Sample records for systematic gene expression

  1. The landscape of genomic imprinting across diverse adult human tissues

    PubMed Central

    Baran, Yael; Subramaniam, Meena; Biton, Anne; Tukiainen, Taru; Tsang, Emily K.; Rivas, Manuel A.; Pirinen, Matti; Gutierrez-Arcelus, Maria; Smith, Kevin S.; Kukurba, Kim R.; Zhang, Rui; Eng, Celeste; Torgerson, Dara G.; Urbanek, Cydney; Li, Jin Billy; Rodriguez-Santana, Jose R.; Burchard, Esteban G.; Seibold, Max A.; MacArthur, Daniel G.; Montgomery, Stephen B.; Zaitlen, Noah A.; Lappalainen, Tuuli

    2015-01-01

    Genomic imprinting is an important regulatory mechanism that silences one of the parental copies of a gene. To systematically characterize this phenomenon, we analyze tissue specificity of imprinting from allelic expression data in 1582 primary tissue samples from 178 individuals from the Genotype-Tissue Expression (GTEx) project. We characterize imprinting in 42 genes, including both novel and previously identified genes. Tissue specificity of imprinting is widespread, and gender-specific effects are revealed in a small number of genes in muscle with stronger imprinting in males. IGF2 shows maternal expression in the brain instead of the canonical paternal expression elsewhere. Imprinting appears to have only a subtle impact on tissue-specific expression levels, with genes lacking a systematic expression difference between tissues with imprinted and biallelic expression. In summary, our systematic characterization of imprinting in adult tissues highlights variation in imprinting between genes, individuals, and tissues. PMID:25953952

  2. The landscape of genomic imprinting across diverse adult human tissues.

    PubMed

    Baran, Yael; Subramaniam, Meena; Biton, Anne; Tukiainen, Taru; Tsang, Emily K; Rivas, Manuel A; Pirinen, Matti; Gutierrez-Arcelus, Maria; Smith, Kevin S; Kukurba, Kim R; Zhang, Rui; Eng, Celeste; Torgerson, Dara G; Urbanek, Cydney; Li, Jin Billy; Rodriguez-Santana, Jose R; Burchard, Esteban G; Seibold, Max A; MacArthur, Daniel G; Montgomery, Stephen B; Zaitlen, Noah A; Lappalainen, Tuuli

    2015-07-01

    Genomic imprinting is an important regulatory mechanism that silences one of the parental copies of a gene. To systematically characterize this phenomenon, we analyze tissue specificity of imprinting from allelic expression data in 1582 primary tissue samples from 178 individuals from the Genotype-Tissue Expression (GTEx) project. We characterize imprinting in 42 genes, including both novel and previously identified genes. Tissue specificity of imprinting is widespread, and gender-specific effects are revealed in a small number of genes in muscle with stronger imprinting in males. IGF2 shows maternal expression in the brain instead of the canonical paternal expression elsewhere. Imprinting appears to have only a subtle impact on tissue-specific expression levels, with genes lacking a systematic expression difference between tissues with imprinted and biallelic expression. In summary, our systematic characterization of imprinting in adult tissues highlights variation in imprinting between genes, individuals, and tissues. © 2015 Baran et al.; Published by Cold Spring Harbor Laboratory Press.

  3. Evaluation and selection of reliable reference genes for gene expression under abiotic stress in cotton (Gossypium hirsutum L.).

    PubMed

    Wang, Min; Wang, Qinglian; Zhang, Baohong

    2013-11-01

    Reference genes are critical for normalization of the gene expression level of target genes. The widely used housekeeping genes may change their expression levels at different tissue under different treatment or stress conditions. Therefore, systematical evaluation on the housekeeping genes is required for gene expression analysis. Up to date, no work was performed to evaluate the housekeeping genes in cotton under stress treatment. In this study, we chose 10 housekeeping genes to systematically assess their expression levels at two different tissues (leaves and roots) under two different abiotic stresses (salt and drought) with three different concentrations. Our results show that there is no best reference gene for all tissues at all stress conditions. The reliable reference gene should be selected based on a specific condition. For example, under salt stress, UBQ7, GAPDH and EF1A8 are better reference genes in leaves; TUA10, UBQ7, CYP1, GAPDH and EF1A8 were better in roots. Under drought stress, UBQ7, EF1A8, TUA10, and GAPDH showed less variety of expression level in leaves and roots. Thus, it is better to identify reliable reference genes first before performing any gene expression analysis. However, using a combination of housekeeping genes as reference gene may provide a new strategy for normalization of gene expression. In this study, we found that combination of four housekeeping genes worked well as reference genes under all the stress conditions. © 2013.

  4. Macronutrients and the FTO gene expression in hypothalamus; a systematic review of experimental studies.

    PubMed

    Doaei, Saeid; Kalantari, Naser; Mohammadi, Nastaran Keshavarz; Tabesh, Ghasem Azizi; Gholamalizadeh, Maryam

    The various studies have examined the relationship between FTO gene expression and macronutrients levels. In order to obtain better viewpoint from this interactions, all of existing studies were reviewed systematically. All published papers have been obtained and reviewed using standard and sensitive keywords from databases such as CINAHL, Embase, PubMed, PsycInfo, and the Cochrane, from 1990 to 2016. The results indicated that all of 6 studies that met the inclusion criteria (from a total of 428 published article) found FTO gene expression changes at short-term follow-ups. Four of six studies found an increased FTO gene expression after calorie restriction, while two of them indicated decreased FTO gene expression. The effect of protein, carbohydrate and fat were separately assessed and suggested by all of six studies. In Conclusion, The level of FTO gene expression in hypothalamus is related to macronutrients levels. Future research should evaluate the long-term impact of dietary interventions. Copyright © 2017. Published by Elsevier B.V.

  5. Gene Expression Profiling and Molecular Signaling of Various Cells in Response to Tricalcium Silicate Cements: A Systematic Review.

    PubMed

    Rathinam, Elanagai; Rajasekharan, Sivaprakash; Chitturi, Ravi Teja; Declercq, Heidi; Martens, Luc; De Coster, Peter

    2016-12-01

    The aim of this study was to present a systematic review investigating the gene expression of various cells (other than dental pulp cells) in response to different variants of tricalcium silicate cements (TSCs). A systematic search of the literature was performed by 2 independent reviewers followed by article selection and data extraction. Studies analyzing any cell type except dental pulp stem cells and any variant of tricalcium silicate cement either as the experimental or as the control group were included. A total of 41 relevant articles were included in this review. Among the included studies, ProRoot MTA (Dentsply, Tulsa, OK) was the most commonly studied (69.1%) TSC variant, and 11 cell types were identified, with 13 articles investigating gene expression in osteoblasts. A total of 39 different genes/molecules expressed were found in the selected studies. The experimental group (irrespective of the TSC variant) was identified to express significantly increased gene expression compared with the control group (untreated) in all included studies. Recent studies have provided useful insight into the gene expression and molecular signaling of various cells in response to TSCs, and new elements have been supplied on the pathways activated in this process. TSCs are capable of eliciting a favorable cellular response in periapical regeneration. Copyright © 2016 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  6. Bayesian approach to transforming public gene expression repositories into disease diagnosis databases.

    PubMed

    Huang, Haiyan; Liu, Chun-Chi; Zhou, Xianghong Jasmine

    2010-04-13

    The rapid accumulation of gene expression data has offered unprecedented opportunities to study human diseases. The National Center for Biotechnology Information Gene Expression Omnibus is currently the largest database that systematically documents the genome-wide molecular basis of diseases. However, thus far, this resource has been far from fully utilized. This paper describes the first study to transform public gene expression repositories into an automated disease diagnosis database. Particularly, we have developed a systematic framework, including a two-stage Bayesian learning approach, to achieve the diagnosis of one or multiple diseases for a query expression profile along a hierarchical disease taxonomy. Our approach, including standardizing cross-platform gene expression data and heterogeneous disease annotations, allows analyzing both sources of information in a unified probabilistic system. A high level of overall diagnostic accuracy was shown by cross validation. It was also demonstrated that the power of our method can increase significantly with the continued growth of public gene expression repositories. Finally, we showed how our disease diagnosis system can be used to characterize complex phenotypes and to construct a disease-drug connectivity map.

  7. Tightly Regulated Expression of Autographa californica Multicapsid Nucleopolyhedrovirus Immediate Early Genes Emerges from Their Interactions and Possible Collective Behaviors

    PubMed Central

    Taka, Hitomi; Asano, Shin-ichiro; Matsuura, Yoshiharu; Bando, Hisanori

    2015-01-01

    To infect their hosts, DNA viruses must successfully initiate the expression of viral genes that control subsequent viral gene expression and manipulate the host environment. Viral genes that are immediately expressed upon infection play critical roles in the early infection process. In this study, we investigated the expression and regulation of five canonical regulatory immediate-early (IE) genes of Autographa californica multicapsid nucleopolyhedrovirus: ie0, ie1, ie2, me53, and pe38. A systematic transient gene-expression analysis revealed that these IE genes are generally transactivators, suggesting the existence of a highly interactive regulatory network. A genetic analysis using gene knockout viruses demonstrated that the expression of these IE genes was tolerant to the single deletions of activator IE genes in the early stage of infection. A network graph analysis on the regulatory relationships observed in the transient expression analysis suggested that the robustness of IE gene expression is due to the organization of the IE gene regulatory network and how each IE gene is activated. However, some regulatory relationships detected by the genetic analysis were contradictory to those observed in the transient expression analysis, especially for IE0-mediated regulation. Statistical modeling, combined with genetic analysis using knockout alleles for ie0 and ie1, showed that the repressor function of ie0 was due to the interaction between ie0 and ie1, not ie0 itself. Taken together, these systematic approaches provided insight into the topology and nature of the IE gene regulatory network. PMID:25816136

  8. FARO server: Meta-analysis of gene expression by matching gene expression signatures to a compendium of public gene expression data.

    PubMed

    Manijak, Mieszko P; Nielsen, Henrik B

    2011-06-11

    Although, systematic analysis of gene annotation is a powerful tool for interpreting gene expression data, it sometimes is blurred by incomplete gene annotation, missing expression response of key genes and secondary gene expression responses. These shortcomings may be partially circumvented by instead matching gene expression signatures to signatures of other experiments. To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700 Arabidopsis microarray experiments. Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/.

  9. Gene expression variability in human hepatic drug metabolizing enzymes and transporters.

    PubMed

    Yang, Lun; Price, Elvin T; Chang, Ching-Wei; Li, Yan; Huang, Ying; Guo, Li-Wu; Guo, Yongli; Kaput, Jim; Shi, Leming; Ning, Baitang

    2013-01-01

    Interindividual variability in the expression of drug-metabolizing enzymes and transporters (DMETs) in human liver may contribute to interindividual differences in drug efficacy and adverse reactions. Published studies that analyzed variability in the expression of DMET genes were limited by sample sizes and the number of genes profiled. We systematically analyzed the expression of 374 DMETs from a microarray data set consisting of gene expression profiles derived from 427 human liver samples. The standard deviation of interindividual expression for DMET genes was much higher than that for non-DMET genes. The 20 DMET genes with the largest variability in the expression provided examples of the interindividual variation. Gene expression data were also analyzed using network analysis methods, which delineates the similarities of biological functionalities and regulation mechanisms for these highly variable DMET genes. Expression variability of human hepatic DMET genes may affect drug-gene interactions and disease susceptibility, with concomitant clinical implications.

  10. o-p′-DDT-mediated uterotrophy and gene expression in immature C57BL/6 mice and Sprague–Dawley rats

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kwekel, Joshua C.; Forgacs, Agnes L.; Center for Integrative Toxicology, Michigan State University, East Lansing, MI

    1,1,1-Trichloro-2,2-bis(2-chlorophenyl-4-chlorophenyl)ethane (o,p′-DDT) is an organochlorine pesticide and endocrine disruptor known to activate the estrogen receptor. Comprehensive ligand- and species-comparative dose- and time-dependent studies were conducted to systematically assess the uterine physiological, morphological and gene expression responses elicited by o,p′-DDT and ethynyl estradiol (EE) in immature ovariectomized C57BL/6 mice and Sprague–Dawley rats. Custom cDNA microarrays were used to identify conserved and divergent differential gene expression responses. A total of 1256 genes were differentially expressed by both ligands in both species, 559 of which exhibited similar temporal expression profiles suggesting that o,p′-DDT elicits estrogenic effects at high doses when compared to EE.more » However, 51 genes exhibited species-specific uterine expression elicited by o,p′-DDT. For example, carbonic anhydrase 2 exhibited species- and ligand-divergent expression as confirmed by quantitative real-time PCR. The identification of comparable temporal phenotypic responses linked to gene expression demonstrates that systematic comparative gene expression assessments are valuable for elucidating conserved and divergent estrogen signaling mechanisms in rodent uterotrophy. - Highlights: • o,p′-DDT and enthynyl estradiol (EE) both elicit uterotrophy in mice and rats. • o,p′-DDT and EE have different kinetics in uterine wet weight induction. • o,p′-DDT elicited stromal hypertrophy in rats but myometrial hypertrophy in mice. • 1256 genes were differentially expressed by both ligands in both species. • Only 51 genes had species-specific uterine expression.« less

  11. Differences in brain gene expression between sleep and waking as revealed by mRNA differential display and cDNA microarray technology.

    PubMed

    Cirelli, C; Tononi, G

    1999-06-01

    The consequences of sleep and sleep deprivation at the molecular level are largely unexplored. Knowledge of such molecular events is essential to understand the restorative processes occurring during sleep as well as the cellular mechanisms of sleep regulation. Here we review the available data about changes in neural gene expression across different behavioural states using candidate gene approaches such as in situ hybridization and immunocytochemistry. We then describe new techniques for systematic screening of gene expression in the brain, such as subtractive hybridization, mRNA differential display, and cDNA microarray technology, outlining advantages and disadvantages of these methods. Finally, we summarize our initial results of a systematic screening of gene expression in the rat brain across behavioural states using mRNA differential display and cDNA microarray technology. The expression pattern of approximately 7000 genes was analysed in the cerebral cortex of rats after 3 h of spontaneous sleep, 3 h of spontaneous waking, or 3 h of sleep deprivation. While the majority of transcripts were expressed at the same level among these three conditions, 14 mRNAs were modulated by sleep and waking. Six transcripts, four more expressed in waking and two more expressed in sleep, corresponded to novel genes. The eight known transcripts were all expressed at higher levels in waking than in sleep and included transcription factors and mitochondrial genes. A possible role for these known transcripts in mediating neural plasticity during waking is discussed.

  12. Systematic identification of human housekeeping genes possibly useful as references in gene expression studies.

    PubMed

    Caracausi, Maria; Piovesan, Allison; Antonaros, Francesca; Strippoli, Pierluigi; Vitale, Lorenza; Pelleri, Maria Chiara

    2017-09-01

    The ideal reference, or control, gene for the study of gene expression in a given organism should be expressed at a medium‑high level for easy detection, should be expressed at a constant/stable level throughout different cell types and within the same cell type undergoing different treatments, and should maintain these features through as many different tissues of the organism. From a biological point of view, these theoretical requirements of an ideal reference gene appear to be best suited to housekeeping (HK) genes. Recent advancements in the quality and completeness of human expression microarray data and in their statistical analysis may provide new clues toward the quantitative standardization of human gene expression studies in biology and medicine, both cross‑ and within‑tissue. The systematic approach used by the present study is based on the Transcriptome Mapper tool and exploits the automated reassignment of probes to corresponding genes, intra‑ and inter‑sample normalization, elaboration and representation of gene expression values in linear form within an indexed and searchable database with a graphical interface recording quantitative levels of expression, expression variability and cross‑tissue width of expression for more than 31,000 transcripts. The present study conducted a meta‑analysis of a pool of 646 expression profile data sets from 54 different human tissues and identified actin γ 1 as the HK gene that best fits the combination of all the traditional criteria to be used as a reference gene for general use; two ribosomal protein genes, RPS18 and RPS27, and one aquaporin gene, POM121 transmembrane nucleporin C, were also identified. The present study provided a list of tissue‑ and organ‑specific genes that may be most suited for the following individual tissues/organs: Adipose tissue, bone marrow, brain, heart, kidney, liver, lung, ovary, skeletal muscle and testis; and also provides in these cases a representative, quantitative portrait of the relative, typical gene‑expression profile in the form of searchable database tables.

  13. The low noise limit in gene expression

    DOE PAGES

    Dar, Roy D.; Weinberger, Leor S.; Cox, Chris D.; ...

    2015-10-21

    Protein noise measurements are increasingly used to elucidate biophysical parameters. Unfortunately noise analyses are often at odds with directly measured parameters. Here we show that these inconsistencies arise from two problematic analytical choices: (i) the assumption that protein translation rate is invariant for different proteins of different abundances, which has inadvertently led to (ii) the assumption that a large constitutive extrinsic noise sets the low noise limit in gene expression. While growing evidence suggests that transcriptional bursting may set the low noise limit, variability in translational bursting has been largely ignored. We show that genome-wide systematic variation in translational efficiencymore » can-and in the case of E. coli does-control the low noise limit in gene expression. Therefore constitutive extrinsic noise is small and only plays a role in the absence of a systematic variation in translational efficiency. Lastly, these results show the existence of two distinct expression noise patterns: (1) a global noise floor uniformly imposed on all genes by expression bursting; and (2) high noise distributed to only a select group of genes.« less

  14. SZDB: A Database for Schizophrenia Genetic Research

    PubMed Central

    Wu, Yong; Yao, Yong-Gang

    2017-01-01

    Abstract Schizophrenia (SZ) is a debilitating brain disorder with a complex genetic architecture. Genetic studies, especially recent genome-wide association studies (GWAS), have identified multiple variants (loci) conferring risk to SZ. However, how to efficiently extract meaningful biological information from bulk genetic findings of SZ remains a major challenge. There is a pressing need to integrate multiple layers of data from various sources, eg, genetic findings from GWAS, copy number variations (CNVs), association and linkage studies, gene expression, protein–protein interaction (PPI), co-expression, expression quantitative trait loci (eQTL), and Encyclopedia of DNA Elements (ENCODE) data, to provide a comprehensive resource to facilitate the translation of genetic findings into SZ molecular diagnosis and mechanism study. Here we developed the SZDB database (http://www.szdb.org/), a comprehensive resource for SZ research. SZ genetic data, gene expression data, network-based data, brain eQTL data, and SNP function annotation information were systematically extracted, curated and deposited in SZDB. In-depth analyses and systematic integration were performed to identify top prioritized SZ genes and enriched pathways. Multiple types of data from various layers of SZ research were systematically integrated and deposited in SZDB. In-depth data analyses and integration identified top prioritized SZ genes and enriched pathways. We further showed that genes implicated in SZ are highly co-expressed in human brain and proteins encoded by the prioritized SZ risk genes are significantly interacted. The user-friendly SZDB provides high-confidence candidate variants and genes for further functional characterization. More important, SZDB provides convenient online tools for data search and browse, data integration, and customized data analyses. PMID:27451428

  15. Systematic identification of genes involved in divergent skeletal muscle growth rates of broiler and layer chickens.

    PubMed

    Zheng, Qi; Zhang, Yong; Chen, Ying; Yang, Ning; Wang, Xiu-Jie; Zhu, Dahai

    2009-02-22

    The genetic closeness and divergent muscle growth rates of broilers and layers make them great models for myogenesis study. In order to discover the molecular mechanisms determining the divergent muscle growth rates and muscle mass control in different chicken lines, we systematically identified differentially expressed genes between broiler and layer skeletal muscle cells during different developmental stages by microarray hybridization experiment. Taken together, 543 differentially expressed genes were identified between broilers and layers across different developmental stages. We found that differential regulation of slow-type muscle gene expression, satellite cell proliferation and differentiation, protein degradation rate and genes in some metabolic pathways could give great contributions to the divergent muscle growth rates of the two chicken lines. Interestingly, the expression profiles of a few differentially expressed genes were positively or negatively correlated with the growth rates of broilers and layers, indicating that those genes may function in regulating muscle growth during development. The multiple muscle cell growth regulatory processes identified by our study implied that complicated molecular networks involved in the regulation of chicken muscle growth. These findings will not only offer genetic information for identifying candidate genes for chicken breeding, but also provide new clues for deciphering mechanisms underlining muscle development in vertebrates.

  16. Genetic Variation and Gene Expression in Antioxidant-Related Enzymes and Risk of Chronic Obstructive Pulmonary Disease: A Systematic Review

    PubMed Central

    Bentley, Amy R; Emrani, Parastu; Cassano, Patricia A

    2011-01-01

    Observational epidemiologic studies of dietary antioxidant intake, serum antioxidant concentration, and lung outcomes suggest that lower levels of antioxidant defenses are associated with decreased lung function. Another approach to understanding the role of oxidant/antioxidant imbalance in risk of Chronic Obstructive Pulmonary Disease (COPD) is to investigate the role of genetic variation in antioxidant enzymes, and indeed family-based studies suggest a heritable component to lung disease. Many studies of the genes encoding antioxidant enzymes have considered COPD or COPD-related outcomes, and a systematic review is needed to summarise the evidence to date, and to provide insights for further research. Genetic association studies of antioxidant enzymes and COPD/COPD-related traits, and comparative gene expression studies with disease or smoking as the exposure were systematically identified and reviewed. Antioxidant enzymes considered included enzymes involved in glutathione (GSH) metabolism, in the thioredoxin (TXN) system, superoxide dismutases (SOD), and catalase (CAT). A total of 29 genetic association and 15 comparative gene expression studies met the inclusion criteria. The strongest and most consistent effects were in the genes GCL, GSTM1, GSTP1, and SOD3. This review also highlights the lack of studies for genes of interest, particularly GSR, GGT, and those related to TXN. There were limited opportunities to evaluate a gene’s contribution to disease risk through a synthesis of results from different study designs, as the majority of studies considered either association of sequence variants with disease or effect of disease on gene expression. Network-driven approaches that consider potential interaction between genes and amoung genes, smoke exposure, and antioxidant intake are needed to fully characterise the role of oxidant/antioxidant balance in pathogenesis. PMID:18566111

  17. Gene expression in obstetric antiphospholipid syndrome: a systematic review.

    PubMed

    Muhammad Aliff, M; Muhammad Shazwan, S; Nur Fariha, M M; Hayati, A R; Nur Syahrina, A R; Maizatul Azma, M; Nazefah, A H; Jameela, S; Asral Wirda, A A

    2016-12-01

    Antiphospholipid syndrome (APS) is a multisystem disease that may present as venous or arterial thrombosis and/or pregnancy complications with the presence of antiphospholipid antibodies. Until today, heterogeneity of pathogenic mechanism fits well with various clinical manifestations. Moreover, previous studies have indicated that genes are differentially expressed between normal and in the disease state. Hence, this study systematically searched the literature on human gene expression that was differentially expressed in Obstetric APS. Electronic search was performed until 31st March 2015 through PubMed and Embase databases; where the following Medical Subject Heading (MeSH) terms were used and they had been specified as the primary focus of the articles; gene, antiphospholipid, obstetric, and pregnancy in the title or abstract. From 502 studies retrieved from the search, only original publications that had performed gene expression analyses of human placental tissue that reported on differentially expressed gene in pregnancies with Obstetric APS were included. Two reviewers independently scrutinized the titles and the abstracts before examining the eligibility of studies that met the inclusion criteria. For each study; diagnostic criteria for APS, method for analysis, and the gene signature were extracted independently by two reviewers. The genes listed were further analysed with the DAVID and the KEGG pathways. Three eligible gene expression studies involving obstetric APS, comprising the datasets on gene expression, were identified. All three studies showed a reduction in transcript expression on PRL, STAT5, TF, DAF, ABCA1, and HBEGF in Obstetric APS. The high enrichment score for functionality in DAVID had been positive regulation of cell proliferation. Meanwhile, pertaining to the KEGG pathway, two pathways were associated with some of the listed genes, which were ErBb signalling pathway and JAK-STAT signalling pathway. Ultimately, studies on a genetic level have the potential to provide new insights into the regulation and to widen the basis for identification of changes in the mechanism of Obstetric APS.

  18. Genomic identification, characterization and differential expression analysis of SBP-box gene family in Brassica napus.

    PubMed

    Cheng, Hongtao; Hao, Mengyu; Wang, Wenxiang; Mei, Desheng; Tong, Chaobo; Wang, Hui; Liu, Jia; Fu, Li; Hu, Qiong

    2016-09-08

    SBP-box genes belong to one of the largest families of transcription factors. Though members of this family have been characterized to be important regulators of diverse biological processes, information of SBP-box genes in the third most important oilseed crop Brassica napus is largely undefined. In the present study, by whole genome bioinformatics analysis and transcriptional profiling, 58 putative members of SBP-box gene family in oilseed rape (Brassica napus L.) were identified and their expression pattern in different tissues as well as possible interaction with miRNAs were analyzed. In addition, B. napus lines with contrasting branch angle were used for investigating the involvement of SBP-box genes in plant architecture regulation. Detailed gene information, including genomic organization, structural feature, conserved domain and phylogenetic relationship of the genes were systematically characterized. By phylogenetic analysis, BnaSBP proteins were classified into eight distinct groups representing the clear orthologous relationships to their family members in Arabidopsis and rice. Expression analysis in twelve tissues including vegetative and reproductive organs showed different expression patterns among the SBP-box genes and a number of the genes exhibit tissue specific expression, indicating their diverse functions involved in the developmental process. Forty-four SBP-box genes were ascertained to contain the putative miR156 binding site, with 30 and 14 of the genes targeted by miR156 at the coding and 3'UTR region, respectively. Relative expression level of miR156 is varied across tissues. Different expression pattern of some BnaSBP genes and the negative correlation of transcription levels between miR156 and its target BnaSBP gene were observed in lines with different branch angle. Taken together, this study represents the first systematic analysis of the SBP-box gene family in Brassica napus. The data presented here provides base foundation for understanding the crucial roles of BnaSBP genes in plant development and other biological processes.

  19. Identification and Validation of Reference Genes for RT-qPCR Analysis in Non-Heading Chinese Cabbage Flowers

    PubMed Central

    Wang, Cheng; Cui, Hong-Mi; Huang, Tian-Hong; Liu, Tong-Kun; Hou, Xi-Lin; Li, Ying

    2016-01-01

    Non-heading Chinese cabbage (Brassica rapa ssp. chinensis Makino) is an important vegetable member of Brassica rapa crops. It exhibits a typical sporophytic self-incompatibility (SI) system and is an ideal model plant to explore the mechanism of SI. Gene expression research are frequently used to unravel the complex genetic mechanism and in such studies appropriate reference selection is vital. Validation of reference genes have neither been conducted in Brassica rapa flowers nor in SI trait. In this study, 13 candidate reference genes were selected and examined systematically in 96 non-heading Chinese cabbage flower samples that represent four strategic groups in compatible and self-incompatible lines of non-heading Chinese cabbage. Two RT-qPCR analysis software, geNorm and NormFinder, were used to evaluate the expression stability of these genes systematically. Results revealed that best-ranked references genes should be selected according to specific sample subsets. DNAJ, UKN1, and PP2A were identified as the most stable reference genes among all samples. Moreover, our research further revealed that the widely used reference genes, CYP and ACP, were the least suitable reference genes in most non-heading Chinese cabbage flower sample sets. To further validate the suitability of the reference genes identified in this study, the expression level of SRK and Exo70A1 genes which play important roles in regulating interaction between pollen and stigma were studied. Our study presented the first systematic study of reference gene(s) selection for SI study and provided guidelines to obtain more accurate RT-qPCR results in non-heading Chinese cabbage. PMID:27375663

  20. Systematic comparison of co-expression of multiple recombinant thermophilic enzymes in Escherichia coli BL21(DE3).

    PubMed

    Chen, Hui; Huang, Rui; Zhang, Y-H Percival

    2017-06-01

    The precise control of multiple heterologous enzyme expression levels in one Escherichia coli strain is important for cascade biocatalysis, metabolic engineering, synthetic biology, natural product synthesis, and studies of complexed proteins. We systematically investigated the co-expression of up to four thermophilic enzymes (i.e., α-glucan phosphorylase (αGP), phosphoglucomutase (PGM), glucose 6-phosphate dehydrogenase (G6PDH), and 6-phosphogluconate dehydrogenase (6PGDH)) in E. coli BL21(DE3) by adding T7 promoter or T7 terminator of each gene for multiple genes in tandem, changing gene alignment, and comparing one or two plasmid systems. It was found that the addition of T7 terminator after each gene was useful to decrease the influence of the upstream gene. The co-expression of the four enzymes in E. coli BL21(DE3) was demonstrated to generate two NADPH molecules from one glucose unit of maltodextrin, where NADPH was oxidized to convert xylose to xylitol. The best four-gene co-expression system was based on two plasmids (pET and pACYC) which harbored two genes. As a result, apparent enzymatic activities of the four enzymes were regulated to be at similar levels and the overall four-enzyme activity was the highest based on the formation of xylitol. This study provides useful information for the precise control of multi-enzyme-coordinated expression in E. coli BL21(DE3).

  1. Effectively identifying regulatory hotspots while capturing expression heterogeneity in gene expression studies

    PubMed Central

    2014-01-01

    Expression quantitative trait loci (eQTL) mapping is a tool that can systematically identify genetic variation affecting gene expression. eQTL mapping studies have shown that certain genomic locations, referred to as regulatory hotspots, may affect the expression levels of many genes. Recently, studies have shown that various confounding factors may induce spurious regulatory hotspots. Here, we introduce a novel statistical method that effectively eliminates spurious hotspots while retaining genuine hotspots. Applied to simulated and real datasets, we validate that our method achieves greater sensitivity while retaining low false discovery rates compared to previous methods. PMID:24708878

  2. Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

    PubMed

    Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

    2014-05-01

    Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  3. Identification of AUXIN RESPONSE FACTOR gene family from Prunus sibirica and its expression analysis during mesocarp and kernel development.

    PubMed

    Niu, Jun; Bi, Quanxin; Deng, Shuya; Chen, Huiping; Yu, Haiyan; Wang, Libing; Lin, Shanzhi

    2018-01-24

    Auxin response factors (ARFs) in auxin signaling pathway are an important component that can regulate the transcription of auxin-responsive genes involved in almost all aspects of plant growth and development. To our knowledge, the comprehensive and systematic characterization of ARF genes has never been reported in Prunus sibirica, a novel woody biodiesel feedstock in China. In this study, we identified 14 PsARF genes with a perfect open reading frame (ORF) in P. sibirica by using its previous transcriptomic data. Conserved motif analysis showed that all identified PsARF proteins had typical DNA-binding and ARF domain, but 5 members (PsARF3, 8 10, 16 and 17) lacked the dimerization domain. Phylogenetic analysis of the ARF proteins generated from various plant species indicated that ARFs could be categorized into 4 major groups (Class I, II, III and IV), in which all identified ARFs from P. sibirica showed a closest relationship with those from P. mume. Comparison of the expression profiles of 14 PsARF genes in different developmental stages of Siberian apricot mesocarp (SAM) and kernel (SAK) reflected distinct temporal or spatial expression patterns for PsARF genes. Additionally, based on the expressed data from fruit and seed development of multiple plant species, we identified 1514 ARF-correlated genes using weighted gene co-expression network analysis (WGCNA). And the major portion of ARF-correlated gene was characterized to be involved in protein, nucleic acid and carbohydrate metabolic, transport and regulatory processes. In summary, we systematically and comprehensively analyzed the structure, expression pattern and co-expression network of ARF gene family in P. sibirica. All our findings provide theoretical foundation for the PsARF gene family and will pave the way for elucidating the precise role of PsARF genes in SAM and SAK development.

  4. Quantifying translational coupling in E. coli synthetic operons using RBS modulation and fluorescent reporters.

    PubMed

    Levin-Karp, Ayelet; Barenholz, Uri; Bareia, Tasneem; Dayagi, Michal; Zelcbuch, Lior; Antonovsky, Niv; Noor, Elad; Milo, Ron

    2013-06-21

    Translational coupling is the interdependence of translation efficiency of neighboring genes encoded within an operon. The degree of coupling may be quantified by measuring how the translation rate of a gene is modulated by the translation rate of its upstream gene. Translational coupling was observed in prokaryotic operons several decades ago, but the quantitative range of modulation translational coupling leads to and the factors governing this modulation were only partially characterized. In this study, we systematically quantify and characterize translational coupling in E. coli synthetic operons using a library of plasmids carrying fluorescent reporter genes that are controlled by a set of different ribosome binding site (RBS) sequences. The downstream gene expression level is found to be enhanced by the upstream gene expression via translational coupling with the enhancement level varying from almost no coupling to over 10-fold depending on the upstream gene's sequence. Additionally, we find that the level of translational coupling in our system is similar between the second and third locations in the operon. The coupling depends on the distance between the stop codon of the upstream gene and the start codon of the downstream gene. This study is the first to systematically and quantitatively characterize translational coupling in a synthetic E. coli operon. Our analysis will be useful in accurate manipulation of gene expression in synthetic biology and serves as a step toward understanding the mechanisms involved in translational expression modulation.

  5. Evidence of Dynamically Dysregulated Gene Expression Pathways in Hyperresponsive B Cells from African American Lupus Patients

    PubMed Central

    Dozmorov, Igor; Dominguez, Nicolas; Sestak, Andrea L.; Robertson, Julie M.; Harley, John B.; James, Judith A.; Guthridge, Joel M.

    2013-01-01

    Recent application of gene expression profiling to the immune system has shown a great potential for characterization of complex regulatory processes. It is becoming increasingly important to characterize functional systems through multigene interactions to provide valuable insights into differences between healthy controls and autoimmune patients. Here we apply an original systematic approach to the analysis of changes in regulatory gene interconnections between in Epstein-Barr virus transformed hyperresponsive B cells from SLE patients and normal control B cells. Both traditional analysis of differential gene expression and analysis of the dynamics of gene expression variations were performed in combination to establish model networks of functional gene expression. This Pathway Dysregulation Analysis identified known transcription factors and transcriptional regulators activated uniquely in stimulated B cells from SLE patients. PMID:23977035

  6. Distinct Gene Expression Profiles in Peripheral Blood Mononuclear Cells from Patients Infected with Vaccinia Virus, Yellow Fever 17D Virus, or Upper Respiratory Infections Running Title: PBMC Expression Response to Viral Agents

    PubMed Central

    Scherer, Christina A.; Magness, Charles L.; Steiger, Kathryn V.; Poitinger, Nicholas D.; Caputo, Christine M.; Miner, Douglas G.; Winokur, Patricia L.; Klinzman, Donna; McKee, Janice; Pilar, Christine; Ward, Patricia A.; Gillham, Martha H.; Haulman, N. Jean; Stapleton, Jack T.; Iadonato, Shawn P.

    2007-01-01

    Gene expression in human peripheral blood mononuclear cells was systematically evaluated following smallpox and yellow fever vaccination, and naturally occurring upper respiratory infection (URI). All three infections were characterized by the induction of many interferon stimulated genes, as well as enhanced expression of genes involved in proteolysis and antigen presentation. Vaccinia infection was also characterized by a distinct expression signature composed of up-regulation of monocyte response genes, with repression of genes expressed by B and T-cells. In contrast, the yellow fever host response was characterized by a suppression of ribosomal and translation factors, distinguishing this infection from vaccinia and URI. No significant URI-specific signature was observed, perhaps reflecting greater heterogeneity in the study population and etiological agents. Taken together, these data suggest that specific host gene expression signatures may be identified that distinguish one or a small number of virus agents. PMID:17651872

  7. Statistical Inference and Reverse Engineering of Gene Regulatory Networks from Observational Expression Data

    PubMed Central

    Emmert-Streib, Frank; Glazko, Galina V.; Altay, Gökmen; de Matos Simoes, Ricardo

    2012-01-01

    In this paper, we present a systematic and conceptual overview of methods for inferring gene regulatory networks from observational gene expression data. Further, we discuss two classic approaches to infer causal structures and compare them with contemporary methods by providing a conceptual categorization thereof. We complement the above by surveying global and local evaluation measures for assessing the performance of inference algorithms. PMID:22408642

  8. Bioinformatics approach reveals systematic mechanism underlying lung adenocarcinoma.

    PubMed

    Wu, Xiya; Zhang, Wei; Hu, Yunhua; Yi, Xianghua

    2015-01-01

    The purpose of this work was to explore the systematic molecular mechanism of lung adenocarcinoma and gain a deeper insight into it. Comprehensive bioinformatics methods were applied. Initially, significant differentially expressed genes (DEGs) were analyzed from the Affymetrix microarray data (GSE27262) deposited in the Gene Expression Omnibus (GEO). Subsequently, gene ontology (GO) analysis was performed using online Database for Annotation, Visualization and Integration Discovery (DAVID) software. Finally, significant pathway crosstalk was investigated based on the information derived from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. According to our results, the N-terminal globular domain of the type X collagen (COL10A1) gene and transmembrane protein 100 (TMEM100) gene were identified to be the most significant DEGs in tumor tissue compared with the adjacent normal tissues. The main GO categories were biological process, cellular component and molecular function. In addition, the crosstalk was significantly different between non-small cell lung cancer pathways and inositol phosphate metabolism pathway, focal adhesion signal pathway, vascular smooth muscle contraction signal pathway, peroxisome proliferator-activated receptor (PPAR) signaling pathway and calcium signaling pathway in tumor. Dysfunctional genes and pathways may play key roles in the progression and development of lung adenocarcinoma. Our data provide a systematic perspective for understanding this mechanism and may be helpful in discovering an effective treatment for lung adenocarcinoma.

  9. Validation of miRNA genes suitable as reference genes in qPCR analyses of miRNA gene expression in Atlantic salmon (Salmo salar).

    PubMed

    Johansen, Ilona; Andreassen, Rune

    2014-12-23

    MicroRNAs (miRNAs) are an abundant class of endogenous small RNA molecules that downregulate gene expression at the post-transcriptional level. They play important roles by regulating genes that control multiple biological processes, and recent years there has been an increased interest in studying miRNA genes and miRNA gene expression. The most common method applied to study gene expression of single genes is quantitative PCR (qPCR). However, before expression of mature miRNAs can be studied robust qPCR methods (miRNA-qPCR) must be developed. This includes identification and validation of suitable reference genes. We are particularly interested in Atlantic salmon (Salmo salar). This is an economically important aquaculture species, but no reference genes dedicated for use in miRNA-qPCR methods has been validated for this species. Our aim was, therefore, to identify suitable reference genes for miRNA-qPCR methods in Salmo salar. We used a systematic approach where we utilized similar studies in other species, some biological criteria, results from deep sequencing of small RNAs and, finally, experimental validation of candidate reference genes by qPCR to identify the most suitable reference genes. Ssa-miR-25-3p was identified as most suitable single reference gene. The best combinations of two reference genes were ssa-miR-25-3p and ssa-miR-455-5p. These two genes were constitutively and stably expressed across many different tissues. Furthermore, infectious salmon anaemia did not seem to affect their expression levels. These genes were amplified with high specificity, good efficiency and the qPCR assays showed a good linearity when applying a simple cybergreen miRNA-PCR method using miRNA gene specific forward primers. We have identified suitable reference genes for miRNA-qPCR in Atlantic salmon. These results will greatly facilitate further studies on miRNA genes in this species. The reference genes identified are conserved genes that are identical in their mature sequence in many aquaculture species. Therefore, they may also be suitable as reference genes in other teleosts. Finally, the systematic approach used in our study successfully identified suitable reference genes, suggesting that this may be a useful strategy to apply in similar validation studies in other aquaculture species.

  10. Effect of promoter architecture on the cell-to-cell variability in gene expression.

    PubMed

    Sanchez, Alvaro; Garcia, Hernan G; Jones, Daniel; Phillips, Rob; Kondev, Jané

    2011-03-01

    According to recent experimental evidence, promoter architecture, defined by the number, strength and regulatory role of the operators that control transcription, plays a major role in determining the level of cell-to-cell variability in gene expression. These quantitative experiments call for a corresponding modeling effort that addresses the question of how changes in promoter architecture affect variability in gene expression in a systematic rather than case-by-case fashion. In this article we make such a systematic investigation, based on a microscopic model of gene regulation that incorporates stochastic effects. In particular, we show how operator strength and operator multiplicity affect this variability. We examine different modes of transcription factor binding to complex promoters (cooperative, independent, simultaneous) and how each of these affects the level of variability in transcriptional output from cell-to-cell. We propose that direct comparison between in vivo single-cell experiments and theoretical predictions for the moments of the probability distribution of mRNA number per cell can be used to test kinetic models of gene regulation. The emphasis of the discussion is on prokaryotic gene regulation, but our analysis can be extended to eukaryotic cells as well.

  11. Effect of Promoter Architecture on the Cell-to-Cell Variability in Gene Expression

    PubMed Central

    Sanchez, Alvaro; Garcia, Hernan G.; Jones, Daniel; Phillips, Rob; Kondev, Jané

    2011-01-01

    According to recent experimental evidence, promoter architecture, defined by the number, strength and regulatory role of the operators that control transcription, plays a major role in determining the level of cell-to-cell variability in gene expression. These quantitative experiments call for a corresponding modeling effort that addresses the question of how changes in promoter architecture affect variability in gene expression in a systematic rather than case-by-case fashion. In this article we make such a systematic investigation, based on a microscopic model of gene regulation that incorporates stochastic effects. In particular, we show how operator strength and operator multiplicity affect this variability. We examine different modes of transcription factor binding to complex promoters (cooperative, independent, simultaneous) and how each of these affects the level of variability in transcriptional output from cell-to-cell. We propose that direct comparison between in vivo single-cell experiments and theoretical predictions for the moments of the probability distribution of mRNA number per cell can be used to test kinetic models of gene regulation. The emphasis of the discussion is on prokaryotic gene regulation, but our analysis can be extended to eukaryotic cells as well. PMID:21390269

  12. Distinct gene expression profiles in peripheral blood mononuclear cells from patients infected with vaccinia virus, yellow fever 17D virus, or upper respiratory infections.

    PubMed

    Scherer, Christina A; Magness, Charles L; Steiger, Kathryn V; Poitinger, Nicholas D; Caputo, Christine M; Miner, Douglas G; Winokur, Patricia L; Klinzman, Donna; McKee, Janice; Pilar, Christine; Ward, Patricia A; Gillham, Martha H; Haulman, N Jean; Stapleton, Jack T; Iadonato, Shawn P

    2007-08-29

    Gene expression in human peripheral blood mononuclear cells was systematically evaluated following smallpox and yellow fever vaccination, and naturally occurring upper respiratory infection (URI). All three infections were characterized by the induction of many interferon stimulated genes, as well as enhanced expression of genes involved in proteolysis and antigen presentation. Vaccinia infection was also characterized by a distinct expression signature composed of up-regulation of monocyte response genes, with repression of genes expressed by B and T-cells. In contrast, the yellow fever host response was characterized by a suppression of ribosomal and translation factors, distinguishing this infection from vaccinia and URI. No significant URI-specific signature was observed, perhaps reflecting greater heterogeneity in the study population and etiological agents. Taken together, these data suggest that specific host gene expression signatures may be identified that distinguish one or a small number of virus agents.

  13. Validation of Reference Genes for Gene Expression Studies in Virus-Infected Nicotiana benthamiana Using Quantitative Real-Time PCR

    PubMed Central

    Han, Chenggui; Yu, Jialin; Li, Dawei; Zhang, Yongliang

    2012-01-01

    Nicotiana benthamiana is the most widely-used experimental host in plant virology. The recent release of the draft genome sequence for N. benthamiana consolidates its role as a model for plant–pathogen interactions. Quantitative real-time PCR (qPCR) is commonly employed for quantitative gene expression analysis. For valid qPCR analysis, accurate normalisation of gene expression against an appropriate internal control is required. Yet there has been little systematic investigation of reference gene stability in N. benthamiana under conditions of viral infections. In this study, the expression profiles of 16 commonly used housekeeping genes (GAPDH, 18S, EF1α, SAMD, L23, UK, PP2A, APR, UBI3, SAND, ACT, TUB, GBP, F-BOX, PPR and TIP41) were determined in N. benthamiana and those with acceptable expression levels were further selected for transcript stability analysis by qPCR of complementary DNA prepared from N. benthamiana leaf tissue infected with one of five RNA plant viruses (Tobacco necrosis virus A, Beet black scorch virus, Beet necrotic yellow vein virus, Barley stripe mosaic virus and Potato virus X). Gene stability was analysed in parallel by three commonly-used dedicated algorithms: geNorm, NormFinder and BestKeeper. Statistical analysis revealed that the PP2A, F-BOX and L23 genes were the most stable overall, and that the combination of these three genes was sufficient for accurate normalisation. In addition, the suitability of PP2A, F-BOX and L23 as reference genes was illustrated by expression-level analysis of AGO2 and RdR6 in virus-infected N. benthamiana leaves. This is the first study to systematically examine and evaluate the stability of different reference genes in N. benthamiana. Our results not only provide researchers studying these viruses a shortlist of potential housekeeping genes to use as normalisers for qPCR experiments, but should also guide the selection of appropriate reference genes for gene expression studies of N. benthamiana under other biotic and abiotic stress conditions. PMID:23029521

  14. What Is the Molecular Signature of Mind-Body Interventions? A Systematic Review of Gene Expression Changes Induced by Meditation and Related Practices.

    PubMed

    Buric, Ivana; Farias, Miguel; Jong, Jonathan; Mee, Christopher; Brazil, Inti A

    2017-01-01

    There is considerable evidence for the effectiveness of mind-body interventions (MBIs) in improving mental and physical health, but the molecular mechanisms of these benefits remain poorly understood. One hypothesis is that MBIs reverse expression of genes involved in inflammatory reactions that are induced by stress. This systematic review was conducted to examine changes in gene expression that occur after MBIs and to explore how these molecular changes are related to health. We searched PubMed throughout September 2016 to look for studies that have used gene expression analysis in MBIs (i.e., mindfulness, yoga, Tai Chi, Qigong, relaxation response, and breath regulation). Due to the limited quantity of studies, we included both clinical and non-clinical samples with any type of research design. Eighteen relevant studies were retrieved and analyzed. Overall, the studies indicate that these practices are associated with a downregulation of nuclear factor kappa B pathway; this is the opposite of the effects of chronic stress on gene expression and suggests that MBI practices may lead to a reduced risk of inflammation-related diseases. However, it is unclear how the effects of MBIs compare to other healthy interventions such as exercise or nutrition due to the small number of available studies. More research is required to be able to understand the effects of MBIs at the molecular level.

  15. A Systematic Investigation into Aging Related Genes in Brain and Their Relationship with Alzheimer's Disease.

    PubMed

    Meng, Guofeng; Zhong, Xiaoyan; Mei, Hongkang

    2016-01-01

    Aging, as a complex biological process, is accompanied by the accumulation of functional loses at different levels, which makes age to be the biggest risk factor to many neurological diseases. Even following decades of investigation, the process of aging is still far from being fully understood, especially at a systematic level. In this study, we identified aging related genes in brain by collecting the ones with sustained and consistent gene expression or DNA methylation changes in the aging process. Functional analysis with Gene Ontology to these genes suggested transcriptional regulators to be the most affected genes in the aging process. Transcription regulation analysis found some transcription factors, especially Specificity Protein 1 (SP1), to play important roles in regulating aging related gene expression. Module-based functional analysis indicated these genes to be associated with many well-known aging related pathways, supporting the validity of our approach to select aging related genes. Finally, we investigated the roles of aging related genes on Alzheimer's Disease (AD). We found that aging and AD related genes both involved some common pathways, which provided a possible explanation why aging made the brain more vulnerable to Alzheimer's Disease.

  16. Correlated gene expression and anatomical communication support synchronized brain activity in the mouse functional connectome.

    PubMed

    Mills, Brian D; Grayson, David S; Shunmugavel, Anandakumar; Miranda-Dominguez, Oscar; Feczko, Eric; Earl, Eric; Neve, Kim; Fair, Damien A

    2018-05-22

    Cognition and behavior depend on synchronized intrinsic brain activity that is organized into functional networks across the brain. Research has investigated how anatomical connectivity both shapes and is shaped by these networks, but not how anatomical connectivity interacts with intra-areal molecular properties to drive functional connectivity. Here, we present a novel linear model to explain functional connectivity by integrating systematically obtained measurements of axonal connectivity, gene expression, and resting state functional connectivity MRI in the mouse brain. The model suggests that functional connectivity arises from both anatomical links and inter-areal similarities in gene expression. By estimating these effects, we identify anatomical modules in which correlated gene expression and anatomical connectivity support functional connectivity. Along with providing evidence that not all genes equally contribute to functional connectivity, this research establishes new insights regarding the biological underpinnings of coordinated brain activity measured by BOLD fMRI. SIGNIFICANCE STATEMENT Efforts at characterizing the functional connectome with fMRI have risen exponentially over the last decade. Yet despite this rise, the biological underpinnings of these functional measurements are still largely unknown. The current report begins to fill this void by investigating the molecular underpinnings of the functional connectome through an integration of systematically obtained structural information and gene expression data throughout the rodent brain. We find that both white matter connectivity and similarity in regional gene expression relate to resting state functional connectivity. The current report furthers our understanding of the biological underpinnings of the functional connectome and provides a linear model that can be utilized to streamline preclinical animal studies of disease. Copyright © 2018 the authors.

  17. Systematic Integration of Brain eQTL and GWAS Identifies ZNF323 as a Novel Schizophrenia Risk Gene and Suggests Recent Positive Selection Based on Compensatory Advantage on Pulmonary Function

    PubMed Central

    Luo, Xiong-Jian; Mattheisen, Manuel; Li, Ming; Huang, Liang; Rietschel, Marcella; Børglum, Anders D.; Als, Thomas D.; van den Oord, Edwin J.; Aberg, Karolina A.; Mors, Ole; Mortensen, Preben Bo; Luo, Zhenwu; Degenhardt, Franziska; Cichon, Sven; Schulze, Thomas G.; Nöthen, Markus M.; Su, Bing; Zhao, Zhongming; Gan, Lin; Yao, Yong-Gang

    2015-01-01

    Genome-wide association studies have identified multiple risk variants and loci that show robust association with schizophrenia. Nevertheless, it remains unclear how these variants confer risk to schizophrenia. In addition, the driving force that maintains the schizophrenia risk variants in human gene pool is poorly understood. To investigate whether expression-associated genetic variants contribute to schizophrenia susceptibility, we systematically integrated brain expression quantitative trait loci and genome-wide association data of schizophrenia using Sherlock, a Bayesian statistical framework. Our analyses identified ZNF323 as a schizophrenia risk gene (P = 2.22×10–6). Subsequent analyses confirmed the association of the ZNF323 and its expression-associated single nucleotide polymorphism rs1150711 in independent samples (gene-expression: P = 1.40×10–6; single-marker meta-analysis in the combined discovery and replication sample comprising 44123 individuals: P = 6.85×10−10). We found that the ZNF323 was significantly downregulated in hippocampus and frontal cortex of schizophrenia patients (P = .0038 and P = .0233, respectively). Evidence for pleiotropic effects was detected (association of rs1150711 with lung function and gene expression of ZNF323 in lung: P = 6.62×10–5 and P = 9.00×10–5, respectively) with the risk allele (T allele) for schizophrenia acting as protective allele for lung function. Subsequent population genetics analyses suggest that the risk allele (T) of rs1150711 might have undergone recent positive selection in human population. Our findings suggest that the ZNF323 is a schizophrenia susceptibility gene whose expression may influence schizophrenia risk. Our study also illustrates a possible mechanism for maintaining schizophrenia risk variants in the human gene pool. PMID:25759474

  18. High-resolution gene expression data from blastoderm embryos of the scuttle fly Megaselia abdita

    PubMed Central

    Wotton, Karl R; Jiménez-Guri, Eva; Crombach, Anton; Cicin-Sain, Damjan; Jaeger, Johannes

    2015-01-01

    Gap genes are involved in segment determination during early development in dipteran insects (flies, midges, and mosquitoes). We carried out a systematic quantitative comparative analysis of the gap gene network across different dipteran species. Our work provides mechanistic insights into the evolution of this pattern-forming network. As a central component of our project, we created a high-resolution quantitative spatio-temporal data set of gap and maternal co-ordinate gene expression in the blastoderm embryo of the non-drosophilid scuttle fly, Megaselia abdita. Our data include expression patterns in both wild-type and RNAi-treated embryos. The data—covering 10 genes, 10 time points, and over 1,000 individual embryos—consist of original embryo images, quantified expression profiles, extracted positions of expression boundaries, and integrated expression patterns, plus metadata and intermediate processing steps. These data provide a valuable resource for researchers interested in the comparative study of gene regulatory networks and pattern formation, an essential step towards a more quantitative and mechanistic understanding of developmental evolution. PMID:25977812

  19. Mapping cis- and trans-regulatory effects across multiple tissues in twins

    PubMed Central

    Grundberg, Elin; Small, Kerrin S.; Hedman, Åsa K.; Nica, Alexandra C.; Buil, Alfonso; Keildson, Sarah; Bell, Jordana T.; Yang, Tsun-Po; Meduri, Eshwar; Barrett, Amy; Nisbett, James; Sekowska, Magdalena; Wilk, Alicja; Shin, So-Youn; Glass, Daniel; Travers, Mary; Min, Josine L.; Ring, Sue; Ho, Karen; Thorleifsson, Gudmar; Kong, Augustine; Thorsteindottir, Unnur; Ainali, Chrysanthi; Dimas, Antigone S.; Hassanali, Neelam; Ingle, Catherine; Knowles, David; Krestyaninova, Maria; Lowe, Christopher E.; Di Meglio, Paola; Montgomery, Stephen B.; Parts, Leopold; Potter, Simon; Surdulescu, Gabriela; Tsaprouni, Loukia; Tsoka, Sophia; Bataille, Veronique; Durbin, Richard; Nestle, Frank O.; O’Rahilly, Stephen; Soranzo, Nicole; Lindgren, Cecilia M.; Zondervan, Krina T.; Ahmadi, Kourosh R.; Schadt, Eric E.; Stefansson, Kari; Smith, George Davey; McCarthy, Mark I.; Deloukas, Panos; Dermitzakis, Emmanouil T.; Spector, Tim D.

    2013-01-01

    Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many eQTL studies typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis-effect on expression cannot be accounted for by common cis-variants, a finding which exposes the contribution of low frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene and identify several replicating trans-variants which act predominantly in a tissue-restricted manner and may regulate the transcription of many genes. PMID:22941192

  20. HIV promoter integration site primarily modulates transcriptional burst size rather than frequency.

    PubMed

    Skupsky, Ron; Burnett, John C; Foley, Jonathan E; Schaffer, David V; Arkin, Adam P

    2010-09-30

    Mammalian gene expression patterns, and their variability across populations of cells, are regulated by factors specific to each gene in concert with its surrounding cellular and genomic environment. Lentiviruses such as HIV integrate their genomes into semi-random genomic locations in the cells they infect, and the resulting viral gene expression provides a natural system to dissect the contributions of genomic environment to transcriptional regulation. Previously, we showed that expression heterogeneity and its modulation by specific host factors at HIV integration sites are key determinants of infected-cell fate and a possible source of latent infections. Here, we assess the integration context dependence of expression heterogeneity from diverse single integrations of a HIV-promoter/GFP-reporter cassette in Jurkat T-cells. Systematically fitting a stochastic model of gene expression to our data reveals an underlying transcriptional dynamic, by which multiple transcripts are produced during short, infrequent bursts, that quantitatively accounts for the wide, highly skewed protein expression distributions observed in each of our clonal cell populations. Interestingly, we find that the size of transcriptional bursts is the primary systematic covariate over integration sites, varying from a few to tens of transcripts across integration sites, and correlating well with mean expression. In contrast, burst frequencies are scattered about a typical value of several per cell-division time and demonstrate little correlation with the clonal means. This pattern of modulation generates consistently noisy distributions over the sampled integration positions, with large expression variability relative to the mean maintained even for the most productive integrations, and could contribute to specifying heterogeneous, integration-site-dependent viral production patterns in HIV-infected cells. Genomic environment thus emerges as a significant control parameter for gene expression variation that may contribute to structuring mammalian genomes, as well as be exploited for survival by integrating viruses.

  1. Weighted gene co-expression network analysis reveals potential genes involved in early metamorphosis process in sea cucumber Apostichopus japonicus.

    PubMed

    Li, Yongxin; Kikuchi, Mani; Li, Xueyan; Gao, Qionghua; Xiong, Zijun; Ren, Yandong; Zhao, Ruoping; Mao, Bingyu; Kondo, Mariko; Irie, Naoki; Wang, Wen

    2018-01-01

    Sea cucumbers, one main class of Echinoderms, have a very fast and drastic metamorphosis process during their development. However, the molecular basis under this process remains largely unknown. Here we systematically examined the gene expression profiles of Japanese common sea cucumber (Apostichopus japonicus) for the first time by RNA sequencing across 16 developmental time points from fertilized egg to juvenile stage. Based on the weighted gene co-expression network analysis (WGCNA), we identified 21 modules. Among them, MEdarkmagenta was highly expressed and correlated with the early metamorphosis process from late auricularia to doliolaria larva. Furthermore, gene enrichment and differentially expressed gene analysis identified several genes in the module that may play key roles in the metamorphosis process. Our results not only provide a molecular basis for experimentally studying the development and morphological complexity of sea cucumber, but also lay a foundation for improving its emergence rate. Copyright © 2017 Elsevier Inc. All rights reserved.

  2. Dynamic CRM occupancy reflects a temporal map of developmental progression.

    PubMed

    Wilczyński, Bartek; Furlong, Eileen E M

    2010-06-22

    Development is driven by tightly coordinated spatio-temporal patterns of gene expression, which are initiated through the action of transcription factors (TFs) binding to cis-regulatory modules (CRMs). Although many studies have investigated how spatial patterns arise, precise temporal control of gene expression is less well understood. Here, we show that dynamic changes in the timing of CRM occupancy is a prevalent feature common to all TFs examined in a developmental ChIP time course to date. CRMs exhibit complex binding patterns that cannot be explained by the sequence motifs or expression of the TFs themselves. The temporal changes in TF binding are highly correlated with dynamic patterns of target gene expression, which in turn reflect transitions in cellular function during different stages of development. Thus, it is not only the timing of a TF's expression, but also its temporal occupancy in refined time windows, which determines temporal gene expression. Systematic measurement of dynamic CRM occupancy may therefore serve as a powerful method to decode dynamic changes in gene expression driving developmental progression.

  3. Evaluation of RNAi and CRISPR technologies by large-scale gene expression profiling in the Connectivity Map.

    PubMed

    Smith, Ian; Greenside, Peyton G; Natoli, Ted; Lahr, David L; Wadden, David; Tirosh, Itay; Narayan, Rajiv; Root, David E; Golub, Todd R; Subramanian, Aravind; Doench, John G

    2017-11-01

    The application of RNA interference (RNAi) to mammalian cells has provided the means to perform phenotypic screens to determine the functions of genes. Although RNAi has revolutionized loss-of-function genetic experiments, it has been difficult to systematically assess the prevalence and consequences of off-target effects. The Connectivity Map (CMAP) represents an unprecedented resource to study the gene expression consequences of expressing short hairpin RNAs (shRNAs). Analysis of signatures for over 13,000 shRNAs applied in 9 cell lines revealed that microRNA (miRNA)-like off-target effects of RNAi are far stronger and more pervasive than generally appreciated. We show that mitigating off-target effects is feasible in these datasets via computational methodologies to produce a consensus gene signature (CGS). In addition, we compared RNAi technology to clustered regularly interspaced short palindromic repeat (CRISPR)-based knockout by analysis of 373 single guide RNAs (sgRNAs) in 6 cells lines and show that the on-target efficacies are comparable, but CRISPR technology is far less susceptible to systematic off-target effects. These results will help guide the proper use and analysis of loss-of-function reagents for the determination of gene function.

  4. Function does not follow form in gene regulatory circuits.

    PubMed

    Payne, Joshua L; Wagner, Andreas

    2015-08-20

    Gene regulatory circuits are to the cell what arithmetic logic units are to the chip: fundamental components of information processing that map an input onto an output. Gene regulatory circuits come in many different forms, distinct structural configurations that determine who regulates whom. Studies that have focused on the gene expression patterns (functions) of circuits with a given structure (form) have examined just a few structures or gene expression patterns. Here, we use a computational model to exhaustively characterize the gene expression patterns of nearly 17 million three-gene circuits in order to systematically explore the relationship between circuit form and function. Three main conclusions emerge. First, function does not follow form. A circuit of any one structure can have between twelve and nearly thirty thousand distinct gene expression patterns. Second, and conversely, form does not follow function. Most gene expression patterns can be realized by more than one circuit structure. And third, multifunctionality severely constrains circuit form. The number of circuit structures able to drive multiple gene expression patterns decreases rapidly with the number of these patterns. These results indicate that it is generally not possible to infer circuit function from circuit form, or vice versa.

  5. Selecting and validating reference genes for quantitative real-time PCR in Plutella xylostella (L.).

    PubMed

    You, Yanchun; Xie, Miao; Vasseur, Liette; You, Minsheng

    2018-05-01

    Gene expression analysis provides important clues regarding gene functions, and quantitative real-time PCR (qRT-PCR) is a widely used method in gene expression studies. Reference genes are essential for normalizing and accurately assessing gene expression. In the present study, 16 candidate reference genes (ACTB, CyPA, EF1-α, GAPDH, HSP90, NDPk, RPL13a, RPL18, RPL19, RPL32, RPL4, RPL8, RPS13, RPS4, α-TUB, and β-TUB) from Plutella xylostella were selected to evaluate gene expression stability across different experimental conditions using five statistical algorithms (geNorm, NormFinder, Delta Ct, BestKeeper, and RefFinder). The results suggest that different reference genes or combinations of reference genes are suitable for normalization in gene expression studies of P. xylostella according to the different developmental stages, strains, tissues, and insecticide treatments. Based on the given experimental sets, the most stable reference genes were RPS4 across different developmental stages, RPL8 across different strains and tissues, and EF1-α across different insecticide treatments. A comprehensive and systematic assessment of potential reference genes for gene expression normalization is essential for post-genomic functional research in P. xylostella, a notorious pest with worldwide distribution and a high capacity to adapt and develop resistance to insecticides.

  6. The regulatory software of cellular metabolism.

    PubMed

    Segrè, Daniel

    2004-06-01

    Understanding the regulation of metabolic pathways in the cell is like unraveling the 'software' that is running on the 'hardware' of the metabolic network. Transcriptional regulation of enzymes is an important component of this software. A recent systematic analysis of metabolic gene-expression data in Saccharomyces cerevisiae reveals a complex modular organization of co-expressed genes, which could increase our ability to understand and engineer cellular metabolic functions.

  7. Pan-Cancer Analysis of the Mediator Complex Transcriptome Identifies CDK19 and CDK8 as Therapeutic Targets in Advanced Prostate Cancer.

    PubMed

    Brägelmann, Johannes; Klümper, Niklas; Offermann, Anne; von Mässenhausen, Anne; Böhm, Diana; Deng, Mario; Queisser, Angela; Sanders, Christine; Syring, Isabella; Merseburger, Axel S; Vogel, Wenzel; Sievers, Elisabeth; Vlasic, Ignacija; Carlsson, Jessica; Andrén, Ove; Brossart, Peter; Duensing, Stefan; Svensson, Maria A; Shaikhibrahim, Zaki; Kirfel, Jutta; Perner, Sven

    2017-04-01

    Purpose: The Mediator complex is a multiprotein assembly, which serves as a hub for diverse signaling pathways to regulate gene expression. Because gene expression is frequently altered in cancer, a systematic understanding of the Mediator complex in malignancies could foster the development of novel targeted therapeutic approaches. Experimental Design: We performed a systematic deconvolution of the Mediator subunit expression profiles across 23 cancer entities ( n = 8,568) using data from The Cancer Genome Atlas (TCGA). Prostate cancer-specific findings were validated in two publicly available gene expression cohorts and a large cohort of primary and advanced prostate cancer ( n = 622) stained by immunohistochemistry. The role of CDK19 and CDK8 was evaluated by siRNA-mediated gene knockdown and inhibitor treatment in prostate cancer cell lines with functional assays and gene expression analysis by RNAseq. Results: Cluster analysis of TCGA expression data segregated tumor entities, indicating tumor-type-specific Mediator complex compositions. Only prostate cancer was marked by high expression of CDK19 In primary prostate cancer, CDK19 was associated with increased aggressiveness and shorter disease-free survival. During cancer progression, highest levels of CDK19 and of its paralog CDK8 were present in metastases. In vitro , inhibition of CDK19 and CDK8 by knockdown or treatment with a selective CDK8/CDK19 inhibitor significantly decreased migration and invasion. Conclusions: Our analysis revealed distinct transcriptional expression profiles of the Mediator complex across cancer entities indicating differential modes of transcriptional regulation. Moreover, it identified CDK19 and CDK8 to be specifically overexpressed during prostate cancer progression, highlighting their potential as novel therapeutic targets in advanced prostate cancer. Clin Cancer Res; 23(7); 1829-40. ©2016 AACR . ©2016 American Association for Cancer Research.

  8. Genetic Network Inference: From Co-Expression Clustering to Reverse Engineering

    NASA Technical Reports Server (NTRS)

    Dhaeseleer, Patrik; Liang, Shoudan; Somogyi, Roland

    2000-01-01

    Advances in molecular biological, analytical, and computational technologies are enabling us to systematically investigate the complex molecular processes underlying biological systems. In particular, using high-throughput gene expression assays, we are able to measure the output of the gene regulatory network. We aim here to review datamining and modeling approaches for conceptualizing and unraveling the functional relationships implicit in these datasets. Clustering of co-expression profiles allows us to infer shared regulatory inputs and functional pathways. We discuss various aspects of clustering, ranging from distance measures to clustering algorithms and multiple-duster memberships. More advanced analysis aims to infer causal connections between genes directly, i.e., who is regulating whom and how. We discuss several approaches to the problem of reverse engineering of genetic networks, from discrete Boolean networks, to continuous linear and non-linear models. We conclude that the combination of predictive modeling with systematic experimental verification will be required to gain a deeper insight into living organisms, therapeutic targeting, and bioengineering.

  9. Gene Expression Profile Analysis is Directly Affected by the Selected Reference Gene: The Case of Leaf-Cutting Atta Sexdens

    PubMed Central

    Máximo, Wesley P. F.; Zanetti, Ronald; Paiva, Luciano V.

    2018-01-01

    Although several ant species are important targets for the development of molecular control strategies, only a few studies focus on identifying and validating reference genes for quantitative reverse transcription polymerase chain reaction (RT-qPCR) data normalization. We provide here an extensive study to identify and validate suitable reference genes for gene expression analysis in the ant Atta sexdens, a threatening agricultural pest in South America. The optimal number of reference genes varies according to each sample and the result generated by RefFinder differed about which is the most suitable reference gene. Results suggest that the RPS16, NADH and SDHB genes were the best reference genes in the sample pool according to stability values. The SNF7 gene expression pattern was stable in all evaluated sample set. In contrast, when using less stable reference genes for normalization a large variability in SNF7 gene expression was recorded. There is no universal reference gene suitable for all conditions under analysis, since these genes can also participate in different cellular functions, thus requiring a systematic validation of possible reference genes for each specific condition. The choice of reference genes on SNF7 gene normalization confirmed that unstable reference genes might drastically change the expression profile analysis of target candidate genes. PMID:29419794

  10. Systematic CRISPR-Cas9-Mediated Modifications of Plasmodium yoelii ApiAP2 Genes Reveal Functional Insights into Parasite Development

    PubMed Central

    Zhang, Cui; Li, Zhenkui; Cui, Huiting; Jiang, Yuanyuan; Yang, Zhenke; Wang, Xu; Gao, Han; Liu, Cong; Zhang, Shujia

    2017-01-01

    ABSTRACT Malaria parasites have a complex life cycle with multiple developmental stages in mosquito and vertebrate hosts, and different developmental stages express unique sets of genes. Unexpectedly, many transcription factors (TFs) commonly found in eukaryotic organisms are absent in malaria parasites; instead, a family of genes encoding proteins similar to the plant Apetala2 (ApiAP2) transcription factors is expanded in the parasites. Several malaria ApiAP2 genes have been shown to play a critical role in parasite development; however, the functions of the majority of the ApiAP2 genes remain to be elucidated. In particular, no study on the Plasmodium yoelii ApiAP2 (PyApiAP2) gene family has been reported so far. This study systematically investigated the functional roles of PyApiAP2 genes in parasite development. Twenty-four of the 26 PyApiAP2 genes were selected for disruption, and 12 were successfully knocked out using the clustered regularly interspaced short palindromic repeat–CRISPR-associated protein 9 (CRISPR-Cas9) method. The effects of gene knockout (KO) on parasite development in mouse and mosquito stages were evaluated. Ten of 12 successfully disrupted genes, including two genes that have not been functionally characterized in any Plasmodium species previously, were shown to be critical for P. yoelii development of sexual and mosquito stages. Additionally, seven of the genes were labeled for protein expression analysis, revealing important information supporting their functions. This study represents the first systematic functional characterization of the P. yoelii ApiAP2 gene family and discovers important insights on the roles of the ApiAP2 genes in parasite development. PMID:29233900

  11. Matrix factorization reveals aging-specific co-expression gene modules in the fat and muscle tissues in nonhuman primates

    NASA Astrophysics Data System (ADS)

    Wang, Yongcui; Zhao, Weiling; Zhou, Xiaobo

    2016-10-01

    Accurate identification of coherent transcriptional modules (subnetworks) in adipose and muscle tissues is important for revealing the related mechanisms and co-regulated pathways involved in the development of aging-related diseases. Here, we proposed a systematically computational approach, called ICEGM, to Identify the Co-Expression Gene Modules through a novel mathematical framework of Higher-Order Generalized Singular Value Decomposition (HO-GSVD). ICEGM was applied on the adipose, and heart and skeletal muscle tissues in old and young female African green vervet monkeys. The genes associated with the development of inflammation, cardiovascular and skeletal disorder diseases, and cancer were revealed by the ICEGM. Meanwhile, genes in the ICEGM modules were also enriched in the adipocytes, smooth muscle cells, cardiac myocytes, and immune cells. Comprehensive disease annotation and canonical pathway analysis indicated that immune cells, adipocytes, cardiomyocytes, and smooth muscle cells played a synergistic role in cardiac and physical functions in the aged monkeys by regulation of the biological processes associated with metabolism, inflammation, and atherosclerosis. In conclusion, the ICEGM provides an efficiently systematic framework for decoding the co-expression gene modules in multiple tissues. Analysis of genes in the ICEGM module yielded important insights on the cooperative role of multiple tissues in the development of diseases.

  12. Systematic asymmetric nucleotide exchanges produce human mitochondrial RNAs cryptically encoding for overlapping protein coding genes.

    PubMed

    Seligmann, Hervé

    2013-05-07

    GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.

  13. Expression profiling of mouse subplate reveals a dynamic gene network and disease association with autism and schizophrenia

    PubMed Central

    Hoerder-Suabedissen, Anna; Oeschger, Franziska M.; Krishnan, Michelle L.; Belgard, T. Grant; Wang, Wei Zhi; Lee, Sheena; Webber, Caleb; Petretto, Enrico; Edwards, A. David; Molnár, Zoltán

    2013-01-01

    The subplate zone is a highly dynamic transient sector of the developing cerebral cortex that contains some of the earliest generated neurons and the first functional synapses of the cerebral cortex. Subplate cells have important functions in early establishment and maturation of thalamocortical connections, as well as in the development of inhibitory cortical circuits in sensory areas. So far no role has been identified for cells in the subplate in the mature brain and disease association of the subplate-specific genes has not been analyzed systematically. Here we present gene expression evidence for distinct roles of the mouse subplate across development as well as unique molecular markers to extend the repertoire of subplate labels. Performing systematic comparisons between different ages (embryonic days 15 and 18, postnatal day 8, and adult), we reveal the dynamic and constant features of the markers labeling subplate cells during embryonic and early postnatal development and in the adult. This can be visualized using the online database of subplate gene expression at https://molnar.dpag.ox.ac.uk/subplate/. We also identify embryonic similarities in gene expression between the ventricular zones, intermediate zone, and subplate, and distinct postnatal similarities between subplate, layer 5, and layers 2/3. The genes expressed in a subplate-specific manner at some point during development show a statistically significant enrichment for association with autism spectrum disorders and schizophrenia. Our report emphasizes the importance of the study of transient features of the developing brain to better understand neurodevelopmental disorders. PMID:23401504

  14. Dynamic gene expression changes precede dioxin-induced liver pathogenesis in medaka fish.

    PubMed

    Volz, David C; Hinton, David E; Law, J McHugh; Kullman, Seth W

    2006-02-01

    A major challenge for environmental genomics is linking gene expression to cellular toxicity and morphological alteration. Herein, we address complexities related to hepatic gene expression responses after a single injection of the aryl hydrocarbon receptor (AHR) agonist 2,3,7,8-tetrachlorodibenzo-p-dioxin (dioxin) and illustrate an initial stress response followed by cytologic and adaptive changes in the teleost fish medaka. Using a custom 175-gene array, we find that overall hepatic gene expression and histological changes are strongly dependent on dose and time. The most pronounced dioxin-induced gene expression changes occurred early and preceded morphologic alteration in the liver. Following a systematic search for putative Ah response elements (AHREs) (5'-CACGCA-3') within 2000 bp upstream of the predicted transcriptional start site, the majority (87%) of genes screened in this study did not contain an AHRE, suggesting that gene expression was not solely dependent on AHRE-mediated transcription. Moreover, in the highest dosage, we observed gene expression changes associated with adaptation that persisted for almost two weeks, including induction of a gene putatively identified as ependymin that may function in hepatic injury repair. These data suggest that the cellular response to dioxin involves both AHRE- and non-AHRE-mediated transcription, and that coupling gene expression profiling with analysis of morphologic pathogenesis is essential for establishing temporal relationships between transcriptional changes, toxicity, and adaptation to hepatic injury.

  15. With Reference to Reference Genes: A Systematic Review of Endogenous Controls in Gene Expression Studies.

    PubMed

    Chapman, Joanne R; Waldenström, Jonas

    2015-01-01

    The choice of reference genes that are stably expressed amongst treatment groups is a crucial step in real-time quantitative PCR gene expression studies. Recent guidelines have specified that a minimum of two validated reference genes should be used for normalisation. However, a quantitative review of the literature showed that the average number of reference genes used across all studies was 1.2. Thus, the vast majority of studies continue to use a single gene, with β-actin (ACTB) and/or glyceraldehyde 3-phosphate dehydrogenase (GAPDH) being commonly selected in studies of vertebrate gene expression. Few studies (15%) tested a panel of potential reference genes for stability of expression before using them to normalise data. Amongst studies specifically testing reference gene stability, few found ACTB or GAPDH to be optimal, whereby these genes were significantly less likely to be chosen when larger panels of potential reference genes were screened. Fewer reference genes were tested for stability in non-model organisms, presumably owing to a dearth of available primers in less well characterised species. Furthermore, the experimental conditions under which real-time quantitative PCR analyses were conducted had a large influence on the choice of reference genes, whereby different studies of rat brain tissue showed different reference genes to be the most stable. These results highlight the importance of validating the choice of normalising reference genes before conducting gene expression studies.

  16. Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes.

    PubMed

    Seligmann, Hervé

    2013-03-01

    Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  17. Technical variables in high-throughput miRNA expression profiling: much work remains to be done.

    PubMed

    Nelson, Peter T; Wang, Wang-Xia; Wilfred, Bernard R; Tang, Guiliang

    2008-11-01

    MicroRNA (miRNA) gene expression profiling has provided important insights into plant and animal biology. However, there has not been ample published work about pitfalls associated with technical parameters in miRNA gene expression profiling. One source of pertinent information about technical variables in gene expression profiling is the separate and more well-established literature regarding mRNA expression profiling. However, many aspects of miRNA biochemistry are unique. For example, the cellular processing and compartmentation of miRNAs, the differential stability of specific miRNAs, and aspects of global miRNA expression regulation require specific consideration. Additional possible sources of systematic bias in miRNA expression studies include the differential impact of pre-analytical variables, substrate specificity of nucleic acid processing enzymes used in labeling and amplification, and issues regarding new miRNA discovery and annotation. We conclude that greater focus on technical parameters is required to bolster the validity, reliability, and cultural credibility of miRNA gene expression profiling studies.

  18. Identification of Reference Genes and Analysis of Heat Shock Protein Gene Expression in Lingzhi or Reishi Medicinal Mushroom, Ganoderma lucidum, after Exposure to Heat Stress.

    PubMed

    Liu, Yong-Nan; Lu, Xiao-Xiao; Ren, Ang; Shi, Liang; Jiang, Ai-Liang; Yu, Han-Shou; Zhao, Ming-Wen

    2017-01-01

    Ganoderma lucidum has been considered an emerging model species for studying how environmental factors regulate the growth, development, and secondary metabolism of Basidiomycetes. Heat stress, which is one of the most important environmental abiotic stresses, seriously affects the growth, development, and yield of microorganisms. Understanding the response to heat stress has gradually become a hotspot in microorganism research. But suitable reference genes for expression analysis under heat stress have not been reported in G. lucidum. In this study, we systematically identified 11 candidate reference genes that were measured using reverse transcriptase quantitative polymerase chain reaction, and the gene expression stability was analyzed under heat stress conditions using geNorm and NormFinder. The results show that 5 reference genes-CYP and TIF, followed by UCE2, ACTIN, and UBQ1-are the most stable genes under our experimental conditions. Moreover, the relative expression levels of 3 heat stress response genes (hsp17.4, hsp70, and hsp90) were analyzed under heat stress conditions with different normalization strategies. The results show that use of a gene with unstable expression (SAND) as the reference gene leads to biased data and misinterpretations of the target gene expression level under heat stress.

  19. Evaluation of a toxicogenomic approach to the local lymph node assay (LLNA).

    PubMed

    Boverhof, Darrell R; Gollapudi, B Bhaskar; Hotchkiss, Jon A; Osterloh-Quiroz, Mandy; Woolhiser, Michael R

    2009-02-01

    Genomic technologies have the potential to enhance and complement existing toxicology endpoints; however, assessment of these approaches requires a systematic evaluation including a robust experimental design with genomic endpoints anchored to traditional toxicology endpoints. The present study was conducted to assess the sensitivity of genomic responses when compared with the traditional local lymph node assay (LLNA) endpoint of lymph node cell proliferation and to evaluate the responses for their ability to provide insights into mode of action. Female BALB/c mice were treated with the sensitizer trimellitic anhydride (TMA), following the standard LLNA dosing regimen, at doses of 0.1, 1, or 10% and traditional tritiated thymidine ((3)HTdR) incorporation and gene expression responses were monitored in the auricular lymph nodes. Additional mice dosed with either vehicle or 10% TMA and sacrificed on day 4 or 10, were also included to examine temporal effects on gene expression. Analysis of (3)HTdR incorporation revealed TMA-induced stimulation indices of 2.8, 22.9, and 61.0 relative to vehicle with an EC(3) of 0.11%. Examination of the dose-response gene expression responses identified 9, 833, and 2122 differentially expressed genes relative to vehicle for the 0.1, 1, and 10% TMA dose groups, respectively. Calculation of EC(3) values for differentially expressed genes did not identify a response that was more sensitive than the (3)HTdR value, although a number of genes displayed comparable sensitivity. Examination of temporal responses revealed 1760, 1870, and 953 differentially expressed genes at the 4-, 6-, and 10-day time points respectively. Functional analysis revealed many responses displayed dose- and time-specific induction patterns within the functional categories of cellular proliferation and immune response, including numerous immunoglobin genes which were highly induced at the day 10 time point. Overall, these experiments have systematically illustrated the potential utility of genomic endpoints to enhance the LLNA and support further exploration of this approach through examination of a more diverse array of chemicals.

  20. Dynamics of Wolbachia pipientis Gene Expression Across the Drosophila melanogaster Life Cycle

    PubMed Central

    Gutzwiller, Florence; Carmo, Catarina R.; Miller, Danny E.; Rice, Danny W.; Newton, Irene L. G.; Hawley, R. Scott; Teixeira, Luis; Bergman, Casey M.

    2015-01-01

    Symbiotic interactions between microbes and their multicellular hosts have manifold biological consequences. To better understand how bacteria maintain symbiotic associations with animal hosts, we analyzed genome-wide gene expression for the endosymbiotic α-proteobacteria Wolbachia pipientis across the entire life cycle of Drosophila melanogaster. We found that the majority of Wolbachia genes are expressed stably across the D. melanogaster life cycle, but that 7.8% of Wolbachia genes exhibit robust stage- or sex-specific expression differences when studied in the whole-organism context. Differentially-expressed Wolbachia genes are typically up-regulated after Drosophila embryogenesis and include many bacterial membrane, secretion system, and ankyrin repeat-containing proteins. Sex-biased genes are often organized as small operons of uncharacterized genes and are mainly up-regulated in adult Drosophila males in an age-dependent manner. We also systematically investigated expression levels of previously-reported candidate genes thought to be involved in host-microbe interaction, including those in the WO-A and WO-B prophages and in the Octomom region, which has been implicated in regulating bacterial titer and pathogenicity. Our work provides comprehensive insight into the developmental dynamics of gene expression for a widespread endosymbiont in its natural host context, and shows that public gene expression data harbor rich resources to probe the functional basis of the Wolbachia-Drosophila symbiosis and annotate the transcriptional outputs of the Wolbachia genome. PMID:26497146

  1. Anterior-posterior regionalized gene expression in the Ciona notochord

    PubMed Central

    Veeman, Michael

    2014-01-01

    Background In the simple ascidian chordate Ciona the signaling pathways and gene regulatory networks giving rise to initial notochord induction are largely understood and the mechanisms of notochord morphogenesis are being systematically elucidated. The notochord has generally been thought of as a non-compartmentalized or regionalized organ that is not finely patterned at the level of gene expression. Quantitative imaging methods have recently shown, however, that notochord cell size, shape and behavior vary consistently along the anterior-posterior (AP) axis. Results Here we screen candidate genes by whole mount in situ hybridization for potential AP asymmetry. We identify 4 genes that show non-uniform expression in the notochord. Ezrin/radixin/moesin (ERM) is expressed more strongly in the secondary notochord lineage than the primary. CTGF is expressed stochastically in a subset of notochord cells. A novel calmodulin-like gene (BCamL) is expressed more strongly at both the anterior and posterior tips of the notochord. A TGF-β ortholog is expressed in a gradient from posterior to anterior. The asymmetries in ERM, BCamL and TGF-β expression are evident even before the notochord cells have intercalated into a single-file column. Conclusions We conclude that the Ciona notochord is not a homogeneous tissue but instead shows distinct patterns of regionalized gene expression. PMID:24288133

  2. Anterior-posterior regionalized gene expression in the Ciona notochord.

    PubMed

    Reeves, Wendy; Thayer, Rachel; Veeman, Michael

    2014-04-01

    In the simple ascidian chordate Ciona, the signaling pathways and gene regulatory networks giving rise to initial notochord induction are largely understood and the mechanisms of notochord morphogenesis are being systematically elucidated. The notochord has generally been thought of as a non-compartmentalized or regionalized organ that is not finely patterned at the level of gene expression. Quantitative imaging methods have recently shown, however, that notochord cell size, shape, and behavior vary consistently along the anterior-posterior (AP) axis. Here we screen candidate genes by whole mount in situ hybridization for potential AP asymmetry. We identify 4 genes that show non-uniform expression in the notochord. Ezrin/radixin/moesin (ERM) is expressed more strongly in the secondary notochord lineage than the primary. CTGF is expressed stochastically in a subset of notochord cells. A novel calmodulin-like gene (BCamL) is expressed more strongly at both the anterior and posterior tips of the notochord. A TGF-β ortholog is expressed in a gradient from posterior to anterior. The asymmetries in ERM, BCamL, and TGF-β expression are evident even before the notochord cells have intercalated into a single-file column. We conclude that the Ciona notochord is not a homogeneous tissue but instead shows distinct patterns of regionalized gene expression. Copyright © 2013 Wiley Periodicals, Inc.

  3. Systematic Functional Interrogation of Rare Cancer Variants Identifies Oncogenic Alleles | Office of Cancer Genomics

    Cancer.gov

    Cancer genome characterization efforts now provide an initial view of the somatic alterations in primary tumors. However, most point mutations occur at low frequency, and the function of these alleles remains undefined. We have developed a scalable systematic approach to interrogate the function of cancer-associated gene variants. We subjected 474 mutant alleles curated from 5,338 tumors to pooled in vivo tumor formation assays and gene expression profiling. We identified 12 transforming alleles, including two in genes (PIK3CB, POT1) that have not been shown to be tumorigenic.

  4. HOX gene expression in phenotypic and genotypic subgroups and low HOXA gene expression as an adverse prognostic factor in pediatric ALL.

    PubMed

    Starkova, Julia; Zamostna, Blanka; Mejstrikova, Ester; Krejci, Roman; Drabkin, Harry A; Trka, Jan

    2010-12-01

    HOX genes play an important role in both normal lymphopoiesis and leukemogenesis. However, HOX expression patterns in leukemia cells compared to normal lymphoid progenitors have not been systematically studied in acute lymphoblastic leukemia (ALL) subtypes. The RNA expression levels of HOXA, HOXB, and CDX1/2 genes were analyzed by qRT-PCR in a cohort of 61 diagnostic pediatric ALL samples and FACS-sorted subpopulations of normal lymphoid progenitors. The RNA expression of HOXA7-10, HOXA13, and HOXB2-4 genes was exclusively detected in leukemic cells and immature progenitors. The RNA expression of HOXB6 and CDX2 genes was exclusively detected in leukemic cells but not in B-lineage cells at any of the studied developmental stages. HOXA3-4, HOXA7, and HOXB3-4 genes were differentially expressed between BCP-ALL and T-ALL subgroups, and among genotypically defined MLL/AF4, TEL/AML1, BCR/ABL, hyperdiploid and normal karyotype subgroups. However, this differential expression did not define specific clusters in hierarchical cluster analysis. HOXA7 gene was low expressed at the RNA level in patients with hyperdiploid leukemia, whereas HOXB7 and CDX2 genes were low expressed in TEL/AML1-positive and BCR/ABL-positive cases, respectively. In contrast to previous findings in acute myeloid leukemia, high HOXA RNA expression was associated with an excellent prognosis in Cox's regression model (P = 0.03). In MLL/AF4-positive ALL, lower HOXA RNA expression correlated with the methylation status of their promoters. HOX gene RNA expression cannot discriminate leukemia subgroups or relative maturity of leukemic cells. However, HOXA RNA expression correlates with prognosis, and particular HOX genes are expressed in specific genotypically characterized subgroups.

  5. Identification and validation of reference genes for qRT-PCR studies of the obligate aphid pathogenic fungus Pandora neoaphidis during different developmental stages.

    PubMed

    Zhang, Shutao; Chen, Chun; Xie, Tingna; Ye, Sudan

    2017-01-01

    The selection of stable reference genes is a critical step for the accurate quantification of gene expression. To identify and validate the reference genes in Pandora neoaphidis-an obligate aphid pathogenic fungus-the expression of 13classical candidate reference genes were evaluated by quantitative real-time reverse transcriptase polymerase chain reaction(qPCR) at four developmental stages (conidia, conidia with germ tubes, short hyphae and elongated hyphae). Four statistical algorithms, including geNorm, NormFinder, BestKeeper and Delta Ct method were used to rank putative reference genes according to their expression stability and indicate the best reference gene or combination of reference genes for accurate normalization. The analysis of comprehensive ranking revealed that ACT1and 18Swas the most stably expressed genes throughout the developmental stages. To further validate the suitability of the reference genes identified in this study, the expression of cell division control protein 25 (CDC25) and Chitinase 1(CHI1) genes were used to further confirm the validated candidate reference genes. Our study presented the first systematic study of reference gene(s) selection for P. neoaphidis study and provided guidelines to obtain more accurate qPCR results for future developmental efforts.

  6. Biological interpretation of genome-wide association studies using predicted gene functions.

    PubMed

    Pers, Tune H; Karjalainen, Juha M; Chan, Yingleong; Westra, Harm-Jan; Wood, Andrew R; Yang, Jian; Lui, Julian C; Vedantam, Sailaja; Gustafsson, Stefan; Esko, Tonu; Frayling, Tim; Speliotes, Elizabeth K; Boehnke, Michael; Raychaudhuri, Soumya; Fehrmann, Rudolf S N; Hirschhorn, Joel N; Franke, Lude

    2015-01-19

    The main challenge for gaining biological insights from genetic associations is identifying which genes and pathways explain the associations. Here we present DEPICT, an integrative tool that employs predicted gene functions to systematically prioritize the most likely causal genes at associated loci, highlight enriched pathways and identify tissues/cell types where genes from associated loci are highly expressed. DEPICT is not limited to genes with established functions and prioritizes relevant gene sets for many phenotypes.

  7. Systematic Analysis of Sequences and Expression Patterns of Drought-Responsive Members of the HD-Zip Gene Family in Maize

    PubMed Central

    Zhao, Yang; Zhou, Yuqiong; Jiang, Haiyang; Li, Xiaoyu; Gan, Defang; Peng, Xiaojian; Zhu, Suwen; Cheng, Beijiu

    2011-01-01

    Background Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). Methods and Findings In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. Conclusions Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development. PMID:22164299

  8. Systematic analysis of sequences and expression patterns of drought-responsive members of the HD-Zip gene family in maize.

    PubMed

    Zhao, Yang; Zhou, Yuqiong; Jiang, Haiyang; Li, Xiaoyu; Gan, Defang; Peng, Xiaojian; Zhu, Suwen; Cheng, Beijiu

    2011-01-01

    Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development.

  9. Nature versus nurture: A systematic approach to elucidate gene-environment interactions in the development of myopic refractive errors.

    PubMed

    Miraldi Utz, Virginia

    2017-01-01

    Myopia is the most common eye disorder and major cause of visual impairment worldwide. As the incidence of myopia continues to rise, the need to further understand the complex roles of molecular and environmental factors controlling variation in refractive error is of increasing importance. Tkatchenko and colleagues applied a systematic approach using a combination of gene set enrichment analysis, genome-wide association studies, and functional analysis of a murine model to identify a myopia susceptibility gene, APLP2. Differential expression of refractive error was associated with time spent reading for those with low frequency variants in this gene. This provides support for the longstanding hypothesis of gene-environment interactions in refractive error development.

  10. Systematic correlation of environmental exposure and physiological and self-reported behaviour factors with leukocyte telomere length.

    PubMed

    Patel, Chirag J; Manrai, Arjun K; Corona, Erik; Kohane, Isaac S

    2017-02-01

    It is hypothesized that environmental exposures and behaviour influence telomere length, an indicator of cellular ageing. We systematically associated 461 indicators of environmental exposures, physiology and self-reported behaviour with telomere length in data from the US National Health and Nutrition Examination Survey (NHANES) in 1999-2002. Further, we tested whether factors identified in the NHANES participants are also correlated with gene expression of telomere length modifying genes. We correlated 461 environmental exposures, behaviours and clinical variables with telomere length, using survey-weighted linear regression, adjusting for sex, age, age squared, race/ethnicity, poverty level, education and born outside the USA, and estimated the false discovery rate to adjust for multiple hypotheses. We conducted a secondary analysis to investigate the correlation between identified environmental variables and gene expression levels of telomere-associated genes in publicly available gene expression samples. After correlating 461 variables with telomere length, we found 22 variables significantly associated with telomere length after adjustment for multiple hypotheses. Of these varaibales, 14 were associated with longer telomeres, including biomarkers of polychlorinated biphenyls([PCBs; 0.1 to 0.2 standard deviation (SD) increase for 1 SD increase in PCB level, P  < 0.002] and a form of vitamin A, retinyl stearate. Eight variables associated with shorter telomeres, including biomarkers of cadmium, C-reactive protein and lack of physical activity. We could not conclude that PCBs are correlated with gene expression of telomere-associated genes. Both environmental exposures and chronic disease-related risk factors may play a role in telomere length. Our secondary analysis found no evidence of association between PCBs/smoking and gene expression of telomere-associated genes. All correlations between exposures, behaviours and clinical factors and changes in telomere length will require further investigation regarding biological influence of exposure. © The Author 2016. Published by Oxford University Press on behalf of the International Epidemiological Association

  11. Striking Similarity in the Gene Expression Levels of Individual Myc Module Members among ESCs, EpiSCs, and Partial iPSCs

    PubMed Central

    Hirasaki, Masataka; Hiraki-Kamon, Keiko; Kamon, Masayoshi; Suzuki, Ayumu; Katano, Miyuki; Nishimoto, Masazumi; Okuda, Akihiko

    2013-01-01

    Predominant transcriptional subnetworks called Core, Myc, and PRC modules have been shown to participate in preservation of the pluripotency and self-renewality of embryonic stem cells (ESCs). Epiblast stem cells (EpiSCs) are another cell type that possesses pluripotency and self-renewality. However, the roles of these modules in EpiSCs have not been systematically examined to date. Here, we compared the average expression levels of Core, Myc, and PRC module genes between ESCs and EpiSCs. EpiSCs showed substantially higher and lower expression levels of PRC and Core module genes, respectively, compared with those in ESCs, while Myc module members showed almost equivalent levels of average gene expression. Subsequent analyses revealed that the similarity in gene expression levels of the Myc module between these two cell types was not just overall, but striking similarities were evident even when comparing the expression of individual genes. We also observed equivalent levels of similarity in the expression of individual Myc module genes between induced pluripotent stem cells (iPSCs) and partial iPSCs that are an unwanted byproduct generated during iPSC induction. Moreover, our data demonstrate that partial iPSCs depend on a high level of c-Myc expression for their self-renewal properties. PMID:24386274

  12. Promoter architecture dictates cell-to-cell variability in gene expression.

    PubMed

    Jones, Daniel L; Brewster, Robert C; Phillips, Rob

    2014-12-19

    Variability in gene expression among genetically identical cells has emerged as a central preoccupation in the study of gene regulation; however, a divide exists between the predictions of molecular models of prokaryotic transcriptional regulation and genome-wide experimental studies suggesting that this variability is indifferent to the underlying regulatory architecture. We constructed a set of promoters in Escherichia coli in which promoter strength, transcription factor binding strength, and transcription factor copy numbers are systematically varied, and used messenger RNA (mRNA) fluorescence in situ hybridization to observe how these changes affected variability in gene expression. Our parameter-free models predicted the observed variability; hence, the molecular details of transcription dictate variability in mRNA expression, and transcriptional noise is specifically tunable and thus represents an evolutionarily accessible phenotypic parameter. Copyright © 2014, American Association for the Advancement of Science.

  13. Reliable reference genes for normalization of gene expression data in tea plants (Camellia sinensis) exposed to metal stresses.

    PubMed

    Wang, Ming-Le; Li, Qing-Hui; Xin, Hua-Hong; Chen, Xuan; Zhu, Xu-Jun; Li, Xing-Hui

    2017-01-01

    Tea plants [Camellia sinensis (L.) O. Kuntze] are an important leaf-type crop that are widely used for the production of non-alcoholic beverages in the world. Exposure to excessive amounts of heavy metals adversely affects the quality and yield of tea leaves. To analyze the molecular responses of tea plants to heavy metals, a reliable quantification of gene expression is important and of major importance herein is the normalization of the measured expression levels for the target genes. Ideally, stably expressed reference genes should be evaluated in all experimental systems. In this study, 12 candidate reference genes (i.e., 18S rRNA, Actin, CYP, EF-1α, eIF-4α, GAPDH, MON1, PP2AA3, TBP, TIP41, TUA, and UBC) were cloned from tea plants, and the stability of their expression was examined systematically in 60 samples exposed to diverse heavy metals (i.e., manganese, aluminum, copper, iron, and zinc). Three Excel-based algorithms (geNorm, NormFinder, and BestKeeper) were used to evaluate the expression stability of these genes. PP2AA3 and 18S rRNA were the most stably expressed genes, even though their expression profiles exhibited some variability. Moreover, commonly used reference genes (i.e., GAPDH and TBP) were the least appropriate reference genes for most samples. To further validate the suitability of the analyzed reference genes, the expression level of a phytochelatin synthase gene (i.e., CsPCS1) was determined using the putative reference genes for data normalizations. Our results may be beneficial for future studies involving the quantification of relative gene expression levels in tea plants.

  14. Reliable reference genes for normalization of gene expression data in tea plants (Camellia sinensis) exposed to metal stresses

    PubMed Central

    Wang, Ming-Le; Li, Qing-Hui; Xin, Hua-Hong; Chen, Xuan; Zhu, Xu-Jun

    2017-01-01

    Tea plants [Camellia sinensis (L.) O. Kuntze] are an important leaf-type crop that are widely used for the production of non-alcoholic beverages in the world. Exposure to excessive amounts of heavy metals adversely affects the quality and yield of tea leaves. To analyze the molecular responses of tea plants to heavy metals, a reliable quantification of gene expression is important and of major importance herein is the normalization of the measured expression levels for the target genes. Ideally, stably expressed reference genes should be evaluated in all experimental systems. In this study, 12 candidate reference genes (i.e., 18S rRNA, Actin, CYP, EF-1α, eIF-4α, GAPDH, MON1, PP2AA3, TBP, TIP41, TUA, and UBC) were cloned from tea plants, and the stability of their expression was examined systematically in 60 samples exposed to diverse heavy metals (i.e., manganese, aluminum, copper, iron, and zinc). Three Excel-based algorithms (geNorm, NormFinder, and BestKeeper) were used to evaluate the expression stability of these genes. PP2AA3 and 18S rRNA were the most stably expressed genes, even though their expression profiles exhibited some variability. Moreover, commonly used reference genes (i.e., GAPDH and TBP) were the least appropriate reference genes for most samples. To further validate the suitability of the analyzed reference genes, the expression level of a phytochelatin synthase gene (i.e., CsPCS1) was determined using the putative reference genes for data normalizations. Our results may be beneficial for future studies involving the quantification of relative gene expression levels in tea plants. PMID:28453515

  15. Characterization of TALE genes expression during the first lineage segregation in mammalian embryos.

    PubMed

    Sonnet, Wendy; Rezsöhazy, Rene; Donnay, Isabelle

    2012-11-01

    Three amino acid loop extension (TALE) homeodomain-containing transcription factors are generally recognized for their role in organogenesis and differentiation during embryogenesis. However, very little is known about the expression and function of Meis, Pbx, and Prep genes during early development. In order to determine whether TALE proteins could contribute to the early cell fate decisions in mammalian development, this study aimed to characterize in a systematic manner the pattern of expression of all Meis, Pbx, and Prep genes from the precompaction to blastocyst stage corresponding to the first step of cell differentiation in mammals. To reveal to what extent TALE genes expression at these early stages is a conserved feature among mammals, this study was performed in parallel in the bovine and mouse models. We demonstrated the transcription and translation of TALE genes, before gastrulation in the two species. At least one member of Meis, Pbx, and Prep subfamilies was found expressed at the RNA and protein levels but different patterns of expression were observed between genes and between species, suggesting specific gene regulations. Taken together, these results suggest a previously unexpected involvement of these factors during the early development in mammals. Copyright © 2012 Wiley Periodicals, Inc.

  16. Systematic identification and validation of candidate genes for detection of circulating tumor cells in peripheral blood specimens of colorectal cancer patients.

    PubMed

    Findeisen, Peter; Röckel, Matthias; Nees, Matthias; Röder, Christian; Kienle, Peter; Von Knebel Doeberitz, Magnus; Kalthoff, Holger; Neumaier, Michael

    2008-11-01

    The presence of tumor cells in peripheral blood is being regarded increasingly as a clinically relevant prognostic factor for colorectal cancer patients. Current molecular methods are very sensitive but due to low specificity their diagnostic value is limited. This study was undertaken in order to systematically identify and validate new colorectal cancer (CRC) marker genes for improved detection of minimal residual disease in peripheral blood mononuclear cells of colorectal cancer patients. Marker genes with upregulated gene expression in colorectal cancer tissue and cell lines were identified using microarray experiments and publicly available gene expression data. A systematic iterative approach was used to reduce a set of 346 candidate genes, reportedly associated with CRC to a selection of candidate genes that were then further validated by relative quantitative real-time RT-PCR. Analytical sensitivity of RT-PCR assays was determined by spiking experiments with CRC cells. Diagnostic sensitivity as well as specificity was tested on a control group consisting of 18 CRC patients compared to 12 individuals without malignant disease. From a total of 346-screened genes only serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 5 (SERPINB5) showed significantly elevated transcript levels in peripheral venous blood specimens of tumor patients when compared to the nonmalignant control group. These results were confirmed by analysis of an enlarged collective consisting of 63 CRC patients and 36 control individuals without malignant disease. In conclusion SERPINB5 seems to be a promising marker for detection of circulating tumor cells in peripheral blood of colorectal cancer patients.

  17. The Relation of Codon Bias to Tissue-Specific Gene Expression in Arabidopsis thaliana

    PubMed Central

    Camiolo, Salvatore; Farina, Lorenzo; Porceddu, Andrea

    2012-01-01

    The codon composition of coding sequences plays an important role in the regulation of gene expression. Herein, we report systematic differences in the usage of synonymous codons among Arabidopsis thaliana genes that are expressed specifically in distinct tissues. Although we observed that both regionally and transcriptionally associated mutational biases were associated significantly with codon bias, they could not explain the observed differences fully. Similarly, given that transcript abundances did not account for the differences in codon usage, it is unlikely that selection for translational efficiency can account exclusively for the observed codon bias. Thus, we considered the possible evolution of codon bias as an adaptive response to the different abundances of tRNAs in different tissues. Our analysis demonstrated that in some cases, codon usage in genes that were expressed in a broad range of tissues was influenced primarily by the tissue in which the gene was expressed maximally. On the basis of this finding we propose that genes that are expressed in certain tissues might show a tissue-specific compositional signature in relation to codon usage. These findings might have implications for the design of transgenes in relation to optimizing their expression. PMID:22865738

  18. Systematical analysis of cutaneous squamous cell carcinoma network of microRNAs, transcription factors, and target and host genes.

    PubMed

    Wang, Ning; Xu, Zhi-Wen; Wang, Kun-Hao

    2014-01-01

    MicroRNAs (miRNAs) are small non-coding RNA molecules found in multicellular eukaryotes which are implicated in development of cancer, including cutaneous squamous cell carcinoma (cSCC). Expression is controlled by transcription factors (TFs) that bind to specific DNA sequences, thereby controlling the flow (or transcription) of genetic information from DNA to messenger RNA. Interactions result in biological signal control networks. Molecular components involved in cSCC were here assembled at abnormally expressed, related and global levels. Networks at these three levels were constructed with corresponding biological factors in term of interactions between miRNAs and target genes, TFs and miRNAs, and host genes and miRNAs. Up/down regulation or mutation of the factors were considered in the context of the regulation and significant patterns were extracted. Participants of the networks were evaluated based on their expression and regulation of other factors. Sub-networks with two core TFs, TP53 and EIF2C2, as the centers are identified. These share self-adapt feedback regulation in which a mutual restraint exists. Up or down regulation of certain genes and miRNAs are discussed. Some, for example the expression of MMP13, were in line with expectation while others, including FGFR3, need further investigation of their unexpected behavior. The present research suggests that dozens of components, miRNAs, TFs, target genes and host genes included, unite as networks through their regulation to function systematically in human cSCC. Networks built under the currently available sources provide critical signal controlling pathways and frequent patterns. Inappropriate controlling signal flow from abnormal expression of key TFs may push the system into an incontrollable situation and therefore contributes to cSCC development.

  19. Systematic Integration of Brain eQTL and GWAS Identifies ZNF323 as a Novel Schizophrenia Risk Gene and Suggests Recent Positive Selection Based on Compensatory Advantage on Pulmonary Function.

    PubMed

    Luo, Xiong-Jian; Mattheisen, Manuel; Li, Ming; Huang, Liang; Rietschel, Marcella; Børglum, Anders D; Als, Thomas D; van den Oord, Edwin J; Aberg, Karolina A; Mors, Ole; Mortensen, Preben Bo; Luo, Zhenwu; Degenhardt, Franziska; Cichon, Sven; Schulze, Thomas G; Nöthen, Markus M; Su, Bing; Zhao, Zhongming; Gan, Lin; Yao, Yong-Gang

    2015-11-01

    Genome-wide association studies have identified multiple risk variants and loci that show robust association with schizophrenia. Nevertheless, it remains unclear how these variants confer risk to schizophrenia. In addition, the driving force that maintains the schizophrenia risk variants in human gene pool is poorly understood. To investigate whether expression-associated genetic variants contribute to schizophrenia susceptibility, we systematically integrated brain expression quantitative trait loci and genome-wide association data of schizophrenia using Sherlock, a Bayesian statistical framework. Our analyses identified ZNF323 as a schizophrenia risk gene (P = 2.22×10(-6)). Subsequent analyses confirmed the association of the ZNF323 and its expression-associated single nucleotide polymorphism rs1150711 in independent samples (gene-expression: P = 1.40×10(-6); single-marker meta-analysis in the combined discovery and replication sample comprising 44123 individuals: P = 6.85×10(-10)). We found that the ZNF323 was significantly downregulated in hippocampus and frontal cortex of schizophrenia patients (P = .0038 and P = .0233, respectively). Evidence for pleiotropic effects was detected (association of rs1150711 with lung function and gene expression of ZNF323 in lung: P = 6.62×10(-5) and P = 9.00×10(-5), respectively) with the risk allele (T allele) for schizophrenia acting as protective allele for lung function. Subsequent population genetics analyses suggest that the risk allele (T) of rs1150711 might have undergone recent positive selection in human population. Our findings suggest that the ZNF323 is a schizophrenia susceptibility gene whose expression may influence schizophrenia risk. Our study also illustrates a possible mechanism for maintaining schizophrenia risk variants in the human gene pool. © The Author 2015. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  20. Identification and evaluation of reference genes for qRT-PCR normalization in Ganoderma lucidum.

    PubMed

    Xu, Jiang; Xu, ZhiChao; Zhu, YingJie; Luo, HongMei; Qian, Jun; Ji, AiJia; Hu, YuanLei; Sun, Wei; Wang, Bo; Song, JingYuan; Sun, Chao; Chen, ShiLin

    2014-01-01

    Quantitative real-time reverse transcription PCR (qRT-PCR) is a rapid, sensitive, and reliable technique for gene expression studies. The accuracy and reliability of qRT-PCR results depend on the stability of the reference genes used for gene normalization. Therefore, a systematic process of reference gene evaluation is needed. Ganoderma lucidum is a famous medicinal mushroom in East Asia. In the current study, 10 potential reference genes were selected from the G. lucidum genomic data. The sequences of these genes were manually curated, and primers were designed following strict criteria. The experiment was conducted using qRT-PCR, and the stability of each candidate gene was assessed using four commonly used statistical programs-geNorm, NormFinder, BestKeeper, and RefFinder. According to our results, PP2A was expressed at the most stable levels under different fermentation conditions, and RPL4 was the most stably expressed gene in different tissues. RPL4, PP2A, and β-tubulin are the most commonly recommended reference genes for normalizing gene expression in the entire sample set. The current study provides a foundation for the further use of qRT-PCR in G. lucidum gene analysis.

  1. Analysis of a Gene Regulatory Cascade Mediating Circadian Rhythm in Zebrafish

    PubMed Central

    Wang, Haifang; Du, Jiulin; Yan, Jun

    2013-01-01

    In the study of circadian rhythms, it has been a puzzle how a limited number of circadian clock genes can control diverse aspects of physiology. Here we investigate circadian gene expression genome-wide using larval zebrafish as a model system. We made use of a spatial gene expression atlas to investigate the expression of circadian genes in various tissues and cell types. Comparison of genome-wide circadian gene expression data between zebrafish and mouse revealed a nearly anti-phase relationship and allowed us to detect novel evolutionarily conserved circadian genes in vertebrates. We identified three groups of zebrafish genes with distinct responses to light entrainment: fast light-induced genes, slow light-induced genes, and dark-induced genes. Our computational analysis of the circadian gene regulatory network revealed several transcription factors (TFs) involved in diverse aspects of circadian physiology through transcriptional cascade. Of these, microphthalmia-associated transcription factor a (mitfa), a dark-induced TF, mediates a circadian rhythm of melanin synthesis, which may be involved in zebrafish's adaptation to daily light cycling. Our study describes a systematic method to discover previously unidentified TFs involved in circadian physiology in complex organisms. PMID:23468616

  2. A Systematic Survey of Expression and Function of Zebrafish frizzled Genes

    PubMed Central

    Nikaido, Masataka; Law, Edward W. P.; Kelsh, Robert N.

    2013-01-01

    Wnt signaling is crucial for the regulation of numerous processes in development. Consistent with this, the gene families for both the ligands (Wnts) and receptors (Frizzleds) are very large. Surprisingly, while we have a reasonable understanding of the Wnt ligands likely to mediate specific Wnt-dependent processes, the corresponding receptors usually remain to be elucidated. Taking advantage of the zebrafish model's excellent genomic and genetic properties, we undertook a comprehensive analysis of the expression patterns of frizzled (fzd) genes in zebrafish. To explore their functions, we focused on testing their requirement in several developmental events known to be regulated by Wnt signaling, convergent extension movements of gastrulation, neural crest induction, and melanocyte specification. We found fourteen distinct fzd genes in the zebrafish genome. Systematic analysis of their expression patterns between 1-somite and 30 hours post-fertilization revealed complex, dynamic and overlapping expression patterns. This analysis demonstrated that only fzd3a, fzd9b, and fzd10 are expressed in the dorsal neural tube at stages corresponding to the timing of melanocyte specification. Surprisingly, however, morpholino knockdown of these, alone or in combination, gave no indication of reduction of melanocytes, suggesting the important involvement of untested fzds or another type of Wnt receptor in this process. Likewise, we found only fzd7b and fzd10 expressed at the border of the neural plate at stages appropriate for neural crest induction. However, neural crest markers were not reduced by knockdown of these receptors. Instead, these morpholino knockdown studies showed that fzd7a and fzd7b work co-operatively to regulate convergent extension movement during gastrulation. Furthermore, we show that the two fzd7 genes function together with fzd10 to regulate epiboly movements and mesoderm differentiation. PMID:23349976

  3. Biological interpretation of genome-wide association studies using predicted gene functions

    PubMed Central

    Pers, Tune H.; Karjalainen, Juha M.; Chan, Yingleong; Westra, Harm-Jan; Wood, Andrew R.; Yang, Jian; Lui, Julian C.; Vedantam, Sailaja; Gustafsson, Stefan; Esko, Tonu; Frayling, Tim; Speliotes, Elizabeth K.; Boehnke, Michael; Raychaudhuri, Soumya; Fehrmann, Rudolf S.N.; Hirschhorn, Joel N.; Franke, Lude

    2015-01-01

    The main challenge for gaining biological insights from genetic associations is identifying which genes and pathways explain the associations. Here we present DEPICT, an integrative tool that employs predicted gene functions to systematically prioritize the most likely causal genes at associated loci, highlight enriched pathways and identify tissues/cell types where genes from associated loci are highly expressed. DEPICT is not limited to genes with established functions and prioritizes relevant gene sets for many phenotypes. PMID:25597830

  4. Quantitative comparison of microarray experiments with published leukemia related gene expression signatures.

    PubMed

    Klein, Hans-Ulrich; Ruckert, Christian; Kohlmann, Alexander; Bullinger, Lars; Thiede, Christian; Haferlach, Torsten; Dugas, Martin

    2009-12-15

    Multiple gene expression signatures derived from microarray experiments have been published in the field of leukemia research. A comparison of these signatures with results from new experiments is useful for verification as well as for interpretation of the results obtained. Currently, the percentage of overlapping genes is frequently used to compare published gene signatures against a signature derived from a new experiment. However, it has been shown that the percentage of overlapping genes is of limited use for comparing two experiments due to the variability of gene signatures caused by different array platforms or assay-specific influencing parameters. Here, we present a robust approach for a systematic and quantitative comparison of published gene expression signatures with an exemplary query dataset. A database storing 138 leukemia-related published gene signatures was designed. Each gene signature was manually annotated with terms according to a leukemia-specific taxonomy. Two analysis steps are implemented to compare a new microarray dataset with the results from previous experiments stored and curated in the database. First, the global test method is applied to assess gene signatures and to constitute a ranking among them. In a subsequent analysis step, the focus is shifted from single gene signatures to chromosomal aberrations or molecular mutations as modeled in the taxonomy. Potentially interesting disease characteristics are detected based on the ranking of gene signatures associated with these aberrations stored in the database. Two example analyses are presented. An implementation of the approach is freely available as web-based application. The presented approach helps researchers to systematically integrate the knowledge derived from numerous microarray experiments into the analysis of a new dataset. By means of example leukemia datasets we demonstrate that this approach detects related experiments as well as related molecular mutations and may help to interpret new microarray data.

  5. Systematic drug safety evaluation based on public genomic expression (Connectivity Map) data: Myocardial and infectious adverse reactions as application cases

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Kejian, E-mail: kejian.wang.bio@gmail.com; Weng, Zuquan; Sun, Liya

    Adverse drug reaction (ADR) is of great importance to both regulatory agencies and the pharmaceutical industry. Various techniques, such as quantitative structure–activity relationship (QSAR) and animal toxicology, are widely used to identify potential risks during the preclinical stage of drug development. Despite these efforts, drugs with safety liabilities can still pass through safety checkpoints and enter the market. This situation raises the concern that conventional chemical structure analysis and phenotypic screening are not sufficient to avoid all clinical adverse events. Genomic expression data following in vitro drug treatments characterize drug actions and thus have become widely used in drug repositioning. Inmore » the present study, we explored prediction of ADRs based on the drug-induced gene-expression profiles from cultured human cells in the Connectivity Map (CMap) database. The results showed that drugs inducing comparable ADRs generally lead to similar CMap expression profiles. Based on such ADR-gene expression association, we established prediction models for various ADRs, including severe myocardial and infectious events. Drugs with FDA boxed warnings of safety liability were effectively identified. We therefore suggest that drug-induced gene expression change, in combination with effective computational methods, may provide a new dimension of information to facilitate systematic drug safety evaluation. - Highlights: • Drugs causing common toxicity lead to similar in vitro gene expression changes. • We built a model to predict drug toxicity with drug-specific expression profiles. • Drugs with FDA black box warnings were effectively identified by our model. • In vitro assay can detect severe toxicity in the early stage of drug development.« less

  6. Plastid Transcriptomics and Translatomics of Tomato Fruit Development and Chloroplast-to-Chromoplast Differentiation: Chromoplast Gene Expression Largely Serves the Production of a Single Protein[W][OA

    PubMed Central

    Kahlau, Sabine; Bock, Ralph

    2008-01-01

    Plastid genes are expressed at high levels in photosynthetically active chloroplasts but are generally believed to be drastically downregulated in nongreen plastids. The genome-wide changes in the expression patterns of plastid genes during the development of nongreen plastid types as well as the contributions of transcriptional versus translational regulation are largely unknown. We report here a systematic transcriptomics and translatomics analysis of the tomato (Solanum lycopersicum) plastid genome during fruit development and chloroplast-to-chromoplast conversion. At the level of RNA accumulation, most but not all plastid genes are strongly downregulated in fruits compared with leaves. By contrast, chloroplast-to-chromoplast differentiation during fruit ripening is surprisingly not accompanied by large changes in plastid RNA accumulation. However, most plastid genes are translationally downregulated during chromoplast development. Both transcriptional and translational downregulation are more pronounced for photosynthesis-related genes than for genes involved in gene expression, indicating that some low-level plastid gene expression must be sustained in chromoplasts. High-level expression during chromoplast development identifies accD, the only plastid-encoded gene involved in fatty acid biosynthesis, as the target gene for which gene expression activity in chromoplasts is maintained. In addition, we have determined the developmental patterns of plastid RNA polymerase activities, intron splicing, and RNA editing and report specific developmental changes in the splicing and editing patterns of plastid transcripts. PMID:18441214

  7. MethHC: a database of DNA methylation and gene expression in human cancer.

    PubMed

    Huang, Wei-Yun; Hsu, Sheng-Da; Huang, Hsi-Yuan; Sun, Yi-Ming; Chou, Chih-Hung; Weng, Shun-Long; Huang, Hsien-Da

    2015-01-01

    We present MethHC (http://MethHC.mbc.nctu.edu.tw), a database comprising a systematic integration of a large collection of DNA methylation data and mRNA/microRNA expression profiles in human cancer. DNA methylation is an important epigenetic regulator of gene transcription, and genes with high levels of DNA methylation in their promoter regions are transcriptionally silent. Increasing numbers of DNA methylation and mRNA/microRNA expression profiles are being published in different public repositories. These data can help researchers to identify epigenetic patterns that are important for carcinogenesis. MethHC integrates data such as DNA methylation, mRNA expression, DNA methylation of microRNA gene and microRNA expression to identify correlations between DNA methylation and mRNA/microRNA expression from TCGA (The Cancer Genome Atlas), which includes 18 human cancers in more than 6000 samples, 6548 microarrays and 12 567 RNA sequencing data. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. Optimizing information flow in small genetic networks. IV. Spatial coupling

    NASA Astrophysics Data System (ADS)

    Sokolowski, Thomas R.; Tkačik, Gašper

    2015-06-01

    We typically think of cells as responding to external signals independently by regulating their gene expression levels, yet they often locally exchange information and coordinate. Can such spatial coupling be of benefit for conveying signals subject to gene regulatory noise? Here we extend our information-theoretic framework for gene regulation to spatially extended systems. As an example, we consider a lattice of nuclei responding to a concentration field of a transcriptional regulator (the input) by expressing a single diffusible target gene. When input concentrations are low, diffusive coupling markedly improves information transmission; optimal gene activation functions also systematically change. A qualitatively different regulatory strategy emerges where individual cells respond to the input in a nearly steplike fashion that is subsequently averaged out by strong diffusion. While motivated by early patterning events in the Drosophila embryo, our framework is generically applicable to spatially coupled stochastic gene expression models.

  9. Automation of fluorescent differential display with digital readout.

    PubMed

    Meade, Jonathan D; Cho, Yong-Jig; Fisher, Jeffrey S; Walden, Jamie C; Guo, Zhen; Liang, Peng

    2006-01-01

    Since its invention in 1992, differential display (DD) has become the most commonly used technique for identifying differentially expressed genes because of its many advantages over competing technologies such as DNA microarray, serial analysis of gene expression (SAGE), and subtractive hybridization. Despite the great impact of the method on biomedical research, there has been a lack of automation of DD technology to increase its throughput and accuracy for systematic gene expression analysis. Most of previous DD work has taken a "shot-gun" approach of identifying one gene at a time, with a limited number of polymerase chain reaction (PCR) reactions set up manually, giving DD a low-tech and low-throughput image. We have optimized the DD process with a new platform that incorporates fluorescent digital readout, automated liquid handling, and large-format gels capable of running entire 96-well plates. The resulting streamlined fluorescent DD (FDD) technology offers an unprecedented accuracy, sensitivity, and throughput in comprehensive and quantitative analysis of gene expression. These major improvements will allow researchers to find differentially expressed genes of interest, both known and novel, quickly and easily.

  10. Systematic analysis of microarray datasets to identify Parkinson's disease‑associated pathways and genes.

    PubMed

    Feng, Yinling; Wang, Xuefeng

    2017-03-01

    In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co‑expression networks and clinical information was adopted, using weighted gene co‑expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co‑pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution‑based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD‑associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis.

  11. Prediction of gene expression in embryonic structures of Drosophila melanogaster.

    PubMed

    Samsonova, Anastasia A; Niranjan, Mahesan; Russell, Steven; Brazma, Alvis

    2007-07-01

    Understanding how sets of genes are coordinately regulated in space and time to generate the diversity of cell types that characterise complex metazoans is a major challenge in modern biology. The use of high-throughput approaches, such as large-scale in situ hybridisation and genome-wide expression profiling via DNA microarrays, is beginning to provide insights into the complexities of development. However, in many organisms the collection and annotation of comprehensive in situ localisation data is a difficult and time-consuming task. Here, we present a widely applicable computational approach, integrating developmental time-course microarray data with annotated in situ hybridisation studies, that facilitates the de novo prediction of tissue-specific expression for genes that have no in vivo gene expression localisation data available. Using a classification approach, trained with data from microarray and in situ hybridisation studies of gene expression during Drosophila embryonic development, we made a set of predictions on the tissue-specific expression of Drosophila genes that have not been systematically characterised by in situ hybridisation experiments. The reliability of our predictions is confirmed by literature-derived annotations in FlyBase, by overrepresentation of Gene Ontology biological process annotations, and, in a selected set, by detailed gene-specific studies from the literature. Our novel organism-independent method will be of considerable utility in enriching the annotation of gene function and expression in complex multicellular organisms.

  12. Prediction of Gene Expression in Embryonic Structures of Drosophila melanogaster

    PubMed Central

    Samsonova, Anastasia A; Niranjan, Mahesan; Russell, Steven; Brazma, Alvis

    2007-01-01

    Understanding how sets of genes are coordinately regulated in space and time to generate the diversity of cell types that characterise complex metazoans is a major challenge in modern biology. The use of high-throughput approaches, such as large-scale in situ hybridisation and genome-wide expression profiling via DNA microarrays, is beginning to provide insights into the complexities of development. However, in many organisms the collection and annotation of comprehensive in situ localisation data is a difficult and time-consuming task. Here, we present a widely applicable computational approach, integrating developmental time-course microarray data with annotated in situ hybridisation studies, that facilitates the de novo prediction of tissue-specific expression for genes that have no in vivo gene expression localisation data available. Using a classification approach, trained with data from microarray and in situ hybridisation studies of gene expression during Drosophila embryonic development, we made a set of predictions on the tissue-specific expression of Drosophila genes that have not been systematically characterised by in situ hybridisation experiments. The reliability of our predictions is confirmed by literature-derived annotations in FlyBase, by overrepresentation of Gene Ontology biological process annotations, and, in a selected set, by detailed gene-specific studies from the literature. Our novel organism-independent method will be of considerable utility in enriching the annotation of gene function and expression in complex multicellular organisms. PMID:17658945

  13. Transcriptional oscillation of canonical clock genes in mouse peripheral tissues

    PubMed Central

    Yamamoto, Takuro; Nakahata, Yasukazu; Soma, Haruhiko; Akashi, Makoto; Mamine, Takayoshi; Takumi, Toru

    2004-01-01

    Background The circadian rhythm of about 24 hours is a fundamental physiological function observed in almost all organisms from prokaryotes to humans. Identification of clock genes has allowed us to study the molecular bases for circadian behaviors and temporal physiological processes such as hormonal secretion, and has prompted the idea that molecular clocks reside not only in a central pacemaker, the suprachiasmatic nuclei (SCN) of hypothalamus in mammals, but also in peripheral tissues, even in immortalized cells. Furthermore, previous molecular dissection revealed that the mechanism of circadian oscillation at a molecular level is based on transcriptional regulation of clock and clock-controlled genes. Results We systematically analyzed the mRNA expression of clock and clock-controlled genes in mouse peripheral tissues. Eight genes (mBmal1, mNpas2, mRev-erbα, mDbp, mRev-erbβ, mPer3, mPer1 and mPer2; given in the temporal order of the rhythm peak) showed robust circadian expressions of mRNAs in all tissues except testis, suggesting that these genes are core molecules of the molecular biological clock. The bioinformatics analysis revealed that these genes have one or a combination of 3 transcriptional elements (RORE, DBPE, and E-box), which are conserved among human, mouse, and rat genome sequences, and indicated that these 3 elements may be responsible for the biological timing of expression of canonical clock genes. Conclusions The observation of oscillatory profiles of canonical clock genes is not only useful for physiological and pathological examination of the circadian clock in various organs but also important for systematic understanding of transcriptional regulation on a genome-wide basis. Our finding of the oscillatory expression of canonical clock genes with a temporal order provides us an interesting hypothesis, that cyclic timing of all clock and clock-controlled genes may be dependent on several transcriptional elements including 3 known elements, E-box, RORE, and DBPE. PMID:15473909

  14. Genetic dissection of the Gpnmb network in the eye.

    PubMed

    Lu, Hong; Wang, Xusheng; Pullen, Matthew; Guan, Huaijin; Chen, Hui; Sahu, Shwetapadma; Zhang, Bing; Chen, Hao; Williams, Robert W; Geisert, Eldon E; Lu, Lu; Jablonski, Monica M

    2011-06-13

    To use a systematic genetics approach to investigate the regulation of Gpnmb, a gene that contributes to pigmentary dispersion syndrome (PDS) and pigmentary glaucoma (PG) in the DBA/2J (D2) mouse. Global patterns of gene expression were studied in whole eyes of a large family of BXD mouse strains (n = 67) generated by crossing the PDS- and PG-prone parent (DBA/2J) with a resistant strain (C57BL/6J). Quantitative trait locus (eQTL) mapping methods and gene set analysis were used to evaluate Gpnmb coexpression networks in wild-type and mutant cohorts. The level of Gpnmb expression was associated with a highly significant cis-eQTL at the location of the gene itself. This autocontrol of Gpnmb is likely to be a direct consequence of the known premature stop codon in exon 4. Both gene ontology and coexpression network analyses demonstrated that the mutation in Gpnmb radically modified the set of genes with which Gpnmb expression is correlated. The covariates of wild-type Gpnmb are involved in biological processes including melanin synthesis and cell migration, whereas the covariates of mutant Gpnmb are involved in the biological processes of posttranslational modification, stress activation, and sensory processing. These results demonstrated that a systematic genetics approach provides a powerful tool for constructing coexpression networks that define the biological process categories within which similarly regulated genes function. The authors showed that the R150X mutation in Gpnmb dramatically modified its list of genetic covariates, which may explain the associated ocular pathology.

  15. Reduction in expression of the benign AR transcriptome is a hallmark of localised prostate cancer progression.

    PubMed

    Stuchbery, Ryan; Macintyre, Geoff; Cmero, Marek; Harewood, Laurence M; Peters, Justin S; Costello, Anthony J; Hovens, Christopher M; Corcoran, Niall M

    2016-05-24

    Despite the importance of androgen receptor (AR) signalling to prostate cancer development, little is known about how this signalling pathway changes with increasing grade and stage of the disease. To explore changes in the normal AR transcriptome in localised prostate cancer, and its relation to adverse pathological features and disease recurrence. Publically accessible human prostate cancer expression arrays as well as RNA sequencing data from the prostate TCGA. Tumour associated PSA and PSAD were calculated for a large cohort of men (n=1108) undergoing prostatectomy. We performed a meta-analysis of the expression of an androgen-regulated gene set across datasets using Oncomine. Differential expression of selected genes in the prostate TCGA database was probed using the edgeR Bioconductor package. Changes in tumour PSA density with stage and grade were assessed by Student's t-test, and its association with biochemical recurrence explored by Kaplan-Meier curves and Cox regression. Meta-analysis revealed a systematic decline in the expression of a previously identified benign prostate androgen-regulated gene set with increasing tumour grade, reaching significance in nine of 25 genes tested despite increasing AR expression. These results were confirmed in a large independent dataset from the TCGA. At the protein level, when serum PSA was corrected for tumour volume, significantly lower levels were observed with increasing tumour grade and stage, and predicted disease recurrence. Lower PSA secretion-per-tumour-volume is associated with increasing grade and stage of prostate cancer, has prognostic relevance, and reflects a systematic perturbation of androgen signalling.

  16. Comprehensive Genome-Wide Survey, Genomic Constitution and Expression Profiling of the NAC Transcription Factor Family in Foxtail Millet (Setaria italica L.)

    PubMed Central

    Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B., Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj

    2013-01-01

    The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants. PMID:23691254

  17. Comprehensive genome-wide survey, genomic constitution and expression profiling of the NAC transcription factor family in foxtail millet (Setaria italica L.).

    PubMed

    Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B, Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj

    2013-01-01

    The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants.

  18. Analysis tools for the interplay between genome layout and regulation.

    PubMed

    Bouyioukos, Costas; Elati, Mohamed; Képès, François

    2016-06-06

    Genome layout and gene regulation appear to be interdependent. Understanding this interdependence is key to exploring the dynamic nature of chromosome conformation and to engineering functional genomes. Evidence for non-random genome layout, defined as the relative positioning of either co-functional or co-regulated genes, stems from two main approaches. Firstly, the analysis of contiguous genome segments across species, has highlighted the conservation of gene arrangement (synteny) along chromosomal regions. Secondly, the study of long-range interactions along a chromosome has emphasised regularities in the positioning of microbial genes that are co-regulated, co-expressed or evolutionarily correlated. While one-dimensional pattern analysis is a mature field, it is often powerless on biological datasets which tend to be incomplete, and partly incorrect. Moreover, there is a lack of comprehensive, user-friendly tools to systematically analyse, visualise, integrate and exploit regularities along genomes. Here we present the Genome REgulatory and Architecture Tools SCAN (GREAT:SCAN) software for the systematic study of the interplay between genome layout and gene expression regulation. SCAN is a collection of related and interconnected applications currently able to perform systematic analyses of genome regularities as well as to improve transcription factor binding sites (TFBS) and gene regulatory network predictions based on gene positional information. We demonstrate the capabilities of these tools by studying on one hand the regular patterns of genome layout in the major regulons of the bacterium Escherichia coli. On the other hand, we demonstrate the capabilities to improve TFBS prediction in microbes. Finally, we highlight, by visualisation of multivariate techniques, the interplay between position and sequence information for effective transcription regulation.

  19. The Evolution and Expression Pattern of Human Overlapping lncRNA and Protein-coding Gene Pairs.

    PubMed

    Ning, Qianqian; Li, Yixue; Wang, Zhen; Zhou, Songwen; Sun, Hong; Yu, Guangjun

    2017-03-27

    Long non-coding RNA overlapping with protein-coding gene (lncRNA-coding pair) is a special type of overlapping genes. Protein-coding overlapping genes have been well studied and increasing attention has been paid to lncRNAs. By studying lncRNA-coding pairs in human genome, we showed that lncRNA-coding pairs were more likely to be generated by overprinting and retaining genes in lncRNA-coding pairs were given higher priority than non-overlapping genes. Besides, the preference of overlapping configurations preserved during evolution was based on the origin of lncRNA-coding pairs. Further investigations showed that lncRNAs promoting the splicing of their embedded protein-coding partners was a unilateral interaction, but the existence of overlapping partners improving the gene expression was bidirectional and the effect was decreased with the increased evolutionary age of genes. Additionally, the expression of lncRNA-coding pairs showed an overall positive correlation and the expression correlation was associated with their overlapping configurations, local genomic environment and evolutionary age of genes. Comparison of the expression correlation of lncRNA-coding pairs between normal and cancer samples found that the lineage-specific pairs including old protein-coding genes may play an important role in tumorigenesis. This work presents a systematically comprehensive understanding of the evolution and the expression pattern of human lncRNA-coding pairs.

  20. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gao, Junpeng; Innovation Experimental College, Northwest A&F University, Yangling, Shaanxi 712100; Cao, Xiaoli

    The Auxin/indole-3-acetic acid (Aux/IAA) genes encode short-lived nuclear proteins that are known to be involved in the primary cellular responses to auxin. To date, systematic analysis of the Aux/IAA genes in potato (Solanum tuberosum) has not been conducted. In this study, a total of 26 potato Aux/IAA genes were identified (designated from StIAA1 to StIAA26), and the distribution of four conserved domains shared by the StIAAs were analyzed based on multiple sequence alignment and a motif-based sequence analysis. A phylogenetic analysis of the Aux/IAA gene families of potato and Arabidopsis was also conducted. In order to assess the roles ofmore » StIAA genes in tuber development, the results of RNA-seq studies were reformatted to analyze the expression patterns of StIAA genes, and then verified by quantitative real-time PCR. A large number of StIAA genes (12 genes) were highly expressed in stolon organs and in during the tuber initiation and expansion developmental stages, and most of these genes were responsive to indoleacetic acid treatment. Our results suggested that StIAA genes were involved in the process of tuber development and provided insights into functional roles of potato Aux/IAA genes. - Highlights: • A systematic analysis of the potato AUX/IAA gene family were performed. • StIAA genes were related to auxin perception and signal transduction. • Candidate StIAA genes likely related to tuber initiation and expansion were screened.« less

  1. Normalizing gene expression by quantitative PCR during somatic embryogenesis in two representative conifer species: Pinus pinaster and Picea abies.

    PubMed

    de Vega-Bartol, José J; Santos, Raquen Raissa; Simões, Marta; Miguel, Célia M

    2013-05-01

    Suitable internal control genes to normalize qPCR data from different stages of embryo development and germination were identified in two representative conifer species. Clonal propagation by somatic embryogenesis has a great application potentiality in conifers. Quantitative PCR (qPCR) is widely used for gene expression analysis during somatic embryogenesis and embryo germination. No single reference gene is universal, so a systematic characterization of endogenous genes for concrete conditions is fundamental for accuracy. We identified suitable internal control genes to normalize qPCR data obtained at different steps of somatic embryogenesis (embryonal mass proliferation, embryo maturation and germination) in two representative conifer species, Pinus pinaster and Picea abies. Candidate genes included endogenous genes commonly used in conifers, genes previously tested in model plants, and genes with a lower variation of the expression along embryo development according to genome-wide transcript profiling studies. Three different algorithms were used to evaluate expression stability. The geometric average of the expression values of elongation factor-1α, α-tubulin and histone 3 in P. pinaster, and elongation factor-1α, α-tubulin, adenosine kinase and CAC in P. abies were adequate for expression studies throughout somatic embryogenesis. However, improved accuracy was achieved when using other gene combinations in experiments with samples at a single developmental stage. The importance of studies selecting reference genes to use in different tissues or developmental stages within one or close species, and the instability of commonly used reference genes, is highlighted.

  2. Ion Channel Gene Expression in Lung Adenocarcinoma: Potential Role in Prognosis and Diagnosis

    PubMed Central

    Ko, Jae-Hong; Gu, Wanjun; Lim, Inja; Bang, Hyoweon; Ko, Eun A.; Zhou, Tong

    2014-01-01

    Ion channels are known to regulate cancer processes at all stages. The roles of ion channels in cancer pathology are extremely diverse. We systematically analyzed the expression patterns of ion channel genes in lung adenocarcinoma. First, we compared the expression of ion channel genes between normal and tumor tissues in patients with lung adenocarcinoma. Thirty-seven ion channel genes were identified as being differentially expressed between the two groups. Next, we investigated the prognostic power of ion channel genes in lung adenocarcinoma. We assigned a risk score to each lung adenocarcinoma patient based on the expression of the differentially expressed ion channel genes. We demonstrated that the risk score effectively predicted overall survival and recurrence-free survival in lung adenocarcinoma. We also found that the risk scores for ever-smokers were higher than those for never-smokers. Multivariate analysis indicated that the risk score was a significant prognostic factor for survival, which is independent of patient age, gender, stage, smoking history, Myc level, and EGFR/KRAS/ALK gene mutation status. Finally, we investigated the difference in ion channel gene expression between the two major subtypes of non-small cell lung cancer: adenocarcinoma and squamous-cell carcinoma. Thirty ion channel genes were identified as being differentially expressed between the two groups. We suggest that ion channel gene expression can be used to improve the subtype classification in non-small cell lung cancer at the molecular level. The findings in this study have been validated in several independent lung cancer cohorts. PMID:24466154

  3. Integrated analyses for genetic markers of polycystic ovary syndrome with 9 case-control studies of gene expression profiles.

    PubMed

    Lu, Chenqi; Liu, Xiaoqin; Wang, Lin; Jiang, Ning; Yu, Jun; Zhao, Xiaobo; Hu, Hairong; Zheng, Saihua; Li, Xuelian; Wang, Guiying

    2017-01-10

    Due to genetic heterogeneity and variable diagnostic criteria, genetic studies of polycystic ovary syndrome are particularly challenging. Furthermore, lack of sufficiently large cohorts limits the identification of susceptibility genes contributing to polycystic ovary syndrome. Here, we carried out a systematic search of studies deposited in the Gene Expression Omnibus database through August 31, 2016. The present analyses included studies with: 1) patients with polycystic ovary syndrome and normal controls, 2) gene expression profiling of messenger RNA, and 3) sufficient data for our analysis. Ultimately, a total of 9 studies with 13 datasets met the inclusion criteria and were performed for the subsequent integrated analyses. Through comprehensive analyses, there were 13 genetic factors overlapped in all datasets and identified as significant specific genes for polycystic ovary syndrome. After quality control assessment, there were six datasets remained. Further gene ontology enrichment and pathway analyses suggested that differentially expressed genes mainly enriched in oocyte pathways. These findings provide potential molecular markers for diagnosis and prognosis of polycystic ovary syndrome, and need in-depth studies on the exact function and mechanism in polycystic ovary syndrome.

  4. Identification and Characterization of Long Non-Coding RNAs Related to Mouse Embryonic Brain Development from Available Transcriptomic Data

    PubMed Central

    He, Hongjuan; Xiu, Youcheng; Guo, Jing; Liu, Hui; Liu, Qi; Zeng, Tiebo; Chen, Yan; Zhang, Yan; Wu, Qiong

    2013-01-01

    Long non-coding RNAs (lncRNAs) as a key group of non-coding RNAs have gained widely attention. Though lncRNAs have been functionally annotated and systematic explored in higher mammals, few are under systematical identification and annotation. Owing to the expression specificity, known lncRNAs expressed in embryonic brain tissues remain still limited. Considering a large number of lncRNAs are only transcribed in brain tissues, studies of lncRNAs in developmental brain are therefore of special interest. Here, publicly available RNA-sequencing (RNA-seq) data in embryonic brain are integrated to identify thousands of embryonic brain lncRNAs by a customized pipeline. A significant proportion of novel transcripts have not been annotated by available genomic resources. The putative embryonic brain lncRNAs are shorter in length, less spliced and show less conservation than known genes. The expression of putative lncRNAs is in one tenth on average of known coding genes, while comparable with known lncRNAs. From chromatin data, putative embryonic brain lncRNAs are associated with active chromatin marks, comparable with known lncRNAs. Embryonic brain expressed lncRNAs are also indicated to have expression though not evident in adult brain. Gene Ontology analysis of putative embryonic brain lncRNAs suggests that they are associated with brain development. The putative lncRNAs are shown to be related to possible cis-regulatory roles in imprinting even themselves are deemed to be imprinted lncRNAs. Re-analysis of one knockdown data suggests that four regulators are associated with lncRNAs. Taken together, the identification and systematic analysis of putative lncRNAs would provide novel insights into uncharacterized mouse non-coding regions and the relationships with mammalian embryonic brain development. PMID:23967161

  5. Genome-wide identification, phylogeny and expressional profiles of mitogen activated protein kinase kinase kinase (MAPKKK) gene family in bread wheat (Triticum aestivum L.).

    PubMed

    Wang, Meng; Yue, Hong; Feng, Kewei; Deng, Pingchuan; Song, Weining; Nie, Xiaojun

    2016-08-22

    Mitogen-activated protein kinase kinase kinases (MAPKKKs) are the important components of MAPK cascades, which play the crucial role in plant growth and development as well as in response to diverse stresses. Although this family has been systematically studied in many plant species, little is known about MAPKKK genes in wheat (Triticum aestivum L.), especially those involved in the regulatory network of stress processes. In this study, we identified 155 wheat MAPKKK genes through a genome-wide search method based on the latest available wheat genome information, of which 29 belonged to MEKK, 11 to ZIK and 115 to Raf subfamily, respectively. Then, chromosome localization, gene structure and conserved protein motifs and phylogenetic relationship as well as regulatory network of these TaMAPKKKs were systematically investigated and results supported the prediction. Furthermore, a total of 11 homologous groups between A, B and D sub-genome and 24 duplication pairs among them were detected, which contributed to the expansion of wheat MAPKKK gene family. Finally, the expression profiles of these MAPKKKs during development and under different abiotic stresses were investigated using the RNA-seq data. Additionally, 10 tissue-specific and 4 salt-responsive TaMAPKKK genes were selected to validate their expression level through qRT-PCR analysis. This study for the first time reported the genome organization, evolutionary features and expression profiles of the wheat MAPKKK gene family, which laid the foundation for further functional analysis of wheat MAPKKK genes, and contributed to better understanding the roles and regulatory mechanism of MAPKKKs in wheat.

  6. Modeling genome-wide dynamic regulatory network in mouse lungs with influenza infection using high-dimensional ordinary differential equations.

    PubMed

    Wu, Shuang; Liu, Zhi-Ping; Qiu, Xing; Wu, Hulin

    2014-01-01

    The immune response to viral infection is regulated by an intricate network of many genes and their products. The reverse engineering of gene regulatory networks (GRNs) using mathematical models from time course gene expression data collected after influenza infection is key to our understanding of the mechanisms involved in controlling influenza infection within a host. A five-step pipeline: detection of temporally differentially expressed genes, clustering genes into co-expressed modules, identification of network structure, parameter estimate refinement, and functional enrichment analysis, is developed for reconstructing high-dimensional dynamic GRNs from genome-wide time course gene expression data. Applying the pipeline to the time course gene expression data from influenza-infected mouse lungs, we have identified 20 distinct temporal expression patterns in the differentially expressed genes and constructed a module-based dynamic network using a linear ODE model. Both intra-module and inter-module annotations and regulatory relationships of our inferred network show some interesting findings and are highly consistent with existing knowledge about the immune response in mice after influenza infection. The proposed method is a computationally efficient, data-driven pipeline bridging experimental data, mathematical modeling, and statistical analysis. The application to the influenza infection data elucidates the potentials of our pipeline in providing valuable insights into systematic modeling of complicated biological processes.

  7. Turning publicly available gene expression data into discoveries using gene set context analysis.

    PubMed

    Ji, Zhicheng; Vokes, Steven A; Dang, Chi V; Ji, Hongkai

    2016-01-08

    Gene Set Context Analysis (GSCA) is an open source software package to help researchers use massive amounts of publicly available gene expression data (PED) to make discoveries. Users can interactively visualize and explore gene and gene set activities in 25,000+ consistently normalized human and mouse gene expression samples representing diverse biological contexts (e.g. different cells, tissues and disease types, etc.). By providing one or multiple genes or gene sets as input and specifying a gene set activity pattern of interest, users can query the expression compendium to systematically identify biological contexts associated with the specified gene set activity pattern. In this way, researchers with new gene sets from their own experiments may discover previously unknown contexts of gene set functions and hence increase the value of their experiments. GSCA has a graphical user interface (GUI). The GUI makes the analysis convenient and customizable. Analysis results can be conveniently exported as publication quality figures and tables. GSCA is available at https://github.com/zji90/GSCA. This software significantly lowers the bar for biomedical investigators to use PED in their daily research for generating and screening hypotheses, which was previously difficult because of the complexity, heterogeneity and size of the data. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. Validation of reference genes for quantitative gene expression analysis in experimental epilepsy.

    PubMed

    Sadangi, Chinmaya; Rosenow, Felix; Norwood, Braxton A

    2017-12-01

    To grasp the molecular mechanisms and pathophysiology underlying epilepsy development (epileptogenesis) and epilepsy itself, it is important to understand the gene expression changes that occur during these phases. Quantitative real-time polymerase chain reaction (qPCR) is a technique that rapidly and accurately determines gene expression changes. It is crucial, however, that stable reference genes are selected for each experimental condition to ensure that accurate values are obtained for genes of interest. If reference genes are unstably expressed, this can lead to inaccurate data and erroneous conclusions. To date, epilepsy studies have used mostly single, nonvalidated reference genes. This is the first study to systematically evaluate reference genes in male Sprague-Dawley rat models of epilepsy. We assessed 15 potential reference genes in hippocampal tissue obtained from 2 different models during epileptogenesis, 1 model during chronic epilepsy, and a model of noninjurious seizures. Reference gene ranking varied between models and also differed between epileptogenesis and chronic epilepsy time points. There was also some variance between the four mathematical models used to rank reference genes. Notably, we found novel reference genes to be more stably expressed than those most often used in experimental epilepsy studies. The consequence of these findings is that reference genes suitable for one epilepsy model may not be appropriate for others and that reference genes can change over time. It is, therefore, critically important to validate potential reference genes before using them as normalizing factors in expression analysis in order to ensure accurate, valid results. © 2017 Wiley Periodicals, Inc.

  9. PROSPECT improves cis-acting regulatory element prediction by integrating expression profile data with consensus pattern searches

    PubMed Central

    Fujibuchi, Wataru; Anderson, John S. J.; Landsman, David

    2001-01-01

    Consensus pattern and matrix-based searches designed to predict cis-acting transcriptional regulatory sequences have historically been subject to large numbers of false positives. We sought to decrease false positives by incorporating expression profile data into a consensus pattern-based search method. We have systematically analyzed the expression phenotypes of over 6000 yeast genes, across 121 expression profile experiments, and correlated them with the distribution of 14 known regulatory elements over sequences upstream of the genes. Our method is based on a metric we term probabilistic element assessment (PEA), which is a ranking of potential sites based on sequence similarity in the upstream regions of genes with similar expression phenotypes. For eight of the 14 known elements that we examined, our method had a much higher selectivity than a naïve consensus pattern search. Based on our analysis, we have developed a web-based tool called PROSPECT, which allows consensus pattern-based searching of gene clusters obtained from microarray data. PMID:11574681

  10. Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.

    PubMed

    Zhang, Wenqian; Yu, Ying; Hertwig, Falk; Thierry-Mieg, Jean; Zhang, Wenwei; Thierry-Mieg, Danielle; Wang, Jian; Furlanello, Cesare; Devanarayan, Viswanath; Cheng, Jie; Deng, Youping; Hero, Barbara; Hong, Huixiao; Jia, Meiwen; Li, Li; Lin, Simon M; Nikolsky, Yuri; Oberthuer, André; Qing, Tao; Su, Zhenqiang; Volland, Ruth; Wang, Charles; Wang, May D; Ai, Junmei; Albanese, Davide; Asgharzadeh, Shahab; Avigad, Smadar; Bao, Wenjun; Bessarabova, Marina; Brilliant, Murray H; Brors, Benedikt; Chierici, Marco; Chu, Tzu-Ming; Zhang, Jibin; Grundy, Richard G; He, Min Max; Hebbring, Scott; Kaufman, Howard L; Lababidi, Samir; Lancashire, Lee J; Li, Yan; Lu, Xin X; Luo, Heng; Ma, Xiwen; Ning, Baitang; Noguera, Rosa; Peifer, Martin; Phan, John H; Roels, Frederik; Rosswog, Carolina; Shao, Susan; Shen, Jie; Theissen, Jessica; Tonini, Gian Paolo; Vandesompele, Jo; Wu, Po-Yen; Xiao, Wenzhong; Xu, Joshua; Xu, Weihong; Xuan, Jiekun; Yang, Yong; Ye, Zhan; Dong, Zirui; Zhang, Ke K; Yin, Ye; Zhao, Chen; Zheng, Yuanting; Wolfinger, Russell D; Shi, Tieliu; Malkas, Linda H; Berthold, Frank; Wang, Jun; Tong, Weida; Shi, Leming; Peng, Zhiyu; Fischer, Matthias

    2015-06-25

    Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction. Since RNA-seq provides a powerful tool for transcriptome-based applications beyond the limitations of microarrays, we sought to systematically evaluate the performance of RNA-seq-based and microarray-based classifiers in this MAQC-III/SEQC study for clinical endpoint prediction using neuroblastoma as a model. We generate gene expression profiles from 498 primary neuroblastomas using both RNA-seq and 44 k microarrays. Characterization of the neuroblastoma transcriptome by RNA-seq reveals that more than 48,000 genes and 200,000 transcripts are being expressed in this malignancy. We also find that RNA-seq provides much more detailed information on specific transcript expression patterns in clinico-genetic neuroblastoma subgroups than microarrays. To systematically compare the power of RNA-seq and microarray-based models in predicting clinical endpoints, we divide the cohort randomly into training and validation sets and develop 360 predictive models on six clinical endpoints of varying predictability. Evaluation of factors potentially affecting model performances reveals that prediction accuracies are most strongly influenced by the nature of the clinical endpoint, whereas technological platforms (RNA-seq vs. microarrays), RNA-seq data analysis pipelines, and feature levels (gene vs. transcript vs. exon-junction level) do not significantly affect performances of the models. We demonstrate that RNA-seq outperforms microarrays in determining the transcriptomic characteristics of cancer, while RNA-seq and microarray-based models perform similarly in clinical endpoint prediction. Our findings may be valuable to guide future studies on the development of gene expression-based predictive models and their implementation in clinical practice.

  11. Optimization and evaluation of T7 based RNA linear amplification protocols for cDNA microarray analysis

    PubMed Central

    Zhao, Hongjuan; Hastie, Trevor; Whitfield, Michael L; Børresen-Dale, Anne-Lise; Jeffrey, Stefanie S

    2002-01-01

    Background T7 based linear amplification of RNA is used to obtain sufficient antisense RNA for microarray expression profiling. We optimized and systematically evaluated the fidelity and reproducibility of different amplification protocols using total RNA obtained from primary human breast carcinomas and high-density cDNA microarrays. Results Using an optimized protocol, the average correlation coefficient of gene expression of 11,123 cDNA clones between amplified and unamplified samples is 0.82 (0.85 when a virtual array was created using repeatedly amplified samples to minimize experimental variation). Less than 4% of genes show changes in expression level by 2-fold or greater after amplification compared to unamplified samples. Most changes due to amplification are not systematic both within one tumor sample and between different tumors. Amplification appears to dampen the variation of gene expression for some genes when compared to unamplified poly(A)+ RNA. The reproducibility between repeatedly amplified samples is 0.97 when performed on the same day, but drops to 0.90 when performed weeks apart. The fidelity and reproducibility of amplification is not affected by decreasing the amount of input total RNA in the 0.3–3 micrograms range. Adding template-switching primer, DNA ligase, or column purification of double-stranded cDNA does not improve the fidelity of amplification. The correlation coefficient between amplified and unamplified samples is higher when total RNA is used as template for both experimental and reference RNA amplification. Conclusion T7 based linear amplification reproducibly generates amplified RNA that closely approximates original sample for gene expression profiling using cDNA microarrays. PMID:12445333

  12. Missing data and technical variability in single-cell RNA-sequencing experiments.

    PubMed

    Hicks, Stephanie C; Townes, F William; Teng, Mingxiang; Irizarry, Rafael A

    2017-11-06

    Until recently, high-throughput gene expression technology, such as RNA-Sequencing (RNA-seq) required hundreds of thousands of cells to produce reliable measurements. Recent technical advances permit genome-wide gene expression measurement at the single-cell level. Single-cell RNA-Seq (scRNA-seq) is the most widely used and numerous publications are based on data produced with this technology. However, RNA-seq and scRNA-seq data are markedly different. In particular, unlike RNA-seq, the majority of reported expression levels in scRNA-seq are zeros, which could be either biologically-driven, genes not expressing RNA at the time of measurement, or technically-driven, genes expressing RNA, but not at a sufficient level to be detected by sequencing technology. Another difference is that the proportion of genes reporting the expression level to be zero varies substantially across single cells compared to RNA-seq samples. However, it remains unclear to what extent this cell-to-cell variation is being driven by technical rather than biological variation. Furthermore, while systematic errors, including batch effects, have been widely reported as a major challenge in high-throughput technologies, these issues have received minimal attention in published studies based on scRNA-seq technology. Here, we use an assessment experiment to examine data from published studies and demonstrate that systematic errors can explain a substantial percentage of observed cell-to-cell expression variability. Specifically, we present evidence that some of these reported zeros are driven by technical variation by demonstrating that scRNA-seq produces more zeros than expected and that this bias is greater for lower expressed genes. In addition, this missing data problem is exacerbated by the fact that this technical variation varies cell-to-cell. Then, we show how this technical cell-to-cell variability can be confused with novel biological results. Finally, we demonstrate and discuss how batch-effects and confounded experiments can intensify the problem. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  13. Systematic Evaluation of Molecular Networks for Discovery of Disease Genes. | Office of Cancer Genomics

    Cancer.gov

    Gene networks are rapidly growing in size and number, raising the question of which networks are most appropriate for particular applications. Here, we evaluate 21 human genome-wide interaction networks for their ability to recover 446 disease gene sets identified through literature curation, gene expression profiling, or genome-wide association studies. While all networks have some ability to recover disease genes, we observe a wide range of performance with STRING, ConsensusPathDB, and GIANT networks having the best performance overall.

  14. Long-Range Chromosome Interactions Mediated by Cohesin Shape Circadian Gene Expression

    PubMed Central

    Xu, Yichi; Guo, Weimin; Li, Ping; Zhang, Yan; Zhao, Meng; Fan, Zenghua; Zhao, Zhihu; Yan, Jun

    2016-01-01

    Mammalian circadian rhythm is established by the negative feedback loops consisting of a set of clock genes, which lead to the circadian expression of thousands of downstream genes in vivo. As genome-wide transcription is organized under the high-order chromosome structure, it is largely uncharted how circadian gene expression is influenced by chromosome architecture. We focus on the function of chromatin structure proteins cohesin as well as CTCF (CCCTC-binding factor) in circadian rhythm. Using circular chromosome conformation capture sequencing, we systematically examined the interacting loci of a Bmal1-bound super-enhancer upstream of a clock gene Nr1d1 in mouse liver. These interactions are largely stable in the circadian cycle and cohesin binding sites are enriched in the interactome. Global analysis showed that cohesin-CTCF co-binding sites tend to insulate the phases of circadian oscillating genes while cohesin-non-CTCF sites are associated with high circadian rhythmicity of transcription. A model integrating the effects of cohesin and CTCF markedly improved the mechanistic understanding of circadian gene expression. Further experiments in cohesin knockout cells demonstrated that cohesin is required at least in part for driving the circadian gene expression by facilitating the enhancer-promoter looping. This study provided a novel insight into the relationship between circadian transcriptome and the high-order chromosome structure. PMID:27135601

  15. Molecular and functional definition of the developing human striatum.

    PubMed

    Onorati, Marco; Castiglioni, Valentina; Biasci, Daniele; Cesana, Elisabetta; Menon, Ramesh; Vuono, Romina; Talpo, Francesca; Laguna Goya, Rocio; Lyons, Paul A; Bulfamante, Gaetano P; Muzio, Luca; Martino, Gianvito; Toselli, Mauro; Farina, Cinthia; Barker, Roger A; Biella, Gerardo; Cattaneo, Elena

    2014-12-01

    The complexity of the human brain derives from the intricate interplay of molecular instructions during development. Here we systematically investigated gene expression changes in the prenatal human striatum and cerebral cortex during development from post-conception weeks 2 to 20. We identified tissue-specific gene coexpression networks, differentially expressed genes and a minimal set of bimodal genes, including those encoding transcription factors, that distinguished striatal from neocortical identities. Unexpected differences from mouse striatal development were discovered. We monitored 36 determinants at the protein level, revealing regional domains of expression and their refinement, during striatal development. We electrophysiologically profiled human striatal neurons differentiated in vitro and determined their refined molecular and functional properties. These results provide a resource and opportunity to gain global understanding of how transcriptional and functional processes converge to specify human striatal and neocortical neurons during development.

  16. Genome Wide Identification, Evolutionary, and Expression Analysis of VQ Genes from Two Pyrus Species.

    PubMed

    Cao, Yunpeng; Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping

    2018-04-23

    The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice ( Oryza sativa ), maize ( Zea mays ), and Arabidopsis ( Arabidopsis thaliana ). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis , respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis , respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis . A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis , respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus , and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis .

  17. Distinguishing the rates of gene activation from phenotypic variations.

    PubMed

    Chen, Ye; Lv, Cheng; Li, Fangting; Li, Tiejun

    2015-06-18

    Stochastic genetic switching driven by intrinsic noise is an important process in gene expression. When the rates of gene activation/inactivation are relatively slow, fast, or medium compared with the synthesis/degradation rates of mRNAs and proteins, the variability of protein and mRNA levels may exhibit very different dynamical patterns. It is desirable to provide a systematic approach to identify their key dynamical features in different regimes, aiming at distinguishing which regime a considered gene regulatory network is in from their phenotypic variations. We studied a gene expression model with positive feedbacks when genetic switching rates vary over a wide range. With the goal of providing a method to distinguish the regime of the switching rates, we first focus on understanding the essential dynamics of gene expression system in different cases. In the regime of slow switching rates, we found that the effective dynamics can be reduced to independent evolutions on two separate layers corresponding to gene activation and inactivation states, and the transitions between two layers are rare events, after which the system goes mainly along deterministic ODE trajectories on a particular layer to reach new steady states. The energy landscape in this regime can be well approximated by using Gaussian mixture model. In the regime of intermediate switching rates, we analyzed the mean switching time to investigate the stability of the system in different parameter ranges. We also discussed the case of fast switching rates from the viewpoint of transition state theory. Based on the obtained results, we made a proposal to distinguish these three regimes in a simulation experiment. We identified the intermediate regime from the fact that the strength of cellular memory is lower than the other two cases, and the fast and slow regimes can be distinguished by their different perturbation-response behavior with respect to the switching rates perturbations. We proposed a simulation experiment to distinguish the slow, intermediate and fast regimes, which is the main point of our paper. In order to achieve this goal, we systematically studied the essential dynamics of gene expression system when the switching rates are in different regimes. Our theoretical understanding provides new insights on the gene expression experiments.

  18. dynGENIE3: dynamical GENIE3 for the inference of gene networks from time series expression data.

    PubMed

    Huynh-Thu, Vân Anh; Geurts, Pierre

    2018-02-21

    The elucidation of gene regulatory networks is one of the major challenges of systems biology. Measurements about genes that are exploited by network inference methods are typically available either in the form of steady-state expression vectors or time series expression data. In our previous work, we proposed the GENIE3 method that exploits variable importance scores derived from Random forests to identify the regulators of each target gene. This method provided state-of-the-art performance on several benchmark datasets, but it could however not specifically be applied to time series expression data. We propose here an adaptation of the GENIE3 method, called dynamical GENIE3 (dynGENIE3), for handling both time series and steady-state expression data. The proposed method is evaluated extensively on the artificial DREAM4 benchmarks and on three real time series expression datasets. Although dynGENIE3 does not systematically yield the best performance on each and every network, it is competitive with diverse methods from the literature, while preserving the main advantages of GENIE3 in terms of scalability.

  19. Genome-wide identification of suitable zebrafish Danio rerio reference genes for normalization of gene expression data by RT-qPCR.

    PubMed

    Xu, H; Li, C; Zeng, Q; Agrawal, I; Zhu, X; Gong, Z

    2016-06-01

    In this study, to systematically identify the most stably expressed genes for internal reference in zebrafish Danio rerio investigations, 37 D. rerio transcriptomic datasets (both RNA sequencing and microarray data) were collected from gene expression omnibus (GEO) database and unpublished data, and gene expression variations were analysed under three experimental conditions: tissue types, developmental stages and chemical treatments. Forty-four putative candidate genes were identified with the c.v. <0·2 from all datasets. Following clustering into different functional groups, 21 genes, in addition to four conventional housekeeping genes (eef1a1l1, b2m, hrpt1l and actb1), were selected from different functional groups for further quantitative real-time (qrt-)PCR validation using 25 RNA samples from different adult tissues, developmental stages and chemical treatments. The qrt-PCR data were then analysed using the statistical algorithm refFinder for gene expression stability. Several new candidate genes showed better expression stability than the conventional housekeeping genes in all three categories. It was found that sep15 and metap1 were the top two stable genes for tissue types, ube2a and tmem50a the top two for different developmental stages, and rpl13a and rp1p0 the top two for chemical treatments. Thus, based on the extensive transcriptomic analyses and qrt-PCR validation, these new reference genes are recommended for normalization of D. rerio qrt-PCR data respectively for the three different experimental conditions. © 2016 The Fisheries Society of the British Isles.

  20. Inferring causal genomic alterations in breast cancer using gene expression data

    PubMed Central

    2011-01-01

    Background One of the primary objectives in cancer research is to identify causal genomic alterations, such as somatic copy number variation (CNV) and somatic mutations, during tumor development. Many valuable studies lack genomic data to detect CNV; therefore, methods that are able to infer CNVs from gene expression data would help maximize the value of these studies. Results We developed a framework for identifying recurrent regions of CNV and distinguishing the cancer driver genes from the passenger genes in the regions. By inferring CNV regions across many datasets we were able to identify 109 recurrent amplified/deleted CNV regions. Many of these regions are enriched for genes involved in many important processes associated with tumorigenesis and cancer progression. Genes in these recurrent CNV regions were then examined in the context of gene regulatory networks to prioritize putative cancer driver genes. The cancer driver genes uncovered by the framework include not only well-known oncogenes but also a number of novel cancer susceptibility genes validated via siRNA experiments. Conclusions To our knowledge, this is the first effort to systematically identify and validate drivers for expression based CNV regions in breast cancer. The framework where the wavelet analysis of copy number alteration based on expression coupled with the gene regulatory network analysis, provides a blueprint for leveraging genomic data to identify key regulatory components and gene targets. This integrative approach can be applied to many other large-scale gene expression studies and other novel types of cancer data such as next-generation sequencing based expression (RNA-Seq) as well as CNV data. PMID:21806811

  1. A Genome-Wide Screen Indicates Correlation between Differentiation and Expression of Metabolism Related Genes

    PubMed Central

    Shende, Akhilesh; Singh, Anupama; Meena, Anil; Ghosal, Ritika; Ranganathan, Madhav; Bandyopadhyay, Amitabha

    2013-01-01

    Differentiated tissues may be considered as materials with distinct properties. The differentiation program of a given tissue ensures that it acquires material properties commensurate with its function. It may be hypothesized that some of these properties are acquired through production of tissue-specific metabolites synthesized by metabolic enzymes. To establish correlation between metabolism and organogenesis we have carried out a genome-wide expression study of metabolism related genes by RNA in-situ hybridization. 23% of the metabolism related genes studied are expressed in a tissue-restricted but not tissue-exclusive manner. We have conducted the screen on whole mount chicken (Gallus gallus) embryos from four distinct developmental stages to correlate dynamic changes in expression patterns of metabolic enzymes with spatio-temporally unique developmental events. Our data strongly suggests that unique combinations of metabolism related genes, and not specific metabolic pathways, are upregulated during differentiation. Further, expression of metabolism related genes in well established signaling centers that regulate different aspects of morphogenesis indicates developmental roles of some of the metabolism related genes. The database of tissue-restricted expression patterns of metabolism related genes, generated in this study, should serve as a resource for systematic identification of these genes with tissue-specific functions during development. Finally, comprehensive understanding of differentiation is not possible unless the downstream genes of a differentiation cascade are identified. We propose, metabolic enzymes constitute a significant portion of these downstream target genes. Thus our study should help elucidate different aspects of tissue differentiation. PMID:23717462

  2. A genome-wide screen indicates correlation between differentiation and expression of metabolism related genes.

    PubMed

    Roy, Priti; Kumar, Brijesh; Shende, Akhilesh; Singh, Anupama; Meena, Anil; Ghosal, Ritika; Ranganathan, Madhav; Bandyopadhyay, Amitabha

    2013-01-01

    Differentiated tissues may be considered as materials with distinct properties. The differentiation program of a given tissue ensures that it acquires material properties commensurate with its function. It may be hypothesized that some of these properties are acquired through production of tissue-specific metabolites synthesized by metabolic enzymes. To establish correlation between metabolism and organogenesis we have carried out a genome-wide expression study of metabolism related genes by RNA in-situ hybridization. 23% of the metabolism related genes studied are expressed in a tissue-restricted but not tissue-exclusive manner. We have conducted the screen on whole mount chicken (Gallus gallus) embryos from four distinct developmental stages to correlate dynamic changes in expression patterns of metabolic enzymes with spatio-temporally unique developmental events. Our data strongly suggests that unique combinations of metabolism related genes, and not specific metabolic pathways, are upregulated during differentiation. Further, expression of metabolism related genes in well established signaling centers that regulate different aspects of morphogenesis indicates developmental roles of some of the metabolism related genes. The database of tissue-restricted expression patterns of metabolism related genes, generated in this study, should serve as a resource for systematic identification of these genes with tissue-specific functions during development. Finally, comprehensive understanding of differentiation is not possible unless the downstream genes of a differentiation cascade are identified. We propose, metabolic enzymes constitute a significant portion of these downstream target genes. Thus our study should help elucidate different aspects of tissue differentiation.

  3. Defining the Transcriptional Landscape during Cytomegalovirus Latency with Single-Cell RNA Sequencing

    PubMed Central

    2018-01-01

    ABSTRACT Primary infection with human cytomegalovirus (HCMV) results in a lifelong infection due to its ability to establish latent infection, with one characterized viral reservoir being hematopoietic cells. Although reactivation from latency causes serious disease in immunocompromised individuals, our molecular understanding of latency is limited. Here, we delineate viral gene expression during natural HCMV persistent infection by analyzing the massive transcriptome RNA sequencing (RNA-seq) atlas generated by the Genotype-Tissue Expression (GTEx) project. This systematic analysis reveals that HCMV persistence in vivo is prevalent in diverse tissues. Notably, we find only viral transcripts that resemble gene expression during various stages of lytic infection with no evidence of any highly restricted latency-associated viral gene expression program. To further define the transcriptional landscape during HCMV latent infection, we also used single-cell RNA-seq and a tractable experimental latency model. In contrast to some current views on latency, we also find no evidence for any highly restricted latency-associated viral gene expression program. Instead, we reveal that latency-associated gene expression largely mirrors a late lytic viral program, albeit at much lower levels of expression. Overall, our work has the potential to revolutionize our understanding of HCMV persistence and suggests that latency is governed mainly by quantitative changes, with a limited number of qualitative changes, in viral gene expression. PMID:29535194

  4. Reference Genes for Accurate Transcript Normalization in Citrus Genotypes under Different Experimental Conditions

    PubMed Central

    Mafra, Valéria; Kubo, Karen S.; Alves-Ferreira, Marcio; Ribeiro-Alves, Marcelo; Stuart, Rodrigo M.; Boava, Leonardo P.; Rodrigues, Carolina M.; Machado, Marcos A.

    2012-01-01

    Real-time reverse transcription PCR (RT-qPCR) has emerged as an accurate and widely used technique for expression profiling of selected genes. However, obtaining reliable measurements depends on the selection of appropriate reference genes for gene expression normalization. The aim of this work was to assess the expression stability of 15 candidate genes to determine which set of reference genes is best suited for transcript normalization in citrus in different tissues and organs and leaves challenged with five pathogens (Alternaria alternata, Phytophthora parasitica, Xylella fastidiosa and Candidatus Liberibacter asiaticus). We tested traditional genes used for transcript normalization in citrus and orthologs of Arabidopsis thaliana genes described as superior reference genes based on transcriptome data. geNorm and NormFinder algorithms were used to find the best reference genes to normalize all samples and conditions tested. Additionally, each biotic stress was individually analyzed by geNorm. In general, FBOX (encoding a member of the F-box family) and GAPC2 (GAPDH) was the most stable candidate gene set assessed under the different conditions and subsets tested, while CYP (cyclophilin), TUB (tubulin) and CtP (cathepsin) were the least stably expressed genes found. Validation of the best suitable reference genes for normalizing the expression level of the WRKY70 transcription factor in leaves infected with Candidatus Liberibacter asiaticus showed that arbitrary use of reference genes without previous testing could lead to misinterpretation of data. Our results revealed FBOX, SAND (a SAND family protein), GAPC2 and UPL7 (ubiquitin protein ligase 7) to be superior reference genes, and we recommend their use in studies of gene expression in citrus species and relatives. This work constitutes the first systematic analysis for the selection of superior reference genes for transcript normalization in different citrus organs and under biotic stress. PMID:22347455

  5. Global identification and expression analysis of stress-responsive genes of the Argonaute family in apple.

    PubMed

    Xu, Ruirui; Liu, Caiyun; Li, Ning; Zhang, Shizhong

    2016-12-01

    Argonaute (AGO) proteins, which are found in yeast, animals, and plants, are the core molecules of the RNA-induced silencing complex. These proteins play important roles in plant growth, development, and responses to biotic stresses. The complete analysis and classification of the AGO gene family have been recently reported in different plants. Nevertheless, systematic analysis and expression profiling of these genes have not been performed in apple (Malus domestica). Approximately 15 AGO genes were identified in the apple genome. The phylogenetic tree, chromosome location, conserved protein motifs, gene structure, and expression of the AGO gene family in apple were analyzed for gene prediction. All AGO genes were phylogenetically clustered into four groups (i.e., AGO1, AGO4, MEL1/AGO5, and ZIPPY/AGO7) with the AGO genes of Arabidopsis. These groups of the AGO gene family were statistically analyzed and compared among 31 plant species. The predicted apple AGO genes are distributed across nine chromosomes at different densities and include three segment duplications. Expression studies indicated that 15 AGO genes exhibit different expression patterns in at least one of the tissues tested. Additionally, analysis of gene expression levels indicated that the genes are mostly involved in responses to NaCl, PEG, heat, and low-temperature stresses. Hence, several candidate AGO genes are involved in different aspects of physiological and developmental processes and may play an important role in abiotic stress responses in apple. To the best of our knowledge, this study is the first to report a comprehensive analysis of the apple AGO gene family. Our results provide useful information to understand the classification and putative functions of these proteins, especially for gene members that may play important roles in abiotic stress responses in M. hupehensis.

  6. Identification and validation of reference genes for quantitative real-time PCR normalization and its applications in lycium.

    PubMed

    Zeng, Shaohua; Liu, Yongliang; Wu, Min; Liu, Xiaomin; Shen, Xiaofei; Liu, Chunzhao; Wang, Ying

    2014-01-01

    Lycium barbarum and L. ruthenicum are extensively used as traditional Chinese medicinal plants. Next generation sequencing technology provides a powerful tool for analyzing transcriptomic profiles of gene expression in non-model species. Such gene expression can then be confirmed with quantitative real-time polymerase chain reaction (qRT-PCR). Therefore, use of systematically identified suitable reference genes is a prerequisite for obtaining reliable gene expression data. Here, we calculated the expression stability of 18 candidate reference genes across samples from different tissues and grown under salt stress using geNorm and NormFinder procedures. The geNorm-determined rank of reference genes was similar to those defined by NormFinder with some differences. Both procedures confirmed that the single most stable reference gene was ACNTIN1 for L. barbarum fruits, H2B1 for L. barbarum roots, and EF1α for L. ruthenicum fruits. PGK3, H2B2, and PGK3 were identified as the best stable reference genes for salt-treated L. ruthenicum leaves, roots, and stems, respectively. H2B1 and GAPDH1+PGK1 for L. ruthenicum and SAMDC2+H2B1 for L. barbarum were the best single and/or combined reference genes across all samples. Finally, expression of salt-responsive gene NAC, fruit ripening candidate gene LrPG, and anthocyanin genes were investigated to confirm the validity of the selected reference genes. Suitable reference genes identified in this study provide a foundation for accurately assessing gene expression and further better understanding of novel gene function to elucidate molecular mechanisms behind particular biological/physiological processes in Lycium.

  7. lpxC and yafS are the most suitable internal controls to normalize real time RT-qPCR expression in the phytopathogenic bacteria Dickeya dadantii.

    PubMed

    Hommais, Florence; Zghidi-Abouzid, Ouafa; Oger-Desfeux, Christine; Pineau-Chapelle, Emilie; Van Gijsegem, Frederique; Nasser, William; Reverchon, Sylvie

    2011-01-01

    Quantitative RT-PCR is the method of choice for studying, with both sensitivity and accuracy, the expression of genes. A reliable normalization of the data, using several reference genes, is critical for an accurate quantification of gene expression. Here, we propose a set of reference genes, of the phytopathogenic bacteria Dickeya dadantii and Pectobacterium atrosepticum, which are stable in a wide range of growth conditions. We extracted, from a D. dadantii micro-array transcript profile dataset comprising thirty-two different growth conditions, an initial set of 49 expressed genes with very low variation in gene expression. Out of these, we retained 10 genes representing different functional categories, different levels of expression (low, medium, and high) and with no systematic variation in expression correlating with growth conditions. We measured the expression of these reference gene candidates using quantitative RT-PCR in 50 different experimental conditions, mimicking the environment encountered by the bacteria in their host and directly during the infection process in planta. The two most stable genes (ABF-0017965 (lpxC) and ABF-0020529 (yafS) were successfully used for normalization of RT-qPCR data. Finally, we demonstrated that the ortholog of lpxC and yafS in Pectobacterium atrosepticum also showed stable expression in diverse growth conditions. We have identified at least two genes, lpxC (ABF-0017965) and yafS (ABF-0020509), whose expressions are stable in a wide range of growth conditions and during infection. Thus, these genes are considered suitable for use as reference genes for the normalization of real-time RT-qPCR data of the two main pectinolytic phytopathogenic bacteria D. dadantii and P. atrosepticum and, probably, of other Enterobacteriaceae. Moreover, we defined general criteria to select good reference genes in bacteria.

  8. Sex-based differences in gene expression in hippocampus following postnatal lead exposure

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schneider, J.S., E-mail: jay.schneider@jefferson.edu; Anderson, D.W.; Sonnenahalli, H.

    The influence of sex as an effect modifier of childhood lead poisoning has received little systematic attention. Considering the paucity of information available concerning the interactive effects of lead and sex on the brain, the current study examined the interactive effects of lead and sex on gene expression patterns in the hippocampus, a structure involved in learning and memory. Male or female rats were fed either 1500 ppm lead-containing chow or control chow for 30 days beginning at weaning.Blood lead levels were 26.7 {+-} 2.1 {mu}g/dl and 27.1 {+-} 1.7 {mu}g/dl for females and males, respectively. The expression of 175more » unique genes was differentially regulated between control male and female rats. A total of 167 unique genes were differentially expressed in response to lead in either males or females. Lead exposure had a significant effect without a significant difference between male and female responses in 77 of these genes. In another set of 71 genes, there were significant differences in male vs. female response. A third set of 30 genes was differentially expressed in opposite directions in males vs. females, with the majority of genes expressed at a lower level in females than in males. Highly differentially expressed genes in males and females following lead exposure were associated with diverse biological pathways and functions. These results show that a brief exposure to lead produced significant changes in expression of a variety of genes in the hippocampus and that the response of the brain to a given lead exposure may vary depending on sex. - Highlights: > Postnatal lead exposure has a significant effect on hippocampal gene expression patterns. > At least one set of genes was affected in opposite directions in males and females. > Differentially expressed genes were associated with diverse biological pathways.« less

  9. Transcriptomics and the Mediterranean Diet: A Systematic Review

    PubMed Central

    Herrera-Marcos, Luis V.; Lou-Bonafonte, José M.; Arnal, Carmen; Navarro, María A.; Osada, Jesús

    2017-01-01

    The Mediterranean diet has been proven to be highly effective in the prevention of cardiovascular diseases and cancer and in decreasing overall mortality. Nowadays, transcriptomics is gaining particular relevance due to the existence of non-coding RNAs capable of regulating many biological processes. The present work describes a systematic review of current evidence supporting the influence of the Mediterranean diet on transcriptomes of different tissues in various experimental models. While information on regulatory RNA is very limited, they seem to contribute to the effect. Special attention has been given to the oily matrix of virgin olive oil. In this regard, monounsaturated fatty acid-rich diets prevented the expression of inflammatory genes in different tissues, an action also observed after the administration of olive oil phenolic compounds. Among these, tyrosol, hydroxytyrosol, and secoiridoids have been found to be particularly effective in cell cycle expression. Less explored terpenes, such as oleanolic acid, are important modulators of circadian clock genes. The wide range of studied tissues and organisms indicate that response to these compounds is universal and poses an important level of complexity considering the different genes expressed in each tissue and the number of different tissues in an organism. PMID:28486416

  10. Effect of Micro-RNA on Tenocytes and Tendon-Related Gene Expression: A Systematic Review.

    PubMed

    Dubin, Jeremy A; Greenberg, Daniel R; Iglinski-Benjamin, Kag C; Abrams, Geoffrey D

    2018-06-06

    The purpose of the review was to synthesize the current literature regarding the effect of miRNA on biological processes known to be involved in tendon and tenocyte development and homeostasis. Using multiple databases, a systematic review was performed with a customized search term crafted to identify any study examining micro-RNA in relation to tendon and/or tenocytes. Results were classified based on the following categories: gene expression, tenocyte development and differentiation, tendon tissue repair, and tenocyte senescence. A total of 3,112 potentially relevant studies were reviewed, and after exclusion criteria was applied, 15 investigations were included in the final analysis. There were 14 specific miRNA included in this review, with 11 studies reporting on tendon-related gene expression, five reporting on tendon development and/or tenocyte differentiation, six reporting on tendon tissue repair, and five reporting on tenocyte senescence. The miR-29 family was the most commonly reported micro-RNA in the investigation. We also report on a number of micro-RNA which are associated with both positive and negative effects on tendon homeostasis. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  11. Transcriptomics and the Mediterranean Diet: A Systematic Review.

    PubMed

    Herrera-Marcos, Luis V; Lou-Bonafonte, José M; Arnal, Carmen; Navarro, María A; Osada, Jesús

    2017-05-09

    The Mediterranean diet has been proven to be highly effective in the prevention of cardiovascular diseases and cancer and in decreasing overall mortality. Nowadays, transcriptomics is gaining particular relevance due to the existence of non-coding RNAs capable of regulating many biological processes. The present work describes a systematic review of current evidence supporting the influence of the Mediterranean diet on transcriptomes of different tissues in various experimental models. While information on regulatory RNA is very limited, they seem to contribute to the effect. Special attention has been given to the oily matrix of virgin olive oil. In this regard, monounsaturated fatty acid-rich diets prevented the expression of inflammatory genes in different tissues, an action also observed after the administration of olive oil phenolic compounds. Among these, tyrosol, hydroxytyrosol, and secoiridoids have been found to be particularly effective in cell cycle expression. Less explored terpenes, such as oleanolic acid, are important modulators of circadian clock genes. The wide range of studied tissues and organisms indicate that response to these compounds is universal and poses an important level of complexity considering the different genes expressed in each tissue and the number of different tissues in an organism.

  12. Characterization of candidate genes in inflammatory bowel disease–associated risk loci

    PubMed Central

    Peloquin, Joanna M.; Sartor, R. Balfour; Newberry, Rodney D.; McGovern, Dermot P.; Yajnik, Vijay; Lira, Sergio A.

    2016-01-01

    GWAS have linked SNPs to risk of inflammatory bowel disease (IBD), but a systematic characterization of disease-associated genes has been lacking. Prior studies utilized microarrays that did not capture many genes encoded within risk loci or defined expression quantitative trait loci (eQTLs) using peripheral blood, which is not the target tissue in IBD. To address these gaps, we sought to characterize the expression of IBD-associated risk genes in disease-relevant tissues and in the setting of active IBD. Terminal ileal (TI) and colonic mucosal tissues were obtained from patients with Crohn’s disease or ulcerative colitis and from healthy controls. We developed a NanoString code set to profile 678 genes within IBD risk loci. A subset of patients and controls were genotyped for IBD-associated risk SNPs. Analyses included differential expression and variance analysis, weighted gene coexpression network analysis, and eQTL analysis. We identified 116 genes that discriminate between healthy TI and colon samples and uncovered patterns in variance of gene expression that highlight heterogeneity of disease. We identified 107 coexpressed gene pairs for which transcriptional regulation is either conserved or reversed in an inflammation-independent or -dependent manner. We demonstrate that on average approximately 60% of disease-associated genes are differentially expressed in inflamed tissue. Last, we identified eQTLs with either genotype-only effects on expression or an interaction effect between genotype and inflammation. Our data reinforce tissue specificity of expression in disease-associated candidate genes, highlight genes and gene pairs that are regulated in disease-relevant tissue and inflammation, and provide a foundation to advance the understanding of IBD pathogenesis. PMID:27668286

  13. Genome-wide identification and expression analysis of TCP transcription factors in Gossypium raimondii.

    PubMed

    Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C; Zhang, Baohong

    2014-10-16

    Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence.

  14. Genome-wide identification and expression analysis of TCP transcription factors in Gossypium raimondii

    PubMed Central

    Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C.; Zhang, Baohong

    2014-01-01

    Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence. PMID:25322260

  15. Genome-wide dynamics of alternative polyadenylation in rice

    PubMed Central

    Fu, Haihui; Yang, Dewei; Su, Wenyue; Ma, Liuyin; Shen, Yingjia; Ji, Guoli; Ye, Xinfu; Wu, Xiaohui

    2016-01-01

    Alternative polyadenylation (APA), in which a transcript uses one of the poly(A) sites to define its 3′-end, is a common regulatory mechanism in eukaryotic gene expression. However, the potential of APA in determining crop agronomic traits remains elusive. This study systematically tallied poly(A) sites of 14 different rice tissues and developmental stages using the poly(A) tag sequencing (PAT-seq) approach. The results indicate significant involvement of APA in developmental and quantitative trait loci (QTL) gene expression. About 48% of all expressed genes use APA to generate transcriptomic and proteomic diversity. Some genes switch APA sites, allowing differentially expressed genes to use alternate 3′ UTRs. Interestingly, APA in mature pollen is distinct where differential expression levels of a set of poly(A) factors and different distributions of APA sites are found, indicating a unique mRNA 3′-end formation regulation during gametophyte development. Equally interesting, statistical analyses showed that QTL tends to use APA for regulation of gene expression of many agronomic traits, suggesting a potential important role of APA in rice production. These results provide thus far the most comprehensive and high-resolution resource for advanced analysis of APA in crops and shed light on how APA is associated with trait formation in eukaryotes. PMID:27733415

  16. A Genetic Toolbox for Modulating the Expression of Heterologous Genes in the Cyanobacterium Synechocystis sp. PCC 6803

    DOE PAGES

    Wang, Bo; Eckert, Carrie; Maness, Pin -Ching; ...

    2017-12-12

    Cyanobacteria, genetic models for photosynthesis research for decades, have recently become attractive hosts for producing renewable fuels and chemicals, owing to their genetic tractability, relatively fast growth, and their ability to utilize sunlight, fix carbon dioxide, and in some cases, fix nitrogen. Despite significant advances, there is still an urgent demand for synthetic biology tools in order to effectively manipulate genetic circuits in cyanobacteria. In this study, we have compared a total of 17 natural and chimeric promoters, focusing on expression of the ethylene-forming enzyme (EFE) in the cyanobacterium Synechocystis sp. PCC 6803. We report the finding that the E.more » coli σ 70 promoter Ptrc is superior compared to the previously reported strong promoters, such as PcpcB and PpsbA, for the expression of EFE. In addition, we found that the EFE expression level was very sensitive to the 5'-untranslated region upstream of the open reading frame. A library of ribosome binding sites (RBSs) was rationally designed and was built and systematically characterized. We demonstrate a strategy complementary to the RBS prediction software to facilitate the rational design of an RBS library to optimize the gene expression in cyanobacteria. Our results show that the EFE expression level is dramatically enhanced through these synthetic biology tools and is no longer the rate-limiting step for cyanobacterial ethylene production. Furthermore, these systematically characterized promoters and the RBS design strategy can serve as useful tools to tune gene expression levels and to identify and mitigate metabolic bottlenecks in cyanobacteria.« less

  17. A Genetic Toolbox for Modulating the Expression of Heterologous Genes in the Cyanobacterium Synechocystis sp. PCC 6803

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Bo; Eckert, Carrie; Maness, Pin -Ching

    Cyanobacteria, genetic models for photosynthesis research for decades, have recently become attractive hosts for producing renewable fuels and chemicals, owing to their genetic tractability, relatively fast growth, and their ability to utilize sunlight, fix carbon dioxide, and in some cases, fix nitrogen. Despite significant advances, there is still an urgent demand for synthetic biology tools in order to effectively manipulate genetic circuits in cyanobacteria. In this study, we have compared a total of 17 natural and chimeric promoters, focusing on expression of the ethylene-forming enzyme (EFE) in the cyanobacterium Synechocystis sp. PCC 6803. We report the finding that the E.more » coli σ 70 promoter Ptrc is superior compared to the previously reported strong promoters, such as PcpcB and PpsbA, for the expression of EFE. In addition, we found that the EFE expression level was very sensitive to the 5'-untranslated region upstream of the open reading frame. A library of ribosome binding sites (RBSs) was rationally designed and was built and systematically characterized. We demonstrate a strategy complementary to the RBS prediction software to facilitate the rational design of an RBS library to optimize the gene expression in cyanobacteria. Our results show that the EFE expression level is dramatically enhanced through these synthetic biology tools and is no longer the rate-limiting step for cyanobacterial ethylene production. Furthermore, these systematically characterized promoters and the RBS design strategy can serve as useful tools to tune gene expression levels and to identify and mitigate metabolic bottlenecks in cyanobacteria.« less

  18. The effects of omega-3 polyunsaturated fatty acids and genetic variants on methylation levels of the interleukin-6 gene promoter

    USDA-ARS?s Scientific Manuscript database

    Scope: Omega-3 PUFAs (n-3 PUFAs) reduce IL-6 gene expression, but their effects on transcription regulatory mechanisms are unknown. We aimed to conduct an integrated analysis with both population and in vitro studies to systematically explore the relationships among n-3 PUFA, DNA methylation, single...

  19. The effect of ACACB cis-variants on gene expression and metabolic traits.

    PubMed

    Ma, Lijun; Mondal, Ashis K; Murea, Mariana; Sharma, Neeraj K; Tönjes, Anke; Langberg, Kurt A; Das, Swapan K; Franks, Paul W; Kovacs, Peter; Antinozzi, Peter A; Stumvoll, Michael; Parks, John S; Elbein, Steven C; Freedman, Barry I

    2011-01-01

    Acetyl Coenzyme A carboxylase β (ACACB) is the rate-limiting enzyme in fatty acid oxidation, and continuous fatty acid oxidation in Acacb knock-out mice increases insulin sensitivity. Systematic human studies have not been performed to evaluate whether ACACB variants regulate gene expression and insulin sensitivity in skeletal muscle and adipose tissues. We sought to determine whether ACACB transcribed variants were associated with ACACB gene expression and insulin sensitivity in non-diabetic African American (AA) and European American (EA) adults. ACACB transcribed single nucleotide polymorphisms (SNPs) were genotyped in 105 EAs and 46 AAs whose body mass index (BMI), lipid profiles and ACACB gene expression in subcutaneous adipose and skeletal muscle had been measured. Allelic expression imbalance (AEI) was assessed in lymphoblast cell lines from heterozygous subjects in an additional EA sample (n = 95). Selected SNPs were further examined for association with insulin sensitivity in a cohort of 417 EAs and 153 AAs. ACACB transcribed SNP rs2075260 (A/G) was associated with adipose ACACB messenger RNA expression in EAs and AAs (p = 3.8×10(-5), dominant model in meta-analysis, Stouffer method), with the (A) allele representing lower gene expression in adipose and higher insulin sensitivity in EAs (p = 0.04). In EAs, adipose ACACB expression was negatively associated with age and sex-adjusted BMI (r = -0.35, p = 0.0002). Common variants within the ACACB locus appear to regulate adipose gene expression in humans. Body fat (represented by BMI) may further regulate adipose ACACB gene expression in the EA population.

  20. Discovering Condition-Specific Gene Co-Expression Patterns Using Gaussian Mixture Models: A Cancer Case Study.

    PubMed

    Ficklin, Stephen P; Dunwoodie, Leland J; Poehlman, William L; Watson, Christopher; Roche, Kimberly E; Feltus, F Alex

    2017-08-17

    A gene co-expression network (GCN) describes associations between genes and points to genetic coordination of biochemical pathways. However, genetic correlations in a GCN are only detectable if they are present in the sampled conditions. With the increasing quantity of gene expression samples available in public repositories, there is greater potential for discovery of genetic correlations from a variety of biologically interesting conditions. However, even if gene correlations are present, their discovery can be masked by noise. Noise is introduced from natural variation (intrinsic and extrinsic), systematic variation (caused by sample measurement protocols and instruments), and algorithmic and statistical variation created by selection of data processing tools. A variety of published studies, approaches and methods attempt to address each of these contributions of variation to reduce noise. Here we describe an approach using Gaussian Mixture Models (GMMs) to address natural extrinsic (condition-specific) variation during network construction from mixed input conditions. To demonstrate utility, we build and analyze a condition-annotated GCN from a compendium of 2,016 mixed gene expression data sets from five tumor subtypes obtained from The Cancer Genome Atlas. Our results show that GMMs help discover tumor subtype specific gene co-expression patterns (modules) that are significantly enriched for clinical attributes.

  1. Comparative analysis of gene expression profiles of hip articular cartilage between non-traumatic necrosis and osteoarthritis.

    PubMed

    Wang, Wenyu; Liu, Yang; Hao, Jingcan; Zheng, Shuyu; Wen, Yan; Xiao, Xiao; He, Awen; Fan, Qianrui; Zhang, Feng; Liu, Ruiyu

    2016-10-10

    Hip cartilage destruction is consistently observed in the non-traumatic osteonecrosis of femoral head (NOFH) and accelerates its bone necrosis. The molecular mechanism underlying the cartilage damage of NOFH remains elusive. In this study, we conducted a systematically comparative study of gene expression profiles between NOFH and osteoarthritis (OA). Hip articular cartilage specimens were collected from 12 NOFH patients and 12 controls with traumatic femoral neck fracture for microarray (n=4) and quantitative real-time PCR validation experiments (n=8). Gene expression profiling of articular cartilage was performed using Agilent Human 4×44K Microarray chip. The accuracy of microarray experiment was further validated by qRT-PCR. Gene expression results of OA hip cartilage were derived from previously published study. Significance Analysis of Microarrays (SAM) software was applied for identifying differently expressed genes. Gene ontology (GO) and pathway enrichment analysis were conducted by Gene Set Enrichment Analysis software and DAVID tool, respectively. Totally, 27 differently expressed genes were identified for NOFH. Comparing the gene expression profiles of NOFH cartilage and OA cartilage detected 8 common differently expressed genes, including COL5A1, OGN, ANGPTL4, CRIP1, NFIL3, METRNL, ID2 and STEAP1. GO comparative analysis identified 10 common significant GO terms, mainly implicated in apoptosis and development process. Pathway comparative analysis observed that ECM-receptor interaction pathway and focal adhesion pathway were enriched in the differently expressed genes of both NOFH and hip OA. In conclusion, we identified a set of differently expressed genes, GO and pathways for NOFH articular destruction, some of which were also involved in the hip OA. Our study results may help to reveal the pathogenetic similarities and differences of cartilage damage of NOFH and hip OA. Copyright © 2016 Elsevier B.V. All rights reserved.

  2. Deletion analysis of Streptococcus pneumoniae late competence genes distinguishes virulence determinants that are dependent or independent of competence induction

    PubMed Central

    Zhu, Luchang; Lin, Jingjun; Kuang, Zhizhou; Vidal, Jorge E.; Lau, Gee W.

    2015-01-01

    Summary The competence regulon of Streptococcus pneumoniae (pneumococcus) is crucial for genetic transformation. During competence development, the alternative sigma factor ComX is activated, which in turn, initiates transcription of 80 “late” competence genes. Interestingly, only 16 late genes are essential for genetic transformation. We hypothesized that these late genes that are dispensable for competence are beneficial to pneumococcal fitness during infection. These late genes were systematically deleted, and the resulting mutants were examined for their fitness during mouse models of bacteremia and acute pneumonia. Among these, 14 late genes were important for fitness in mice. Significantly, deletion of some late genes attenuated pneumococcal fitness to the same level in both wild-type and ComX-null genetic backgrounds, suggesting that the constitutive baseline expression of these genes was important for bacterial fitness. In contrast, some mutants were attenuated only in the wild-type genetic background but not in the ComX-null background, suggesting that specific expression of these genes during competence state contributed to pneumococcal fitness. Increased virulence during competence state was partially caused by the induction of allolytic enzymes that enhanced pneumolysin release. These results distinguish the role of basal expression versus competence induction in virulence functions encoded by ComX-regulated late competence genes. Graphical abstract During genetic transformation of pneumococcus, the alternative sigma factor ComX regulates expression of 14 late competence genes important for virulence. The constitutive baseline expression of some of these genes is important for bacteremia and acute pneumonia infections. In contrast, elevated expression of DprA, CbpD, CibAB, and Cinbox are dependent on competence development, enhancing the release of pneumolysin. These results distinguish the role of basal expression versus competence induction in virulence determinants regulated by ComX. PMID:25846124

  3. The Schizophrenia Risk Gene MIR137 Acts as a Hippocampal Gene Network Node Orchestrating the Expression of Genes Relevant to Nervous System Development and Function

    PubMed Central

    Loohuis, Nikkie FM Olde; Kasri, Nael Nadif; Glennon, Jeffrey C; van Bokhoven, Hans; Hébert, Sébastien S; Kaplan, Barry B.; Martens, Gerard JM; Aschrafi, Armaz

    2016-01-01

    MicroRNAs (miRs) are small regulatory molecules, which orchestrate neuronal development and plasticity through modulation of complex gene networks. microRNA-137 (miR-137) is a brain-enriched RNA with a critical role in regulating brain development and in mediating synaptic plasticity. Importantly, mutations in this miR are associated with the pathoetiology of schizophrenia (SZ), and there is a widespread assumption that disruptions in miR-137 expression lead to aberrant expression of gene regulatory networks associated with SZ. To systematically identify the mRNA targets for this miR, we performed miR-137 gain- and loss-of-function experiments in primary rat hippocampal neurons and profiled differentially expressed mRNAs through next-generation sequencing. We identified 500 genes that were bidirectionally activated or repressed in their expression by the modulation of miR-137 levels. Gene ontology analysis using two independent software resources suggested functions for these miR-137-regulated genes in neurodevelopmental processes, neuronal maturation processes and cell maintenance, all of which known to be critical for proper brain circuitry formation. Since many of the putative miR-137 targets identified here also have been previously shown to be associated with SZ, we propose that this miR acts as a critical gene network hub contributing to the pathophysiology of this neurodevelopmental disorder. PMID:26925706

  4. From Genome to Function: Systematic Analysis of the Soil Bacterium Bacillus Subtilis

    PubMed Central

    Crawshaw, Samuel G.; Wipat, Anil

    2001-01-01

    Bacillus subtilis is a sporulating Gram-positive bacterium that lives primarily in the soil and associated water sources. Whilst this bacterium has been studied extensively in the laboratory, relatively few studies have been undertaken to study its activity in natural environments. The publication of the B. subtilis genome sequence and subsequent systematic functional analysis programme have provided an opportunity to develop tools for analysing the role and expression of Bacillus genes in situ. In this paper we discuss analytical approaches that are being developed to relate genes to function in environments such as the rhizosphere. PMID:18628943

  5. Direct Capture and Heterologous Expression of Salinispora Natural Product Genes for the Biosynthesis of Enterocin

    PubMed Central

    2015-01-01

    Heterologous expression of secondary metabolic pathways is a promising approach for the discovery and characterization of bioactive natural products. Herein we report the first heterologous expression of a natural product from the model marine actinomycete genus Salinispora. Using the recently developed method of yeast-mediated transformation-associated recombination for natural product gene clusters, we captured a type II polyketide synthase pathway from Salinispora pacifica with high homology to the enterocin pathway from Streptomyces maritimus and successfully produced enterocin in two different Streptomyces host strains. This result paves the way for the systematic interrogation of Salinispora’s promising secondary metabolome. PMID:25382643

  6. Direct capture and heterologous expression of Salinispora natural product genes for the biosynthesis of enterocin.

    PubMed

    Bonet, Bailey; Teufel, Robin; Crüsemann, Max; Ziemert, Nadine; Moore, Bradley S

    2015-03-27

    Heterologous expression of secondary metabolic pathways is a promising approach for the discovery and characterization of bioactive natural products. Herein we report the first heterologous expression of a natural product from the model marine actinomycete genus Salinispora. Using the recently developed method of yeast-mediated transformation-associated recombination for natural product gene clusters, we captured a type II polyketide synthase pathway from Salinispora pacifica with high homology to the enterocin pathway from Streptomyces maritimus and successfully produced enterocin in two different Streptomyces host strains. This result paves the way for the systematic interrogation of Salinispora's promising secondary metabolome.

  7. DCGL v2.0: an R package for unveiling differential regulation from differential co-expression.

    PubMed

    Yang, Jing; Yu, Hui; Liu, Bao-Hong; Zhao, Zhongming; Liu, Lei; Ma, Liang-Xiao; Li, Yi-Xue; Li, Yuan-Yuan

    2013-01-01

    Differential co-expression analysis (DCEA) has emerged in recent years as a novel, systematic investigation into gene expression data. While most DCEA studies or tools focus on the co-expression relationships among genes, some are developing a potentially more promising research domain, differential regulation analysis (DRA). In our previously proposed R package DCGL v1.0, we provided functions to facilitate basic differential co-expression analyses; however, the output from DCGL v1.0 could not be translated into differential regulation mechanisms in a straightforward manner. To advance from DCEA to DRA, we upgraded the DCGL package from v1.0 to v2.0. A new module named "Differential Regulation Analysis" (DRA) was designed, which consists of three major functions: DRsort, DRplot, and DRrank. DRsort selects differentially regulated genes (DRGs) and differentially regulated links (DRLs) according to the transcription factor (TF)-to-target information. DRrank prioritizes the TFs in terms of their potential relevance to the phenotype of interest. DRplot graphically visualizes differentially co-expressed links (DCLs) and/or TF-to-target links in a network context. In addition to these new modules, we streamlined the codes from v1.0. The evaluation results proved that our differential regulation analysis is able to capture the regulators relevant to the biological subject. With ample functions to facilitate differential regulation analysis, DCGL v2.0 was upgraded from a DCEA tool to a DRA tool, which may unveil the underlying differential regulation from the observed differential co-expression. DCGL v2.0 can be applied to a wide range of gene expression data in order to systematically identify novel regulators that have not yet been documented as critical. DCGL v2.0 package is available at http://cran.r-project.org/web/packages/DCGL/index.html or at our project home page http://lifecenter.sgst.cn/main/en/dcgl.jsp.

  8. Ethylene regulation of carotenoid accumulation and carotenogenic gene expression in colour-contrasted apricot varieties (Prunus armeniaca).

    PubMed

    Marty, I; Bureau, S; Sarkissian, G; Gouble, B; Audergon, J M; Albagnac, G

    2005-07-01

    In order to elucidate the regulation mechanisms of carotenoid biosynthesis in apricot fruit (Prunus armeniaca), carotenoid content and carotenogenic gene expression were analysed as a function of ethylene production in two colour-contrasted apricot varieties. Fruits from Goldrich (GO) were orange, while Moniqui (MO) fruits were white. Biochemical analysis showed that GO accumulated precursors of the uncoloured carotenoids, phytoene and phytofluene, and the coloured carotenoid, beta-carotene, while Moniqui (MO) fruits only accumulated phytoene and phytofluene but no beta-carotene. Physiological analysis showed that ethylene production was clearly weaker in GO than in MO. Carotenogenic gene expression (Psy-1, Pds, and Zds) and carotenoid accumulation were measured with respect to ethylene production which is initiated in mature green fruits at the onset of the climacteric stage or following exo-ethylene or ethylene-receptor inhibitor (1-MCP) treatments. Results showed (i) systematically stronger expression of carotenogenic genes in white than in orange fruits, even for the Zds gene involved in beta-carotene synthesis that is undetectable in MO fruits, (ii) ethylene-induction of Psy-1 and Pds gene expression and the corresponding product accumulation, (iii) Zds gene expression and beta-carotene production independent of ethylene. The different results obtained at physiological, biochemical, and molecular levels revealed the complex regulation of carotenoid biosynthesis in apricots and led to suggestions regarding some possible ways to regulate it.

  9. Transcriptomic Profiling Analysis of Arabidopsis thaliana Treated with Exogenous Myo-Inositol

    PubMed Central

    Ye, Wenxing; Ren, Weibo; Kong, Lingqi; Zhang, Wanjun; Wang, Tao

    2016-01-01

    Myo-insositol (MI) is a crucial substance in the growth and developmental processes in plants. It is commonly added to the culture medium to promote adventitious shoot development. In our previous work, MI was found in influencing Agrobacterium-mediated transformation. In this report, a high-throughput RNA sequencing technique (RNA-Seq) was used to investigate differently expressed genes in one-month-old Arabidopsis seedling grown on MI free or MI supplemented culture medium. The results showed that 21,288 and 21,299 genes were detected with and without MI treatment, respectively. The detected genes included 184 new genes that were not annotated in the Arabidopsis thaliana reference genome. Additionally, 183 differentially expressed genes were identified (DEGs, FDR ≤0.05, log2 FC≥1), including 93 up-regulated genes and 90 down-regulated genes. The DEGs were involved in multiple pathways, such as cell wall biosynthesis, biotic and abiotic stress response, chromosome modification, and substrate transportation. Some significantly differently expressed genes provided us with valuable information for exploring the functions of exogenous MI. RNA-Seq results showed that exogenous MI could alter gene expression and signaling transduction in plant cells. These results provided a systematic understanding of the functions of exogenous MI in detail and provided a foundation for future studies. PMID:27603208

  10. Evaluation of reference genes for quantitative real-time PCR in oil palm elite planting materials propagated by tissue culture.

    PubMed

    Chan, Pek-Lan; Rose, Ray J; Abdul Murad, Abdul Munir; Zainal, Zamri; Low, Eng-Ti Leslie; Ooi, Leslie Cheng-Li; Ooi, Siew-Eng; Yahya, Suzaini; Singh, Rajinder

    2014-01-01

    The somatic embryogenesis tissue culture process has been utilized to propagate high yielding oil palm. Due to the low callogenesis and embryogenesis rates, molecular studies were initiated to identify genes regulating the process, and their expression levels are usually quantified using reverse transcription quantitative real-time PCR (RT-qPCR). With the recent release of oil palm genome sequences, it is crucial to establish a proper strategy for gene analysis using RT-qPCR. Selection of the most suitable reference genes should be performed for accurate quantification of gene expression levels. In this study, eight candidate reference genes selected from cDNA microarray study and literature review were evaluated comprehensively across 26 tissue culture samples using RT-qPCR. These samples were collected from two tissue culture lines and media treatments, which consisted of leaf explants cultures, callus and embryoids from consecutive developmental stages. Three statistical algorithms (geNorm, NormFinder and BestKeeper) confirmed that the expression stability of novel reference genes (pOP-EA01332, PD00380 and PD00569) outperformed classical housekeeping genes (GAPDH, NAD5, TUBULIN, UBIQUITIN and ACTIN). PD00380 and PD00569 were identified as the most stably expressed genes in total samples, MA2 and MA8 tissue culture lines. Their applicability to validate the expression profiles of a putative ethylene-responsive transcription factor 3-like gene demonstrated the importance of using the geometric mean of two genes for normalization. Systematic selection of the most stably expressed reference genes for RT-qPCR was established in oil palm tissue culture samples. PD00380 and PD00569 were selected for accurate and reliable normalization of gene expression data from RT-qPCR. These data will be valuable to the research associated with the tissue culture process. Also, the method described here will facilitate the selection of appropriate reference genes in other oil palm tissues and in the expression profiling of genes relating to yield, biotic and abiotic stresses.

  11. Immuno-Navigator, a batch-corrected coexpression database, reveals cell type-specific gene networks in the immune system

    PubMed Central

    Vandenbon, Alexis; Dinh, Viet H.; Mikami, Norihisa; Kitagawa, Yohko; Teraguchi, Shunsuke; Ohkura, Naganari; Sakaguchi, Shimon

    2016-01-01

    High-throughput gene expression data are one of the primary resources for exploring complex intracellular dynamics in modern biology. The integration of large amounts of public data may allow us to examine general dynamical relationships between regulators and target genes. However, obstacles for such analyses are study-specific biases or batch effects in the original data. Here we present Immuno-Navigator, a batch-corrected gene expression and coexpression database for 24 cell types of the mouse immune system. We systematically removed batch effects from the underlying gene expression data and showed that this removal considerably improved the consistency between inferred correlations and prior knowledge. The data revealed widespread cell type-specific correlation of expression. Integrated analysis tools allow users to use this correlation of expression for the generation of hypotheses about biological networks and candidate regulators in specific cell types. We show several applications of Immuno-Navigator as examples. In one application we successfully predicted known regulators of importance in naturally occurring Treg cells from their expression correlation with a set of Treg-specific genes. For one high-scoring gene, integrin β8 (Itgb8), we confirmed an association between Itgb8 expression in forkhead box P3 (Foxp3)-positive T cells and Treg-specific epigenetic remodeling. Our results also suggest that the regulation of Treg-specific genes within Treg cells is relatively independent of Foxp3 expression, supporting recent results pointing to a Foxp3-independent component in the development of Treg cells. PMID:27078110

  12. pySAPC, a python package for sparse affinity propagation clustering: Application to odontogenesis whole genome time series gene-expression data.

    PubMed

    Cao, Huojun; Amendt, Brad A

    2016-11-01

    Developmental dental anomalies are common forms of congenital defects. The molecular mechanisms of dental anomalies are poorly understood. Systematic approaches such as clustering genes based on similar expression patterns could identify novel genes involved in dental anomalies and provide a framework for understanding molecular regulatory mechanisms of these genes during tooth development (odontogenesis). A python package (pySAPC) of sparse affinity propagation clustering algorithm for large datasets was developed. Whole genome pair-wise similarity was calculated based on expression pattern similarity based on 45 microarrays of several stages during odontogenesis. pySAPC identified 743 gene clusters based on expression pattern similarity during mouse tooth development. Three clusters are significantly enriched for genes associated with dental anomalies (with FDR <0.1). The three clusters of genes have distinct expression patterns during odontogenesis. Clustering genes based on similar expression profiles recovered several known regulatory relationships for genes involved in odontogenesis, as well as many novel genes that may be involved with the same genetic pathways as genes that have already been shown to contribute to dental defects. By using sparse similarity matrix, pySAPC use much less memory and CPU time compared with the original affinity propagation program that uses a full similarity matrix. This python package will be useful for many applications where dataset(s) are too large to use full similarity matrix. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang. Copyright © 2016. Published by Elsevier B.V.

  13. Genomic survey, expression profile and co-expression network analysis of OsWD40 family in rice

    PubMed Central

    2012-01-01

    Background WD40 proteins represent a large family in eukaryotes, which have been involved in a broad spectrum of crucial functions. Systematic characterization and co-expression analysis of OsWD40 genes enable us to understand the networks of the WD40 proteins and their biological processes and gene functions in rice. Results In this study, we identify and analyze 200 potential OsWD40 genes in rice, describing their gene structures, genome localizations, and evolutionary relationship of each member. Expression profiles covering the whole life cycle in rice has revealed that transcripts of OsWD40 were accumulated differentially during vegetative and reproductive development and preferentially up or down-regulated in different tissues. Under phytohormone treatments, 25 OsWD40 genes were differentially expressed with treatments of one or more of the phytohormone NAA, KT, or GA3 in rice seedlings. We also used a combined analysis of expression correlation and Gene Ontology annotation to infer the biological role of the OsWD40 genes in rice. The results suggested that OsWD40 genes may perform their diverse functions by complex network, thus were predictive for understanding their biological pathways. The analysis also revealed that OsWD40 genes might interact with each other to take part in metabolic pathways, suggesting a more complex feedback network. Conclusions All of these analyses suggest that the functions of OsWD40 genes are diversified, which provide useful references for selecting candidate genes for further functional studies. PMID:22429805

  14. Parallel habitat acclimatization is realized by the expression of different genes in two closely related salamander species (genus Salamandra).

    PubMed

    Goedbloed, D J; Czypionka, T; Altmüller, J; Rodriguez, A; Küpfer, E; Segev, O; Blaustein, L; Templeton, A R; Nolte, A W; Steinfartz, S

    2017-12-01

    The utilization of similar habitats by different species provides an ideal opportunity to identify genes underlying adaptation and acclimatization. Here, we analysed the gene expression of two closely related salamander species: Salamandra salamandra in Central Europe and Salamandra infraimmaculata in the Near East. These species inhabit similar habitat types: 'temporary ponds' and 'permanent streams' during larval development. We developed two species-specific gene expression microarrays, each targeting over 12 000 transcripts, including an overlapping subset of 8331 orthologues. Gene expression was examined for systematic differences between temporary ponds and permanent streams in larvae from both salamander species to establish gene sets and functions associated with these two habitat types. Only 20 orthologues were associated with a habitat in both species, but these orthologues did not show parallel expression patterns across species more than expected by chance. Functional annotation of a set of 106 genes with the highest effect size for a habitat suggested four putative gene function categories associated with a habitat in both species: cell proliferation, neural development, oxygen responses and muscle capacity. Among these high effect size genes was a single orthologue (14-3-3 protein zeta/YWHAZ) that was downregulated in temporary ponds in both species. The emergence of four gene function categories combined with a lack of parallel expression of orthologues (except 14-3-3 protein zeta) suggests that parallel habitat adaptation or acclimatization by larvae from S. salamandra and S. infraimmaculata to temporary ponds and permanent streams is mainly realized by different genes with a converging functionality.

  15. Gene expression-based chemical genomics identifies potential therapeutic drugs in hepatocellular carcinoma.

    PubMed

    Chen, Ming-Huang; Yang, Wu-Lung R; Lin, Kuan-Ting; Liu, Chia-Hung; Liu, Yu-Wen; Huang, Kai-Wen; Chang, Peter Mu-Hsin; Lai, Jin-Mei; Hsu, Chun-Nan; Chao, Kun-Mao; Kao, Cheng-Yan; Huang, Chi-Ying F

    2011-01-01

    Hepatocellular carcinoma (HCC) is an aggressive tumor with a poor prognosis. Currently, only sorafenib is approved by the FDA for advanced HCC treatment; therefore, there is an urgent need to discover candidate therapeutic drugs for HCC. We hypothesized that if a drug signature could reverse, at least in part, the gene expression signature of HCC, it might have the potential to inhibit HCC-related pathways and thereby treat HCC. To test this hypothesis, we first built an integrative platform, the "Encyclopedia of Hepatocellular Carcinoma genes Online 2", dubbed EHCO2, to systematically collect, organize and compare the publicly available data from HCC studies. The resulting collection includes a total of 4,020 genes. To systematically query the Connectivity Map (CMap), which includes 6,100 drug-mediated expression profiles, we further designed various gene signature selection and enrichment methods, including a randomization technique, majority vote, and clique analysis. Subsequently, 28 out of 50 prioritized drugs, including tanespimycin, trichostatin A, thioguanosine, and several anti-psychotic drugs with anti-tumor activities, were validated via MTT cell viability assays and clonogenic assays in HCC cell lines. To accelerate their future clinical use, possibly through drug-repurposing, we selected two well-established drugs to test in mice, chlorpromazine and trifluoperazine. Both drugs inhibited orthotopic liver tumor growth. In conclusion, we successfully discovered and validated existing drugs for potential HCC therapeutic use with the pipeline of Connectivity Map analysis and lab verification, thereby suggesting the usefulness of this procedure to accelerate drug repurposing for HCC treatment.

  16. Selection of suitable reference genes for gene expression studies in Staphylococcus capitis during growth under erythromycin stress.

    PubMed

    Cui, Bintao; Smooker, Peter M; Rouch, Duncan A; Deighton, Margaret A

    2016-08-01

    Accurate and reproducible measurement of gene transcription requires appropriate reference genes, which are stably expressed under different experimental conditions to provide normalization. Staphylococcus capitis is a human pathogen that produces biofilm under stress, such as imposed by antimicrobial agents. In this study, a set of five commonly used staphylococcal reference genes (gyrB, sodA, recA, tuf and rpoB) were systematically evaluated in two clinical isolates of Staphylococcus capitis (S. capitis subspecies urealyticus and capitis, respectively) under erythromycin stress in mid-log and stationary phases. Two public software programs (geNorm and NormFinder) and two manual calculation methods, reference residue normalization (RRN) and relative quantitative (RQ), were applied. The potential reference genes selected by the four algorithms were further validated by comparing the expression of a well-studied biofilm gene (icaA) with phenotypic biofilm formation in S. capitis under four different experimental conditions. The four methods differed considerably in their ability to predict the most suitable reference gene or gene combination for comparing icaA expression under different conditions. Under the conditions used here, the RQ method provided better selection of reference genes than the other three algorithms; however, this finding needs to be confirmed with a larger number of isolates. This study reinforces the need to assess the stability of reference genes for analysis of target gene expression under different conditions and the use of more than one algorithm in such studies. Although this work was conducted using a specific human pathogen, it emphasizes the importance of selecting suitable reference genes for accurate normalization of gene expression more generally.

  17. Floral organ MADS-box genes in Cercidiphyllum japonicum (Cercidiphyllaceae): Implications for systematic evolution and bracts definition.

    PubMed

    Jin, Yupei; Wang, Yubing; Zhang, Dechun; Shen, Xiangling; Liu, Wen; Chen, Faju

    2017-01-01

    The dioecious relic Cercidiphyllum japonicum is one of two species of the sole genus Cercidiphyllum, with a tight inflorescence lacking an apparent perianth structure. In addition, its systematic place has been much debated and, so far researches have mainly focused on its morphology and chloroplast genes. In our investigation, we identified 10 floral organ identity genes, including four A-class, three B-class, two C-class and one D-class. Phylogenetic analyses showed that all ten genes are grouped with Saxifragales plants, which confirmed the phylogenetic place of C. japonicum. Expression patterns of those genes were examined by quantitative reverse transcriptase PCR, with some variations that did not completely coincide with the ABCDE model, suggesting some subfunctionalization. As well, our research supported the idea that thebract actually is perianth according to our morphological and molecular analyses in Cercidiphyllum japonicum.

  18. Floral organ MADS-box genes in Cercidiphyllum japonicum (Cercidiphyllaceae): Implications for systematic evolution and bracts definition

    PubMed Central

    Zhang, Dechun; Shen, Xiangling; Chen, Faju

    2017-01-01

    The dioecious relic Cercidiphyllum japonicum is one of two species of the sole genus Cercidiphyllum, with a tight inflorescence lacking an apparent perianth structure. In addition, its systematic place has been much debated and, so far researches have mainly focused on its morphology and chloroplast genes. In our investigation, we identified 10 floral organ identity genes, including four A-class, three B-class, two C-class and one D-class. Phylogenetic analyses showed that all ten genes are grouped with Saxifragales plants, which confirmed the phylogenetic place of C. japonicum. Expression patterns of those genes were examined by quantitative reverse transcriptase PCR, with some variations that did not completely coincide with the ABCDE model, suggesting some subfunctionalization. As well, our research supported the idea that thebract actually is perianth according to our morphological and molecular analyses in Cercidiphyllum japonicum. PMID:28562649

  19. MicroRNA expression, target genes, and signaling pathways in infants with a ventricular septal defect.

    PubMed

    Chai, Hui; Yan, Zhaoyuan; Huang, Ke; Jiang, Yuanqing; Zhang, Lin

    2018-02-01

    This study aimed to systematically investigate the relationship between miRNA expression and the occurrence of ventricular septal defect (VSD), and characterize the miRNA target genes and pathways that can lead to VSD. The miRNAs that were differentially expressed in blood samples from VSD and normal infants were screened and validated by implementing miRNA microarrays and qRT-PCR. The target genes regulated by differentially expressed miRNAs were predicted using three target gene databases. The functions and signaling pathways of the target genes were enriched using the GO database and KEGG database, respectively. The transcription and protein expression of specific target genes in critical pathways were compared in the VSD and normal control groups using qRT-PCR and western blotting, respectively. Compared with the normal control group, the VSD group had 22 differentially expressed miRNAs; 19 were downregulated and three were upregulated. The 10,677 predicted target genes participated in many biological functions related to cardiac development and morphogenesis. Four target genes (mGLUR, Gq, PLC, and PKC) were involved in the PKC pathway and four (ECM, FAK, PI3 K, and PDK1) were involved in the PI3 K-Akt pathway. The transcription and protein expression of these eight target genes were significantly upregulated in the VSD group. The 22 miRNAs that were dysregulated in the VSD group were mainly downregulated, which may result in the dysregulation of several key genes and biological functions related to cardiac development. These effects could also be exerted via the upregulation of eight specific target genes, the subsequent over-activation of the PKC and PI3 K-Akt pathways, and the eventual abnormal cardiac development and VSD.

  20. Unraveling transcriptional control and cis-regulatory codes using the software suite GeneACT

    PubMed Central

    Cheung, Tom Hiu; Kwan, Yin Lam; Hamady, Micah; Liu, Xuedong

    2006-01-01

    Deciphering gene regulatory networks requires the systematic identification of functional cis-acting regulatory elements. We present a suite of web-based bioinformatics tools, called GeneACT , that can rapidly detect evolutionarily conserved transcription factor binding sites or microRNA target sites that are either unique or over-represented in differentially expressed genes from DNA microarray data. GeneACT provides graphic visualization and extraction of common regulatory sequence elements in the promoters and 3'-untranslated regions that are conserved across multiple mammalian species. PMID:17064417

  1. Is There a Genetic Predisposition to Anterior Cruciate Ligament Tear? A Systematic Review.

    PubMed

    John, Rakesh; Dhillon, Mandeep Singh; Sharma, Siddhartha; Prabhakar, Sharad; Bhandari, Mohit

    2016-12-01

    Injuries to the anterior cruciate ligament (ACL) are among the most common knee ligament injuries and frequently warrant reconstruction. The etiopathogenesis of these injuries has focused mainly on mechanism of trauma, patient sex, and anatomic factors as predisposing causes. Several genetic factors that could predispose to an ACL tear have recently been reported. This systematic review summarizes the current evidence for a genetic predisposition to ACL tears. The principal research question was to identify genetic factors, based on the available literature, that could predispose an individual to an ACL tear. Systematic review. The PubMed, EMBASE, Cochrane, and HuGE databases were searched; the search was run from the period of inception until June 21, 2015. A secondary search was performed by screening the references of full-text articles obtained and by manually searching selected journals. Articles were screened with prespecified inclusion criteria. The quality of studies included in the review was assessed for risk of bias by 2 reviewers using the Newcastle-Ottawa Scale. A total of 994 records were identified by the search, out of which 17 studies (16 case-control studies and 1 cross-sectional study) were included in the final review. Two studies observed a familial predisposition to an ACL tear. Fourteen studies looked at specific gene polymorphisms in 20 genes, from which different polymorphisms in 10 genes were positively associated with an ACL tear. In addition to these polymorphisms, 8 haplotypes were associated with ACL tear. One study looked at gene expression analysis. Although specific gene polymorphisms and haplotypes have been identified, it is difficult to come to a conclusion on the basis of the existing literature. Several sources of bias have been identified in these studies, and the results cannot be extrapolated to the general population. More studies are needed in larger populations of different ethnicities. Gene-gene interactions and gene expression studies in the future may delineate the exact role of these gene polymorphisms in ACL tears. © 2016 The Author(s).

  2. Dynamics of lineage commitment revealed by single-cell transcriptomics of differentiating embryonic stem cells.

    PubMed

    Semrau, Stefan; Goldmann, Johanna E; Soumillon, Magali; Mikkelsen, Tarjei S; Jaenisch, Rudolf; van Oudenaarden, Alexander

    2017-10-23

    Gene expression heterogeneity in the pluripotent state of mouse embryonic stem cells (mESCs) has been increasingly well-characterized. In contrast, exit from pluripotency and lineage commitment have not been studied systematically at the single-cell level. Here we measure the gene expression dynamics of retinoic acid driven mESC differentiation from pluripotency to lineage commitment, using an unbiased single-cell transcriptomics approach. We find that the exit from pluripotency marks the start of a lineage transition as well as a transient phase of increased susceptibility to lineage specifying signals. Our study reveals several transcriptional signatures of this phase, including a sharp increase of gene expression variability and sequential expression of two classes of transcriptional regulators. In summary, we provide a comprehensive analysis of the exit from pluripotency and lineage commitment at the single cell level, a potential stepping stone to improved lineage manipulation through timing of differentiation cues.

  3. Classification of Genes and Putative Biomarker Identification Using Distribution Metrics on Expression Profiles

    PubMed Central

    Huang, Hung-Chung; Jupiter, Daniel; VanBuren, Vincent

    2010-01-01

    Background Identification of genes with switch-like properties will facilitate discovery of regulatory mechanisms that underlie these properties, and will provide knowledge for the appropriate application of Boolean networks in gene regulatory models. As switch-like behavior is likely associated with tissue-specific expression, these gene products are expected to be plausible candidates as tissue-specific biomarkers. Methodology/Principal Findings In a systematic classification of genes and search for biomarkers, gene expression profiles (GEPs) of more than 16,000 genes from 2,145 mouse array samples were analyzed. Four distribution metrics (mean, standard deviation, kurtosis and skewness) were used to classify GEPs into four categories: predominantly-off, predominantly-on, graded (rheostatic), and switch-like genes. The arrays under study were also grouped and examined by tissue type. For example, arrays were categorized as ‘brain group’ and ‘non-brain group’; the Kolmogorov-Smirnov distance and Pearson correlation coefficient were then used to compare GEPs between brain and non-brain for each gene. We were thus able to identify tissue-specific biomarker candidate genes. Conclusions/Significance The methodology employed here may be used to facilitate disease-specific biomarker discovery. PMID:20140228

  4. Integrative Analysis Reveals Regulatory Programs in Endometriosis

    PubMed Central

    Yang, Huan; Kang, Kai; Cheng, Chao; Mamillapalli, Ramanaiah; Taylor, Hugh S.

    2015-01-01

    Endometriosis is a common gynecological disease found in approximately 10% of reproductive-age women. Gene expression analysis has been performed to explore alterations in gene expression associated with endometriosis; however, the underlying transcription factors (TFs) governing such expression changes have not been investigated in a systematic way. In this study, we propose a method to integrate gene expression with TF binding data and protein–protein interactions to construct an integrated regulatory network (IRN) for endometriosis. The IRN has shown that the most regulated gene in endometriosis is RUNX1, which is targeted by 14 of 26 TFs also involved in endometriosis. Using 2 published cohorts, GSE7305 (Hover, n = 20) and GSE7307 (Roth, n = 36) from the Gene Expression Omnibus database, we identified a network of TFs, which bind to target genes that are differentially expressed in endometriosis. Enrichment analysis based on the hypergeometric distribution allowed us to predict the TFs involved in endometriosis (n = 40). This included known TFs such as androgen receptor (AR) and critical factors in the pathology of endometriosis, estrogen receptor α, and estrogen receptor β. We also identified several new ones from which we selected FOXA2 and TFAP2C, and their regulation was confirmed by quantitative real-time polymerase chain reaction and immunohistochemistry (IHC). Further, our analysis revealed that the function of AR and p53 in endometriosis is regulated by posttranscriptional changes and not by differential gene expression. Our integrative analysis provides new insights into the regulatory programs involved in endometriosis. PMID:26134036

  5. TP53 mutations, expression and interaction networks in human cancers

    PubMed Central

    Wang, Xiaosheng; Sun, Qingrong

    2017-01-01

    Although the associations of p53 dysfunction, p53 interaction networks and oncogenesis have been widely explored, a systematic analysis of TP53 mutations and its related interaction networks in various types of human cancers is lacking. Our study explored the associations of TP53 mutations, gene expression, clinical outcomes, and TP53 interaction networks across 33 cancer types using data from The Cancer Genome Atlas (TCGA). We show that TP53 is the most frequently mutated gene in a number of cancers, and its mutations appear to be early events in cancer initiation. We identified genes potentially repressed by p53, and genes whose expression correlates significantly with TP53 expression. These gene products may be especially important nodes in p53 interaction networks in human cancers. This study shows that while TP53-truncating mutations often result in decreased TP53 expression, other non-truncating TP53 mutations result in increased TP53 expression in some cancers. Survival analyses in a number of cancers show that patients with TP53 mutations are more likely to have worse prognoses than TP53-wildtype patients, and that elevated TP53 expression often leads to poor clinical outcomes. We identified a set of candidate synthetic lethal (SL) genes for TP53, and validated some of these SL interactions using data from the Cancer Cell Line Project. These predicted SL genes are promising candidates for experimental validation and the development of personalized therapeutics for patients with TP53-mutated cancers. PMID:27880943

  6. TP53 mutations, expression and interaction networks in human cancers.

    PubMed

    Wang, Xiaosheng; Sun, Qingrong

    2017-01-03

    Although the associations of p53 dysfunction, p53 interaction networks and oncogenesis have been widely explored, a systematic analysis of TP53 mutations and its related interaction networks in various types of human cancers is lacking. Our study explored the associations of TP53 mutations, gene expression, clinical outcomes, and TP53 interaction networks across 33 cancer types using data from The Cancer Genome Atlas (TCGA). We show that TP53 is the most frequently mutated gene in a number of cancers, and its mutations appear to be early events in cancer initiation. We identified genes potentially repressed by p53, and genes whose expression correlates significantly with TP53 expression. These gene products may be especially important nodes in p53 interaction networks in human cancers. This study shows that while TP53-truncating mutations often result in decreased TP53 expression, other non-truncating TP53 mutations result in increased TP53 expression in some cancers. Survival analyses in a number of cancers show that patients with TP53 mutations are more likely to have worse prognoses than TP53-wildtype patients, and that elevated TP53 expression often leads to poor clinical outcomes. We identified a set of candidate synthetic lethal (SL) genes for TP53, and validated some of these SL interactions using data from the Cancer Cell Line Project. These predicted SL genes are promising candidates for experimental validation and the development of personalized therapeutics for patients with TP53-mutated cancers.

  7. Identification of internal control genes for quantitative expression analysis by real-time PCR in bovine peripheral lymphocytes.

    PubMed

    Spalenza, Veronica; Girolami, Flavia; Bevilacqua, Claudia; Riondato, Fulvio; Rasero, Roberto; Nebbia, Carlo; Sacchi, Paola; Martin, Patrice

    2011-09-01

    Gene expression studies in blood cells, particularly lymphocytes, are useful for monitoring potential exposure to toxicants or environmental pollutants in humans and livestock species. Quantitative PCR is the method of choice for obtaining accurate quantification of mRNA transcripts although variations in the amount of starting material, enzymatic efficiency, and the presence of inhibitors can lead to evaluation errors. As a result, normalization of data is of crucial importance. The most common approach is the use of endogenous reference genes as an internal control, whose expression should ideally not vary among individuals and under different experimental conditions. The accurate selection of reference genes is therefore an important step in interpreting quantitative PCR studies. Since no systematic investigation in bovine lymphocytes has been performed, the aim of the present study was to assess the expression stability of seven candidate reference genes in circulating lymphocytes collected from 15 dairy cows. Following the characterization by flow cytometric analysis of the cell populations obtained from blood through a density gradient procedure, three popular softwares were used to evaluate the gene expression data. The results showed that two genes are sufficient for normalization of quantitative PCR studies in cattle lymphocytes and that YWAHZ, S24 and PPIA are the most stable genes. Copyright © 2010 Elsevier Ltd. All rights reserved.

  8. Comparative analysis of gene expression profiles of OPN signaling pathway in four kinds of liver diseases.

    PubMed

    Wang, Gaiping; Chen, Shasha; Zhao, Congcong; Li, Xiaofang; Zhao, Weiming; Yang, Jing; Chang, Cuifang; Xu, Cunshuan

    2016-09-01

    To explore the relevance of OPN signalling pathway to the occurrence and development of nonalcoholic fatty liver disease (NAFLD), liver cirrhosis (LC), hepatic cancer (HC) and acute hepatic failure (AHF) at transcriptional level, Rat Genome 230 2.0 Array was used to detect expression profiles of OPN signalling pathway-related genes in four kinds of liver diseases. The results showed that 23, 33, 59 and 74 genes were significantly changed in the above four kinds of liver diseases, respectively. H-clustering analysis showed that the expression profiles of OPN signalling-related genes were notably different in four kinds of liver diseases. Subsequently, a total of above-mentioned 147 genes were categorized into four clusters by k-means according to the similarity of gene expression, and expression analysis systematic explorer (EASE) functional enrichment analysis revealed that OPN signalling pathway-related genes were involved in cell adhesion and migration, cell proliferation, apoptosis, stress and inflammatory reaction, etc. Finally, ingenuity pathway analysis (IPA) software was used to predict the functions of OPN signalling-related genes, and the results indicated that the activities of ROS production, cell adhesion and migration, cell proliferation were remarkably increased, while that of apoptosis, stress and inflammatory reaction were reduced in four kinds of liver diseases. In summary, the above physiological activities changed more obviously in LC, HC and AHF than in NAFLD.

  9. Using scale and feather traits for module construction provides a functional approach to chicken epidermal development.

    PubMed

    Bao, Weier; Greenwold, Matthew J; Sawyer, Roger H

    2017-11-01

    Gene co-expression network analysis has been a research method widely used in systematically exploring gene function and interaction. Using the Weighted Gene Co-expression Network Analysis (WGCNA) approach to construct a gene co-expression network using data from a customized 44K microarray transcriptome of chicken epidermal embryogenesis, we have identified two distinct modules that are highly correlated with scale or feather development traits. Signaling pathways related to feather development were enriched in the traditional KEGG pathway analysis and functional terms relating specifically to embryonic epidermal development were also enriched in the Gene Ontology analysis. Significant enrichment annotations were discovered from customized enrichment tools such as Modular Single-Set Enrichment Test (MSET) and Medical Subject Headings (MeSH). Hub genes in both trait-correlated modules showed strong specific functional enrichment toward epidermal development. Also, regulatory elements, such as transcription factors and miRNAs, were targeted in the significant enrichment result. This work highlights the advantage of this methodology for functional prediction of genes not previously associated with scale- and feather trait-related modules.

  10. Genome Wide Identification, Evolutionary, and Expression Analysis of VQ Genes from Two Pyrus Species

    PubMed Central

    Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping

    2018-01-01

    The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice (Oryza sativa), maize (Zea mays), and Arabidopsis (Arabidopsis thaliana). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis, respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis, respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis. A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis, respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus, and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis. PMID:29690608

  11. Identification and expression analysis of the IPT and CKX gene families during axillary bud outgrowth in apple (Malus domestica Borkh.).

    PubMed

    Tan, Ming; Li, Guofang; Qi, Siyan; Liu, Xiaojie; Chen, Xilong; Ma, Juanjuan; Zhang, Dong; Han, Mingyu

    2018-04-20

    Cytokinins (CKs) play a crucial role in promoting axillary bud outgrowth and targeting the control of CK metabolism can be used to enhance branching in plants. CK levels are maintained mainly by CK biosynthesis (isopentenyl transferase, IPT) and degradation (dehydrogenase, CKX) genes in plants. A systematic study of the IPT and CKX gene families in apple, however, has not been conducted. In the present study, 12 MdIPTs and 12 MdCKXs were identified in the apple genome. Systematic phylogenetic, structural, and synteny analyses were performed. Expression analysis of these genes in different tissues was also assessed. MdIPT and MdCKX genes exhibit distinct expression patterns in different tissues. The response of MdIPT, MdCKX, and MdPIN1 genes to various treatments (6-BA, decapitation and Lovastatin, an inhibitor of CKs synthesis) that impact branching were also investigated. Results indicated that most of the MdIPT and MdCKX, and MdPIN1 genes were upregulated by 6-BA and decapitation treatment, but inhibited by Lovastatin, a compound that effectively suppresses axillary bud outgrowth induced by decapitation. These findings suggest that cytokinin biosynthesis is required for the activation of bud break and the export of auxin from buds in apple tree with intact primary shoot apex or decapitated apple tree. MdCKX8 and MdCKX10, however, exhibited little response to decapitation, but were significantly up-regulated by 6-BA and Lovastatin, a finding that warrants further investigation in order to understand their function in bud-outgrowth. Copyright © 2018 Elsevier B.V. All rights reserved.

  12. Genome-Wide Identification and Expression Analysis of the UGlcAE Gene Family in Tomato.

    PubMed

    Ding, Xing; Li, Jinhua; Pan, Yu; Zhang, Yue; Ni, Lei; Wang, Yaling; Zhang, Xingguo

    2018-05-27

    The UGlcAE has the capability of interconverting UDP-d-galacturonic acid and UDP-d-glucuronic acid, and UDP-d-galacturonic acid is an activated precursor for the synthesis of pectins in plants. In this study, we identified nine UGlcAE protein-encoding genes in tomato. The nine UGlcAE genes that were distributed on eight chromosomes in tomato, and the corresponding proteins contained one or two trans-membrane domains. The phylogenetic analysis showed that SlUGlcAE genes could be divided into seven groups, designated UGlcAE1 to UGlcAE6 , of which the UGlcAE2 were classified into two groups. Expression profile analysis revealed that the SlUGlcAE genes display diverse expression patterns in various tomato tissues. Selective pressure analysis indicated that all of the amino acid sites of SlUGlcAE proteins are undergoing purifying selection. Fifteen stress-, hormone-, and development-related elements were identified in the upstream regions (0.5 kb) of these SlUGlcAE genes. Furthermore, we investigated the expression patterns of SlUGlcAE genes in response to three hormones (indole-3-acetic acid (IAA), gibberellin (GA), and salicylic acid (SA)). We detected firmness, pectin contents, and expression levels of UGlcAE family genes during the development of tomato fruit. Here, we systematically summarize the general characteristics of the SlUGlcAE genes in tomato, which could provide a basis for further function studies of tomato UGlcAE genes.

  13. Identification and evaluation of reference genes for qRT-PCR studies in Lentinula edodes

    PubMed Central

    Qin, Peng; He, Maolan; Yu, Xiumei; Zhao, Ke; Zhang, Xiaoping; Ma, Menggen; Chen, Qiang; Chen, Xiaoqiong; Zeng, Xianfu; Gu, Yunfu

    2018-01-01

    Lentinula edodes (shiitake mushroom) is a common edible mushroom with a number of potential therapeutic and nutritional applications. It contains various medically important molecules, such as polysaccharides, terpenoids, sterols, and lipids, were contained in this mushroom. Quantitative real-time polymerase chain reaction (qRT-PCR) is a powerful tool to analyze the mechanisms underlying the biosynthetic pathways of these substances. qRT-PCR is used for accurate analyses of transcript levels owing to its rapidity, sensitivity, and reliability. However, its accuracy and reliability for the quantification of transcripts rely on the expression stability of the reference genes used for data normalization. To ensure the reliability of gene expression analyses using qRT-PCR in L. edodes molecular biology research, it is necessary to systematically evaluate reference genes. In the current study, ten potential reference genes were selected from L. edodes genomic data and their expression levels were measured by qRT-PCR using various samples. The expression stability of each candidate gene was analyzed by three commonly used software packages: geNorm, NormFinder, and BestKeeper. Base on the results, Rpl4 was the most stable reference gene across all experimental conditions, and Atu was the most stable gene among strains. 18S was found to be the best reference gene for different development stages, and Rpl4 was the most stably expressed gene under various nutrient conditions. The present work will contribute to qRT-PCR studies in L. edodes. PMID:29293626

  14. Identification and evaluation of reference genes for qRT-PCR studies in Lentinula edodes.

    PubMed

    Xiang, Quanju; Li, Jin; Qin, Peng; He, Maolan; Yu, Xiumei; Zhao, Ke; Zhang, Xiaoping; Ma, Menggen; Chen, Qiang; Chen, Xiaoqiong; Zeng, Xianfu; Gu, Yunfu

    2018-01-01

    Lentinula edodes (shiitake mushroom) is a common edible mushroom with a number of potential therapeutic and nutritional applications. It contains various medically important molecules, such as polysaccharides, terpenoids, sterols, and lipids, were contained in this mushroom. Quantitative real-time polymerase chain reaction (qRT-PCR) is a powerful tool to analyze the mechanisms underlying the biosynthetic pathways of these substances. qRT-PCR is used for accurate analyses of transcript levels owing to its rapidity, sensitivity, and reliability. However, its accuracy and reliability for the quantification of transcripts rely on the expression stability of the reference genes used for data normalization. To ensure the reliability of gene expression analyses using qRT-PCR in L. edodes molecular biology research, it is necessary to systematically evaluate reference genes. In the current study, ten potential reference genes were selected from L. edodes genomic data and their expression levels were measured by qRT-PCR using various samples. The expression stability of each candidate gene was analyzed by three commonly used software packages: geNorm, NormFinder, and BestKeeper. Base on the results, Rpl4 was the most stable reference gene across all experimental conditions, and Atu was the most stable gene among strains. 18S was found to be the best reference gene for different development stages, and Rpl4 was the most stably expressed gene under various nutrient conditions. The present work will contribute to qRT-PCR studies in L. edodes.

  15. Clustering gene expression data based on predicted differential effects of GV interaction.

    PubMed

    Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu

    2005-02-01

    Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.

  16. Mechanical stimuli differentially control stem cell behavior: morphology, proliferation, and differentiation

    PubMed Central

    Maul, Timothy M.; Chew, Douglas W.; Nieponice, Alejandro

    2011-01-01

    Mesenchymal stem cell (MSC) therapy has demonstrated applications in vascular regenerative medicine. Although blood vessels exist in a mechanically dynamic environment, there has been no rigorous, systematic analysis of mechanical stimulation on stem cell differentiation. We hypothesize that mechanical stimuli, relevant to the vasculature, can differentiate MSCs toward smooth muscle (SMCs) and endothelial cells (ECs). This was tested using a unique experimental platform to differentially apply various mechanical stimuli in parallel. Three forces, cyclic stretch, cyclic pressure, and laminar shear stress, were applied independently to mimic several vascular physiologic conditions. Experiments were conducted using subconfluent MSCs for 5 days and demonstrated significant effects on morphology and proliferation depending upon the type, magnitude, frequency, and duration of applied stimulation. We have defined thresholds of cyclic stretch that potentiate SMC protein expression, but did not find EC protein expression under any condition tested. However, a second set of experiments performed at confluence and aimed to elicit the temporal gene expression response of a select magnitude of each stimulus revealed that EC gene expression can be increased with cyclic pressure and shear stress in a cell-contact-dependent manner. Further, these MSCs also appear to express genes from multiple lineages simultaneously which may warrant further investigation into post-transcriptional mechanisms for controlling protein expression. To our knowledge, this is the first systematic examination of the effects of mechanical stimulation on MSCs and has implications for the understanding of stem cell biology, as well as potential bioreactor designs for tissue engineering and cell therapy applications. PMID:21253809

  17. Circular RNA biogenesis can proceed through an exon-containing lariat precursor.

    PubMed

    Barrett, Steven P; Wang, Peter L; Salzman, Julia

    2015-06-09

    Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical 'backsplicing' event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure.

  18. A novel bioinformatics pipeline to discover genes related to arbuscular mycorrhizal symbiosis based on their evolutionary conservation pattern among higher plants.

    PubMed

    Favre, Patrick; Bapaume, Laure; Bossolini, Eligio; Delorenzi, Mauro; Falquet, Laurent; Reinhardt, Didier

    2014-12-03

    Genes involved in arbuscular mycorrhizal (AM) symbiosis have been identified primarily by mutant screens, followed by identification of the mutated genes (forward genetics). In addition, a number of AM-related genes has been identified by their AM-related expression patterns, and their function has subsequently been elucidated by knock-down or knock-out approaches (reverse genetics). However, genes that are members of functionally redundant gene families, or genes that have a vital function and therefore result in lethal mutant phenotypes, are difficult to identify. If such genes are constitutively expressed and therefore escape differential expression analyses, they remain elusive. The goal of this study was to systematically search for AM-related genes with a bioinformatics strategy that is insensitive to these problems. The central element of our approach is based on the fact that many AM-related genes are conserved only among AM-competent species. Our approach involves genome-wide comparisons at the proteome level of AM-competent host species with non-mycorrhizal species. Using a clustering method we first established orthologous/paralogous relationships and subsequently identified protein clusters that contain members only of the AM-competent species. Proteins of these clusters were then analyzed in an extended set of 16 plant species and ranked based on their relatedness among AM-competent monocot and dicot species, relative to non-mycorrhizal species. In addition, we combined the information on the protein-coding sequence with gene expression data and with promoter analysis. As a result we present a list of yet uncharacterized proteins that show a strongly AM-related pattern of sequence conservation, indicating that the respective genes may have been under selection for a function in AM. Among the top candidates are three genes that encode a small family of similar receptor-like kinases that are related to the S-locus receptor kinases involved in sporophytic self-incompatibility. We present a new systematic strategy of gene discovery based on conservation of the protein-coding sequence that complements classical forward and reverse genetics. This strategy can be applied to diverse other biological phenomena if species with established genome sequences fall into distinguished groups that differ in a defined functional trait of interest.

  19. Systematic Identification and Characterization of Novel Human Skin-Associated Genes Encoding Membrane and Secreted Proteins

    PubMed Central

    Buhren, Bettina Alexandra; Martinez, Cynthia; Schrumpf, Holger; Gasis, Marcia; Grether-Beck, Susanne; Krutmann, Jean

    2013-01-01

    Through bioinformatics analyses of a human gene expression database representing 105 different tissues and cell types, we identified 687 skin-associated genes that are selectively and highly expressed in human skin. Over 50 of these represent uncharacterized genes not previously associated with skin and include a subset that encode novel secreted and plasma membrane proteins. The high levels of skin-associated expression for eight of these novel therapeutic target genes were confirmed by semi-quantitative real time PCR, western blot and immunohistochemical analyses of normal skin and skin-derived cell lines. Four of these are expressed specifically by epidermal keratinocytes; two that encode G-protein-coupled receptors (GPR87 and GPR115), and two that encode secreted proteins (WFDC5 and SERPINB7). Further analyses using cytokine-activated and terminally differentiated human primary keratinocytes or a panel of common inflammatory, autoimmune or malignant skin diseases revealed distinct patterns of regulation as well as disease associations that point to important roles in cutaneous homeostasis and disease. Some of these novel uncharacterized skin genes may represent potential biomarkers or drug targets for the development of future diagnostics or therapeutics. PMID:23840300

  20. Microarray Data Mining for Potential Selenium Targets in Chemoprevention of Prostate Cancer

    PubMed Central

    ZHANG, HAITAO; DONG, YAN; ZHAO, HONGJUAN; BROOKS, JAMES D.; HAWTHORN, LESLEYANN; NOWAK, NORMA; MARSHALL, JAMES R.; GAO, ALLEN C.; IP, CLEMENT

    2008-01-01

    Background A previous clinical trial showed that selenium supplementation significantly reduced the incidence of prostate cancer. We report here a bioinformatics approach to gain new insights into selenium molecular targets that might be relevant to prostate cancer chemoprevention. Materials and Methods We first performed data mining analysis to identify genes which are consistently dysregulated in prostate cancer using published datasets from gene expression profiling of clinical prostate specimens. We then devised a method to systematically analyze three selenium microarray datasets from the LNCaP human prostate cancer cells, and to match the analysis to the cohort of genes implicated in prostate carcinogenesis. Moreover, we compared the selenium datasets with two datasets obtained from expression profiling of androgen-stimulated LNCaP cells. Results We found that selenium reverses the expression of genes implicated in prostate carcinogenesis. In addition, we found that selenium could counteract the effect of androgen on the expression of a subset obtained from androgen-regulated genes. Conclusions The above information provides us with a treasure of new clues to investigate the mechanism of selenium chemoprevention of prostate cancer. Furthermore, these selenium target genes could also serve as biomarkers in future clinical trials to gauge the efficacy of selenium intervention. PMID:18548127

  1. Transcriptome analysis of Petunia axillaris flowers reveals genes involved in morphological differentiation and metabolite transport

    PubMed Central

    Amano, Ikuko; Kitajima, Sakihito; Suzuki, Hideyuki; Koeduka, Takao

    2018-01-01

    The biosynthesis of plant secondary metabolites is associated with morphological and metabolic differentiation. As a consequence, gene expression profiles can change drastically, and primary and secondary metabolites, including intermediate and end-products, move dynamically within and between cells. However, little is known about the molecular mechanisms underlying differentiation and transport mechanisms. In this study, we performed a transcriptome analysis of Petunia axillaris subsp. parodii, which produces various volatiles in its corolla limbs and emits metabolites to attract pollinators. RNA-sequencing from leaves, buds, and limbs identified 53,243 unigenes. Analysis of differentially expressed genes, combined with gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway analyses, showed that many biological processes were highly enriched in limbs. These included catabolic processes and signaling pathways of hormones, such as gibberellins, and metabolic pathways, including phenylpropanoids and fatty acids. Moreover, we identified five transporter genes that showed high expression in limbs, and we performed spatiotemporal expression analyses and homology searches to infer their putative functions. Our systematic analysis provides comprehensive transcriptomic information regarding morphological differentiation and metabolite transport in the Petunia flower and lays the foundation for establishing the specific mechanisms that control secondary metabolite biosynthesis in plants. PMID:29902274

  2. Selection of relatively exact reference genes for gene expression studies in goosegrass (Eleusine indica) under herbicide stress.

    PubMed

    Chen, Jingchao; Huang, Zhaofeng; Huang, Hongjuan; Wei, Shouhui; Liu, Yan; Jiang, Cuilan; Zhang, Jie; Zhang, Chaoxian

    2017-04-21

    Goosegrass (Eleusine indica) is one of the most serious annual grassy weeds worldwide, and its evolved herbicide-resistant populations are more difficult to control. Quantitative real-time PCR (qPCR) is a common technique for investigating the resistance mechanism; however, there is as yet no report on the systematic selection of stable reference genes for goosegrass. This study proposed to test the expression stability of 9 candidate reference genes in goosegrass in different tissues and developmental stages and under stress from three types of herbicide. The results show that for different developmental stages and organs (control), eukaryotic initiation factor 4 A (eIF-4) is the most stable reference gene. Chloroplast acetolactate synthase (ALS) is the most stable reference gene under glyphosate stress. Under glufosinate stress, eIF-4 is the best reference gene. Ubiquitin-conjugating enzyme (UCE) is the most stable reference gene under quizalofop-p-ethyl stress. The gene eIF-4 is the recommended reference gene for goosegrass under the stress of all three herbicides. Moreover, pairwise analysis showed that seven reference genes were sufficient to normalize the gene expression data under three herbicides treatment. This study provides a list of reliable reference genes for transcript normalization in goosegrass, which will facilitate resistance mechanism studies in this weed species.

  3. Prediction of Human Disease Genes by Human-Mouse Conserved Coexpression Analysis

    PubMed Central

    Grassi, Elena; Damasco, Christian; Silengo, Lorenzo; Oti, Martin; Provero, Paolo; Di Cunto, Ferdinando

    2008-01-01

    Background Even in the post-genomic era, the identification of candidate genes within loci associated with human genetic diseases is a very demanding task, because the critical region may typically contain hundreds of positional candidates. Since genes implicated in similar phenotypes tend to share very similar expression profiles, high throughput gene expression data may represent a very important resource to identify the best candidates for sequencing. However, so far, gene coexpression has not been used very successfully to prioritize positional candidates. Methodology/Principal Findings We show that it is possible to reliably identify disease-relevant relationships among genes from massive microarray datasets by concentrating only on genes sharing similar expression profiles in both human and mouse. Moreover, we show systematically that the integration of human-mouse conserved coexpression with a phenotype similarity map allows the efficient identification of disease genes in large genomic regions. Finally, using this approach on 850 OMIM loci characterized by an unknown molecular basis, we propose high-probability candidates for 81 genetic diseases. Conclusion Our results demonstrate that conserved coexpression, even at the human-mouse phylogenetic distance, represents a very strong criterion to predict disease-relevant relationships among human genes. PMID:18369433

  4. Fungal Gene Expression on Demand: an Inducible, Tunable, and Metabolism-Independent Expression System for Aspergillus niger▿†

    PubMed Central

    Meyer, Vera; Wanka, Franziska; van Gent, Janneke; Arentshorst, Mark; van den Hondel, Cees A. M. J. J.; Ram, Arthur F. J.

    2011-01-01

    Filamentous fungi are the cause of serious human and plant diseases but are also exploited in biotechnology as production platforms. Comparative genomics has documented their genetic diversity, and functional genomics and systems biology approaches are under way to understand the functions and interaction of fungal genes and proteins. In these approaches, gene functions are usually inferred from deletion or overexpression mutants. However, studies at these extreme points give only limited information. Moreover, many overexpression studies use metabolism-dependent promoters, often causing pleiotropic effects and thus limitations in their significance. We therefore established and systematically evaluated a tunable expression system for Aspergillus niger that is independent of carbon and nitrogen metabolism and silent under noninduced conditions. The system consists of two expression modules jointly targeted to a defined genomic locus. One module ensures constitutive expression of the tetracycline-dependent transactivator rtTA2S-M2, and one module harbors the rtTA2S-M2-dependent promoter that controls expression of the gene of interest (the Tet-on system). We show here that the system is tight, responds within minutes after inducer addition, and allows fine-tuning based on the inducer concentration or gene copy number up to expression levels higher than the expression levels of the gpdA promoter. We also validate the Tet-on system for the generation of conditional overexpression mutants and demonstrate its power when combined with a gene deletion approach. Finally, we show that the system is especially suitable when the functions of essential genes must be examined. PMID:21378046

  5. Genome-wide identification and comparative expression analysis reveal a rapid expansion and functional divergence of duplicated genes in the WRKY gene family of cabbage, Brassica oleracea var. capitata.

    PubMed

    Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi

    2015-02-15

    WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. Copyright © 2014 Elsevier B.V. All rights reserved.

  6. Decreased expression of the stress protein HSP70 is an early event in murine erythroleukemic cell differentiation.

    PubMed Central

    Hensold, J O; Housman, D E

    1988-01-01

    Two-dimensional protein gels were used to systematically assess changes in gene expression in Friend erythroleukemia cells after exposure to inducers of differentiation. A rapid decrease in expression of the stress protein HSP70 was observed after exposure to inducers. The kinetics of this change suggest that it may be related to the cellular events that regulate the onset of differentiation. Images PMID:3164440

  7. Rejuvenation of Gene Expression Pattern of Aged Human Skin by Broadband Light Treatment: A Pilot Study

    PubMed Central

    Chang, Anne Lynn S; Bitter, Patrick H; Qu, Kun; Lin, Meihong; Rapicavoli, Nicole A; Chang, Howard Y

    2013-01-01

    Studies in model organisms suggest that aged cells can be functionally rejuvenated, but whether this concept applies to human skin is unclear. Here we apply 3′-end sequencing for expression quantification (“3-seq”) to discover the gene expression program associated with human photoaging and intrinsic skin aging (collectively termed “skin aging”), and the impact of broadband light (BBL) treatment. We find that skin aging was associated with a significantly altered expression level of 2,265 coding and noncoding RNAs, of which 1,293 became “rejuvenated” after BBL treatment; i.e., they became more similar to their expression level in youthful skin. Rejuvenated genes (RGs) included several known key regulators of organismal longevity and their proximal long noncoding RNAs. Skin aging is not associated with systematic changes in 3′-end mRNA processing. Hence, BBL treatment can restore gene expression pattern of photoaged and intrinsically aged human skin to resemble young skin. In addition, our data reveal, to our knowledge, a previously unreported set of targets that may lead to new insights into the human skin aging process. PMID:22931923

  8. Impaired Cytogenetic Damage Repair and Cell Cycle Regulation in Response to Ionizing Radiation in Human Fibroblast Cells with Individual Knock-down of 25 Genes

    NASA Technical Reports Server (NTRS)

    Zhang, Ye; Rohde, Larry; Emami, Kamal; Hammond, Dianne; Casey, Rachael; Mehta, Satish; Jeevarajan, Antony; Pierson, Duane; Wu, Honglu

    2008-01-01

    Changes of gene expression profile are one of the most important biological responses in living cells after ionizing radiation (IR) exposure. Although some studies have demonstrated that genes with upregulated expression induced by IR may play important roles in DNA damage sensing, cell cycle checkpoint and chromosomal repair, the relationship between the regulation of gene expression by IR and its impact on cytogenetic responses to ionizing radiation has not been systematically studied. In our present study, the expression of 25 genes selected based on their transcriptional changes in response to IR or from their known DNA repair roles were individually knocked down by siRNA transfection in human fibroblast cells. Chromosome aberrations (CA) and micronuclei (MN) formation were measured as the cytogenetic endpoints. Our results showed that the yield of MN and/or CA formation were significantly increased by suppressed expression of 5 genes that included Ku70 in the DSB repair pathway; XPA in the NER pathway; RPA1 in the MMR pathway; RAD17 and RBBP8 in cell cycle control. Knocked-down expression of 4 genes including MRE11A, RAD51 in the DSB pathway, and SESN1 and SUMO1 showed significant inhibition of cell cycle progression, possibly because of severe impairment of DNA damage repair. Furthermore, loss of XPA, p21 and MLH1 expression resulted in both enhanced cell cycle progression and significantly higher yield of cytogenetic damage, indicating the involvement of these gene products in both cell cycle control and DNA damage repair. Of these 11 genes that affected the cytogenetic response, 9 were up-regulated in the cells exposed to gamma radiation, suggesting that genes transcriptionally modulated by IR were critical to regulating the biological consequences after IR. Failure to express these IR-responsive genes, such as by gene mutation, could seriously change the outcome of the post IR scenario and lead to carcinogenesis.

  9. Selection of internal control genes for quantitative real-time RT-PCR studies during tomato development process

    PubMed Central

    Expósito-Rodríguez, Marino; Borges, Andrés A; Borges-Pérez, Andrés; Pérez, José A

    2008-01-01

    Background The elucidation of gene expression patterns leads to a better understanding of biological processes. Real-time quantitative RT-PCR has become the standard method for in-depth studies of gene expression. A biologically meaningful reporting of target mRNA quantities requires accurate and reliable normalization in order to identify real gene-specific variation. The purpose of normalization is to control several variables such as different amounts and quality of starting material, variable enzymatic efficiencies of retrotranscription from RNA to cDNA, or differences between tissues or cells in overall transcriptional activity. The validity of a housekeeping gene as endogenous control relies on the stability of its expression level across the sample panel being analysed. In the present report we describe the first systematic evaluation of potential internal controls during tomato development process to identify which are the most reliable for transcript quantification by real-time RT-PCR. Results In this study, we assess the expression stability of 7 traditional and 4 novel housekeeping genes in a set of 27 samples representing different tissues and organs of tomato plants at different developmental stages. First, we designed, tested and optimized amplification primers for real-time RT-PCR. Then, expression data from each candidate gene were evaluated with three complementary approaches based on different statistical procedures. Our analysis suggests that SGN-U314153 (CAC), SGN-U321250 (TIP41), SGN-U346908 ("Expressed") and SGN-U316474 (SAND) genes provide superior transcript normalization in tomato development studies. We recommend different combinations of these exceptionally stable housekeeping genes for suited normalization of different developmental series, including the complete tomato development process. Conclusion This work constitutes the first effort for the selection of optimal endogenous controls for quantitative real-time RT-PCR studies of gene expression during tomato development process. From our study a tool-kit of control genes emerges that outperform the traditional genes in terms of expression stability. PMID:19102748

  10. Genome-Wide Investigation and Expression Profiling of AP2/ERF Transcription Factor Superfamily in Foxtail Millet (Setaria italica L.)

    PubMed Central

    Lata, Charu; Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj

    2014-01-01

    The APETALA2/ethylene-responsive element binding factor (AP2/ERF) family is one of the largest transcription factor (TF) families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and RAV (Related to ABI3/VP). AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.). A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI). Duplication analysis revealed that 12 (∼7%) SiAP2/ERF genes were tandem repeated and 22 (∼13%) were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes), maize (14 genes), rice (9 genes) and Brachypodium (6 genes) showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and systematic functional analysis of AP2/ERF gene family at genome level in foxtail millet which may be utilized for improving stress adaptation and tolerance in millets, cereals and bioenergy grasses. PMID:25409524

  11. Genome-wide investigation and expression profiling of AP2/ERF transcription factor superfamily in foxtail millet (Setaria italica L.).

    PubMed

    Lata, Charu; Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj

    2014-01-01

    The APETALA2/ethylene-responsive element binding factor (AP2/ERF) family is one of the largest transcription factor (TF) families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and RAV (Related to ABI3/VP). AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.). A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI). Duplication analysis revealed that 12 (∼7%) SiAP2/ERF genes were tandem repeated and 22 (∼13%) were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes), maize (14 genes), rice (9 genes) and Brachypodium (6 genes) showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and systematic functional analysis of AP2/ERF gene family at genome level in foxtail millet which may be utilized for improving stress adaptation and tolerance in millets, cereals and bioenergy grasses.

  12. A Systematic Approach to Time-series Metabolite Profiling and RNA-seq Analysis of Chinese Hamster Ovary Cell Culture.

    PubMed

    Hsu, Han-Hsiu; Araki, Michihiro; Mochizuki, Masao; Hori, Yoshimi; Murata, Masahiro; Kahar, Prihardi; Yoshida, Takanobu; Hasunuma, Tomohisa; Kondo, Akihiko

    2017-03-02

    Chinese hamster ovary (CHO) cells are the primary host used for biopharmaceutical protein production. The engineering of CHO cells to produce higher amounts of biopharmaceuticals has been highly dependent on empirical approaches, but recent high-throughput "omics" methods are changing the situation in a rational manner. Omics data analyses using gene expression or metabolite profiling make it possible to identify key genes and metabolites in antibody production. Systematic omics approaches using different types of time-series data are expected to further enhance understanding of cellular behaviours and molecular networks for rational design of CHO cells. This study developed a systematic method for obtaining and analysing time-dependent intracellular and extracellular metabolite profiles, RNA-seq data (enzymatic mRNA levels) and cell counts from CHO cell cultures to capture an overall view of the CHO central metabolic pathway (CMP). We then calculated correlation coefficients among all the profiles and visualised the whole CMP by heatmap analysis and metabolic pathway mapping, to classify genes and metabolites together. This approach provides an efficient platform to identify key genes and metabolites in CHO cell culture.

  13. Generation of mammalian cells stably expressing multiple genes at predetermined levels.

    PubMed

    Liu, X; Constantinescu, S N; Sun, Y; Bogan, J S; Hirsch, D; Weinberg, R A; Lodish, H F

    2000-04-10

    Expression of cloned genes at desired levels in cultured mammalian cells is essential for studying protein function. Controlled levels of expression have been difficult to achieve, especially for cell lines with low transfection efficiency or when expression of multiple genes is required. An internal ribosomal entry site (IRES) has been incorporated into many types of expression vectors to allow simultaneous expression of two genes. However, there has been no systematic quantitative analysis of expression levels in individual cells of genes linked by an IRES, and thus the broad use of these vectors in functional analysis has been limited. We constructed a set of retroviral expression vectors containing an IRES followed by a quantitative selectable marker such as green fluorescent protein (GFP) or truncated cell surface proteins CD2 or CD4. The gene of interest is placed in a multiple cloning site 5' of the IRES sequence under the control of the retroviral long terminal repeat (LTR) promoter. These vectors exploit the approximately 100-fold differences in levels of expression of a retrovirus vector depending on its site of insertion in the host chromosome. We show that the level of expression of the gene downstream of the IRES and the expression level and functional activity of the gene cloned upstream of the IRES are highly correlated in stably infected target cells. This feature makes our vectors extremely useful for the rapid generation of stably transfected cell populations or clonal cell lines expressing specific amounts of a desired protein simply by fluorescent activated cell sorting (FACS) based on the level of expression of the gene downstream of the IRES. We show how these vectors can be used to generate cells expressing high levels of the erythropoietin receptor (EpoR) or a dominant negative Smad3 protein and to generate cells expressing two different cloned proteins, Ski and Smad4. Correlation of a biologic effect with the level of expression of the protein downstream of the IRES provides strong evidence for the function of the protein placed upstream of the IRES.

  14. GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences

    PubMed Central

    Di, Yanming; Schafer, Daniel W.; Wilhelm, Larry J.; Fox, Samuel E.; Sullivan, Christopher M.; Curzon, Aron D.; Carrington, James C.; Mockler, Todd C.; Chang, Jeff H.

    2011-01-01

    GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts. PMID:21998647

  15. Transcriptome display during tilapia sex determination and differentiation as revealed by RNA-Seq analysis.

    PubMed

    Tao, Wenjing; Chen, Jinlin; Tan, Dejie; Yang, Jing; Sun, Lina; Wei, Jing; Conte, Matthew A; Kocher, Thomas D; Wang, Deshou

    2018-05-15

    The factors determining sex in teleosts are diverse. Great efforts have been made to characterize the underlying genetic network in various species. However, only seven master sex-determining genes have been identified in teleosts. While the function of a few genes involved in sex determination and differentiation has been studied, we are far from fully understanding how genes interact to coordinate in this process. To enable systematic insights into fish sexual differentiation, we generated a dynamic co-expression network from tilapia gonadal transcriptomes at 5, 20, 30, 40, 90, and 180 dah (days after hatching), plus 45 and 90 dat (days after treatment) and linked gene expression profiles to both development and sexual differentiation. Transcriptomic profiles of female and male gonads at 5 and 20 dah exhibited high similarities except for a small number of genes that were involved in sex determination, while drastic changes were observed from 90 to 180 dah, with a group of differently expressed genes which were involved in gonadal differentiation and gametogenesis. Weighted gene correlation network analysis identified changes in the expression of Borealin, Gtsf1, tesk1, Zar1, Cdn15, and Rpl that were correlated with the expression of genes previously known to be involved in sex differentiation, such as Foxl2, Cyp19a1a, Gsdf, Dmrt1, and Amh. Global gonadal gene expression kinetics during sex determination and differentiation have been extensively profiled in tilapia. These findings provide insights into the genetic framework underlying sex determination and sexual differentiation, and expand our current understanding of developmental pathways during teleost sex determination.

  16. Genomic Organization, Phylogenetic and Expression Analysis of the B-BOX Gene Family in Tomato

    PubMed Central

    Chu, Zhuannan; Wang, Xin; Li, Ying; Yu, Huiyang; Li, Jinhua; Lu, Yongen; Li, Hanxia; Ouyang, Bo

    2016-01-01

    The B-BOX (BBX) proteins encode a class of zinc-finger transcription factors possessing one or two B-BOX domains and in some cases an additional CCT (CO, CO-like and TOC1) motif, which play important roles in regulating plant growth, development and stress response. Nevertheless, no systematic study of BBX genes has undertaken in tomato (Solanum lycopersicum). Here we present the results of a genome-wide analysis of the 29 BBX genes in this important vegetable species. Their structures, conserved domains, phylogenetic relationships, subcellular localizations, and promoter cis-regulatory elements were analyzed; their tissue expression profiles and expression patterns under various hormones and stress treatments were also investigated in detail. Tomato BBX genes can be divided into five subfamilies, and twelve of them were found to be segmentally duplicated. Real-time quantitative PCR analysis showed that most BBX genes exhibited different temporal and spatial expression patterns. The expression of most BBX genes can be induced by drought, polyethylene glycol-6000 or heat stress. Some BBX genes were induced strongly by phytohormones such as abscisic acid, gibberellic acid, or ethephon. The majority of tomato BBX proteins was predicted to be located in nuclei, and the transient expression assay using Arabidopsis mesophyll protoplasts demonstrated that all the seven BBX members tested (SlBBX5, 7, 15, 17, 20, 22, and 24) were localized in nucleus. Our analysis of tomato BBX genes on the genome scale would provide valuable information for future functional characterization of specific genes in this family. PMID:27807440

  17. Proteome and Transcriptome Analysis of Ovary, Intersex Gonads, and Testis Reveals Potential Key Sex Reversal/Differentiation Genes and Mechanism in Scallop Chlamys nobilis.

    PubMed

    Shi, Yu; Liu, Wenguang; He, Maoxian

    2018-04-01

    Bivalve mollusks exhibit hermaphroditism and sex reversal/differentiation. Studies generally focus on transcriptional profiling and specific genes related to sex determination and differentiation. Few studies on sex reversal/differentiation have been reported. A combination analysis of gonad proteomics and transcriptomics was conducted on Chlamys nobilis to provide a systematic understanding of sex reversal/differentiation in bivalves. We obtained 4258 unique peptides and 93,731 unigenes with good correlation between messenger RNA and protein levels. Candidate genes in sex reversal/differentiation were found: 15 genes differentially expressed between sexes were identified and 12 had obvious sexual functions. Three novel genes (foxl2, β-catenin, and sry) were expressed highly in intersex individuals and were likely involved in the control of gonadal sex in C. nobilis. High expression of foxl2 or β-catenin may inhibit sry and activate 5-HT receptor and vitellogenin to maintain female development. High expression of sry may inhibit foxl2 and β-catenin and activate dmrt2, fem-1, sfp2, sa6, Amy-1, APCP4, and PLK to maintain male function. High expression of sry, foxl2, and β-catenin in C. nobilis may be involved in promoting and maintaining sex reversal/differentiation. The downstream regulator may not be dimorphic expressed genes, but genes expressed in intersex individuals, males and females. Different expression patterns of sex-related genes and gonadal histological characteristics suggested that C. nobilis may change its sex from male to female. These findings suggest highly conserved sex reversal/differentiation with diverged regulatory pathways during C. nobilis evolution. This study provides valuable genetic resources for understanding sex reversal/differentiation (intersex) mechanisms and pathways underlying bivalve reproductive regulation.

  18. Expression profiling and bioinformatic analyses suggest new target genes and pathways for human hair follicle related microRNAs.

    PubMed

    Hochfeld, Lara M; Anhalt, Thomas; Reinbold, Céline S; Herrera-Rivero, Marisol; Fricker, Nadine; Nöthen, Markus M; Heilmann-Heimbach, Stefanie

    2017-02-22

    Human hair follicle (HF) cycling is characterised by the tight orchestration and regulation of signalling cascades. Research shows that micro(mi)RNAs are potent regulators of these pathways. However, knowledge of the expression of miRNAs and their target genes and pathways in the human HF is limited. The objective of this study was to improve understanding of the role of miRNAs and their regulatory interactions in the human HF. Expression levels of ten candidate miRNAs with reported functions in hair biology were assessed in HFs from 25 healthy male donors. MiRNA expression levels were correlated with mRNA-expression levels from the same samples. Identified target genes were tested for enrichment in biological pathways and accumulation in protein-protein interaction (PPI) networks. Expression in the human HF was confirmed for seven of the ten candidate miRNAs, and numerous target genes for miR-24, miR-31, and miR-106a were identified. While the latter include several genes with known functions in hair biology (e.g., ITGB1, SOX9), the majority have not been previously implicated (e.g., PHF1). Target genes were enriched in pathways of interest to hair biology, such as integrin and GnRH signalling, and the respective gene products showed accumulation in PPIs. Further investigation of miRNA expression in the human HF, and the identification of novel miRNA target genes and pathways via the systematic integration of miRNA and mRNA expression data, may facilitate the delineation of tissue-specific regulatory interactions, and improve our understanding of both normal hair growth and the pathobiology of hair loss disorders.

  19. Study on the Correlation between Gene Expression and Enzyme Activity of Seven Key Enzymes and Ginsenoside Content in Ginseng in Over Time in Ji'an, China.

    PubMed

    Yin, Juxin; Zhang, Daihui; Zhuang, Jianjian; Huang, Yi; Mu, Ying; Lv, Shaowu

    2017-12-11

    Panax ginseng is a traditional medicine. Fresh ginseng is one of the most important industries related to ginseng development, and fresh ginseng of varying ages has different medicinal properties. Previous research has not systematically reported the correlation between changes in key enzyme activity with changes in ginsenoside content in fresh ginseng over time. In this study, for the first time, we use ginseng samples of varying ages in Ji'an and systematically reported the changes in the activity of seven key enzymes (HMGR, FPS, SS, SE, DS, CYP450, and GT). We investigated the content of ginsenoside and gene expression of these key enzymes. Ginsenoside content was measured using HPLC. HPLC, GC-MS, and LC-MS were combined to measure the enzyme activity of the key enzymes. Quantitative PCR was used in the investigation of gene expression. By analyzing the correlation between the enzyme activity and the transcription level of the key enzymes with ginsenoside content, we found that DS and GT enzyme activities are significantly correlated with the ginsenoside content in different ages of ginseng. Our findings might provide a new strategy to discriminate between ginseng of different years. Meanwhile, this research provides important information for the in-depth study of ginsenoside biosynthesis.

  20. Network-based co-expression analysis for exploring the potential diagnostic biomarkers of metastatic melanoma.

    PubMed

    Wang, Li-Xin; Li, Yang; Chen, Guan-Zhi

    2018-01-01

    Metastatic melanoma is an aggressive skin cancer and is one of the global malignancies with high mortality and morbidity. It is essential to identify and verify diagnostic biomarkers of early metastatic melanoma. Previous studies have systematically assessed protein biomarkers and mRNA-based expression characteristics. However, molecular markers for the early diagnosis of metastatic melanoma have not been identified. To explore potential regulatory targets, we have analyzed the gene microarray expression profiles of malignant melanoma samples by co-expression analysis based on the network approach. The differentially expressed genes (DEGs) were screened by the EdgeR package of R software. A weighted gene co-expression network analysis (WGCNA) was used for the identification of DEGs in the special gene modules and hub genes. Subsequently, a protein-protein interaction network was constructed to extract hub genes associated with gene modules. Finally, twenty-four important hub genes (RASGRP2, IKZF1, CXCR5, LTB, BLK, LINGO3, CCR6, P2RY10, RHOH, JUP, KRT14, PLA2G3, SPRR1A, KRT78, SFN, CLDN4, IL1RN, PKP3, CBLC, KRT16, TMEM79, KLK8, LYPD3 and LYPD5) were treated as valuable factors involved in the immune response and tumor cell development in tumorigenesis. In addition, a transcriptional regulatory network was constructed for these specific modules or hub genes, and a few core transcriptional regulators were found to be mostly associated with our hub genes, including GATA1, STAT1, SP1, and PSG1. In summary, our findings enhance our understanding of the biological process of malignant melanoma metastasis, enabling us to identify specific genes to use for diagnostic and prognostic markers and possibly for targeted therapy.

  1. Genome-wide organization and expression profiling of the R2R3-MYB transcription factor family in pineapple (Ananas comosus).

    PubMed

    Liu, Chaoyang; Xie, Tao; Chen, Chenjie; Luan, Aiping; Long, Jianmei; Li, Chuhao; Ding, Yaqi; He, Yehua

    2017-07-01

    The MYB proteins comprise one of the largest families of plant transcription factors, which are involved in various plant physiological and biochemical processes. Pineapple (Ananas comosus) is one of three most important tropical fruits worldwide. The completion of pineapple genome sequencing provides a great opportunity to investigate the organization and evolutionary traits of pineapple MYB genes at the genome-wide level. In the present study, a total of 94 pineapple R2R3-MYB genes were identified and further phylogenetically classified into 26 subfamilies, as supported by the conserved gene structures and motif composition. Collinearity analysis indicated that the segmental duplication events played a crucial role in the expansion of pineapple MYB gene family. Further comparative phylogenetic analysis suggested that there have been functional divergences of MYB gene family during plant evolution. RNA-seq data from different tissues and developmental stages revealed distinct temporal and spatial expression profiles of the AcMYB genes. Further quantitative expression analysis showed the specific expression patterns of the selected putative stress-related AcMYB genes in response to distinct abiotic stress and hormonal treatments. The comprehensive expression analysis of the pineapple MYB genes, especially the tissue-preferential and stress-responsive genes, could provide valuable clues for further function characterization. In this work, we systematically identified AcMYB genes by analyzing the pineapple genome sequence using a set of bioinformatics approaches. Our findings provide a global insight into the organization, phylogeny and expression patterns of the pineapple R2R3-MYB genes, and hence contribute to the greater understanding of their biological roles in pineapple.

  2. Prenatal Nutritional Deficiency Reprogrammed Postnatal Gene Expression in Mammal Brains: Implications for Schizophrenia

    PubMed Central

    Xu, Jiawei; He, Guang; Zhu, Jingde; Zhou, Xinyao; St Clair, David; Wang, Teng; Xiang, Yuqian; Zhao, Qingzhu; Xing, Qinghe; Liu, Yun; Wang, Lei; Li, Qiaoli

    2015-01-01

    Background: Epidemiological studies have identified prenatal exposure to famine as a risk factor for schizophrenia, and animal models of prenatal malnutrition display structural and functional brain abnormalities implicated in schizophrenia. Methods: The offspring of the RLP50 rat, a recently developed animal model of prenatal famine malnutrition exposure, was used to investigate the changes of gene expression and epigenetic modifications in the brain regions. Microarray gene expression analysis was carried out in the prefrontal cortex and the hippocampus from 8 RLP50 offspring rats and 8 controls. MBD-seq was used to test the changes in DNA methylation in hippocampus depending on prenatal malnutrition exposure. Results: In the prefrontal cortex, offspring of RLP50 exhibit differences in neurotransmitters and olfactory-associated gene expression. In the hippocampus, the differentially-expressed genes are related to synaptic function and transcription regulation. DNA methylome profiling of the hippocampus also shows widespread but systematic epigenetic changes; in most cases (87%) this involves hypermethylation. Remarkably, genes encoded for the plasma membrane are significantly enriched for changes in both gene expression and DNA methylome profiling screens (p = 2.37×10–9 and 5.36×10–9, respectively). Interestingly, Mecp2 and Slc2a1, two genes associated with cognitive impairment, show significant down-regulation, and Slc2a1 is hypermethylated in the hippocampus of the RLP50 offspring. Conclusions: Collectively, our results indicate that prenatal exposure to malnutrition leads to the reprogramming of postnatal brain gene expression and that the epigenetic modifications contribute to the reprogramming. The process may impair learning and memory ability and result in higher susceptibility to schizophrenia. PMID:25522397

  3. Mapping the Shh long-range regulatory domain

    PubMed Central

    Anderson, Eve; Devenney, Paul S.; Hill, Robert E.; Lettice, Laura A.

    2014-01-01

    Coordinated gene expression controlled by long-distance enhancers is orchestrated by DNA regulatory sequences involving transcription factors and layers of control mechanisms. The Shh gene and well-established regulators are an example of genomic composition in which enhancers reside in a large desert extending into neighbouring genes to control the spatiotemporal pattern of expression. Exploiting the local hopping activity of the Sleeping Beauty transposon, the lacZ reporter gene was dispersed throughout the Shh region to systematically map the genomic features responsible for expression activity. We found that enhancer activities are retained inside a genomic region that corresponds to the topological associated domain (TAD) defined by Hi-C. This domain of approximately 900 kb is in an open conformation over its length and is generally susceptible to all Shh enhancers. Similar to the distal enhancers, an enhancer residing within the Shh second intron activates the reporter gene located at distances of hundreds of kilobases away, suggesting that both proximal and distal enhancers have the capacity to survey the Shh topological domain to recognise potential promoters. The widely expressed Rnf32 gene lying within the Shh domain evades enhancer activities by a process that may be common among other housekeeping genes that reside in large regulatory domains. Finally, the boundaries of the Shh TAD do not represent the absolute expression limits of enhancer activity, as expression activity is lost stepwise at a number of genomic positions at the verges of these domains. PMID:25252942

  4. A systematic evaluation of expression of HERV-W elements; influence of genomic context, viral structure and orientation

    PubMed Central

    2011-01-01

    Background One member of the W family of human endogenous retroviruses (HERV) appears to have been functionally adopted by the human host. Nevertheless, a highly diversified and regulated transcription from a range of HERV-W elements has been observed in human tissues and cells. Aberrant expression of members of this family has also been associated with human disease such as multiple sclerosis (MS) and schizophrenia. It is not known whether this broad expression of HERV-W elements represents transcriptional leakage or specific transcription initiated from the retroviral promoter in the long terminal repeat (LTR) region. Therefore, potential influences of genomic context, structure and orientation on the expression levels of individual HERV-W elements in normal human tissues were systematically investigated. Results Whereas intronic HERV-W elements with a pseudogene structure exhibited a strong anti-sense orientation bias, intronic elements with a proviral structure and solo LTRs did not. Although a highly variable expression across tissues and elements was observed, systematic effects of context, structure and orientation were also observed. Elements located in intronic regions appeared to be expressed at higher levels than elements located in intergenic regions. Intronic elements with proviral structures were expressed at higher levels than those elements bearing hallmarks of processed pseudogenes or solo LTRs. Relative to their corresponding genes, intronic elements integrated on the sense strand appeared to be transcribed at higher levels than those integrated on the anti-sense strand. Moreover, the expression of proviral elements appeared to be independent from that of their corresponding genes. Conclusions Intronic HERV-W provirus integrations on the sense strand appear to have elicited a weaker negative selection than pseudogene integrations of transcripts from such elements. Our current findings suggest that the previously observed diversified and tissue-specific expression of elements in the HERV-W family is the result of both directed transcription (involving both the LTR and internal sequence) and leaky transcription of HERV-W elements in normal human tissues. PMID:21226900

  5. Differential Responses to Wnt and PCP Disruption Predict Expression and Developmental Function of Conserved and Novel Genes in a Cnidarian

    PubMed Central

    Lapébie, Pascal; Ruggiero, Antonella; Barreau, Carine; Chevalier, Sandra; Chang, Patrick; Dru, Philippe; Houliston, Evelyn; Momose, Tsuyoshi

    2014-01-01

    We have used Digital Gene Expression analysis to identify, without bilaterian bias, regulators of cnidarian embryonic patterning. Transcriptome comparison between un-manipulated Clytia early gastrula embryos and ones in which the key polarity regulator Wnt3 was inhibited using morpholino antisense oligonucleotides (Wnt3-MO) identified a set of significantly over and under-expressed transcripts. These code for candidate Wnt signaling modulators, orthologs of other transcription factors, secreted and transmembrane proteins known as developmental regulators in bilaterian models or previously uncharacterized, and also many cnidarian-restricted proteins. Comparisons between embryos injected with morpholinos targeting Wnt3 and its receptor Fz1 defined four transcript classes showing remarkable correlation with spatiotemporal expression profiles. Class 1 and 3 transcripts tended to show sustained expression at “oral” and “aboral” poles respectively of the developing planula larva, class 2 transcripts in cells ingressing into the endodermal region during gastrulation, while class 4 gene expression was repressed at the early gastrula stage. The preferential effect of Fz1-MO on expression of class 2 and 4 transcripts can be attributed to Planar Cell Polarity (PCP) disruption, since it was closely matched by morpholino knockdown of the specific PCP protein Strabismus. We conclude that endoderm and post gastrula-specific gene expression is particularly sensitive to PCP disruption while Wnt-/β-catenin signaling dominates gene regulation along the oral-aboral axis. Phenotype analysis using morpholinos targeting a subset of transcripts indicated developmental roles consistent with expression profiles for both conserved and cnidarian-restricted genes. Overall our unbiased screen allowed systematic identification of regionally expressed genes and provided functional support for a shared eumetazoan developmental regulatory gene set with both predicted and previously unexplored members, but also demonstrated that fundamental developmental processes including axial patterning and endoderm formation in cnidarians can involve newly evolved (or highly diverged) genes. PMID:25233086

  6. Differential responses to Wnt and PCP disruption predict expression and developmental function of conserved and novel genes in a cnidarian.

    PubMed

    Lapébie, Pascal; Ruggiero, Antonella; Barreau, Carine; Chevalier, Sandra; Chang, Patrick; Dru, Philippe; Houliston, Evelyn; Momose, Tsuyoshi

    2014-09-01

    We have used Digital Gene Expression analysis to identify, without bilaterian bias, regulators of cnidarian embryonic patterning. Transcriptome comparison between un-manipulated Clytia early gastrula embryos and ones in which the key polarity regulator Wnt3 was inhibited using morpholino antisense oligonucleotides (Wnt3-MO) identified a set of significantly over and under-expressed transcripts. These code for candidate Wnt signaling modulators, orthologs of other transcription factors, secreted and transmembrane proteins known as developmental regulators in bilaterian models or previously uncharacterized, and also many cnidarian-restricted proteins. Comparisons between embryos injected with morpholinos targeting Wnt3 and its receptor Fz1 defined four transcript classes showing remarkable correlation with spatiotemporal expression profiles. Class 1 and 3 transcripts tended to show sustained expression at "oral" and "aboral" poles respectively of the developing planula larva, class 2 transcripts in cells ingressing into the endodermal region during gastrulation, while class 4 gene expression was repressed at the early gastrula stage. The preferential effect of Fz1-MO on expression of class 2 and 4 transcripts can be attributed to Planar Cell Polarity (PCP) disruption, since it was closely matched by morpholino knockdown of the specific PCP protein Strabismus. We conclude that endoderm and post gastrula-specific gene expression is particularly sensitive to PCP disruption while Wnt-/β-catenin signaling dominates gene regulation along the oral-aboral axis. Phenotype analysis using morpholinos targeting a subset of transcripts indicated developmental roles consistent with expression profiles for both conserved and cnidarian-restricted genes. Overall our unbiased screen allowed systematic identification of regionally expressed genes and provided functional support for a shared eumetazoan developmental regulatory gene set with both predicted and previously unexplored members, but also demonstrated that fundamental developmental processes including axial patterning and endoderm formation in cnidarians can involve newly evolved (or highly diverged) genes.

  7. Developmental genes significantly afflicted by aberrant promoter methylation and somatic mutation predict overall survival of late-stage colorectal cancer

    PubMed Central

    An, Ning; Yang, Xue; Cheng, Shujun; Wang, Guiqi; Zhang, Kaitai

    2015-01-01

    Carcinogenesis is an exceedingly complicated process, which involves multi-level dysregulations, including genomics (majorly caused by somatic mutation and copy number variation), DNA methylomics, and transcriptomics. Therefore, only looking into one molecular level of cancer is not sufficient to uncover the intricate underlying mechanisms. With the abundant resources of public available data in the Cancer Genome Atlas (TCGA) database, an integrative strategy was conducted to systematically analyze the aberrant patterns of colorectal cancer on the basis of DNA copy number, promoter methylation, somatic mutation and gene expression. In this study, paired samples in each genomic level were retrieved to identify differentially expressed genes with corresponding genetic or epigenetic dysregulations. Notably, the result of gene ontology enrichment analysis indicated that the differentially expressed genes with corresponding aberrant promoter methylation or somatic mutation were both functionally concentrated upon developmental process, suggesting the intimate association between development and carcinogenesis. Thus, by means of random walk with restart, 37 significant development-related genes were retrieved from a priori-knowledge based biological network. In five independent microarray datasets, Kaplan–Meier survival and Cox regression analyses both confirmed that the expression of these genes was significantly associated with overall survival of Stage III/IV colorectal cancer patients. PMID:26691761

  8. Developmental genes significantly afflicted by aberrant promoter methylation and somatic mutation predict overall survival of late-stage colorectal cancer.

    PubMed

    An, Ning; Yang, Xue; Cheng, Shujun; Wang, Guiqi; Zhang, Kaitai

    2015-12-22

    Carcinogenesis is an exceedingly complicated process, which involves multi-level dysregulations, including genomics (majorly caused by somatic mutation and copy number variation), DNA methylomics, and transcriptomics. Therefore, only looking into one molecular level of cancer is not sufficient to uncover the intricate underlying mechanisms. With the abundant resources of public available data in the Cancer Genome Atlas (TCGA) database, an integrative strategy was conducted to systematically analyze the aberrant patterns of colorectal cancer on the basis of DNA copy number, promoter methylation, somatic mutation and gene expression. In this study, paired samples in each genomic level were retrieved to identify differentially expressed genes with corresponding genetic or epigenetic dysregulations. Notably, the result of gene ontology enrichment analysis indicated that the differentially expressed genes with corresponding aberrant promoter methylation or somatic mutation were both functionally concentrated upon developmental process, suggesting the intimate association between development and carcinogenesis. Thus, by means of random walk with restart, 37 significant development-related genes were retrieved from a priori-knowledge based biological network. In five independent microarray datasets, Kaplan-Meier survival and Cox regression analyses both confirmed that the expression of these genes was significantly associated with overall survival of Stage III/IV colorectal cancer patients.

  9. The presence of both negative and positive elements in the 5'-flanking sequence of the rat Na,K-ATPase alpha 3 subunit gene are required for brain expression in transgenic mice.

    PubMed Central

    Pathak, B G; Neumann, J C; Croyle, M L; Lingrel, J B

    1994-01-01

    The Na,K-ATPase is an integral plasma membrane protein consisting of alpha and beta subunits, each of which has discrete isoforms expressed in a tissue-specific manner. Of the three functional alpha isoform genes, the one encoding the alpha 3 isoform is the most tissue-restricted in its expression, being found primarily in the brain. To identify regions of the alpha 3 isoform gene that are involved in directing expression in the brain, a 1.6 kb 5'-flanking sequence was attached to a reporter gene, chloramphenicol acetyltransferase (CAT). The alpha 3-CAT chimeric gene construct was microinjected into fertilized mouse eggs, and transgenic mice were produced. Analysis of adult transgenic mice from different lines revealed that the transgene is expressed primarily in the brain. To further delineate regions that are needed for conferring expression in this tissue, systematic deletions of the 5'-flanking sequence of the alpha 3-CAT fusion constructs were made and analyzed, again using transgenic mice. The results from these analyses indicate that DNA sequences required for mediating brain-specific expression of the alpha 3 isoform gene are present within 210 bp upstream of the transcription initiation site. alpha 3-CAT promoter constructs containing scanning mutations in this region were also assayed in transgenic mice. These studies have identified both a functional neural-restrictive silencer element as well as a positively acting cis element. Images PMID:7984427

  10. Systems analysis of cis-regulatory motifs in C4 photosynthesis genes using maize and rice leaf transcriptomic data during a process of de-etiolation

    PubMed Central

    Xu, Jiajia; Bräutigam, Andrea; Weber, Andreas P. M.; Zhu, Xin-Guang

    2016-01-01

    Identification of potential cis-regulatory motifs controlling the development of C4 photosynthesis is a major focus of current research. In this study, we used time-series RNA-seq data collected from etiolated maize and rice leaf tissues sampled during a de-etiolation process to systematically characterize the expression patterns of C4-related genes and to further identify potential cis elements in five different genomic regions (i.e. promoter, 5′UTR, 3′UTR, intron, and coding sequence) of C4 orthologous genes. The results demonstrate that although most of the C4 genes show similar expression patterns, a number of them, including chloroplast dicarboxylate transporter 1, aspartate aminotransferase, and triose phosphate transporter, show shifted expression patterns compared with their C3 counterparts. A number of conserved short DNA motifs between maize C4 genes and their rice orthologous genes were identified not only in the promoter, 5′UTR, 3′UTR, and coding sequences, but also in the introns of core C4 genes. We also identified cis-regulatory motifs that exist in maize C4 genes and also in genes showing similar expression patterns as maize C4 genes but that do not exist in rice C3 orthologs, suggesting a possible recruitment of pre-existing cis-elements from genes unrelated to C4 photosynthesis into C4 photosynthesis genes during C4 evolution. PMID:27436282

  11. Rationally designed, heterologous S. cerevisiae transcripts expose novel expression determinants

    PubMed Central

    Ben-Yehezkel, Tuval; Atar, Shimshi; Zur, Hadas; Diament, Alon; Goz, Eli; Marx, Tzipy; Cohen, Rafael; Dana, Alexandra; Feldman, Anna; Shapiro, Ehud; Tuller, Tamir

    2015-01-01

    Deducing generic causal relations between RNA transcript features and protein expression profiles from endogenous gene expression data remains a major unsolved problem in biology. The analysis of gene expression from heterologous genes contributes significantly to solving this problem, but has been heavily biased toward the study of the effect of 5′ transcript regions and to prokaryotes. Here, we employ a synthetic biology driven approach that systematically differentiates the effect of different regions of the transcript on gene expression up to 240 nucleotides into the ORF. This enabled us to discover new causal effects between features in previously unexplored regions of transcripts, and gene expression in natural regimes. We rationally designed, constructed, and analyzed 383 gene variants of the viral HRSVgp04 gene ORF, with multiple synonymous mutations at key positions along the transcript in the eukaryote S. cerevisiae. Our results show that a few silent mutations at the 5′UTR can have a dramatic effect of up to 15 fold change on protein levels, and that even synonymous mutations in positions more than 120 nucleotides downstream from the ORF 5′end can modulate protein levels up to 160%–300%. We demonstrate that the correlation between protein levels and folding energy increases with the significance of the level of selection of the latter in endogenous genes, reinforcing the notion that selection for folding strength in different parts of the ORF is related to translation regulation. Our measured protein abundance correlates notably(correlation up to r = 0.62 (p=0.0013)) with mean relative codon decoding times, based on ribosomal densities (Ribo-Seq) in endogenous genes, supporting the conjecture that translation elongation and adaptation to the tRNA pool can modify protein levels in a causal/direct manner. This report provides an improved understanding of transcript evolution, design principles of gene expression regulation, and suggests simple rules for engineering synthetic gene expression in eukaryotes. PMID:26176266

  12. Rationally designed, heterologous S. cerevisiae transcripts expose novel expression determinants.

    PubMed

    Ben-Yehezkel, Tuval; Atar, Shimshi; Zur, Hadas; Diament, Alon; Goz, Eli; Marx, Tzipy; Cohen, Rafael; Dana, Alexandra; Feldman, Anna; Shapiro, Ehud; Tuller, Tamir

    2015-01-01

    Deducing generic causal relations between RNA transcript features and protein expression profiles from endogenous gene expression data remains a major unsolved problem in biology. The analysis of gene expression from heterologous genes contributes significantly to solving this problem, but has been heavily biased toward the study of the effect of 5' transcript regions and to prokaryotes. Here, we employ a synthetic biology driven approach that systematically differentiates the effect of different regions of the transcript on gene expression up to 240 nucleotides into the ORF. This enabled us to discover new causal effects between features in previously unexplored regions of transcripts, and gene expression in natural regimes. We rationally designed, constructed, and analyzed 383 gene variants of the viral HRSVgp04 gene ORF, with multiple synonymous mutations at key positions along the transcript in the eukaryote S. cerevisiae. Our results show that a few silent mutations at the 5'UTR can have a dramatic effect of up to 15 fold change on protein levels, and that even synonymous mutations in positions more than 120 nucleotides downstream from the ORF 5'end can modulate protein levels up to 160%-300%. We demonstrate that the correlation between protein levels and folding energy increases with the significance of the level of selection of the latter in endogenous genes, reinforcing the notion that selection for folding strength in different parts of the ORF is related to translation regulation. Our measured protein abundance correlates notably(correlation up to r = 0.62 (p=0.0013)) with mean relative codon decoding times, based on ribosomal densities (Ribo-Seq) in endogenous genes, supporting the conjecture that translation elongation and adaptation to the tRNA pool can modify protein levels in a causal/direct manner. This report provides an improved understanding of transcript evolution, design principles of gene expression regulation, and suggests simple rules for engineering synthetic gene expression in eukaryotes.

  13. Complex chromosomal neighborhood effects determine the adaptive potential of a gene under selection.

    PubMed

    Steinrueck, Magdalena; Guet, Călin C

    2017-07-25

    How the organization of genes on a chromosome shapes adaptation is essential for understanding evolutionary paths. Here, we investigate how adaptation to rapidly increasing levels of antibiotic depends on the chromosomal neighborhood of a drug-resistance gene inserted at different positions of the Escherichia coli chromosome. Using a dual-fluorescence reporter that allows us to distinguish gene amplifications from other up-mutations, we track in real-time adaptive changes in expression of the drug-resistance gene. We find that the relative contribution of several mutation types differs systematically between loci due to properties of neighboring genes: essentiality, expression, orientation, termination, and presence of duplicates. These properties determine rate and fitness effects of gene amplification, deletions, and mutations compromising transcriptional termination. Thus, the adaptive potential of a gene under selection is a system-property with a complex genetic basis that is specific for each chromosomal locus, and it can be inferred from detailed functional and genomic data.

  14. Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

    PubMed

    Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

    2018-05-09

    Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.

  15. dbMDEGA: a database for meta-analysis of differentially expressed genes in autism spectrum disorder.

    PubMed

    Zhang, Shuyun; Deng, Libin; Jia, Qiyue; Huang, Shaoting; Gu, Junwang; Zhou, Fankun; Gao, Meng; Sun, Xinyi; Feng, Chang; Fan, Guangqin

    2017-11-16

    Autism spectrum disorders (ASD) are hereditary, heterogeneous and biologically complex neurodevelopmental disorders. Individual studies on gene expression in ASD cannot provide clear consensus conclusions. Therefore, a systematic review to synthesize the current findings from brain tissues and a search tool to share the meta-analysis results are urgently needed. Here, we conducted a meta-analysis of brain gene expression profiles in the current reported human ASD expression datasets (with 84 frozen male cortex samples, 17 female cortex samples, 32 cerebellum samples and 4 formalin fixed samples) and knock-out mouse ASD model expression datasets (with 80 collective brain samples). Then, we applied R language software and developed an interactive shared and updated database (dbMDEGA) displaying the results of meta-analysis of data from ASD studies regarding differentially expressed genes (DEGs) in the brain. This database, dbMDEGA ( https://dbmdega.shinyapps.io/dbMDEGA/ ), is a publicly available web-portal for manual annotation and visualization of DEGs in the brain from data from ASD studies. This database uniquely presents meta-analysis values and homologous forest plots of DEGs in brain tissues. Gene entries are annotated with meta-values, statistical values and forest plots of DEGs in brain samples. This database aims to provide searchable meta-analysis results based on the current reported brain gene expression datasets of ASD to help detect candidate genes underlying this disorder. This new analytical tool may provide valuable assistance in the discovery of DEGs and the elucidation of the molecular pathogenicity of ASD. This database model may be replicated to study other disorders.

  16. Identification of two integration sites in favor of transgene expression in Trichoderma reesei.

    PubMed

    Qin, Lina; Jiang, Xianzhang; Dong, Zhiyang; Huang, Jianzhong; Chen, Xiuzhen

    2018-01-01

    The ascomycete fungus Trichoderma reesei was widely used as a biotechnological workhorse for production of cellulases and recombinant proteins due to its large capacity of protein secretion. Transgenesis by random integration of a gene of interest (GOI) into the genome of T. reesei can generate series of strains that express different levels of the indicated transgene. The insertion site of the GOI plays an important role in the ultimate production of the targeted proteins. However, so far no systematic studies have been made to identify transgene integration loci for optimal expression of the GOI in T. reesei . Currently, only the locus of exocellobiohydrolases I encoding gene ( cbh1) is widely used as a promising integration site to lead to high expression level of the GOI. No additional sites associated with efficient gene expression have been characterized. To search for gene integration sites that benefit for the secreted expression of GOI, the food-and-mouth disease virus 2A protein was applied for co-expression of an Aspergillus niger lipA gene and Discosoma sp. DsRed1 gene in T. reesei, by random integration of the expression cassette into the genome. We demonstrated that the fluorescent intensity of RFP (red fluorescent protein) inside of the cell was well correlated with the secreted lipase yields, based on which, we successfully developed a high-throughput screening method to screen strains with relatively higher secreted expression of the GOI (in this study, lipase). The copy number and the insertion sites of the transgene were investigated among the selected highly expressed strains. Eventually, in addition to cbh1 gene locus, two other genome insertion loci that efficiently facilitate gene expression in T. reesei were identified. We have successfully developed a high-throughput screening method to screen strains with optimal expression of the indicated secreted proteins in T. reesei . Moreover, we identified two optimal genome loci for transgene expression, which could provide new approach to modulate gene expression levels while retaining the indicated promoter and culture conditions.

  17. Optimized Assembly of a Multifunctional RNA-Protein Nanostructure in a Cell-Free Gene Expression System.

    PubMed

    Schwarz-Schilling, Matthaeus; Dupin, Aurore; Chizzolini, Fabio; Krishnan, Swati; Mansy, Sheref S; Simmel, Friedrich C

    2018-04-11

    Molecular complexes composed of RNA molecules and proteins are promising multifunctional nanostructures for a wide variety of applications in biological cells or in artificial cellular systems. In this study, we systematically address some of the challenges associated with the expression and assembly of such hybrid structures using cell-free gene expression systems. As a model structure, we investigated a pRNA-derived RNA scaffold functionalized with four distinct aptamers, three of which bind to proteins, streptavidin and two fluorescent proteins, while one binds the small molecule dye malachite green (MG). Using MG fluorescence and Förster resonance energy transfer (FRET) between the RNA-scaffolded proteins, we assess critical assembly parameters such as chemical stability, binding efficiency, and also resource sharing effects within the reaction compartment. We then optimize simultaneous expression and coassembly of the RNA-protein nanostructure within a single-compartment cell-free gene expression system. We demonstrate expression and assembly of the multicomponent nanostructures inside of emulsion droplets and their aptamer-mediated localization onto streptavidin-coated substrates, plus the successful assembly of the hybrid structures inside of bacterial cells.

  18. In silico identification of genetically attenuated vaccine candidate genes for Plasmodium liver stage.

    PubMed

    Kumar, Hirdesh; Frischknecht, Friedrich; Mair, Gunnar R; Gomes, James

    2015-12-01

    Genetically attenuated parasites (GAPs) that lack genes essential for the liver stage of the malaria parasite, and therefore cause developmental arrest, have been developed as live vaccines in rodent malaria models and recently been tested in humans. The genes targeted for deletion were often identified by trial and error. Here we present a systematic gene - protein and transcript - expression analyses of several Plasmodium species with the aim to identify candidate genes for the generation of novel GAPs. With a lack of liver stage expression data for human malaria parasites, we used data available for liver stage development of Plasmodium yoelii, a rodent malaria model, to identify proteins expressed in the liver stage but absent from blood stage parasites. An orthology-based search was then employed to identify orthologous proteins in the human malaria parasite Plasmodium falciparum resulting in a total of 310 genes expressed in the liver stage but lacking evidence of protein expression in blood stage parasites. Among these 310 possible GAP candidates, we further studied Plasmodium liver stage proteins by phyletic distribution and functional domain analyses and shortlisted twenty GAP-candidates; these are: fabB/F, fabI, arp, 3 genes encoding subunits of the PDH complex, dnaJ, urm1, rS5, ancp, mcp, arh, gk, lisp2, valS, palm, and four conserved Plasmodium proteins of unknown function. Parasites lacking one or several of these genes might yield new attenuated malaria parasites for experimental vaccination studies. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. Statistical Test of Expression Pattern (STEPath): a new strategy to integrate gene expression data with genomic information in individual and meta-analysis studies.

    PubMed

    Martini, Paolo; Risso, Davide; Sales, Gabriele; Romualdi, Chiara; Lanfranchi, Gerolamo; Cagnin, Stefano

    2011-04-11

    In the last decades, microarray technology has spread, leading to a dramatic increase of publicly available datasets. The first statistical tools developed were focused on the identification of significant differentially expressed genes. Later, researchers moved toward the systematic integration of gene expression profiles with additional biological information, such as chromosomal location, ontological annotations or sequence features. The analysis of gene expression linked to physical location of genes on chromosomes allows the identification of transcriptionally imbalanced regions, while, Gene Set Analysis focuses on the detection of coordinated changes in transcriptional levels among sets of biologically related genes. In this field, meta-analysis offers the possibility to compare different studies, addressing the same biological question to fully exploit public gene expression datasets. We describe STEPath, a method that starts from gene expression profiles and integrates the analysis of imbalanced region as an a priori step before performing gene set analysis. The application of STEPath in individual studies produced gene set scores weighted by chromosomal activation. As a final step, we propose a way to compare these scores across different studies (meta-analysis) on related biological issues. One complication with meta-analysis is batch effects, which occur because molecular measurements are affected by laboratory conditions, reagent lots and personnel differences. Major problems occur when batch effects are correlated with an outcome of interest and lead to incorrect conclusions. We evaluated the power of combining chromosome mapping and gene set enrichment analysis, performing the analysis on a dataset of leukaemia (example of individual study) and on a dataset of skeletal muscle diseases (meta-analysis approach). In leukaemia, we identified the Hox gene set, a gene set closely related to the pathology that other algorithms of gene set analysis do not identify, while the meta-analysis approach on muscular disease discriminates between related pathologies and correlates similar ones from different studies. STEPath is a new method that integrates gene expression profiles, genomic co-expressed regions and the information about the biological function of genes. The usage of the STEPath-computed gene set scores overcomes batch effects in the meta-analysis approaches allowing the direct comparison of different pathologies and different studies on a gene set activation level.

  20. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava

    PubMed Central

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava. PMID:26904033

  1. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava.

    PubMed

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava.

  2. Systematic gene tagging using CRISPR/Cas9 in human stem cells to illuminate cell organization

    PubMed Central

    Roberts, Brock; Haupt, Amanda; Tucker, Andrew; Grancharova, Tanya; Arakaki, Joy; Fuqua, Margaret A.; Nelson, Angelique; Hookway, Caroline; Ludmann, Susan A.; Mueller, Irina A.; Yang, Ruian; Horwitz, Rick; Rafelski, Susanne M.; Gunawardane, Ruwanthi N.

    2017-01-01

    We present a CRISPR/Cas9 genome-editing strategy to systematically tag endogenous proteins with fluorescent tags in human induced pluripotent stem cells (hiPSC). To date, we have generated multiple hiPSC lines with monoallelic green fluorescent protein tags labeling 10 proteins representing major cellular structures. The tagged proteins include alpha tubulin, beta actin, desmoplakin, fibrillarin, nuclear lamin B1, nonmuscle myosin heavy chain IIB, paxillin, Sec61 beta, tight junction protein ZO1, and Tom20. Our genome-editing methodology using Cas9/crRNA ribonuclear protein and donor plasmid coelectroporation, followed by fluorescence-based enrichment of edited cells, typically resulted in <0.1–4% homology-directed repair (HDR). Twenty-five percent of clones generated from each edited population were precisely edited. Furthermore, 92% (36/39) of expanded clonal lines displayed robust morphology, genomic stability, expression and localization of the tagged protein to the appropriate subcellular structure, pluripotency-marker expression, and multilineage differentiation. It is our conclusion that, if cell lines are confirmed to harbor an appropriate gene edit, pluripotency, differentiation potential, and genomic stability are typically maintained during the clonal line–generation process. The data described here reveal general trends that emerged from this systematic gene-tagging approach. Final clonal lines corresponding to each of the 10 cellular structures are now available to the research community. PMID:28814507

  3. Epigenetic mechanisms of nutrient-induced modulation of gene expression and cellular functions

    USDA-ARS?s Scientific Manuscript database

    Utilizing next-generation sequencing technology in combination with chromatin immunoprecipitation (ChIP) technology, our study provides systematic and novel insights into the relationships between nutrition and epigenetics. One paradigmatic example of nutrient-epigenetic-phenotype relationship is th...

  4. Unstable genomes elevate transcriptome dynamics

    PubMed Central

    Stevens, Joshua B.; Liu, Guo; Abdallah, Batoul Y.; Horne, Steven D.; Ye, Karen J.; Bremer, Steven W.; Ye, Christine J.; Krawetz, Stephen A.; Heng, Henry H.

    2015-01-01

    The challenge of identifying common expression signatures in cancer is well known, however the reason behind this is largely unclear. Traditionally variation in expression signatures has been attributed to technological problems, however recent evidence suggests that chromosome instability (CIN) and resultant karyotypic heterogeneity may be a large contributing factor. Using a well-defined model of immortalization, we systematically compared the pattern of genome alteration and expression dynamics during somatic evolution. Co-measurement of global gene expression and karyotypic alteration throughout the immortalization process reveals that karyotype changes influence gene expression as major structural and numerical karyotypic alterations result in large gene expression deviation. Replicate samples from stages with stable genomes are more similar to each other than are replicate samples with karyotypic heterogeneity. Karyotypic and gene expression change during immortalization is dynamic as each stage of progression has a unique expression pattern. This was further verified by comparing global expression in two replicates grown in one flask with known karyotypes. Replicates with higher karyotypic instability were found to be less similar than replicates with stable karyotypes. This data illustrates the karyotype, transcriptome, and transcriptome determined pathways are in constant flux during somatic cellular evolution (particularly during the macroevolutionary phase) and this flux is an inextricable feature of CIN and essential for cancer formation. The findings presented here underscore the importance of understanding the evolutionary process of cancer in order to design improved treatment modalities. PMID:24122714

  5. Evaluation of Reference Genes for Quantitative Real-Time PCR in Oil Palm Elite Planting Materials Propagated by Tissue Culture

    PubMed Central

    Chan, Pek-Lan; Rose, Ray J.; Abdul Murad, Abdul Munir; Zainal, Zamri; Leslie Low, Eng-Ti; Ooi, Leslie Cheng-Li; Ooi, Siew-Eng; Yahya, Suzaini; Singh, Rajinder

    2014-01-01

    Background The somatic embryogenesis tissue culture process has been utilized to propagate high yielding oil palm. Due to the low callogenesis and embryogenesis rates, molecular studies were initiated to identify genes regulating the process, and their expression levels are usually quantified using reverse transcription quantitative real-time PCR (RT-qPCR). With the recent release of oil palm genome sequences, it is crucial to establish a proper strategy for gene analysis using RT-qPCR. Selection of the most suitable reference genes should be performed for accurate quantification of gene expression levels. Results In this study, eight candidate reference genes selected from cDNA microarray study and literature review were evaluated comprehensively across 26 tissue culture samples using RT-qPCR. These samples were collected from two tissue culture lines and media treatments, which consisted of leaf explants cultures, callus and embryoids from consecutive developmental stages. Three statistical algorithms (geNorm, NormFinder and BestKeeper) confirmed that the expression stability of novel reference genes (pOP-EA01332, PD00380 and PD00569) outperformed classical housekeeping genes (GAPDH, NAD5, TUBULIN, UBIQUITIN and ACTIN). PD00380 and PD00569 were identified as the most stably expressed genes in total samples, MA2 and MA8 tissue culture lines. Their applicability to validate the expression profiles of a putative ethylene-responsive transcription factor 3-like gene demonstrated the importance of using the geometric mean of two genes for normalization. Conclusions Systematic selection of the most stably expressed reference genes for RT-qPCR was established in oil palm tissue culture samples. PD00380 and PD00569 were selected for accurate and reliable normalization of gene expression data from RT-qPCR. These data will be valuable to the research associated with the tissue culture process. Also, the method described here will facilitate the selection of appropriate reference genes in other oil palm tissues and in the expression profiling of genes relating to yield, biotic and abiotic stresses. PMID:24927412

  6. Selection of Suitable Reference Genes for RT-qPCR Normalization under Abiotic Stresses and Hormone Stimulation in Persimmon (Diospyros kaki Thunb)

    PubMed Central

    Wang, Peihong; Xiong, Aisheng; Gao, Zhihong; Yu, Xinyi; Li, Man; Hou, Yingjun; Sun, Chao; Qu, Shenchun

    2016-01-01

    The success of quantitative real-time reverse transcription polymerase chain reaction (RT-qPCR) to quantify gene expression depends on the stability of the reference genes used for data normalization. To date, systematic screening for reference genes in persimmon (Diospyros kaki Thunb) has never been reported. In this study, 13 candidate reference genes were cloned from 'Nantongxiaofangshi' using information available in the transcriptome database. Their expression stability was assessed by geNorm and NormFinder algorithms under abiotic stress and hormone stimulation. Our results showed that the most suitable reference genes across all samples were UBC and GAPDH, and not the commonly used persimmon reference gene ACT. In addition, UBC combined with RPII or TUA were found to be appropriate for the "abiotic stress" group and α-TUB combined with PP2A were found to be appropriate for the "hormone stimuli" group. For further validation, the transcript level of the DkDREB2C homologue under heat stress was studied with the selected genes (CYP, GAPDH, TUA, UBC, α-TUB, and EF1-α). The results suggested that it is necessary to choose appropriate reference genes according to the test materials or experimental conditions. Our study will be useful for future studies on gene expression in persimmon. PMID:27513755

  7. Tcof1-Related Molecular Networks in Treacher Collins Syndrome.

    PubMed

    Dai, Jiewen; Si, Jiawen; Wang, Minjiao; Huang, Li; Fang, Bing; Shi, Jun; Wang, Xudong; Shen, Guofang

    2016-09-01

    Treacher Collins syndrome (TCS) is a rare, autosomal-dominant disorder characterized by craniofacial deformities, and is primarily caused by mutations in the Tcof1 gene. This article was aimed to perform a comprehensive literature review and systematic bioinformatic analysis of Tcof1-related molecular networks in TCS. First, the up- and down-regulated genes in Tcof1 heterozygous haploinsufficient mutant mice embryos and Tcof1 knockdown and Tcof1 over-expressed neuroblastoma N1E-115 cells were obtained from the Gene Expression Omnibus database. The GeneDecks database was used to calculate the 500 genes most closely related to Tcof1. Then, the relationships between 4 gene sets (a predicted set and sets comparing the wildtype with the 3 Gene Expression Omnibus datasets) were analyzed using the DAVID, GeneMANIA and STRING databases. The analysis results showed that the Tcof1-related genes were enriched in various biological processes, including cell proliferation, apoptosis, cell cycle, differentiation, and migration. They were also enriched in several signaling pathways, such as the ribosome, p53, cell cycle, and WNT signaling pathways. Additionally, these genes clearly had direct or indirect interactions with Tcof1 and between each other. Literature review and bioinformatic analysis finds imply that special attention should be given to these pathways, as they may offer target points for TCS therapies.

  8. Genome-Wide Identification, Characterization and Expression Analysis of the Chalcone Synthase Family in Maize

    PubMed Central

    Han, Yahui; Ding, Ting; Su, Bo; Jiang, Haiyang

    2016-01-01

    Members of the chalcone synthase (CHS) family participate in the synthesis of a series of secondary metabolites in plants, fungi and bacteria. The metabolites play important roles in protecting land plants against various environmental stresses during the evolutionary process. Our research was conducted on comprehensive investigation of CHS genes in maize (Zea mays L.), including their phylogenetic relationships, gene structures, chromosomal locations and expression analysis. Fourteen CHS genes (ZmCHS01–14) were identified in the genome of maize, representing one of the largest numbers of CHS family members identified in one organism to date. The gene family was classified into four major classes (classes I–IV) based on their phylogenetic relationships. Most of them contained two exons and one intron. The 14 genes were unevenly located on six chromosomes. Two segmental duplication events were identified, which might contribute to the expansion of the maize CHS gene family to some extent. In addition, quantitative real-time PCR and microarray data analyses suggested that ZmCHS genes exhibited various expression patterns, indicating functional diversification of the ZmCHS genes. Our results will contribute to future studies of the complexity of the CHS gene family in maize and provide valuable information for the systematic analysis of the functions of the CHS gene family. PMID:26828478

  9. Survey of the Heritability and Sparse Architecture of Gene Expression Traits across Human Tissues.

    PubMed

    Wheeler, Heather E; Shah, Kaanan P; Brenner, Jonathon; Garcia, Tzintzuni; Aquino-Michaels, Keston; Cox, Nancy J; Nicolae, Dan L; Im, Hae Kyung

    2016-11-01

    Understanding the genetic architecture of gene expression traits is key to elucidating the underlying mechanisms of complex traits. Here, for the first time, we perform a systematic survey of the heritability and the distribution of effect sizes across all representative tissues in the human body. We find that local h2 can be relatively well characterized with 59% of expressed genes showing significant h2 (FDR < 0.1) in the DGN whole blood cohort. However, current sample sizes (n ≤ 922) do not allow us to compute distal h2. Bayesian Sparse Linear Mixed Model (BSLMM) analysis provides strong evidence that the genetic contribution to local expression traits is dominated by a handful of genetic variants rather than by the collective contribution of a large number of variants each of modest size. In other words, the local architecture of gene expression traits is sparse rather than polygenic across all 40 tissues (from DGN and GTEx) examined. This result is confirmed by the sparsity of optimal performing gene expression predictors via elastic net modeling. To further explore the tissue context specificity, we decompose the expression traits into cross-tissue and tissue-specific components using a novel Orthogonal Tissue Decomposition (OTD) approach. Through a series of simulations we show that the cross-tissue and tissue-specific components are identifiable via OTD. Heritability and sparsity estimates of these derived expression phenotypes show similar characteristics to the original traits. Consistent properties relative to prior GTEx multi-tissue analysis results suggest that these traits reflect the expected biology. Finally, we apply this knowledge to develop prediction models of gene expression traits for all tissues. The prediction models, heritability, and prediction performance R2 for original and decomposed expression phenotypes are made publicly available (https://github.com/hakyimlab/PrediXcan).

  10. Integrative analysis of DNA methylation and gene expression data identifies EPAS1 as a key regulator of COPD.

    PubMed

    Yoo, Seungyeul; Takikawa, Sachiko; Geraghty, Patrick; Argmann, Carmen; Campbell, Joshua; Lin, Luan; Huang, Tao; Tu, Zhidong; Foronjy, Robert F; Feronjy, Robert; Spira, Avrum; Schadt, Eric E; Powell, Charles A; Zhu, Jun

    2015-01-01

    Chronic Obstructive Pulmonary Disease (COPD) is a complex disease. Genetic, epigenetic, and environmental factors are known to contribute to COPD risk and disease progression. Therefore we developed a systematic approach to identify key regulators of COPD that integrates genome-wide DNA methylation, gene expression, and phenotype data in lung tissue from COPD and control samples. Our integrative analysis identified 126 key regulators of COPD. We identified EPAS1 as the only key regulator whose downstream genes significantly overlapped with multiple genes sets associated with COPD disease severity. EPAS1 is distinct in comparison with other key regulators in terms of methylation profile and downstream target genes. Genes predicted to be regulated by EPAS1 were enriched for biological processes including signaling, cell communications, and system development. We confirmed that EPAS1 protein levels are lower in human COPD lung tissue compared to non-disease controls and that Epas1 gene expression is reduced in mice chronically exposed to cigarette smoke. As EPAS1 downstream genes were significantly enriched for hypoxia responsive genes in endothelial cells, we tested EPAS1 function in human endothelial cells. EPAS1 knockdown by siRNA in endothelial cells impacted genes that significantly overlapped with EPAS1 downstream genes in lung tissue including hypoxia responsive genes, and genes associated with emphysema severity. Our first integrative analysis of genome-wide DNA methylation and gene expression profiles illustrates that not only does DNA methylation play a 'causal' role in the molecular pathophysiology of COPD, but it can be leveraged to directly identify novel key mediators of this pathophysiology.

  11. Age-related regulation of genes: slow homeostatic changes and age-dimension technology

    NASA Astrophysics Data System (ADS)

    Kurachi, Kotoku; Zhang, Kezhong; Huo, Jeffrey; Ameri, Afshin; Kuwahara, Mitsuhiro; Fontaine, Jean-Marc; Yamamoto, Kei; Kurachi, Sumiko

    2002-11-01

    Through systematic studies of pro- and anti-blood coagulation factors, we have determined molecular mechanisms involving two genetic elements, age-related stability element (ASE), GAGGAAG and age-related increase element (AIE), a unique stretch of dinucleotide repeats (AIE). ASE and AIE are essential for age-related patterns of stable and increased gene expression patterns, respectively. Such age-related gene regulatory mechanisms are also critical for explaining homeostasis in various physiological reactions as well as slow homeostatic changes in them. The age-related increase expression of the human factor IX (hFIX) gene requires the presence of both ASE and AIE, which apparently function additively. The anti-coagulant factor protein C (hPC) gene uses an ASE (CAGGAG) to produce age-related stable expression. Both ASE sequences (G/CAGAAG) share consensus sequence of the transcriptional factor PEA-3 element. No other similar sequences, including another PEA-3 consensus sequence, GAGGATG, function in conferring age-related gene regulation. The age-regulatory mechanisms involving ASE and AIE apparently function universally with different genes and across different animal species. These findings have led us to develop a new field of research and applications, which we named “age-dimension technology (ADT)”. ADT has exciting potential for modifying age-related expression of genes as well as associated physiological processes, and developing novel, more effective prophylaxis or treatments for age-related diseases.

  12. Mammalian polycistronic mRNAs and disease

    PubMed Central

    Karginov, Timofey A.; Hejazi Pastor, Daniel Parviz; Semler, Bert L.; Gomez, Christopher M.

    2016-01-01

    Our understanding of gene expression has come far since the “one-gene one-polypeptide” hypothesis proposed by Beadle and Tatum. This review addresses the gradual recognition that a growing number of polycistronic genes, originally discovered in viruses, are being identified within the mammalian genome, and that these may provide new insights into disease mechanisms and treatment. We have carried out a systematic literature review identifying 13 mammalian genes for which there is evidence for polycistronic expression via translation through an Internal Ribosome Entry Site (IRES). Although the canonical mechanism of translation initiation has been studied extensively, this review highlights a process of non-canonical translation, IRES-mediated translation, that is a growing source of understanding complex inheritance, elucidation of disease mechanisms, and discovery of novel therapeutic targets. Identification of additional polycistronic genes may provide new insights into disease therapy and allow for new discoveries of translational and disease mechanisms. PMID:28012572

  13. Sex Bias and Maternal Contribution to Gene Expression Divergence in Drosophila Blastoderm Embryos

    PubMed Central

    Paris, Mathilde; Villalta, Jacqueline E.; Eisen, Michael B.; Lott, Susan E.

    2015-01-01

    Early embryogenesis is a unique developmental stage where genetic control of development is handed off from mother to zygote. Yet the contribution of this transition to the evolution of gene expression is poorly understood. Here we study two aspects of gene expression specific to early embryogenesis in Drosophila: sex-biased gene expression prior to the onset of canonical X chromosomal dosage compensation, and the contribution of maternally supplied mRNAs. We sequenced mRNAs from individual unfertilized eggs and precisely staged and sexed blastoderm embryos, and compared levels between D. melanogaster, D. yakuba, D. pseudoobscura and D. virilis. First, we find that mRNA content is highly conserved for a given stage and that studies relying on pooled embryos likely systematically overstate the degree of gene expression divergence. Unlike studies done on larvae and adults where most species show a larger proportion of genes with male-biased expression, we find that transcripts in Drosophila embryos are largely female-biased in all species, likely due to incomplete dosage compensation prior to the activation of the canonical dosage compensation mechanism. The divergence of sex-biased gene expression across species is observed to be often due to lineage-specific decrease of expression; the most drastic example of which is the overall reduction of male expression from the neo-X chromosome in D. pseudoobscura, leading to a pervasive female-bias on this chromosome. We see no evidence for a faster evolution of expression on the X chromosome in embryos (no “faster-X” effect), unlike in adults, and contrary to a previous study on pooled non-sexed embryos. Finally, we find that most genes are conserved in regard to their maternal or zygotic origin of transcription, and present evidence that differences in maternal contribution to the blastoderm transcript pool may be due to species-specific divergence of transcript degradation rates. PMID:26485701

  14. Genome-Wide Identification and Analysis of Biotic and Abiotic Stress Regulation of C4 Photosynthetic Pathway Genes in Rice.

    PubMed

    Muthusamy, Senthilkumar K; Lenka, Sangram K; Katiyar, Amit; Chinnusamy, Viswanathan; Singh, Ashok K; Bansal, Kailash C

    2018-06-19

    Photosynthetic fixation of CO 2 is more efficient in C 4 than in C 3 plants. Rice is a C 3 plant and a potential target for genetic engineering of the C 4 pathway. It is known that genes encoding C 4 enzymes are present in C 3 plants. However, no systematic analysis has been conducted to determine if these C 4 gene family members are expressed in diverse rice genotypes. In this study, we identified 15 genes belonging to the five C 4 gene families in rice genome through BLAST search using known maize C 4 photosynthetic pathway genes. Phylogenetic relationship of rice C 4 photosynthetic pathway genes and their isoforms with other grass genomes (Brachypodium, maize, Sorghum and Setaria), showed that these genes were highly conserved across grass genomes. Spatiotemporal, hormone, and abiotic stress specific expression pattern of the identified genes revealed constitutive as well as inductive responses of the C 4 photosynthetic pathway in different tissues and developmental stages of rice. Expression levels of C 4 specific gene family members in flag leaf during tillering stage were quantitatively analyzed in five rice genotypes covering three species, viz. Oryza sativa, ssp. japonica (cv. Nipponbare), Oryza sativa, ssp. indica (cv IR64, Swarna), and two wild species Oryza barthii and Oryza australiensis. The results showed that all the identified genes expressed in rice and exhibited differential expression pattern during different growth stages, and in response to biotic and abiotic stress conditions and hormone treatments. Our study concludes that C 4 photosynthetic pathway genes present in rice play a crucial role in stress regulation and might act as targets for C 4 pathway engineering via CRISPR-mediated breeding.

  15. Aldehyde Dehydrogenase Gene Superfamily in Populus: Organization and Expression Divergence between Paralogous Gene Pairs.

    PubMed

    Tian, Feng-Xia; Zang, Jian-Lei; Wang, Tan; Xie, Yu-Li; Zhang, Jin; Hu, Jian-Jun

    2015-01-01

    Aldehyde dehydrogenases (ALDHs) constitute a superfamily of NAD(P)+-dependent enzymes that catalyze the irreversible oxidation of a wide range of reactive aldehydes to their corresponding nontoxic carboxylic acids. ALDHs have been studied in many organisms from bacteria to mammals; however, no systematic analyses incorporating genome organization, gene structure, expression profiles, and cis-acting elements have been conducted in the model tree species Populus trichocarpa thus far. In this study, a comprehensive analysis of the Populus ALDH gene superfamily was performed. A total of 26 Populus ALDH genes were found to be distributed across 12 chromosomes. Genomic organization analysis indicated that purifying selection may have played a pivotal role in the retention and maintenance of PtALDH gene families. The exon-intron organizations of PtALDHs were highly conserved within the same family, suggesting that the members of the same family also may have conserved functionalities. Microarray data and qRT-PCR analysis indicated that most PtALDHs had distinct tissue-specific expression patterns. The specificity of cis-acting elements in the promoter regions of the PtALDHs and the divergence of expression patterns between nine paralogous PtALDH gene pairs suggested that gene duplications may have freed the duplicate genes from the functional constraints. The expression levels of some ALDHs were up- or down-regulated by various abiotic stresses, implying that the products of these genes may be involved in the adaptation of Populus to abiotic stresses. Overall, the data obtained from our investigation contribute to a better understanding of the complexity of the Populus ALDH gene superfamily and provide insights into the function and evolution of ALDH gene families in vascular plants.

  16. NETWORK ASSISTED ANALYSIS TO REVEAL THE GENETIC BASIS OF AUTISM1

    PubMed Central

    Liu, Li; Lei, Jing; Roeder, Kathryn

    2016-01-01

    While studies show that autism is highly heritable, the nature of the genetic basis of this disorder remains illusive. Based on the idea that highly correlated genes are functionally interrelated and more likely to affect risk, we develop a novel statistical tool to find more potentially autism risk genes by combining the genetic association scores with gene co-expression in specific brain regions and periods of development. The gene dependence network is estimated using a novel partial neighborhood selection (PNS) algorithm, where node specific properties are incorporated into network estimation for improved statistical and computational efficiency. Then we adopt a hidden Markov random field (HMRF) model to combine the estimated network and the genetic association scores in a systematic manner. The proposed modeling framework can be naturally extended to incorporate additional structural information concerning the dependence between genes. Using currently available genetic association data from whole exome sequencing studies and brain gene expression levels, the proposed algorithm successfully identified 333 genes that plausibly affect autism risk. PMID:27134692

  17. Selection of relatively exact reference genes for gene expression studies in goosegrass (Eleusine indica) under herbicide stress

    PubMed Central

    Chen, Jingchao; Huang, Zhaofeng; Huang, Hongjuan; Wei, Shouhui; Liu, Yan; Jiang, Cuilan; Zhang, Jie; Zhang, Chaoxian

    2017-01-01

    Goosegrass (Eleusine indica) is one of the most serious annual grassy weeds worldwide, and its evolved herbicide-resistant populations are more difficult to control. Quantitative real-time PCR (qPCR) is a common technique for investigating the resistance mechanism; however, there is as yet no report on the systematic selection of stable reference genes for goosegrass. This study proposed to test the expression stability of 9 candidate reference genes in goosegrass in different tissues and developmental stages and under stress from three types of herbicide. The results show that for different developmental stages and organs (control), eukaryotic initiation factor 4 A (eIF-4) is the most stable reference gene. Chloroplast acetolactate synthase (ALS) is the most stable reference gene under glyphosate stress. Under glufosinate stress, eIF-4 is the best reference gene. Ubiquitin-conjugating enzyme (UCE) is the most stable reference gene under quizalofop-p-ethyl stress. The gene eIF-4 is the recommended reference gene for goosegrass under the stress of all three herbicides. Moreover, pairwise analysis showed that seven reference genes were sufficient to normalize the gene expression data under three herbicides treatment. This study provides a list of reliable reference genes for transcript normalization in goosegrass, which will facilitate resistance mechanism studies in this weed species. PMID:28429727

  18. Gene Expression Profiling Reveals a Massive, Aneuploidy-Dependent Transcriptional Deregulation and Distinct Differences between Lymph Node–Negative and Lymph Node–Positive Colon Carcinomas

    PubMed Central

    Grade, Marian; Hörmann, Patrick; Becker, Sandra; Hummon, Amanda B.; Wangsa, Danny; Varma, Sudhir; Simon, Richard; Liersch, Torsten; Becker, Heinz; Difilippantonio, Michael J.; Ghadimi, B. Michael; Ried, Thomas

    2016-01-01

    To characterize patterns of global transcriptional deregulation in primary colon carcinomas, we did gene expression profiling of 73 tumors [Unio Internationale Contra Cancrum stage II (n = 33) and stage III (n = 40)] using oligonucleotide microarrays. For 30 of the tumors, expression profiles were compared with those from matched normal mucosa samples. We identified a set of 1,950 genes with highly significant deregulation between tumors and mucosa samples (P < 1e–7). A significant proportion of these genes mapped to chromosome 20 (P = 0.01). Seventeen genes had a >5-fold average expression difference between normal colon mucosa and carcinomas, including up-regulation of MYC and of HMGA1, a putative oncogene. Furthermore, we identified 68 genes that were significantly differentially expressed between lymph node–negative and lymph node–positive tumors (P < 0.001), the functional annotation of which revealed a preponderance of genes that play a role in cellular immune response and surveillance. The microarray-derived gene expression levels of 20 deregulated genes were validated using quantitative real-time reverse transcription-PCR in >40 tumor and normal mucosa samples with good concordance between the techniques. Finally, we established a relationship between specific genomic imbalances, which were mapped for 32 of the analyzed colon tumors by comparative genomic hybridization, and alterations of global transcriptional activity. Previously, we had conducted a similar analysis of primary rectal carcinomas. The systematic comparison of colon and rectal carcinomas revealed a significant overlap of genomic imbalances and transcriptional deregulation, including activation of the Wnt/β-catenin signaling cascade, suggesting similar pathogenic pathways. PMID:17210682

  19. Gene expression profiling reveals a massive, aneuploidy-dependent transcriptional deregulation and distinct differences between lymph node-negative and lymph node-positive colon carcinomas.

    PubMed

    Grade, Marian; Hörmann, Patrick; Becker, Sandra; Hummon, Amanda B; Wangsa, Danny; Varma, Sudhir; Simon, Richard; Liersch, Torsten; Becker, Heinz; Difilippantonio, Michael J; Ghadimi, B Michael; Ried, Thomas

    2007-01-01

    To characterize patterns of global transcriptional deregulation in primary colon carcinomas, we did gene expression profiling of 73 tumors [Unio Internationale Contra Cancrum stage II (n = 33) and stage III (n = 40)] using oligonucleotide microarrays. For 30 of the tumors, expression profiles were compared with those from matched normal mucosa samples. We identified a set of 1,950 genes with highly significant deregulation between tumors and mucosa samples (P < 1e-7). A significant proportion of these genes mapped to chromosome 20 (P = 0.01). Seventeen genes had a >5-fold average expression difference between normal colon mucosa and carcinomas, including up-regulation of MYC and of HMGA1, a putative oncogene. Furthermore, we identified 68 genes that were significantly differentially expressed between lymph node-negative and lymph node-positive tumors (P < 0.001), the functional annotation of which revealed a preponderance of genes that play a role in cellular immune response and surveillance. The microarray-derived gene expression levels of 20 deregulated genes were validated using quantitative real-time reverse transcription-PCR in >40 tumor and normal mucosa samples with good concordance between the techniques. Finally, we established a relationship between specific genomic imbalances, which were mapped for 32 of the analyzed colon tumors by comparative genomic hybridization, and alterations of global transcriptional activity. Previously, we had conducted a similar analysis of primary rectal carcinomas. The systematic comparison of colon and rectal carcinomas revealed a significant overlap of genomic imbalances and transcriptional deregulation, including activation of the Wnt/beta-catenin signaling cascade, suggesting similar pathogenic pathways.

  20. Predictive model for inflammation grades of chronic hepatitis B: Large-scale analysis of clinical parameters and gene expressions.

    PubMed

    Zhou, Weichen; Ma, Yanyun; Zhang, Jun; Hu, Jingyi; Zhang, Menghan; Wang, Yi; Li, Yi; Wu, Lijun; Pan, Yida; Zhang, Yitong; Zhang, Xiaonan; Zhang, Xinxin; Zhang, Zhanqing; Zhang, Jiming; Li, Hai; Lu, Lungen; Jin, Li; Wang, Jiucun; Yuan, Zhenghong; Liu, Jie

    2017-11-01

    Liver biopsy is the gold standard to assess pathological features (eg inflammation grades) for hepatitis B virus-infected patients although it is invasive and traumatic; meanwhile, several gene profiles of chronic hepatitis B (CHB) have been separately described in relatively small hepatitis B virus (HBV)-infected samples. We aimed to analyse correlations among inflammation grades, gene expressions and clinical parameters (serum alanine amino transaminase, aspartate amino transaminase and HBV-DNA) in large-scale CHB samples and to predict inflammation grades by using clinical parameters and/or gene expressions. We analysed gene expressions with three clinical parameters in 122 CHB samples by an improved regression model. Principal component analysis and machine-learning methods including Random Forest, K-nearest neighbour and support vector machine were used for analysis and further diagnosis models. Six normal samples were conducted to validate the predictive model. Significant genes related to clinical parameters were found enriching in the immune system, interferon-stimulated, regulation of cytokine production, anti-apoptosis, and etc. A panel of these genes with clinical parameters can effectively predict binary classifications of inflammation grade (area under the ROC curve [AUC]: 0.88, 95% confidence interval [CI]: 0.77-0.93), validated by normal samples. A panel with only clinical parameters was also valuable (AUC: 0.78, 95% CI: 0.65-0.86), indicating that liquid biopsy method for detecting the pathology of CHB is possible. This is the first study to systematically elucidate the relationships among gene expressions, clinical parameters and pathological inflammation grades in CHB, and to build models predicting inflammation grades by gene expressions and/or clinical parameters as well. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  1. Isolation, structural analysis, and expression characteristics of the maize (Zea mays L.) hexokinase gene family.

    PubMed

    Zhang, Zhongbao; Zhang, Jiewei; Chen, Yajuan; Li, Ruifen; Wang, Hongzhi; Ding, Liping; Wei, Jianhua

    2014-09-01

    Hexokinases (HXKs, EC 2.7.1.1) play important roles in metabolism, glucose (Glc) signaling, and phosphorylation of Glc and fructose and are ubiquitous in all organisms. Despite their physiological importance, the maize HXK (ZmHXK) genes have not been analyzed systematically. We isolated and characterized nine members of the ZmHXK gene family which were distributed on 3 of the 10 maize chromosomes. A multiple sequence alignment and motif analysis revealed that the maize ZmHXK proteins share three conserved domains. Phylogenetic analysis revealed that the ZmHXK family can be divided into four subfamilies. We identified putative cis-elements in the ZmHXK promoter sequences potentially involved in phytohormone and abiotic stress responses, sugar repression, light and circadian rhythm regulation, Ca(2+) responses, seed development and germination, and CO2-responsive transcriptional activation. To study the functions of maize HXK isoforms, we characterized the expression of the ZmHXK5 and ZmHXK6 genes, which are evolutionarily related to the OsHXK5 and OsHXK6 genes from rice. Analysis of tissue-specific expression patterns using quantitative real time-PCR showed that ZmHXK5 was highly expressed in tassels, while ZmHXK6 was expressed in both tassels and leaves. ZmHXK5 and ZmHXK6 expression levels were upregulated by phytohormones and by abiotic stress.

  2. Gene Expression in Wilms’ Tumor Mimics the Earliest Committed Stage in the Metanephric Mesenchymal-Epithelial Transition

    PubMed Central

    Li, Chi-Ming; Guo, Meirong; Borczuk, Alain; Powell, Charles A.; Wei, Michelle; Thaker, Harshwardhan M.; Friedman, Richard; Klein, Ulf; Tycko, Benjamin

    2002-01-01

    Wilms’ tumor (WT) has been considered a prototype for arrested cellular differentiation in cancer, but previous studies have relied on selected markers. We have now performed an unbiased survey of gene expression in WTs using oligonucleotide microarrays. Statistical criteria identified 357 genes as differentially expressed between WTs and fetal kidneys. This set contained 124 matches to genes on a microarray used by Stuart and colleagues (Stuart RO, Bush KT, Nigam SK: Changes in global gene expression patterns during development and maturation of the rat kidney. Proc Natl Acad Sci USA 2001, 98:5649–5654) to establish genes with stage-specific expression in the developing rat kidney. Mapping between the two data sets showed that WTs systematically overexpressed genes corresponding to the earliest stage of metanephric development, and underexpressed genes corresponding to later stages. Automated clustering identified a smaller group of 27 genes that were highly expressed in WTs compared to fetal kidney and heterologous tumor and normal tissues. This signature set was enriched in genes encoding transcription factors. Four of these, PAX2, EYA1, HBF2, and HOXA11, are essential for cell survival and proliferation in early metanephric development, whereas others, including SIX1, MOX1, and SALL2, are predicted to act at this stage. SIX1 and SALL2 proteins were expressed in the condensing mesenchyme in normal human fetal kidneys, but were absent (SIX1) or reduced (SALL2) in cells at other developmental stages. These data imply that the blastema in WTs has progressed to the committed stage in the mesenchymal-epithelial transition, where it is partially arrested in differentiation. The WT-signature set also contained the Wnt receptor FZD7, the tumor antigen PRAME, the imprinted gene NNAT and the metastasis-associated transcription factor E1AF. PMID:12057921

  3. Long Non-Coding RNAs (lncRNAs) of Sea Cucumber: Large-Scale Prediction, Expression Profiling, Non-Coding Network Construction, and lncRNA-microRNA-Gene Interaction Analysis of lncRNAs in Apostichopus japonicus and Holothuria glaberrima During LPS Challenge and Radial Organ Complex Regeneration.

    PubMed

    Mu, Chuang; Wang, Ruijia; Li, Tianqi; Li, Yuqiang; Tian, Meilin; Jiao, Wenqian; Huang, Xiaoting; Zhang, Lingling; Hu, Xiaoli; Wang, Shi; Bao, Zhenmin

    2016-08-01

    Long non-coding RNA (lncRNA) structurally resembles mRNA but cannot be translated into protein. Although the systematic identification and characterization of lncRNAs have been increasingly reported in model species, information concerning non-model species is still lacking. Here, we report the first systematic identification and characterization of lncRNAs in two sea cucumber species: (1) Apostichopus japonicus during lipopolysaccharide (LPS) challenge and in heathy tissues and (2) Holothuria glaberrima during radial organ complex regeneration, using RNA-seq datasets and bioinformatics analysis. We identified A. japonicus and H. glaberrima lncRNAs that were differentially expressed during LPS challenge and radial organ complex regeneration, respectively. Notably, the predicted lncRNA-microRNA-gene trinities revealed that, in addition to targeting protein-coding transcripts, miRNAs might also target lncRNAs, thereby participating in a potential novel layer of regulatory interactions among non-coding RNA classes in echinoderms. Furthermore, the constructed coding-non-coding network implied the potential involvement of lncRNA-gene interactions during the regulation of several important genes (e.g., Toll-like receptor 1 [TLR1] and transglutaminase-1 [TGM1]) in response to LPS challenge and radial organ complex regeneration in sea cucumbers. Overall, this pioneer systematic identification, annotation, and characterization of lncRNAs in echinoderm pave the way for similar studies and future genetic, genomic, and evolutionary research in non-model species.

  4. Insulators to improve expression of a 3(')IgH LCR-driven reporter gene in transgenic mouse models.

    PubMed

    Guglielmi, Laurence; Le Bert, Marc; Truffinet, Véronique; Cogné, Michel; Denizot, Yves

    2003-08-01

    A locus control region (LCR) containing four transcriptional enhancers lies downstream of the IgH chain locus. We studied transgenes carrying a 3(')IgH LCR-driven GFP reporter gene for expression and B cell differentiation stage specificity. We also compared transgenes that were or were not flanked by two copies of the beta-globin HS4 insulator, an element defined by its ability to protect transgenes from the influences of surrounding genes at the insertion site. Results indicate that insulators are instrumental in sustaining GFP expression in GFP-3(')LCR transgenic mice when they were included. Flow cytometry experiments reported a strictly B cell specific GFP expression from pre-B cells in bone marrow to mature B cells in spleen. Despite addition of 5(')HS4 insulators to the GFP-3(')LCR construct, complete transgene silencing occurred in some transgenic lines and was systematically observed in ageing animals from all lines.

  5. Genome-wide identification, isolation and expression analysis of auxin response factor (ARF) gene family in sweet orange (Citrus sinensis)

    PubMed Central

    Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the CiARFs was undertaken, including the gene structures, phylogenetic analysis, chromosome locations, conserved motifs of proteins, and cis-elements in promoters of CiARF. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid (IAA) and N-1-napthylphthalamic acid (NPA) treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members during citrus growth and development process. PMID:25870601

  6. Minimising Immunohistochemical False Negative ER Classification Using a Complementary 23 Gene Expression Signature of ER Status

    PubMed Central

    Li, Qiyuan; Eklund, Aron C.; Juul, Nicolai; Haibe-Kains, Benjamin; Workman, Christopher T.; Richardson, Andrea L.; Szallasi, Zoltan; Swanton, Charles

    2010-01-01

    Background Expression of the oestrogen receptor (ER) in breast cancer predicts benefit from endocrine therapy. Minimising the frequency of false negative ER status classification is essential to identify all patients with ER positive breast cancers who should be offered endocrine therapies in order to improve clinical outcome. In routine oncological practice ER status is determined by semi-quantitative methods such as immunohistochemistry (IHC) or other immunoassays in which the ER expression level is compared to an empirical threshold[1], [2]. The clinical relevance of gene expression-based ER subtypes as compared to IHC-based determination has not been systematically evaluated. Here we attempt to reduce the frequency of false negative ER status classification using two gene expression approaches and compare these methods to IHC based ER status in terms of predictive and prognostic concordance with clinical outcome. Methodology/Principal Findings Firstly, ER status was discriminated by fitting the bimodal expression of ESR1 to a mixed Gaussian model. The discriminative power of ESR1 suggested bimodal expression as an efficient way to stratify breast cancer; therefore we identified a set of genes whose expression was both strongly bimodal, mimicking ESR expression status, and highly expressed in breast epithelial cell lines, to derive a 23-gene ER expression signature-based classifier. We assessed our classifiers in seven published breast cancer cohorts by comparing the gene expression-based ER status to IHC-based ER status as a predictor of clinical outcome in both untreated and tamoxifen treated cohorts. In untreated breast cancer cohorts, the 23 gene signature-based ER status provided significantly improved prognostic power compared to IHC-based ER status (P = 0.006). In tamoxifen-treated cohorts, the 23 gene ER expression signature predicted clinical outcome (HR = 2.20, P = 0.00035). These complementary ER signature-based strategies estimated that between 15.1% and 21.8% patients of IHC-based negative ER status would be classified with ER positive breast cancer. Conclusion/Significance Expression-based ER status classification may complement IHC to minimise false negative ER status classification and optimise patient stratification for endocrine therapies. PMID:21152022

  7. Genome-wide Mapping Reveals Conservation of Promoter DNA Methylation Following Chicken Domestication

    PubMed Central

    Li, Qinghe; Wang, Yuanyuan; Hu, Xiaoxiang; Zhao, Yaofeng; Li, Ning

    2015-01-01

    It is well-known that environment influences DNA methylation, however, the extent of heritable DNA methylation variation following animal domestication remains largely unknown. Using meDIP-chip we mapped the promoter methylomes for 23,316 genes in muscle tissues of ancestral and domestic chickens. We systematically examined the variation of promoter DNA methylation in terms of different breeds, differentially expressed genes, SNPs and genes undergo genetic selection sweeps. While considerable changes in DNA sequence and gene expression programs were prevalent, we found that the inter-strain DNA methylation patterns were highly conserved in promoter region between the wild and domestic chicken breeds. Our data suggests a global preservation of DNA methylation between the wild and domestic chicken breeds in either a genome-wide or locus-specific scale in chick muscle tissues. PMID:25735894

  8. MicroRNA-181 promotes synaptogenesis and attenuates axonal outgrowth in cortical neurons

    PubMed Central

    Kos, Aron; Olde Loohuis, Nikkie; Meinhardt, Julia; van Bokhoven, Hans; Kaplan, Barry B; Martens, Gerard; Aschrafi, Armaz

    2016-01-01

    MicroRNAs (miRs) are non-coding gene transcripts abundantly expressed in both the developing and adult mammalian brain. They act as important modulators of complex gene regulatory networks during neuronal development and plasticity. miR-181c is highly abundant in cerebellar cortex and its expression is increased in autism patients as well as in an animal model of autism. To systematically identify putative targets of miR-181c, we repressed this miR in growing cortical neurons and found over 70 differentially expressed target genes using transcriptome profiling. Pathway analysis showed that the miR-181c-modulated genes converge on signaling cascades relevant to neurite and synapse developmental processes. To experimentally examine the significance of these data, we inhibited miR-181c during rat cortical neuronal maturation in vitro; this loss-of miR-181c function resulted in enhanced neurite sprouting and reduced synaptogenesis. Collectively, our findings suggest that miR-181c is a modulator of gene networks associated with cortical neuronal maturation. PMID:27017280

  9. Differential expression analysis of genes involved in high-temperature induced sex differentiation in Nile tilapia.

    PubMed

    Li, Chun Ge; Wang, Hui; Chen, Hong Ju; Zhao, Yan; Fu, Pei Sheng; Ji, Xiang Shan

    2014-01-01

    Nowadays, high temperature effects on the molecular pathways during sex differentiation in teleosts need to be deciphered. In this study, a systematic differential expression analysis of genes involved in high temperature-induced sex differentiation was done in the Nile tilapia gonad and brain. Our results showed that high temperature caused significant down-regulation of CYP19A1A in the gonad of both sexes in induction group, and FOXL2 in the ovary of the induction group. The expressions of GTHα, LHβ and ERα were also significantly down-regulated in the brain of both sexes in the induction and recovery groups. On the contrary, the expression of CYP11B2 was significantly up-regulated in the ovary, but not in the testis in both groups. Spearman rank correlation analysis showed that there are significant correlations between the expressions of CYP19A1A, FOXL2, or DMRT1 in the gonads and the expression of some genes in the brain. Another result in this study showed that high temperature up-regulated the expression level of DNMT1 in the testis of the induction group, and DNMT1 and DNMT3A in the female brain of both groups. The expression and correlation analysis of HSPs showed that high temperature action on tilapia HSPs might indirectly induce the expression changes of sex differentiation genes in the gonads. These findings provide new insights on TSD and suggest that sex differentiation related genes, heat shock proteins, and DNA methylation genes are new candidates for studying TSD in fish species. Copyright © 2014 Elsevier Inc. All rights reserved.

  10. Genome-wide survey of Aux/IAA gene family members in potato (Solanum tuberosum): Identification, expression analysis, and evaluation of their roles in tuber development.

    PubMed

    Gao, Junpeng; Cao, Xiaoli; Shi, Shandang; Ma, Yuling; Wang, Kai; Liu, Shengjie; Chen, Dan; Chen, Qin; Ma, Haoli

    2016-03-04

    The Auxin/indole-3-acetic acid (Aux/IAA) genes encode short-lived nuclear proteins that are known to be involved in the primary cellular responses to auxin. To date, systematic analysis of the Aux/IAA genes in potato (Solanum tuberosum) has not been conducted. In this study, a total of 26 potato Aux/IAA genes were identified (designated from StIAA1 to StIAA26), and the distribution of four conserved domains shared by the StIAAs were analyzed based on multiple sequence alignment and a motif-based sequence analysis. A phylogenetic analysis of the Aux/IAA gene families of potato and Arabidopsis was also conducted. In order to assess the roles of StIAA genes in tuber development, the results of RNA-seq studies were reformatted to analyze the expression patterns of StIAA genes, and then verified by quantitative real-time PCR. A large number of StIAA genes (12 genes) were highly expressed in stolon organs and in during the tuber initiation and expansion developmental stages, and most of these genes were responsive to indoleacetic acid treatment. Our results suggested that StIAA genes were involved in the process of tuber development and provided insights into functional roles of potato Aux/IAA genes. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. Selection of reference genes for RT-qPCR analysis in tumor tissues from male hepatocellular carcinoma patients with hepatitis B infection and cirrhosis.

    PubMed

    Liu, Shuang; Zhu, Pengfei; Zhang, Ling; Ding, Shanlong; Zheng, Sujun; Wang, Yang; Lu, Fengmin

    2013-01-01

    Reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR) has been widely used to quantify relative gene expression because of the high specificity, sensitivity and accuracy of this technique. However, its reliability is strongly depends on the expression stability of reference gene used for data normalization. Therefore, identification of reliable and condition specific reference genes is critical for the success of RT-qPCR. Hepatitis B virus (HBV) infection, male gender and the presence of cirrhosis are widely recognized as the leading independent risk factors for the development of hepatocellular carcinoma (HCC). This study aimed to select reliable reference gene for RT-qPCR analysis in HCC patients with all of those risk factors. Six candidate reference genes were analyzed in 33 paired tumor and non-tumor tissues from untreated HCC patients. The genes expression stabilities were assessed by geNorm and NormFinder. C-terminal binding protein 1(CTBP1) was the most stable gene among the 6 candidate genes evaluated by both geNorm and NormFinder. The expression stability values were 0.08 for CTBP1 and UBC, 0.09 for HPRT1, 0.12 for HMBS, 0.14 for GAPDH and 0.18 for 18S with geNorm analysis. The stability values suggested by NormFinder software were CTBP1: 0.044, UBC: 0.063, HMBS: 0.072, HPRT1: 0.072, GAPDH: 0.098 and 18S rRNA: 0.161. This is the first systematic analysis which suggested CTBP1 as the highest expression-stable gene in human male HBV infection related-HCC with cirrhosis. We recommend CTBP1 as the best candidate reference gene when RT-qPCR was used to determine gene(s) expression in HCC. This may facilitate the relevant HBV related HCC studies in the future.

  12. Non-DBS DNA Repair Genes Regulate Radiation-induced Cytogenetic Damage Repair and Cell Cycle Progression

    NASA Technical Reports Server (NTRS)

    Zhang, Ye; Rohde, Larry H.; Emami, Kamal; Casey, Rachael; Wu, Honglu

    2008-01-01

    Changes of gene expression profile are one of the most important biological responses in living cells after ionizing radiation (IR) exposure. Although some studies have shown that genes up-regulated by IR may play important roles in DNA damage repair, the relationship between the regulation of gene expression by IR, particularly genes not known for their roles in DSB repair, and its impact on cytogenetic responses has not been systematically studied. In the present study, the expression of 25 genes selected on the basis of their transcriptional changes in response to IR was individually knocked down by transfection with small interfering RNA in human fibroblast cells. The purpose of this study is to identify new roles of these selected genes on regulating DSB repair and cell cycle progression , as measured in the micronuclei formation and chromosome aberration. In response to IR, the formation of MN was significantly increased by suppressed expression of 5 genes: Ku70 in the DSB repair pathway, XPA in the NER pathway, RPA1 in the MMR pathway, and RAD17 and RBBP8 in cell cycle control. Knocked-down expression of 4 genes (MRE11A, RAD51 in the DSB pathway, SESN1, and SUMO1) significantly inhibited cell cycle progression, possibly because of severe impairment of DNA damage repair. Furthermore, loss of XPA, P21, or MLH1 expression resulted in both significantly enhanced cell cycle progression and increased yields of chromosome aberrations, indicating that these gene products modulate both cell cycle control and DNA damage repair. Most of the 11 genes that affected cytogenetic responses are not known to have clear roles influencing DBS repair. Nine of these 11 genes were up-regulated in cells exposed to gamma radiation, suggesting that genes transcriptionally modulated by IR were critical to regulate the biological consequences after IR.

  13. Analysis of gene expression in a developmental context emphasizes distinct biological leitmotifs in human cancers

    PubMed Central

    Naxerova, Kamila; Bult, Carol J; Peaston, Anne; Fancher, Karen; Knowles, Barbara B; Kasif, Simon; Kohane, Isaac S

    2008-01-01

    Background In recent years, the molecular underpinnings of the long-observed resemblance between neoplastic and immature tissue have begun to emerge. Genome-wide transcriptional profiling has revealed similar gene expression signatures in several tumor types and early developmental stages of their tissue of origin. However, it remains unclear whether such a relationship is a universal feature of malignancy, whether heterogeneities exist in the developmental component of different tumor types and to which degree the resemblance between cancer and development is a tissue-specific phenomenon. Results We defined a developmental landscape by summarizing the main features of ten developmental time courses and projected gene expression from a variety of human tumor types onto this landscape. This comparison demonstrates a clear imprint of developmental gene expression in a wide range of tumors and with respect to different, even non-cognate developmental backgrounds. Our analysis reveals three classes of cancers with developmentally distinct transcriptional patterns. We characterize the biological processes dominating these classes and validate the class distinction with respect to a new time series of murine embryonic lung development. Finally, we identify a set of genes that are upregulated in most cancers and we show that this signature is active in early development. Conclusion This systematic and quantitative overview of the relationship between the neoplastic and developmental transcriptome spanning dozens of tissues provides a reliable outline of global trends in cancer gene expression, reveals potentially clinically relevant differences in the gene expression of different cancer types and represents a reference framework for interpretation of smaller-scale functional studies. PMID:18611264

  14. Gene expression during skeletal development in three osteopetrotic rat mutations. Evidence for osteoblast abnormalities.

    PubMed

    Shalhoub, V; Jackson, M E; Lian, J B; Stein, G S; Marks, S C

    1991-05-25

    Osteopetrosis is a group of metabolic bone diseases characterized by reductions in osteoclast development and/or function. These aspects of osteoclast biology are known to be influenced by osteoblasts and their products. To ascertain whether osteoblast dysfunction contributes to aberrations in the structural and functional properties of osteoclasts in osteopetrosis, we systematically examined gene expression as reflected by mRNA levels for a series of cell growth- and tissue-related genes associated with the osteoblast phenotype during skeletal development in normal and mutant rats of three different osteopetrotic stocks. We show that the methods used permit the reproducible isolation of undegraded total cellular RNA from bone and that mRNA levels can be reliably quantitated in these preparations. Each osteopetrotic mutation exhibits a distinct aberrant pattern of osteoblast gene expression that may be correlated with and explain some abnormalities in extracellular matrix composition, mineralization, osteoclast development, and effects of elevated serum levels of 1 alpha,25-dihydroxyvitamin D3, depending upon the mutation. Normal rats show minor variations in gene expression that reflect the genetic background (stock). This, the first comprehensive molecular analysis of osteoblast gene expression in osteopetrosis, suggests that some osteopetroses, particularly in the toothless rat, are associated with and potentially related to mechanisms associated with aberrations in osteoblast function. More generally, the present studies demonstrate alterations in gene expression as reflected by mRNA levels that are associated with functional properties of the osteoblast, particularly those contributing to the recruitment and/or differentiation of osteoclasts, thereby influencing skeletal modeling.

  15. Systems analysis of cis-regulatory motifs in C4 photosynthesis genes using maize and rice leaf transcriptomic data during a process of de-etiolation.

    PubMed

    Xu, Jiajia; Bräutigam, Andrea; Weber, Andreas P M; Zhu, Xin-Guang

    2016-09-01

    Identification of potential cis-regulatory motifs controlling the development of C4 photosynthesis is a major focus of current research. In this study, we used time-series RNA-seq data collected from etiolated maize and rice leaf tissues sampled during a de-etiolation process to systematically characterize the expression patterns of C4-related genes and to further identify potential cis elements in five different genomic regions (i.e. promoter, 5'UTR, 3'UTR, intron, and coding sequence) of C4 orthologous genes. The results demonstrate that although most of the C4 genes show similar expression patterns, a number of them, including chloroplast dicarboxylate transporter 1, aspartate aminotransferase, and triose phosphate transporter, show shifted expression patterns compared with their C3 counterparts. A number of conserved short DNA motifs between maize C4 genes and their rice orthologous genes were identified not only in the promoter, 5'UTR, 3'UTR, and coding sequences, but also in the introns of core C4 genes. We also identified cis-regulatory motifs that exist in maize C4 genes and also in genes showing similar expression patterns as maize C4 genes but that do not exist in rice C3 orthologs, suggesting a possible recruitment of pre-existing cis-elements from genes unrelated to C4 photosynthesis into C4 photosynthesis genes during C4 evolution. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  16. Identification of Importin 8 (IPO8) as the most accurate reference gene for the clinicopathological analysis of lung specimens

    PubMed Central

    Nguewa, Paul A; Agorreta, Jackeline; Blanco, David; Lozano, Maria Dolores; Gomez-Roman, Javier; Sanchez, Blas A; Valles, Iñaki; Pajares, Maria J; Pio, Ruben; Rodriguez, Maria Jose; Montuenga, Luis M; Calvo, Alfonso

    2008-01-01

    Background The accurate normalization of differentially expressed genes in lung cancer is essential for the identification of novel therapeutic targets and biomarkers by real time RT-PCR and microarrays. Although classical "housekeeping" genes, such as GAPDH, HPRT1, and beta-actin have been widely used in the past, their accuracy as reference genes for lung tissues has not been proven. Results We have conducted a thorough analysis of a panel of 16 candidate reference genes for lung specimens and lung cell lines. Gene expression was measured by quantitative real time RT-PCR and expression stability was analyzed with the softwares GeNorm and NormFinder, mean of |ΔCt| (= |Ct Normal-Ct tumor|) ± SEM, and correlation coefficients among genes. Systematic comparison between candidates led us to the identification of a subset of suitable reference genes for clinical samples: IPO8, ACTB, POLR2A, 18S, and PPIA. Further analysis showed that IPO8 had a very low mean of |ΔCt| (0.70 ± 0.09), with no statistically significant differences between normal and malignant samples and with excellent expression stability. Conclusion Our data show that IPO8 is the most accurate reference gene for clinical lung specimens. In addition, we demonstrate that the commonly used genes GAPDH and HPRT1 are inappropriate to normalize data derived from lung biopsies, although they are suitable as reference genes for lung cell lines. We thus propose IPO8 as a novel reference gene for lung cancer samples. PMID:19014639

  17. Domestication-driven Gossypium profilin 1 (GhPRF1) gene transduces early flowering phenotype in tobacco by spatial alteration of apical/floral-meristem related gene expression.

    PubMed

    Pandey, Dhananjay K; Chaudhary, Bhupendra

    2016-05-13

    Plant profilin genes encode core cell-wall structural proteins and are evidenced for their up-regulation under cotton domestication. Notwithstanding striking discoveries in the genetics of cell-wall organization in plants, little is explicit about the manner in which profilin-mediated molecular interplay and corresponding networks are altered, especially during cellular signalling of apical meristem determinacy and flower development. Here we show that the ectopic expression of GhPRF1 gene in tobacco resulted in the hyperactivation of apical meristem and early flowering phenotype with increased flower number in comparison to the control plants. Spatial expression alteration in CLV1, a key meristem-determinacy gene, is induced by the GhPRF1 overexpression in a WUS-dependent manner and mediates cell signalling to promote flowering. But no such expression alterations are recorded in the GhPRF1-RNAi lines. The GhPRF1 transduces key positive flowering regulator AP1 gene via coordinated expression of FT4, SOC1, FLC1 and FT1 genes involved in the apical-to-floral meristem signalling cascade which is consistent with our in silico profilin interaction data. Remarkably, these positive and negative flowering regulators are spatially controlled by the Actin-Related Protein (ARP) genes, specifically ARP4 and ARP6 in proximate association with profilins. This study provides a novel and systematic link between GhPRF1 gene expression and the flower primordium initiation via up-regulation of the ARP genes, and an insight into the functional characterization of GhPRF1 gene acting upstream to the flowering mechanism. Also, the transgenic plants expressing GhPRF1 gene show an increase in the plant height, internode length, leaf size and plant vigor. Overexpression of GhPRF1 gene induced early and increased flowering in tobacco with enhanced plant vigor. During apical meristem determinacy and flower development, the GhPRF1 gene directly influences key flowering regulators through ARP-genes, indicating for its role upstream in the apical-to-floral meristem signalling cascade.

  18. Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities

    PubMed Central

    Fang, Xin; Sastry, Anand; Mih, Nathan; Kim, Donghyuk; Tan, Justin; Lloyd, Colton J.; Gao, Ye; Yang, Laurence; Palsson, Bernhard O.

    2017-01-01

    Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the Escherichia coli TRN—probably the best characterized TRN—several questions remain. Here, we address three questions: (i) How complete is our knowledge of the E. coli TRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism’s TRN from disparate data types. PMID:28874552

  19. Identification and expression profiling analysis of TCP family genes involved in growth and development in maize.

    PubMed

    Chai, Wenbo; Jiang, Pengfei; Huang, Guoyu; Jiang, Haiyang; Li, Xiaoyu

    2017-10-01

    The TCP family is a group of plant-specific transcription factors. TCP genes encode proteins harboring bHLH structure, which is implicated in DNA binding and protein-protein interactions and known as the TCP domain. TCP genes play important roles in plant development and have been evolutionarily and functionally elaborated in various plants, however, no overall phylogenetic analysis or expression profiling of TCP genes in Zea mays has been reported. In the present study, a systematic analysis of molecular evolution and functional prediction of TCP family genes in maize ( Z . mays L.) has been conducted. We performed a genome-wide survey of TCP genes in maize, revealing the gene structure, chromosomal location and phylogenetic relationship of family members. Microsynteny between grass species and tissue-specific expression profiles were also investigated. In total, 29 TCP genes were identified in the maize genome, unevenly distributed on the 10 maize chromosomes. Additionally, ZmTCP genes were categorized into nine classes based on phylogeny and purifying selection may largely be responsible for maintaining the functions of maize TCP genes. What's more, microsynteny analysis suggested that TCP genes have been conserved during evolution. Finally, expression analysis revealed that most TCP genes are expressed in the stem and ear, which suggests that ZmTCP genes influence stem and ear growth. This result is consistent with the previous finding that maize TCP genes represses the growth of axillary organs and enables the formation of female inflorescences. Altogether, this study presents a thorough overview of TCP family in maize and provides a new perspective on the evolution of this gene family. The results also indicate that TCP family genes may be involved in development stage in plant growing conditions. Additionally, our results will be useful for further functional analysis of the TCP gene family in maize.

  20. Identification of miRNA-Mediated Core Gene Module for Glioma Patient Prediction by Integrating High-Throughput miRNA, mRNA Expression and Pathway Structure

    PubMed Central

    Han, Junwei; Shang, Desi; Zhang, Yunpeng; Zhang, Wei; Yao, Qianlan; Han, Lei; Xu, Yanjun; Yan, Wei; Bao, Zhaoshi; You, Gan; Jiang, Tao; Kang, Chunsheng; Li, Xia

    2014-01-01

    The prognosis of glioma patients is usually poor, especially in patients with glioblastoma (World Health Organization (WHO) grade IV). The regulatory functions of microRNA (miRNA) on genes have important implications in glioma cell survival. However, there are not many studies that have investigated glioma survival by integrating miRNAs and genes while also considering pathway structure. In this study, we performed sample-matched miRNA and mRNA expression profilings to systematically analyze glioma patient survival. During this analytical process, we developed pathway-based random walk to identify a glioma core miRNA-gene module, simultaneously considering pathway structure information and multi-level involvement of miRNAs and genes. The core miRNA-gene module we identified was comprised of four apparent sub-modules; all four sub-modules displayed a significant correlation with patient survival in the testing set (P-values≤0.001). Notably, one sub-module that consisted of 6 miRNAs and 26 genes also correlated with survival time in the high-grade subgroup (WHO grade III and IV), P-value = 0.0062. Furthermore, the 26-gene expression signature from this sub-module had robust predictive power in four independent, publicly available glioma datasets. Our findings suggested that the expression signatures, which were identified by integration of miRNA and gene level, were closely associated with overall survival among the glioma patients with various grades. PMID:24809850

  1. A Methodology for the Development of RESTful Semantic Web Services for Gene Expression Analysis

    PubMed Central

    Guardia, Gabriela D. A.; Pires, Luís Ferreira; Vêncio, Ricardo Z. N.; Malmegrim, Kelen C. R.; de Farias, Cléver R. G.

    2015-01-01

    Gene expression studies are generally performed through multi-step analysis processes, which require the integrated use of a number of analysis tools. In order to facilitate tool/data integration, an increasing number of analysis tools have been developed as or adapted to semantic web services. In recent years, some approaches have been defined for the development and semantic annotation of web services created from legacy software tools, but these approaches still present many limitations. In addition, to the best of our knowledge, no suitable approach has been defined for the functional genomics domain. Therefore, this paper aims at defining an integrated methodology for the implementation of RESTful semantic web services created from gene expression analysis tools and the semantic annotation of such services. We have applied our methodology to the development of a number of services to support the analysis of different types of gene expression data, including microarray and RNASeq. All developed services are publicly available in the Gene Expression Analysis Services (GEAS) Repository at http://dcm.ffclrp.usp.br/lssb/geas. Additionally, we have used a number of the developed services to create different integrated analysis scenarios to reproduce parts of two gene expression studies documented in the literature. The first study involves the analysis of one-color microarray data obtained from multiple sclerosis patients and healthy donors. The second study comprises the analysis of RNA-Seq data obtained from melanoma cells to investigate the role of the remodeller BRG1 in the proliferation and morphology of these cells. Our methodology provides concrete guidelines and technical details in order to facilitate the systematic development of semantic web services. Moreover, it encourages the development and reuse of these services for the creation of semantically integrated solutions for gene expression analysis. PMID:26207740

  2. Valine-glutamine (VQ) motif coding genes are ancient and non-plant-specific with comprehensive expression regulation by various biotic and abiotic stresses.

    PubMed

    Jiang, Shu-Ye; Sevugan, Mayalagu; Ramachandran, Srinivasan

    2018-05-09

    Valine-glutamine (VQ) motif containing proteins play important roles in abiotic and biotic stress responses in plants. However, little is known about the origin and evolution as well as comprehensive expression regulation of the VQ gene family. In this study, we systematically surveyed this gene family in 50 plant genomes from algae, moss, gymnosperm and angiosperm and explored their presence in other species from animals, bacteria, fungi and viruses. No VQs were detected in all tested algae genomes and all genomes from moss, gymnosperm and angiosperm encode varying numbers of VQs. Interestingly, some of fungi, lower animals and bacteria also encode single to a few VQs. Thus, they are not plant-specific and should be regarded as an ancient family. Their family expansion was mainly due to segmental duplication followed by tandem duplication and mobile elements. Limited contribution of gene conversion was detected to the family evolution. Generally, VQs were very much conserved in their motif coding region and were under purifying selection. However, positive selection was also observed during species divergence. Many VQs were up- or down-regulated by various abiotic / biotic stresses and phytohormones in rice and Arabidopsis. They were also co-expressed with some of other stress-related genes. All of the expression data suggest a comprehensive expression regulation of the VQ gene family. We provide new insights into gene expansion, divergence, evolution and their expression regulation of this VQ family. VQs were detectable not only in plants but also in some of fungi, lower animals and bacteria, suggesting the evolutionary conservation and the ancient origin. Overall, VQs are non-plant-specific and play roles in abiotic / biotic responses or other biological processes through comprehensive expression regulation.

  3. A Methodology for the Development of RESTful Semantic Web Services for Gene Expression Analysis.

    PubMed

    Guardia, Gabriela D A; Pires, Luís Ferreira; Vêncio, Ricardo Z N; Malmegrim, Kelen C R; de Farias, Cléver R G

    2015-01-01

    Gene expression studies are generally performed through multi-step analysis processes, which require the integrated use of a number of analysis tools. In order to facilitate tool/data integration, an increasing number of analysis tools have been developed as or adapted to semantic web services. In recent years, some approaches have been defined for the development and semantic annotation of web services created from legacy software tools, but these approaches still present many limitations. In addition, to the best of our knowledge, no suitable approach has been defined for the functional genomics domain. Therefore, this paper aims at defining an integrated methodology for the implementation of RESTful semantic web services created from gene expression analysis tools and the semantic annotation of such services. We have applied our methodology to the development of a number of services to support the analysis of different types of gene expression data, including microarray and RNASeq. All developed services are publicly available in the Gene Expression Analysis Services (GEAS) Repository at http://dcm.ffclrp.usp.br/lssb/geas. Additionally, we have used a number of the developed services to create different integrated analysis scenarios to reproduce parts of two gene expression studies documented in the literature. The first study involves the analysis of one-color microarray data obtained from multiple sclerosis patients and healthy donors. The second study comprises the analysis of RNA-Seq data obtained from melanoma cells to investigate the role of the remodeller BRG1 in the proliferation and morphology of these cells. Our methodology provides concrete guidelines and technical details in order to facilitate the systematic development of semantic web services. Moreover, it encourages the development and reuse of these services for the creation of semantically integrated solutions for gene expression analysis.

  4. The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison

    PubMed Central

    Sioson, Allan A; Mane, Shrinivasrao P; Li, Pinghua; Sha, Wei; Heath, Lenwood S; Bohnert, Hans J; Grene, Ruth

    2006-01-01

    Background Analysis of DNA microarray data takes as input spot intensity measurements from scanner software and returns differential expression of genes between two conditions, together with a statistical significance assessment. This process typically consists of two steps: data normalization and identification of differentially expressed genes through statistical analysis. The Expresso microarray experiment management system implements these steps with a two-stage, log-linear ANOVA mixed model technique, tailored to individual experimental designs. The complement of tools in TM4, on the other hand, is based on a number of preset design choices that limit its flexibility. In the TM4 microarray analysis suite, normalization, filter, and analysis methods form an analysis pipeline. TM4 computes integrated intensity values (IIV) from the average intensities and spot pixel counts returned by the scanner software as input to its normalization steps. By contrast, Expresso can use either IIV data or median intensity values (MIV). Here, we compare Expresso and TM4 analysis of two experiments and assess the results against qRT-PCR data. Results The Expresso analysis using MIV data consistently identifies more genes as differentially expressed, when compared to Expresso analysis with IIV data. The typical TM4 normalization and filtering pipeline corrects systematic intensity-specific bias on a per microarray basis. Subsequent statistical analysis with Expresso or a TM4 t-test can effectively identify differentially expressed genes. The best agreement with qRT-PCR data is obtained through the use of Expresso analysis and MIV data. Conclusion The results of this research are of practical value to biologists who analyze microarray data sets. The TM4 normalization and filtering pipeline corrects microarray-specific systematic bias and complements the normalization stage in Expresso analysis. The results of Expresso using MIV data have the best agreement with qRT-PCR results. In one experiment, MIV is a better choice than IIV as input to data normalization and statistical analysis methods, as it yields as greater number of statistically significant differentially expressed genes; TM4 does not support the choice of MIV input data. Overall, the more flexible and extensive statistical models of Expresso achieve more accurate analytical results, when judged by the yardstick of qRT-PCR data, in the context of an experimental design of modest complexity. PMID:16626497

  5. New Insights into the Organization, Recombination, Expression and Functional Mechanism of Low Molecular Weight Glutenin Subunit Genes in Bread Wheat

    PubMed Central

    Fan, Huajie; Sun, Jiazhu; Zhang, Zhongjuan; Qin, Huanju; Li, Bin; Hao, Shanting; Li, Zhensheng; Wang, Daowen; Zhang, Aimin; Ling, Hong-Qing

    2010-01-01

    The bread-making quality of wheat is strongly influenced by multiple low molecular weight glutenin subunit (LMW-GS) proteins expressed in the seeds. However, the organization, recombination and expression of LMW-GS genes and their functional mechanism in bread-making are not well understood. Here we report a systematic molecular analysis of LMW-GS genes located at the orthologous Glu-3 loci (Glu-A3, B3 and D3) of bread wheat using complementary approaches (genome wide characterization of gene members, expression profiling, proteomic analysis). Fourteen unique LMW-GS genes were identified for Xiaoyan 54 (with superior bread-making quality). Molecular mapping and recombination analyses revealed that the three Glu-3 loci of Xiaoyan 54 harbored dissimilar numbers of LMW-GS genes and covered different genetic distances. The number of expressed LMW-GS in the seeds was higher in Xiaoyan 54 than in Jing 411 (with relatively poor bread-making quality). This correlated with the finding of higher numbers of active LMW-GS genes at the A3 and D3 loci in Xiaoyan 54. Association analysis using recombinant inbred lines suggested that positive interactions, conferred by genetic combinations of the Glu-3 locus alleles with more numerous active LMW-GS genes, were generally important for the recombinant progenies to attain high Zeleny sedimentation value (ZSV), an important indicator of bread-making quality. A higher number of active LMW-GS genes tended to lead to a more elevated ZSV, although this tendency was influenced by genetic background. This work provides substantial new insights into the genomic organization and expression of LMW-GS genes, and molecular genetic evidence suggesting that these genes contribute quantitatively to bread-making quality in hexaploid wheat. Our analysis also indicates that selection for high numbers of active LMW-GS genes can be used for improvement of bread-making quality in wheat breeding. PMID:20975830

  6. Circular RNA biogenesis can proceed through an exon-containing lariat precursor

    PubMed Central

    Barrett, Steven P; Wang, Peter L; Salzman, Julia

    2015-01-01

    Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical ‘backsplicing’ event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure. DOI: http://dx.doi.org/10.7554/eLife.07540.001 PMID:26057830

  7. Genome-wide characterization and analysis of bZIP transcription factor gene family related to abiotic stress in cassava.

    PubMed

    Hu, Wei; Yang, Hubiao; Yan, Yan; Wei, Yunxie; Tie, Weiwei; Ding, Zehong; Zuo, Jiao; Peng, Ming; Li, Kaimian

    2016-03-07

    The basic leucine zipper (bZIP) transcription factor family plays crucial roles in various aspects of biological processes. Currently, no information is available regarding the bZIP family in the important tropical crop cassava. Herein, 77 bZIP genes were identified from cassava. Evolutionary analysis indicated that MebZIPs could be divided into 10 subfamilies, which was further supported by conserved motif and gene structure analyses. Global expression analysis suggested that MebZIPs showed similar or distinct expression patterns in different tissues between cultivated variety and wild subspecies. Transcriptome analysis of three cassava genotypes revealed that many MebZIP genes were activated by drought in the root of W14 subspecies, indicating the involvement of these genes in the strong resistance of cassava to drought. Expression analysis of selected MebZIP genes in response to osmotic, salt, cold, ABA, and H2O2 suggested that they might participate in distinct signaling pathways. Our systematic analysis of MebZIPs reveals constitutive, tissue-specific and abiotic stress-responsive candidate MebZIP genes for further functional characterization in planta, yields new insights into transcriptional regulation of MebZIP genes, and lays a foundation for understanding of bZIP-mediated abiotic stress response.

  8. Identification and evaluation of reference genes for accurate gene expression normalization of fresh and frozen-thawed spermatozoa of water buffalo (Bubalus bubalis).

    PubMed

    Ashish, Shende; Bhure, S K; Harikrishna, Pillai; Ramteke, S S; Muhammed Kutty, V H; Shruthi, N; Ravi Kumar, G V P P S; Manish, Mahawar; Ghosh, S K; Mihir, Sarkar

    2017-04-01

    The quantitative real time PCR (qRT-PCR) has become an important tool for gene-expression analysis for a selected number of genes in life science. Although large dynamic range, sensitivity and reproducibility of qRT-PCR is good, the reliability majorly depend on the selection of proper reference genes (RGs) employed for normalization. Although, RGs expression has been reported to vary considerably within same cell type with different experimental treatments. No systematic study has been conducted to identify and evaluate the appropriate RGs in spermatozoa of domestic animals. Therefore, this study was conducted to analyze suitable stable RGs in fresh and frozen-thawed spermatozoa. We have assessed 13 candidate RGs (BACT, RPS18s, RPS15A, ATP5F1, HMBS, ATP2B4, RPL13, EEF2, TBP, EIF2B2, MDH1, B2M and GLUT5) of different functions and pathways using five algorithms. Regardless of the approach, the ranking of the most and the least candidate RGs remained almost same. The comprehensive ranking by RefFinder showed GLUT5, ATP2B4 and B2M, MDH1 as the top two stable and least stable RGs, respectively. The expression levels of four heat shock proteins (HSP) were employed as a target gene to evaluate RGs efficiency for normalization. The results demonstrated an exponential difference in expression levels of the four HSP genes upon normalization of the data with the most stable and the least stable RGs. Our study, provides a convenient RGs for normalization of gene-expression of key metabolic pathways effected during freezing and thawing of spermatozoa of buffalo and other closely related bovines. Copyright © 2017 Elsevier Inc. All rights reserved.

  9. Clinical Value of Prognosis Gene Expression Signatures in Colorectal Cancer: A Systematic Review

    PubMed Central

    Cordero, David; Riccadonna, Samantha; Solé, Xavier; Crous-Bou, Marta; Guinó, Elisabet; Sanjuan, Xavier; Biondo, Sebastiano; Soriano, Antonio; Jurman, Giuseppe; Capella, Gabriel; Furlanello, Cesare; Moreno, Victor

    2012-01-01

    Introduction The traditional staging system is inadequate to identify those patients with stage II colorectal cancer (CRC) at high risk of recurrence or with stage III CRC at low risk. A number of gene expression signatures to predict CRC prognosis have been proposed, but none is routinely used in the clinic. The aim of this work was to assess the prediction ability and potential clinical usefulness of these signatures in a series of independent datasets. Methods A literature review identified 31 gene expression signatures that used gene expression data to predict prognosis in CRC tissue. The search was based on the PubMed database and was restricted to papers published from January 2004 to December 2011. Eleven CRC gene expression datasets with outcome information were identified and downloaded from public repositories. Random Forest classifier was used to build predictors from the gene lists. Matthews correlation coefficient was chosen as a measure of classification accuracy and its associated p-value was used to assess association with prognosis. For clinical usefulness evaluation, positive and negative post-tests probabilities were computed in stage II and III samples. Results Five gene signatures showed significant association with prognosis and provided reasonable prediction accuracy in their own training datasets. Nevertheless, all signatures showed low reproducibility in independent data. Stratified analyses by stage or microsatellite instability status showed significant association but limited discrimination ability, especially in stage II tumors. From a clinical perspective, the most predictive signatures showed a minor but significant improvement over the classical staging system. Conclusions The published signatures show low prediction accuracy but moderate clinical usefulness. Although gene expression data may inform prognosis, better strategies for signature validation are needed to encourage their widespread use in the clinic. PMID:23145004

  10. Paired termini stabilize antisense RNAs and enhance conditional gene silencing in Escherichia coli

    PubMed Central

    Nakashima, Nobutaka; Tamura, Tomohiro; Good, Liam

    2006-01-01

    Reliable methods for conditional gene silencing in bacteria have been elusive. To improve silencing by expressed antisense RNAs (asRNAs), we systematically altered several design parameters and targeted multiple reporter and essential genes in Escherichia coli. A paired termini (PT) design, where flanking inverted repeats create paired dsRNA termini, proved effective. PTasRNAs targeted against the ackA gene within the acetate kinase-phosphotransacetylase operon (ackA-pta) triggered target mRNA decay and a 78% reduction in AckA activity with high genetic penetrance. PTasRNAs are abundant and stable and function through an RNase III independent mechanism that requires a large stoichiometric excess of asRNA. Conditional ackA silencing reduced carbon flux to acetate and increased heterologous gene expression. The PT design also improved silencing of the essential fabI gene. Full anti-fabI PTasRNA induction prevented growth and partial induction sensitized cells to a FabI inhibitor. PTasRNAs have potential for functional genomics, antimicrobial discovery and metabolic flux control. PMID:17062631

  11. Paired termini stabilize antisense RNAs and enhance conditional gene silencing in Escherichia coli.

    PubMed

    Nakashima, Nobutaka; Tamura, Tomohiro; Good, Liam

    2006-01-01

    Reliable methods for conditional gene silencing in bacteria have been elusive. To improve silencing by expressed antisense RNAs (asRNAs), we systematically altered several design parameters and targeted multiple reporter and essential genes in Escherichia coli. A paired termini (PT) design, where flanking inverted repeats create paired dsRNA termini, proved effective. PTasRNAs targeted against the ackA gene within the acetate kinase-phosphotransacetylase operon (ackA-pta) triggered target mRNA decay and a 78% reduction in AckA activity with high genetic penetrance. PTasRNAs are abundant and stable and function through an RNase III independent mechanism that requires a large stoichiometric excess of asRNA. Conditional ackA silencing reduced carbon flux to acetate and increased heterologous gene expression. The PT design also improved silencing of the essential fabI gene. Full anti-fabI PTasRNA induction prevented growth and partial induction sensitized cells to a FabI inhibitor. PTasRNAs have potential for functional genomics, antimicrobial discovery and metabolic flux control.

  12. From data towards knowledge: revealing the architecture of signaling systems by unifying knowledge mining and data mining of systematic perturbation data.

    PubMed

    Lu, Songjian; Jin, Bo; Cowart, L Ashley; Lu, Xinghua

    2013-01-01

    Genetic and pharmacological perturbation experiments, such as deleting a gene and monitoring gene expression responses, are powerful tools for studying cellular signal transduction pathways. However, it remains a challenge to automatically derive knowledge of a cellular signaling system at a conceptual level from systematic perturbation-response data. In this study, we explored a framework that unifies knowledge mining and data mining towards the goal. The framework consists of the following automated processes: 1) applying an ontology-driven knowledge mining approach to identify functional modules among the genes responding to a perturbation in order to reveal potential signals affected by the perturbation; 2) applying a graph-based data mining approach to search for perturbations that affect a common signal; and 3) revealing the architecture of a signaling system by organizing signaling units into a hierarchy based on their relationships. Applying this framework to a compendium of yeast perturbation-response data, we have successfully recovered many well-known signal transduction pathways; in addition, our analysis has led to many new hypotheses regarding the yeast signal transduction system; finally, our analysis automatically organized perturbed genes as a graph reflecting the architecture of the yeast signaling system. Importantly, this framework transformed molecular findings from a gene level to a conceptual level, which can be readily translated into computable knowledge in the form of rules regarding the yeast signaling system, such as "if genes involved in the MAPK signaling are perturbed, genes involved in pheromone responses will be differentially expressed."

  13. Heterogeneous activation of the TGFβ pathway in glioblastomas identified by gene expression-based classification using TGFβ-responsive genes

    PubMed Central

    Xu, Xie L; Kapoun, Ann M

    2009-01-01

    Background TGFβ has emerged as an attractive target for the therapeutic intervention of glioblastomas. Aberrant TGFβ overproduction in glioblastoma and other high-grade gliomas has been reported, however, to date, none of these reports has systematically examined the components of TGFβ signaling to gain a comprehensive view of TGFβ activation in large cohorts of human glioma patients. Methods TGFβ activation in mammalian cells leads to a transcriptional program that typically affects 5–10% of the genes in the genome. To systematically examine the status of TGFβ activation in high-grade glial tumors, we compiled a gene set of transcriptional response to TGFβ stimulation from tissue culture and in vivo animal studies. These genes were used to examine the status of TGFβ activation in high-grade gliomas including a large cohort of glioblastomas. Unsupervised and supervised classification analysis was performed in two independent, publicly available glioma microarray datasets. Results Unsupervised and supervised classification using the TGFβ-responsive gene list in two independent glial tumor gene expression data sets revealed various levels of TGFβ activation in these tumors. Among glioblastomas, one of the most devastating human cancers, two subgroups were identified that showed distinct TGFβ activation patterns as measured from transcriptional responses. Approximately 62% of glioblastoma samples analyzed showed strong TGFβ activation, while the rest showed a weak TGFβ transcriptional response. Conclusion Our findings suggest heterogeneous TGFβ activation in glioblastomas, which may cause potential differences in responses to anti-TGFβ therapies in these two distinct subgroups of glioblastomas patients. PMID:19192267

  14. Gene signatures of postoperative atrial fibrillation in atrial tissue after coronary artery bypass grafting surgery in patients receiving β-blockers.

    PubMed

    Kertai, Miklos D; Qi, Wenjing; Li, Yi-Ju; Lombard, Frederick W; Liu, Yutao; Smith, Michael P; Stafford-Smith, Mark; Newman, Mark F; Milano, Carmelo A; Mathew, Joseph P; Podgoreanu, Mihai V

    2016-03-01

    Atrial tissue gene expression profiling may help to determine how differentially expressed genes in the human atrium before cardiopulmonary bypass (CPB) are related to subsequent biologic pathway activation patterns, and whether specific expression profiles are associated with an increased risk for postoperative atrial fibrillation (AF) or altered response to β-blocker (BB) therapy after coronary artery bypass grafting (CABG) surgery. Right atrial appendage (RAA) samples were collected from 45 patients who were receiving perioperative BB treatment, and underwent CABG surgery. The isolated RNA samples were used for microarray gene expression analysis, to identify probes that were expressed differently in patients with and without postoperative AF. Gene expression analysis was performed to identify probes that were expressed differently in patients with and without postoperative AF. Gene set enrichment analysis (GSEA) was performed to determine how sets of genes might be systematically altered in patients with postoperative AF. Of the 45 patients studied, genomic DNA from 42 patients was used for target sequencing of 66 candidate genes potentially associated with AF, and 2,144 single-nucleotide polymorphisms (SNPs) were identified. We then performed expression quantitative trait loci (eQTL) analysis to determine the correlation between SNPs identified in the genotyped patients, and RAA expression. Probes that met a false discovery rate<0.25 were selected for eQTL analysis. Of the 17,678 gene expression probes analyzed, 2 probes met our prespecified significance threshold of false discovery rate<0.25. The most significant probe corresponded to vesicular overexpressed in cancer - prosurvival protein 1 gene (VOPP1; 1.83 fold change; P=3.47×10(-7)), and was up-regulated in patients with postoperative AF, whereas the second most significant probe, which corresponded to the LOC389286 gene (0.49 fold change; P=1.54×10(-5)), was down-regulated in patients with postoperative AF. GSEA highlighted the role of VOPP1 in pathways with biologic relevance to myocardial homeostasis, and oxidative stress and redox modulation. Candidate gene eQTL showed a trans-acting association between variants of G protein-coupled receptor kinase 5 gene, previously linked to altered BB response, and high expression of VOPP1. In patients undergoing CABG surgery, RAA gene expression profiling, and pathway and eQTL analysis suggested that VOPP1 plays a novel etiological role in postoperative AF despite perioperative BB therapy. Copyright © 2016. Published by Elsevier Ltd.

  15. Linking Genes to Cardiovascular Diseases: Gene Action and Gene–Environment Interactions

    PubMed Central

    2016-01-01

    A unique myocardial characteristic is its ability to grow/remodel in order to adapt; this is determined partly by genes and partly by the environment and the milieu intérieur. In the “post-genomic” era, a need is emerging to elucidate the physiologic functions of myocardial genes, as well as potential adaptive and maladaptive modulations induced by environmental/epigenetic factors. Genome sequencing and analysis advances have become exponential lately, with escalation of our knowledge concerning sometimes controversial genetic underpinnings of cardiovascular diseases. Current technologies can identify candidate genes variously involved in diverse normal/abnormal morphomechanical phenotypes, and offer insights into multiple genetic factors implicated in complex cardiovascular syndromes. The expression profiles of thousands of genes are regularly ascertained under diverse conditions. Global analyses of gene expression levels are useful for cataloging genes and correlated phenotypes, and for elucidating the role of genes in maladies. Comparative expression of gene networks coupled to complex disorders can contribute insights as to how “modifier genes” influence the expressed phenotypes. Increasingly, a more comprehensive and detailed systematic understanding of genetic abnormalities underlying, for example, various genetic cardiomyopathies is emerging. Implementing genomic findings in cardiology practice may well lead directly to better diagnosing and therapeutics. There is currently evolving a strong appreciation for the value of studying gene anomalies, and doing so in a non-disjointed, cohesive manner. However, it is challenging for many—practitioners and investigators—to comprehend, interpret, and utilize the clinically increasingly accessible and affordable cardiovascular genomics studies. This survey addresses the need for fundamental understanding in this vital area. PMID:26545598

  16. RNA deep sequencing as a tool for selection of cell lines for systematic subcellular localization of all human proteins.

    PubMed

    Danielsson, Frida; Wiking, Mikaela; Mahdessian, Diana; Skogs, Marie; Ait Blal, Hammou; Hjelmare, Martin; Stadler, Charlotte; Uhlén, Mathias; Lundberg, Emma

    2013-01-04

    One of the major challenges of a chromosome-centric proteome project is to explore in a systematic manner the potential proteins identified from the chromosomal genome sequence, but not yet characterized on a protein level. Here, we describe the use of RNA deep sequencing to screen human cell lines for RNA profiles and to use this information to select cell lines suitable for characterization of the corresponding gene product. In this manner, the subcellular localization of proteins can be analyzed systematically using antibody-based confocal microscopy. We demonstrate the usefulness of selecting cell lines with high expression levels of RNA transcripts to increase the likelihood of high quality immunofluorescence staining and subsequent successful subcellular localization of the corresponding protein. The results show a path to combine transcriptomics with affinity proteomics to characterize the proteins in a gene- or chromosome-centric manner.

  17. Integrative Analysis of DNA Methylation and Gene Expression Data Identifies EPAS1 as a Key Regulator of COPD

    PubMed Central

    Yoo, Seungyeul; Takikawa, Sachiko; Geraghty, Patrick; Argmann, Carmen; Campbell, Joshua; Lin, Luan; Huang, Tao; Tu, Zhidong; Feronjy, Robert; Spira, Avrum; Schadt, Eric E.; Powell, Charles A.; Zhu, Jun

    2015-01-01

    Chronic Obstructive Pulmonary Disease (COPD) is a complex disease. Genetic, epigenetic, and environmental factors are known to contribute to COPD risk and disease progression. Therefore we developed a systematic approach to identify key regulators of COPD that integrates genome-wide DNA methylation, gene expression, and phenotype data in lung tissue from COPD and control samples. Our integrative analysis identified 126 key regulators of COPD. We identified EPAS1 as the only key regulator whose downstream genes significantly overlapped with multiple genes sets associated with COPD disease severity. EPAS1 is distinct in comparison with other key regulators in terms of methylation profile and downstream target genes. Genes predicted to be regulated by EPAS1 were enriched for biological processes including signaling, cell communications, and system development. We confirmed that EPAS1 protein levels are lower in human COPD lung tissue compared to non-disease controls and that Epas1 gene expression is reduced in mice chronically exposed to cigarette smoke. As EPAS1 downstream genes were significantly enriched for hypoxia responsive genes in endothelial cells, we tested EPAS1 function in human endothelial cells. EPAS1 knockdown by siRNA in endothelial cells impacted genes that significantly overlapped with EPAS1 downstream genes in lung tissue including hypoxia responsive genes, and genes associated with emphysema severity. Our first integrative analysis of genome-wide DNA methylation and gene expression profiles illustrates that not only does DNA methylation play a ‘causal’ role in the molecular pathophysiology of COPD, but it can be leveraged to directly identify novel key mediators of this pathophysiology. PMID:25569234

  18. Deletion analysis of Streptococcus pneumoniae late competence genes distinguishes virulence determinants that are dependent or independent of competence induction.

    PubMed

    Zhu, Luchang; Lin, Jingjun; Kuang, Zhizhou; Vidal, Jorge E; Lau, Gee W

    2015-07-01

    The competence regulon of Streptococcus pneumoniae (pneumococcus) is crucial for genetic transformation. During competence development, the alternative sigma factor ComX is activated, which in turn, initiates transcription of 80 'late' competence genes. Interestingly, only 16 late genes are essential for genetic transformation. We hypothesized that these late genes that are dispensable for competence are beneficial to pneumococcal fitness during infection. These late genes were systematically deleted, and the resulting mutants were examined for their fitness during mouse models of bacteremia and acute pneumonia. Among these, 14 late genes were important for fitness in mice. Significantly, deletion of some late genes attenuated pneumococcal fitness to the same level in both wild-type and ComX-null genetic backgrounds, suggesting that the constitutive baseline expression of these genes was important for bacterial fitness. In contrast, some mutants were attenuated only in the wild-type genetic background but not in the ComX-null background, suggesting that specific expression of these genes during competence state contributed to pneumococcal fitness. Increased virulence during competence state was partially caused by the induction of allolytic enzymes that enhanced pneumolysin release. These results distinguish the role of basal expression versus competence induction in virulence functions encoded by ComX-regulated late competence genes. © 2015 John Wiley & Sons Ltd.

  19. Genome-Wide Survey of Flavonoid Biosynthesis Genes and Gene Expression Analysis between Black- and Yellow-Seeded Brassica napus

    PubMed Central

    Qu, Cunmin; Zhao, Huiyan; Fu, Fuyou; Wang, Zhen; Zhang, Kai; Zhou, Yan; Wang, Xin; Wang, Rui; Xu, Xinfu; Tang, Zhanglin; Lu, Kun; Li, Jia-Na

    2016-01-01

    Flavonoids, the compounds that impart color to fruits, flowers, and seeds, are the most widespread secondary metabolites in plants. However, a systematic analysis of these loci has not been performed in Brassicaceae. In this study, we isolated 649 nucleotide sequences related to flavonoid biosynthesis, i.e., the Transparent Testa (TT) genes, and their associated amino acid sequences in 17 Brassicaceae species, grouped into Arabidopsis or Brassicaceae subgroups. Moreover, 36 copies of 21 genes of the flavonoid biosynthesis pathway were identified in Arabidopsis thaliana, 53 were identified in Brassica rapa, 50 in Brassica oleracea, and 95 in B. napus, followed the genomic distribution, collinearity analysis and genes triplication of them among Brassicaceae species. The results showed that the extensive gene loss, whole genome triplication, and diploidization that occurred after divergence from the common ancestor. Using qRT-PCR methods, we analyzed the expression of 18 flavonoid biosynthesis genes in 6 yellow- and black-seeded B. napus inbred lines with different genetic background, found that 12 of which were preferentially expressed during seed development, whereas the remaining genes were expressed in all B. napus tissues examined. Moreover, 14 of these genes showed significant differences in expression level during seed development, and all but four of these (i.e., BnTT5, BnTT7, BnTT10, and BnTTG1) had similar expression patterns among the yellow- and black-seeded B. napus. Results showed that the structural genes (BnTT3, BnTT18, and BnBAN), regulatory genes (BnTTG2 and BnTT16) and three encoding transfer proteins (BnTT12, BnTT19, and BnAHA10) might play an crucial roles in the formation of different seed coat colors in B. napus. These data will be helpful for illustrating the molecular mechanisms of flavonoid biosynthesis in Brassicaceae species. PMID:27999578

  20. Directed Neural Differentiation of Mouse Embryonic Stem Cells Is a Sensitive System for the Identification of Novel Hox Gene Effectors

    PubMed Central

    Bami, Myrto; Episkopou, Vasso; Gavalas, Anthony; Gouti, Mina

    2011-01-01

    The evolutionarily conserved Hox family of homeodomain transcription factors plays fundamental roles in regulating cell specification along the anterior posterior axis during development of all bilaterian animals by controlling cell fate choices in a highly localized, extracellular signal and cell context dependent manner. Some studies have established downstream target genes in specific systems but their identification is insufficient to explain either the ability of Hox genes to direct homeotic transformations or the breadth of their patterning potential. To begin delineating Hox gene function in neural development we used a mouse ES cell based system that combines efficient neural differentiation with inducible Hoxb1 expression. Gene expression profiling suggested that Hoxb1 acted as both activator and repressor in the short term but predominantly as a repressor in the long run. Activated and repressed genes segregated in distinct processes suggesting that, in the context examined, Hoxb1 blocked differentiation while activating genes related to early developmental processes, wnt and cell surface receptor linked signal transduction and cell-to-cell communication. To further elucidate aspects of Hoxb1 function we used loss and gain of function approaches in the mouse and chick embryos. We show that Hoxb1 acts as an activator to establish the full expression domain of CRABPI and II in rhombomere 4 and as a repressor to restrict expression of Lhx5 and Lhx9. Thus the Hoxb1 patterning activity includes the regulation of the cellular response to retinoic acid and the delay of the expression of genes that commit cells to neural differentiation. The results of this study show that ES neural differentiation and inducible Hox gene expression can be used as a sensitive model system to systematically identify Hox novel target genes, delineate their interactions with signaling pathways in dictating cell fate and define the extent of functional overlap among different Hox genes. PMID:21637844

  1. Integration of zebrafish fin regeneration genes with expression data of human tumors in silico uncovers potential novel melanoma markers.

    PubMed

    Hagedorn, Martin; Siegfried, Géraldine; Hooks, Katarzyna B; Khatib, Abdel-Majid

    2016-11-01

    Tissue regeneration requires expression of a large, unknown number of genes to initiate and maintain cellular processes such as proliferation, extracellular matrix synthesis, differentiation and migration. A unique model to simulate this process in a controlled manner is the re-growth of the caudal fin of zebrafish after amputation. Within this tissue stem cells differentiate into fibroblasts, epithelial and endothelial cells as well as melanocytes. Many genes implicated in the regeneration process are deregulated in cancer. We therefore undertook a systematic gene expression study to identify genes upregulated during the re-growth of caudal fin tissue. By applying a high stringency cut-off value of 4-fold change, we identified 54 annotated genes significantly overexpressed in regenerating blastema. Further bioinformatics data mining studies showed that 22 out of the 54 regeneration genes where overexpressed in melanoma compared to normal skin or other cancers. Whereas the role of TNC (tenascin C) and FN1 (fibronectin 1) in melanoma development is well documented, implication of MARCKS, RCN3, BAMBI, PEA3/ETV4 and the FK506 family members FKBP7, FKBP10 and FKBP11 in melanoma progression is unclear. Corresponding proteins were detected in melanoma tissue but not in normal skin. High expression of FKBP7, DPYSL5 and MDK was significantly associated with poor survival. We discuss a potential role of these novel melanoma genes, which have promising potential as new therapeutic targets or diagnostic markers.

  2. Gene expression profiling of selenophosphate synthetase 2 knockdown in Drosophila melanogaster.

    PubMed

    Li, Gaopeng; Liu, Liying; Li, Ping; Chen, Luonan; Song, Haiyun; Zhang, Yan

    2016-03-01

    Selenium (Se) is an important trace element for many organisms and is incorporated into selenoproteins as selenocysteine (Sec). In eukaryotes, selenophosphate synthetase SPS2 is essential for Sec biosynthesis. In recent years, genetic disruptions of both Sec biosynthesis genes and selenoprotein genes have been investigated in different animal models, which provide important clues for understanding the Se metabolism and function in these organisms. However, a systematic study on the knockdown of SPS2 has not been performed in vivo. Herein, we conducted microarray experiments to study the transcriptome of fruit flies with knockdown of SPS2 in larval and adult stages. Several hundred differentially expressed genes were identified in each stage. In spite that the expression levels of other Sec biosynthesis genes and selenoprotein genes were not significantly changed, it is possible that selenoprotein translation might be reduced without impacting the mRNA level. Functional enrichment and network-based analyses revealed that although different sets of differentially expressed genes were obtained in each stage, they were both significantly enriched in the carbohydrate metabolism and redox processes. Furthermore, protein-protein interaction (PPI)-based network clustering analysis implied that several hub genes detected in the top modules, such as Nimrod C1 and regucalcin, could be considered as key regulators that are responsible for the complex responses caused by SPS2 knockdown. Overall, our data provide new insights into the relationship between Se utilization and several fundamental cellular processes as well as diseases.

  3. Candidate Reference Genes Selection and Application for RT-qPCR Analysis in Kenaf with Cytoplasmic Male Sterility Background

    PubMed Central

    Zhou, Bujin; Chen, Peng; Khan, Aziz; Zhao, Yanhong; Chen, Lihong; Liu, Dongmei; Liao, Xiaofang; Kong, Xiangjun; Zhou, Ruiyang

    2017-01-01

    Cytoplasmic male sterility (CMS) is a maternally inherited trait that results in the production of dysfunctional pollen. Based on reliable reference gene-normalized real-time quantitative PCR (RT-qPCR) data, examining gene expression profile can provide valuable information on the molecular mechanism of kenaf CMS. However, studies have not been conducted regarding selection of reference genes for normalizing RT-qPCR data in the CMS and maintainer lines of kenaf crop. Therefore, we studied 10 candidate reference genes (ACT3, ELF1A, G6PD, PEPKR1, TUB, TUA, CYP, GAPDH, H3, and 18S) to assess their expression stability at three stages of pollen development in CMS line 722A and maintainer line 722B of kenaf. Five computational statistical approaches (GeNorm, NormFinder, ΔCt, BestKeeper, and RefFinder) were used to evaluate the expression stability levels of these genes. According to RefFinder and GeNorm, the combination of TUB, CYP, and PEPKR1 was identified as an internal control for the accurate normalization across all sample set, which was further confirmed by validating the expression of HcPDIL5-2a. Furthermore, the combination of TUB, CYP, and PEPKR1 was used to differentiate the expression pattern of five mitochondria F1F0-ATPase subunit genes (atp1, atp4, atp6, atp8, and atp9) by RT-qPCR during pollen development in CMS line 722A and maintainer line 722B. We found that atp1, atp6, and atp9 exhibited significantly different expression patterns during pollen development in line 722A compared with line 722B. This is the first systematic study of reference genes selection for CMS and will provide useful information for future research on the gene expressions and molecular mechanisms underlying CMS in kenaf. PMID:28919905

  4. Effects of high temperature on photosynthesis and related gene expression in poplar

    PubMed Central

    2014-01-01

    Background High temperature, whether transitory or constant, causes physiological, biochemical and molecular changes that adversely affect tree growth and productivity by reducing photosynthesis. To elucidate the photosynthetic adaption response and examine the recovery capacity of trees under heat stress, we measured gas exchange, chlorophyll fluorescence, electron transport, water use efficiency, and reactive oxygen-producing enzyme activities in heat-stressed plants. Results We found that photosynthesis could completely recover after less than six hours of high temperature treatment, which might be a turning point in the photosynthetic response to heat stress. Genome-wide gene expression analysis at six hours of heat stress identified 29,896 differentially expressed genes (15,670 up-regulated and 14,226 down-regulated), including multiple classes of transcription factors. These interact with each other and regulate the expression of photosynthesis-related genes in response to heat stress, controlling carbon fixation and changes in stomatal conductance. Heat stress of more than twelve hours caused reduced electron transport, damaged photosystems, activated the glycolate pathway and caused H2O2 production; as a result, photosynthetic capacity did not recover completely. Conclusions This study provides a systematic physiological and global gene expression profile of the poplar photosynthetic response to heat stress and identifies the main limitations and threshold of photosynthesis under heat stress. It will expand our understanding of plant thermostability and provides a robust dataset for future studies. PMID:24774695

  5. Effects of high temperature on photosynthesis and related gene expression in poplar.

    PubMed

    Song, Yuepeng; Chen, Qingqing; Ci, Dong; Shao, Xinning; Zhang, Deqiang

    2014-04-28

    High temperature, whether transitory or constant, causes physiological, biochemical and molecular changes that adversely affect tree growth and productivity by reducing photosynthesis. To elucidate the photosynthetic adaption response and examine the recovery capacity of trees under heat stress, we measured gas exchange, chlorophyll fluorescence, electron transport, water use efficiency, and reactive oxygen-producing enzyme activities in heat-stressed plants. We found that photosynthesis could completely recover after less than six hours of high temperature treatment, which might be a turning point in the photosynthetic response to heat stress. Genome-wide gene expression analysis at six hours of heat stress identified 29,896 differentially expressed genes (15,670 up-regulated and 14,226 down-regulated), including multiple classes of transcription factors. These interact with each other and regulate the expression of photosynthesis-related genes in response to heat stress, controlling carbon fixation and changes in stomatal conductance. Heat stress of more than twelve hours caused reduced electron transport, damaged photosystems, activated the glycolate pathway and caused H2O2 production; as a result, photosynthetic capacity did not recover completely. This study provides a systematic physiological and global gene expression profile of the poplar photosynthetic response to heat stress and identifies the main limitations and threshold of photosynthesis under heat stress. It will expand our understanding of plant thermostability and provides a robust dataset for future studies.

  6. Functional importance of cardiac enhancer-associated noncoding RNAs in heart development and disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ounzain, Samir; Pezzuto, Iole; Micheletti, Rudi

    We report here that the key information processing units within gene regulatory networks are enhancers. Enhancer activity is associated with the production of tissue-specific noncoding RNAs, yet the existence of such transcripts during cardiac development has not been established. Using an integrated genomic approach, we demonstrate that fetal cardiac enhancers generate long noncoding RNAs (lncRNAs) during cardiac differentiation and morphogenesis. Enhancer expression correlates with the emergence of active enhancer chromatin states, the initiation of RNA polymerase II at enhancer loci and expression of target genes. Orthologous human sequences are also transcribed in fetal human hearts and cardiac progenitor cells. Throughmore » a systematic bioinformatic analysis, we identified and characterized, for the first time, a catalog of lncRNAs that are expressed during embryonic stem cell differentiation into cardiomyocytes and associated with active cardiac enhancer sequences. RNA-sequencing demonstrates that many of these transcripts are polyadenylated, multi-exonic long noncoding RNAs. Moreover, knockdown of two enhancer-associated lncRNAs resulted in the specific downregulation of their predicted target genes. Interestingly, the reactivation of the fetal gene program, a hallmark of the stress response in the adult heart, is accompanied by increased expression of fetal cardiac enhancer transcripts. Altogether, these findings demonstrate that the activity of cardiac enhancers and expression of their target genes are associated with the production of enhancer-derived lncRNAs.« less

  7. Functional importance of cardiac enhancer-associated noncoding RNAs in heart development and disease

    DOE PAGES

    Ounzain, Samir; Pezzuto, Iole; Micheletti, Rudi; ...

    2014-08-19

    We report here that the key information processing units within gene regulatory networks are enhancers. Enhancer activity is associated with the production of tissue-specific noncoding RNAs, yet the existence of such transcripts during cardiac development has not been established. Using an integrated genomic approach, we demonstrate that fetal cardiac enhancers generate long noncoding RNAs (lncRNAs) during cardiac differentiation and morphogenesis. Enhancer expression correlates with the emergence of active enhancer chromatin states, the initiation of RNA polymerase II at enhancer loci and expression of target genes. Orthologous human sequences are also transcribed in fetal human hearts and cardiac progenitor cells. Throughmore » a systematic bioinformatic analysis, we identified and characterized, for the first time, a catalog of lncRNAs that are expressed during embryonic stem cell differentiation into cardiomyocytes and associated with active cardiac enhancer sequences. RNA-sequencing demonstrates that many of these transcripts are polyadenylated, multi-exonic long noncoding RNAs. Moreover, knockdown of two enhancer-associated lncRNAs resulted in the specific downregulation of their predicted target genes. Interestingly, the reactivation of the fetal gene program, a hallmark of the stress response in the adult heart, is accompanied by increased expression of fetal cardiac enhancer transcripts. Altogether, these findings demonstrate that the activity of cardiac enhancers and expression of their target genes are associated with the production of enhancer-derived lncRNAs.« less

  8. Scoring clustering solutions by their biological relevance.

    PubMed

    Gat-Viks, I; Sharan, R; Shamir, R

    2003-12-12

    A central step in the analysis of gene expression data is the identification of groups of genes that exhibit similar expression patterns. Clustering gene expression data into homogeneous groups was shown to be instrumental in functional annotation, tissue classification, regulatory motif identification, and other applications. Although there is a rich literature on clustering algorithms for gene expression analysis, very few works addressed the systematic comparison and evaluation of clustering results. Typically, different clustering algorithms yield different clustering solutions on the same data, and there is no agreed upon guideline for choosing among them. We developed a novel statistically based method for assessing a clustering solution according to prior biological knowledge. Our method can be used to compare different clustering solutions or to optimize the parameters of a clustering algorithm. The method is based on projecting vectors of biological attributes of the clustered elements onto the real line, such that the ratio of between-groups and within-group variance estimators is maximized. The projected data are then scored using a non-parametric analysis of variance test, and the score's confidence is evaluated. We validate our approach using simulated data and show that our scoring method outperforms several extant methods, including the separation to homogeneity ratio and the silhouette measure. We apply our method to evaluate results of several clustering methods on yeast cell-cycle gene expression data. The software is available from the authors upon request.

  9. Comprehensive analysis of SAUR gene family in citrus and its transcriptional correlation with fruitlet drop from abscission zone A.

    PubMed

    Xie, Rangjin; Dong, Cuicui; Ma, Yanyan; Deng, Lie; He, Shaolan; Yi, Shilai; Lv, Qiang; Zheng, Yongqiang

    2015-11-01

    Small auxin-up RNA (SAUR) gene family is large, and the members of which can be rapidly induced by auxin and encode highly unstable mRNAs. SAUR genes are involved in various developmental and physiological processes, such as leaf senescence, fruitlet abscission, and hypocotyl development. However, their modes of action in citrus remain unknown. Hereby, a systematic analysis of SAUR gene family in citrus was conducted through a genome-wide search. In this study, a total of 70 SAUR genes, referred to as CitSAURs, have been identified in citrus. The evolutionary relationship and the intro-exon organization were analyzed, revealing strong gene conservation and the expansion of particular functional genes during plant evolution. Expression analysis showed that the major of CitSAUR genes were expressed in at least one tissue and showed distinctive expression levels, indicating the SAUR gene family play important roles in the development and growth of citrus organs. However, there were more than 20 CitSAUR genes such as CitSARU36, CitSAUR37, and CitSAUR54 exhibiting very low expression level in all tissue tested. Twenty-three out of 70 CitSAUR genes were responded to indole-3-acetic acid (IAA) treatment, of which just CitSAUR19 was down-regulated. Additionally, 14 CitSAUR genes exhibited distinct changes during fruitlet abscission, however just 5 of them including CitSAUR06, CitSAUR08, CitSAUR44, CitSAUR61, and CitSAUR64 were associated with fruitlet abscission. The current study provides basic information for the citrus SAUR gene family and will pave the way for deciphering the precise role of SAURs in citrus development and growth as well as fruitlet abscission.

  10. Sequential Logic Model Deciphers Dynamic Transcriptional Control of Gene Expressions

    PubMed Central

    Yeo, Zhen Xuan; Wong, Sum Thai; Arjunan, Satya Nanda Vel; Piras, Vincent; Tomita, Masaru; Selvarajoo, Kumar; Giuliani, Alessandro; Tsuchiya, Masa

    2007-01-01

    Background Cellular signaling involves a sequence of events from ligand binding to membrane receptors through transcription factors activation and the induction of mRNA expression. The transcriptional-regulatory system plays a pivotal role in the control of gene expression. A novel computational approach to the study of gene regulation circuits is presented here. Methodology Based on the concept of finite state machine, which provides a discrete view of gene regulation, a novel sequential logic model (SLM) is developed to decipher control mechanisms of dynamic transcriptional regulation of gene expressions. The SLM technique is also used to systematically analyze the dynamic function of transcriptional inputs, the dependency and cooperativity, such as synergy effect, among the binding sites with respect to when, how much and how fast the gene of interest is expressed. Principal Findings SLM is verified by a set of well studied expression data on endo16 of Strongylocentrotus purpuratus (sea urchin) during the embryonic midgut development. A dynamic regulatory mechanism for endo16 expression controlled by three binding sites, UI, R and Otx is identified and demonstrated to be consistent with experimental findings. Furthermore, we show that during transition from specification to differentiation in wild type endo16 expression profile, SLM reveals three binary activities are not sufficient to explain the transcriptional regulation of endo16 expression and additional activities of binding sites are required. Further analyses suggest detailed mechanism of R switch activity where indirect dependency occurs in between UI activity and R switch during specification to differentiation stage. Conclusions/Significance The sequential logic formalism allows for a simplification of regulation network dynamics going from a continuous to a discrete representation of gene activation in time. In effect our SLM is non-parametric and model-independent, yet providing rich biological insight. The demonstration of the efficacy of this approach in endo16 is a promising step for further application of the proposed method. PMID:17712424

  11. Gene expression analysis of whole blood, peripheral blood mononuclear cells, and lymphoblastoid cell lines from the Framingham Heart Study

    PubMed Central

    Joehanes, Roby; Johnson, Andrew D.; Barb, Jennifer J.; Raghavachari, Nalini; Liu, Poching; Woodhouse, Kimberly A.; O'Donnell, Christopher J.; Munson, Peter J.

    2012-01-01

    Despite a growing number of reports of gene expression analysis from blood-derived RNA sources, there have been few systematic comparisons of various RNA sources in transcriptomic analysis or for biomarker discovery in the context of cardiovascular disease (CVD). As a pilot study of the Systems Approach to Biomarker Research (SABRe) in CVD Initiative, this investigation used Affymetrix Exon arrays to characterize gene expression of three blood-derived RNA sources: lymphoblastoid cell lines (LCL), whole blood using PAXgene tubes (PAX), and peripheral blood mononuclear cells (PBMC). Their performance was compared in relation to identifying transcript associations with sex and CVD risk factors, such as age, high-density lipoprotein, and smoking status, and the differential blood cell count. We also identified a set of exons that vary substantially between participants, but consistently in each RNA source. Such exons are thus stable phenotypes of the participant and may potentially become useful fingerprinting biomarkers. In agreement with previous studies, we found that each of the RNA sources is distinct. Unlike PAX and PBMC, LCL gene expression showed little association with the differential blood count. LCL, however, was able to detect two genes related to smoking status. PAX and PBMC identified Y-chromosome probe sets similarly and slightly better than LCL. PMID:22045913

  12. Genetic Contributions of Inflammation to Depression

    PubMed Central

    Barnes, Jacob; Mondelli, Valeria; Pariante, Carmine M

    2017-01-01

    This paper describes the effects of immune genes genetic variants and mRNA expression on depression's risk, severity, and response to antidepressant treatment, through a systematic review on all papers published between 2000 and 2016. Our results, based largely on case–control studies, suggest that common genetic variants and gene-expression pathways are involved in both immune activation and depression. The most replicated and relevant genetic variants include polymorphisms in the genes for interleukin (IL)-1β, IL-6, IL-10, monocyte chemoattractant protein-1, tumor necrosis factor-alpha, C-reactive protein, and phospholipase A2. Moreover, increased blood cytokines mRNA expression (especially of IL-1β) identifies patients that are less likely to respond to conventional antidepressants. However, even for the most replicated findings there are inconsistent results, not only between studies, but also between the immune effects of the genetic variants and the resulting effects on depression. We find evidence that these discrepant findings may be explained, at least in part, by the heterogeneity of the depression immunophenotype, by environmental influences and gene × environment interactions, and by the complex interfacing of genetic variants with gene expression. Indeed, some of the most robust findings have been obtained in patients developing depression in the context of treatment with interferon-alpha, a widely used model to mimic depression in the context of inflammation. Further ‘omics' approaches, through GWAS and transcriptomics, will finally shed light on the interaction between immune genes, their expression, and the influence of the environment, in the pathogenesis of depression. PMID:27555379

  13. Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders.

    PubMed

    Forero, Diego A; Prada, Carlos F; Perry, George

    2016-01-01

    In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD.

  14. Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders

    PubMed Central

    Forero, Diego A.; Prada, Carlos F.; Perry, George

    2016-01-01

    Background: In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. Objective: To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. Methods: A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. Results: We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. Conclusion: These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD. PMID:27990183

  15. Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis.

    PubMed

    dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana

    2015-01-01

    Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis.

  16. Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis

    PubMed Central

    dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana

    2015-01-01

    Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis. PMID:26393928

  17. Myostatin regulates miR-431 expression via the Ras-Mek-Erk signaling pathway.

    PubMed

    Wu, Rimao; Li, Hu; Li, Tingting; Zhang, Yong; Zhu, Dahai

    2015-05-29

    MicroRNAs (miRNAs) play critical regulatory roles in controlling myogenic development both in vitro and in vivo; however, the molecular mechanisms underlying transcriptional regulation of miRNA genes in skeletal muscle cells are largely unknown. Here, using a microarray hybridization approach, we identified myostatin-regulated miRNA genes in skeletal muscle tissues by systematically searching miRNAs that are differentially expressed between wild-type and myostatin-null mice during development. We found that 116 miRNA genes were differentially expressed in muscles between these mice across different developmental stages. We further characterized myostatin-regulated miR-431 was upregulated in skeletal muscle tissues of myostatin-null mice. In functional studies, we found that overexpression of miR-431 in C2C12 myoblast cells attenuated myostatin-induced suppression of myogenic differentiation. Mechanistic studies further demonstrated that myostatin acted through the Ras-Mek-Erk signaling pathway to transcriptionally regulate miR-431 expression C2C12 cells. Our findings provide new insight into the mechanisms underlying transcriptional regulation of miRNA genes by myostatin during skeletal muscle development. Copyright © 2015 Elsevier Inc. All rights reserved.

  18. Orthogonal control of expression mean and variance by epigenetic features at different genomic loci

    DOE PAGES

    Dey, Siddharth S.; Foley, Jonathan E.; Limsirichai, Prajit; ...

    2015-05-05

    While gene expression noise has been shown to drive dramatic phenotypic variations, the molecular basis for this variability in mammalian systems is not well understood. Gene expression has been shown to be regulated by promoter architecture and the associated chromatin environment. However, the exact contribution of these two factors in regulating expression noise has not been explored. Using a dual-reporter lentiviral model system, we deconvolved the influence of the promoter sequence to systematically study the contribution of the chromatin environment at different genomic locations in regulating expression noise. By integrating a large-scale analysis to quantify mRNA levels by smFISH andmore » protein levels by flow cytometry in single cells, we found that mean expression and noise are uncorrelated across genomic locations. Furthermore, we showed that this independence could be explained by the orthogonal control of mean expression by the transcript burst size and noise by the burst frequency. Finally, we showed that genomic locations displaying higher expression noise are associated with more repressed chromatin, thereby indicating the contribution of the chromatin environment in regulating expression noise.« less

  19. Effects of intense magnetic fields on sedimentation pattern and gene expression profile in budding yeast

    NASA Astrophysics Data System (ADS)

    Ikehata, Masateru; Iwasaka, Masakazu; Miyakoshi, Junji; Ueno, Shoogo; Koana, Takao

    2003-05-01

    Effects of magnetic fields (MFs) on biological systems are usually investigated using biological indices such as gene expression profiles. However, to precisely evaluate the biological effects of MF, the effects of intense MFs on systematic material transport processes including experimental environment must be seriously taken into consideration. In this study, a culture of the budding yeast, Saccharomyces cerevisiae, was used as a model for an in vitro biological test system. After exposure to 5 T static vertical MF, we found a difference in the sedimentation pattern of cells depending on the location of the dish in the magnet bore. Sedimented cells were localized in the center of the dish when they were placed in the lower part of the magnet bore while the sedimentation of the cells was uniform in dishes placed in the upper part of the bore because of the diamagnetic force. Genome wide gene expression profile of the yeast cells after exposure to 5 T static MF for 2 h suggested that the MF did not affect the expression level of any gene in yeast cells although the sedimentation pattern was altered. In addition, exposure to 10 T for 1 h and 5 T for 24 h also did not affect the gene expression. On the other hand, a slight change in expressions of several genes which are related to respiration was observed by exposure to a 14 T static MF for 24 h. The necessity of estimating the indirect effects of MFs on a study of its biological effect of MF in vitro will be discussed.

  20. Distributional fold change test – a statistical approach for detecting differential expression in microarray experiments

    PubMed Central

    2012-01-01

    Background Because of the large volume of data and the intrinsic variation of data intensity observed in microarray experiments, different statistical methods have been used to systematically extract biological information and to quantify the associated uncertainty. The simplest method to identify differentially expressed genes is to evaluate the ratio of average intensities in two different conditions and consider all genes that differ by more than an arbitrary cut-off value to be differentially expressed. This filtering approach is not a statistical test and there is no associated value that can indicate the level of confidence in the designation of genes as differentially expressed or not differentially expressed. At the same time the fold change by itself provide valuable information and it is important to find unambiguous ways of using this information in expression data treatment. Results A new method of finding differentially expressed genes, called distributional fold change (DFC) test is introduced. The method is based on an analysis of the intensity distribution of all microarray probe sets mapped to a three dimensional feature space composed of average expression level, average difference of gene expression and total variance. The proposed method allows one to rank each feature based on the signal-to-noise ratio and to ascertain for each feature the confidence level and power for being differentially expressed. The performance of the new method was evaluated using the total and partial area under receiver operating curves and tested on 11 data sets from Gene Omnibus Database with independently verified differentially expressed genes and compared with the t-test and shrinkage t-test. Overall the DFC test performed the best – on average it had higher sensitivity and partial AUC and its elevation was most prominent in the low range of differentially expressed features, typical for formalin-fixed paraffin-embedded sample sets. Conclusions The distributional fold change test is an effective method for finding and ranking differentially expressed probesets on microarrays. The application of this test is advantageous to data sets using formalin-fixed paraffin-embedded samples or other systems where degradation effects diminish the applicability of correlation adjusted methods to the whole feature set. PMID:23122055

  1. Global analysis of differential gene expression related to long-term sperm storage in oviduct of Chinese Soft-Shelled Turtle Pelodiscus sinensis

    PubMed Central

    Liu, Tengfei; Yang, Ping; Chen, Hong; Huang, Yufei; Liu, Yi; Waqas, Yasir; Ahmed, Nisar; Chu, Xiaoya; Chen, Qiusheng

    2016-01-01

    Important evolutionary and ecological consequences arise from the ability of female turtles to store viable spermatozoa for an extended period. Although previous morphological studies have observed the localization of spermatozoa in Pelodiscus sinensis oviduct, no systematic study on the identification of genes that are involved in long-term sperm storage has been performed. In this study, the oviduct of P. sinensis at different phases (reproductive and hibernation seasons) was prepared for RNA-Seq and gene expression profiling. In total, 2,662 differentially expressed genes (DEGs) including 1,224 up- and 1,438 down-regulated genes were identified from two cDNA libraries. Functional enrichment analysis indicated that many genes were predominantly involved in the immune response, apoptosis pathway and regulation of autophagy. RT-qPCR, ELISA, western blot and IHC analyses showed that the expression profiles of mRNA and protein in selected DEGs were in consistent with results from RNA-Seq analysis. Remarkably, TUNEL analysis revealed the reduced number of apoptotic cells during sperm storage. IHC and TEM analyses found that autophagy occurred in the oviduct epithelial cells, where the spermatozoa were closely attached. The outcomes of this study provide fundamental insights into the complex sperm storage regulatory process and facilitate elucidating the mechanism of sperm storage in P. sinensis. PMID:27628424

  2. Tuning Gene Activity by Inducible and Targeted Regulation of Gene Expression in Minimal Bacterial Cells.

    PubMed

    Mariscal, Ana M; Kakizawa, Shigeyuki; Hsu, Jonathan Y; Tanaka, Kazuki; González-González, Luis; Broto, Alicia; Querol, Enrique; Lluch-Senar, Maria; Piñero-Lambea, Carlos; Sun, Lijie; Weyman, Philip D; Wise, Kim S; Merryman, Chuck; Tse, Gavin; Moore, Adam J; Hutchison, Clyde A; Smith, Hamilton O; Tomita, Masaru; Venter, J Craig; Glass, John I; Piñol, Jaume; Suzuki, Yo

    2018-05-22

    Functional genomics studies in minimal mycoplasma cells enable unobstructed access to some of the most fundamental processes in biology. Conventional transposon bombardment and gene knockout approaches often fail to reveal functions of genes that are essential for viability, where lethality precludes phenotypic characterization. Conditional inactivation of genes is effective for characterizing functions central to cell growth and division, but tools are limited for this purpose in mycoplasmas. Here we demonstrate systems for inducible repression of gene expression based on clustered regularly interspaced short palindromic repeats-mediated interference (CRISPRi) in Mycoplasma pneumoniae and synthetic Mycoplasma mycoides, two organisms with reduced genomes actively used in systems biology studies. In the synthetic cell, we also demonstrate inducible gene expression for the first time. Time-course data suggest rapid kinetics and reversible engagement of CRISPRi. Targeting of six selected endogenous genes with this system results in lowered transcript levels or reduced growth rates that agree with lack or shortage of data in previous transposon bombardment studies, and now produces actual cells to analyze. The ksgA gene encodes a methylase that modifies 16S rRNA, rendering it vulnerable to inhibition by the antibiotic kasugamycin. Targeting the ksgA gene with CRISPRi removes the lethal effect of kasugamycin and enables cell growth, thereby establishing specific and effective gene modulation with our system. The facile methods for conditional gene activation and inactivation in mycoplasmas open the door to systematic dissection of genetic programs at the core of cellular life.

  3. Genome-wide identification and characterisation of F-box family in maize.

    PubMed

    Jia, Fengjuan; Wu, Bingjiang; Li, Hui; Huang, Jinguang; Zheng, Chengchao

    2013-11-01

    F-box-containing proteins, as the key components of the protein degradation machinery, are widely distributed in higher plants and are considered as one of the largest known families of regulatory proteins. The F-box protein family plays a crucial role in plant growth and development and in response to biotic and abiotic stresses. However, systematic analysis of the F-box family in maize (Zea mays) has not been reported yet. In this paper, we identified and characterised the maize F-box genes in a genome-wide scale, including phylogenetic analysis, chromosome distribution, gene structure, promoter analysis and gene expression profiles. A total of 359 F-box genes were identified and divided into 15 subgroups by phylogenetic analysis. The F-box domain was relatively conserved, whereas additional motifs outside the F-box domain may indicate the functional diversification of maize F-box genes. These genes were unevenly distributed in ten maize chromosomes, suggesting that they expanded in the maize genome because of tandem and segmental duplication events. The expression profiles suggested that the maize F-box genes had temporal and spatial expression patterns. Putative cis-acting regulatory DNA elements involved in abiotic stresses were observed in maize F-box gene promoters. The gene expression profiles under abiotic stresses also suggested that some genes participated in stress responsive pathways. Furthermore, ten genes were chosen for quantitative real-time PCR analysis under drought stress and the results were consistent with the microarray data. This study has produced a comparative genomics analysis of the maize ZmFBX gene family that can be used in further studies to uncover their roles in maize growth and development.

  4. Selection of Reliable Reference Genes for Gene Expression Studies on Rhododendron molle G. Don.

    PubMed

    Xiao, Zheng; Sun, Xiaobo; Liu, Xiaoqing; Li, Chang; He, Lisi; Chen, Shangping; Su, Jiale

    2016-01-01

    The quantitative real-time polymerase chain reaction (qRT-PCR) approach has become a widely used method to analyze expression patterns of target genes. The selection of an optimal reference gene is a prerequisite for the accurate normalization of gene expression in qRT-PCR. The present study constitutes the first systematic evaluation of potential reference genes in Rhododendron molle G. Don. Eleven candidate reference genes in different tissues and flowers at different developmental stages of R. molle were assessed using the following three software packages: GeNorm, NormFinder, and BestKeeper. The results showed that EF1- α (elongation factor 1-alpha), 18S (18s ribosomal RNA), and RPL3 (ribosomal protein L3) were the most stable reference genes in developing rhododendron flowers and, thus, in all of the tested samples, while tublin ( TUB ) was the least stable. ACT5 (actin), RPL3 , 18S , and EF1- α were found to be the top four choices for different tissues, whereas TUB was not found to favor qRT-PCR normalization in these tissues. Three stable reference genes are recommended for the normalization of qRT-PCR data in R. molle . Furthermore, the expression profiles of RmPSY (phytoene synthase) and RmPDS (phytoene dehydrogenase) were assessed using EF1- α, 18S , ACT5 , RPL3 , and their combination as internals. Similar trends were found, but these trends varied when the least stable reference gene TUB was used. The results further prove that it is necessary to validate the stability of reference genes prior to their use for normalization under different experimental conditions. This study provides useful information for reliable qRT-PCR data normalization in gene studies of R. molle .

  5. Characterization of reference genes for RT-qPCR in the desert moss Syntrichia caninervis in response to abiotic stress and desiccation/rehydration

    PubMed Central

    Li, Xiaoshuang; Zhang, Daoyuan; Li, Haiyan; Gao, Bei; Yang, Honglan; Zhang, Yuanming; Wood, Andrew J.

    2015-01-01

    Syntrichia caninervis is the dominant bryophyte of the biological soil crusts found in the Gurbantunggut desert. The extreme desert environment is characterized by prolonged drought, temperature extremes, high radiation and frequent cycles of hydration and dehydration. S. caninervis is an ideal organism for the identification and characterization of genes related to abiotic stress tolerance. Reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR) expression analysis is a powerful analytical technique that requires the use of stable reference genes. Using available S. caninervis transcriptome data, we selected 15 candidate reference genes and analyzed their relative expression stabilities in S. caninervis gametophores exposed to a range of abiotic stresses or a hydration-desiccation-rehydration cycle. The programs geNorm, NormFinder, and RefFinder were used to assess and rank the expression stability of the 15 candidate genes. The stability ranking results of reference genes under each specific experimental condition showed high consistency using different algorithms. For abiotic stress treatments, the combination of two genes (α-TUB2 and CDPK) were sufficient for accurate normalization. For the hydration-desiccation-rehydration process, the combination of two genes (α-TUB1 and CDPK) were sufficient for accurate normalization. 18S was among the least stable genes in all of the experimental sets and was unsuitable as reference gene in S. caninervis. This is the first systematic investigation and comparison of reference gene selection for RT-qPCR work in S. caninervis. This research will facilitate gene expression studies in S. caninervis, related moss species from the Syntrichia complex and other mosses. PMID:25699066

  6. Long non-coding RNAs and mRNAs profiling during spleen development in pig.

    PubMed

    Che, Tiandong; Li, Diyan; Jin, Long; Fu, Yuhua; Liu, Yingkai; Liu, Pengliang; Wang, Yixin; Tang, Qianzi; Ma, Jideng; Wang, Xun; Jiang, Anan; Li, Xuewei; Li, Mingzhou

    2018-01-01

    Genome-wide transcriptomic studies in humans and mice have become extensive and mature. However, a comprehensive and systematic understanding of protein-coding genes and long non-coding RNAs (lncRNAs) expressed during pig spleen development has not been achieved. LncRNAs are known to participate in regulatory networks for an array of biological processes. Here, we constructed 18 RNA libraries from developing fetal pig spleen (55 days before birth), postnatal pig spleens (0, 30, 180 days and 2 years after birth), and the samples from the 2-year-old Wild Boar. A total of 15,040 lncRNA transcripts were identified among these samples. We found that the temporal expression pattern of lncRNAs was more restricted than observed for protein-coding genes. Time-series analysis showed two large modules for protein-coding genes and lncRNAs. The up-regulated module was enriched for genes related to immune and inflammatory function, while the down-regulated module was enriched for cell proliferation processes such as cell division and DNA replication. Co-expression networks indicated the functional relatedness between protein-coding genes and lncRNAs, which were enriched for similar functions over the series of time points examined. We identified numerous differentially expressed protein-coding genes and lncRNAs in all five developmental stages. Notably, ceruloplasmin precursor (CP), a protein-coding gene participating in antioxidant and iron transport processes, was differentially expressed in all stages. This study provides the first catalog of the developing pig spleen, and contributes to a fuller understanding of the molecular mechanisms underpinning mammalian spleen development.

  7. Deregulation of Rab and Rab Effector Genes in Bladder Cancer

    PubMed Central

    Ho, Joel R.; Chapeaublanc, Elodie; Kirkwood, Lisa; Nicolle, Remy; Benhamou, Simone; Lebret, Thierry; Allory, Yves; Southgate, Jennifer; Radvanyi, François; Goud, Bruno

    2012-01-01

    Growing evidence indicates that Rab GTPases, key regulators of intracellular transport in eukaryotic cells, play an important role in cancer. We analysed the deregulation at the transcriptional level of the genes encoding Rab proteins and Rab-interacting proteins in bladder cancer pathogenesis, distinguishing between the two main progression pathways so far identified in bladder cancer: the Ta pathway characterized by a high frequency of FGFR3 mutation and the carcinoma in situ pathway where no or infrequent FGFR3 mutations have been identified. A systematic literature search identified 61 genes encoding Rab proteins and 223 genes encoding Rab-interacting proteins. Transcriptomic data were obtained for normal urothelium samples and for two independent bladder cancer data sets corresponding to 152 and 75 tumors. Gene deregulation was analysed with the SAM (significant analysis of microarray) test or the binomial test. Overall, 30 genes were down-regulated, and 13 were up-regulated in the tumor samples. Five of these deregulated genes (LEPRE1, MICAL2, RAB23, STXBP1, SYTL1) were specifically deregulated in FGFR3-non-mutated muscle-invasive tumors. No gene encoding a Rab or Rab-interacting protein was found to be specifically deregulated in FGFR3-mutated tumors. Cluster analysis showed that the RAB27 gene cluster (comprising the genes encoding RAB27 and its interacting partners) was deregulated and that this deregulation was associated with both pathways of bladder cancer pathogenesis. Finally, we found that the expression of KIF20A and ZWINT was associated with that of proliferation markers and that the expression of MLPH, MYO5B, RAB11A, RAB11FIP1, RAB20 and SYTL2 was associated with that of urothelial cell differentiation markers. This systematic analysis of Rab and Rab effector gene deregulation in bladder cancer, taking relevant tumor subgroups into account, provides insight into the possible roles of Rab proteins and their effectors in bladder cancer pathogenesis. This approach is applicable to other group of genes and types of cancer. PMID:22724020

  8. Integrative analysis of RUNX1 downstream pathways and target genes

    PubMed Central

    Michaud, Joëlle; Simpson, Ken M; Escher, Robert; Buchet-Poyau, Karine; Beissbarth, Tim; Carmichael, Catherine; Ritchie, Matthew E; Schütz, Frédéric; Cannon, Ping; Liu, Marjorie; Shen, Xiaofeng; Ito, Yoshiaki; Raskind, Wendy H; Horwitz, Marshall S; Osato, Motomi; Turner, David R; Speed, Terence P; Kavallaris, Maria; Smyth, Gordon K; Scott, Hamish S

    2008-01-01

    Background The RUNX1 transcription factor gene is frequently mutated in sporadic myeloid and lymphoid leukemia through translocation, point mutation or amplification. It is also responsible for a familial platelet disorder with predisposition to acute myeloid leukemia (FPD-AML). The disruption of the largely unknown biological pathways controlled by RUNX1 is likely to be responsible for the development of leukemia. We have used multiple microarray platforms and bioinformatic techniques to help identify these biological pathways to aid in the understanding of why RUNX1 mutations lead to leukemia. Results Here we report genes regulated either directly or indirectly by RUNX1 based on the study of gene expression profiles generated from 3 different human and mouse platforms. The platforms used were global gene expression profiling of: 1) cell lines with RUNX1 mutations from FPD-AML patients, 2) over-expression of RUNX1 and CBFβ, and 3) Runx1 knockout mouse embryos using either cDNA or Affymetrix microarrays. We observe that our datasets (lists of differentially expressed genes) significantly correlate with published microarray data from sporadic AML patients with mutations in either RUNX1 or its cofactor, CBFβ. A number of biological processes were identified among the differentially expressed genes and functional assays suggest that heterozygous RUNX1 point mutations in patients with FPD-AML impair cell proliferation, microtubule dynamics and possibly genetic stability. In addition, analysis of the regulatory regions of the differentially expressed genes has for the first time systematically identified numerous potential novel RUNX1 target genes. Conclusion This work is the first large-scale study attempting to identify the genetic networks regulated by RUNX1, a master regulator in the development of the hematopoietic system and leukemia. The biological pathways and target genes controlled by RUNX1 will have considerable importance in disease progression in both familial and sporadic leukemia as well as therapeutic implications. PMID:18671852

  9. In Vivo Genome-Wide Expression Study on Human Circulating B Cells Suggests a Novel ESR1 and MAPK3 Network for Postmenopausal Osteoporosis

    PubMed Central

    Xiao, Peng; Chen, Yuan; Jiang, Hui; Liu, Yao-Zhong; Pan, Feng; Yang, Tie-Lin; Tang, Zi-Hui; Larsen, Jennifer A; Lappe, Joan M; Recker, Robert R; Deng, Hong-Wen

    2008-01-01

    Introduction Osteoporosis is characterized by low BMD. Studies have shown that B cells may participate in osteoclastogenesis through expression of osteoclast-related factors, such as RANKL, transforming growth factor β (TGFB), and osteoprotegerin (OPG). However, the in vivo significance of B cells in human bone metabolism and osteoporosis is still largely unknown, particularly at the systematic gene expression level. Materials and Methods In this study, Affymetrix HG-U133A GeneChip arrays were used to identify genes differentially expressed in B cells between 10 low and 10 high BMD postmenopausal women. Significance of differential expression was tested by t-test and adjusted for multiple testing with the Benjamini and Hochberg (BH) procedure (adjusted p ≤ 0.05). Results Twenty-nine genes were downregulated in the low versus high BMD group. These genes were further analyzed using Ingenuity Pathways Analysis (Ingenuity Systems). A network involving estrogen receptor 1 (ESR1) and mitogen activated protein kinase 3 (MAPK3) was identified. Real-time RT-PCR confirmed differential expression of eight genes, including ESR1, MAPK3, methyl CpG binding protein 2 (MECP2), proline-serine-threonine phosphatase interacting protein 1 (PSTPIP1), Scr-like-adaptor (SLA), serine/threonine kinase 11 (STK11), WNK lysine-deficient protein kinase 1 (WNK1), and zinc finger protein 446 (ZNF446). Conclusions This is the first in vivo genome-wide expression study on human B cells in relation to osteoporosis. Our results highlight the significance of B cells in the etiology of osteoporosis and suggest a novel mechanism for postmenopausal osteoporosis (i.e., that downregulation of ESR1 and MAPK3 in B cells regulates secretion of factors, leading to increased osteoclastogenesis or decreased osteoblastogenesis). PMID:18433299

  10. Revealing cell cycle control by combining model-based detection of periodic expression with novel cis-regulatory descriptors

    PubMed Central

    Andersson, Claes R; Hvidsten, Torgeir R; Isaksson, Anders; Gustafsson, Mats G; Komorowski, Jan

    2007-01-01

    Background We address the issue of explaining the presence or absence of phase-specific transcription in budding yeast cultures under different conditions. To this end we use a model-based detector of gene expression periodicity to divide genes into classes depending on their behavior in experiments using different synchronization methods. While computational inference of gene regulatory circuits typically relies on expression similarity (clustering) in order to find classes of potentially co-regulated genes, this method instead takes advantage of known time profile signatures related to the studied process. Results We explain the regulatory mechanisms of the inferred periodic classes with cis-regulatory descriptors that combine upstream sequence motifs with experimentally determined binding of transcription factors. By systematic statistical analysis we show that periodic classes are best explained by combinations of descriptors rather than single descriptors, and that different combinations correspond to periodic expression in different classes. We also find evidence for additive regulation in that the combinations of cis-regulatory descriptors associated with genes periodically expressed in fewer conditions are frequently subsets of combinations associated with genes periodically expression in more conditions. Finally, we demonstrate that our approach retrieves combinations that are more specific towards known cell-cycle related regulators than the frequently used clustering approach. Conclusion The results illustrate how a model-based approach to expression analysis may be particularly well suited to detect biologically relevant mechanisms. Our new approach makes it possible to provide more refined hypotheses about regulatory mechanisms of the cell cycle and it can easily be adjusted to reveal regulation of other, non-periodic, cellular processes. PMID:17939860

  11. ALTERED GENE EXPRESSION PROFILES OF RAT LUNG IN RESPONSE TO AN EMISSION PARTICULATE AND ITS METAL CONSTITUENTS

    EPA Science Inventory

    Comprehensive and systematic approaches are needed to understand the molecular basis for the proposed adverse health effects of PM exposure reported in epidemiological studies. Due to the complex nature of the pollutant and the altered physiological conditions in the predisposed...

  12. A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research.

    PubMed

    Weidner, Christopher; Steinfath, Matthias; Wistorf, Elisa; Oelgeschläger, Michael; Schneider, Marlon R; Schönfelder, Gilbert

    2017-08-16

    Recent studies that compared transcriptomic datasets of human diseases with datasets from mouse models using traditional gene-to-gene comparison techniques resulted in contradictory conclusions regarding the relevance of animal models for translational research. A major reason for the discrepancies between different gene expression analyses is the arbitrary filtering of differentially expressed genes. Furthermore, the comparison of single genes between different species and platforms often is limited by technical variance, leading to misinterpretation of the con/discordance between data from human and animal models. Thus, standardized approaches for systematic data analysis are needed. To overcome subjective gene filtering and ineffective gene-to-gene comparisons, we recently demonstrated that gene set enrichment analysis (GSEA) has the potential to avoid these problems. Therefore, we developed a standardized protocol for the use of GSEA to distinguish between appropriate and inappropriate animal models for translational research. This protocol is not suitable to predict how to design new model systems a-priori, as it requires existing experimental omics data. However, the protocol describes how to interpret existing data in a standardized manner in order to select the most suitable animal model, thus avoiding unnecessary animal experiments and misleading translational studies.

  13. Genome-Wide Identification and Structural Analysis of bZIP Transcription Factor Genes in Brassica napus.

    PubMed

    Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Lu, Kun; Xu, Xinfu; Wang, Rui; Li, Jiana; Qu, Cunmin

    2017-10-24

    The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed ( Brassica napus ). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B . napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B . napus and its parental lines and for molecular breeding studies of bZIP genes in B . napus .

  14. Genome-Wide Identification and Structural Analysis of bZIP Transcription Factor Genes in Brassica napus

    PubMed Central

    Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Xu, Xinfu; Wang, Rui; Li, Jiana

    2017-01-01

    The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed (Brassica napus). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B. napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B. napus and its parental lines and for molecular breeding studies of bZIP genes in B. napus. PMID:29064393

  15. The Sucrose Synthase Gene Family in Chinese Pear (Pyrus bretschneideri Rehd.): Structure, Expression, and Evolution.

    PubMed

    Abdullah, Muhammad; Cao, Yungpeng; Cheng, Xi; Meng, Dandan; Chen, Yu; Shakoor, Awais; Gao, Junshan; Cai, Yongping

    2018-05-11

    Sucrose synthase (SS) is a key enzyme involved in sucrose metabolism that is critical in plant growth and development, and particularly quality of the fruit. Sucrose synthase gene families have been identified and characterized in plants various plants such as tobacco, grape, rice, and Arabidopsis . However, there is still lack of detailed information about sucrose synthase gene in pear. In the present study, we performed a systematic analysis of the pear ( Pyrus bretschneideri Rehd.) genome and reported 30 sucrose synthase genes. Subsequently, gene structure, phylogenetic relationship, chromosomal localization, gene duplications, promoter regions, collinearity, RNA-Seq data and qRT-PCR were conducted on these sucrose synthase genes. The transcript analysis revealed that 10 PbSSs genes (30%) were especially expressed in pear fruit development. Additionally, qRT-PCR analysis verified the RNA-seq data and shown that PbSS30 , PbSS24 , and PbSS15 have a potential role in the pear fruit development stages. This study provides important insights into the evolution of sucrose synthase gene family in pear and will provide assistance for further investigation of sucrose synthase genes functions in the process of fruit development, fruit quality and resistance to environmental stresses.

  16. HaploReg v4: systematic mining of putative causal variants, cell types, regulators and target genes for human complex traits and disease.

    PubMed

    Ward, Lucas D; Kellis, Manolis

    2016-01-04

    More than 90% of common variants associated with complex traits do not affect proteins directly, but instead the circuits that control gene expression. This has increased the urgency of understanding the regulatory genome as a key component for translating genetic results into mechanistic insights and ultimately therapeutics. To address this challenge, we developed HaploReg (http://compbio.mit.edu/HaploReg) to aid the functional dissection of genome-wide association study (GWAS) results, the prediction of putative causal variants in haplotype blocks, the prediction of likely cell types of action, and the prediction of candidate target genes by systematic mining of comparative, epigenomic and regulatory annotations. Since first launching the website in 2011, we have greatly expanded HaploReg, increasing the number of chromatin state maps to 127 reference epigenomes from ENCODE 2012 and Roadmap Epigenomics, incorporating regulator binding data, expanding regulatory motif disruption annotations, and integrating expression quantitative trait locus (eQTL) variants and their tissue-specific target genes from GTEx, Geuvadis, and other recent studies. We present these updates as HaploReg v4, and illustrate a use case of HaploReg for attention deficit hyperactivity disorder (ADHD)-associated SNPs with putative brain regulatory mechanisms. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Analysis of the dynamic co-expression network of heart regeneration in the zebrafish

    PubMed Central

    Rodius, Sophie; Androsova, Ganna; Götz, Lou; Liechti, Robin; Crespo, Isaac; Merz, Susanne; Nazarov, Petr V.; de Klein, Niek; Jeanty, Céline; González-Rosa, Juan M.; Muller, Arnaud; Bernardin, Francois; Niclou, Simone P.; Vallar, Laurent; Mercader, Nadia; Ibberson, Mark; Xenarios, Ioannis; Azuaje, Francisco

    2016-01-01

    The zebrafish has the capacity to regenerate its heart after severe injury. While the function of a few genes during this process has been studied, we are far from fully understanding how genes interact to coordinate heart regeneration. To enable systematic insights into this phenomenon, we generated and integrated a dynamic co-expression network of heart regeneration in the zebrafish and linked systems-level properties to the underlying molecular events. Across multiple post-injury time points, the network displays topological attributes of biological relevance. We show that regeneration steps are mediated by modules of transcriptionally coordinated genes, and by genes acting as network hubs. We also established direct associations between hubs and validated drivers of heart regeneration with murine and human orthologs. The resulting models and interactive analysis tools are available at http://infused.vital-it.ch. Using a worked example, we demonstrate the usefulness of this unique open resource for hypothesis generation and in silico screening for genes involved in heart regeneration. PMID:27241320

  18. Analysis of the dynamic co-expression network of heart regeneration in the zebrafish

    NASA Astrophysics Data System (ADS)

    Rodius, Sophie; Androsova, Ganna; Götz, Lou; Liechti, Robin; Crespo, Isaac; Merz, Susanne; Nazarov, Petr V.; de Klein, Niek; Jeanty, Céline; González-Rosa, Juan M.; Muller, Arnaud; Bernardin, Francois; Niclou, Simone P.; Vallar, Laurent; Mercader, Nadia; Ibberson, Mark; Xenarios, Ioannis; Azuaje, Francisco

    2016-05-01

    The zebrafish has the capacity to regenerate its heart after severe injury. While the function of a few genes during this process has been studied, we are far from fully understanding how genes interact to coordinate heart regeneration. To enable systematic insights into this phenomenon, we generated and integrated a dynamic co-expression network of heart regeneration in the zebrafish and linked systems-level properties to the underlying molecular events. Across multiple post-injury time points, the network displays topological attributes of biological relevance. We show that regeneration steps are mediated by modules of transcriptionally coordinated genes, and by genes acting as network hubs. We also established direct associations between hubs and validated drivers of heart regeneration with murine and human orthologs. The resulting models and interactive analysis tools are available at http://infused.vital-it.ch. Using a worked example, we demonstrate the usefulness of this unique open resource for hypothesis generation and in silico screening for genes involved in heart regeneration.

  19. Landscape of X chromosome inactivation across human tissues.

    PubMed

    Tukiainen, Taru; Villani, Alexandra-Chloé; Yen, Angela; Rivas, Manuel A; Marshall, Jamie L; Satija, Rahul; Aguirre, Matt; Gauthier, Laura; Fleharty, Mark; Kirby, Andrew; Cummings, Beryl B; Castel, Stephane E; Karczewski, Konrad J; Aguet, François; Byrnes, Andrea; Lappalainen, Tuuli; Regev, Aviv; Ardlie, Kristin G; Hacohen, Nir; MacArthur, Daniel G

    2017-10-11

    X chromosome inactivation (XCI) silences transcription from one of the two X chromosomes in female mammalian cells to balance expression dosage between XX females and XY males. XCI is, however, incomplete in humans: up to one-third of X-chromosomal genes are expressed from both the active and inactive X chromosomes (Xa and Xi, respectively) in female cells, with the degree of 'escape' from inactivation varying between genes and individuals. The extent to which XCI is shared between cells and tissues remains poorly characterized, as does the degree to which incomplete XCI manifests as detectable sex differences in gene expression and phenotypic traits. Here we describe a systematic survey of XCI, integrating over 5,500 transcriptomes from 449 individuals spanning 29 tissues from GTEx (v6p release) and 940 single-cell transcriptomes, combined with genomic sequence data. We show that XCI at 683 X-chromosomal genes is generally uniform across human tissues, but identify examples of heterogeneity between tissues, individuals and cells. We show that incomplete XCI affects at least 23% of X-chromosomal genes, identify seven genes that escape XCI with support from multiple lines of evidence and demonstrate that escape from XCI results in sex biases in gene expression, establishing incomplete XCI as a mechanism that is likely to introduce phenotypic diversity. Overall, this updated catalogue of XCI across human tissues helps to increase our understanding of the extent and impact of the incompleteness in the maintenance of XCI.

  20. Modeling gene expression measurement error: a quasi-likelihood approach

    PubMed Central

    Strimmer, Korbinian

    2003-01-01

    Background Using suitable error models for gene expression measurements is essential in the statistical analysis of microarray data. However, the true probabilistic model underlying gene expression intensity readings is generally not known. Instead, in currently used approaches some simple parametric model is assumed (usually a transformed normal distribution) or the empirical distribution is estimated. However, both these strategies may not be optimal for gene expression data, as the non-parametric approach ignores known structural information whereas the fully parametric models run the risk of misspecification. A further related problem is the choice of a suitable scale for the model (e.g. observed vs. log-scale). Results Here a simple semi-parametric model for gene expression measurement error is presented. In this approach inference is based an approximate likelihood function (the extended quasi-likelihood). Only partial knowledge about the unknown true distribution is required to construct this function. In case of gene expression this information is available in the form of the postulated (e.g. quadratic) variance structure of the data. As the quasi-likelihood behaves (almost) like a proper likelihood, it allows for the estimation of calibration and variance parameters, and it is also straightforward to obtain corresponding approximate confidence intervals. Unlike most other frameworks, it also allows analysis on any preferred scale, i.e. both on the original linear scale as well as on a transformed scale. It can also be employed in regression approaches to model systematic (e.g. array or dye) effects. Conclusions The quasi-likelihood framework provides a simple and versatile approach to analyze gene expression data that does not make any strong distributional assumptions about the underlying error model. For several simulated as well as real data sets it provides a better fit to the data than competing models. In an example it also improved the power of tests to identify differential expression. PMID:12659637

  1. Prediction of Bacillus weihenstephanensis acid resistance: the use of gene expression patterns to select potential biomarkers.

    PubMed

    Desriac, N; Postollec, F; Coroller, L; Sohier, D; Abee, T; den Besten, H M W

    2013-10-01

    Exposure to mild stress conditions can activate stress adaptation mechanisms and provide cross-resistance towards otherwise lethal stresses. In this study, an approach was followed to select molecular biomarkers (quantitative gene expressions) to predict induced acid resistance after exposure to various mild stresses, i.e. exposure to sublethal concentrations of salt, acid and hydrogen peroxide during 5 min to 60 min. Gene expression patterns of unstressed and mildly stressed cells of Bacillus weihenstephanensis were correlated to their acid resistance (3D value) which was estimated after exposure to lethal acid conditions. Among the twenty-nine candidate biomarkers, 12 genes showed expression patterns that were correlated either linearly or non-linearly to acid resistance, while for the 17 other genes the correlation remains to be determined. The selected genes represented two types of biomarkers, (i) four direct biomarker genes (lexA, spxA, narL, bkdR) for which expression patterns upon mild stress treatment were linearly correlated to induced acid resistance; and (ii) nine long-acting biomarker genes (spxA, BcerKBAB4_0325, katA, trxB, codY, lacI, BcerKBAB4_1716, BcerKBAB4_2108, relA) which were transiently up-regulated during mild stress exposure and correlated to increased acid resistance over time. Our results highlight that mild stress induced transcripts can be linearly or non-linearly correlated to induced acid resistance and both approaches can be used to find relevant biomarkers. This quantitative and systematic approach opens avenues to select cellular biomarkers that could be incremented in mathematical models to predict microbial behaviour. Copyright © 2013 Elsevier B.V. All rights reserved.

  2. Construction of a β-galactosidase-gene-based fusion is convenient for screening candidate genes involved in regulation of pyrrolnitrin biosynthesis in Pseudomonas chlororaphis G05.

    PubMed

    Luo, Wangtai; Miao, Jing; Feng, Zhibin; Lu, Ruiyang; Sun, Xiaoqiang; Zhang, Baoshen; Ding, Weiqiu; Lu, Yang; Wang, Yanhua; Chi, Xiaoyan; Ge, Yihe

    2018-05-28

    In our recent work, we found that pyrrolnitrin, and not phenazines, pyrrolnitrin contributed to the suppression of the mycelia growth of Fusarium graminearum that causes heavy Fusarium head blight (FHB) disease in cereal crops. However, pyrrolnitrin production of Pseudomonas chlororaphis G05 in King's B medium was very low. Although a few regulatory genes mediating the prnABCD (the prn operon, pyrrolnitrin biosynthetic locus) expression have been identified, it is not enough for us to enhance pyrrolnitrin production by systematically constructing a genetically-engineered strain. To obtain new candidate genes involved in regulation of the prn operon expression, we successfully constructed a fusion mutant G05ΔphzΔprn::lacZ, in which most of the coding regions of the prn operon and the phzABCDEFG (the phz operon, phenazine biosynthetic locus) were deleted, and the promoter region plus the first thirty condons of the prnA was in-frame fused with the truncated lacZ gene on its chromosome. The expression of the fused lacZ reporter gene driven by the promoter of the prn operon made it easy for us to detect the level of the prn expression in terms of the color variation of colonies on LB agar plates supplemented with 5-bromo-4-chloro-3-indolyl-β-D-galactopyranoside (X-Gal). With this fusion mutant as a recipient strain, mini-Tn5-based random insertional mutagenesis was then conducted. By picking up colonies with color change, it is possible for us to screen and identify new candidate genes involved in regulation of the prn expression. Identification of additional regulatory genes in further work could reasonably be expected to increase pyrrolnitrin production in G05 and to improve its biological control function.

  3. Identification of an ICP27-responsive element in the coding region of a herpes simplex virus type 1 late gene.

    PubMed

    Sedlackova, Lenka; Perkins, Keith D; Meyer, Julia; Strain, Anna K; Goldman, Oksana; Rice, Stephen A

    2010-03-01

    During productive herpes simplex virus type 1 (HSV-1) infection, a subset of viral delayed-early (DE) and late (L) genes require the immediate-early (IE) protein ICP27 for their expression. However, the cis-acting regulatory sequences in DE and L genes that mediate their specific induction by ICP27 are unknown. One viral L gene that is highly dependent on ICP27 is that encoding glycoprotein C (gC). We previously demonstrated that this gene is posttranscriptionally transactivated by ICP27 in a plasmid cotransfection assay. Based on our past results, we hypothesized that the gC gene possesses a cis-acting inhibitory sequence and that ICP27 overcomes the effects of this sequence to enable efficient gC expression. To test this model, we systematically deleted sequences from the body of the gC gene and tested the resulting constructs for expression. In so doing, we identified a 258-bp "silencing element" (SE) in the 5' portion of the gC coding region. When present, the SE inhibits gC mRNA accumulation from a transiently transfected gC gene, unless ICP27 is present. Moreover, the SE can be transferred to another HSV-1 gene, where it inhibits mRNA accumulation in the absence of ICP27 and confers high-level expression in the presence of ICP27. Thus, for the first time, an ICP27-responsive sequence has been identified in a physiologically relevant ICP27 target gene. To see if the SE functions during viral infection, we engineered HSV-1 recombinants that lack the SE, either in a wild-type (WT) or ICP27-null genetic background. In an ICP27-null background, deletion of the SE led to ICP27-independent expression of the gC gene, demonstrating that the SE functions during viral infection. Surprisingly, the ICP27-independent gC expression seen with the mutant occurred even in the absence of viral DNA synthesis, indicating that the SE helps to regulate the tight DNA replication-dependent expression of gC.

  4. Selection and Evaluation of Potential Reference Genes for Gene Expression Analysis in the Brown Planthopper, Nilaparvata lugens (Hemiptera: Delphacidae) Using Reverse-Transcription Quantitative PCR

    PubMed Central

    Zhu, Xun; Wan, Hu; Shakeel, Muhammad; Zhan, Sha; Jin, Byung-Rae; Li, Jianhong

    2014-01-01

    The brown planthopper (BPH), Nilaparvata lugens (Hemiptera, Delphacidae), is one of the most important rice pests. Abundant genetic studies on BPH have been conducted using reverse-transcription quantitative real-time PCR (qRT-PCR). Using qRT-PCR, the expression levels of target genes are calculated on the basis of endogenous controls. These genes need to be appropriately selected by experimentally assessing whether they are stably expressed under different conditions. However, such studies on potential reference genes in N. lugens are lacking. In this paper, we presented a systematic exploration of eight candidate reference genes in N. lugens, namely, actin 1 (ACT), muscle actin (MACT), ribosomal protein S11 (RPS11), ribosomal protein S15e (RPS15), alpha 2-tubulin (TUB), elongation factor 1 delta (EF), 18S ribosomal RNA (18S), and arginine kinase (AK) and used four alternative methods (BestKeeper, geNorm, NormFinder, and the delta Ct method) to evaluate the suitability of these genes as endogenous controls. We examined their expression levels among different experimental factors (developmental stage, body part, geographic population, temperature variation, pesticide exposure, diet change, and starvation) following the MIQE (Minimum Information for publication of Quantitative real time PCR Experiments) guidelines. Based on the results of RefFinder, which integrates four currently available major software programs to compare and rank the tested candidate reference genes, RPS15, RPS11, and TUB were found to be the most suitable reference genes in different developmental stages, body parts, and geographic populations, respectively. RPS15 was the most suitable gene under different temperature and diet conditions, while RPS11 was the most suitable gene under different pesticide exposure and starvation conditions. This work sheds light on establishing a standardized qRT-PCR procedure in N. lugens, and serves as a starting point for screening for reference genes for expression studies of related insects. PMID:24466124

  5. Genome-Wide Transcriptional Reorganization Associated with Senescence-to-Immortality Switch during Human Hepatocellular Carcinogenesis

    PubMed Central

    Konu, Ozlen; Yuzugullu, Haluk; Gursoy-Yuzugullu, Ozge; Ozturk, Nuri; Ozen, Cigdem; Ozdag, Hilal; Erdal, Esra; Karademir, Sedat; Sagol, Ozgul; Mizrak, Dilsa; Bozkaya, Hakan; Ilk, Hakki Gokhan; Ilk, Ozlem; Bilen, Biter; Cetin-Atalay, Rengul; Akar, Nejat; Ozturk, Mehmet

    2013-01-01

    Senescence is a permanent proliferation arrest in response to cell stress such as DNA damage. It contributes strongly to tissue aging and serves as a major barrier against tumor development. Most tumor cells are believed to bypass the senescence barrier (become “immortal”) by inactivating growth control genes such as TP53 and CDKN2A. They also reactivate telomerase reverse transcriptase. Senescence-to-immortality transition is accompanied by major phenotypic and biochemical changes mediated by genome-wide transcriptional modifications. This appears to happen during hepatocellular carcinoma (HCC) development in patients with liver cirrhosis, however, the accompanying transcriptional changes are virtually unknown. We investigated genome-wide transcriptional changes related to the senescence-to-immortality switch during hepatocellular carcinogenesis. Initially, we performed transcriptome analysis of senescent and immortal clones of Huh7 HCC cell line, and identified genes with significant differential expression to establish a senescence-related gene list. Through the analysis of senescence-related gene expression in different liver tissues we showed that cirrhosis and HCC display expression patterns compatible with senescent and immortal phenotypes, respectively; dysplasia being a transitional state. Gene set enrichment analysis revealed that cirrhosis/senescence-associated genes were preferentially expressed in non-tumor tissues, less malignant tumors, and differentiated or senescent cells. In contrast, HCC/immortality genes were up-regulated in tumor tissues, or more malignant tumors and progenitor cells. In HCC tumors and immortal cells genes involved in DNA repair, cell cycle, telomere extension and branched chain amino acid metabolism were up-regulated, whereas genes involved in cell signaling, as well as in drug, lipid, retinoid and glycolytic metabolism were down-regulated. Based on these distinctive gene expression features we developed a 15-gene hepatocellular immortality signature test that discriminated HCC from cirrhosis with high accuracy. Our findings demonstrate that senescence bypass plays a central role in hepatocellular carcinogenesis engendering systematic changes in the transcription of genes regulating DNA repair, proliferation, differentiation and metabolism. PMID:23691139

  6. Selection and evaluation of potential reference genes for gene expression analysis in the brown planthopper, Nilaparvata lugens (Hemiptera: Delphacidae) using reverse-transcription quantitative PCR.

    PubMed

    Yuan, Miao; Lu, Yanhui; Zhu, Xun; Wan, Hu; Shakeel, Muhammad; Zhan, Sha; Jin, Byung-Rae; Li, Jianhong

    2014-01-01

    The brown planthopper (BPH), Nilaparvata lugens (Hemiptera, Delphacidae), is one of the most important rice pests. Abundant genetic studies on BPH have been conducted using reverse-transcription quantitative real-time PCR (qRT-PCR). Using qRT-PCR, the expression levels of target genes are calculated on the basis of endogenous controls. These genes need to be appropriately selected by experimentally assessing whether they are stably expressed under different conditions. However, such studies on potential reference genes in N. lugens are lacking. In this paper, we presented a systematic exploration of eight candidate reference genes in N. lugens, namely, actin 1 (ACT), muscle actin (MACT), ribosomal protein S11 (RPS11), ribosomal protein S15e (RPS15), alpha 2-tubulin (TUB), elongation factor 1 delta (EF), 18S ribosomal RNA (18S), and arginine kinase (AK) and used four alternative methods (BestKeeper, geNorm, NormFinder, and the delta Ct method) to evaluate the suitability of these genes as endogenous controls. We examined their expression levels among different experimental factors (developmental stage, body part, geographic population, temperature variation, pesticide exposure, diet change, and starvation) following the MIQE (Minimum Information for publication of Quantitative real time PCR Experiments) guidelines. Based on the results of RefFinder, which integrates four currently available major software programs to compare and rank the tested candidate reference genes, RPS15, RPS11, and TUB were found to be the most suitable reference genes in different developmental stages, body parts, and geographic populations, respectively. RPS15 was the most suitable gene under different temperature and diet conditions, while RPS11 was the most suitable gene under different pesticide exposure and starvation conditions. This work sheds light on establishing a standardized qRT-PCR procedure in N. lugens, and serves as a starting point for screening for reference genes for expression studies of related insects.

  7. Selection and assessment of reference genes for quantitative PCR normalization in migratory locust Locusta migratoria (Orthoptera: Acrididae).

    PubMed

    Yang, Qingpo; Li, Zhen; Cao, Jinjun; Zhang, Songdou; Zhang, Huaijiang; Wu, Xiaoyun; Zhang, Qingwen; Liu, Xiaoxia

    2014-01-01

    Locusta migratoria is a classic hemimetamorphosis insect and has caused widespread economic damage to crops as a migratory pest. Researches on the expression pattern of functional genes in L. migratoria have drawn focus in recent years, especially with the release of genome information. Real-time quantitative PCR is the most reproducible and sensitive approach for detecting transcript expression levels of target genes, but optimal internal standards are key factors for its accuracy and reliability. Therefore, it's necessary to provide a systematic stability assessment of internal control for well-performed tests of target gene expression profile. In this study, twelve candidate genes (Ach, Act, Cht2, EF1α, RPL32, Hsp70, Tub, RP49, SDH, GAPDH, 18S, and His) were analyzed with four statistical methods: the delta Ct approach, geNorm, Bestkeeper and NormFinder. The results from these analyses aimed to choose the best suitable reference gene across different experimental situations for gene profile study in L. migratoria. The result demonstrated that for different developmental stages, EF1α, Hsp70 and RPL32 exhibited the most stable expression status for all samples; EF1α and RPL32 were selected as the best reference genes for studies involving embryo and larvae stages, while SDH and RP49 were identified for adult stage. The best-ranked reference genes across different tissues are RPL32, Hsp70 and RP49. For abiotic treatments, the most appropriate genes we identified were as follows: Act and SDH for larvae subjected to different insecticides; RPL32 and Ach for larvae exposed to different temperature treatments; and Act and Ach for larvae suffering from starvation. The present report should facilitate future researches on gene expression in L. migratoria with accessibly optimal reference genes under different experimental contexts.

  8. Genes Whose Gain or Loss-Of-Function Increases Skeletal Muscle Mass in Mice: A Systematic Literature Review.

    PubMed

    Verbrugge, Sander A J; Schönfelder, Martin; Becker, Lore; Yaghoob Nezhad, Fakhreddin; Hrabě de Angelis, Martin; Wackerhage, Henning

    2018-01-01

    Skeletal muscle mass differs greatly in mice and humans and this is partially inherited. To identify muscle hypertrophy candidate genes we conducted a systematic review to identify genes whose experimental loss or gain-of-function results in significant skeletal muscle hypertrophy in mice. We found 47 genes that meet our search criteria and cause muscle hypertrophy after gene manipulation. They are from high to small effect size: Ski, Fst, Acvr2b, Akt1, Mstn, Klf10, Rheb, Igf1, Pappa, Ppard, Ikbkb, Fstl3, Atgr1a, Ucn3, Mcu, Junb, Ncor1, Gprasp1, Grb10, Mmp9, Dgkz, Ppargc1a (specifically the Ppargc1a4 isoform), Smad4, Ltbp4, Bmpr1a, Crtc2, Xiap, Dgat1, Thra, Adrb2, Asb15, Cast, Eif2b5, Bdkrb2, Tpt1, Nr3c1, Nr4a1, Gnas, Pld1, Crym, Camkk1, Yap1, Inhba, Tp53inp2, Inhbb, Nol3, Esr1 . Knock out, knock down, overexpression or a higher activity of these genes causes overall muscle hypertrophy as measured by an increased muscle weight or cross sectional area. The mean effect sizes range from 5 to 345% depending on the manipulated gene as well as the muscle size variable and muscle investigated. Bioinformatical analyses reveal that Asb15, Klf10, Tpt1 are most highly expressed hypertrophy genes in human skeletal muscle when compared to other tissues. Many of the muscle hypertrophy-regulating genes are involved in transcription and ubiquitination. Especially genes belonging to three signaling pathways are able to induce hypertrophy: (a) Igf1-Akt-mTOR pathway, (b) myostatin-Smad signaling, and (c) the angiotensin-bradykinin signaling pathway. The expression of several muscle hypertrophy-inducing genes and the phosphorylation of their protein products changes after human resistance and high intensity exercise, in maximally stimulated mouse muscle or in overloaded mouse plantaris.

  9. Interleukin gene polymorphisms and breast cancer: a case control study and systematic literature review

    PubMed Central

    Balasubramanian, SP; Azmy, IAF; Higham, SE; Wilson, AG; Cross, SS; Cox, A; Brown, NJ; Reed, MW

    2006-01-01

    Background Interleukins and cytokines play an important role in the pathogenesis of many solid cancers. Several single nucleotide polymorphisms (SNPs) identified in cytokine genes are thought to influence the expression or function of these proteins and many have been evaluated for their role in inflammatory disease and cancer predisposition. The aim of this study was to evaluate any role of specific SNPs in the interleukin genes IL1A, IL1B, IL1RN, IL4R, IL6 and IL10 in predisposition to breast cancer susceptibility and severity. Methods Candidate single nucleotide polymorphisms (SNPs) in key cytokine genes were genotyped in breast cancer patients and in appropriate healthy volunteers who were similar in age, race and sex. Genotyping was performed using a high throughput allelic discrimination method. Data on clinico-pathological details and survival were collected. A systematic review of Medline English literature was done to retrieve previous studies of these polymorphisms in breast cancer. Results None of the polymorphisms studied showed any overall predisposition to breast cancer susceptibility, severity or to time to death or occurrence of distant metastases. The results of the systematic review are summarised. Conclusion Polymorphisms within key interleukin genes (IL1A, IL1B, IL1RN, IL4R, IL6 and IL10 do not appear to play a significant overall role in breast cancer susceptibility or severity. PMID:16842617

  10. Engineering Extracellular Expression Systems in Escherichia coli Based on Transcriptome Analysis and Cell Growth State.

    PubMed

    Gao, Wen; Yin, Jun; Bao, Lichen; Wang, Qun; Hou, Shan; Yue, Yali; Yao, Wenbing; Gao, Xiangdong

    2018-05-18

    Escherichia coli extracellular expression systems have a number of advantages over other systems, such as lower pyrogen levels and a simple purification process. Various approaches, such as the generation of leaky mutants via chromosomal engineering, have been explored for this expression system. However, extracellular protein yields in leaky mutants are relatively low compared to that in intracellular expression systems and therefore need to be improved. In this work, we describe the construction, characterization, and mechanism of enhanced extracellular expression in Escherichia coli. On the basis of the localizations, functions, and transcription levels of cell envelope proteins, we systematically elucidated the effects of multiple gene deletions on cell growth and extracellular expression using modified CRISPR/Cas9-based genome editing and a FlAsH labeling assay. High extracellular yields of heterologous proteins of different sizes were obtained by screening multiple gene mutations. The enhancement of extracellular secretion was associated with the derepression of translation and translocation. This work utilized universal methods in the design of extracellular expression systems for genes not directly associated with protein synthesis that were used to generate strains with higher protein expression capability. We anticipate that extracellular expression systems may help to shed light on the poorly understood aspects of these secretion processes as well as to further assist in the construction of engineered prokaryotic cells for efficient extracellular production of heterologous proteins.

  11. Murine Hyperglycemic Vasculopathy and Cardiomyopathy: Whole-Genome Gene Expression Analysis Predicts Cellular Targets and Regulatory Networks Influenced by Mannose Binding Lectin

    PubMed Central

    Zou, Chenhui; La Bonte, Laura R.; Pavlov, Vasile I.; Stahl, Gregory L.

    2012-01-01

    Hyperglycemia, in the absence of type 1 or 2 diabetes, is an independent risk factor for cardiovascular disease. We have previously demonstrated a central role for mannose binding lectin (MBL)-mediated cardiac dysfunction in acute hyperglycemic mice. In this study, we applied whole-genome microarray data analysis to investigate MBL’s role in systematic gene expression changes. The data predict possible intracellular events taking place in multiple cellular compartments such as enhanced insulin signaling pathway sensitivity, promoted mitochondrial respiratory function, improved cellular energy expenditure and protein quality control, improved cytoskeleton structure, and facilitated intracellular trafficking, all of which may contribute to the organismal health of MBL null mice against acute hyperglycemia. Our data show a tight association between gene expression profile and tissue function which might be a very useful tool in predicting cellular targets and regulatory networks connected with in vivo observations, providing clues for further mechanistic studies. PMID:22375142

  12. Early signatures of regime shifts in gene expression dynamics

    NASA Astrophysics Data System (ADS)

    Pal, Mainak; Pal, Amit Kumar; Ghosh, Sayantari; Bose, Indrani

    2013-06-01

    Recently, a large number of studies have been carried out on the early signatures of sudden regime shifts in systems as diverse as ecosystems, financial markets, population biology and complex diseases. The signatures of regime shifts in gene expression dynamics are less systematically investigated. In this paper, we consider sudden regime shifts in the gene expression dynamics described by a fold-bifurcation model involving bistability and hysteresis. We consider two alternative models, models 1 and 2, of competence development in the bacterial population B. subtilis and determine some early signatures of the regime shifts between competence and noncompetence. We use both deterministic and stochastic formalisms for the purpose of our study. The early signatures studied include the critical slowing down as a transition point is approached, rising variance and the lag-1 autocorrelation function, skewness and a ratio of two mean first passage times. Some of the signatures could provide the experimental basis for distinguishing between bistability and excitability as the correct mechanism for the development of competence.

  13. Systematic Analysis of mRNA and miRNA Expression of 3D-Cultured Neural Stem Cells (NSCs) in Spaceflight.

    PubMed

    Cui, Yi; Han, Jin; Xiao, Zhifeng; Qi, Yiduo; Zhao, Yannan; Chen, Bing; Fang, Yongxiang; Liu, Sumei; Wu, Xianming; Dai, Jianwu

    2017-01-01

    Recently, with the development of the space program there are growing concerns about the influence of spaceflight on tissue engineering. The purpose of this study was thus to determine the variations of neural stem cells (NSCs) during spaceflight. RNA-Sequencing (RNA-Seq) based transcriptomic profiling of NSCs identified many differentially expressed mRNAs and miRNAs between space and earth groups. Subsequently, those genes with differential expression were subjected to bioinformatic evaluation using gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes pathway (KEGG) and miRNA-mRNA network analyses. The results showed that NSCs maintain greater stemness ability during spaceflight although the growth rate of NSCs was slowed down. Furthermore, the results indicated that NSCs tended to differentiate into neuron in outer space conditions. Detailed genomic analyses of NSCs during spaceflight will help us to elucidate the molecular mechanisms behind their differentiation and proliferation when they are in outer space.

  14. A genome-wide analysis of the flax (Linum usitatissimum L.) dirigent protein family: from gene identification and evolution to differential regulation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija

    Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset ofmore » genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in different aspects of secondary cell wall biosynthesis and plant defense.« less

  15. A genome-wide analysis of the flax (Linum usitatissimum L.) dirigent protein family: from gene identification and evolution to differential regulation.

    PubMed

    Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija; Auguin, Daniel; Lainé, Éric; Davin, Laurence B; Cort, John R; Lewis, Norman G; Hano, Christophe

    2018-05-01

    Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset of genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in different aspects of secondary cell wall biosynthesis and plant defense.

  16. Systematic screening of isogenic cancer cells identifies DUSP6 as context-specific synthetic lethal target in melanoma

    PubMed Central

    Wittig-Blaich, Stephanie; Wittig, Rainer; Schmidt, Steffen; Lyer, Stefan; Bewerunge-Hudler, Melanie; Gronert-Sum, Sabine; Strobel-Freidekind, Olga; Müller, Carolin; List, Markus; Jaskot, Aleksandra; Christiansen, Helle; Hafner, Mathias; Schadendorf, Dirk; Block, Ines; Mollenhauer, Jan

    2017-01-01

    Next-generation sequencing has dramatically increased genome-wide profiling options and conceptually initiates the possibility for personalized cancer therapy. State-of-the-art sequencing studies yield large candidate gene sets comprising dozens or hundreds of mutated genes. However, few technologies are available for the systematic downstream evaluation of these results to identify novel starting points of future cancer therapies. We improved and extended a site-specific recombination-based system for systematic analysis of the individual functions of a large number of candidate genes. This was facilitated by a novel system for the construction of isogenic constitutive and inducible gain- and loss-of-function cell lines. Additionally, we demonstrate the construction of isogenic cell lines with combinations of the traits for advanced functional in vitro analyses. In a proof-of-concept experiment, a library of 108 isogenic melanoma cell lines was constructed and 8 genes were identified that significantly reduced viability in a discovery screen and in an independent validation screen. Here, we demonstrate the broad applicability of this recombination-based method and we proved its potential to identify new drug targets via the identification of the tumor suppressor DUSP6 as potential synthetic lethal target in melanoma cell lines with BRAF V600E mutations and high DUSP6 expression. PMID:28423600

  17. RNA-sequence data normalization through in silico prediction of reference genes: the bacterial response to DNA damage as case study.

    PubMed

    Berghoff, Bork A; Karlsson, Torgny; Källman, Thomas; Wagner, E Gerhart H; Grabherr, Manfred G

    2017-01-01

    Measuring how gene expression changes in the course of an experiment assesses how an organism responds on a molecular level. Sequencing of RNA molecules, and their subsequent quantification, aims to assess global gene expression changes on the RNA level (transcriptome). While advances in high-throughput RNA-sequencing (RNA-seq) technologies allow for inexpensive data generation, accurate post-processing and normalization across samples is required to eliminate any systematic noise introduced by the biochemical and/or technical processes. Existing methods thus either normalize on selected known reference genes that are invariant in expression across the experiment, assume that the majority of genes are invariant, or that the effects of up- and down-regulated genes cancel each other out during the normalization. Here, we present a novel method, moose 2 , which predicts invariant genes in silico through a dynamic programming (DP) scheme and applies a quadratic normalization based on this subset. The method allows for specifying a set of known or experimentally validated invariant genes, which guides the DP. We experimentally verified the predictions of this method in the bacterium Escherichia coli , and show how moose 2 is able to (i) estimate the expression value distances between RNA-seq samples, (ii) reduce the variation of expression values across all samples, and (iii) to subsequently reveal new functional groups of genes during the late stages of DNA damage. We further applied the method to three eukaryotic data sets, on which its performance compares favourably to other methods. The software is implemented in C++ and is publicly available from http://grabherr.github.io/moose2/. The proposed RNA-seq normalization method, moose 2 , is a valuable alternative to existing methods, with two major advantages: (i) in silico prediction of invariant genes provides a list of potential reference genes for downstream analyses, and (ii) non-linear artefacts in RNA-seq data are handled adequately to minimize variations between replicates.

  18. Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex.

    PubMed

    Konermann, Silvana; Brigham, Mark D; Trevino, Alexandro E; Joung, Julia; Abudayyeh, Omar O; Barcena, Clea; Hsu, Patrick D; Habib, Naomi; Gootenberg, Jonathan S; Nishimasu, Hiroshi; Nureki, Osamu; Zhang, Feng

    2015-01-29

    Systematic interrogation of gene function requires the ability to perturb gene expression in a robust and generalizable manner. Here we describe structure-guided engineering of a CRISPR-Cas9 complex to mediate efficient transcriptional activation at endogenous genomic loci. We used these engineered Cas9 activation complexes to investigate single-guide RNA (sgRNA) targeting rules for effective transcriptional activation, to demonstrate multiplexed activation of ten genes simultaneously, and to upregulate long intergenic non-coding RNA (lincRNA) transcripts. We also synthesized a library consisting of 70,290 guides targeting all human RefSeq coding isoforms to screen for genes that, upon activation, confer resistance to a BRAF inhibitor. The top hits included genes previously shown to be able to confer resistance, and novel candidates were validated using individual sgRNA and complementary DNA overexpression. A gene expression signature based on the top screening hits correlated with markers of BRAF inhibitor resistance in cell lines and patient-derived samples. These results collectively demonstrate the potential of Cas9-based activators as a powerful genetic perturbation technology.

  19. Quantitative system drift compensates for altered maternal inputs to the gap gene network of the scuttle fly Megaselia abdita

    PubMed Central

    Wotton, Karl R; Jiménez-Guri, Eva; Crombach, Anton; Janssens, Hilde; Alcaine-Colet, Anna; Lemke, Steffen; Schmidt-Ott, Urs; Jaeger, Johannes

    2015-01-01

    The segmentation gene network in insects can produce equivalent phenotypic outputs despite differences in upstream regulatory inputs between species. We investigate the mechanistic basis of this phenomenon through a systems-level analysis of the gap gene network in the scuttle fly Megaselia abdita (Phoridae). It combines quantification of gene expression at high spatio-temporal resolution with systematic knock-downs by RNA interference (RNAi). Initiation and dynamics of gap gene expression differ markedly between M. abdita and Drosophila melanogaster, while the output of the system converges to equivalent patterns at the end of the blastoderm stage. Although the qualitative structure of the gap gene network is conserved, there are differences in the strength of regulatory interactions between species. We term such network rewiring ‘quantitative system drift’. It provides a mechanistic explanation for the developmental hourglass model in the dipteran lineage. Quantitative system drift is likely to be a widespread mechanism for developmental evolution. DOI: http://dx.doi.org/10.7554/eLife.04785.001 PMID:25560971

  20. Systematically labeling developmental stage-specific genes for the study of pancreatic β-cell differentiation from human embryonic stem cells.

    PubMed

    Liu, Haisong; Yang, Huan; Zhu, Dicong; Sui, Xin; Li, Juan; Liang, Zhen; Xu, Lei; Chen, Zeyu; Yao, Anzhi; Zhang, Long; Zhang, Xi; Yi, Xing; Liu, Meng; Xu, Shiqing; Zhang, Wenjian; Lin, Hua; Xie, Lan; Lou, Jinning; Zhang, Yong; Xi, Jianzhong; Deng, Hongkui

    2014-10-01

    The applications of human pluripotent stem cell (hPSC)-derived cells in regenerative medicine has encountered a long-standing challenge: how can we efficiently obtain mature cell types from hPSCs? Attempts to address this problem are hindered by the complexity of controlling cell fate commitment and the lack of sufficient developmental knowledge for guiding hPSC differentiation. Here, we developed a systematic strategy to study hPSC differentiation by labeling sequential developmental genes to encompass the major developmental stages, using the directed differentiation of pancreatic β cells from hPSCs as a model. We therefore generated a large panel of pancreas-specific mono- and dual-reporter cell lines. With this unique platform, we visualized the kinetics of the entire differentiation process in real time for the first time by monitoring the expression dynamics of the reporter genes, identified desired cell populations at each differentiation stage and demonstrated the ability to isolate these cell populations for further characterization. We further revealed the expression profiles of isolated NGN3-eGFP(+) cells by RNA sequencing and identified sushi domain-containing 2 (SUSD2) as a novel surface protein that enriches for pancreatic endocrine progenitors and early endocrine cells both in human embryonic stem cells (hESC)-derived pancreatic cells and in the developing human pancreas. Moreover, we captured a series of cell fate transition events in real time, identified multiple cell subpopulations and unveiled their distinct gene expression profiles, among heterogeneous progenitors for the first time using our dual reporter hESC lines. The exploration of this platform and our new findings will pave the way to obtain mature β cells in vitro.

  1. A CRISPR-Based Toolbox for Studying T Cell Signal Transduction

    PubMed Central

    Chi, Shen; Weiss, Arthur; Wang, Haopeng

    2016-01-01

    CRISPR/Cas9 system is a powerful technology to perform genome editing in a variety of cell types. To facilitate the application of Cas9 in mapping T cell signaling pathways, we generated a toolbox for large-scale genetic screens in human Jurkat T cells. The toolbox has three different Jurkat cell lines expressing distinct Cas9 variants, including wild-type Cas9, dCas9-KRAB, and sunCas9. We demonstrated that the toolbox allows us to rapidly disrupt endogenous gene expression at the DNA level and to efficiently repress or activate gene expression at the transcriptional level. The toolbox, in combination with multiple currently existing genome-wide sgRNA libraries, will be useful to systematically investigate T cell signal transduction using both loss-of-function and gain-of-function genetic screens. PMID:27057542

  2. Synergistic and Dose-Controlled Regulation of Cellulase Gene Expression in Penicillium oxalicum.

    PubMed

    Li, Zhonghai; Yao, Guangshan; Wu, Ruimei; Gao, Liwei; Kan, Qinbiao; Liu, Meng; Yang, Piao; Liu, Guodong; Qin, Yuqi; Song, Xin; Zhong, Yaohua; Fang, Xu; Qu, Yinbo

    2015-09-01

    Filamentous fungus Penicillium oxalicum produces diverse lignocellulolytic enzymes, which are regulated by the combinations of many transcription factors. Here, a single-gene disruptant library for 470 transcription factors was constructed and systematically screened for cellulase production. Twenty transcription factors (including ClrB, CreA, XlnR, Ace1, AmyR, and 15 unknown proteins) were identified to play putative roles in the activation or repression of cellulase synthesis. Most of these regulators have not been characterized in any fungi before. We identified the ClrB, CreA, XlnR, and AmyR transcription factors as critical dose-dependent regulators of cellulase expression, the core regulons of which were identified by analyzing several transcriptomes and/or secretomes. Synergistic and additive modes of combinatorial control of each cellulase gene by these regulatory factors were achieved, and cellulase expression was fine-tuned in a proper and controlled manner. With one of these targets, the expression of the major intracellular β-glucosidase Bgl2 was found to be dependent on ClrB. The Bgl2-deficient background resulted in a substantial gene activation by ClrB and proved to be closely correlated with the relief of repression mediated by CreA and AmyR during cellulase induction. Our results also signify that probing the synergistic and dose-controlled regulation mechanisms of cellulolytic regulators and using it for reconstruction of expression regulation network (RERN) may be a promising strategy for cellulolytic fungi to develop enzyme hyper-producers. Based on our data, ClrB was identified as focal point for the synergistic activation regulation of cellulase expression by integrating cellulolytic regulators and their target genes, which refined our understanding of transcriptional-regulatory network as a "seesaw model" in which the coordinated regulation of cellulolytic genes is established by counteracting activators and repressors.

  3. Synergistic and Dose-Controlled Regulation of Cellulase Gene Expression in Penicillium oxalicum

    PubMed Central

    Li, Zhonghai; Yao, Guangshan; Wu, Ruimei; Gao, Liwei; Kan, Qinbiao; Liu, Meng; Yang, Piao; Liu, Guodong; Qin, Yuqi; Song, Xin; Zhong, Yaohua; Fang, Xu; Qu, Yinbo

    2015-01-01

    Filamentous fungus Penicillium oxalicum produces diverse lignocellulolytic enzymes, which are regulated by the combinations of many transcription factors. Here, a single-gene disruptant library for 470 transcription factors was constructed and systematically screened for cellulase production. Twenty transcription factors (including ClrB, CreA, XlnR, Ace1, AmyR, and 15 unknown proteins) were identified to play putative roles in the activation or repression of cellulase synthesis. Most of these regulators have not been characterized in any fungi before. We identified the ClrB, CreA, XlnR, and AmyR transcription factors as critical dose-dependent regulators of cellulase expression, the core regulons of which were identified by analyzing several transcriptomes and/or secretomes. Synergistic and additive modes of combinatorial control of each cellulase gene by these regulatory factors were achieved, and cellulase expression was fine-tuned in a proper and controlled manner. With one of these targets, the expression of the major intracellular β-glucosidase Bgl2 was found to be dependent on ClrB. The Bgl2-deficient background resulted in a substantial gene activation by ClrB and proved to be closely correlated with the relief of repression mediated by CreA and AmyR during cellulase induction. Our results also signify that probing the synergistic and dose-controlled regulation mechanisms of cellulolytic regulators and using it for reconstruction of expression regulation network (RERN) may be a promising strategy for cellulolytic fungi to develop enzyme hyper-producers. Based on our data, ClrB was identified as focal point for the synergistic activation regulation of cellulase expression by integrating cellulolytic regulators and their target genes, which refined our understanding of transcriptional-regulatory network as a “seesaw model” in which the coordinated regulation of cellulolytic genes is established by counteracting activators and repressors. PMID:26360497

  4. Characterization of the definitive classical calpain family of vertebrates using phylogenetic, evolutionary and expression analyses.

    PubMed

    Macqueen, Daniel J; Wilcox, Alexander H

    2014-04-09

    The calpains are a superfamily of proteases with extensive relevance to human health and welfare. Vast research attention is given to the vertebrate 'classical' subfamily, making it surprising that the evolutionary origins, distribution and relationships of these genes is poorly characterized. Consequently, there exists uncertainty about the conservation of gene family structure, function and expression that has been principally defined from work with mammals. Here, more than 200 vertebrate classical calpains were incorporated in phylogenetic analyses spanning an unprecedented range of taxa, including jawless and cartilaginous fish. We demonstrate that the common vertebrate ancestor had at least six classical calpains, including a single gene that gave rise to CAPN11, 1, 2 and 8 in the early jawed fish lineage, plus CAPN3, 9, 12, 13 and a novel calpain gene, hereafter named CAPN17. We reveal that while all vertebrate classical calpains have been subject to persistent purifying selection during evolution, the degree and nature of selective pressure has often been lineage-dependent. The tissue expression of the complete classic calpain family was assessed in representative teleost fish, amphibians, reptiles and mammals. This highlighted systematic divergence in expression across vertebrate taxa, with most classic calpain genes from fish and amphibians having more extensive tissue distribution than in amniotes. Our data suggest that classical calpain functions have frequently diverged during vertebrate evolution and challenge the ongoing value of the established system of classifying calpains by expression.

  5. Characterization of the definitive classical calpain family of vertebrates using phylogenetic, evolutionary and expression analyses

    PubMed Central

    Macqueen, Daniel J.; Wilcox, Alexander H.

    2014-01-01

    The calpains are a superfamily of proteases with extensive relevance to human health and welfare. Vast research attention is given to the vertebrate ‘classical’ subfamily, making it surprising that the evolutionary origins, distribution and relationships of these genes is poorly characterized. Consequently, there exists uncertainty about the conservation of gene family structure, function and expression that has been principally defined from work with mammals. Here, more than 200 vertebrate classical calpains were incorporated in phylogenetic analyses spanning an unprecedented range of taxa, including jawless and cartilaginous fish. We demonstrate that the common vertebrate ancestor had at least six classical calpains, including a single gene that gave rise to CAPN11, 1, 2 and 8 in the early jawed fish lineage, plus CAPN3, 9, 12, 13 and a novel calpain gene, hereafter named CAPN17. We reveal that while all vertebrate classical calpains have been subject to persistent purifying selection during evolution, the degree and nature of selective pressure has often been lineage-dependent. The tissue expression of the complete classic calpain family was assessed in representative teleost fish, amphibians, reptiles and mammals. This highlighted systematic divergence in expression across vertebrate taxa, with most classic calpain genes from fish and amphibians having more extensive tissue distribution than in amniotes. Our data suggest that classical calpain functions have frequently diverged during vertebrate evolution and challenge the ongoing value of the established system of classifying calpains by expression. PMID:24718597

  6. Convergent evidence from systematic analysis of GWAS revealed genetic basis of esophageal cancer.

    PubMed

    Gao, Xue-Xin; Gao, Lei; Wang, Jiu-Qiang; Qu, Su-Su; Qu, Yue; Sun, Hong-Lei; Liu, Si-Dang; Shang, Ying-Li

    2016-07-12

    Recent genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with risk of esophageal cancer (EC). However, investigation of genetic basis from the perspective of systematic biology and integrative genomics remains scarce.In this study, we explored genetic basis of EC based on GWAS data and implemented a series of bioinformatics methods including functional annotation, expression quantitative trait loci (eQTL) analysis, pathway enrichment analysis and pathway grouped network analysis.Two hundred and thirteen risk SNPs were identified, in which 44 SNPs were found to have significantly differential gene expression in esophageal tissues by eQTL analysis. By pathway enrichment analysis, 170 risk genes mapped by risk SNPs were enriched into 38 significant GO terms and 17 significant KEGG pathways, which were significantly grouped into 9 sub-networks by pathway grouped network analysis. The 9 groups of interconnected pathways were mainly involved with muscle cell proliferation, cellular response to interleukin-6, cell adhesion molecules, and ethanol oxidation, which might participate in the development of EC.Our findings provide genetic evidence and new insight for exploring the molecular mechanisms of EC.

  7. Comprehensive single cell-resolution analysis of the role of chromatin regulators in early C. elegans embryogenesis.

    PubMed

    Krüger, Angela V; Jelier, Rob; Dzyubachyk, Oleh; Zimmerman, Timo; Meijering, Erik; Lehner, Ben

    2015-02-15

    Chromatin regulators are widely expressed proteins with diverse roles in gene expression, nuclear organization, cell cycle regulation, pluripotency, physiology and development, and are frequently mutated in human diseases such as cancer. Their inhibition often results in pleiotropic effects that are difficult to study using conventional approaches. We have developed a semi-automated nuclear tracking algorithm to quantify the divisions, movements and positions of all nuclei during the early development of Caenorhabditis elegans and have used it to systematically study the effects of inhibiting chromatin regulators. The resulting high dimensional datasets revealed that inhibition of multiple regulators, including F55A3.3 (encoding FACT subunit SUPT16H), lin-53 (RBBP4/7), rba-1 (RBBP4/7), set-16 (MLL2/3), hda-1 (HDAC1/2), swsn-7 (ARID2), and let-526 (ARID1A/1B) affected cell cycle progression and caused chromosome segregation defects. In contrast, inhibition of cir-1 (CIR1) accelerated cell division timing in specific cells of the AB lineage. The inhibition of RNA polymerase II also accelerated these division timings, suggesting that normal gene expression is required to delay cell cycle progression in multiple lineages in the early embryo. Quantitative analyses of the dataset suggested the existence of at least two functionally distinct SWI/SNF chromatin remodeling complex activities in the early embryo, and identified a redundant requirement for the egl-27 and lin-40 MTA orthologs in the development of endoderm and mesoderm lineages. Moreover, our dataset also revealed a characteristic rearrangement of chromatin to the nuclear periphery upon the inhibition of multiple general regulators of gene expression. Our systematic, comprehensive and quantitative datasets illustrate the power of single cell-resolution quantitative tracking and high dimensional phenotyping to investigate gene function. Furthermore, the results provide an overview of the functions of essential chromatin regulators during the early development of an animal. Copyright © 2014 Elsevier Inc. All rights reserved.

  8. Systematic analyses reveal long non-coding RNA (PTAF)-mediated promotion of EMT and invasion-metastasis in serous ovarian cancer.

    PubMed

    Liang, Haihai; Zhao, Xiaoguang; Wang, Chengyu; Sun, Jian; Chen, Yingzhun; Wang, Guoyuan; Fang, Lei; Yang, Rui; Yu, Mengxue; Gu, Yunyan; Shan, Hongli

    2018-06-21

    A deeper mechanistic understanding of epithelial-to-mesenchymal transition (EMT) regulation is needed to improve current anti-metastasis strategies in ovarian cancer (OvCa). This study was designed to investigate the role of lncRNAs in EMT regulation during process of invasion-metastasis in serous OvCa to improve current anti-metastasis strategies for OvCa. We systematically analyzes high-throughput gene expression profiles of both lncRNAs and protein-coding genes in OvCa samples with integrated epithelial (iE) subtype and integrated mesenchymal (iM) subtype labels. Mouse models, cytobiology, molecular biology assays and clinical samples were performed to elucidate the function and underlying mechanisms of lncRNA PTAF-mediated promotion of EMT and invasion-metastasis in serous OvCa. We constructed a lncRNA-mediated competing endogenous RNA (ceRNA) regulatory network that affects the expression of many EMT-related protein-coding genes in mesenchymal OvCa. Using a combination of in vitro and in vivo studies, we provided evidence that the lncRNA PTAF-miR-25-SNAI2 axis controlled EMT in OvCa. Our results revealed that up-regulated PTAF induced elevated SNAI2 expression by competitively binding to miR-25, which in turn promoted OvCa cell EMT and invasion. Moreover, we found that silencing of PTAF inhibited tumor progression and metastasis in an orthotopic mouse model of OvCa. We then observed a significant correlation between PTAF expression and EMT markers in OvCa patients. The lncRNA PTAF, a mediator of TGF-β signaling, can predispose OvCa patients to metastases and may serve as a potential target for anti-metastatic therapies for mesenchymal OvCa patients.

  9. Accumulated Expression Level of Cytosolic Glutamine Synthetase 1 Gene (OsGS1;1 or OsGS1;2) Alter Plant Development and the Carbon-Nitrogen Metabolic Status in Rice

    PubMed Central

    Bao, Aili; Zhao, Zhuqing; Ding, Guangda; Shi, Lei; Xu, Fangsen; Cai, Hongmei

    2014-01-01

    Maintaining an appropriate balance of carbon to nitrogen metabolism is essential for rice growth and yield. Glutamine synthetase is a key enzyme for ammonium assimilation. In this study, we systematically analyzed the growth phenotype, carbon-nitrogen metabolic status and gene expression profiles in GS1;1-, GS1;2-overexpressing rice and wildtype plants. Our results revealed that the GS1;1-, GS1;2-overexpressing plants exhibited a poor plant growth phenotype and yield and decreased carbon/nitrogen ratio in the stem caused by the accumulation of nitrogen in the stem. In addition, the leaf SPAD value and photosynthetic parameters, soluble proteins and carbohydrates varied greatly in the GS1;1-, GS1;2-overexpressing plants. Furthermore, metabolite profile and gene expression analysis demonstrated significant changes in individual sugars, organic acids and free amino acids, and gene expression patterns in GS1;1-, GS1;2-overexpressing plants, which also indicated the distinct roles that these two GS1 genes played in rice nitrogen metabolism, particularly when sufficient nitrogen was applied in the environment. Thus, the unbalanced carbon-nitrogen metabolic status and poor ability of nitrogen transportation from stem to leaf in GS1;1-, GS1;2-overexpressing plants may explain the poor growth and yield. PMID:24743556

  10. Allelic Imbalance Is a Prevalent and Tissue-Specific Feature of the Mouse Transcriptome

    PubMed Central

    Pinter, Stefan F.; Colognori, David; Beliveau, Brian J.; Sadreyev, Ruslan I.; Payer, Bernhard; Yildirim, Eda; Wu, Chao-ting; Lee, Jeannie T.

    2015-01-01

    In mammals, several classes of monoallelic genes have been identified, including those subject to X-chromosome inactivation (XCI), genomic imprinting, and random monoallelic expression (RMAE). However, the extent to which these epigenetic phenomena are influenced by underlying genetic variation is unknown. Here we perform a systematic classification of allelic imbalance in mouse hybrids derived from reciprocal crosses of divergent strains. We observe that deviation from balanced biallelic expression is common, occurring in ∼20% of the mouse transcriptome in a given tissue. Allelic imbalance attributed to genotypic variation is by far the most prevalent class and typically is tissue-specific. However, some genotype-based imbalance is maintained across tissues and is associated with greater genetic variation, especially in 5′ and 3′ termini of transcripts. We further identify novel random monoallelic and imprinted genes and find that genotype can modify penetrance of parental origin even in the setting of large imprinted regions. Examination of nascent transcripts in single cells from inbred parental strains reveals that genes showing genotype-based imbalance in hybrids can also exhibit monoallelic expression in isogenic backgrounds. This surprising observation may suggest a competition between alleles and/or reflect the combined impact of cis- and trans-acting variation on expression of a given gene. Our findings provide novel insights into gene regulation and may be relevant to human genetic variation and disease. PMID:25858912

  11. Mapping photothermally induced gene expression in living cells and tissues by nanorod-locked nucleic acid complexes.

    PubMed

    Riahi, Reza; Wang, Shue; Long, Min; Li, Na; Chiou, Pei-Yu; Zhang, Donna D; Wong, Pak Kin

    2014-04-22

    The photothermal effect of plasmonic nanostructures has numerous applications, such as cancer therapy, photonic gene circuit, large cargo delivery, and nanostructure-enhanced laser tweezers. The photothermal operation can also induce unwanted physical and biochemical effects, which potentially alter the cell behaviors. However, there is a lack of techniques for characterizing the dynamic cell responses near the site of photothermal operation with high spatiotemporal resolution. In this work, we show that the incorporation of locked nucleic acid probes with gold nanorods allows photothermal manipulation and real-time monitoring of gene expression near the area of irradiation in living cells and animal tissues. The multimodal gold nanorod serves as an endocytic delivery reagent to transport the probes into the cells, a fluorescence quencher and a binding competitor to detect intracellular mRNA, and a plasmonic photothermal transducer to induce cell ablation. We demonstrate the ability of the gold nanorod-locked nucleic acid complex for detecting the spatiotemporal gene expression in viable cells and tissues and inducing photothermal ablation of single cells. Using the gold nanorod-locked nucleic acid complex, we systematically characterize the dynamic cellular heat shock responses near the site of photothermal operation. The gold nanorod-locked nucleic acid complex enables mapping of intracellular gene expressions and analyzes the photothermal effects of nanostructures toward various biomedical applications.

  12. A Populus TIR1 gene family survey reveals differential expression patterns and responses to 1-naphthaleneacetic acid and stress treatments

    PubMed Central

    Shu, Wenbo; Liu, Yingli; Guo, Yinghua; Zhou, Houjun; Zhang, Jin; Zhao, Shutang; Lu, Mengzhu

    2015-01-01

    The plant hormone auxin is a central regulator of plant growth. TRANSPORT INHIBITOR RESPONSE 1/AUXIN SIGNALING F-BOX (TIR1/AFB) is a component of the E3 ubiquitin ligase complex SCFTIR1/AFB and acts as an auxin co-receptor for nuclear auxin signaling. The SCFTIR1/AFB-proteasome machinery plays a central regulatory role in development-related gene transcription. Populus trichocarpa, as a model tree, has a unique fast-growth trait to which auxin signaling may contribute. However, no systematic analyses of the genome organization, gene structure, and expression of TIR1-like genes have been undertaken in this woody model plant. In this study, we identified a total of eight TIR1 genes in the Populus genome that are phylogenetically clustered into four subgroups, PtrFBL1/PtrFBL2, PtrFBL3/PtrFBL4, PtrFBL5/PtrFBL6, and PtrFBL7/PtrFBL8, representing four paralogous pairs. In addition, the gene structure and motif composition were relatively conserved in each paralogous pair and all of the PtrFBL members were localized in the nucleus. Different sets of PtrFBLs were strongly expressed in the leaves, stems, roots, cambial zones, and immature xylem of Populus. Interestingly, PtrFBL1 and 7 were expressed mainly in vascular and cambial tissues, respectively, indicating their potential but different roles in wood formation. Furthermore, Populus FBLs responded differentially upon exposure to various stresses. Finally, over-expression studies indicated a role of FBL1 in poplar stem growth and response to drought stress. Collectively, these observations lay the foundation for further investigations into the potential roles of PtrFBL genes in tree growth and development. PMID:26442033

  13. Physiologically Shrinking the Solution Space of a Saccharomyces cerevisiae Genome-Scale Model Suggests the Role of the Metabolic Network in Shaping Gene Expression Noise.

    PubMed

    Chi, Baofang; Tao, Shiheng; Liu, Yanlin

    2015-01-01

    Sampling the solution space of genome-scale models is generally conducted to determine the feasible region for metabolic flux distribution. Because the region for actual metabolic states resides only in a small fraction of the entire space, it is necessary to shrink the solution space to improve the predictive power of a model. A common strategy is to constrain models by integrating extra datasets such as high-throughput datasets and C13-labeled flux datasets. However, studies refining these approaches by performing a meta-analysis of massive experimental metabolic flux measurements, which are closely linked to cellular phenotypes, are limited. In the present study, experimentally identified metabolic flux data from 96 published reports were systematically reviewed. Several strong associations among metabolic flux phenotypes were observed. These phenotype-phenotype associations at the flux level were quantified and integrated into a Saccharomyces cerevisiae genome-scale model as extra physiological constraints. By sampling the shrunken solution space of the model, the metabolic flux fluctuation level, which is an intrinsic trait of metabolic reactions determined by the network, was estimated and utilized to explore its relationship to gene expression noise. Although no correlation was observed in all enzyme-coding genes, a relationship between metabolic flux fluctuation and expression noise of genes associated with enzyme-dosage sensitive reactions was detected, suggesting that the metabolic network plays a role in shaping gene expression noise. Such correlation was mainly attributed to the genes corresponding to non-essential reactions, rather than essential ones. This was at least partially, due to regulations underlying the flux phenotype-phenotype associations. Altogether, this study proposes a new approach in shrinking the solution space of a genome-scale model, of which sampling provides new insights into gene expression noise.

  14. Discovery of error-tolerant biclusters from noisy gene expression data.

    PubMed

    Gupta, Rohit; Rao, Navneet; Kumar, Vipin

    2011-11-24

    An important analysis performed on microarray gene-expression data is to discover biclusters, which denote groups of genes that are coherently expressed for a subset of conditions. Various biclustering algorithms have been proposed to find different types of biclusters from these real-valued gene-expression data sets. However, these algorithms suffer from several limitations such as inability to explicitly handle errors/noise in the data; difficulty in discovering small bicliusters due to their top-down approach; inability of some of the approaches to find overlapping biclusters, which is crucial as many genes participate in multiple biological processes. Association pattern mining also produce biclusters as their result and can naturally address some of these limitations. However, traditional association mining only finds exact biclusters, which limits its applicability in real-life data sets where the biclusters may be fragmented due to random noise/errors. Moreover, as they only work with binary or boolean attributes, their application on gene-expression data require transforming real-valued attributes to binary attributes, which often results in loss of information. Many past approaches have tried to address the issue of noise and handling real-valued attributes independently but there is no systematic approach that addresses both of these issues together. In this paper, we first propose a novel error-tolerant biclustering model, 'ET-bicluster', and then propose a bottom-up heuristic-based mining algorithm to sequentially discover error-tolerant biclusters directly from real-valued gene-expression data. The efficacy of our proposed approach is illustrated by comparing it with a recent approach RAP in the context of two biological problems: discovery of functional modules and discovery of biomarkers. For the first problem, two real-valued S.Cerevisiae microarray gene-expression data sets are used to demonstrate that the biclusters obtained from ET-bicluster approach not only recover larger set of genes as compared to those obtained from RAP approach but also have higher functional coherence as evaluated using the GO-based functional enrichment analysis. The statistical significance of the discovered error-tolerant biclusters as estimated by using two randomization tests, reveal that they are indeed biologically meaningful and statistically significant. For the second problem of biomarker discovery, we used four real-valued Breast Cancer microarray gene-expression data sets and evaluate the biomarkers obtained using MSigDB gene sets. The results obtained for both the problems: functional module discovery and biomarkers discovery, clearly signifies the usefulness of the proposed ET-bicluster approach and illustrate the importance of explicitly incorporating noise/errors in discovering coherent groups of genes from gene-expression data.

  15. Non-functional genes repaired at the RNA level.

    PubMed

    Burger, Gertraud

    2016-01-01

    Genomes and genes continuously evolve. Gene sequences undergo substitutions, deletions or nucleotide insertions; mobile genetic elements invade genomes and interleave in genes; chromosomes break, even within genes, and pieces reseal in reshuffled order. To maintain functional gene products and assure an organism's survival, two principal strategies are used - either repair of the gene itself or of its product. I will introduce common types of gene aberrations and how gene function is restored secondarily, and then focus on systematically fragmented genes found in a poorly studied protist group, the diplonemids. Expression of their broken genes involves restitching of pieces at the RNA-level, and substantial RNA editing, to compensate for point mutations. I will conclude with thoughts on how such a grotesquely unorthodox system may have evolved, and why this group of organisms persists and thrives since tens of millions of years. Copyright © 2016 Académie des sciences. Published by Elsevier SAS. All rights reserved.

  16. Expression stability and selection of optimal reference genes for gene expression normalization in early life stage rainbow trout exposed to cadmium and copper.

    PubMed

    Shekh, Kamran; Tang, Song; Niyogi, Som; Hecker, Markus

    2017-09-01

    Gene expression analysis represents a powerful approach to characterize the specific mechanisms by which contaminants interact with organisms. One of the key considerations when conducting gene expression analyses using quantitative real-time reverse transcription-polymerase chain reaction (qPCR) is the selection of appropriate reference genes, which is often overlooked. Specifically, to reach meaningful conclusions when using relative quantification approaches, expression levels of reference genes must be highly stable and cannot vary as a function of experimental conditions. However, to date, information on the stability of commonly used reference genes across developmental stages, tissues and after exposure to contaminants such as metals is lacking for many vertebrate species including teleost fish. Therefore, in this study, we assessed the stability of expression of 8 reference gene candidates in the gills and skin of three different early life-stages of rainbow trout after acute exposure (24h) to two metals, cadmium (Cd) and copper (Cu) using qPCR. Candidate housekeeping genes were: beta actin (b-actin), DNA directed RNA polymerase II subunit I (DRP2), elongation factor-1 alpha (EF1a), glyceraldehyde 3-phosphate dehydrogenase (GAPDH), glucose-6-phosphate dehydrogenase (G6PD), hypoxanthine phosphoribosyltransferase (HPRT), ribosomal protein L8 (RPL8), and 18S ribosomal RNA (18S). Four algorithms, geNorm, NormFinder, BestKeeper, and the comparative ΔCt method were employed to systematically evaluate the expression stability of these candidate genes under control and exposed conditions as well as across three different life-stages. Finally, stability of genes was ranked by taking geometric means of the ranks established by the different methods. Stability of reference genes was ranked in the following order (from lower to higher stability): HPRT

  17. Evaluation of New Reference Genes in Papaya for Accurate Transcript Normalization under Different Experimental Conditions

    PubMed Central

    Chen, Weixin; Chen, Jianye; Lu, Wangjin; Chen, Lei; Fu, Danwen

    2012-01-01

    Real-time reverse transcription PCR (RT-qPCR) is a preferred method for rapid and accurate quantification of gene expression studies. Appropriate application of RT-qPCR requires accurate normalization though the use of reference genes. As no single reference gene is universally suitable for all experiments, thus reference gene(s) validation under different experimental conditions is crucial for RT-qPCR analysis. To date, only a few studies on reference genes have been done in other plants but none in papaya. In the present work, we selected 21 candidate reference genes, and evaluated their expression stability in 246 papaya fruit samples using three algorithms, geNorm, NormFinder and RefFinder. The samples consisted of 13 sets collected under different experimental conditions, including various tissues, different storage temperatures, different cultivars, developmental stages, postharvest ripening, modified atmosphere packaging, 1-methylcyclopropene (1-MCP) treatment, hot water treatment, biotic stress and hormone treatment. Our results demonstrated that expression stability varied greatly between reference genes and that different suitable reference gene(s) or combination of reference genes for normalization should be validated according to the experimental conditions. In general, the internal reference genes EIF (Eukaryotic initiation factor 4A), TBP1 (TATA binding protein 1) and TBP2 (TATA binding protein 2) genes had a good performance under most experimental conditions, whereas the most widely present used reference genes, ACTIN (Actin 2), 18S rRNA (18S ribosomal RNA) and GAPDH (Glyceraldehyde-3-phosphate dehydrogenase) were not suitable in many experimental conditions. In addition, two commonly used programs, geNorm and Normfinder, were proved sufficient for the validation. This work provides the first systematic analysis for the selection of superior reference genes for accurate transcript normalization in papaya under different experimental conditions. PMID:22952972

  18. Systematic genetic dissection of chitin degradation and uptake in Vibrio cholerae.

    PubMed

    Hayes, Chelsea A; Dalia, Triana N; Dalia, Ankur B

    2017-10-01

    Vibrio cholerae is a natural resident of the aquatic environment, where a common nutrient is the chitinous exoskeletons of microscopic crustaceans. Chitin utilization requires chitinases, which degrade this insoluble polymer into soluble chitin oligosaccharides. These oligosaccharides also serve as an inducing cue for natural transformation in Vibrio species. There are 7 predicted endochitinase-like genes in the V. cholerae genome. Here, we systematically dissect the contribution of each gene to growth on chitin as well as induction of natural transformation. Specifically, we created a strain that lacks all 7 putative chitinases and from this strain, generated a panel of strains where each expresses a single chitinase. We also generated expression plasmids to ectopically express all 7 chitinases in our chitinase deficient strain. Through this analysis, we found that low levels of chitinase activity are sufficient for natural transformation, while growth on insoluble chitin as a sole carbon source requires more robust and concerted chitinase activity. We also assessed the role that the three uptake systems for the chitin degradation products GlcNAc, (GlcNAc) 2 and (GlcN) 2 , play in chitin utilization and competence induction. Cumulatively, this study provides mechanistic details for how this pathogen utilizes chitin to thrive and evolve in its environmental reservoir. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

  19. Systematic Investigation of Expression of G2/M Transition Genes Reveals CDC25 Alteration in Nonfunctioning Pituitary Adenomas.

    PubMed

    Butz, Henriett; Németh, Kinga; Czenke, Dóra; Likó, István; Czirják, Sándor; Zivkovic, Vladimir; Baghy, Kornélia; Korbonits, Márta; Kovalszky, Ilona; Igaz, Péter; Rácz, Károly; Patócs, Attila

    2017-07-01

    Dysregulation of G1/S checkpoint of cell cycle has been reported in pituitary adenomas. In addition, our previous finding showing that deregulation of Wee1 kinase by microRNAs together with other studies demonstrating alteration of G2/M transition in nonfunctioning pituitary adenomas (NFPAs) suggest that G2/M transition may also be important in pituitary tumorigenesis. To systematically study the expression of members of the G2/M transition in NFPAs and to investigate potential microRNA (miRNA) involvement. Totally, 80 NFPA and 14 normal pituitary (NP) tissues were examined. Expression of 46 genes encoding members of the G2/M transition was profiled on 34 NFPA and 10 NP samples on TaqMan Low Density Array. Expression of CDC25A and two miRNAs targeting CDC25A were validated by individual quantitative real time PCR using TaqMan assays. Protein expression of CDC25A, CDC25C, CDK1 and phospho-CDK1 (Tyr-15) was investigated on tissue microarray and immunohistochemistry. Several genes' expression alteration were observed in NFPA compared to normal tissues by transcription profiling. On protein level CDC25A and both the total and the phospho-CDK1 were overexpressed in adenoma tissues. CDC25A correlated with nuclear localized CDK1 (nCDK1) and with tumor size and nCDK1 with Ki-67 index. Comparing primary vs. recurrent adenomas we found that Ki-67 proliferation index was higher and phospho-CDK1 (inactive form) was downregulated in recurrent tumors compared to primary adenomas. Investigating the potential causes behind CDC25A overexpression we could not find copy number variation at the coding region nor expression alteration of CDC25A regulating transcription factors however CDC25A targeting miRNAs were downregulated in NFPA and negatively correlated with CDC25A expression. Our results suggest that among alterations of G2/M transition of the cell cycle, overexpression of the CDK1 and CDC25A may have a role in the pathogenesis of the NFPA and that CDC25A is potentially regulated by miRNAs.

  20. A proposed metric for assessing the measurement quality of individual microarrays

    PubMed Central

    Kim, Kyoungmi; Page, Grier P; Beasley, T Mark; Barnes, Stephen; Scheirer, Katherine E; Allison, David B

    2006-01-01

    Background High-density microarray technology is increasingly applied to study gene expression levels on a large scale. Microarray experiments rely on several critical steps that may introduce error and uncertainty in analyses. These steps include mRNA sample extraction, amplification and labeling, hybridization, and scanning. In some cases this may be manifested as systematic spatial variation on the surface of microarray in which expression measurements within an individual array may vary as a function of geographic position on the array surface. Results We hypothesized that an index of the degree of spatiality of gene expression measurements associated with their physical geographic locations on an array could indicate the summary of the physical reliability of the microarray. We introduced a novel way to formulate this index using a statistical analysis tool. Our approach regressed gene expression intensity measurements on a polynomial response surface of the microarray's Cartesian coordinates. We demonstrated this method using a fixed model and presented results from real and simulated datasets. Conclusion We demonstrated the potential of such a quantitative metric for assessing the reliability of individual arrays. Moreover, we showed that this procedure can be incorporated into laboratory practice as a means to set quality control specifications and as a tool to determine whether an array has sufficient quality to be retained in terms of spatial correlation of gene expression measurements. PMID:16430768

  1. Methods to increase reproducibility in differential gene expression via meta-analysis

    PubMed Central

    Sweeney, Timothy E.; Haynes, Winston A.; Vallania, Francesco; Ioannidis, John P.; Khatri, Purvesh

    2017-01-01

    Findings from clinical and biological studies are often not reproducible when tested in independent cohorts. Due to the testing of a large number of hypotheses and relatively small sample sizes, results from whole-genome expression studies in particular are often not reproducible. Compared to single-study analysis, gene expression meta-analysis can improve reproducibility by integrating data from multiple studies. However, there are multiple choices in designing and carrying out a meta-analysis. Yet, clear guidelines on best practices are scarce. Here, we hypothesized that studying subsets of very large meta-analyses would allow for systematic identification of best practices to improve reproducibility. We therefore constructed three very large gene expression meta-analyses from clinical samples, and then examined meta-analyses of subsets of the datasets (all combinations of datasets with up to N/2 samples and K/2 datasets) compared to a ‘silver standard’ of differentially expressed genes found in the entire cohort. We tested three random-effects meta-analysis models using this procedure. We showed relatively greater reproducibility with more-stringent effect size thresholds with relaxed significance thresholds; relatively lower reproducibility when imposing extraneous constraints on residual heterogeneity; and an underestimation of actual false positive rate by Benjamini–Hochberg correction. In addition, multivariate regression showed that the accuracy of a meta-analysis increased significantly with more included datasets even when controlling for sample size. PMID:27634930

  2. Immunogene and viral transcript dynamics during parasitic Varroa destructor mite infection of developing honey bee (Apis mellifera) pupae.

    PubMed

    Kuster, Ryan D; Boncristiani, Humberto F; Rueppell, Olav

    2014-05-15

    The ectoparasitic Varroa destructor mite is a major contributor to the ongoing honey bee health crisis. Varroa interacts with honey bee viruses, exacerbating their pathogenicity. In addition to vectoring viruses, immunosuppression of the developing honey bee hosts by Varroa has been proposed to explain the synergy between viruses and mites. However, the evidence for honey bee immune suppression by V. destructor is contentious. We systematically studied the quantitative effects of experimentally introduced V. destructor mites on immune gene expression at five specific time points during the development of the honey bee hosts. Mites reproduced normally and were associated with increased titers of deformed wing virus in the developing bees. Our data on different immune genes show little evidence for immunosuppression of honey bees by V. destructor. Experimental wounding of developing bees increases relative immune gene expression and deformed wing virus titers. Combined, these results suggest that mite feeding activity itself and not immunosuppression may contribute to the synergy between viruses and mites. However, our results also suggest that increased expression of honey bee immune genes decreases mite reproductive success, which may be explored to enhance mite control strategies. Finally, our expression data for multiple immune genes across developmental time and different experimental treatments indicates co-regulation of several of these genes and thus improves our understanding of the understudied honey bee immune system. © 2014. Published by The Company of Biologists Ltd.

  3. The consequences of chromosomal aneuploidy on the transcriptome of cancer cells☆

    PubMed Central

    Ried, Thomas; Hu, Yue; Difilippantonio, Michael J.; Ghadimi, B. Michael; Grade, Marian; Camps, Jordi

    2016-01-01

    Chromosomal aneuploidies are a defining feature of carcinomas, i.e., tumors of epithelial origin. Such aneuploidies result in tumor specific genomic copy number alterations. The patterns of genomic imbalances are tumor specific, and to a certain extent specific for defined stages of tumor development. Genomic imbalances occur already in premalignant precursor lesions, i.e., before the transition to invasive disease, and their distribution is maintained in metastases, and in cell lines derived from primary tumors. These observations are consistent with the interpretation that tumor specific genomic imbalances are drivers of malignant transformation. Naturally, this precipitates the question of how such imbalances influence the expression of resident genes. A number of laboratories have systematically integrated copy number alterations with gene expression changes in primary tumors and metastases, cell lines, and experimental models of aneuploidy to address the question as to whether genomic imbalances deregulate the expression of one or few key genes, or rather affect the cancer transcriptome more globally. The majority of these studies showed that gene expression levels follow genomic copy number. Therefore, gross genomic copy number changes, including aneuploidies of entire chromosome arms and chromosomes, result in a massive deregulation of the transcriptome of cancer cells. This article is part of a Special Issue entitled: Chromatin in time and space. PMID:22426433

  4. Evaluation of reference genes for insect olfaction studies.

    PubMed

    Omondi, Bonaventure Aman; Latorre-Estivalis, Jose Manuel; Rocha Oliveira, Ivana Helena; Ignell, Rickard; Lorenzo, Marcelo Gustavo

    2015-04-22

    Quantitative reverse transcription PCR (qRT-PCR) is a robust and accessible method to assay gene expression and to infer gene regulation. Being a chain of procedures, this technique is subject to systematic error due to biological and technical limitations mainly set by the starting material and downstream procedures. Thus, rigorous data normalization is critical to grant reliability and repeatability of gene expression quantification by qRT-PCR. A number of 'housekeeping genes', involved in basic cellular functions, have been commonly used as internal controls for this normalization process. However, these genes could themselves be regulated and must therefore be tested a priori. We evaluated eight potential reference genes for their stability as internal controls for RT-qPCR studies of olfactory gene expression in the antennae of Rhodnius prolixus, a Chagas disease vector. The set of genes included were: α-tubulin; β-actin; Glyceraldehyde-3-phosphate dehydrogenase; Eukaryotic initiation factor 1A; Glutathione-S-transferase; Serine protease; Succinate dehydrogenase; and Glucose-6-phosphate dehydrogenase. Five experimental conditions, including changes in age,developmental stage and feeding status were tested in both sexes. We show that the evaluation of candidate reference genes is necessary for each combination of sex, tissue and physiological condition analyzed in order to avoid inconsistent results and conclusions. Although, Normfinder and geNorm software yielded different results between males and females, five genes (SDH, Tub, GAPDH, Act and G6PDH) appeared in the first positions in all rankings obtained. By using gene expression data of a single olfactory coreceptor gene as an example, we demonstrated the extent of changes expected using different internal standards. This work underlines the need for a rigorous selection of internal standards to grant the reliability of normalization processes in qRT-PCR studies. Furthermore, we show that particular physiological or developmental conditions require independent evaluation of a diverse set of potential reference genes.

  5. Mechanical Stretching Promotes Skin Tissue Regeneration via Enhancing Mesenchymal Stem Cell Homing and Transdifferentiation.

    PubMed

    Liang, Xiao; Huang, Xiaolu; Zhou, Yiwen; Jin, Rui; Li, Qingfeng

    2016-07-01

    Skin tissue expansion is a clinical procedure for skin regeneration to reconstruct cutaneous defects that can be accompanied by severe complications. The transplantation of mesenchymal stem cells (MSCs) has been proven effective in promoting skin expansion and helping to ameliorate complications; however, systematic understanding of its mechanism remains unclear. MSCs from luciferase-Tg Lewis rats were intravenously transplanted into a rat tissue expansion model to identify homing and transdifferentiation. To clarify underlying mechanisms, a systematic approach was used to identify the differentially expressed genes between mechanically stretched human MSCs and controls. The biological significance of these changes was analyzed through bioinformatic methods. We further investigated genes and pathways of interest to disclose their potential role in mechanical stretching-induced skin regeneration. Cross sections of skin samples from the expanded group showed significantly more luciferase(+) and stromal cell-derived factor 1α (SDF-1α)(+), luciferase(+)keratin 14(+), and luciferase(+)CD31(+) cells than the control group, indicating MSC transdifferentiation into epidermal basal cells and endothelial cells after SDF-1α-mediated homing. Microarray analysis suggested upregulation of genes related to hypoxia, vascularization, and cell proliferation in the stretched human MSCs. Further investigation showed that the homing of MSCs was blocked by short interfering RNA targeted against matrix metalloproteinase 2, and that mechanical stretching-induced vascular endothelial growth factor A upregulation was related to the Janus kinase/signal transducer and activator of transcription (Jak-STAT) and Wnt signaling pathways. This study determines that mechanical stretching might promote skin regeneration by upregulating MSC expression of genes related to hypoxia, vascularization, and cell proliferation; enhancing transplanted MSC homing to the expanded skin; and transdifferentiation into epidermal basal cells and endothelial cells. Skin tissue expansion is a clinical procedure for skin regeneration to cover cutaneous defects that can be accompanied by severe complications. The transplantation of mesenchymal stem cells (MSCs) has been proven effective in promoting skin expansion and ameliorating complications. This study, which sought to provide a systematic understanding of the mechanism, determined that mechanical stretching could upregulate MSC expression of genes related to hypoxia, vascularization, and cell proliferation; enhance transplanted MSC homing to the expanded skin tissue; and promote their transdifferentiation into epidermal basal cells and endothelial cells. ©AlphaMed Press.

  6. RiceFOX: a database of Arabidopsis mutant lines overexpressing rice full-length cDNA that contains a wide range of trait information to facilitate analysis of gene function.

    PubMed

    Sakurai, Tetsuya; Kondou, Youichi; Akiyama, Kenji; Kurotani, Atsushi; Higuchi, Mieko; Ichikawa, Takanari; Kuroda, Hirofumi; Kusano, Miyako; Mori, Masaki; Saitou, Tsutomu; Sakakibara, Hitoshi; Sugano, Shoji; Suzuki, Makoto; Takahashi, Hideki; Takahashi, Shinya; Takatsuji, Hiroshi; Yokotani, Naoki; Yoshizumi, Takeshi; Saito, Kazuki; Shinozaki, Kazuo; Oda, Kenji; Hirochika, Hirohiko; Matsui, Minami

    2011-02-01

    Identification of gene function is important not only for basic research but also for applied science, especially with regard to improvements in crop production. For rapid and efficient elucidation of useful traits, we developed a system named FOX hunting (Full-length cDNA Over-eXpressor gene hunting) using full-length cDNAs (fl-cDNAs). A heterologous expression approach provides a solution for the high-throughput characterization of gene functions in agricultural plant species. Since fl-cDNAs contain all the information of functional mRNAs and proteins, we introduced rice fl-cDNAs into Arabidopsis plants for systematic gain-of-function mutation. We generated >30,000 independent Arabidopsis transgenic lines expressing rice fl-cDNAs (rice FOX Arabidopsis mutant lines). These rice FOX Arabidopsis lines were screened systematically for various criteria such as morphology, photosynthesis, UV resistance, element composition, plant hormone profile, metabolite profile/fingerprinting, bacterial resistance, and heat and salt tolerance. The information obtained from these screenings was compiled into a database named 'RiceFOX'. This database contains around 18,000 records of rice FOX Arabidopsis lines and allows users to search against all the observed results, ranging from morphological to invisible traits. The number of searchable items is approximately 100; moreover, the rice FOX Arabidopsis lines can be searched by rice and Arabidopsis gene/protein identifiers, sequence similarity to the introduced rice fl-cDNA and traits. The RiceFOX database is available at http://ricefox.psc.riken.jp/.

  7. RiceFOX: A Database of Arabidopsis Mutant Lines Overexpressing Rice Full-Length cDNA that Contains a Wide Range of Trait Information to Facilitate Analysis of Gene Function

    PubMed Central

    Sakurai, Tetsuya; Kondou, Youichi; Akiyama, Kenji; Kurotani, Atsushi; Higuchi, Mieko; Ichikawa, Takanari; Kuroda, Hirofumi; Kusano, Miyako; Mori, Masaki; Saitou, Tsutomu; Sakakibara, Hitoshi; Sugano, Shoji; Suzuki, Makoto; Takahashi, Hideki; Takahashi, Shinya; Takatsuji, Hiroshi; Yokotani, Naoki; Yoshizumi, Takeshi; Saito, Kazuki; Shinozaki, Kazuo; Oda, Kenji; Hirochika, Hirohiko; Matsui, Minami

    2011-01-01

    Identification of gene function is important not only for basic research but also for applied science, especially with regard to improvements in crop production. For rapid and efficient elucidation of useful traits, we developed a system named FOX hunting (Full-length cDNA Over-eXpressor gene hunting) using full-length cDNAs (fl-cDNAs). A heterologous expression approach provides a solution for the high-throughput characterization of gene functions in agricultural plant species. Since fl-cDNAs contain all the information of functional mRNAs and proteins, we introduced rice fl-cDNAs into Arabidopsis plants for systematic gain-of-function mutation. We generated >30,000 independent Arabidopsis transgenic lines expressing rice fl-cDNAs (rice FOX Arabidopsis mutant lines). These rice FOX Arabidopsis lines were screened systematically for various criteria such as morphology, photosynthesis, UV resistance, element composition, plant hormone profile, metabolite profile/fingerprinting, bacterial resistance, and heat and salt tolerance. The information obtained from these screenings was compiled into a database named ‘RiceFOX’. This database contains around 18,000 records of rice FOX Arabidopsis lines and allows users to search against all the observed results, ranging from morphological to invisible traits. The number of searchable items is approximately 100; moreover, the rice FOX Arabidopsis lines can be searched by rice and Arabidopsis gene/protein identifiers, sequence similarity to the introduced rice fl-cDNA and traits. The RiceFOX database is available at http://ricefox.psc.riken.jp/. PMID:21186176

  8. MicroRNA profiling of the murine hematopoietic system

    PubMed Central

    Monticelli, Silvia; Ansel, K Mark; Xiao, Changchun; Socci, Nicholas D; Krichevsky, Anna M; Thai, To-Ha; Rajewsky, Nikolaus; Marks, Debora S; Sander, Chris; Rajewsky, Klaus; Rao, Anjana; Kosik, Kenneth S

    2005-01-01

    Background MicroRNAs (miRNAs) are a class of recently discovered noncoding RNA genes that post-transcriptionally regulate gene expression. It is becoming clear that miRNAs play an important role in the regulation of gene expression during development. However, in mammals, expression data are principally based on whole tissue analysis and are still very incomplete. Results We used oligonucleotide arrays to analyze miRNA expression in the murine hematopoietic system. Complementary oligonucleotides capable of hybridizing to 181 miRNAs were immobilized on a membrane and probed with radiolabeled RNA derived from low molecular weight fractions of total RNA from several different hematopoietic and neuronal cells. This method allowed us to analyze cell type-specific patterns of miRNA expression and to identify miRNAs that might be important for cell lineage specification and/or cell effector functions. Conclusion This is the first report of systematic miRNA gene profiling in cells of the hematopoietic system. As expected, miRNA expression patterns were very different between hematopoietic and non-hematopoietic cells, with further subtle differences observed within the hematopoietic group. Interestingly, the most pronounced similarities were observed among fully differentiated effector cells (Th1 and Th2 lymphocytes and mast cells) and precursors at comparable stages of differentiation (double negative thymocytes and pro-B cells), suggesting that in addition to regulating the process of commitment to particular cellular lineages, miRNAs might have an important general role in the mechanism of cell differentiation and maintenance of cell identity. PMID:16086853

  9. Molecular mechanisms of floral organ specification by MADS domain proteins.

    PubMed

    Yan, Wenhao; Chen, Dijun; Kaufmann, Kerstin

    2016-02-01

    Flower development is a model system to understand organ specification in plants. The identities of different types of floral organs are specified by homeotic MADS transcription factors that interact in a combinatorial fashion. Systematic identification of DNA-binding sites and target genes of these key regulators show that they have shared and unique sets of target genes. DNA binding by MADS proteins is not based on 'simple' recognition of a specific DNA sequence, but depends on DNA structure and combinatorial interactions. Homeotic MADS proteins regulate gene expression via alternative mechanisms, one of which may be to modulate chromatin structure and accessibility in their target gene promoters. Copyright © 2015 Elsevier Ltd. All rights reserved.

  10. Integrative analysis and expression profiling of secondary cell wall genes in C4 biofuel model Setaria italica reveals targets for lignocellulose bioengineering

    PubMed Central

    Muthamilarasan, Mehanathan; Khan, Yusuf; Jaishankar, Jananee; Shweta, Shweta; Lata, Charu; Prasad, Manoj

    2015-01-01

    Several underutilized grasses have excellent potential for use as bioenergy feedstock due to their lignocellulosic biomass. Genomic tools have enabled identification of lignocellulose biosynthesis genes in several sequenced plants. However, the non-availability of whole genome sequence of bioenergy grasses hinders the study on bioenergy genomics and their genomics-assisted crop improvement. Foxtail millet (Setaria italica L.; Si) is a model crop for studying systems biology of bioenergy grasses. In the present study, a systematic approach has been used for identification of gene families involved in cellulose (CesA/Csl), callose (Gsl) and monolignol biosynthesis (PAL, C4H, 4CL, HCT, C3H, CCoAOMT, F5H, COMT, CCR, CAD) and construction of physical map of foxtail millet. Sequence alignment and phylogenetic analysis of identified proteins showed that monolignol biosynthesis proteins were highly diverse, whereas CesA/Csl and Gsl proteins were homologous to rice and Arabidopsis. Comparative mapping of foxtail millet lignocellulose biosynthesis genes with other C4 panicoid genomes revealed maximum homology with switchgrass, followed by sorghum and maize. Expression profiling of candidate lignocellulose genes in response to different abiotic stresses and hormone treatments showed their differential expression pattern, with significant higher expression of SiGsl12, SiPAL2, SiHCT1, SiF5H2, and SiCAD6 genes. Further, due to the evolutionary conservation of grass genomes, the insights gained from the present study could be extrapolated for identifying genes involved in lignocellulose biosynthesis in other biofuel species for further characterization. PMID:26583030

  11. Integrative analysis and expression profiling of secondary cell wall genes in C4 biofuel model Setaria italica reveals targets for lignocellulose bioengineering.

    PubMed

    Muthamilarasan, Mehanathan; Khan, Yusuf; Jaishankar, Jananee; Shweta, Shweta; Lata, Charu; Prasad, Manoj

    2015-01-01

    Several underutilized grasses have excellent potential for use as bioenergy feedstock due to their lignocellulosic biomass. Genomic tools have enabled identification of lignocellulose biosynthesis genes in several sequenced plants. However, the non-availability of whole genome sequence of bioenergy grasses hinders the study on bioenergy genomics and their genomics-assisted crop improvement. Foxtail millet (Setaria italica L.; Si) is a model crop for studying systems biology of bioenergy grasses. In the present study, a systematic approach has been used for identification of gene families involved in cellulose (CesA/Csl), callose (Gsl) and monolignol biosynthesis (PAL, C4H, 4CL, HCT, C3H, CCoAOMT, F5H, COMT, CCR, CAD) and construction of physical map of foxtail millet. Sequence alignment and phylogenetic analysis of identified proteins showed that monolignol biosynthesis proteins were highly diverse, whereas CesA/Csl and Gsl proteins were homologous to rice and Arabidopsis. Comparative mapping of foxtail millet lignocellulose biosynthesis genes with other C4 panicoid genomes revealed maximum homology with switchgrass, followed by sorghum and maize. Expression profiling of candidate lignocellulose genes in response to different abiotic stresses and hormone treatments showed their differential expression pattern, with significant higher expression of SiGsl12, SiPAL2, SiHCT1, SiF5H2, and SiCAD6 genes. Further, due to the evolutionary conservation of grass genomes, the insights gained from the present study could be extrapolated for identifying genes involved in lignocellulose biosynthesis in other biofuel species for further characterization.

  12. Co-acting gene networks predict TRAIL responsiveness of tumour cells with high accuracy.

    PubMed

    O'Reilly, Paul; Ortutay, Csaba; Gernon, Grainne; O'Connell, Enda; Seoighe, Cathal; Boyce, Susan; Serrano, Luis; Szegezdi, Eva

    2014-12-19

    Identification of differentially expressed genes from transcriptomic studies is one of the most common mechanisms to identify tumor biomarkers. This approach however is not well suited to identify interaction between genes whose protein products potentially influence each other, which limits its power to identify molecular wiring of tumour cells dictating response to a drug. Due to the fact that signal transduction pathways are not linear and highly interlinked, the biological response they drive may be better described by the relative amount of their components and their functional relationships than by their individual, absolute expression. Gene expression microarray data for 109 tumor cell lines with known sensitivity to the death ligand cytokine tumor necrosis factor-related apoptosis-inducing ligand (TRAIL) was used to identify genes with potential functional relationships determining responsiveness to TRAIL-induced apoptosis. The machine learning technique Random Forest in the statistical environment "R" with backward elimination was used to identify the key predictors of TRAIL sensitivity and differentially expressed genes were identified using the software GeneSpring. Gene co-regulation and statistical interaction was assessed with q-order partial correlation analysis and non-rejection rate. Biological (functional) interactions amongst the co-acting genes were studied with Ingenuity network analysis. Prediction accuracy was assessed by calculating the area under the receiver operator curve using an independent dataset. We show that the gene panel identified could predict TRAIL-sensitivity with a very high degree of sensitivity and specificity (AUC=0·84). The genes in the panel are co-regulated and at least 40% of them functionally interact in signal transduction pathways that regulate cell death and cell survival, cellular differentiation and morphogenesis. Importantly, only 12% of the TRAIL-predictor genes were differentially expressed highlighting the importance of functional interactions in predicting the biological response. The advantage of co-acting gene clusters is that this analysis does not depend on differential expression and is able to incorporate direct- and indirect gene interactions as well as tissue- and cell-specific characteristics. This approach (1) identified a descriptor of TRAIL sensitivity which performs significantly better as a predictor of TRAIL sensitivity than any previously reported gene signatures, (2) identified potential novel regulators of TRAIL-responsiveness and (3) provided a systematic view highlighting fundamental differences between the molecular wiring of sensitive and resistant cell types.

  13. Evaluation of Bias-Variance Trade-Off for Commonly Used Post-Summarizing Normalization Procedures in Large-Scale Gene Expression Studies

    PubMed Central

    Qiu, Xing; Hu, Rui; Wu, Zhixin

    2014-01-01

    Normalization procedures are widely used in high-throughput genomic data analyses to remove various technological noise and variations. They are known to have profound impact to the subsequent gene differential expression analysis. Although there has been some research in evaluating different normalization procedures, few attempts have been made to systematically evaluate the gene detection performances of normalization procedures from the bias-variance trade-off point of view, especially with strong gene differentiation effects and large sample size. In this paper, we conduct a thorough study to evaluate the effects of normalization procedures combined with several commonly used statistical tests and MTPs under different configurations of effect size and sample size. We conduct theoretical evaluation based on a random effect model, as well as simulation and biological data analyses to verify the results. Based on our findings, we provide some practical guidance for selecting a suitable normalization procedure under different scenarios. PMID:24941114

  14. Systematic transcriptome-wide analysis of mRNA-miRNA interactions reveals the involvement of miR-142-5p and its target (FOXO3) in skeletal muscle growth in chickens.

    PubMed

    Li, Zhenhui; Abdalla, Bahareldin Ali; Zheng, Ming; He, Xiaomei; Cai, Bolin; Han, Peigong; Ouyang, Hongjia; Chen, Biao; Nie, Qinghua; Zhang, Xiquan

    2018-02-01

    The goal of this study was to perform a systematic transcriptome-wide analysis of mRNA-miRNA interactions and to identify candidates involved in the interplay between miRNAs and mRNAs that regulate chicken muscle growth. We used our previously published mRNA (GSE72424) and miRNA (GSE62971) deep sequencing data from two-tailed samples [i.e., the highest (h) and lowest (l) body weights] of Recessive White Rock (WRR) and Xinghua (XH) chickens to conduct integrative analyses of the miRNA-mRNA interactions involved in chicken skeletal muscle growth. A total of 162, 15, 173, and 27 miRNA-mRNA pairs with negatively correlated expression patterns were identified in miRNA-mRNA networks constructed on the basis of the WRR h vs. XH h , WRR h vs. WRR l , WRR l vs. XH l , and XH h vs. XH l comparisons, respectively. Ingenuity Pathway Analysis revealed that gene networks identified for the WRR h vs. XH h contrast were associated with developmental disorders. Importantly, the WRR h vs. XH h contrast miRNA-mRNA network was enriched in IGF-1 signaling pathway genes, including FOXO3. A dual-luciferase reporter assay showed that FOXO3 was a target of miR-142-5p. Furthermore, miR-142-5p overexpression significantly decreased FOXO3 mRNA levels and promoted the expression of growth-related genes. These data demonstrated that miR-142-5p targets FOXO3 and promotes growth-related gene expression and regulates skeletal muscle growth in chicken. Comprehensive analysis facilitated the identification of miRNAs and target genes that might contribute to the regulation of skeletal muscle development. Our results provide new clues for understanding the molecular basis of chicken growth.

  15. The transfer and transformation of collective network information in gene-matched networks.

    PubMed

    Kitsukawa, Takashi; Yagi, Takeshi

    2015-10-09

    Networks, such as the human society network, social and professional networks, and biological system networks, contain vast amounts of information. Information signals in networks are distributed over nodes and transmitted through intricately wired links, making the transfer and transformation of such information difficult to follow. Here we introduce a novel method for describing network information and its transfer using a model network, the Gene-matched network (GMN), in which nodes (neurons) possess attributes (genes). In the GMN, nodes are connected according to their expression of common genes. Because neurons have multiple genes, the GMN is cluster-rich. We show that, in the GMN, information transfer and transformation were controlled systematically, according to the activity level of the network. Furthermore, information transfer and transformation could be traced numerically with a vector using genes expressed in the activated neurons, the active-gene array, which was used to assess the relative activity among overlapping neuronal groups. Interestingly, this coding style closely resembles the cell-assembly neural coding theory. The method introduced here could be applied to many real-world networks, since many systems, including human society and various biological systems, can be represented as a network of this type.

  16. Innate immune activity conditions the effect of regulatory variants upon monocyte gene expression.

    PubMed

    Fairfax, Benjamin P; Humburg, Peter; Makino, Seiko; Naranbhai, Vivek; Wong, Daniel; Lau, Evelyn; Jostins, Luke; Plant, Katharine; Andrews, Robert; McGee, Chris; Knight, Julian C

    2014-03-07

    To systematically investigate the impact of immune stimulation upon regulatory variant activity, we exposed primary monocytes from 432 healthy Europeans to interferon-γ (IFN-γ) or differing durations of lipopolysaccharide and mapped expression quantitative trait loci (eQTLs). More than half of cis-eQTLs identified, involving hundreds of genes and associated pathways, are detected specifically in stimulated monocytes. Induced innate immune activity reveals multiple master regulatory trans-eQTLs including the major histocompatibility complex (MHC), coding variants altering enzyme and receptor function, an IFN-β cytokine network showing temporal specificity, and an interferon regulatory factor 2 (IRF2) transcription factor-modulated network. Induced eQTL are significantly enriched for genome-wide association study loci, identifying context-specific associations to putative causal genes including CARD9, ATM, and IRF8. Thus, applying pathophysiologically relevant immune stimuli assists resolution of functional genetic variants.

  17. [Weighted gene co-expression network analysis in biomedicine research].

    PubMed

    Liu, Wei; Li, Li; Ye, Hua; Tu, Wei

    2017-11-25

    High-throughput biological technologies are now widely applied in biology and medicine, allowing scientists to monitor thousands of parameters simultaneously in a specific sample. However, it is still an enormous challenge to mine useful information from high-throughput data. The emergence of network biology provides deeper insights into complex bio-system and reveals the modularity in tissue/cellular networks. Correlation networks are increasingly used in bioinformatics applications. Weighted gene co-expression network analysis (WGCNA) tool can detect clusters of highly correlated genes. Therefore, we systematically reviewed the application of WGCNA in the study of disease diagnosis, pathogenesis and other related fields. First, we introduced principle, workflow, advantages and disadvantages of WGCNA. Second, we presented the application of WGCNA in disease, physiology, drug, evolution and genome annotation. Then, we indicated the application of WGCNA in newly developed high-throughput methods. We hope this review will help to promote the application of WGCNA in biomedicine research.

  18. A combinatorial code for pattern formation in Drosophila oogenesis.

    PubMed

    Yakoby, Nir; Bristow, Christopher A; Gong, Danielle; Schafer, Xenia; Lembong, Jessica; Zartman, Jeremiah J; Halfon, Marc S; Schüpbach, Trudi; Shvartsman, Stanislav Y

    2008-11-01

    Two-dimensional patterning of the follicular epithelium in Drosophila oogenesis is required for the formation of three-dimensional eggshell structures. Our analysis of a large number of published gene expression patterns in the follicle cells suggests that they follow a simple combinatorial code based on six spatial building blocks and the operations of union, difference, intersection, and addition. The building blocks are related to the distribution of inductive signals, provided by the highly conserved epidermal growth factor receptor and bone morphogenetic protein signaling pathways. We demonstrate the validity of the code by testing it against a set of patterns obtained in a large-scale transcriptional profiling experiment. Using the proposed code, we distinguish 36 distinct patterns for 81 genes expressed in the follicular epithelium and characterize their joint dynamics over four stages of oogenesis. The proposed combinatorial framework allows systematic analysis of the diversity and dynamics of two-dimensional transcriptional patterns and guides future studies of gene regulation.

  19. Transcriptional master regulator analysis in breast cancer genetic networks.

    PubMed

    Tovar, Hugo; García-Herrera, Rodrigo; Espinal-Enríquez, Jesús; Hernández-Lemus, Enrique

    2015-12-01

    Gene regulatory networks account for the delicate mechanisms that control gene expression. Under certain circumstances, gene regulatory programs may give rise to amplification cascades. Such transcriptional cascades are events in which activation of key-responsive transcription factors called master regulators trigger a series of gene expression events. The action of transcriptional master regulators is then important for the establishment of certain programs like cell development and differentiation. However, such cascades have also been related with the onset and maintenance of cancer phenotypes. Here we present a systematic implementation of a series of algorithms aimed at the inference of a gene regulatory network and analysis of transcriptional master regulators in the context of primary breast cancer cells. Such studies were performed in a highly curated database of 880 microarray gene expression experiments on biopsy-captured tissue corresponding to primary breast cancer and healthy controls. Biological function and biochemical pathway enrichment analyses were also performed to study the role that the processes controlled - at the transcriptional level - by such master regulators may have in relation to primary breast cancer. We found that transcription factors such as AGTR2, ZNF132, TFDP3 and others are master regulators in this gene regulatory network. Sets of genes controlled by these regulators are involved in processes that are well-known hallmarks of cancer. This kind of analyses may help to understand the most upstream events in the development of phenotypes, in particular, those regarding cancer biology. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Integrative analyses of leprosy susceptibility genes indicate a common autoimmune profile.

    PubMed

    Zhang, Deng-Feng; Wang, Dong; Li, Yu-Ye; Yao, Yong-Gang

    2016-04-01

    Leprosy is an ancient chronic infection in the skin and peripheral nerves caused by Mycobacterium leprae. The development of leprosy depends on genetic background and the immune status of the host. However, there is no systematic view focusing on the biological pathways, interaction networks and overall expression pattern of leprosy-related immune and genetic factors. To identify the hub genes in the center of leprosy genetic network and to provide an insight into immune and genetic factors contributing to leprosy. We retrieved all reported leprosy-related genes and performed integrative analyses covering gene expression profiling, pathway analysis, protein-protein interaction network, and evolutionary analyses. A list of 123 differentially expressed leprosy related genes, which were enriched in activation and regulation of immune response, was obtained in our analyses. Cross-disorder analysis showed that the list of leprosy susceptibility genes was largely shared by typical autoimmune diseases such as lupus erythematosus and arthritis, suggesting that similar pathways might be affected in leprosy and autoimmune diseases. Protein-protein interaction (PPI) and positive selection analyses revealed a co-evolution network of leprosy risk genes. Our analyses showed that leprosy associated genes constituted a co-evolution network and might undergo positive selection driven by M. leprae. We suggested that leprosy may be a kind of autoimmune disease and the development of leprosy is a matter of defect or over-activation of body immunity. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  1. Functional Brachyury Binding Sites Establish a Temporal Read-out of Gene Expression in the Ciona Notochord

    PubMed Central

    Passamaneck, Yale J.; Gazdoiu, Stefan; José-Edwards, Diana S.; Kugler, Jamie E.; Oda-Ishii, Izumi; Imai, Janice H.; Nibu, Yutaka; Di Gregorio, Anna

    2013-01-01

    The appearance of the notochord represented a milestone in Deuterostome evolution. The notochord is necessary for the development of the chordate body plan and for the formation of the vertebral column and numerous organs. It is known that the transcription factor Brachyury is required for notochord formation in all chordates, and that it controls transcription of a large number of target genes. However, studies of the structure of the cis-regulatory modules (CRMs) through which this control is exerted are complicated in vertebrates by the genomic complexity and the pan-mesodermal expression territory of Brachyury. We used the ascidian Ciona, in which the single-copy Brachyury is notochord-specific and CRMs are easily identifiable, to carry out a systematic characterization of Brachyury-downstream notochord CRMs. We found that Ciona Brachyury (Ci-Bra) controls most of its targets directly, through non-palindromic binding sites that function either synergistically or individually to activate early- and middle-onset genes, respectively, while late-onset target CRMs are controlled indirectly, via transcriptional intermediaries. These results illustrate how a transcriptional regulator can efficiently shape a shallow gene regulatory network into a multi-tiered transcriptional output, and provide insights into the mechanisms that establish temporal read-outs of gene expression in a fast-developing chordate embryo. PMID:24204212

  2. Functional Brachyury binding sites establish a temporal read-out of gene expression in the Ciona notochord.

    PubMed

    Katikala, Lavanya; Aihara, Hitoshi; Passamaneck, Yale J; Gazdoiu, Stefan; José-Edwards, Diana S; Kugler, Jamie E; Oda-Ishii, Izumi; Imai, Janice H; Nibu, Yutaka; Di Gregorio, Anna

    2013-10-01

    The appearance of the notochord represented a milestone in Deuterostome evolution. The notochord is necessary for the development of the chordate body plan and for the formation of the vertebral column and numerous organs. It is known that the transcription factor Brachyury is required for notochord formation in all chordates, and that it controls transcription of a large number of target genes. However, studies of the structure of the cis-regulatory modules (CRMs) through which this control is exerted are complicated in vertebrates by the genomic complexity and the pan-mesodermal expression territory of Brachyury. We used the ascidian Ciona, in which the single-copy Brachyury is notochord-specific and CRMs are easily identifiable, to carry out a systematic characterization of Brachyury-downstream notochord CRMs. We found that Ciona Brachyury (Ci-Bra) controls most of its targets directly, through non-palindromic binding sites that function either synergistically or individually to activate early- and middle-onset genes, respectively, while late-onset target CRMs are controlled indirectly, via transcriptional intermediaries. These results illustrate how a transcriptional regulator can efficiently shape a shallow gene regulatory network into a multi-tiered transcriptional output, and provide insights into the mechanisms that establish temporal read-outs of gene expression in a fast-developing chordate embryo.

  3. Practical applications of the bioinformatics toolbox for narrowing quantitative trait loci.

    PubMed

    Burgess-Herbert, Sarah L; Cox, Allison; Tsaih, Shirng-Wern; Paigen, Beverly

    2008-12-01

    Dissecting the genes involved in complex traits can be confounded by multiple factors, including extensive epistatic interactions among genes, the involvement of epigenetic regulators, and the variable expressivity of traits. Although quantitative trait locus (QTL) analysis has been a powerful tool for localizing the chromosomal regions underlying complex traits, systematically identifying the causal genes remains challenging. Here, through its application to plasma levels of high-density lipoprotein cholesterol (HDL) in mice, we demonstrate a strategy for narrowing QTL that utilizes comparative genomics and bioinformatics techniques. We show how QTL detected in multiple crosses are subjected to both combined cross analysis and haplotype block analysis; how QTL from one species are mapped to the concordant regions in another species; and how genomewide scans associating haplotype groups with their phenotypes can be used to prioritize the narrowed regions. Then we illustrate how these individual methods for narrowing QTL can be systematically integrated for mouse chromosomes 12 and 15, resulting in a significantly reduced number of candidate genes, often from hundreds to <10. Finally, we give an example of how additional bioinformatics resources can be combined with experiments to determine the most likely quantitative trait genes.

  4. Orphan nuclear receptor chicken ovalbumin upstream promoter-transcription factor II (COUP-TFII) protein negatively regulates bone morphogenetic protein 2-induced osteoblast differentiation through suppressing runt-related gene 2 (Runx2) activity.

    PubMed

    Lee, Kkot-Nim; Jang, Won-Gu; Kim, Eun-Jung; Oh, Sin-Hye; Son, Hye-Ju; Kim, Sun-Hun; Franceschi, Renny; Zhang, Xiao-Kun; Lee, Shee-Eun; Koh, Jeong-Tae

    2012-06-01

    Chicken ovalbumin upstream promoter-transcription factor II (COUP-TFII) is an orphan nuclear receptor of the steroid-thyroid hormone receptor superfamily. COUP-TFII is widely expressed in multiple tissues and organs throughout embryonic development and has been shown to regulate cellular growth, differentiation, and organ development. However, the role of COUP-TFII in osteoblast differentiation has not been systematically evaluated. In the present study, COUP-TFII was strongly expressed in multipotential mesenchymal cells, and the endogenous expression level decreased during osteoblast differentiation. Overexpression of COUP-TFII inhibited bone morphogenetic protein 2 (BMP2)-induced osteoblastic gene expression. The results of alkaline phosphatase, Alizarin Red staining, and osteocalcin production assay showed that COUP-TFII overexpression blocks BMP2-induced osteoblast differentiation. In contrast, the down-regulation of COUP-TFII synergistically induced the expression of BMP2-induced osteoblastic genes and osteoblast differentiation. Furthermore, the immunoprecipitation assay showed that COUP-TFII and Runx2 physically interacted and COUP-TFII significantly impaired the Runx2-dependent activation of the osteocalcin promoter. From the ChIP assay, we found that COUP-TFII repressed DNA binding of Runx2 to the osteocalcin gene, whereas Runx2 inhibited COUP-TFII expression via direct binding to the COUP-TFII promoter. Taken together, these findings demonstrate that COUP-TFII negatively regulates osteoblast differentiation via interaction with Runx2, and during the differentiation state, BMP2-induced Runx2 represses COUP-TFII expression and promotes osteoblast differentiation.

  5. DNA methylome signature in rheumatoid arthritis.

    PubMed

    Nakano, Kazuhisa; Whitaker, John W; Boyle, David L; Wang, Wei; Firestein, Gary S

    2013-01-01

    Epigenetics can influence disease susceptibility and severity. While DNA methylation of individual genes has been explored in autoimmunity, no unbiased systematic analyses have been reported. Therefore, a genome-wide evaluation of DNA methylation loci in fibroblast-like synoviocytes (FLS) isolated from the site of disease in rheumatoid arthritis (RA) was performed. Genomic DNA was isolated from six RA and five osteoarthritis (OA) FLS lines and evaluated using the Illumina HumanMethylation450 chip. Cluster analysis of data was performed and corrected using Benjamini-Hochberg adjustment for multiple comparisons. Methylation was confirmed by pyrosequencing and gene expression was determined by qPCR. Pathway analysis was performed using the Kyoto Encyclopedia of Genes and Genomes. RA and control FLS segregated based on DNA methylation, with 1859 differentially methylated loci. Hypomethylated loci were identified in key genes relevant to RA, such as CHI3L1, CASP1, STAT3, MAP3K5, MEFV and WISP3. Hypermethylation was also observed, including TGFBR2 and FOXO1. Hypomethylation of individual genes was associated with increased gene expression. Grouped analysis identified 207 hypermethylated or hypomethylated genes with multiple differentially methylated loci, including COL1A1, MEFV and TNF. Hypomethylation was increased in multiple pathways related to cell migration, including focal adhesion, cell adhesion, transendothelial migration and extracellular matrix interactions. Confirmatory studies with OA and normal FLS also demonstrated segregation of RA from control FLS based on methylation pattern. Differentially methylated genes could alter FLS gene expression and contribute to the pathogenesis of RA. DNA methylation of critical genes suggests that RA FLS are imprinted and implicate epigenetic contributions to inflammatory arthritis.

  6. Genome-, Transcriptome- and Proteome-Wide Analyses of the Gliadin Gene Families in Triticum urartu

    PubMed Central

    Wang, Dongzhi; Yang, Wenlong; Sun, Jiazhu; Zhang, Aimin; Zhan, Kehui

    2015-01-01

    Gliadins are the major components of storage proteins in wheat grains, and they play an essential role in the dough extensibility and nutritional quality of flour. Because of the large number of the gliadin family members, the high level of sequence identity, and the lack of abundant genomic data for Triticum species, identifying the full complement of gliadin family genes in hexaploid wheat remains challenging. Triticum urartu is a wild diploid wheat species and considered the A-genome donor of polyploid wheat species. The accession PI428198 (G1812) was chosen to determine the complete composition of the gliadin gene families in the wheat A-genome using the available draft genome. Using a PCR-based cloning strategy for genomic DNA and mRNA as well as a bioinformatics analysis of genomic sequence data, 28 gliadin genes were characterized. Of these genes, 23 were α-gliadin genes, three were γ-gliadin genes and two were ω-gliadin genes. An RNA sequencing (RNA-Seq) survey of the dynamic expression patterns of gliadin genes revealed that their synthesis in immature grains began prior to 10 days post-anthesis (DPA), peaked at 15 DPA and gradually decreased at 20 DPA. The accumulation of proteins encoded by 16 of the expressed gliadin genes was further verified and quantified using proteomic methods. The phylogenetic analysis demonstrated that the homologs of these α-gliadin genes were present in tetraploid and hexaploid wheat, which was consistent with T. urartu being the A-genome progenitor species. This study presents a systematic investigation of the gliadin gene families in T. urartu that spans the genome, transcriptome and proteome, and it provides new information to better understand the molecular structure, expression profiles and evolution of the gliadin genes in T. urartu and common wheat. PMID:26132381

  7. Genome-, Transcriptome- and Proteome-Wide Analyses of the Gliadin Gene Families in Triticum urartu.

    PubMed

    Zhang, Yanlin; Luo, Guangbin; Liu, Dongcheng; Wang, Dongzhi; Yang, Wenlong; Sun, Jiazhu; Zhang, Aimin; Zhan, Kehui

    2015-01-01

    Gliadins are the major components of storage proteins in wheat grains, and they play an essential role in the dough extensibility and nutritional quality of flour. Because of the large number of the gliadin family members, the high level of sequence identity, and the lack of abundant genomic data for Triticum species, identifying the full complement of gliadin family genes in hexaploid wheat remains challenging. Triticum urartu is a wild diploid wheat species and considered the A-genome donor of polyploid wheat species. The accession PI428198 (G1812) was chosen to determine the complete composition of the gliadin gene families in the wheat A-genome using the available draft genome. Using a PCR-based cloning strategy for genomic DNA and mRNA as well as a bioinformatics analysis of genomic sequence data, 28 gliadin genes were characterized. Of these genes, 23 were α-gliadin genes, three were γ-gliadin genes and two were ω-gliadin genes. An RNA sequencing (RNA-Seq) survey of the dynamic expression patterns of gliadin genes revealed that their synthesis in immature grains began prior to 10 days post-anthesis (DPA), peaked at 15 DPA and gradually decreased at 20 DPA. The accumulation of proteins encoded by 16 of the expressed gliadin genes was further verified and quantified using proteomic methods. The phylogenetic analysis demonstrated that the homologs of these α-gliadin genes were present in tetraploid and hexaploid wheat, which was consistent with T. urartu being the A-genome progenitor species. This study presents a systematic investigation of the gliadin gene families in T. urartu that spans the genome, transcriptome and proteome, and it provides new information to better understand the molecular structure, expression profiles and evolution of the gliadin genes in T. urartu and common wheat.

  8. Profiling deleterious non-synonymous SNPs of smoker's gene CYP1A1.

    PubMed

    Ramesh, A Sai; Khan, Imran; Farhan, Md; Thiagarajan, Padma

    2013-01-01

    CYP1A1 gene belongs to the cytochrome P450 family and is known better as smokers' gene due to its hyperactivation as a consequence of long term smoking. The expression of CYP1A1 induces polycyclic aromatic hydrocarbon production in the lungs, which when over expressed, is known to cause smoking related diseases, such as cardiovascular pathologies, cancer, and diabetes. Single nucleotide polymorphisms (SNPs) are the simplest form of genetic variations that occur at a higher frequency, and are denoted as synonymous and non-synonymous SNPs on the basis of their effects on the amino acids. This study adopts a systematic in silico approach to predict the deleterious SNPs that are associated with disease conditions. It is inferred that four SNPs are highly deleterious, among which the SNP with rs17861094 is commonly predicted to be harmful by all tools. Hydrophobic (isoleucine) to hydrophilic (serine) amino acid variation was observed in the candidate gene. Hence, this investigation aims to characterize a candidate gene from 159 SNPs of CYP1A1.

  9. SZGR 2.0: a one-stop shop of schizophrenia candidate genes

    PubMed Central

    Jia, Peilin; Han, Guangchun; Zhao, Junfei; Lu, Pinyi; Zhao, Zhongming

    2017-01-01

    SZGR 2.0 is a comprehensive resource of candidate variants and genes for schizophrenia, covering genetic, epigenetic, transcriptomic, translational and many other types of evidence. By systematic review and curation of multiple lines of evidence, we included almost all variants and genes that have ever been reported to be associated with schizophrenia. In particular, we collected ∼4200 common variants reported in genome-wide association studies, ∼1000 de novo mutations discovered by large-scale sequencing of family samples, 215 genes spanning rare and replication copy number variations, 99 genes overlapping with linkage regions, 240 differentially expressed genes, 4651 differentially methylated genes and 49 genes as antipsychotic drug targets. To facilitate interpretation, we included various functional annotation data, especially brain eQTL, methylation QTL, brain expression featured in deep categorization of brain areas and developmental stages and brain-specific promoter and enhancer annotations. Furthermore, we conducted cross-study, cross-data type and integrative analyses of the multidimensional data deposited in SZGR 2.0, and made the data and results available through a user-friendly interface. In summary, SZGR 2.0 provides a one-stop shop of schizophrenia variants and genes and their function and regulation, providing an important resource in the schizophrenia and other mental disease community. SZGR 2.0 is available at https://bioinfo.uth.edu/SZGR/. PMID:27733502

  10. Geometric Morphometrics on Gene Expression Patterns Within Phenotypes: A Case Example on Limb Development

    PubMed Central

    Martínez-Abadías, Neus; Mateu, Roger; Niksic, Martina; Russo, Lucia; Sharpe, James

    2016-01-01

    How the genotype translates into the phenotype through development is critical to fully understand the evolution of phenotypes. We propose a novel approach to directly assess how changes in gene expression patterns are associated with changes in morphology using the limb as a case example. Our method combines molecular biology techniques, such as whole-mount in situ hybridization, with image and shape analysis, extending the use of Geometric Morphometrics to the analysis of nonanatomical shapes, such as gene expression domains. Elliptical Fourier and Procrustes-based semilandmark analyses were used to analyze the variation and covariation patterns of the limb bud shape with the expression patterns of two relevant genes for limb morphogenesis, Hoxa11 and Hoxa13. We devised a multiple thresholding method to semiautomatically segment gene domains at several expression levels in large samples of limb buds from C57Bl6 mouse embryos between 10 and 12 postfertilization days. Besides providing an accurate phenotyping tool to quantify the spatiotemporal dynamics of gene expression patterns within developing structures, our morphometric analyses revealed high, non-random, and gene-specific variation undergoing canalization during limb development. Our results demonstrate that Hoxa11 and Hoxa13, despite being paralogs with analogous functions in limb patterning, show clearly distinct dynamic patterns, both in shape and size, and are associated differently with the limb bud shape. The correspondence between our results and already well-established molecular processes underlying limb development confirms that this morphometric approach is a powerful tool to extract features of development regulating morphogenesis. Such multilevel analyses are promising in systems where not so much molecular information is available and will advance our understanding of the genotype–phenotype map. In systematics, this knowledge will increase our ability to infer how evolution modified a common developmental pattern to generate a wide diversity of morphologies, as in the vertebrate limb. PMID:26377442

  11. Toxicogenomic outcomes predictive of forestomach carcinogenesis following exposure to benzo(a)pyrene: Relevance to human cancer risk

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Labib, Sarah, E-mail: Sarah.Labib@hc-sc.gc.ca; Guo, Charles H., E-mail: Charles.Guo@hc-sc.gc.ca; Williams, Andrew, E-mail: Andrew.Williams@hc-sc.gc.ca

    2013-12-01

    Forestomach tumors are observed in mice exposed to environmental carcinogens. However, the relevance of this data to humans is controversial because humans lack a forestomach. We hypothesize that an understanding of early molecular changes after exposure to a carcinogen in the forestomach will provide mode-of-action information to evaluate the applicability of forestomach cancers to human cancer risk assessment. In the present study we exposed mice to benzo(a)pyrene (BaP), an environmental carcinogen commonly associated with tumors of the rodent forestomach. Toxicogenomic tools were used to profile gene expression response in the forestomach. Adult Muta™Mouse males were orally exposed to 25, 50,more » and 75 mg BaP/kg-body-weight/day for 28 consecutive days. The forestomach was collected three days post-exposure. DNA microarrays, real-time RT-qPCR arrays, and protein analyses were employed to characterize responses in the forestomach. Microarray results showed altered expression of 414 genes across all treatment groups (± 1.5 fold; false discovery rate adjusted P ≤ 0.05). Significant downregulation of genes associated with phase II xenobiotic metabolism and increased expression of genes implicated in antigen processing and presentation, immune response, chemotaxis, and keratinocyte differentiation were observed in treated groups in a dose-dependent manner. A systematic comparison of the differentially expressed genes in the forestomach from the present study to differentially expressed genes identified in human diseases including human gastrointestinal tract cancers using the NextBio Human Disease Atlas showed significant commonalities between the two models. Our results provide molecular evidence supporting the use of the mouse forestomach model to evaluate chemically-induced gastrointestinal carcinogenesis in humans. - Highlights: • Benzo(a)pyrene-mediated transcriptomic response in the forestomach was examined. • The immunoproteosome subunits and MHC class I pathway were the most affected. • Keratinocyte differentiation associated gene expression changes were dose-dependent. • Molecular similarities exist between cancers of the forestomach and human stomach.« less

  12. Alternative Polyadenylation Directs Tissue-Specific miRNA Targeting in Caenorhabditis elegans Somatic Tissues

    PubMed Central

    Blazie, Stephen M.; Geissel, Heather C.; Wilky, Henry; Joshi, Rajan; Newbern, Jason; Mangone, Marco

    2017-01-01

    mRNA expression dynamics promote and maintain the identity of somatic tissues in living organisms; however, their impact in post-transcriptional gene regulation in these processes is not fully understood. Here, we applied the PAT-Seq approach to systematically isolate, sequence, and map tissue-specific mRNA from five highly studied Caenorhabditis elegans somatic tissues: GABAergic and NMDA neurons, arcade and intestinal valve cells, seam cells, and hypodermal tissues, and studied their mRNA expression dynamics. The integration of these datasets with previously profiled transcriptomes of intestine, pharynx, and body muscle tissues, precisely assigns tissue-specific expression dynamics for 60% of all annotated C. elegans protein-coding genes, providing an important resource for the scientific community. The mapping of 15,956 unique high-quality tissue-specific polyA sites in all eight somatic tissues reveals extensive tissue-specific 3′untranslated region (3′UTR) isoform switching through alternative polyadenylation (APA) . Almost all ubiquitously transcribed genes use APA and harbor miRNA targets in their 3′UTRs, which are commonly lost in a tissue-specific manner, suggesting widespread usage of post-transcriptional gene regulation modulated through APA to fine tune tissue-specific protein expression. Within this pool, the human disease gene C. elegans orthologs rack-1 and tct-1 use APA to switch to shorter 3′UTR isoforms in order to evade miRNA regulation in the body muscle tissue, resulting in increased protein expression needed for proper body muscle function. Our results highlight a major positive regulatory role for APA, allowing genes to counteract miRNA regulation on a tissue-specific basis. PMID:28348061

  13. Alternative Polyadenylation Directs Tissue-Specific miRNA Targeting in Caenorhabditis elegans Somatic Tissues.

    PubMed

    Blazie, Stephen M; Geissel, Heather C; Wilky, Henry; Joshi, Rajan; Newbern, Jason; Mangone, Marco

    2017-06-01

    mRNA expression dynamics promote and maintain the identity of somatic tissues in living organisms; however, their impact in post-transcriptional gene regulation in these processes is not fully understood. Here, we applied the PAT-Seq approach to systematically isolate, sequence, and map tissue-specific mRNA from five highly studied Caenorhabditis elegans somatic tissues: GABAergic and NMDA neurons, arcade and intestinal valve cells, seam cells, and hypodermal tissues, and studied their mRNA expression dynamics. The integration of these datasets with previously profiled transcriptomes of intestine, pharynx, and body muscle tissues, precisely assigns tissue-specific expression dynamics for 60% of all annotated C. elegans protein-coding genes, providing an important resource for the scientific community. The mapping of 15,956 unique high-quality tissue-specific polyA sites in all eight somatic tissues reveals extensive tissue-specific 3'untranslated region (3'UTR) isoform switching through alternative polyadenylation (APA) . Almost all ubiquitously transcribed genes use APA and harbor miRNA targets in their 3'UTRs, which are commonly lost in a tissue-specific manner, suggesting widespread usage of post-transcriptional gene regulation modulated through APA to fine tune tissue-specific protein expression. Within this pool, the human disease gene C. elegans orthologs rack-1 and tct-1 use APA to switch to shorter 3'UTR isoforms in order to evade miRNA regulation in the body muscle tissue, resulting in increased protein expression needed for proper body muscle function. Our results highlight a major positive regulatory role for APA, allowing genes to counteract miRNA regulation on a tissue-specific basis. Copyright © 2017 Blazie et al.

  14. The antimicrobial resistance patterns and associated determinants in Streptococcus suis isolated from humans in southern Vietnam, 1997-2008

    PubMed Central

    2011-01-01

    Background Streptococcus suis is an emerging zoonotic pathogen and is the leading cause of bacterial meningitis in adults in Vietnam. Systematic data on the antimicrobial susceptibility profiles of S. suis strains isolated from human cases are lacking. We studied antimicrobial resistance and associated resistance determinants in S. suis isolated from patients with meningitis in southern Vietnam. Methods S. suis strains isolated between 1997 and 2008 were investigated for their susceptibility to six antimicrobial agents. Strains were screened for the presence and expression of tetracycline and erythromycin resistance determinants and the association of tet(M) genes with Tn916- like transposons. The localization of tetracycline resistance gene tet(L) was determined by pulse field gel electrophoresis and Southern blotting. Results We observed a significant increase in resistance to tetracycline and chloramphenicol, which was concurrent with an increase in multi-drug resistance. In tetracycline resistance strains, we identified tet(M), tet(O), tet(W) and tet(L) and confirmed their expression. All tet(M) genes were associated with a Tn916-like transposon. The co-expression of tet(L) and other tetracycline resistance gene(s) encoding for ribosomal protection protein(s) was only detected in strains with a minimum inhibitory concentration (MIC) of tetracycline of ≥ 64 mg/L Conclusions We demonstrated that multi-drug resistance in S. suis causing disease in humans in southern Vietnam has increased over the 11-year period studied. We report the presence and expression of tet(L) in S. suis strains and our data suggest that co-expression of multiple genes encoding distinct mechanism is required for an MIC ≥ 64 mg/L to tetracycline. PMID:21208459

  15. The antimicrobial resistance patterns and associated determinants in Streptococcus suis isolated from humans in southern Vietnam, 1997-2008.

    PubMed

    Hoa, Ngo T; Chieu, Tran T B; Nghia, Ho D T; Mai, Nguyen T H; Anh, Pham H; Wolbers, Marcel; Baker, Stephen; Campbell, James I; Chau, Nguyen V V; Hien, Tran T; Farrar, Jeremy; Schultsz, Constance

    2011-01-06

    Streptococcus suis is an emerging zoonotic pathogen and is the leading cause of bacterial meningitis in adults in Vietnam. Systematic data on the antimicrobial susceptibility profiles of S. suis strains isolated from human cases are lacking. We studied antimicrobial resistance and associated resistance determinants in S. suis isolated from patients with meningitis in southern Vietnam. S. suis strains isolated between 1997 and 2008 were investigated for their susceptibility to six antimicrobial agents. Strains were screened for the presence and expression of tetracycline and erythromycin resistance determinants and the association of tet(M) genes with Tn916- like transposons. The localization of tetracycline resistance gene tet(L) was determined by pulse field gel electrophoresis and Southern blotting. We observed a significant increase in resistance to tetracycline and chloramphenicol, which was concurrent with an increase in multi-drug resistance. In tetracycline resistance strains, we identified tet(M), tet(O), tet(W) and tet(L) and confirmed their expression. All tet(M) genes were associated with a Tn916-like transposon. The co-expression of tet(L) and other tetracycline resistance gene(s) encoding for ribosomal protection protein(s) was only detected in strains with a minimum inhibitory concentration (MIC) of tetracycline of ≥ 64 mg/L. We demonstrated that multi-drug resistance in S. suis causing disease in humans in southern Vietnam has increased over the 11-year period studied. We report the presence and expression of tet(L) in S. suis strains and our data suggest that co-expression of multiple genes encoding distinct mechanism is required for an MIC ≥ 64 mg/L to tetracycline.

  16. Early bovine embryos regulate oviduct epithelial cell gene expression during in vitro co-culture.

    PubMed

    Schmaltz-Panneau, Barbara; Cordova, Amanda; Dhorne-Pollet, Sophie; Hennequet-Antier, Christelle; Uzbekova, Sveltlana; Martinot, Emmanuelle; Doret, Sarah; Martin, Patrice; Mermillod, Pascal; Locatelli, Yann

    2014-10-01

    In mammals, the oviduct may participate to the regulation of early embryo development. In vitro co-culture of early bovine embryos with bovine oviduct epithelial cells (BOEC) has been largely used to mimic the maternal environment. However, the mechanisms of BOEC action have not been clearly elucidated yet. The aim of this study was to determine the response of BOEC cultures to the presence of developing bovine embryos. A 21,581-element bovine oligonucleotide array was used compare the gene expression profiles of confluent BOEC cultured for 8 days with or without embryos. This study revealed 34 differentially expressed genes (DEG). Of these 34 genes, IFI6, ISG15, MX1, IFI27, IFI44, RSAD2, IFITM1, EPSTI1, USP18, IFIT5, and STAT1 expression increased to the greatest extent due to the presence of embryos with a major impact on antiviral and immune response. Among the mRNAs at least 25 are already described as induced by interferons. In addition, transcript levels of new candidate genes involved in the regulation of transcription, modulation of the maternal immune system and endometrial remodeling were found to be increased. We selected 7 genes and confirmed their differential expression by quantitative RT-PCR. The immunofluorescence imaging of cellular localization of STAT1 protein in BOEC showed a nuclear translocation in the presence of embryos, suggesting the activation of interferon signaling pathway. This first systematic study of BOEC transcriptome changes in response to the presence of embryos in cattle provides some evidences that these cells are able to adapt their transcriptomic profile in response to embryo signaling. Copyright © 2014 Elsevier B.V. All rights reserved.

  17. Genome-wide characterization and expression analysis enables identification of abiotic stress-responsive MYB transcription factors in cassava (Manihot esculenta).

    PubMed

    Ruan, Meng-Bin; Guo, Xin; Wang, Bin; Yang, Yi-Ling; Li, Wen-Qi; Yu, Xiao-Ling; Zhang, Peng; Peng, Ming

    2017-06-15

    The myeloblastosis (MYB) transcription factor superfamily is the largest transcription factor family in plants, playing different roles during stress response. However, abiotic stress-responsive MYB transcription factors have not been systematically studied in cassava (Manihot esculenta), an important tropical tuber root crop. In this study, we used a genome-wide transcriptome analysis to predict 299 putative MeMYB genes in the cassava genome. Under drought and cold stresses, many MeMYB genes exhibited different expression patterns in cassava leaves, indicating that these genes might play a role in abiotic stress responses. We found that several stress-responsive MeMYB genes responded to abscisic acid (ABA) in cassava leaves. We characterize four MeMYBs, namely MeMYB1, MeMYB2, MeMYB4, and MeMYB9, as R2R3-MYB transcription factors. Furthermore, RNAi-driven repression of MeMYB2 resulted in drought and cold tolerance in transgenic cassava. Gene expression assays in wild-type and MeMYB2-RNAi cassava plants revealed that MeMYB2 may affect other MeMYBs as well as MeWRKYs under drought and cold stress, suggesting crosstalk between MYB and WRKY family genes under stress conditions in cassava. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  18. Microarray Meta-Analysis of RNA-Binding Protein Functions in Alternative Polyadenylation

    PubMed Central

    Hu, Wenchao; Liu, Yuting; Yan, Jun

    2014-01-01

    Alternative polyadenylation (APA) is a post-transcriptional mechanism to generate diverse mRNA transcripts with different 3′UTRs from the same gene. In this study, we systematically searched for the APA events with differential expression in public mouse microarray data. Hundreds of genes with over-represented differential APA events and the corresponding experiments were identified. We further revealed that global APA differential expression occurred prevalently in tissues such as brain comparing to peripheral tissues, and biological processes such as development, differentiation and immune responses. Interestingly, we also observed widespread differential APA events in RNA-binding protein (RBP) genes such as Rbm3, Eif4e2 and Elavl1. Given the fact that RBPs are considered as the main regulators of differential APA expression, we constructed a co-expression network between APAs and RBPs using the microarray data. Further incorporation of CLIP-seq data of selected RBPs showed that Nova2 represses and Mbnl1 promotes the polyadenylation of closest poly(A) sites respectively. Altogether, our study is the first microarray meta-analysis in a mammal on the regulation of APA by RBPs that integrated massive mRNA expression data under a wide-range of biological conditions. Finally, we present our results as a comprehensive resource in an online website for the research community. PMID:24622240

  19. A shell regeneration assay to identify biomineralization candidate genes in mytilid mussels.

    PubMed

    Hüning, Anne K; Lange, Skadi M; Ramesh, Kirti; Jacob, Dorrit E; Jackson, Daniel J; Panknin, Ulrike; Gutowska, Magdalena A; Philipp, Eva E R; Rosenstiel, Philip; Lucassen, Magnus; Melzner, Frank

    2016-06-01

    Biomineralization processes in bivalve molluscs are still poorly understood. Here we provide an analysis of specifically expressed sequences from a mantle transcriptome of the blue mussel, Mytilus edulis. We then developed a novel, integrative shell injury assay to test, whether biomineralization candidate genes highly expressed in marginal and pallial mantle could be induced in central mantle tissue underlying the damaged shell areas. This experimental approach makes it possible to identify gene products that control the chemical micro-environment during calcification as well as organic matrix components. This is unlike existing methodological approaches that work retroactively to characterize calcification relevant molecules and are just able to examine organic matrix components that are present in completed shells. In our assay an orthogonal array of nine 1mm holes was drilled into the left valve, and mussels were suspended in net cages for 20, 29 and 36days to regenerate. Structural observations using stereo-microscopy, SEM and Raman spectroscopy revealed organic sheet synthesis (day 20) as the first step of shell-repair followed by the deposition of calcite crystals (days 20 and 29) and aragonite tablets (day 36). The regeneration period was characterized by time-dependent shifts in gene expression in left central mantle tissue underlying the injured shell, (i) increased expression of two tyrosinase isoforms (TYR3: 29-fold and TYR6: 5-fold) at day 20 with a decline thereafter, (ii) an increase in expression of a gene encoding a nacrein-like protein (max. 100-fold) on day 29. The expression of an acidic Asp-Ser-rich protein was enhanced during the entire regeneration process. This proof-of-principle study demonstrates that genes that are specifically expressed in pallial and marginal mantle tissue can be induced (4 out of 10 genes) in central mantle following experimental injury of the overlying shell. Our findings suggest that regeneration assays can be used systematically to better characterize gene products that are essential for distinct phases of the shell formation process, particularly those that are not incorporated into the organic shell matrix. Copyright © 2016 Elsevier B.V. All rights reserved.

  20. Nutrigenomic Analysis of Diet-Gene Interactions on Functional Supplements for Weight Management

    PubMed Central

    Lau, Francis C; Bagchi, Manashi; Sen, Chandan; Roy, Sashwati; Bagchi, Debasis

    2008-01-01

    Recent advances in molecular biology combined with the wealth of information generated by the Human Genome Project have fostered the emergence of nutrigenomics, a new discipline in the field of nutritional research. Nutrigenomics may provide the strategies for the development of safe and effective dietary interventions against the obesity epidemic. According to the World Health Organization, more than 60% of the global disease burden will be attributed to chronic disorders associated with obesity by 2020. Meanwhile in the US, the prevalence of obesity has doubled in adults and tripled in children during the past three decades. In this regard, a number of natural dietary supplements and micronutrients have been studied for their potential in weight management. Among these supplements, (–)-hydroxycitric acid (HCA), a natural extract isolated from the dried fruit rind of Garcinia cambogia, and the micronutrient niacin-bound chromium(III) (NBC) have been shown to be safe and efficacious for weight loss. Utilizing cDNA microarrays, we demonstrated for the first time that HCA-supplementation altered the expression of genes involved in lipolytic and adipogenic pathways in adipocytes from obese women and up-regulated the expression of serotonin receptor gene in the abdominal fat of rats. Similarly, we showed that NBC-supplementation up-regulated the expression of myogenic genes while suppressed the expression of genes that are highly expressed in brown adipose tissue in diabetic obese mice. The potential biological mechanisms underlying the observed beneficial effects of these supplements as elucidated by the state-of-the-art nutrigenomic technologies will be systematically discussed in this review. PMID:19452041

  1. Identifying novel glioma associated pathways based on systems biology level meta-analysis.

    PubMed

    Hu, Yangfan; Li, Jinquan; Yan, Wenying; Chen, Jiajia; Li, Yin; Hu, Guang; Shen, Bairong

    2013-01-01

    With recent advances in microarray technology, including genomics, proteomics, and metabolomics, it brings a great challenge for integrating this "-omics" data to analysis complex disease. Glioma is an extremely aggressive and lethal form of brain tumor, and thus the study of the molecule mechanism underlying glioma remains very important. To date, most studies focus on detecting the differentially expressed genes in glioma. However, the meta-analysis for pathway analysis based on multiple microarray datasets has not been systematically pursued. In this study, we therefore developed a systems biology based approach by integrating three types of omics data to identify common pathways in glioma. Firstly, the meta-analysis has been performed to study the overlapping of signatures at different levels based on the microarray gene expression data of glioma. Among these gene expression datasets, 12 pathways were found in GeneGO database that shared by four stages. Then, microRNA expression profiles and ChIP-seq data were integrated for the further pathway enrichment analysis. As a result, we suggest 5 of these pathways could be served as putative pathways in glioma. Among them, the pathway of TGF-beta-dependent induction of EMT via SMAD is of particular importance. Our results demonstrate that the meta-analysis based on systems biology level provide a more useful approach to study the molecule mechanism of complex disease. The integration of different types of omics data, including gene expression microarrays, microRNA and ChIP-seq data, suggest some common pathways correlated with glioma. These findings will offer useful potential candidates for targeted therapeutic intervention of glioma.

  2. A systematic study on drug-response associated genes using baseline gene expressions of the Cancer Cell Line Encyclopedia

    NASA Astrophysics Data System (ADS)

    Liu, Xiaoming; Yang, Jiasheng; Zhang, Yi; Fang, Yun; Wang, Fayou; Wang, Jun; Zheng, Xiaoqi; Yang, Jialiang

    2016-03-01

    We have studied drug-response associated (DRA) gene expressions by applying a systems biology framework to the Cancer Cell Line Encyclopedia data. More than 4,000 genes are inferred to be DRA for at least one drug, while the number of DRA genes for each drug varies dramatically from almost 0 to 1,226. Functional enrichment analysis shows that the DRA genes are significantly enriched in genes associated with cell cycle and plasma membrane. Moreover, there might be two patterns of DRA genes between genders. There are significantly shared DRA genes between male and female for most drugs, while very little DRA genes tend to be shared between the two genders for a few drugs targeting sex-specific cancers (e.g., PD-0332991 for breast cancer and ovarian cancer). Our analyses also show substantial difference for DRA genes between young and old samples, suggesting the necessity of considering the age effects for personalized medicine in cancers. Lastly, differential module and key driver analyses confirm cell cycle related modules as top differential ones for drug sensitivity. The analyses also reveal the role of TSPO, TP53, and many other immune or cell cycle related genes as important key drivers for DRA network modules. These key drivers provide new drug targets to improve the sensitivity of cancer therapy.

  3. Transcriptome Profiling of Wheat Inflorescence Development from Spikelet Initiation to Floral Patterning Identified Stage-Specific Regulatory Genes1[OPEN

    PubMed Central

    Feng, Nan; Song, Gaoyuan; Guan, Jiantao; Chen, Kai; Jia, Meiling; Huang, Dehua; Wu, Jiajie; Zhang, Lichao; Kong, Xiuying; Geng, Shuaifeng

    2017-01-01

    Early reproductive development in cereals is crucial for final grain number per spike and hence the yield potential of the crop. To date, however, no systematic analyses of gene expression profiles during this important process have been conducted for common wheat (Triticum aestivum). Here, we studied the transcriptome profiles at four stages of early wheat reproductive development, from spikelet initiation to floral organ differentiation. K-means clustering and stage-specific transcript identification detected dynamically expressed homeologs of important transcription regulators in spikelet and floral meristems that may be involved in spikelet initiation, floret meristem specification, and floral organ patterning, as inferred from their homologs in model plants. Small RNA transcriptome sequencing discovered key microRNAs that were differentially expressed during wheat inflorescence development alongside their target genes, suggesting that miRNA-mediated regulatory mechanisms for floral development may be conserved in cereals and Arabidopsis. Our analysis was further substantiated by the functional characterization of the ARGONAUTE1d (AGO1d) gene, which was initially expressed in stamen primordia and later in the tapetum during anther maturation. In agreement with its stage-specific expression pattern, the loss of function of the predominantly expressed B homeolog of AGO1d in a tetraploid durum wheat mutant resulted in smaller anthers with more infertile pollens than the wild type and a reduced grain number per spike. Together, our work provides a first glimpse of the gene regulatory networks in wheat inflorescence development that may be pivotal for floral and grain development, highlighting potential targets for genetic manipulation to improve future wheat yields. PMID:28515146

  4. Modulation of immunity and inflammatory gene expression in the gut, in inflammatory diseases of the gut and in the liver by probiotics

    PubMed Central

    Plaza-Diaz, Julio; Gomez-Llorente, Carolina; Fontana, Luis; Gil, Angel

    2014-01-01

    The potential for the positive manipulation of the gut microbiome through the introduction of beneficial microbes, as also known as probiotics, is currently an active area of investigation. The FAO/WHO define probiotics as live microorganisms that confer a health benefit to the host when administered in adequate amounts. However, dead bacteria and bacterial molecular components may also exhibit probiotic properties. The results of clinical studies have demonstrated the clinical potential of probiotics in many pathologies, such as allergic diseases, diarrhea, inflammatory bowel disease and viral infection. Several mechanisms have been proposed to explain the beneficial effects of probiotics, most of which involve gene expression regulation in specific tissues, particularly the intestine and liver. Therefore, the modulation of gene expression mediated by probiotics is an important issue that warrants further investigation. In the present paper, we performed a systematic review of the probiotic-mediated modulation of gene expression that is associated with the immune system and inflammation. Between January 1990 to February 2014, PubMed was searched for articles that were published in English using the MeSH terms “probiotics" and "gene expression" combined with “intestines", "liver", "enterocytes", "antigen-presenting cells", "dendritic cells", "immune system", and "inflammation". Two hundred and five original articles matching these criteria were initially selected, although only those articles that included specific gene expression results (77) were later considered for this review and separated into three major topics: the regulation of immunity and inflammatory gene expression in the gut, in inflammatory diseases of the gut and in the liver. Particular strains of Bifidobacteria, Lactobacilli, Escherichia coli, Propionibacterium, Bacillus and Saccharomyces influence the gene expression of mucins, Toll-like receptors, caspases, nuclear factor-κB, and interleukins and lead mainly to an anti-inflammatory response in cultured enterocytes. In addition, the interaction of commensal bacteria and probiotics with the surface of antigen-presenting cells in vitro results in the downregulation of pro-inflammatory genes that are linked to inflammatory signaling pathways, whereas other anti-inflammatory genes are upregulated. The effects of probiotics have been extensively investigated in animal models ranging from fish to mice, rats and piglets. These bacteria induce a tolerogenic and hyporesponsive immune response in which many genes that are related to the immune system, in particular those genes expressing anti-inflammatory cytokines, are upregulated. By contrast, information related to gene expression in human intestinal cells mediated by the action of probiotics is scarce. There is a need for further clinical studies that evaluate the mechanism of action of probiotics both in healthy humans and in patients with chronic diseases. These types of clinical studies are necessary for addressing the influence of these microorganisms in gene expression for different pathways, particularly those that are associated with the immune response, and to better understand the role that probiotics might have in the prevention and treatment of disease. PMID:25400447

  5. CovRS-Regulated Transcriptome Analysis of a Hypervirulent M23 Strain of Group A Streptococcus pyogenes Provides New Insights into Virulence Determinants.

    PubMed

    Bao, Yun-Juan; Liang, Zhong; Mayfield, Jeffrey A; Lee, Shaun W; Ploplis, Victoria A; Castellino, Francis J

    2015-10-01

    The two-component control of virulence (Cov) regulator (R)-sensor (S) (CovRS) regulates the virulence of Streptococcus pyogenes (group A Streptococcus [GAS]). Inactivation of CovS during infection switches the pathogenicity of GAS to a more invasive form by regulating transcription of diverse virulence genes via CovR. However, the manner in which CovRS controls virulence through expression of extended gene families has not been fully determined. In the current study, the CovS-regulated gene expression profiles of a hypervirulent emm23 GAS strain (M23ND/CovS negative [M23ND/CovS(-)]) and a noninvasive isogenic strain (M23ND/CovS(+)), under different growth conditions, were investigated. RNA sequencing identified altered expression of ∼ 349 genes (18% of the chromosome). The data demonstrated that M23ND/CovS(-) achieved hypervirulence by allowing enhanced expression of genes responsible for antiphagocytosis (e.g., hasABC), by abrogating expression of toxin genes (e.g., speB), and by compromising gene products with dispensable functions (e.g., sfb1). Among these genes, several (e.g., parE and parC) were not previously reported to be regulated by CovRS. Furthermore, the study revealed that CovS also modulated the expression of a broad spectrum of metabolic genes that maximized nutrient utilization and energy metabolism during growth and dissemination, where the bacteria encounter large variations in available nutrients, thus restructuring metabolism of GAS for adaption to diverse growth environments. From constructing a genome-scale metabolic model, we identified 16 nonredundant metabolic gene modules that constitute unique nutrient sources. These genes were proposed to be essential for pathogen growth and are likely associated with GAS virulence. The genome-wide prediction of genes associated with virulence identifies new candidate genes that potentially contribute to GAS virulence. The CovRS system modulates transcription of ∼ 18% of the genes in the Streptococcus pyogenes genome. Mutations that inactivate CovR or CovS enhance the virulence of this bacterium. We determined complete transcriptomes of a naturally CovS-inactivated invasive deep tissue isolate of an emm23 strain of S. pyogenes (M23ND) and its complemented avirulent variant (CovS(+)). We identified diverse virulence genes whose altered expression revealed a genetic switching of a nonvirulent form of M23ND to a highly virulent strain. Furthermore, we also systematically uncovered for the first time the comparative levels of expression of a broad spectrum of metabolic genes, which reflected different metabolic needs of the bacterium as it invaded deeper tissue of the human host. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  6. Systematic analysis of gene expression pattern in has-miR-197 over-expressed human uterine leiomyoma cells.

    PubMed

    Ling, Jing; Wu, Xiaoli; Fu, Ziyi; Tan, Jie; Xu, Qing

    2015-10-01

    Our previous study showed that the expression of miR-197 in leiomyoma was down-regulated compared with myometrium. Further, miR-197 has been identified to affect uterine leiomyoma cell proliferation, apoptosis, and metastasis ability, though the responsible molecular mechanism has not been well elucidated. In this study, we sought to determine the expression patterns of miR-197 targeted genes and to explore their potential functions, participating Pathways and the networks that are involved in the biological behavior of human uterine leiomyoma. After transfection of human uterine leiomyoma cells with miR-197, we confirmed the expression level of miR-197 using quantitative real-time PCR (qRT-PCR), and we detected the gene expression profiles after miR-197 over-expression through DNA microarray analysis. Further, we performed GO and Pathway analysis. The dominantly dys-regulated genes, which were up- or down-regulated by more than 10-fold, compared with parental cells, were confirmed using qRT-PCR technology. Compared with the control group, miR-197 was up-regulated by 30-fold after miR-197 lentiviral transfection. The microarray data showed that 872 genes were dys-regulated by more than 2-fold in human uterine leiomyoma cells after miR-197 overexpression, including 537 up-regulated and 335 down-regulated genes. The GO analysis indicated that the dys-regulated genes were primarily involved in response to stimuli, multicellular organ processes, and the signaling of biological progression. Further, Pathway analysis data showed that these genes participated in regulating several signaling Pathways, including the JAK/STAT signaling Pathway, the Toll-like receptor signaling Pathway, and cytokine-cytokine receptor interaction. The qRT-PCR results confirmed that 17 of the 66 selected genes, which were up- or down-regulated more than 10-fold by miR-197, were consistent with the microarray results, including tumorigenesis-related genes, such as DRT7, SLC549, SFMBT2, FLJ37956, FBLN2, C10orf35, HOXD12, CACNG7, and LOC100134279. Our study explored gene expression patterns after miR-197 overexpression and confirmed 17 dominantly dys-regulated genes, which could expand the insights into the function of miR-197 and the molecular mechanisms during the development and progression of uterine leiomyomas. This study might afford new clues for understanding the pathogenesis of uterine leiomyomas, and it could likely provide a unique method for diagnosing or predicting prognosis in the clinical treatment of leiomyoma. Copyright © 2015 Elsevier Masson SAS. All rights reserved.

  7. Natural genetic variation of the cardiac transcriptome in non-diseased donors and patients with dilated cardiomyopathy.

    PubMed

    Heinig, Matthias; Adriaens, Michiel E; Schafer, Sebastian; van Deutekom, Hanneke W M; Lodder, Elisabeth M; Ware, James S; Schneider, Valentin; Felkin, Leanne E; Creemers, Esther E; Meder, Benjamin; Katus, Hugo A; Rühle, Frank; Stoll, Monika; Cambien, François; Villard, Eric; Charron, Philippe; Varro, Andras; Bishopric, Nanette H; George, Alfred L; Dos Remedios, Cristobal; Moreno-Moral, Aida; Pesce, Francesco; Bauerfeind, Anja; Rüschendorf, Franz; Rintisch, Carola; Petretto, Enrico; Barton, Paul J; Cook, Stuart A; Pinto, Yigal M; Bezzina, Connie R; Hubner, Norbert

    2017-09-14

    Genetic variation is an important determinant of RNA transcription and splicing, which in turn contributes to variation in human traits, including cardiovascular diseases. Here we report the first in-depth survey of heart transcriptome variation using RNA-sequencing in 97 patients with dilated cardiomyopathy and 108 non-diseased controls. We reveal extensive differences of gene expression and splicing between dilated cardiomyopathy patients and controls, affecting known as well as novel dilated cardiomyopathy genes. Moreover, we show a widespread effect of genetic variation on the regulation of transcription, isoform usage, and allele-specific expression. Systematic annotation of genome-wide association SNPs identifies 60 functional candidate genes for heart phenotypes, representing 20% of all published heart genome-wide association loci. Focusing on the dilated cardiomyopathy phenotype we found that eQTL variants are also enriched for dilated cardiomyopathy genome-wide association signals in two independent cohorts. RNA transcription, splicing, and allele-specific expression are each important determinants of the dilated cardiomyopathy phenotype and are controlled by genetic factors. Our results represent a powerful resource for the field of cardiovascular genetics.

  8. RNA-Sequencing Analyses Demonstrate the Involvement of Canonical Transient Receptor Potential Channels in Rat Tooth Germ Development

    PubMed Central

    Yang, Jun; Cai, Wenping; Lu, Xi; Liu, Shangfeng; Zhao, Shouliang

    2017-01-01

    Tooth development depends on multiple molecular interactions between the dental epithelium and mesenchyme, which are derived from ectodermal and ectomesenchymal cells, respectively. We report on a systematic RNA sequencing analysis of transcriptional expression levels from the bud to hard tissue formation stages of rat tooth germ development. We found that GNAO1, ENO1, EFNB1, CALM1, SIAH2, ATP6V0A1, KDELR2, GTPBP1, POLR2C, SORT1, and members of the canonical transient receptor potential (TRPC) channel family are involved in tooth germ development. Furthermore, Cell Counting Kit 8 (CCK8) and Transwell migration assays were performed to explore the effects of these differentially expressed genes (DEGs) on the proliferation and migration of dental pulp stem cells. Immunostaining revealed that TRPC channels are expressed at varying levels during odontogenesis. The identified genes represent novel candidates that are likely to be vital for rat tooth germ development. Together, the results provide a valuable resource to elucidate the gene regulatory mechanisms underlying mammalian tooth germ development. PMID:28706494

  9. A systems biology approach to defining regulatory mechanisms for cartilage and tendon cell phenotypes.

    PubMed

    Mueller, A J; Tew, S R; Vasieva, O; Clegg, P D; Canty-Laird, E G

    2016-09-27

    Phenotypic plasticity of adult somatic cells has provided emerging avenues for the development of regenerative therapeutics. In musculoskeletal biology the mechanistic regulatory networks of genes governing the phenotypic plasticity of cartilage and tendon cells has not been considered systematically. Additionally, a lack of strategies to effectively reproduce in vitro functional models of cartilage and tendon is retarding progress in this field. De- and redifferentiation represent phenotypic transitions that may contribute to loss of function in ageing musculoskeletal tissues. Applying a systems biology network analysis approach to global gene expression profiles derived from common in vitro culture systems (monolayer and three-dimensional cultures) this study demonstrates common regulatory mechanisms governing de- and redifferentiation transitions in cartilage and tendon cells. Furthermore, evidence of convergence of gene expression profiles during monolayer expansion of cartilage and tendon cells, and the expression of key developmental markers, challenges the physiological relevance of this culture system. The study also suggests that oxidative stress and PI3K signalling pathways are key modulators of in vitro phenotypes for cells of musculoskeletal origin.

  10. Systematic gene tagging using CRISPR/Cas9 in human stem cells to illuminate cell organization.

    PubMed

    Roberts, Brock; Haupt, Amanda; Tucker, Andrew; Grancharova, Tanya; Arakaki, Joy; Fuqua, Margaret A; Nelson, Angelique; Hookway, Caroline; Ludmann, Susan A; Mueller, Irina A; Yang, Ruian; Horwitz, Rick; Rafelski, Susanne M; Gunawardane, Ruwanthi N

    2017-10-15

    We present a CRISPR/Cas9 genome-editing strategy to systematically tag endogenous proteins with fluorescent tags in human induced pluripotent stem cells (hiPSC). To date, we have generated multiple hiPSC lines with monoallelic green fluorescent protein tags labeling 10 proteins representing major cellular structures. The tagged proteins include alpha tubulin, beta actin, desmoplakin, fibrillarin, nuclear lamin B1, nonmuscle myosin heavy chain IIB, paxillin, Sec61 beta, tight junction protein ZO1, and Tom20. Our genome-editing methodology using Cas9/crRNA ribonuclear protein and donor plasmid coelectroporation, followed by fluorescence-based enrichment of edited cells, typically resulted in <0.1-4% homology-directed repair (HDR). Twenty-five percent of clones generated from each edited population were precisely edited. Furthermore, 92% (36/39) of expanded clonal lines displayed robust morphology, genomic stability, expression and localization of the tagged protein to the appropriate subcellular structure, pluripotency-marker expression, and multilineage differentiation. It is our conclusion that, if cell lines are confirmed to harbor an appropriate gene edit, pluripotency, differentiation potential, and genomic stability are typically maintained during the clonal line-generation process. The data described here reveal general trends that emerged from this systematic gene-tagging approach. Final clonal lines corresponding to each of the 10 cellular structures are now available to the research community. © 2017 Roberts, Haupt, et al. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).

  11. Transcriptome Analysis Reveals Markers of Aberrantly Activated Innate Immunity in Vitiligo Lesional and Non-Lesional Skin

    PubMed Central

    Huang, Yuanshen; Wang, Yang; Yu, Jie; Gao, Min; Levings, Megan; Wei, Shencai; Zhang, Shengquan; Xu, Aie; Su, Mingwan; Dutz, Jan; Zhang, Xuejun; Zhou, Youwen

    2012-01-01

    Background Vitiligo is characterized by the death of melanocytes in the skin. This is associated with the presence of T cell infiltrates in the lesional borders. However, at present, there is no detailed and systematic characterization on whether additional cellular or molecular changes are present inside vitiligo lesions. Further, it is unknown if the normal appearing non-lesional skin of vitiligo patients is in fact normal. The purpose of this study is to systematically characterize the molecular and cellular characteristics of the lesional and non-lesional skin of vitiligo patients. Methods and Materials Paired lesional and non-lesional skin biopsies from twenty-three vitiligo patients and normal skin biopsies from sixteen healthy volunteers were obtained with informed consent. The following aspects were analyzed: (1) transcriptome changes present in vitiligo skin using DNA microarrays and qRT-PCR; (2) abnormal cellular infiltrates in vitiligo skin explant cultures using flow cytometry; and (3) distribution of the abnormal cellular infiltrates in vitiligo skin using immunofluorescence microscopy. Results Compared with normal skin, vitiligo lesional skin contained 17 genes (mostly melanocyte-specific genes) whose expression was decreased or absent. In contrast, the relative expression of 13 genes was up-regulated. The up-regulated genes point to aberrant activity of the innate immune system, especially natural killer cells in vitiligo. Strikingly, the markers of heightened innate immune responses were also found to be up-regulated in the non-lesional skin of vitiligo patients. Conclusions and Clinical Implications As the first systematic transcriptome characterization of the skin in vitiligo patients, this study revealed previously unknown molecular markers that strongly suggest aberrant innate immune activation in the microenvironment of vitiligo skin. Since these changes involve both lesional and non-lesional skin, our results suggest that therapies targeting the entire skin surface may improve treatment outcomes. Finally, this study revealed novel mediators that may facilitate future development of vitiligo therapies. PMID:23251420

  12. Genome-wide coexpression dynamics: Theory and application

    PubMed Central

    Li, Ker-Chau

    2002-01-01

    High-throughput expression profiling enables the global study of gene activities. Genes with positively correlated expression profiles are likely to encode functionally related proteins. However, all biological processes are interlocked, and each protein may play multiple cellular roles. Thus the coexpression of any two functionally related genes may depend on the constantly varying, yet often-unknown cellular state. To initiate a systematic study on this issue, a theory of coexpression dynamics is presented. This theory is used to rationalize a strategy of conducting a genome-wide search for the most critical cellular players that may affect the coexpression pattern of any two genes. In one example, using a yeast data set, our method reveals how the enzymes associated with the urea cycle are expressed to ensure proper mass flow of the involved metabolites. The correlation between ARG2 and CAR2 is found to change from positive to negative as the expression level of CPA2 increases. This delicate interplay in correlation signifies a remarkable control on the influx and efflux of ornithine and reflects well the intrinsic cellular demand for arginine. In addition to the urea cycle, our examples include SCH9 and CYR1 (both implicated in a recent longevity study), cytochrome c1 (mitochondrial electron transport), calmodulin (main calcium-binding protein), PFK1 and PFK2 (glycolysis), and two genes, ECM1 and YNL101W, the functions of which are newly revealed. The complexity in computation is eased by a new result from mathematical statistics. PMID:12486219

  13. GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge

    PubMed Central

    Wagner, Florian

    2015-01-01

    Method Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. Results I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets. PMID:26575370

  14. GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge.

    PubMed

    Wagner, Florian

    2015-01-01

    Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets.

  15. Exploring the molecular mechanisms of Traditional Chinese Medicine components using gene expression signatures and connectivity map.

    PubMed

    Yoo, Minjae; Shin, Jimin; Kim, Hyunmin; Kim, Jihye; Kang, Jaewoo; Tan, Aik Choon

    2018-04-04

    Traditional Chinese Medicine (TCM) has been practiced over thousands of years in China and other Asian countries for treating various symptoms and diseases. However, the underlying molecular mechanisms of TCM are poorly understood, partly due to the "multi-component, multi-target" nature of TCM. To uncover the molecular mechanisms of TCM, we perform comprehensive gene expression analysis using connectivity map. We interrogated gene expression signatures obtained 102 TCM components using the next generation Connectivity Map (CMap) resource. We performed systematic data mining and analysis on the mechanism of action (MoA) of these TCM components based on the CMap results. We clustered the 102 TCM components into four groups based on their MoAs using next generation CMap resource. We performed gene set enrichment analysis on these components to provide additional supports for explaining these molecular mechanisms. We also provided literature evidence to validate the MoAs identified through this bioinformatics analysis. Finally, we developed the Traditional Chinese Medicine Drug Repurposing Hub (TCM Hub) - a connectivity map resource to facilitate the elucidation of TCM MoA for drug repurposing research. TCMHub is freely available in http://tanlab.ucdenver.edu/TCMHub. Molecular mechanisms of TCM could be uncovered by using gene expression signatures and connectivity map. Through this analysis, we identified many of the TCM components possess diverse MoAs, this may explain the applications of TCM in treating various symptoms and diseases. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

  16. Differential gene expression detection and sample classification using penalized linear regression models.

    PubMed

    Wu, Baolin

    2006-02-15

    Differential gene expression detection and sample classification using microarray data have received much research interest recently. Owing to the large number of genes p and small number of samples n (p > n), microarray data analysis poses big challenges for statistical analysis. An obvious problem owing to the 'large p small n' is over-fitting. Just by chance, we are likely to find some non-differentially expressed genes that can classify the samples very well. The idea of shrinkage is to regularize the model parameters to reduce the effects of noise and produce reliable inferences. Shrinkage has been successfully applied in the microarray data analysis. The SAM statistics proposed by Tusher et al. and the 'nearest shrunken centroid' proposed by Tibshirani et al. are ad hoc shrinkage methods. Both methods are simple, intuitive and prove to be useful in empirical studies. Recently Wu proposed the penalized t/F-statistics with shrinkage by formally using the (1) penalized linear regression models for two-class microarray data, showing good performance. In this paper we systematically discussed the use of penalized regression models for analyzing microarray data. We generalize the two-class penalized t/F-statistics proposed by Wu to multi-class microarray data. We formally derive the ad hoc shrunken centroid used by Tibshirani et al. using the (1) penalized regression models. And we show that the penalized linear regression models provide a rigorous and unified statistical framework for sample classification and differential gene expression detection.

  17. Expression quantitative trait loci (eQTL) mapping in Puerto Rican children.

    PubMed

    Chen, Wei; Brehm, John M; Lin, Jerome; Wang, Ting; Forno, Erick; Acosta-Pérez, Edna; Boutaoui, Nadia; Canino, Glorisa; Celedón, Juan C

    2015-01-01

    Expression quantitative trait loci (eQTL) have been identified using tissue or cell samples from diverse human populations, thus enhancing our understanding of regulation of gene expression. However, few studies have attempted to identify eQTL in racially admixed populations such as Hispanics. We performed a systematic eQTL study to identify regulatory variants of gene expression in whole blood from 121 Puerto Rican children with (n = 63) and without (n = 58) asthma. Genome-wide genotyping was conducted using the Illumina Omni2.5M Bead Chip, and gene expression was assessed using the Illumina HT-12 microarray. After completing quality control, we performed a pair-wise genome analysis of ~15 K transcripts and ~1.3 M SNPs for both local and distal effects. This analysis was conducted under a regression framework adjusting for age, gender and principal components derived from both genotypic and mRNA data. We used a false discovery rate (FDR) approach to identify significant eQTL signals, which were next compared to top eQTL signals from existing eQTL databases. We then performed a pathway analysis for our top genes. We identified 36,720 local pairs in 3,391 unique genes and 1,851 distal pairs in 446 unique genes at FDR <0.05, corresponding to unadjusted P values lower than 1.5x10-4 and 4.5x10-9, respectively. A significant proportion of genes identified in our study overlapped with those identified in previous studies. We also found an enrichment of disease-related genes in our eQTL list. We present results from the first eQTL study in Puerto Rican children, who are members of a unique Hispanic cohort disproportionately affected with asthma, prematurity, obesity and other common diseases. Our study confirmed eQTL signals identified in other ethnic groups, while also detecting additional eQTLs unique to our study population. The identified eQTLs will help prioritize findings from future genome-wide association studies in Puerto Ricans.

  18. DNetDB: The human disease network database based on dysfunctional regulation mechanism.

    PubMed

    Yang, Jing; Wu, Su-Juan; Yang, Shao-You; Peng, Jia-Wei; Wang, Shi-Nuo; Wang, Fu-Yan; Song, Yu-Xing; Qi, Ting; Li, Yi-Xue; Li, Yuan-Yuan

    2016-05-21

    Disease similarity study provides new insights into disease taxonomy, pathogenesis, which plays a guiding role in diagnosis and treatment. The early studies were limited to estimate disease similarities based on clinical manifestations, disease-related genes, medical vocabulary concepts or registry data, which were inevitably biased to well-studied diseases and offered small chance of discovering novel findings in disease relationships. In other words, genome-scale expression data give us another angle to address this problem since simultaneous measurement of the expression of thousands of genes allows for the exploration of gene transcriptional regulation, which is believed to be crucial to biological functions. Although differential expression analysis based methods have the potential to explore new disease relationships, it is difficult to unravel the upstream dysregulation mechanisms of diseases. We therefore estimated disease similarities based on gene expression data by using differential coexpression analysis, a recently emerging method, which has been proved to be more potential to capture dysfunctional regulation mechanisms than differential expression analysis. A total of 1,326 disease relationships among 108 diseases were identified, and the relevant information constituted the human disease network database (DNetDB). Benefiting from the use of differential coexpression analysis, the potential common dysfunctional regulation mechanisms shared by disease pairs (i.e. disease relationships) were extracted and presented. Statistical indicators, common disease-related genes and drugs shared by disease pairs were also included in DNetDB. In total, 1,326 disease relationships among 108 diseases, 5,598 pathways, 7,357 disease-related genes and 342 disease drugs are recorded in DNetDB, among which 3,762 genes and 148 drugs are shared by at least two diseases. DNetDB is the first database focusing on disease similarity from the viewpoint of gene regulation mechanism. It provides an easy-to-use web interface to search and browse the disease relationships and thus helps to systematically investigate etiology and pathogenesis, perform drug repositioning, and design novel therapeutic interventions.Database URL: http://app.scbit.org/DNetDB/ #.

  19. Genome-wide identification and characterization of five MyD88 duplication genes in Yesso scallop (Patinopecten yessoensis) and expression changes in response to bacterial challenge.

    PubMed

    Ning, Xianhui; Wang, Ruijia; Li, Xue; Wang, Shuyue; Zhang, Mengran; Xing, Qiang; Sun, Yan; Wang, Shi; Zhang, Lingling; Hu, Xiaoli; Bao, Zhenmin

    2015-10-01

    Myeloid differentiation factor 88 (MyD88) is a pivotal adaptor in the TLR/IL-1R signaling pathway, which plays an important role in activating the innate immune system. Although MyD88 genes have been identified in a variety of species, they have not been systematically characterized in scallops. In this study, five MyD88 genes were identified in Yesso scallop (Patinopecten yessoensis), PyMyD88-1, PyMyD88-2a, PyMyD88-2b, PyMyD88-3 and PyMyD88-4, which consisted of two pairs of tandem duplications located on the same chromosome. To our knowledge, this is the largest number of MyD88 genes found in an invertebrate. Phylogenetic and protein structural analyses were carried out to determine the identities and evolutionary relationships of these genes. PyMyD88s have highly conserved structures compared to MyD88 genes from other invertebrate species, except for PyMyD88-4, which contains only a DD domain, suggesting the evolutionarily conserved form of this particular gene member. We investigated the expression profiles of PyMyD88 genes at different developmental stages and in healthy adult tissues and hemocytes after Micrococcus luteus and Vibrio anguillarum infection using quantitative real-time PCR (qRT-PCR). The expression of most PyMyD88s was significantly induced in the acute phase (3-6 h) after infection with both gram-positive (M. luteus) and gram-negative (V. anguillarum) bacteria, with much more dramatic changes in PyMyD88 expression being observed after V. anguillarum challenge. Collectively, the abundance of MyD88s and their specific expression patterns provide insight into their versatile roles in the response of the bivalve innate immune system to gram-negative bacterial pathogens. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Hey bHLH transcription factors.

    PubMed

    Weber, David; Wiese, Cornelia; Gessler, Manfred

    2014-01-01

    Hey bHLH transcription factors are direct targets of canonical Notch signaling. The three mammalian Hey proteins are closely related to Hes proteins and they primarily repress target genes by either directly binding to core promoters or by inhibiting other transcriptional activators. Individual candidate gene approaches and systematic screens identified a number of Hey target genes, which often encode other transcription factors involved in various developmental processes. Here, we review data on interaction partners and target genes and conclude with a model for Hey target gene regulation. Furthermore, we discuss how expression of Hey proteins affects processes like cell fate decisions and differentiation, e.g., in cardiovascular, skeletal, and neural development or oncogenesis and how this relates to the observed developmental defects and phenotypes observed in various knockout mice. © 2014 Elsevier Inc. All rights reserved.

  1. Kinetic models of gene expression including non-coding RNAs

    NASA Astrophysics Data System (ADS)

    Zhdanov, Vladimir P.

    2011-03-01

    In cells, genes are transcribed into mRNAs, and the latter are translated into proteins. Due to the feedbacks between these processes, the kinetics of gene expression may be complex even in the simplest genetic networks. The corresponding models have already been reviewed in the literature. A new avenue in this field is related to the recognition that the conventional scenario of gene expression is fully applicable only to prokaryotes whose genomes consist of tightly packed protein-coding sequences. In eukaryotic cells, in contrast, such sequences are relatively rare, and the rest of the genome includes numerous transcript units representing non-coding RNAs (ncRNAs). During the past decade, it has become clear that such RNAs play a crucial role in gene expression and accordingly influence a multitude of cellular processes both in the normal state and during diseases. The numerous biological functions of ncRNAs are based primarily on their abilities to silence genes via pairing with a target mRNA and subsequently preventing its translation or facilitating degradation of the mRNA-ncRNA complex. Many other abilities of ncRNAs have been discovered as well. Our review is focused on the available kinetic models describing the mRNA, ncRNA and protein interplay. In particular, we systematically present the simplest models without kinetic feedbacks, models containing feedbacks and predicting bistability and oscillations in simple genetic networks, and models describing the effect of ncRNAs on complex genetic networks. Mathematically, the presentation is based primarily on temporal mean-field kinetic equations. The stochastic and spatio-temporal effects are also briefly discussed.

  2. Comparative Analysis of mRNA Isoform Expression in Cardiac Hypertrophy and Development Reveals Multiple Post-Transcriptional Regulatory Modules

    PubMed Central

    Park, Ji Yeon; Li, Wencheng; Zheng, Dinghai; Zhai, Peiyong; Zhao, Yun; Matsuda, Takahisa; Vatner, Stephen F.; Sadoshima, Junichi; Tian, Bin

    2011-01-01

    Cardiac hypertrophy is enlargement of the heart in response to physiological or pathological stimuli, chiefly involving growth of myocytes in size rather than in number. Previous studies have shown that the expression pattern of a group of genes in hypertrophied heart induced by pressure overload resembles that at the embryonic stage of heart development, a phenomenon known as activation of the “fetal gene program”. Here, using a genome-wide approach we systematically defined genes and pathways regulated in short- and long-term cardiac hypertrophy conditions using mice with transverse aortic constriction (TAC), and compared them with those regulated at different stages of embryonic and postnatal development. In addition, exon-level analysis revealed widespread mRNA isoform changes during cardiac hypertrophy resulting from alternative usage of terminal or internal exons, some of which are also developmentally regulated and may be attributable to decreased expression of Fox-1 protein in cardiac hypertrophy. Genes with functions in certain pathways, such as cell adhesion and cell morphology, are more likely to be regulated by alternative splicing. Moreover, we found 3′UTRs of mRNAs were generally shortened through alternative cleavage and polyadenylation in hypertrophy, and microRNA target genes were generally de-repressed, suggesting coordinated mechanisms to increase mRNA stability and protein production during hypertrophy. Taken together, our results comprehensively delineated gene and mRNA isoform regulation events in cardiac hypertrophy and revealed their relations to those in development, and suggested that modulation of mRNA isoform expression plays an importance role in heart remodeling under pressure overload. PMID:21799842

  3. The top skin-associated genes: a comparative analysis of human and mouse skin transcriptomes.

    PubMed

    Gerber, Peter Arne; Buhren, Bettina Alexandra; Schrumpf, Holger; Homey, Bernhard; Zlotnik, Albert; Hevezi, Peter

    2014-06-01

    The mouse represents a key model system for the study of the physiology and biochemistry of skin. Comparison of skin between mouse and human is critical for interpretation and application of data from mouse experiments to human disease. Here, we review the current knowledge on structure and immunology of mouse and human skin. Moreover, we present a systematic comparison of human and mouse skin transcriptomes. To this end, we have recently used a genome-wide database of human gene expression to identify genes highly expressed in skin, with no, or limited expression elsewhere - human skin-associated genes (hSAGs). Analysis of our set of hSAGs allowed us to generate a comprehensive molecular characterization of healthy human skin. Here, we used a similar database to generate a list of mouse skin-associated genes (mSAGs). A comparative analysis between the top human (n=666) and mouse (n=873) skin-associated genes (SAGs) revealed a total of only 30.2% identity between the two lists. The majority of shared genes encode proteins that participate in structural and barrier functions. Analysis of the top functional annotation terms revealed an overlap for morphogenesis, cell adhesion, structure, and signal transduction. The results of this analysis, discussed in the context of published data, illustrate the diversity between the molecular make up of skin of both species and grants a probable explanation, why results generated in murine in vivo models often fail to translate into the human.

  4. LPL is the strongest prognostic factor in a comparative analysis of RNA-based markers in early chronic lymphocytic leukemia.

    PubMed

    Kaderi, Mohd Arifin; Kanduri, Meena; Buhl, Anne Mette; Sevov, Marie; Cahill, Nicola; Gunnarsson, Rebeqa; Jansson, Mattias; Smedby, Karin Ekström; Hjalgrim, Henrik; Jurlander, Jesper; Juliusson, Gunnar; Mansouri, Larry; Rosenquist, Richard

    2011-08-01

    The expression levels of LPL, ZAP70, TCL1A, CLLU1 and MCL1 have recently been proposed as prognostic factors in chronic lymphocytic leukemia. However, few studies have systematically compared these different RNA-based markers. Using real-time quantitative PCR, we measured the mRNA expression levels of these genes in unsorted samples from 252 newly diagnosed chronic lymphocytic leukemia patients and correlated our data with established prognostic markers (for example Binet stage, CD38, IGHV gene mutational status and genomic aberrations) and clinical outcome. High expression levels of all RNA-based markers, except MCL1, predicted shorter overall survival and time to treatment, with LPL being the most significant. In multivariate analysis including the RNA-based markers, LPL expression was the only independent prognostic marker for overall survival and time to treatment. When studying LPL expression and the established markers, LPL expression retained its independent prognostic strength for overall survival. All of the RNA-based markers, albeit with varying ability, added prognostic information to established markers, with LPL expression giving the most significant results. Notably, high LPL expression predicted a worse outcome in good-prognosis subgroups, such as patients with mutated IGHV genes, Binet stage A, CD38 negativity or favorable cytogenetics. In particular, the combination of LPL expression and CD38 could further stratify Binet stage A patients. LPL expression is the strongest RNA-based prognostic marker in chronic lymphocytic leukemia that could potentially be applied to predict outcome in the clinical setting, particularly in the large group of patients with favorable prognosis.

  5. A powerful approach reveals numerous expression quantitative trait haplotypes in multiple tissues.

    PubMed

    Ying, Dingge; Li, Mulin Jun; Sham, Pak Chung; Li, Miaoxin

    2018-04-26

    Recently many studies showed single nucleotide polymorphisms (SNPs) affect gene expression and contribute to development of complex traits/diseases in a tissue context-dependent manner. However, little is known about haplotype's influence on gene expression and complex traits, which reflects the interaction effect between SNPs. In the present study, we firstly proposed a regulatory region guided eQTL haplotype association analysis approach, and then systematically investigate the expression quantitative trait loci (eQTL) haplotypes in 20 different tissues by the approach. The approach has a powerful design of reducing computational burden by the utilization of regulatory predictions for candidate SNP selection and multiple testing corrections on non-independent haplotypes. The application results in multiple tissues showed that haplotype-based eQTLs not only increased the number of eQTL genes in a tissue specific manner, but were also enriched in loci that associated with complex traits in a tissue-matched manner. In addition, we found that tag SNPs of eQTL haplotypes from whole blood were selectively enriched in certain combination of regulatory elements (e.g. promoters and enhancers) according to predicted chromatin states. In summary, this eQTL haplotype detection approach, together with the application results, shed insights into synergistic effect of sequence variants on gene expression and their susceptibility to complex diseases. The executable application "eHaplo" is implemented in Java and is publicly available at http://grass.cgs.hku.hk/limx/ehaplo/. jonsonfox@gmail.com, limiaoxin@mail.sysu.edu.cn. Supplementary data are available at Bioinformatics online.

  6. A Multipurpose Toolkit to Enable Advanced Genome Engineering in Plants[OPEN

    PubMed Central

    Gil-Humanes, Javier; Čegan, Radim; Kono, Thomas J.Y.; Konečná, Eva; Belanto, Joseph J.; Starker, Colby G.

    2017-01-01

    We report a comprehensive toolkit that enables targeted, specific modification of monocot and dicot genomes using a variety of genome engineering approaches. Our reagents, based on transcription activator-like effector nucleases (TALENs) and the clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 system, are systematized for fast, modular cloning and accommodate diverse regulatory sequences to drive reagent expression. Vectors are optimized to create either single or multiple gene knockouts and large chromosomal deletions. Moreover, integration of geminivirus-based vectors enables precise gene editing through homologous recombination. Regulation of transcription is also possible. A Web-based tool streamlines vector selection and construction. One advantage of our platform is the use of the Csy-type (CRISPR system yersinia) ribonuclease 4 (Csy4) and tRNA processing enzymes to simultaneously express multiple guide RNAs (gRNAs). For example, we demonstrate targeted deletions in up to six genes by expressing 12 gRNAs from a single transcript. Csy4 and tRNA expression systems are almost twice as effective in inducing mutations as gRNAs expressed from individual RNA polymerase III promoters. Mutagenesis can be further enhanced 2.5-fold by incorporating the Trex2 exonuclease. Finally, we demonstrate that Cas9 nickases induce gene targeting at frequencies comparable to native Cas9 when they are delivered on geminivirus replicons. The reagents have been successfully validated in tomato (Solanum lycopersicum), tobacco (Nicotiana tabacum), Medicago truncatula, wheat (Triticum aestivum), and barley (Hordeum vulgare). PMID:28522548

  7. A multi-purpose toolkit to enable advanced genome engineering in plants

    DOE PAGES

    Cermak, Tomas; Curtin, Shaun J.; Gil-Humanes, Javier; ...

    2017-05-18

    Here, we report a comprehensive toolkit that enables targeted, specific modification of monocot and dicot genomes using a variety of genome engineering approaches. Our reagents, based on Transcription Activator-Like Effector Nucleases TALENs and the Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/Cas9 system, are systematized for fast, modular cloning and accommodate diverse regulatory sequences to drive reagent expression. Vectors are optimized to create either single or multiple gene knockouts and large chromosomal deletions. Moreover, integration of geminivirus-based vectors enables precise gene editing through homologous recombination. Regulation of transcription is also possible. A web-based tool streamlines vector selection and construction. One advantagemore » of our platform is the use of the Csy-type (CRISPR system yersinia) ribonuclease 4 Csy4 and tRNA processing enzymes to simultaneously express multiple guide RNAs (gRNAs). For example, we demonstrate targeted deletions in up to six genes by expressing twelve gRNAs from a single transcript. Csy4 and tRNA expression systems are almost twice as effective in inducing mutations as gRNAs expressed from individual RNA polymerase III promoters. Mutagenesis can be further enhanced 2.5-fold by incorporating the Trex2 exonuclease. Finally, we demonstrate that Cas9 nickases induce gene targeting at frequencies comparable to native Cas9 when they are delivered on geminivirus replicons. The reagents have been successfully validated in tomato (Solanum lycopersicum), tobacco (Nicotiana tabacum), Medicago truncatula, wheat (Triticum aestivum), and barley (Hordeum vulgare).« less

  8. A multi-purpose toolkit to enable advanced genome engineering in plants

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cermak, Tomas; Curtin, Shaun J.; Gil-Humanes, Javier

    Here, we report a comprehensive toolkit that enables targeted, specific modification of monocot and dicot genomes using a variety of genome engineering approaches. Our reagents, based on Transcription Activator-Like Effector Nucleases TALENs and the Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/Cas9 system, are systematized for fast, modular cloning and accommodate diverse regulatory sequences to drive reagent expression. Vectors are optimized to create either single or multiple gene knockouts and large chromosomal deletions. Moreover, integration of geminivirus-based vectors enables precise gene editing through homologous recombination. Regulation of transcription is also possible. A web-based tool streamlines vector selection and construction. One advantagemore » of our platform is the use of the Csy-type (CRISPR system yersinia) ribonuclease 4 Csy4 and tRNA processing enzymes to simultaneously express multiple guide RNAs (gRNAs). For example, we demonstrate targeted deletions in up to six genes by expressing twelve gRNAs from a single transcript. Csy4 and tRNA expression systems are almost twice as effective in inducing mutations as gRNAs expressed from individual RNA polymerase III promoters. Mutagenesis can be further enhanced 2.5-fold by incorporating the Trex2 exonuclease. Finally, we demonstrate that Cas9 nickases induce gene targeting at frequencies comparable to native Cas9 when they are delivered on geminivirus replicons. The reagents have been successfully validated in tomato (Solanum lycopersicum), tobacco (Nicotiana tabacum), Medicago truncatula, wheat (Triticum aestivum), and barley (Hordeum vulgare).« less

  9. Genome-Wide Identification, Evolutionary Expansion, and Expression Profile of Homeodomain-Leucine Zipper Gene Family in Poplar (Populus trichocarpa)

    PubMed Central

    Hu, Ruibo; Chi, Xiaoyuan; Chai, Guohua; Kong, Yingzhen; He, Guo; Wang, Xiaoyu; Shi, Dachuan; Zhang, Dongyuan; Zhou, Gongke

    2012-01-01

    Background Homeodomain-leucine zipper (HD-ZIP) proteins are plant-specific transcriptional factors known to play crucial roles in plant development. Although sequence phylogeny analysis of Populus HD-ZIPs was carried out in a previous study, no systematic analysis incorporating genome organization, gene structure, and expression compendium has been conducted in model tree species Populus thus far. Principal Findings In this study, a comprehensive analysis of Populus HD-ZIP gene family was performed. Sixty-three full-length HD-ZIP genes were found in Populus genome. These Populus HD-ZIP genes were phylogenetically clustered into four distinct subfamilies (HD-ZIP I–IV) and predominately distributed across 17 linkage groups (LG). Fifty genes from 25 Populus paralogous pairs were located in the duplicated blocks of Populus genome and then preferentially retained during the sequential evolutionary courses. Genomic organization analyses indicated that purifying selection has played a pivotal role in the retention and maintenance of Populus HD-ZIP gene family. Microarray analysis has shown that 21 Populus paralogous pairs have been differentially expressed across different tissues and under various stresses, with five paralogous pairs showing nearly identical expression patterns, 13 paralogous pairs being partially redundant and three paralogous pairs diversifying significantly. Quantitative real-time RT-PCR (qRT-PCR) analysis performed on 16 selected Populus HD-ZIP genes in different tissues and under both drought and salinity stresses confirms their tissue-specific and stress-inducible expression patterns. Conclusions Genomic organizations indicated that segmental duplications contributed significantly to the expansion of Populus HD-ZIP gene family. Exon/intron organization and conserved motif composition of Populus HD-ZIPs are highly conservative in the same subfamily, suggesting the members in the same subfamilies may also have conservative functionalities. Microarray and qRT-PCR analyses showed that 89% (56 out of 63) of Populus HD-ZIPs were duplicate genes that might have been retained by substantial subfunctionalization. Taken together, these observations may lay the foundation for future functional analysis of Populus HD-ZIP genes to unravel their biological roles. PMID:22359569

  10. Genome-wide characterization of Toll-like receptor gene family in common carp (Cyprinus carpio) and their involvement in host immune response to Aeromonas hydrophila infection.

    PubMed

    Gong, Yiwen; Feng, Shuaisheng; Li, Shangqi; Zhang, Yan; Zhao, Zixia; Hu, Mou; Xu, Peng; Jiang, Yanliang

    2017-12-01

    The Toll-like receptor (TLR) gene family is a class of conserved pattern recognition receptors, which play an essential role in innate immunity providing efficient defense against invading microbial pathogens. Although TLRs have been extensively characterized in both invertebrates and vertebrates, a comprehensive analysis of TLRs in common carp is lacking. In the present study, we have conducted the first genome-wide systematic analysis of common carp (Cyprinus carpio) TLR genes. A set of 27 common carp TLR genes were identified and characterized. Sequence similarity analysis, functional domain prediction and phylogenetic analysis supported their annotation and orthologies. By examining the gene copy number of TLR genes across several vertebrates, gene duplications and losses were observed. The expression patterns of TLR genes were examined during early developmental stages and in various healthy tissues, and the results showed that TLR genes were ubiquitously expressed, indicating a likely role in maintaining homeostasis. Moreover, the differential expression of TLRs was examined after Aeromons hydrophila infection, and showed that most TLR genes were induced, with diverse patterns. TLR1, TLR4-2, TLR4-3, TLR22-2, TLR22-3 were significantly up-regulated at minimum one timepoint, whereas TLR2-1, TLR4-1, TLR7-1 and TLR7-2 were significantly down-regulated. Our results suggested that TLR genes play critical roles in the common carp immune response. Collectively, our findings provide fundamental genomic resources for future studies on fish disease management and disease-resistance selective breeding strategy development. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. Impact of missing data imputation methods on gene expression clustering and classification.

    PubMed

    de Souto, Marcilio C P; Jaskowiak, Pablo A; Costa, Ivan G

    2015-02-26

    Several missing value imputation methods for gene expression data have been proposed in the literature. In the past few years, researchers have been putting a great deal of effort into presenting systematic evaluations of the different imputation algorithms. Initially, most algorithms were assessed with an emphasis on the accuracy of the imputation, using metrics such as the root mean squared error. However, it has become clear that the success of the estimation of the expression value should be evaluated in more practical terms as well. One can consider, for example, the ability of the method to preserve the significant genes in the dataset, or its discriminative/predictive power for classification/clustering purposes. We performed a broad analysis of the impact of five well-known missing value imputation methods on three clustering and four classification methods, in the context of 12 cancer gene expression datasets. We employed a statistical framework, for the first time in this field, to assess whether different imputation methods improve the performance of the clustering/classification methods. Our results suggest that the imputation methods evaluated have a minor impact on the classification and downstream clustering analyses. Simple methods such as replacing the missing values by mean or the median values performed as well as more complex strategies. The datasets analyzed in this study are available at http://costalab.org/Imputation/ .

  12. Identification and Expression Analysis of Cytokinin Metabolic Genes in Soybean under Normal and Drought Conditions in Relation to Cytokinin Levels

    PubMed Central

    Le, Dung Tien; Nishiyama, Rie; Watanabe, Yasuko; Vankova, Radomira; Tanaka, Maho; Seki, Motoaki; Ham, Le Huy; Yamaguchi-Shinozaki, Kazuko; Shinozaki, Kazuo; Tran, Lam-Son Phan

    2012-01-01

    Cytokinins (CKs) mediate cellular responses to drought stress and targeted control of CK metabolism can be used to develop drought-tolerant plants. Aiming to manipulate CK levels to improve drought tolerance of soybean cultivars through genetic engineering of CK metabolic genes, we surveyed the soybean genome and identified 14 CK biosynthetic (isopentenyltransferase, GmIPT) and 17 CK degradative (CK dehydrogenase, GmCKX) genes. Comparative analyses of GmIPTs and GmCKXs with Arabidopsis counterparts revealed their similar architecture. The average numbers of abiotic stress-inducible cis-elements per promoter were 0.4 and 1.2 for GmIPT and GmCKX genes, respectively, suggesting that upregulation of GmCKXs, thereby reduction of CK levels, maybe the major events under abiotic stresses. Indeed, the expression of 12 GmCKX genes was upregulated by dehydration in R2 roots. Overall, the expressions of soybean CK metabolic genes in various tissues at various stages were highly responsive to drought. CK contents in various organs at the reproductive (R2) stage were also determined under well-watered and drought stress conditions. Although tRNA-type GmIPT genes were highly expressed in soybean, cis-zeatin and its derivatives were found at low concentrations. Moreover, reduction of total CK content in R2 leaves under drought was attributable to the decrease in dihydrozeatin levels, suggesting a role of this molecule in regulating soybean's responses to drought stress. Our systematic analysis of the GmIPT and GmCKX families has provided an insight into CK metabolism in soybean under drought stress and a solid foundation for in-depth characterization and future development of improved drought-tolerant soybean cultivars by manipulation of CK levels via biotechnological approach. PMID:22900018

  13. Identification of Suitable Reference Genes for Gene Expression Normalization in qRT-PCR Analysis in Watermelon

    PubMed Central

    Gao, Lingyun; Zhao, Shuang; Jiang, Wei; Huang, Yuan; Bie, Zhilong

    2014-01-01

    Watermelon is one of the major Cucurbitaceae crops and the recent availability of genome sequence greatly facilitates the fundamental researches on it. Quantitative real-time reverse transcriptase PCR (qRT–PCR) is the preferred method for gene expression analyses, and using validated reference genes for normalization is crucial to ensure the accuracy of this method. However, a systematic validation of reference genes has not been conducted on watermelon. In this study, transcripts of 15 candidate reference genes were quantified in watermelon using qRT–PCR, and the stability of these genes was compared using geNorm and NormFinder. geNorm identified ClTUA and ClACT, ClEF1α and ClACT, and ClCAC and ClTUA as the best pairs of reference genes in watermelon organs and tissues under normal growth conditions, abiotic stress, and biotic stress, respectively. NormFinder identified ClYLS8, ClUBCP, and ClCAC as the best single reference genes under the above experimental conditions, respectively. ClYLS8 and ClPP2A were identified as the best reference genes across all samples. Two to nine reference genes were required for more reliable normalization depending on the experimental conditions. The widely used watermelon reference gene 18SrRNA was less stable than the other reference genes under the experimental conditions. Catalase family genes were identified in watermelon genome, and used to validate the reliability of the identified reference genes. ClCAT1and ClCAT2 were induced and upregulated in the first 24 h, whereas ClCAT3 was downregulated in the leaves under low temperature stress. However, the expression levels of these genes were significantly overestimated and misinterpreted when 18SrRNA was used as a reference gene. These results provide a good starting point for reference gene selection in qRT–PCR analyses involving watermelon. PMID:24587403

  14. Identification of suitable reference genes for gene expression normalization in qRT-PCR analysis in watermelon.

    PubMed

    Kong, Qiusheng; Yuan, Jingxian; Gao, Lingyun; Zhao, Shuang; Jiang, Wei; Huang, Yuan; Bie, Zhilong

    2014-01-01

    Watermelon is one of the major Cucurbitaceae crops and the recent availability of genome sequence greatly facilitates the fundamental researches on it. Quantitative real-time reverse transcriptase PCR (qRT-PCR) is the preferred method for gene expression analyses, and using validated reference genes for normalization is crucial to ensure the accuracy of this method. However, a systematic validation of reference genes has not been conducted on watermelon. In this study, transcripts of 15 candidate reference genes were quantified in watermelon using qRT-PCR, and the stability of these genes was compared using geNorm and NormFinder. geNorm identified ClTUA and ClACT, ClEF1α and ClACT, and ClCAC and ClTUA as the best pairs of reference genes in watermelon organs and tissues under normal growth conditions, abiotic stress, and biotic stress, respectively. NormFinder identified ClYLS8, ClUBCP, and ClCAC as the best single reference genes under the above experimental conditions, respectively. ClYLS8 and ClPP2A were identified as the best reference genes across all samples. Two to nine reference genes were required for more reliable normalization depending on the experimental conditions. The widely used watermelon reference gene 18SrRNA was less stable than the other reference genes under the experimental conditions. Catalase family genes were identified in watermelon genome, and used to validate the reliability of the identified reference genes. ClCAT1and ClCAT2 were induced and upregulated in the first 24 h, whereas ClCAT3 was downregulated in the leaves under low temperature stress. However, the expression levels of these genes were significantly overestimated and misinterpreted when 18SrRNA was used as a reference gene. These results provide a good starting point for reference gene selection in qRT-PCR analyses involving watermelon.

  15. Localization of proteasomes and proteasomal proteolysis in the mammalian interphase cell nucleus by systematic application of immunocytochemistry.

    PubMed

    Scharf, Andrea; Rockel, Thomas Dino; von Mikecz, Anna

    2007-06-01

    Proteasomes are ATP-driven, multisubunit proteolytic machines that degrade endogenous proteins into peptides and play a crucial role in cellular events such as the cell cycle, signal transduction, maintenance of proper protein folding and gene expression. Recent evidence indicates that the ubiquitin-proteasome system is an active component of the cell nucleus. A characteristic feature of the nucleus is its organization into distinct domains that have a unique composition of macromolecules and dynamically form as a response to the requirements of nuclear function. Here, we show by systematic application of different immunocytochemical procedures and comparison with signature proteins of nuclear domains that during interphase endogenous proteasomes are localized diffusely throughout the nucleoplasm, in speckles, in nuclear bodies, and in nucleoplasmic foci. Proteasomes do not occur in the nuclear envelope region or the nucleolus, unless nucleoplasmic invaginations expand into this nuclear body. Confirmedly, proteasomal proteolysis is detected in nucleoplasmic foci, but is absent from the nuclear envelope or nucleolus. The results underpin the idea that the ubiquitin-proteasome system is not only located, but also proteolytically active in distinct nuclear domains and thus may be directly involved in gene expression, and nuclear quality control.

  16. Genome-wide identification and expression profiling reveal tissue-specific expression and differentially-regulated genes involved in gibberellin metabolism between Williams banana and its dwarf mutant.

    PubMed

    Chen, Jingjing; Xie, Jianghui; Duan, Yajie; Hu, Huigang; Hu, Yulin; Li, Weiming

    2016-05-27

    Dwarfism is one of the most valuable traits in banana breeding because semi-dwarf cultivars show good resistance to damage by wind and rain. Moreover, these cultivars present advantages of convenient cultivation, management, and so on. We obtained a dwarf mutant '8818-1' through EMS (ethyl methane sulphonate) mutagenesis of Williams banana 8818 (Musa spp. AAA group). Our research have shown that gibberellins (GAs) content in 8818-1 false stems was significantly lower than that in its parent 8818 and the dwarf type of 8818-1 could be restored by application of exogenous GA3. Although GA exerts important impacts on the 8818-1 dwarf type, our understanding of the regulation of GA metabolism during banana dwarf mutant development remains limited. Genome-wide screening revealed 36 candidate GA metabolism genes were systematically identified for the first time; these genes included 3 MaCPS, 2 MaKS, 1 MaKO, 2 MaKAO, 10 MaGA20ox, 4 MaGA3ox, and 14 MaGA2ox genes. Phylogenetic tree and conserved protein domain analyses showed sequence conservation and divergence. GA metabolism genes exhibited tissue-specific expression patterns. Early GA biosynthesis genes were constitutively expressed but presented differential regulation in different tissues in Williams banana. GA oxidase family genes were mainly transcribed in young fruits, thus suggesting that young fruits were the most active tissue involved in GA metabolism, followed by leaves, bracts, and finally approximately mature fruits. Expression patterns between 8818 and 8818-1 revealed that MaGA20ox4, MaGA20ox5, and MaGA20ox7 of the MaGA20ox gene family and MaGA2ox7, MaGA2ox12, and MaGA2ox14 of the MaGA2ox gene family exhibited significant differential expression and high-expression levels in false stems. These genes are likely to be responsible for the regulation of GAs content in 8818-1 false stems. Overall, phylogenetic evolution, tissue specificity and differential expression analyses of GA metabolism genes can provide a better understanding of GA-regulated development in banana. The present results revealed that MaGA20ox4, MaGA20ox5, MaGA20ox7, MaGA2ox7, MaGA2ox12, and MaGA2ox14 were the main genes regulating GA content difference between 8818 and 8818-1. All of these genes may perform important functions in the developmental processes of banana, but each gene may perform different functions in different tissues or during different developmental stages.

  17. The auxin response factor gene family in banana: genome-wide identification and expression analyses during development, ripening, and abiotic stress

    PubMed Central

    Hu, Wei; Zuo, Jiao; Hou, Xiaowan; Yan, Yan; Wei, Yunxie; Liu, Juhua; Li, Meiying; Xu, Biyu; Jin, Zhiqiang

    2015-01-01

    Auxin signaling regulates various auxin-responsive genes via two types of transcriptional regulators, Auxin Response Factors (ARF) and Aux/IAA. ARF transcription factors act as critical components of auxin signaling that play important roles in modulating various biological processes. However, limited information about this gene family in fruit crops is currently available. Herein, 47 ARF genes were identified in banana based on its genome sequence. Phylogenetic analysis of the ARFs from banana, rice, and Arabidopsis suggested that the ARFs could be divided into four subgroups, among which most ARFs from the banana showed a closer relationship with those from rice than those from Arabidopsis. Conserved motif analysis showed that all identified MaARFs had typical DNA-binding and ARF domains, but 12 members lacked the dimerization domain. Gene structure analysis showed that the number of exons in MaARF genes ranged from 5 to 21, suggesting large variation amongst banana ARF genes. The comprehensive expression profiles of MaARF genes yielded useful information about their involvement in diverse tissues, different stages of fruit development and ripening, and responses to abiotic stresses in different varieties. Interaction networks and co-expression assays indicated the strong transcriptional response of banana ARFs and ARF-mediated networks in early fruit development for different varieties. Our systematic analysis of MaARFs revealed robust tissue-specific, development-dependent, and abiotic stress-responsive candidate MaARF genes for further functional assays in planta. These findings could lead to potential applications in the genetic improvement of banana cultivars, and yield new insights into the complexity of the control of MaARF gene expression at the transcriptional level. Finally, they support the hypothesis that ARFs are a crucial component of the auxin signaling pathway, which regulates a wide range of physiological processes. PMID:26442055

  18. DFP: a Bioconductor package for fuzzy profile identification and gene reduction of microarray data

    PubMed Central

    Glez-Peña, Daniel; Álvarez, Rodrigo; Díaz, Fernando; Fdez-Riverola, Florentino

    2009-01-01

    Background Expression profiling assays done by using DNA microarray technology generate enormous data sets that are not amenable to simple analysis. The greatest challenge in maximizing the use of this huge amount of data is to develop algorithms to interpret and interconnect results from different genes under different conditions. In this context, fuzzy logic can provide a systematic and unbiased way to both (i) find biologically significant insights relating to meaningful genes, thereby removing the need for expert knowledge in preliminary steps of microarray data analyses and (ii) reduce the cost and complexity of later applied machine learning techniques being able to achieve interpretable models. Results DFP is a new Bioconductor R package that implements a method for discretizing and selecting differentially expressed genes based on the application of fuzzy logic. DFP takes advantage of fuzzy membership functions to assign linguistic labels to gene expression levels. The technique builds a reduced set of relevant genes (FP, Fuzzy Pattern) able to summarize and represent each underlying class (pathology). A last step constructs a biased set of genes (DFP, Discriminant Fuzzy Pattern) by intersecting existing fuzzy patterns in order to detect discriminative elements. In addition, the software provides new functions and visualisation tools that summarize achieved results and aid in the interpretation of differentially expressed genes from multiple microarray experiments. Conclusion DFP integrates with other packages of the Bioconductor project, uses common data structures and is accompanied by ample documentation. It has the advantage that its parameters are highly configurable, facilitating the discovery of biologically relevant connections between sets of genes belonging to different pathologies. This information makes it possible to automatically filter irrelevant genes thereby reducing the large volume of data supplied by microarray experiments. Based on these contributions GENECBR, a successful tool for cancer diagnosis using microarray datasets, has recently been released. PMID:19178723

  19. DFP: a Bioconductor package for fuzzy profile identification and gene reduction of microarray data.

    PubMed

    Glez-Peña, Daniel; Alvarez, Rodrigo; Díaz, Fernando; Fdez-Riverola, Florentino

    2009-01-29

    Expression profiling assays done by using DNA microarray technology generate enormous data sets that are not amenable to simple analysis. The greatest challenge in maximizing the use of this huge amount of data is to develop algorithms to interpret and interconnect results from different genes under different conditions. In this context, fuzzy logic can provide a systematic and unbiased way to both (i) find biologically significant insights relating to meaningful genes, thereby removing the need for expert knowledge in preliminary steps of microarray data analyses and (ii) reduce the cost and complexity of later applied machine learning techniques being able to achieve interpretable models. DFP is a new Bioconductor R package that implements a method for discretizing and selecting differentially expressed genes based on the application of fuzzy logic. DFP takes advantage of fuzzy membership functions to assign linguistic labels to gene expression levels. The technique builds a reduced set of relevant genes (FP, Fuzzy Pattern) able to summarize and represent each underlying class (pathology). A last step constructs a biased set of genes (DFP, Discriminant Fuzzy Pattern) by intersecting existing fuzzy patterns in order to detect discriminative elements. In addition, the software provides new functions and visualisation tools that summarize achieved results and aid in the interpretation of differentially expressed genes from multiple microarray experiments. DFP integrates with other packages of the Bioconductor project, uses common data structures and is accompanied by ample documentation. It has the advantage that its parameters are highly configurable, facilitating the discovery of biologically relevant connections between sets of genes belonging to different pathologies. This information makes it possible to automatically filter irrelevant genes thereby reducing the large volume of data supplied by microarray experiments. Based on these contributions GENECBR, a successful tool for cancer diagnosis using microarray datasets, has recently been released.

  20. Identification of genes and gene pathways associated with major depressive disorder by integrative brain analysis of rat and human prefrontal cortex transcriptomes

    PubMed Central

    Malki, K; Pain, O; Tosto, M G; Du Rietz, E; Carboni, L; Schalkwyk, L C

    2015-01-01

    Despite moderate heritability estimates, progress in uncovering the molecular substrate underpinning major depressive disorder (MDD) has been slow. In this study, we used prefrontal cortex (PFC) gene expression from a genetic rat model of MDD to inform probe set prioritization in PFC in a human post-mortem study to uncover genes and gene pathways associated with MDD. Gene expression differences between Flinders sensitive (FSL) and Flinders resistant (FRL) rat lines were statistically evaluated using the RankProd, non-parametric algorithm. Top ranking probe sets in the rat study were subsequently used to prioritize orthologous selection in a human PFC in a case–control post-mortem study on MDD from the Stanley Brain Consortium. Candidate genes in the human post-mortem study were then tested against a matched control sample using the RankProd method. A total of 1767 probe sets were differentially expressed in the PFC between FSL and FRL rat lines at (q⩽0.001). A total of 898 orthologous probe sets was found on Affymetrix's HG-U95A chip used in the human study. Correcting for the number of multiple, non-independent tests, 20 probe sets were found to be significantly dysregulated between human cases and controls at q⩽0.05. These probe sets tagged the expression profile of 18 human genes (11 upregulated and seven downregulated). Using an integrative rat–human study, a number of convergent genes that may have a role in pathogenesis of MDD were uncovered. Eighty percent of these genes were functionally associated with a key stress response signalling cascade, involving NF-κB (nuclear factor kappa-light-chain-enhancer of activated B cells), AP-1 (activator protein 1) and ERK/MAPK, which has been systematically associated with MDD, neuroplasticity and neurogenesis. PMID:25734512

  1. Divergently expressed gene identification and interaction prediction of long noncoding RNA and mRNA involved in duck reproduction.

    PubMed

    Ren, Jindong; Du, Xue; Zeng, Tao; Chen, Li; Shen, Junda; Lu, Lizhi; Hu, Jianhong

    2017-10-01

    Long noncoding RNAs (lncRNAs) and divergently expressed genes exist widely in different tissues of mammals and birds, in which they are involved in various biological processes. However, there is limited information on their role in the regulation of normal biological processes during differentiation, development, and reproduction in birds. In this study, whole transcriptome strand-specific RNA sequencing of the ovary from young ducks (60days), first-laying ducks (160days), and old ducks, i.e., ducks that stopped laying eggs (490days) was performed. The lncRNAs and mRNAs from these ducks were systematically analyzed and identified by duck genome sequencing in the three study groups. The transcriptome from the duck ovary comprised 15,011 protein-coding genes and 2905 lncRNAs; all the lncRNAs were identified as novel long noncoding transcripts. The comparison of transcriptome data from different study groups identified 2240 divergent transcription genes and 135 divergently expressed lncRNAs, which differed among the groups; most of them were significantly downregulated with age. Among the divergent genes, 38 genes were related to the reproductive process and 6 genes were upregulated. Further prediction analysis revealed that 52 lncRNAs were closely correlated with divergent reproductive mRNAs. More importantly, 6 remarkable lncRNAs were correlated significantly with the conversion of the ovary in different phases. Our results aid in the understanding of the divergent transcriptome of duck ovary in different phases and the underlying mechanisms that drive the specificity of protein-coding genes and lncRNAs in duck ovary. Copyright © 2017. Published by Elsevier B.V.

  2. The BTB and CNC homology 1 (BACH1) target genes are involved in the oxidative stress response and in control of the cell cycle.

    PubMed

    Warnatz, Hans-Jörg; Schmidt, Dominic; Manke, Thomas; Piccini, Ilaria; Sultan, Marc; Borodina, Tatiana; Balzereit, Daniela; Wruck, Wasco; Soldatov, Alexey; Vingron, Martin; Lehrach, Hans; Yaspo, Marie-Laure

    2011-07-01

    The regulation of gene expression in response to environmental signals and metabolic imbalances is a key step in maintaining cellular homeostasis. BTB and CNC homology 1 (BACH1) is a heme-binding transcription factor repressing the transcription from a subset of MAF recognition elements at low intracellular heme levels. Upon heme binding, BACH1 is released from the MAF recognition elements, resulting in increased expression of antioxidant response genes. To systematically address the gene regulatory networks involving BACH1, we combined chromatin immunoprecipitation sequencing analysis of BACH1 target genes in HEK 293 cells with knockdown of BACH1 using three independent types of small interfering RNAs followed by transcriptome profiling using microarrays. The 59 BACH1 target genes identified by chromatin immunoprecipitation sequencing were found highly enriched in genes showing expression changes after BACH1 knockdown, demonstrating the impact of BACH1 repression on transcription. In addition to known and new BACH1 targets involved in heme degradation (HMOX1, FTL, FTH1, ME1, and SLC48A1) and redox regulation (GCLC, GCLM, and SLC7A11), we also discovered BACH1 target genes affecting cell cycle and apoptosis pathways (ITPR2, CALM1, SQSTM1, TFE3, EWSR1, CDK6, BCL2L11, and MAFG) as well as subcellular transport processes (CLSTN1, PSAP, MAPT, and vault RNA). The newly identified impact of BACH1 on genes involved in neurodegenerative processes and proliferation provides an interesting basis for future dissection of BACH1-mediated gene repression in neurodegeneration and virus-induced cancerogenesis.

  3. A microarray analysis of sexual dimorphism of adipose tissues in high-fat-diet-induced obese mice

    PubMed Central

    Grove, KL; Fried, SK; Greenberg, AS; Xiao, XQ; Clegg, DJ

    2013-01-01

    Objective A sexual dimorphism exists in body fat distribution; females deposit relatively more fat in subcutaneous/inguinal depots whereas males deposit more fat in the intra-abdominal/gonadal depot. Our objective was to systematically document depot- and sex-related differences in the accumulation of adipose tissue and gene expression, comparing differentially expressed genes in diet-induced obese mice with mice maintained on a chow diet. Research Design and Methods We used a microarray approach to determine whether there are sexual dimorphisms in gene expression in age-matched male, female or ovariectomized female (OVX) C57/BL6 mice maintained on a high-fat (HF) diet. We then compared expression of validated genes between the sexes on a chow diet. Results After exposure to a high fat diet for 12 weeks, females gained less weight than males. The microarray analyses indicate in intra-abdominal/gonadal adipose tissue in females 1642 genes differ by at least twofold between the depots, whereas 706 genes differ in subcutaneous/inguinal adipose tissue when compared with males. Only 138 genes are commonly regulated in both sexes and adipose tissue depots. Inflammatory genes (cytokine–cytokine receptor interactions and acute-phase protein synthesis) are upregulated in males when compared with females, and there is a partial reversal after OVX, where OVX adipose tissue gene expression is more ′male-like′. This pattern is not observed in mice maintained on chow. Histology of male gonadal white adipose tissue (GWAT) shows more crown-like structures than females, indicative of inflammation and adipose tissue remodeling. In addition, genes related to insulin signaling and lipid synthesis are higher in females than males, regardless of dietary exposure. Conclusions These data suggest that male and female adipose tissue differ between the sexes regardless of diet. Moreover, HF diet exposure elicits a much greater inflammatory response in males when compared with females. This data set underscores the importance of analyzing depot-, sex- and steroid-dependent regulation of adipose tissue distribution and function. PMID:20157318

  4. Identification of differentially expressed genes and pathways for intramuscular fat deposition in pectoralis major tissues of fast-and slow-growing chickens.

    PubMed

    Cui, Huan-Xian; Liu, Ran-Ran; Zhao, Gui-Ping; Zheng, Mai-Qing; Chen, Ji-Lan; Wen, Jie

    2012-05-30

    Intramuscular fat (IMF) is one of the important factors influencing meat quality, however, for chickens, the molecular regulatory mechanisms underlying this trait have not yet been determined. In this study, a systematic identification of candidate genes and new pathways related to IMF deposition in chicken breast tissue has been made using gene expression profiles of two distinct breeds: Beijing-you (BJY), a slow-growing Chinese breed possessing high meat quality and Arbor Acres (AA), a commercial fast-growing broiler line. Agilent cDNA microarray analyses were conducted to determine gene expression profiles of breast muscle sampled at different developmental stages of BJY and AA chickens. Relative to d 1 when there is no detectable IMF, breast muscle at d 21, d 42, d 90 and d 120 (only for BJY) contained 1310 differentially expressed genes (DEGs) in BJY and 1080 DEGs in AA. Of these, 34-70 DEGs related to lipid metabolism or muscle development processes were examined further in each breed based on Gene Ontology (GO) analysis. The expression of several DEGs was correlated, positively or negatively, with the changing patterns of lipid content or breast weight across the ages sampled, indicating that those genes may play key roles in these developmental processes. In addition, based on KEGG pathway analysis of DEGs in both BJY and AA chickens, it was found that in addition to pathways affecting lipid metabolism (pathways for MAPK & PPAR signaling), cell junction-related pathways (tight junction, ECM-receptor interaction, focal adhesion, regulation of actin cytoskeleton), which play a prominent role in maintaining the integrity of tissues, could contribute to the IMF deposition. The results of this study identified potential candidate genes associated with chicken IMF deposition and imply that IMF deposition in chicken breast muscle is regulated and mediated not only by genes and pathways related to lipid metabolism and muscle development, but also by others involved in cell junctions. These findings establish the groundwork and provide new clues for deciphering the molecular mechanisms underlying IMF deposition in poultry. Further studies at the translational and posttranslational level are now required to validate the genes and pathways identified here.

  5. The significance of translation regulation in the stress response

    PubMed Central

    2013-01-01

    Background The stress response in bacteria involves the multistage control of gene expression but is not entirely understood. To identify the translational response of bacteria in stress conditions and assess its contribution to the regulation of gene expression, the translational states of all mRNAs were compared under optimal growth condition and during nutrient (isoleucine) starvation. Results A genome-scale study of the translational response to nutritional limitation was performed in the model bacterium Lactococcus lactis. Two measures were used to assess the translational status of each individual mRNA: the fraction engaged in translation (ribosome occupancy) and ribosome density (number of ribosomes per 100 nucleotides). Under isoleucine starvation, half of the mRNAs considered were translationally down-regulated mainly due to decreased ribosome density. This pattern concerned genes involved in growth-related functions such as translation, transcription, and the metabolism of fatty acids, phospholipids and bases, contributing to the slowdown of growth. Only 4% of the mRNAs were translationally up-regulated, mostly related to prophagic expression in response to stress. The remaining genes exhibited antagonistic regulations of the two markers of translation. Ribosome occupancy increased significantly for all the genes involved in the biosynthesis of isoleucine, although their ribosome density had decreased. The results revealed complex translational regulation of this pathway, essential to cope with isoleucine starvation. To elucidate the regulation of global gene expression more generally, translational regulation was compared to transcriptional regulation under isoleucine starvation and to other post-transcriptional regulations related to mRNA degradation and mRNA dilution by growth. Translational regulation appeared to accentuate the effects of transcriptional changes for down-regulated growth-related functions under isoleucine starvation although mRNA stabilization and lower dilution by growth counterbalanced this effect. Conclusions We show that the contribution of translational regulation to the control of gene expression is significant in the stress response. Post-transcriptional regulation is complex and not systematically co-directional with transcription regulation. Post-transcriptional regulation is important to the understanding of gene expression control. PMID:23985063

  6. Shrinkage estimation of effect sizes as an alternative to hypothesis testing followed by estimation in high-dimensional biology: applications to differential gene expression.

    PubMed

    Montazeri, Zahra; Yanofsky, Corey M; Bickel, David R

    2010-01-01

    Research on analyzing microarray data has focused on the problem of identifying differentially expressed genes to the neglect of the problem of how to integrate evidence that a gene is differentially expressed with information on the extent of its differential expression. Consequently, researchers currently prioritize genes for further study either on the basis of volcano plots or, more commonly, according to simple estimates of the fold change after filtering the genes with an arbitrary statistical significance threshold. While the subjective and informal nature of the former practice precludes quantification of its reliability, the latter practice is equivalent to using a hard-threshold estimator of the expression ratio that is not known to perform well in terms of mean-squared error, the sum of estimator variance and squared estimator bias. On the basis of two distinct simulation studies and data from different microarray studies, we systematically compared the performance of several estimators representing both current practice and shrinkage. We find that the threshold-based estimators usually perform worse than the maximum-likelihood estimator (MLE) and they often perform far worse as quantified by estimated mean-squared risk. By contrast, the shrinkage estimators tend to perform as well as or better than the MLE and never much worse than the MLE, as expected from what is known about shrinkage. However, a Bayesian measure of performance based on the prior information that few genes are differentially expressed indicates that hard-threshold estimators perform about as well as the local false discovery rate (FDR), the best of the shrinkage estimators studied. Based on the ability of the latter to leverage information across genes, we conclude that the use of the local-FDR estimator of the fold change instead of informal or threshold-based combinations of statistical tests and non-shrinkage estimators can be expected to substantially improve the reliability of gene prioritization at very little risk of doing so less reliably. Since the proposed replacement of post-selection estimates with shrunken estimates applies as well to other types of high-dimensional data, it could also improve the analysis of SNP data from genome-wide association studies.

  7. Comparison of normalization methods for the analysis of metagenomic gene abundance data.

    PubMed

    Pereira, Mariana Buongermino; Wallroth, Mikael; Jonsson, Viktor; Kristiansson, Erik

    2018-04-20

    In shotgun metagenomics, microbial communities are studied through direct sequencing of DNA without any prior cultivation. By comparing gene abundances estimated from the generated sequencing reads, functional differences between the communities can be identified. However, gene abundance data is affected by high levels of systematic variability, which can greatly reduce the statistical power and introduce false positives. Normalization, which is the process where systematic variability is identified and removed, is therefore a vital part of the data analysis. A wide range of normalization methods for high-dimensional count data has been proposed but their performance on the analysis of shotgun metagenomic data has not been evaluated. Here, we present a systematic evaluation of nine normalization methods for gene abundance data. The methods were evaluated through resampling of three comprehensive datasets, creating a realistic setting that preserved the unique characteristics of metagenomic data. Performance was measured in terms of the methods ability to identify differentially abundant genes (DAGs), correctly calculate unbiased p-values and control the false discovery rate (FDR). Our results showed that the choice of normalization method has a large impact on the end results. When the DAGs were asymmetrically present between the experimental conditions, many normalization methods had a reduced true positive rate (TPR) and a high false positive rate (FPR). The methods trimmed mean of M-values (TMM) and relative log expression (RLE) had the overall highest performance and are therefore recommended for the analysis of gene abundance data. For larger sample sizes, CSS also showed satisfactory performance. This study emphasizes the importance of selecting a suitable normalization methods in the analysis of data from shotgun metagenomics. Our results also demonstrate that improper methods may result in unacceptably high levels of false positives, which in turn may lead to incorrect or obfuscated biological interpretation.

  8. EGRINs (Environmental Gene Regulatory Influence Networks) in Rice That Function in the Response to Water Deficit, High Temperature, and Agricultural Environments[OPEN

    PubMed Central

    Hafemeister, Christoph; Nicotra, Adrienne B.; Jagadish, S.V. Krishna; Bonneau, Richard; Purugganan, Michael

    2016-01-01

    Environmental gene regulatory influence networks (EGRINs) coordinate the timing and rate of gene expression in response to environmental signals. EGRINs encompass many layers of regulation, which culminate in changes in accumulated transcript levels. Here, we inferred EGRINs for the response of five tropical Asian rice (Oryza sativa) cultivars to high temperatures, water deficit, and agricultural field conditions by systematically integrating time-series transcriptome data, patterns of nucleosome-free chromatin, and the occurrence of known cis-regulatory elements. First, we identified 5447 putative target genes for 445 transcription factors (TFs) by connecting TFs with genes harboring known cis-regulatory motifs in nucleosome-free regions proximal to their transcriptional start sites. We then used network component analysis to estimate the regulatory activity for each TF based on the expression of its putative target genes. Finally, we inferred an EGRIN using the estimated transcription factor activity (TFA) as the regulator. The EGRINs include regulatory interactions between 4052 target genes regulated by 113 TFs. We resolved distinct regulatory roles for members of the heat shock factor family, including a putative regulatory connection between abiotic stress and the circadian clock. TFA estimation using network component analysis is an effective way of incorporating multiple genome-scale measurements into network inference. PMID:27655842

  9. Identification and characterization of plant-specific NAC gene family in canola (Brassica napus L.) reveal novel members involved in cell death.

    PubMed

    Wang, Boya; Guo, Xiaohua; Wang, Chen; Ma, Jieyu; Niu, Fangfang; Zhang, Hanfeng; Yang, Bo; Liang, Wanwan; Han, Feng; Jiang, Yuan-Qing

    2015-03-01

    NAC transcription factors are plant-specific and play important roles in plant development processes, response to biotic and abiotic cues and hormone signaling. However, to date, little is known about the NAC genes in canola (or oilseed rape, Brassica napus L.). In this study, a total of 60 NAC genes were identified from canola through a systematical analysis and mining of expressed sequence tags. Among these, the cDNA sequences of 41 NAC genes were successfully cloned. The translated protein sequences of canola NAC genes with the NAC genes from representative species were phylogenetically clustered into three major groups and multiple subgroups. The transcriptional activities of these BnaNAC proteins were assayed in yeast. In addition, by quantitative real-time RT-PCR, we further observed that some of these BnaNACs were regulated by different hormone stimuli or abiotic stresses. Interestingly, we successfully identified two novel BnaNACs, BnaNAC19 and BnaNAC82, which could elicit hypersensitive response-like cell death when expressed in Nicotiana benthamiana leaves, which was mediated by accumulation of reactive oxygen species. Overall, our work has laid a solid foundation for further characterization of this important NAC gene family in canola.

  10. Coexpression network analysis of the genes regulated by two types of resistance responses to powdery mildew in wheat

    PubMed Central

    Zhang, Juncheng; Zheng, Hongyuan; Li, Yiwen; Li, Hongjie; Liu, Xin; Qin, Huanju; Dong, Lingli; Wang, Daowen

    2016-01-01

    Powdery mildew disease caused by Blumeria graminis f. sp. tritici (Bgt) inflicts severe economic losses in wheat crops. A systematic understanding of the molecular mechanisms involved in wheat resistance to Bgt is essential for effectively controlling the disease. Here, using the diploid wheat Triticum urartu as a host, the genes regulated by immune (IM) and hypersensitive reaction (HR) resistance responses to Bgt were investigated through transcriptome sequencing. Four gene coexpression networks (GCNs) were developed using transcriptomic data generated for 20 T. urartu accessions showing IM, HR or susceptible responses. The powdery mildew resistance regulated (PMRR) genes whose expression was significantly correlated with Bgt resistance were identified, and they tended to be hubs and enriched in six major modules. A wide occurrence of negative regulation of PMRR genes was observed. Three new candidate immune receptor genes (TRIUR3_13045, TRIUR3_01037 and TRIUR3_06195) positively associated with Bgt resistance were discovered. Finally, the involvement of TRIUR3_01037 in Bgt resistance was tentatively verified through cosegregation analysis in a F2 population and functional expression assay in Bgt susceptible leaf cells. This research provides insights into the global network properties of PMRR genes. Potential molecular differences between IM and HR resistance responses to Bgt are discussed. PMID:27033636

  11. Global Identification and Characterization of Transcriptionally Active Regions in the Rice Genome

    PubMed Central

    Stolc, Viktor; Deng, Wei; He, Hang; Korbel, Jan; Chen, Xuewei; Tongprasit, Waraporn; Ronald, Pamela; Chen, Runsheng; Gerstein, Mark; Wang Deng, Xing

    2007-01-01

    Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome. PMID:17372628

  12. Deregulation of RB1 expression by loss of imprinting in human hepatocellular carcinoma.

    PubMed

    Anwar, Sumadi Lukman; Krech, Till; Hasemeier, Britta; Schipper, Elisa; Schweitzer, Nora; Vogel, Arndt; Kreipe, Hans; Lehmann, Ulrich

    2014-08-01

    The tumour suppressor gene RB1 is frequently silenced in many different types of human cancer, including hepatocellular carcinoma (HCC). However, mutations of the RB1 gene are relatively rare in HCC. A systematic screen for the identification of imprinted genes deregulated in human HCC revealed that RB1 shows imprint abnormalities in a high proportion of primary patient samples. Altogether, 40% of the HCC specimens (16/40) showed hyper- or hypomethylation at the CpG island in intron 2 of the RB1 gene. Re-analysis of publicly available genome-wide DNA methylation data confirmed these findings in two independent HCC cohorts. Loss of correct DNA methylation patterns at the RB1 locus leads to the aberrant expression of an alternative RB1-E2B transcript, as measured by quantitative real-time PCR. Demethylation at the intron 2 CpG island by DNMT1 knock-down or aza-deoxycytidine (DAC) treatment stimulated expression of the RB1-E2B transcript, accompanied by diminished RB1 main transcript expression. No aberrant DNA methylation was found at the RB1 locus in hepatocellular adenoma (HCA, n = 10), focal nodular hyperplasia (FNH, n = 5) and their corresponding adjacent liver tissue specimens. Deregulated RB1 expression due to hyper- or hypomethylation in intron 2 of the RB1 gene is found in tumours without loss of heterozygosity and is associated with a decrease in overall survival (p = 0.032) if caused by hypermethylation of CpG85. This unequivocally demonstrates that loss of imprinting represents an important additional mechanism for RB1 pathway inactivation in human HCC, complementing well-described molecular defects. Copyright © 2014 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.

  13. Molecular pathology of brain edema after severe burns in forensic autopsy cases with special regard to the importance of reference gene selection.

    PubMed

    Wang, Qi; Ishikawa, Takaki; Michiue, Tomomi; Zhu, Bao-Li; Guan, Da-Wei; Maeda, Hitoshi

    2013-09-01

    Brain edema is believed to be linked to high mortality incidence after severe burns. The present study investigated the molecular pathology of brain damage and responses involving brain edema in forensic autopsy cases of fire fatality (n = 55) compared with sudden cardiac death (n = 11), mechanical asphyxia (n = 13), and non-brain injury cases (n = 22). Postmortem mRNA and immunohistochemical expressions of aquaporins (AQPs), claudin5 (CLDN5), and matrix metalloproteinases (MMPs) were examined. Prolonged deaths due to severe burns showed an increase in brain water content, but relative mRNA quantification, using different normalization methods, showed inconsistent results: in prolonged deaths due to severe burns, higher expression levels were detected for all markers when three previously validated reference genes, PES1, POLR2A, and IPO8, were used for normalization, higher for AQP1 and MMP9 when GAPDH alone was used for normalization and higher for MMP9, but lower for MMP2 when B2M alone was used for normalization. Additionally, when B2M alone was used for normalization, higher expression of AQP4 was detected in acute fire deaths. Furthermore, the expression stability values of these five reference genes calculated by geNorm demonstrated that B2M was the least stable one, followed by GAPDH. In immunostaining, only AQP1 and MMP9 showed differences among the causes of death: they were evident in most prolonged deaths due to severe burns. These findings suggest that systematic analysis of gene expressions using real-time PCR might be a useful procedure in forensic death investigation, and validation of reference genes is crucial.

  14. Transcriptome-wide identification and expression profiles of the WRKY transcription factor family in Broomcorn millet (Panicum miliaceum L.).

    PubMed

    Yue, Hong; Wang, Meng; Liu, Siyan; Du, Xianghong; Song, Weining; Nie, Xiaojun

    2016-05-10

    WRKY genes, as the most pivotal transcription factors in plants, play the indispensable roles in regulating various physiological processes, including plant growth and development as well as in response to stresses. Broomcorn millet is one of the most important crops in drought areas worldwide. However, the WRKY gene family in broomcorn millet remains unknown. A total of 32 PmWRKY genes were identified in this study using computational prediction method. Structural analysis found that PmWRKY proteins contained a highly conserved motif WRKYGQK and two common variant motifs, namely WRKYGKK and WRKYGEK. Phylogenetic analysis of PmWRKYs together with the homologous genes from the representative species could classify them into three groups, with the number of 1, 15, and 16, respectively. Finally, the transcriptional profiles of these 32 PmWRKY genes in various tissues or under different abiotic stresses were systematically investigated using qRT-PCR analysis. Results showed that the expression level of 22 PmWRKY genes varied significantly under one or more abiotic stress treatments, which could be defined as abiotic stress-responsive genes. This was the first study to identify the organization and transcriptional profiles of PmWRKY genes, which not only facilitates the functional analysis of the PmWRKY genes, and also lays the foundation to reveal the molecular mechanism of stress tolerance in this important crop.

  15. SZGR 2.0: a one-stop shop of schizophrenia candidate genes.

    PubMed

    Jia, Peilin; Han, Guangchun; Zhao, Junfei; Lu, Pinyi; Zhao, Zhongming

    2017-01-04

    SZGR 2.0 is a comprehensive resource of candidate variants and genes for schizophrenia, covering genetic, epigenetic, transcriptomic, translational and many other types of evidence. By systematic review and curation of multiple lines of evidence, we included almost all variants and genes that have ever been reported to be associated with schizophrenia. In particular, we collected ∼4200 common variants reported in genome-wide association studies, ∼1000 de novo mutations discovered by large-scale sequencing of family samples, 215 genes spanning rare and replication copy number variations, 99 genes overlapping with linkage regions, 240 differentially expressed genes, 4651 differentially methylated genes and 49 genes as antipsychotic drug targets. To facilitate interpretation, we included various functional annotation data, especially brain eQTL, methylation QTL, brain expression featured in deep categorization of brain areas and developmental stages and brain-specific promoter and enhancer annotations. Furthermore, we conducted cross-study, cross-data type and integrative analyses of the multidimensional data deposited in SZGR 2.0, and made the data and results available through a user-friendly interface. In summary, SZGR 2.0 provides a one-stop shop of schizophrenia variants and genes and their function and regulation, providing an important resource in the schizophrenia and other mental disease community. SZGR 2.0 is available at https://bioinfo.uth.edu/SZGR/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Systematic characterization of the peroxidase gene family provides new insights into fungal pathogenicity in Magnaporthe oryzae

    PubMed Central

    Mir, Albely Afifa; Park, Sook-Young; Sadat, Md. Abu; Kim, Seongbeom; Choi, Jaeyoung; Jeon, Junhyun; Lee, Yong-Hwan

    2015-01-01

    Fungal pathogens have evolved antioxidant defense against reactive oxygen species produced as a part of host innate immunity. Recent studies proposed peroxidases as components of antioxidant defense system. However, the role of fungal peroxidases during interaction with host plants has not been explored at the genomic level. Here, we systematically identified peroxidase genes and analyzed their impact on fungal pathogenesis in a model plant pathogenic fungus, Magnaporthe oryzae. Phylogeny reconstruction placed 27 putative peroxidase genes into 15 clades. Expression profiles showed that majority of them are responsive to in planta condition and in vitro H2O2. Our analysis of individual deletion mutants for seven selected genes including MoPRX1 revealed that these genes contribute to fungal development and/or pathogenesis. We identified significant and positive correlations among sensitivity to H2O2, peroxidase activity and fungal pathogenicity. In-depth analysis of MoPRX1 demonstrated that it is a functional ortholog of thioredoxin peroxidase in Saccharomyces cerevisiae and is required for detoxification of the oxidative burst within host cells. Transcriptional profiling of other peroxidases in ΔMoprx1 suggested interwoven nature of the peroxidase-mediated antioxidant defense system. The results from this study provide insight into the infection strategy built on evolutionarily conserved peroxidases in the rice blast fungus. PMID:26134974

  17. Transactivation of the proximal promoter of human oxytocin gene by TR4 orphan receptor

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, C.-P.; Lee, Y.-F.; Chang, C.

    2006-12-08

    The human testicular receptor 4 (TR4) shares structural homology with members of the nuclear receptor superfamily. Some other members of this superfamily were able to regulate the transcriptional activity of the human oxytocin (OXT) promoter by binding to the first DR0 regulatory site. However, little investigation was conducted systematically in the study of the second dDR4 site of OXT proximal promoter, and the relationship between the first and the second sites of OXT promoter. Here, we demonstrated for the first time that TR4 could increase the proximal promoter activity of the human OXT gene via DR0, dDR4, and OXT (bothmore » DR0 and dDR4) elements, respectively. TR4 might induce OXT gene expression through the OXT element in a dose-dependent manner. However, there is no synergistic effect between DR0 and dDR4 elements during TR4 transactivation. Taken together, these results suggested that TR4 should be one of important regulators of OXT gene expression.« less

  18. Hunting for genes for hypertension: the Millennium Genome Project for Hypertension.

    PubMed

    Tabara, Yasuharu; Kohara, Katsuhiko; Miki, Tetsuro

    2012-06-01

    The Millennium Genome Project for Hypertension was started in 2000 to identify genetic variants conferring susceptibility to hypertension, with the aim of furthering the understanding of the pathogenesis of this condition and realizing genome-based personalized medical care. Two different approaches were launched, genome-wide association analysis using single-nucleotide polymorphisms (SNPs) and microsatellite markers, and systematic candidate gene analysis, under the hypothesis that common variants have an important role in the etiology of common diseases. These multilateral approaches identified ATP2B1 as a gene responsible for hypertension in not only Japanese but also Caucasians. The high blood pressure susceptibility conferred by certain alleles of ATP2B1 has been widely replicated in various populations. Ex vivo mRNA expression analysis in umbilical artery smooth muscle cells indicated that reduced expression of this gene associated with the risk allele may be an underlying mechanism relating the ATP2B1 variant to hypertension. However, the effect size of a SNP was too small to clarify the entire picture of the genetic basis of hypertension. Further, dense genome analysis with accurate phenotype data may be required.

  19. Efficient Reverse-Engineering of a Developmental Gene Regulatory Network

    PubMed Central

    Cicin-Sain, Damjan; Ashyraliyev, Maksat; Jaeger, Johannes

    2012-01-01

    Understanding the complex regulatory networks underlying development and evolution of multi-cellular organisms is a major problem in biology. Computational models can be used as tools to extract the regulatory structure and dynamics of such networks from gene expression data. This approach is called reverse engineering. It has been successfully applied to many gene networks in various biological systems. However, to reconstitute the structure and non-linear dynamics of a developmental gene network in its spatial context remains a considerable challenge. Here, we address this challenge using a case study: the gap gene network involved in segment determination during early development of Drosophila melanogaster. A major problem for reverse-engineering pattern-forming networks is the significant amount of time and effort required to acquire and quantify spatial gene expression data. We have developed a simplified data processing pipeline that considerably increases the throughput of the method, but results in data of reduced accuracy compared to those previously used for gap gene network inference. We demonstrate that we can infer the correct network structure using our reduced data set, and investigate minimal data requirements for successful reverse engineering. Our results show that timing and position of expression domain boundaries are the crucial features for determining regulatory network structure from data, while it is less important to precisely measure expression levels. Based on this, we define minimal data requirements for gap gene network inference. Our results demonstrate the feasibility of reverse-engineering with much reduced experimental effort. This enables more widespread use of the method in different developmental contexts and organisms. Such systematic application of data-driven models to real-world networks has enormous potential. Only the quantitative investigation of a large number of developmental gene regulatory networks will allow us to discover whether there are rules or regularities governing development and evolution of complex multi-cellular organisms. PMID:22807664

  20. Identification of Differentially Expressed Genes and Pathways for Myofiber Characteristics in Soleus Muscles between Chicken Breeds Differing in Meat Quality.

    PubMed

    Du, Y F; Ding, Q L; Li, Y M; Fang, W R

    2017-04-03

    In the modern chicken industry, fast-growing broilers have undergone strong artificial selection for muscle growth, which has led to remarkable phenotypic variations compared with slow-growing chickens. However, the molecular mechanism underlying these phenotypes differences remains unknown. In this study, a systematic identification of candidate genes and new pathways related to myofiber development and composition in chicken Soleus muscle (SOL) has been made using gene expression profiles of two distinct breeds: Qingyuan partridge (QY), a slow-growing Chinese breed possessing high meat quality and Cobb 500 (CB), a commercial fast-growing broiler line. Agilent cDNA microarray analyses were conducted to determine gene expression profiles of soleus muscle sampled at sexual maturity age of QY (112 d) and CB (42 d). The 1318 genes with at least 2-fold differences were identified (P < 0.05, FDR <0.05, FC ≥ 2) in SOL muscles of QY and CB chickens. Differentially expressed genes (DEGs) related to muscle development, energy metabolism or lipid metabolism processes were examined further in each breed based on Gene Ontology (GO) analysis, and 11 genes involved in these processes were selected for further validation studies by qRT-PCR. In addition, based on KEGG pathway analysis of DEGs in both QY and CB chickens, it was found that in addition to pathways affecting myogenic fibre-type development and differentiation (pathways for Hedgehog & Calcium signaling), energy metabolism (Phosphatidylinositol signaling system, VEGF signaling pathway, Purine metabolism, Pyrimidine metabolism) were also enriched and might form a network with pathways related to muscle metabolism to influence the development of myofibers. This study is the first stage in the understanding of molecular mechanisms underlying variations in poultry meat quality. Large scale analyses are now required to validate the role of the genes identified and ultimately to find molecular markers that can be used for selection or to optimize rearing practices.

  1. miRNA-Mediated Relationships between Cis-SNP Genotypes and Transcript Intensities in Lymphocyte Cell Lines

    PubMed Central

    Zhang, Wensheng; Edwards, Andrea; Zhu, Dongxiao; Flemington, Erik K.; Deininger, Prescott; Zhang, Kun

    2012-01-01

    In metazoans, miRNAs regulate gene expression primarily through binding to target sites in the 3′ UTRs (untranslated regions) of messenger RNAs (mRNAs). Cis-acting variants within, or close to, a gene are crucial in explaining the variability of gene expression measures. Single nucleotide polymorphisms (SNPs) in the 3′ UTRs of genes can affect the base-pairing between miRNAs and mRNAs, and hence disrupt existing target sites (in the reference sequence) or create novel target sites, suggesting a possible mechanism for cis regulation of gene expression. Moreover, because the alleles of different SNPs within a DNA sequence of limited length tend to be in strong linkage disequilibrium (LD), we hypothesize the variants of miRNA target sites caused by SNPs potentially function as bridges linking the documented cis-SNP markers to the expression of the associated genes. A large-scale analysis was herein performed to test this hypothesis. By systematically integrating multiple latest information sources, we found 21 significant gene-level SNP-involved miRNA-mediated post-transcriptional regulation modules (SNP-MPRMs) in the form of SNP-miRNA-mRNA triplets in lymphocyte cell lines for the CEU and YRI populations. Among the cognate genes, six including ALG8, DGKE, GNA12, KLF11, LRPAP1, and MMAB are related to multiple genetic diseases such as depressive disorder and Type-II diabetes. Furthermore, we found that ∼35% of the documented transcript intensity-related cis-SNPs (∼950) in a recent publication are identical to, or in significant linkage disequilibrium (LD) (p<0.01) with, one or multiple SNPs located in miRNA target sites. Based on these associations (or identities), 69 significant exon-level SNP-MPRMs and 12 disease genes were further determined for two populations. These results provide concrete in silico evidence for the proposed hypothesis. The discovered modules warrant additional follow-up in independent laboratory studies. PMID:22348086

  2. Hsp70 gene expansions in the scallop Patinopecten yessoensis and their expression regulation after exposure to the toxic dinoflagellate Alexandrium catenella.

    PubMed

    Cheng, Jie; Xun, Xiaogang; Kong, Yifan; Wang, Shuyue; Yang, Zhihui; Li, Yajuan; Kong, Dexu; Wang, Shi; Zhang, Lingling; Hu, Xiaoli; Bao, Zhenmin

    2016-11-01

    Heat shock protein 70 (Hsp70s) family members are present in virtually all living organisms and perform a fundamental role against different types of environmental stressors and pathogenic organisms. Marine bivalves live in highly dynamic environments and may accumulate paralytic shellfish toxins (PSTs), a class of well-known neurotoxins closely associated with harmful algal blooms (HABs). Here, we provide a systematic analysis of Hsp70 genes (PyHsp70s) in the genome of Yesso scallop (Patinopecten yessoensis), an important aquaculture species in China, through in silico analysis using transcriptome and genome databases. Phylogenetic analyses indicated extensive expansion of Hsp70 genes from the Hspa12 sub-family in the Yesso scallop and also the bivalve lineages, with gene duplication events before or after the split between the Yesso scallop and the Pacific oyster. In addition, we determined the expression patterns of PyHsp70s after exposure to Alexandrium catenella, the dinoflagellate producing PSTs. Our results confirmed the inducible expression patterns of PyHsp70s under PSTs stress, and the responses to the toxic stress may have arisen through the adaptive recruitment of tandem duplication of Hsp70 genes. These findings provide a thorough overview of the evolution and modification of the Hsp70 family, which will gain insights into the functional characteristics of scallop Hsp70 genes in response to different stresses. Copyright © 2016. Published by Elsevier Ltd.

  3. Transcriptome and Small RNA Deep Sequencing Reveals Deregulation of miRNA Biogenesis in Human Glioma

    PubMed Central

    Moore, Lynette M.; Kivinen, Virpi; Liu, Yuexin; Annala, Matti; Cogdell, David; Liu, Xiuping; Liu, Chang-Gong; Sawaya, Raymond; Yli-Harja, Olli; Shmulevich, Ilya; Fuller, Gregory N.; Zhang, Wei; Nykter, Matti

    2013-01-01

    Altered expression of oncogenic and tumor-suppressing microRNAs (miRNAs) is widely associated with tumorigenesis. However, the regulatory mechanisms underlying these alterations are poorly understood. We sought to shed light on the deregulation of miRNA biogenesis promoting the aberrant miRNA expression profiles identified in these tumors. Using sequencing technology to perform both whole-transcriptome and small RNA sequencing of glioma patient samples, we examined precursor and mature miRNAs to directly evaluate the miRNA maturation process, and interrogated expression profiles for genes involved in the major steps of miRNA biogenesis. We found that ratios of mature to precursor forms of a large number of miRNAs increased with the progression from normal brain to low-grade and then to high-grade gliomas. The expression levels of genes involved in each of the three major steps of miRNA biogenesis (nuclear processing, nucleo-cytoplasmic transport, and cytoplasmic processing) were systematically altered in glioma tissues. Survival analysis of an independent data set demonstrated that the alteration of genes involved in miRNA maturation correlates with survival in glioma patients. Direct quantification of miRNA maturation with deep sequencing demonstrated that deregulation of the miRNA biogenesis pathway is a hallmark for glioma genesis and progression. PMID:23007860

  4. Effect of the absolute statistic on gene-sampling gene-set analysis methods.

    PubMed

    Nam, Dougu

    2017-06-01

    Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.

  5. Direct isolation of differentially expressed genes from a specific chromosome region of common wheat: application of the amplified fragment length polymorphism-based mRNA fingerprinting (AMF) method in combination with a deletion line of wheat.

    PubMed

    Kojima, T; Habu, Y; Iida, S; Ogihara, Y

    2000-05-01

    The amplified restriction fragment length polymorphism (AFLP)-based mRNA fingerprinting (AMF) method makes it possible systematically and conveniently to identify differentially expressed cDNAs with high reproducibility. We have applied the AMF method to the cloning of the Q gene of common wheat, which is located on the long arm of chromosome 5A and pleiotropically controls the spike morphology and the threshing character of seeds. Using the AMF method, we compared the fingerprints of mRNA samples extracted from the young spikes of Triticum aestivum cv. Chinese Spring (CS) carrying the Q gene to those of a chromosome deletion line of CS, namely, q5, which lacks 15% of 5AL including the Q gene. Approximately 12,200 fragments were produced after PCR with 256 primer combinations. Of these, 92 fragments were differentially expressed between CS and q5. Northern and Southern analyses showed that 16 fragments gave specific or relatively stronger transcript signals in CS, and these clones were present in single copy or in low copy numbers in the wheat genome. Four clones were genetically mapped to the region deleted in q5. Subsequently, one clone, pTaQ22, was mapped at the same locus as the Q gene, indicating that pTaQ22 corresponds to the Q gene or is tightly linked to it. DNA sequence data showed that pTaQ22 had no homology to any known genes, thus suggesting a novel function for this gene in flower morphogenesis. This AMF method might provide a straightforward method for isolating genes in the hexaploid background of common wheat.

  6. Systematic identification of an integrative network module during senescence from time-series gene expression.

    PubMed

    Park, Chihyun; Yun, So Jeong; Ryu, Sung Jin; Lee, Soyoung; Lee, Young-Sam; Yoon, Youngmi; Park, Sang Chul

    2017-03-15

    Cellular senescence irreversibly arrests growth of human diploid cells. In addition, recent studies have indicated that senescence is a multi-step evolving process related to important complex biological processes. Most studies analyzed only the genes and their functions representing each senescence phase without considering gene-level interactions and continuously perturbed genes. It is necessary to reveal the genotypic mechanism inferred by affected genes and their interaction underlying the senescence process. We suggested a novel computational approach to identify an integrative network which profiles an underlying genotypic signature from time-series gene expression data. The relatively perturbed genes were selected for each time point based on the proposed scoring measure denominated as perturbation scores. Then, the selected genes were integrated with protein-protein interactions to construct time point specific network. From these constructed networks, the conserved edges across time point were extracted for the common network and statistical test was performed to demonstrate that the network could explain the phenotypic alteration. As a result, it was confirmed that the difference of average perturbation scores of common networks at both two time points could explain the phenotypic alteration. We also performed functional enrichment on the common network and identified high association with phenotypic alteration. Remarkably, we observed that the identified cell cycle specific common network played an important role in replicative senescence as a key regulator. Heretofore, the network analysis from time series gene expression data has been focused on what topological structure was changed over time point. Conversely, we focused on the conserved structure but its context was changed in course of time and showed it was available to explain the phenotypic changes. We expect that the proposed method will help to elucidate the biological mechanism unrevealed by the existing approaches.

  7. Kinase impact assessment in the landscape of fusion genes that retain kinase domains: a pan-cancer study

    PubMed Central

    Kim, Pora; Jia, Peilin; Zhao, Zhongming

    2018-01-01

    Abstract Assessing the impact of kinase in gene fusion is essential for both identifying driver fusion genes (FGs) and developing molecular targeted therapies. Kinase domain retention is a crucial factor in kinase fusion genes (KFGs), but such a systematic investigation has not been done yet. To this end, we analyzed kinase domain retention (KDR) status in chimeric protein sequences of 914 KFGs covering 312 kinases across 13 major cancer types. Based on 171 kinase domain-retained KFGs including 101 kinases, we studied their recurrence, kinase groups, fusion partners, exon-based expression depth, short DNA motifs around the break points and networks. Our results, such as more KDR than 5′-kinase fusion genes, combinatorial effects between 3′-KDR kinases and their 5′-partners and a signal transduction-specific DNA sequence motif in the break point intronic sequences, supported positive selection on 3′-kinase fusion genes in cancer. We introduced a degree-of-frequency (DoF) score to measure the possible number of KFGs of a kinase. Interestingly, kinases with high DoF scores tended to undergo strong gene expression alteration at the break points. Furthermore, our KDR gene fusion network analysis revealed six of the seven kinases with the highest DoF scores (ALK, BRAF, MET, NTRK1, NTRK3 and RET) were all observed in thyroid carcinoma. Finally, we summarized common features of ‘effective’ (highly recurrent) kinases in gene fusions such as expression alteration at break point, redundant usage in multiple cancer types and 3′-location tendency. Collectively, our findings are useful for prioritizing driver kinases and FGs and provided insights into KFGs’ clinical implications. PMID:28013235

  8. Transcriptomic Analysis of Paulownia Infected by Paulownia Witches'-Broom Phytoplasma

    PubMed Central

    Zhu, Shui-Fang; Lin, Cai-Li; Tian, Guo-Zhong; Xu, Xia; Zhao, Wen-Jun

    2013-01-01

    Phytoplasmas are plant pathogenic bacteria that have no cell wall and are responsible for major crop losses throughout the world. Phytoplasma-infected plants show a variety of symptoms and the mechanisms they use to physiologically alter the host plants are of considerable interest, but poorly understood. In this study we undertook a detailed analysis of Paulownia infected by Paulownia witches’-broom (PaWB) Phytoplasma using high-throughput mRNA sequencing (RNA-Seq) and digital gene expression (DGE). RNA-Seq analysis identified 74,831 unigenes, which were subsequently used as reference sequences for DGE analysis of diseased and healthy Paulownia in field grown and tissue cultured plants. Our study revealed that dramatic changes occurred in the gene expression profile of Paulownia after PaWB Phytoplasma infection. Genes encoding key enzymes in cytokinin biosynthesis, such as isopentenyl diphosphate isomerase and isopentenyltransferase, were significantly induced in the infected Paulownia. Genes involved in cell wall biosynthesis and degradation were largely up-regulated and genes related to photosynthesis were down-regulated after PaWB Phytoplasma infection. Our systematic analysis provides comprehensive transcriptomic data about plants infected by Phytoplasma. This information will help further our understanding of the detailed interaction mechanisms between plants and Phytoplasma. PMID:24130859

  9. Expression cloning and characterization of a novel gene that encodes the RNA-binding protein FAU-1 from Pyrococcus furiosus.

    PubMed Central

    Kanai, Akio; Oida, Hanako; Matsuura, Nana; Doi, Hirofumi

    2003-01-01

    We systematically screened a genomic DNA library to identify proteins of the hyperthermophilic archaeon Pyrococcus furiosus using an expression cloning method. One gene product, which we named FAU-1 (P. furiosus AU-binding), demonstrated the strongest binding activity of all the genomic library-derived proteins tested against an AU-rich RNA sequence. The protein was purified to near homogeneity as a 54 kDa single polypeptide, and the gene locus corresponding to this FAU-1 activity was also sequenced. The FAU-1 gene encoded a 472-amino-acid protein that was characterized by highly charged domains consisting of both acidic and basic amino acids. The N-terminal half of the gene had a degree of similarity (25%) with RNase E from Escherichia coli. Five rounds of RNA-binding-site selection and footprinting analysis showed that the FAU-1 protein binds specifically to the AU-rich sequence in a loop region of a possible RNA ligand. Moreover, we demonstrated that the FAU-1 protein acts as an oligomer, and mainly as a trimer. These results showed that the FAU-1 protein is a novel heat-stable protein with an RNA loop-binding characteristic. PMID:12614195

  10. Proteomics Perspectives in Rotator Cuff Research: A Systematic Review of Gene Expression and Protein Composition in Human Tendinopathy

    PubMed Central

    Sejersen, Maria Hee Jung; Frost, Poul; Hansen, Torben Bæk; Deutch, Søren Rasmussen; Svendsen, Susanne Wulff

    2015-01-01

    Background Rotator cuff tendinopathy including tears is a cause of significant morbidity. The molecular pathogenesis of the disorder is largely unknown. This review aimed to present an overview of the literature on gene expression and protein composition in human rotator cuff tendinopathy and other tendinopathies, and to evaluate perspectives of proteomics – the comprehensive study of protein composition - in tendon research. Materials and Methods We conducted a systematic search of the literature published between 1 January 1990 and 18 December 2012 in PubMed, Embase, and Web of Science. We included studies on objectively quantified differential gene expression and/or protein composition in human rotator cuff tendinopathy and other tendinopathies as compared to control tissue. Results We identified 2199 studies, of which 54 were included; 25 studies focussed on rotator cuff or biceps tendinopathy. Most of the included studies quantified prespecified mRNA molecules and proteins using polymerase chain reactions and immunoassays, respectively. There was a tendency towards an increase of collagen I (11 of 15 studies) and III (13 of 14), metalloproteinase (MMP)-1 (6 of 12), -9 (7 of 7), -13 (4 of 7), tissue inhibitor of metalloproteinase (TIMP)-1 (4 of 7), and vascular endothelial growth factor (4 of 7), and a decrease in MMP-3 (10 of 12). Fourteen proteomics studies of tendon tissues/cells failed inclusion, mostly because they were conducted in animals or in vitro. Conclusions Based on methods, which only allowed simultaneous quantification of a limited number of prespecified mRNA molecules or proteins, several proteins appeared to be differentially expressed/represented in rotator cuff tendinopathy and other tendinopathies. No proteomics studies fulfilled our inclusion criteria, although proteomics technologies may be a way to identify protein profiles (including non-prespecified proteins) that characterise specific tendon disorders or stages of tendinopathy. Thus, our results suggested an untapped potential for proteomics in tendon research. PMID:25879758

  11. Proteomics perspectives in rotator cuff research: a systematic review of gene expression and protein composition in human tendinopathy.

    PubMed

    Sejersen, Maria Hee Jung; Frost, Poul; Hansen, Torben Bæk; Deutch, Søren Rasmussen; Svendsen, Susanne Wulff

    2015-01-01

    Rotator cuff tendinopathy including tears is a cause of significant morbidity. The molecular pathogenesis of the disorder is largely unknown. This review aimed to present an overview of the literature on gene expression and protein composition in human rotator cuff tendinopathy and other tendinopathies, and to evaluate perspectives of proteomics--the comprehensive study of protein composition--in tendon research. We conducted a systematic search of the literature published between 1 January 1990 and 18 December 2012 in PubMed, Embase, and Web of Science. We included studies on objectively quantified differential gene expression and/or protein composition in human rotator cuff tendinopathy and other tendinopathies as compared to control tissue. We identified 2199 studies, of which 54 were included; 25 studies focussed on rotator cuff or biceps tendinopathy. Most of the included studies quantified prespecified mRNA molecules and proteins using polymerase chain reactions and immunoassays, respectively. There was a tendency towards an increase of collagen I (11 of 15 studies) and III (13 of 14), metalloproteinase (MMP)-1 (6 of 12), -9 (7 of 7), -13 (4 of 7), tissue inhibitor of metalloproteinase (TIMP)-1 (4 of 7), and vascular endothelial growth factor (4 of 7), and a decrease in MMP-3 (10 of 12). Fourteen proteomics studies of tendon tissues/cells failed inclusion, mostly because they were conducted in animals or in vitro. Based on methods, which only allowed simultaneous quantification of a limited number of prespecified mRNA molecules or proteins, several proteins appeared to be differentially expressed/represented in rotator cuff tendinopathy and other tendinopathies. No proteomics studies fulfilled our inclusion criteria, although proteomics technologies may be a way to identify protein profiles (including non-prespecified proteins) that characterise specific tendon disorders or stages of tendinopathy. Thus, our results suggested an untapped potential for proteomics in tendon research.

  12. Driver Fusions and Their Implications in the Development and Treatment of Human Cancers.

    PubMed

    Gao, Qingsong; Liang, Wen-Wei; Foltz, Steven M; Mutharasu, Gnanavel; Jayasinghe, Reyka G; Cao, Song; Liao, Wen-Wei; Reynolds, Sheila M; Wyczalkowski, Matthew A; Yao, Lijun; Yu, Lihua; Sun, Sam Q; Chen, Ken; Lazar, Alexander J; Fields, Ryan C; Wendl, Michael C; Van Tine, Brian A; Vij, Ravi; Chen, Feng; Nykter, Matti; Shmulevich, Ilya; Ding, Li

    2018-04-03

    Gene fusions represent an important class of somatic alterations in cancer. We systematically investigated fusions in 9,624 tumors across 33 cancer types using multiple fusion calling tools. We identified a total of 25,664 fusions, with a 63% validation rate. Integration of gene expression, copy number, and fusion annotation data revealed that fusions involving oncogenes tend to exhibit increased expression, whereas fusions involving tumor suppressors have the opposite effect. For fusions involving kinases, we found 1,275 with an intact kinase domain, the proportion of which varied significantly across cancer types. Our study suggests that fusions drive the development of 16.5% of cancer cases and function as the sole driver in more than 1% of them. Finally, we identified druggable fusions involving genes such as TMPRSS2, RET, FGFR3, ALK, and ESR1 in 6.0% of cases, and we predicted immunogenic peptides, suggesting that fusions may provide leads for targeted drug and immune therapy. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  13. Molecular time-course and the metabolic basis of entry into dauer in Caenorhabditis elegans.

    PubMed

    Jeong, Pan-Young; Kwon, Min-Seok; Joo, Hyoe-Jin; Paik, Young-Ki

    2009-01-01

    When Caenorhabditis elegans senses dauer pheromone (daumone), signaling inadequate growth conditions, it enters the dauer state, which is capable of long-term survival. However, the molecular pathway of dauer entry in C. elegans has remained elusive. To systematically monitor changes in gene expression in dauer paths, we used a DNA microarray containing 22,625 gene probes corresponding to 22,150 unique genes from C. elegans. We employed two different paths: direct exposure to daumone (Path 1) and normal growth media plus liquid culture (Path 2). Our data reveal that entry into dauer is accomplished through the multi-step process, which appears to be compartmentalized in time and according to metabolic flux. That is, a time-course of dauer entry in Path 1 shows that dauer larvae formation begins at post-embryonic stage S4 (48 h) and is complete at S6 (72 h). Our results also suggest the presence of a unique adaptive metabolic control mechanism that requires both stage-specific expression of specific genes and tight regulation of different modes of fuel metabolite utilization to sustain the energy balance in the context of prolonged survival under adverse growth conditions. It is apparent that worms entering dauer stage may rely heavily on carbohydrate-based energy reserves, whereas dauer larvae utilize fat or glyoxylate cycle-based energy sources. We created a comprehensive web-based dauer metabolic database for C. elegans (www.DauerDB.org) that makes it possible to search any gene and compare its relative expression at a specific stage, or evaluate overall patterns of gene expression in both paths. This database can be accessed by the research community and could be widely applicable to other related nematodes as a molecular atlas.

  14. Involvements of PCD and changes in gene expression profile during self-pruning of spring shoots in sweet orange (Citrus sinensis).

    PubMed

    Zhang, Jin-Zhi; Zhao, Kun; Ai, Xiao-Yan; Hu, Chun-Gen

    2014-10-13

    Citrus shoot tips abscise at an anatomically distinct abscission zone (AZ) that separates the top part of the shoots into basal and apical portions (citrus self-pruning). Cell separation occurs only at the AZ, which suggests its cells have distinctive molecular regulation. Although several studies have looked into the morphological aspects of self-pruning process, the underlying molecular mechanisms remain unknown. In this study, the hallmarks of programmed cell death (PCD) were identified by TUNEL experiments, transmission electron microscopy (TEM) and histochemical staining for reactive oxygen species (ROS) during self-pruning of the spring shoots in sweet orange. Our results indicated that PCD occurred systematically and progressively and may play an important role in the control of self-pruning of citrus. Microarray analysis was used to examine transcriptome changes at three stages of self-pruning, and 1,378 differentially expressed genes were identified. Some genes were related to PCD, while others were associated with cell wall biosynthesis or metabolism. These results strongly suggest that abscission layers activate both catabolic and anabolic wall modification pathways during the self-pruning process. In addition, a strong correlation was observed between self-pruning and the expression of hormone-related genes. Self-pruning plays an important role in citrus floral bud initiation. Therefore, several key flowering homologs of Arabidopsis and tomato shoot apical meristem (SAM) activity genes were investigated in sweet orange by real-time PCR and in situ hybridization, and the results indicated that these genes were preferentially expressed in SAM as well as axillary meristem. Based on these findings, a model for sweet orange spring shoot self-pruning is proposed, which will enable us to better understand the mechanism of self-pruning and abscission.

  15. Systematic Identification of Genes Required for Expression of Androgen Receptor Splice Variants

    DTIC Science & Technology

    2016-08-01

    engineering tool has been developed from bacterial Clustered Regularly Interspaced Short Palindromic Repeats ( CRISPR )/ CRISPR ‐Associated System (Cas...regulation of AR splice variant through CRISPR /Cas screening system. 15. SUBJECT TERMS CRISPR /Cas, Androgen receptor, castration resistance, biomarker 16...control (non-targeting) gRNAs available from Addgene (http://www.addgene.org/ CRISPR /libraries/). Generation of AR3 reporter: We used molecular cloning

  16. Transitions from mono- to co- to tri-culture uniquely affect gene expression in breast cancer, stromal, and immune compartments.

    PubMed

    Regier, Mary C; Maccoux, Lindsey J; Weinberger, Emma M; Regehr, Keil J; Berry, Scott M; Beebe, David J; Alarid, Elaine T

    2016-08-01

    Heterotypic interactions in cancer microenvironments play important roles in disease initiation, progression, and spread. Co-culture is the predominant approach used in dissecting paracrine interactions between tumor and stromal cells, but functional results from simple co-cultures frequently fail to correlate to in vivo conditions. Though complex heterotypic in vitro models have improved functional relevance, there is little systematic knowledge of how multi-culture parameters influence this recapitulation. We therefore have employed a more iterative approach to investigate the influence of increasing model complexity; increased heterotypic complexity specifically. Here we describe how the compartmentalized and microscale elements of our multi-culture device allowed us to obtain gene expression data from one cell type at a time in a heterotypic culture where cells communicated through paracrine interactions. With our device we generated a large dataset comprised of cell type specific gene-expression patterns for cultures of increasing complexity (three cell types in mono-, co-, or tri-culture) not readily accessible in other systems. Principal component analysis indicated that gene expression was changed in co-culture but was often more strongly altered in tri-culture as compared to mono-culture. Our analysis revealed that cell type identity and the complexity around it (mono-, co-, or tri-culture) influence gene regulation. We also observed evidence of complementary regulation between cell types in the same heterotypic culture. Here we demonstrate the utility of our platform in providing insight into how tumor and stromal cells respond to microenvironments of varying complexities highlighting the expanding importance of heterotypic cultures that go beyond conventional co-culture.

  17. Functional Analysis With a Barcoder Yeast Gene Overexpression System

    PubMed Central

    Douglas, Alison C.; Smith, Andrew M.; Sharifpoor, Sara; Yan, Zhun; Durbic, Tanja; Heisler, Lawrence E.; Lee, Anna Y.; Ryan, Owen; Göttert, Hendrikje; Surendra, Anu; van Dyk, Dewald; Giaever, Guri; Boone, Charles; Nislow, Corey; Andrews, Brenda J.

    2012-01-01

    Systematic analysis of gene overexpression phenotypes provides an insight into gene function, enzyme targets, and biological pathways. Here, we describe a novel functional genomics platform that enables a highly parallel and systematic assessment of overexpression phenotypes in pooled cultures. First, we constructed a genome-level collection of ~5100 yeast barcoder strains, each of which carries a unique barcode, enabling pooled fitness assays with a barcode microarray or sequencing readout. Second, we constructed a yeast open reading frame (ORF) galactose-induced overexpression array by generating a genome-wide set of yeast transformants, each of which carries an individual plasmid-born and sequence-verified ORF derived from the Saccharomyces cerevisiae full-length EXpression-ready (FLEX) collection. We combined these collections genetically using synthetic genetic array methodology, generating ~5100 strains, each of which is barcoded and overexpresses a specific ORF, a set we termed “barFLEX.” Additional synthetic genetic array allows the barFLEX collection to be moved into different genetic backgrounds. As a proof-of-principle, we describe the properties of the barFLEX overexpression collection and its application in synthetic dosage lethality studies under different environmental conditions. PMID:23050238

  18. Gene expression profiles of fin regeneration in loach (Paramisgurnus dabryanu).

    PubMed

    Li, Li; He, Jingya; Wang, Linlin; Chen, Weihua; Chang, Zhongjie

    2017-11-01

    Teleost fins can regenerate accurate position-matched structure and function after amputation. However, we still lack systematic transcriptional profiling and methodologies to understand the molecular basis of fin regeneration. After histological analysis, we established a suppression subtraction hybridization library containing 418 distinct sequences expressed differentially during the process of blastema formation and differentiation in caudal fin regeneration. Genome ontology and comparative analysis of differential distribution of our data and the reference zebrafish genome showed notable subcategories, including multi-organism processes, response to stimuli, extracellular matrix, antioxidant activity, and cell junction function. KEGG pathway analysis allowed the effective identification of relevant genes in those pathways involved in tissue morphogenesis and regeneration, including tight junction, cell adhesion molecules, mTOR and Jak-STAT signaling pathway. From relevant function subcategories and signaling pathways, 78 clones were examined for further Southern-blot hybridization. Then, 17 genes were chosen and characterized using semi-quantitative PCR. Then 4 candidate genes were identified, including F11r, Mmp9, Agr2 and one without a match to any database. After real-time quantitative PCR, the results showed obvious expression changes in different periods of caudal fin regeneration. We can assume that the 4 candidates, likely valuable genes associated with fin regeneration, deserve additional attention. Thus, our study demonstrated how to investigate the transcript profiles with an emphasis on bioinformatics intervention and how to identify potential genes related to fin regeneration processes. The results also provide a foundation or knowledge for further research into genes and molecular mechanisms of fin regeneration. Copyright © 2017 Elsevier B.V. All rights reserved.

  19. Refined mapping of autoimmune disease associated genetic variants with gene expression suggests an important role for non-coding RNAs.

    PubMed

    Ricaño-Ponce, Isis; Zhernakova, Daria V; Deelen, Patrick; Luo, Oscar; Li, Xingwang; Isaacs, Aaron; Karjalainen, Juha; Di Tommaso, Jennifer; Borek, Zuzanna Agnieszka; Zorro, Maria M; Gutierrez-Achury, Javier; Uitterlinden, Andre G; Hofman, Albert; van Meurs, Joyce; Netea, Mihai G; Jonkers, Iris H; Withoff, Sebo; van Duijn, Cornelia M; Li, Yang; Ruan, Yijun; Franke, Lude; Wijmenga, Cisca; Kumar, Vinod

    2016-04-01

    Genome-wide association and fine-mapping studies in 14 autoimmune diseases (AID) have implicated more than 250 loci in one or more of these diseases. As more than 90% of AID-associated SNPs are intergenic or intronic, pinpointing the causal genes is challenging. We performed a systematic analysis to link 460 SNPs that are associated with 14 AID to causal genes using transcriptomic data from 629 blood samples. We were able to link 71 (39%) of the AID-SNPs to two or more nearby genes, providing evidence that for part of the AID loci multiple causal genes exist. While 54 of the AID loci are shared by one or more AID, 17% of them do not share candidate causal genes. In addition to finding novel genes such as ULK3, we also implicate novel disease mechanisms and pathways like autophagy in celiac disease pathogenesis. Furthermore, 42 of the AID SNPs specifically affected the expression of 53 non-coding RNA genes. To further understand how the non-coding genome contributes to AID, the SNPs were linked to functional regulatory elements, which suggest a model where AID genes are regulated by network of chromatin looping/non-coding RNAs interactions. The looping model also explains how a causal candidate gene is not necessarily the gene closest to the AID SNP, which was the case in nearly 50% of cases. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  20. The interaction and integration of auxin signaling components.

    PubMed

    Hayashi, Ken-ichiro

    2012-06-01

    IAA, a naturally occurring auxin, is a simple signaling molecule that regulates many diverse steps of plant development. Auxin essentially coordinates plant development through transcriptional regulation. Auxin binds to TIR1/AFB nuclear receptors, which are F-box subunits of the SCF ubiquitin ligase complex. The auxin signal is then modulated by the quantitative and qualitative responses of the Aux/IAA repressors and the auxin response factor (ARF) transcription factors. The specificity of the auxin-regulated gene expression profile is defined by several factors, such as the expression of these regulatory proteins, their post-transcriptional regulation, their stability and the affinity between these regulatory proteins. Auxin-binding protein 1 (ABP1) is a candidate protein for an auxin receptor that is implicated in non-transcriptional auxin signaling. ABP1 also affects TIR1/AFB-mediated auxin-responsive gene expression, implying that both the ABP1 and TIR1/AFB signaling machineries coordinately control auxin-mediated physiological events. Systematic approaches using the comprehensive mapping of the expression and interaction of signaling modules and computational modeling would be valuable for integrating our knowledge of auxin signals and responses.

  1. Analysis of temporal transcription expression profiles reveal links between protein function and developmental stages of Drosophila melanogaster.

    PubMed

    Wan, Cen; Lees, Jonathan G; Minneci, Federico; Orengo, Christine A; Jones, David T

    2017-10-01

    Accurate gene or protein function prediction is a key challenge in the post-genome era. Most current methods perform well on molecular function prediction, but struggle to provide useful annotations relating to biological process functions due to the limited power of sequence-based features in that functional domain. In this work, we systematically evaluate the predictive power of temporal transcription expression profiles for protein function prediction in Drosophila melanogaster. Our results show significantly better performance on predicting protein function when transcription expression profile-based features are integrated with sequence-derived features, compared with the sequence-derived features alone. We also observe that the combination of expression-based and sequence-based features leads to further improvement of accuracy on predicting all three domains of gene function. Based on the optimal feature combinations, we then propose a novel multi-classifier-based function prediction method for Drosophila melanogaster proteins, FFPred-fly+. Interpreting our machine learning models also allows us to identify some of the underlying links between biological processes and developmental stages of Drosophila melanogaster.

  2. Cellular and Tumor Radiosensitivity is Correlated to Epidermal Growth Factor Receptor Protein Expression Level in Tumors Without EGFR Amplification;Epidermal growth factor receptor; Radiotherapy; Squamous cell carcinoma; Biomarker; Local tumor control

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kasten-Pisula, Ulla; Saker, Jarob; Eicheler, Wolfgang

    2011-07-15

    Purpose: There is conflicting evidence for whether the expression of epidermal growth factor receptor in human tumors can be used as a marker of radioresponse. Therefore, this association was studied in a systematic manner using squamous cell carcinoma (SCC) cell lines grown as cell cultures and xenografts. Methods and Materials: The study was performed with 24 tumor cell lines of different tumor types, including 10 SCC lines, which were also investigated as xenografts on nude mice. Egfr gene dose and the length of CA-repeats in intron 1 were determined by polymerase chain reaction, protein expression in vitro by Western blotmore » and in vivo by enzyme-linked immunosorbent assay, and radiosensitivity in vitro by colony formation. Data were correlated with previously published tumor control dose 50% data after fractionated irradiation of xenografts of the 10 SCC. Results: EGFR protein expression varies considerably, with most tumor cell lines showing moderate and only few showing pronounced upregulation. EGFR upregulation could only be attributed to massive gene amplification in the latter. In the case of little or no amplification, in vitro EGFR expression correlated with both cellular and tumor radioresponse. In vivo EGFR expression did not show this correlation. Conclusions: Local tumor control after the fractionated irradiation of tumors with little or no gene amplification seems to be dependent on in vitro EGFR via its effect on cellular radiosensitivity.« less

  3. Phylogenetics of Lophotrochozoan bHLH Genes and the Evolution of Lineage-Specific Gene Duplicates.

    PubMed

    Bao, Yongbo; Xu, Fei; Shimeld, Sebastian M

    2017-04-01

    The gain and loss of genes encoding transcription factors is of importance to understanding the evolution of gene regulatory complexity. The basic helix-loop-helix (bHLH) genes encode a large superfamily of transcription factors. We systematically classify the bHLH genes from five mollusc, two annelid and one brachiopod genomes, tracing the pattern of bHLH gene evolution across these poorly studied Phyla. In total, 56-88 bHLH genes were identified in each genome, with most identifiable as members of previously described bilaterian families, or of new families we define. Of such families only one, Mesp, appears lost by all these species. Additional duplications have also played a role in the evolution of the bHLH gene repertoire, with many new lophotrochozoan-, mollusc-, bivalve-, or gastropod-specific genes defined. Using a combination of transcriptome mining, RT-PCR, and in situ hybridization we compared the expression of several of these novel genes in tissues and embryos of the molluscs Crassostrea gigas and Patella vulgata, finding both conserved expression and evidence for neofunctionalization. We also map the positions of the genes across these genomes, identifying numerous gene linkages. Some reflect recent paralog divergence by tandem duplication, others are remnants of ancient tandem duplications dating to the lophotrochozoan or bilaterian common ancestors. These data are built into a model of the evolution of bHLH genes in molluscs, showing formidable evolutionary stasis at the family level but considerable within-family diversification by tandem gene duplication. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  4. Challenges in projecting clustering results across gene expression-profiling datasets.

    PubMed

    Lusa, Lara; McShane, Lisa M; Reid, James F; De Cecco, Loris; Ambrogi, Federico; Biganzoli, Elia; Gariboldi, Manuela; Pierotti, Marco A

    2007-11-21

    Gene expression microarray studies for several types of cancer have been reported to identify previously unknown subtypes of tumors. For breast cancer, a molecular classification consisting of five subtypes based on gene expression microarray data has been proposed. These subtypes have been reported to exist across several breast cancer microarray studies, and they have demonstrated some association with clinical outcome. A classification rule based on the method of centroids has been proposed for identifying the subtypes in new collections of breast cancer samples; the method is based on the similarity of the new profiles to the mean expression profile of the previously identified subtypes. Previously identified centroids of five breast cancer subtypes were used to assign 99 breast cancer samples, including a subset of 65 estrogen receptor-positive (ER+) samples, to five breast cancer subtypes based on microarray data for the samples. The effect of mean centering the genes (i.e., transforming the expression of each gene so that its mean expression is equal to 0) on subtype assignment by method of centroids was assessed. Further studies of the effect of mean centering and of class prevalence in the test set on the accuracy of method of centroids classifications of ER status were carried out using training and test sets for which ER status had been independently determined by ligand-binding assay and for which the proportion of ER+ and ER- samples were systematically varied. When all 99 samples were considered, mean centering before application of the method of centroids appeared to be helpful for correctly assigning samples to subtypes, as evidenced by the expression of genes that had previously been used as markers to identify the subtypes. However, when only the 65 ER+ samples were considered for classification, many samples appeared to be misclassified, as evidenced by an unexpected distribution of ER+ samples among the resultant subtypes. When genes were mean centered before classification of samples for ER status, the accuracy of the ER subgroup assignments was highly dependent on the proportion of ER+ samples in the test set; this effect of subtype prevalence was not seen when gene expression data were not mean centered. Simple corrections such as mean centering of genes aimed at microarray platform or batch effect correction can have undesirable consequences because patient population effects can easily be confused with these assay-related effects. Careful thought should be given to the comparability of the patient populations before attempting to force data comparability for purposes of assigning subtypes to independent subjects.

  5. Cell differentiation in cardiac myxomas: confocal microscopy and gene expression analysis after laser capture microdissection.

    PubMed

    Pucci, Angela; Mattioli, Claudia; Matteucci, Marco; Lorenzini, Daniele; Panvini, Francesca; Pacini, Simone; Ippolito, Chiara; Celiento, Michele; De Martino, Andrea; Dolfi, Amelio; Belgio, Beatrice; Bortolotti, Uberto; Basolo, Fulvio; Bartoloni, Giovanni

    2018-05-22

    Cardiac myxomas are rare tumors with a heterogeneous cell population including properly neoplastic (lepidic), endothelial and smooth muscle cells. The assessment of neoplastic (lepidic) cell differentiation pattern is rather difficult using conventional light microscopy immunohistochemistry and/or whole tissue extracts for mRNA analyses. In a preliminary study, we investigated 20 formalin-fixed and paraffin-embedded cardiac myxomas by means of conventional immunohistochemistry; in 10/20 cases, cell differentiation was also analyzed by real-time RT-PCR after laser capture microdissection of the neoplastic cells, whereas calretinin and endothelial antigen CD31 immunoreactivity was localized in 4/10 cases by double immunofluorescence confocal microscopy. Gene expression analyses of α-smooth muscle actin, endothelial CD31 antigen, alpha-cardiac actin, matrix metalloprotease-2 (MMP2) and tissue inhibitor of matrix metalloprotease-1 (TIMP1) was performed on cDNA obtained from either microdissected neoplastic cells or whole tumor sections. We found very little or absent CD31 and α-Smooth Muscle Actin expression in the microdissected cells as compared to the whole tumors, whereas TIMP1 and MMP2 genes were highly expressed in both ones, greater levels being found in patients with embolic phenomena. α-Cardiac Actin was not detected. Confocal microscopy disclosed two different signals corresponding to calretinin-positive myxoma cells and to endothelial CD31-positive cells, respectively. In conclusion, the neoplastic (lepidic) cells showed a distinct gene expression pattern and no consistent overlapping with endothelial and smooth muscle cells or cardiac myocytes; the expression of TIMP1 and MMP2 might be related to clinical presentation; larger series studies using also systematic transcriptome analysis might be useful to confirm the present results.

  6. A data mining paradigm for identifying key factors in biological processes using gene expression data.

    PubMed

    Li, Jin; Zheng, Le; Uchiyama, Akihiko; Bin, Lianghua; Mauro, Theodora M; Elias, Peter M; Pawelczyk, Tadeusz; Sakowicz-Burkiewicz, Monika; Trzeciak, Magdalena; Leung, Donald Y M; Morasso, Maria I; Yu, Peng

    2018-06-13

    A large volume of biological data is being generated for studying mechanisms of various biological processes. These precious data enable large-scale computational analyses to gain biological insights. However, it remains a challenge to mine the data efficiently for knowledge discovery. The heterogeneity of these data makes it difficult to consistently integrate them, slowing down the process of biological discovery. We introduce a data processing paradigm to identify key factors in biological processes via systematic collection of gene expression datasets, primary analysis of data, and evaluation of consistent signals. To demonstrate its effectiveness, our paradigm was applied to epidermal development and identified many genes that play a potential role in this process. Besides the known epidermal development genes, a substantial proportion of the identified genes are still not supported by gain- or loss-of-function studies, yielding many novel genes for future studies. Among them, we selected a top gene for loss-of-function experimental validation and confirmed its function in epidermal differentiation, proving the ability of this paradigm to identify new factors in biological processes. In addition, this paradigm revealed many key genes in cold-induced thermogenesis using data from cold-challenged tissues, demonstrating its generalizability. This paradigm can lead to fruitful results for studying molecular mechanisms in an era of explosive accumulation of publicly available biological data.

  7. A genome-wide resource for the analysis of protein localisation in Drosophila

    PubMed Central

    Sarov, Mihail; Barz, Christiane; Jambor, Helena; Hein, Marco Y; Schmied, Christopher; Suchold, Dana; Stender, Bettina; Janosch, Stephan; KJ, Vinay Vikas; Krishnan, RT; Krishnamoorthy, Aishwarya; Ferreira, Irene RS; Ejsmont, Radoslaw K; Finkl, Katja; Hasse, Susanne; Kämpfer, Philipp; Plewka, Nicole; Vinis, Elisabeth; Schloissnig, Siegfried; Knust, Elisabeth; Hartenstein, Volker; Mann, Matthias; Ramaswami, Mani; VijayRaghavan, K; Tomancak, Pavel; Schnorrer, Frank

    2016-01-01

    The Drosophila genome contains >13000 protein-coding genes, the majority of which remain poorly investigated. Important reasons include the lack of antibodies or reporter constructs to visualise these proteins. Here, we present a genome-wide fosmid library of 10000 GFP-tagged clones, comprising tagged genes and most of their regulatory information. For 880 tagged proteins, we created transgenic lines, and for a total of 207 lines, we assessed protein expression and localisation in ovaries, embryos, pupae or adults by stainings and live imaging approaches. Importantly, we visualised many proteins at endogenous expression levels and found a large fraction of them localising to subcellular compartments. By applying genetic complementation tests, we estimate that about two-thirds of the tagged proteins are functional. Moreover, these tagged proteins enable interaction proteomics from developing pupae and adult flies. Taken together, this resource will boost systematic analysis of protein expression and localisation in various cellular and developmental contexts. DOI: http://dx.doi.org/10.7554/eLife.12068.001 PMID:26896675

  8. A transcription factor hierarchy defines an environmental stress response network.

    PubMed

    Song, Liang; Huang, Shao-Shan Carol; Wise, Aaron; Castanon, Rosa; Nery, Joseph R; Chen, Huaming; Watanabe, Marina; Thomas, Jerushah; Bar-Joseph, Ziv; Ecker, Joseph R

    2016-11-04

    Environmental stresses are universally encountered by microbes, plants, and animals. Yet systematic studies of stress-responsive transcription factor (TF) networks in multicellular organisms have been limited. The phytohormone abscisic acid (ABA) influences the expression of thousands of genes, allowing us to characterize complex stress-responsive regulatory networks. Using chromatin immunoprecipitation sequencing, we identified genome-wide targets of 21 ABA-related TFs to construct a comprehensive regulatory network in Arabidopsis thaliana Determinants of dynamic TF binding and a hierarchy among TFs were defined, illuminating the relationship between differential gene expression patterns and ABA pathway feedback regulation. By extrapolating regulatory characteristics of observed canonical ABA pathway components, we identified a new family of transcriptional regulators modulating ABA and salt responsiveness and demonstrated their utility to modulate plant resilience to osmotic stress. Copyright © 2016, American Association for the Advancement of Science.

  9. Genetic validation of whole-transcriptome sequencing for mapping expression affected by cis-regulatory variation.

    PubMed

    Babak, Tomas; Garrett-Engele, Philip; Armour, Christopher D; Raymond, Christopher K; Keller, Mark P; Chen, Ronghua; Rohl, Carol A; Johnson, Jason M; Attie, Alan D; Fraser, Hunter B; Schadt, Eric E

    2010-08-13

    Identifying associations between genotypes and gene expression levels using microarrays has enabled systematic interrogation of regulatory variation underlying complex phenotypes. This approach has vast potential for functional characterization of disease states, but its prohibitive cost, given hundreds to thousands of individual samples from populations have to be genotyped and expression profiled, has limited its widespread application. Here we demonstrate that genomic regions with allele-specific expression (ASE) detected by sequencing cDNA are highly enriched for cis-acting expression quantitative trait loci (cis-eQTL) identified by profiling of 500 animals in parallel, with up to 90% agreement on the allele that is preferentially expressed. We also observed widespread noncoding and antisense ASE and identified several allele-specific alternative splicing variants. Monitoring ASE by sequencing cDNA from as little as one sample is a practical alternative to expression genetics for mapping cis-acting variation that regulates RNA transcription and processing.

  10. Super-delta: a new differential gene expression analysis procedure with robust data normalization.

    PubMed

    Liu, Yuhang; Zhang, Jinfeng; Qiu, Xing

    2017-12-21

    Normalization is an important data preparation step in gene expression analyses, designed to remove various systematic noise. Sample variance is greatly reduced after normalization, hence the power of subsequent statistical analyses is likely to increase. On the other hand, variance reduction is made possible by borrowing information across all genes, including differentially expressed genes (DEGs) and outliers, which will inevitably introduce some bias. This bias typically inflates type I error; and can reduce statistical power in certain situations. In this study we propose a new differential expression analysis pipeline, dubbed as super-delta, that consists of a multivariate extension of the global normalization and a modified t-test. A robust procedure is designed to minimize the bias introduced by DEGs in the normalization step. The modified t-test is derived based on asymptotic theory for hypothesis testing that suitably pairs with the proposed robust normalization. We first compared super-delta with four commonly used normalization methods: global, median-IQR, quantile, and cyclic loess normalization in simulation studies. Super-delta was shown to have better statistical power with tighter control of type I error rate than its competitors. In many cases, the performance of super-delta is close to that of an oracle test in which datasets without technical noise were used. We then applied all methods to a collection of gene expression datasets on breast cancer patients who received neoadjuvant chemotherapy. While there is a substantial overlap of the DEGs identified by all of them, super-delta were able to identify comparatively more DEGs than its competitors. Downstream gene set enrichment analysis confirmed that all these methods selected largely consistent pathways. Detailed investigations on the relatively small differences showed that pathways identified by super-delta have better connections to breast cancer than other methods. As a new pipeline, super-delta provides new insights to the area of differential gene expression analysis. Solid theoretical foundation supports its asymptotic unbiasedness and technical noise-free properties. Implementation on real and simulated datasets demonstrates its decent performance compared with state-of-art procedures. It also has the potential of expansion to be incorporated with other data type and/or more general between-group comparison problems.

  11. Characteristic Changes in Decidual Gene Expression Signature in Spontaneous Term Parturition

    PubMed Central

    El-Azzamy, Haidy; Balogh, Andrea; Romero, Roberto; Xu, Yi; LaJeunesse, Christopher; Plazyo, Olesya; Xu, Zhonghui; Price, Theodore G.; Dong, Zhong; Tarca, Adi L.; Papp, Zoltan; Hassan, Sonia S.; Chaiworapongsa, Tinnakorn; Kim, Chong Jai; Gomez-Lopez, Nardhy; Than, Nandor Gabor

    2017-01-01

    Background The decidua has been implicated in the “terminal pathway” of human term parturition, which is characterized by the activation of pro-inflammatory pathways in gestational tissues. However, the transcriptomic changes in the decidua leading to terminal pathway activation have not been systematically explored. This study aimed to compare the decidual expression of developmental signaling and inflammation-related genes before and after spontaneous term labor in order to reveal their involvement in this process. Methods Chorioamniotic membranes were obtained from normal pregnant women who delivered at term with spontaneous labor (TIL, n = 14) or without labor (TNL, n = 15). Decidual cells were isolated from snap-frozen chorioamniotic membranes with laser microdissection. The expression of 46 genes involved in decidual development, sex steroid and prostaglandin signaling, as well as pro- and anti-inflammatory pathways, was analyzed using high-throughput quantitative real-time polymerase chain reaction (qRT-PCR). Chorioamniotic membrane sections were immunostained and then semi-quantified for five proteins, and immunoassays for three chemokines were performed on maternal plasma samples. Results The genes with the highest expression in the decidua at term gestation included insulin-like growth factor-binding protein 1 (IGFBP1), galectin-1 (LGALS1), and progestogen-associated endometrial protein (PAEP); the expression of estrogen receptor 1 (ESR1), homeobox A11 (HOXA11), interleukin 1β (IL1B), IL8, progesterone receptor membrane component 2 (PGRMC2), and prostaglandin E synthase (PTGES) was higher in TIL than in TNL cases; the expression of chemokine C-C motif ligand 2 (CCL2), CCL5, LGALS1, LGALS3, and PAEP was lower in TIL than in TNL cases; immunostaining confirmed qRT-PCR data for IL-8, CCL2, galectin-1, galectin-3, and PAEP; and no correlations between the decidual gene expression and the maternal plasma protein concentrations of CCL2, CCL5, and IL-8 were found. Conclusions Our data suggests that with the initiation of parturition, the decidual expression of anti-inflammatory mediators decreases, while the expression of pro-inflammatory mediators and steroid receptors increases. This shift may affect downstream signaling pathways that can lead to parturition. PMID:28226203

  12. Statistical approach for selection of biologically informative genes.

    PubMed

    Das, Samarendra; Rai, Anil; Mishra, D C; Rai, Shesh N

    2018-05-20

    Selection of informative genes from high dimensional gene expression data has emerged as an important research area in genomics. Many gene selection techniques have been proposed so far are either based on relevancy or redundancy measure. Further, the performance of these techniques has been adjudged through post selection classification accuracy computed through a classifier using the selected genes. This performance metric may be statistically sound but may not be biologically relevant. A statistical approach, i.e. Boot-MRMR, was proposed based on a composite measure of maximum relevance and minimum redundancy, which is both statistically sound and biologically relevant for informative gene selection. For comparative evaluation of the proposed approach, we developed two biological sufficient criteria, i.e. Gene Set Enrichment with QTL (GSEQ) and biological similarity score based on Gene Ontology (GO). Further, a systematic and rigorous evaluation of the proposed technique with 12 existing gene selection techniques was carried out using five gene expression datasets. This evaluation was based on a broad spectrum of statistically sound (e.g. subject classification) and biological relevant (based on QTL and GO) criteria under a multiple criteria decision-making framework. The performance analysis showed that the proposed technique selects informative genes which are more biologically relevant. The proposed technique is also found to be quite competitive with the existing techniques with respect to subject classification and computational time. Our results also showed that under the multiple criteria decision-making setup, the proposed technique is best for informative gene selection over the available alternatives. Based on the proposed approach, an R Package, i.e. BootMRMR has been developed and available at https://cran.r-project.org/web/packages/BootMRMR. This study will provide a practical guide to select statistical techniques for selecting informative genes from high dimensional expression data for breeding and system biology studies. Published by Elsevier B.V.

  13. Quality controls in cellular immunotherapies: rapid assessment of clinical grade dendritic cells by gene expression profiling.

    PubMed

    Castiello, Luciano; Sabatino, Marianna; Zhao, Yingdong; Tumaini, Barbara; Ren, Jiaqiang; Ping, Jin; Wang, Ena; Wood, Lauren V; Marincola, Francesco M; Puri, Raj K; Stroncek, David F

    2013-02-01

    Cell-based immunotherapies are among the most promising approaches for developing effective and targeted immune response. However, their clinical usefulness and the evaluation of their efficacy rely heavily on complex quality control assessment. Therefore, rapid systematic methods are urgently needed for the in-depth characterization of relevant factors affecting newly developed cell product consistency and the identification of reliable markers for quality control. Using dendritic cells (DCs) as a model, we present a strategy to comprehensively characterize manufactured cellular products in order to define factors affecting their variability, quality and function. After generating clinical grade human monocyte-derived mature DCs (mDCs), we tested by gene expression profiling the degrees of product consistency related to the manufacturing process and variability due to intra- and interdonor factors, and how each factor affects single gene variation. Then, by calculating for each gene an index of variation we selected candidate markers for identity testing, and defined a set of genes that may be useful comparability and potency markers. Subsequently, we confirmed the observed gene index of variation in a larger clinical data set. In conclusion, using high-throughput technology we developed a method for the characterization of cellular therapies and the discovery of novel candidate quality assurance markers.

  14. Discovery and identification of candidate genes from the chitinase gene family for Verticillium dahliae resistance in cotton

    PubMed Central

    Xu, Jun; Xu, Xiaoyang; Tian, Liangliang; Wang, Guilin; Zhang, Xueying; Wang, Xinyu; Guo, Wangzhen

    2016-01-01

    Verticillium dahliae, a destructive and soil-borne fungal pathogen, causes massive losses in cotton yields. However, the resistance mechanism to V. dahilae in cotton is still poorly understood. Accumulating evidence indicates that chitinases are crucial hydrolytic enzymes, which attack fungal pathogens by catalyzing the fungal cell wall degradation. As a large gene family, to date, the chitinase genes (Chis) have not been systematically analyzed and effectively utilized in cotton. Here, we identified 47, 49, 92, and 116 Chis from four sequenced cotton species, diploid Gossypium raimondii (D5), G. arboreum (A2), tetraploid G. hirsutum acc. TM-1 (AD1), and G. barbadense acc. 3–79 (AD2), respectively. The orthologous genes were not one-to-one correspondence in the diploid and tetraploid cotton species, implying changes in the number of Chis in different cotton species during the evolution of Gossypium. Phylogenetic classification indicated that these Chis could be classified into six groups, with distinguishable structural characteristics. The expression patterns of Chis indicated their various expressions in different organs and tissues, and in the V. dahliae response. Silencing of Chi23, Chi32, or Chi47 in cotton significantly impaired the resistance to V. dahliae, suggesting these genes might act as positive regulators in disease resistance to V. dahliae. PMID:27354165

  15. A systematic analysis of genomic changes in Tg2576 mice.

    PubMed

    Tan, Lu; Wang, Xiong; Ni, Zhong-Fei; Zhu, Xiuming; Wu, Wei; Zhu, Ling-Qiang; Liu, Dan

    2013-06-01

    Alzheimer's disease (AD) is an age-related neurodegenerative disorder characterized by intelligence decline, behavioral disorders and cognitive disability. The purpose of this study was to investigate gene expression in AD, based on published microarray data on Tg2576 mice. Hierarchical Cluster Analysis and Gene Ontology were employed to group genes together on the basis of their product characteristics and annotation data. Genes with prominent alterations were clustered into apoptosis and axon guidance pathways. Based on our findings and those of previous studies, we propose that the mitochondria-mediated apoptotic pathway plays a crucial role in the neuronal loss and synaptic dysfunction associated with AD. Furthermore, based on the findings of Positional Gene Enrichment analysis and Gene Set Enrichment analysis, we propose that the regulation of transcription of AD genes may be an important pathogenic factor in this neurodegenerative disease. Our results highlight the importance of genes that could subsequently be examined for their potential as prognostic markers for AD.

  16. An integrated approach to characterize transcription factor and microRNA regulatory networks involved in Schwann cell response to peripheral nerve injury

    PubMed Central

    2013-01-01

    Background The regenerative response of Schwann cells after peripheral nerve injury is a critical process directly related to the pathophysiology of a number of neurodegenerative diseases. This SC injury response is dependent on an intricate gene regulatory program coordinated by a number of transcription factors and microRNAs, but the interactions among them remain largely unknown. Uncovering the transcriptional and post-transcriptional regulatory networks governing the Schwann cell injury response is a key step towards a better understanding of Schwann cell biology and may help develop novel therapies for related diseases. Performing such comprehensive network analysis requires systematic bioinformatics methods to integrate multiple genomic datasets. Results In this study we present a computational pipeline to infer transcription factor and microRNA regulatory networks. Our approach combined mRNA and microRNA expression profiling data, ChIP-Seq data of transcription factors, and computational transcription factor and microRNA target prediction. Using mRNA and microRNA expression data collected in a Schwann cell injury model, we constructed a regulatory network and studied regulatory pathways involved in Schwann cell response to injury. Furthermore, we analyzed network motifs and obtained insights on cooperative regulation of transcription factors and microRNAs in Schwann cell injury recovery. Conclusions This work demonstrates a systematic method for gene regulatory network inference that may be used to gain new information on gene regulation by transcription factors and microRNAs. PMID:23387820

  17. Expression and activity profiling of the steroidogenic enzymes of glucocorticoid biosynthesis and the fdx1 co-factors in zebrafish.

    PubMed

    Weger, M; Diotel, N; Weger, B D; Beil, T; Zaucker, A; Eachus, H L; Oakes, J A; do Rego, J L; Storbeck, K-H; Gut, P; Strähle, U; Rastegar, S; Müller, F; Krone, N

    2018-04-01

    The spatial and temporal expression of steroidogenic genes in zebrafish has not been fully characterised. Because zebrafish are increasingly employed in endocrine and stress research, a better characterisation of steroidogenic pathways is required to target specific steps in the biosynthetic pathways. In the present study, we have systematically defined the temporal and spatial expression of steroidogenic enzymes involved in glucocorticoid biosynthesis (cyp21a2, cyp11c1, cyp11a1, cyp11a2, cyp17a1, cyp17a2, hsd3b1, hsd3b2), as well as the mitochondrial electron-providing ferredoxin co-factors (fdx1, fdx1b), during zebrafish development. Our studies showed an early expression of all these genes during embryogenesis. In larvae, expression of cyp11a2, cyp11c1, cyp17a2, cyp21a2, hsd3b1 and fdx1b can be detected in the interrenal gland, which is the zebrafish counterpart of the mammalian adrenal gland, whereas the fdx1 transcript is mainly found in the digestive system. Gene expression studies using quantitative reverse transcriptase-PCR and whole-mount in situ hybridisation in the adult zebrafish brain revealed a wide expression of these genes throughout the encephalon, including neurogenic regions. Using ultra-high-performance liquid chromatography tandem mass spectrometry, we were able to demonstrate the presence of the glucocorticoid cortisol in the adult zebrafish brain. Moreover, we demonstrate de novo biosynthesis of cortisol and the neurosteroid tetrahydrodeoxycorticosterone in the adult zebrafish brain from radiolabelled pregnenolone. Taken together, the present study comprises a comprehensive characterisation of the steroidogenic genes and the fdx co-factors facilitating glucocorticoid biosynthesis in zebrafish. Furthermore, we provide additional evidence of de novo neurosteroid biosynthesising in the brain of adult zebrafish facilitated by enzymes involved in glucocorticoid biosynthesis. Our study provides a valuable source for establishing the zebrafish as a translational model with respect to understanding the roles of the genes for glucocorticoid biosynthesis and fdx co-factors during embryonic development and stress, as well as in brain homeostasis and function. © 2018 British Society for Neuroendocrinology.

  18. An enhanced deterministic K-Means clustering algorithm for cancer subtype prediction from gene expression data.

    PubMed

    Nidheesh, N; Abdul Nazeer, K A; Ameer, P M

    2017-12-01

    Clustering algorithms with steps involving randomness usually give different results on different executions for the same dataset. This non-deterministic nature of algorithms such as the K-Means clustering algorithm limits their applicability in areas such as cancer subtype prediction using gene expression data. It is hard to sensibly compare the results of such algorithms with those of other algorithms. The non-deterministic nature of K-Means is due to its random selection of data points as initial centroids. We propose an improved, density based version of K-Means, which involves a novel and systematic method for selecting initial centroids. The key idea of the algorithm is to select data points which belong to dense regions and which are adequately separated in feature space as the initial centroids. We compared the proposed algorithm to a set of eleven widely used single clustering algorithms and a prominent ensemble clustering algorithm which is being used for cancer data classification, based on the performances on a set of datasets comprising ten cancer gene expression datasets. The proposed algorithm has shown better overall performance than the others. There is a pressing need in the Biomedical domain for simple, easy-to-use and more accurate Machine Learning tools for cancer subtype prediction. The proposed algorithm is simple, easy-to-use and gives stable results. Moreover, it provides comparatively better predictions of cancer subtypes from gene expression data. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Epigenomics of Total Acute Sleep Deprivation in Relation to Genome-Wide DNA Methylation Profiles and RNA Expression.

    PubMed

    Nilsson, Emil K; Boström, Adrian E; Mwinyi, Jessica; Schiöth, Helgi B

    2016-06-01

    Despite an established link between sleep deprivation and epigenetic processes in humans, it remains unclear to what extent sleep deprivation modulates DNA methylation. We performed a within-subject randomized blinded study with 16 healthy subjects to examine the effect of one night of total sleep deprivation (TSD) on the genome-wide methylation profile in blood compared with that in normal sleep. Genome-wide differences in methylation between both conditions were assessed by applying a paired regression model that corrected for monocyte subpopulations. In addition, the correlations between the methylation of genes detected to be modulated by TSD and gene expression were examined in a separate, publicly available cohort of 10 healthy male donors (E-GEOD-49065). Sleep deprivation significantly affected the DNA methylation profile both independently and in dependency of shifts in monocyte composition. Our study detected differential methylation of 269 probes. Notably, one CpG site was located 69 bp upstream of ING5, which has been shown to be differentially expressed after sleep deprivation. Gene set enrichment analysis detected the Notch and Wnt signaling pathways to be enriched among the differentially methylated genes. These results provide evidence that total acute sleep deprivation alters the methylation profile in healthy human subjects. This is, to our knowledge, the first study that systematically investigated the impact of total acute sleep deprivation on genome-wide DNA methylation profiles in blood and related the epigenomic findings to the expression data.

  20. Networking Senescence-Regulating Pathways by Using Arabidopsis Enhancer Trap Lines1

    PubMed Central

    He, Yuehui; Tang, Weining; Swain, Johnnie D.; Green, Anthony L.; Jack, Thomas P.; Gan, Susheng

    2001-01-01

    The last phase of leaf development, generally referred to as leaf senescence, is an integral part of plant development that involves massive programmed cell death. Due to a sharp decline of photosynthetic capacity in a leaf, senescence limits crop yield and forest plant biomass production. However, the biochemical components and regulatory mechanisms underlying leaf senescence are poorly characterized. Although several approaches such as differential cDNA screening, differential display, and cDNA subtraction have been employed to isolate senescence-associated genes (SAGs), only a limited number of SAGs have been identified, and information regarding the regulation of these genes is fragmentary. Here we report on the utilization of enhancer trap approach toward the identification and analysis of SAGs. We have developed a sensitive large-scale screening method and have screened 1,300 Arabidopsis enhancer trap lines and have identified 147 lines in which the reporter gene GUS (β-glucuronidase) is expressed in senescing leaves but not in non-senescing ones. We have systematically analyzed the regulation of β-glucuronidase expression in 125 lines (genetically, each contains single T-DNA insertion) by six senescence-promoting factors, namely abscisic acid, ethylene, jasmonic acid, brassinosteroid, darkness, and dehydration. This analysis not only reveals the complexity of the regulatory circuitry but also allows us to postulate the existence of a network of senescence-promoting pathways. We have also cloned three SAGs from randomly selected enhancer trap lines, demonstrating that reporter expression pattern reflects the expression pattern of the endogenous gene. PMID:11402199

Top