NASA Technical Reports Server (NTRS)
Mjolsness, Eric; Castano, Rebecca; Mann, Tobias; Wold, Barbara
2000-01-01
We provide preliminary evidence that existing algorithms for inferring small-scale gene regulation networks from gene expression data can be adapted to large-scale gene expression data coming from hybridization microarrays. The essential steps are (I) clustering many genes by their expression time-course data into a minimal set of clusters of co-expressed genes, (2) theoretically modeling the various conditions under which the time-courses are measured using a continuous-time analog recurrent neural network for the cluster mean time-courses, (3) fitting such a regulatory model to the cluster mean time courses by simulated annealing with weight decay, and (4) analysing several such fits for commonalities in the circuit parameter sets including the connection matrices. This procedure can be used to assess the adequacy of existing and future gene expression time-course data sets for determining transcriptional regulatory relationships such as coregulation.
Hi-C Chromatin Interaction Networks Predict Co-expression in the Mouse Cortex
Hulsman, Marc; Lelieveldt, Boudewijn P. F.; de Ridder, Jeroen; Reinders, Marcel
2015-01-01
The three dimensional conformation of the genome in the cell nucleus influences important biological processes such as gene expression regulation. Recent studies have shown a strong correlation between chromatin interactions and gene co-expression. However, predicting gene co-expression from frequent long-range chromatin interactions remains challenging. We address this by characterizing the topology of the cortical chromatin interaction network using scale-aware topological measures. We demonstrate that based on these characterizations it is possible to accurately predict spatial co-expression between genes in the mouse cortex. Consistent with previous findings, we find that the chromatin interaction profile of a gene-pair is a good predictor of their spatial co-expression. However, the accuracy of the prediction can be substantially improved when chromatin interactions are described using scale-aware topological measures of the multi-resolution chromatin interaction network. We conclude that, for co-expression prediction, it is necessary to take into account different levels of chromatin interactions ranging from direct interaction between genes (i.e. small-scale) to chromatin compartment interactions (i.e. large-scale). PMID:25965262
Optimal Scaling of Digital Transcriptomes
Glusman, Gustavo; Caballero, Juan; Robinson, Max; Kutlu, Burak; Hood, Leroy
2013-01-01
Deep sequencing of transcriptomes has become an indispensable tool for biology, enabling expression levels for thousands of genes to be compared across multiple samples. Since transcript counts scale with sequencing depth, counts from different samples must be normalized to a common scale prior to comparison. We analyzed fifteen existing and novel algorithms for normalizing transcript counts, and evaluated the effectiveness of the resulting normalizations. For this purpose we defined two novel and mutually independent metrics: (1) the number of “uniform” genes (genes whose normalized expression levels have a sufficiently low coefficient of variation), and (2) low Spearman correlation between normalized expression profiles of gene pairs. We also define four novel algorithms, one of which explicitly maximizes the number of uniform genes, and compared the performance of all fifteen algorithms. The two most commonly used methods (scaling to a fixed total value, or equalizing the expression of certain ‘housekeeping’ genes) yielded particularly poor results, surpassed even by normalization based on randomly selected gene sets. Conversely, seven of the algorithms approached what appears to be optimal normalization. Three of these algorithms rely on the identification of “ubiquitous” genes: genes expressed in all the samples studied, but never at very high or very low levels. We demonstrate that these include a “core” of genes expressed in many tissues in a mutually consistent pattern, which is suitable for use as an internal normalization guide. The new methods yield robustly normalized expression values, which is a prerequisite for the identification of differentially expressed and tissue-specific genes as potential biomarkers. PMID:24223126
Multiscale Embedded Gene Co-expression Network Analysis
Song, Won-Min; Zhang, Bin
2015-01-01
Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma. PMID:26618778
Multiscale Embedded Gene Co-expression Network Analysis.
Song, Won-Min; Zhang, Bin
2015-11-01
Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
Tanaka, F; Wada, H; Fukui, Y; Fukushima, M
2011-08-01
Previous small-sized studies showed lower thymidylate synthase (TS) expression in adenocarcinoma of the lung, which may explain higher antitumor activity of TS-inhibiting agents such as pemetrexed. To quantitatively measure TS gene expression in a large-scale Japanese population (n = 2621) with primary lung cancer, laser-captured microdissected sections were cut from primary tumors, surrounding normal lung tissues and involved nodes. TS gene expression level in primary tumor was significantly higher than that in normal lung tissue (mean TS/β-actin, 3.4 and 1.0, respectively; P < 0.01), and TS gene expression level was further higher in involved node (mean TS/β-actin, 7.7; P < 0.01). Analyses of TS gene expression levels in primary tumor according to histologic cell type revealed that small-cell carcinoma showed highest TS expression (mean TS/β-actin, 13.8) and that squamous cell carcinoma showed higher TS expression as compared with adenocarcinoma (mean TS/β-actin, 4.3 and 2.3, respectively; P < 0.01); TS gene expression was significantly increased along with a decrease in the grade of tumor cell differentiation. There was no significant difference in TS gene expression according to any other patient characteristics including tumor progression. Lower TS expression in adenocarcinoma of the lung was confirmed in a large-scale study.
Integrative approaches for large-scale transcriptome-wide association studies
Gusev, Alexander; Ko, Arthur; Shi, Huwenbo; Bhatia, Gaurav; Chung, Wonil; Penninx, Brenda W J H; Jansen, Rick; de Geus, Eco JC; Boomsma, Dorret I; Wright, Fred A; Sullivan, Patrick F; Nikkola, Elina; Alvarez, Marcus; Civelek, Mete; Lusis, Aldons J.; Lehtimäki, Terho; Raitoharju, Emma; Kähönen, Mika; Seppälä, Ilkka; Raitakari, Olli T.; Kuusisto, Johanna; Laakso, Markku; Price, Alkes L.; Pajukanta, Päivi; Pasaniuc, Bogdan
2016-01-01
Many genetic variants influence complex traits by modulating gene expression, thus altering the abundance levels of one or multiple proteins. Here, we introduce a powerful strategy that integrates gene expression measurements with summary association statistics from large-scale genome-wide association studies (GWAS) to identify genes whose cis-regulated expression is associated to complex traits. We leverage expression imputation to perform a transcriptome wide association scan (TWAS) to identify significant expression-trait associations. We applied our approaches to expression data from blood and adipose tissue measured in ~3,000 individuals overall. We imputed gene expression into GWAS data from over 900,000 phenotype measurements to identify 69 novel genes significantly associated to obesity-related traits (BMI, lipids, and height). Many of the novel genes are associated with relevant phenotypes in the Hybrid Mouse Diversity Panel. Our results showcase the power of integrating genotype, gene expression and phenotype to gain insights into the genetic basis of complex traits. PMID:26854917
A Review of Feature Extraction Software for Microarray Gene Expression Data
Tan, Ching Siang; Ting, Wai Soon; Mohamad, Mohd Saberi; Chan, Weng Howe; Deris, Safaai; Ali Shah, Zuraini
2014-01-01
When gene expression data are too large to be processed, they are transformed into a reduced representation set of genes. Transforming large-scale gene expression data into a set of genes is called feature extraction. If the genes extracted are carefully chosen, this gene set can extract the relevant information from the large-scale gene expression data, allowing further analysis by using this reduced representation instead of the full size data. In this paper, we review numerous software applications that can be used for feature extraction. The software reviewed is mainly for Principal Component Analysis (PCA), Independent Component Analysis (ICA), Partial Least Squares (PLS), and Local Linear Embedding (LLE). A summary and sources of the software are provided in the last section for each feature extraction method. PMID:25250315
DEXTER: Disease-Expression Relation Extraction from Text.
Gupta, Samir; Dingerdissen, Hayley; Ross, Karen E; Hu, Yu; Wu, Cathy H; Mazumder, Raja; Vijay-Shanker, K
2018-01-01
Gene expression levels affect biological processes and play a key role in many diseases. Characterizing expression profiles is useful for clinical research, and diagnostics and prognostics of diseases. There are currently several high-quality databases that capture gene expression information, obtained mostly from large-scale studies, such as microarray and next-generation sequencing technologies, in the context of disease. The scientific literature is another rich source of information on gene expression-disease relationships that not only have been captured from large-scale studies but have also been observed in thousands of small-scale studies. Expression information obtained from literature through manual curation can extend expression databases. While many of the existing databases include information from literature, they are limited by the time-consuming nature of manual curation and have difficulty keeping up with the explosion of publications in the biomedical field. In this work, we describe an automated text-mining tool, Disease-Expression Relation Extraction from Text (DEXTER) to extract information from literature on gene and microRNA expression in the context of disease. One of the motivations in developing DEXTER was to extend the BioXpress database, a cancer-focused gene expression database that includes data derived from large-scale experiments and manual curation of publications. The literature-based portion of BioXpress lags behind significantly compared to expression information obtained from large-scale studies and can benefit from our text-mined results. We have conducted two different evaluations to measure the accuracy of our text-mining tool and achieved average F-scores of 88.51 and 81.81% for the two evaluations, respectively. Also, to demonstrate the ability to extract rich expression information in different disease-related scenarios, we used DEXTER to extract information on differential expression information for 2024 genes in lung cancer, 115 glycosyltransferases in 62 cancers and 826 microRNA in 171 cancers. All extractions using DEXTER are integrated in the literature-based portion of BioXpress.Database URL: http://biotm.cis.udel.edu/DEXTER.
Asgari, Yazdan; Khosravi, Pegah; Zabihinpour, Zahra; Habibi, Mahnaz
2018-02-19
Genome-scale metabolic models have provided valuable resources for exploring changes in metabolism under normal and cancer conditions. However, metabolism itself is strongly linked to gene expression, so integration of gene expression data into metabolic models might improve the detection of genes involved in the control of tumor progression. Herein, we considered gene expression data as extra constraints to enhance the predictive powers of metabolic models. We reconstructed genome-scale metabolic models for lung and prostate, under normal and cancer conditions to detect the major genes associated with critical subsystems during tumor development. Furthermore, we utilized gene expression data in combination with an information theory-based approach to reconstruct co-expression networks of the human lung and prostate in both cohorts. Our results revealed 19 genes as candidate biomarkers for lung and prostate cancer cells. This study also revealed that the development of a complementary approach (integration of gene expression and metabolic profiles) could lead to proposing novel biomarkers and suggesting renovated cancer treatment strategies which have not been possible to detect using either of the methods alone.
A genome-scale map of expression for a mouse brain section obtained using voxelation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chin, Mark H.; Geng, Alex B.; Khan, Arshad H.
Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological diseases. We have reconstructed 2- dimensional images of gene expression for 20,000 genes in a coronal slice of the mouse brain at the level of the striatum by using microarrays in combination with voxelation at a resolution of 1 mm3. Good reliability of the microarray results were confirmed using multiple replicates, subsequent quantitative RT-PCR voxelation, mass spectrometry voxelation and publicly available in situ hybridization data. Known and novel genes were identified with expression patterns localized to defined substructures within the brain. In addition, genesmore » with unexpected patterns were identified and cluster analysis identified a set of genes with a gradient of dorsal/ventral expression not restricted to known anatomical boundaries. The genome-scale maps of gene expression obtained using voxelation will be a valuable tool for the neuroscience community.« less
2012-01-01
Background DNA cytosine methylation is an epigenetic modification that has been implicated in many biological processes. However, large-scale epigenomic studies have been applied to very few plant species, and variability in methylation among specialized tissues and its relationship to gene expression is poorly understood. Results We surveyed DNA methylation from seven distinct tissue types (vegetative bud, male inflorescence [catkin], female catkin, leaf, root, xylem, phloem) in the reference tree species black cottonwood (Populus trichocarpa). Using 5-methyl-cytosine DNA immunoprecipitation followed by Illumina sequencing (MeDIP-seq), we mapped a total of 129,360,151 36- or 32-mer reads to the P. trichocarpa reference genome. We validated MeDIP-seq results by bisulfite sequencing, and compared methylation and gene expression using published microarray data. Qualitative DNA methylation differences among tissues were obvious on a chromosome scale. Methylated genes had lower expression than unmethylated genes, but genes with methylation in transcribed regions ("gene body methylation") had even lower expression than genes with promoter methylation. Promoter methylation was more frequent than gene body methylation in all tissues except male catkins. Male catkins differed in demethylation of particular transposable element categories, in level of gene body methylation, and in expression range of genes with methylated transcribed regions. Tissue-specific gene expression patterns were correlated with both gene body and promoter methylation. Conclusions We found striking differences among tissues in methylation, which were apparent at the chromosomal scale and when genes and transposable elements were examined. In contrast to other studies in plants, gene body methylation had a more repressive effect on transcription than promoter methylation. PMID:22251412
Pao, Sheng-Ying; Lin, Win-Li; Hwang, Ming-Jing
2006-01-01
Background Screening for differentially expressed genes on the genomic scale and comparative analysis of the expression profiles of orthologous genes between species to study gene function and regulation are becoming increasingly feasible. Expressed sequence tags (ESTs) are an excellent source of data for such studies using bioinformatic approaches because of the rich libraries and tremendous amount of data now available in the public domain. However, any large-scale EST-based bioinformatics analysis must deal with the heterogeneous, and often ambiguous, tissue and organ terms used to describe EST libraries. Results To deal with the issue of tissue source, in this work, we carefully screened and organized more than 8 million human and mouse ESTs into 157 human and 108 mouse tissue/organ categories, to which we applied an established statistic test using different thresholds of the p value to identify genes differentially expressed in different tissues. Further analysis of the tissue distribution and level of expression of human and mouse orthologous genes showed that tissue-specific orthologs tended to have more similar expression patterns than those lacking significant tissue specificity. On the other hand, a number of orthologs were found to have significant disparity in their expression profiles, hinting at novel functions, divergent regulation, or new ortholog relationships. Conclusion Comprehensive statistics on the tissue-specific expression of human and mouse genes were obtained in this very large-scale, EST-based analysis. These statistical results have been organized into a database, freely accessible at our website , for easy searching of human and mouse tissue-specific genes and for investigating gene expression profiles in the context of comparative genomics. Comparative analysis showed that, although highly tissue-specific genes tend to exhibit similar expression profiles in human and mouse, there are significant exceptions, indicating that orthologous genes, while sharing basic genomic properties, could result in distinct phenotypes. PMID:16626500
Modeling gene expression measurement error: a quasi-likelihood approach
Strimmer, Korbinian
2003-01-01
Background Using suitable error models for gene expression measurements is essential in the statistical analysis of microarray data. However, the true probabilistic model underlying gene expression intensity readings is generally not known. Instead, in currently used approaches some simple parametric model is assumed (usually a transformed normal distribution) or the empirical distribution is estimated. However, both these strategies may not be optimal for gene expression data, as the non-parametric approach ignores known structural information whereas the fully parametric models run the risk of misspecification. A further related problem is the choice of a suitable scale for the model (e.g. observed vs. log-scale). Results Here a simple semi-parametric model for gene expression measurement error is presented. In this approach inference is based an approximate likelihood function (the extended quasi-likelihood). Only partial knowledge about the unknown true distribution is required to construct this function. In case of gene expression this information is available in the form of the postulated (e.g. quadratic) variance structure of the data. As the quasi-likelihood behaves (almost) like a proper likelihood, it allows for the estimation of calibration and variance parameters, and it is also straightforward to obtain corresponding approximate confidence intervals. Unlike most other frameworks, it also allows analysis on any preferred scale, i.e. both on the original linear scale as well as on a transformed scale. It can also be employed in regression approaches to model systematic (e.g. array or dye) effects. Conclusions The quasi-likelihood framework provides a simple and versatile approach to analyze gene expression data that does not make any strong distributional assumptions about the underlying error model. For several simulated as well as real data sets it provides a better fit to the data than competing models. In an example it also improved the power of tests to identify differential expression. PMID:12659637
NASA Astrophysics Data System (ADS)
Zhang, Yanjie; Sun, Jin; Chen, Chong; Watanabe, Hiromi K.; Feng, Dong; Zhang, Yu; Chiu, Jill M. Y.; Qian, Pei-Yuan; Qiu, Jian-Wen
2017-04-01
Polynoid scale worms (Polynoidae, Annelida) invaded deep-sea chemosynthesis-based ecosystems approximately 60 million years ago, but little is known about their genetic adaptation to the extreme deep-sea environment. In this study, we reported the first two transcriptomes of deep-sea polynoids (Branchipolynoe pettiboneae, Lepidonotopodium sp.) and compared them with the transcriptome of a shallow-water polynoid (Harmothoe imbricata). We determined codon and amino acid usage, positive selected genes, highly expressed genes and putative duplicated genes. Transcriptome assembly produced 98,806 to 225,709 contigs in the three species. There were more positively charged amino acids (i.e., histidine and arginine) and less negatively charged amino acids (i.e., aspartic acid and glutamic acid) in the deep-sea species. There were 120 genes showing clear evidence of positive selection. Among the 10% most highly expressed genes, there were more hemoglobin genes with high expression levels in both deep-sea species. The duplicated genes related to DNA recombination and metabolism, and gene expression were only enriched in deep-sea species. Deep-sea scale worms adopted two strategies of adaptation to hypoxia in the chemosynthesis-based habitats (i.e., rapid evolution of tetra-domain hemoglobin in Branchipolynoe or high expression of single-domain hemoglobin in Lepidonotopodium sp.).
Zhang, Yanjie; Sun, Jin; Chen, Chong; Watanabe, Hiromi K.; Feng, Dong; Zhang, Yu; Chiu, Jill M.Y.; Qian, Pei-Yuan; Qiu, Jian-Wen
2017-01-01
Polynoid scale worms (Polynoidae, Annelida) invaded deep-sea chemosynthesis-based ecosystems approximately 60 million years ago, but little is known about their genetic adaptation to the extreme deep-sea environment. In this study, we reported the first two transcriptomes of deep-sea polynoids (Branchipolynoe pettiboneae, Lepidonotopodium sp.) and compared them with the transcriptome of a shallow-water polynoid (Harmothoe imbricata). We determined codon and amino acid usage, positive selected genes, highly expressed genes and putative duplicated genes. Transcriptome assembly produced 98,806 to 225,709 contigs in the three species. There were more positively charged amino acids (i.e., histidine and arginine) and less negatively charged amino acids (i.e., aspartic acid and glutamic acid) in the deep-sea species. There were 120 genes showing clear evidence of positive selection. Among the 10% most highly expressed genes, there were more hemoglobin genes with high expression levels in both deep-sea species. The duplicated genes related to DNA recombination and metabolism, and gene expression were only enriched in deep-sea species. Deep-sea scale worms adopted two strategies of adaptation to hypoxia in the chemosynthesis-based habitats (i.e., rapid evolution of tetra-domain hemoglobin in Branchipolynoe or high expression of single-domain hemoglobin in Lepidonotopodium sp.). PMID:28397791
Cronn, Richard; Dolan, Peter C; Jogdeo, Sanjuro; Wegrzyn, Jill L; Neale, David B; St Clair, J Bradley; Denver, Dee R
2017-07-24
Perennial growth in plants is the product of interdependent cycles of daily and annual stimuli that induce cycles of growth and dormancy. In conifers, needles are the key perennial organ that integrates daily and seasonal signals from light, temperature, and water availability. To understand the relationship between seasonal cycles and seasonal gene expression responses in conifers, we examined diurnal and circannual needle mRNA accumulation in Douglas-fir (Pseudotsuga menziesii) needles at diurnal and circannual scales. Using mRNA sequencing, we sampled 6.1 × 10 9 reads from 19 trees and constructed a de novo pan-transcriptome reference that includes 173,882 tree-derived transcripts. Using this reference, we mapped RNA-Seq reads from 179 samples that capture daily and annual variation. We identified 12,042 diurnally-cyclic transcripts, 9299 of which showed homology to annotated genes from other plant genomes, including angiosperm core clock genes. Annual analysis revealed 21,225 circannual transcripts, 17,335 of which showed homology to annotated genes from other plant genomes. The timing of maximum gene expression is associated with light intensity at diurnal scales and photoperiod at annual scales, with approximately half of transcripts reaching maximum expression +/- 2 h from sunrise and sunset, and +/- 20 days from winter and summer solstices. Comparisons with published studies from other conifers shows congruent behavior in clock genes with Japanese cedar (Cryptomeria), and a significant preservation of gene expression patterns for 2278 putative orthologs from Douglas-fir during the summer growing season, and 760 putative orthologs from spruce (Picea) during the transition from fall to winter. Our study highlight the extensive diurnal and circannual transcriptome variability demonstrated in conifer needles. At these temporal scales, 29% of expressed transcripts show a significant diurnal cycle, and 58.7% show a significant circannual cycle. Remarkably, thousands of genes reach their annual peak activity during winter dormancy. Our study establishes the fine-scale timing of daily and annual maximum gene expression for diverse needle genes in Douglas-fir, and it highlights the potential for using this information for evaluating hypotheses concerning the daily or seasonal timing of gene activity in temperate-zone conifers, and for identifying cyclic transcriptome components in other conifer species.
Sequeira, Ana Filipa; Brás, Joana L A; Guerreiro, Catarina I P D; Vincentelli, Renaud; Fontes, Carlos M G A
2016-12-01
Gene synthesis is becoming an important tool in many fields of recombinant DNA technology, including recombinant protein production. De novo gene synthesis is quickly replacing the classical cloning and mutagenesis procedures and allows generating nucleic acids for which no template is available. In addition, when coupled with efficient gene design algorithms that optimize codon usage, it leads to high levels of recombinant protein expression. Here, we describe the development of an optimized gene synthesis platform that was applied to the large scale production of small genes encoding venom peptides. This improved gene synthesis method uses a PCR-based protocol to assemble synthetic DNA from pools of overlapping oligonucleotides and was developed to synthesise multiples genes simultaneously. This technology incorporates an accurate, automated and cost effective ligation independent cloning step to directly integrate the synthetic genes into an effective Escherichia coli expression vector. The robustness of this technology to generate large libraries of dozens to thousands of synthetic nucleic acids was demonstrated through the parallel and simultaneous synthesis of 96 genes encoding animal toxins. An automated platform was developed for the large-scale synthesis of small genes encoding eukaryotic toxins. Large scale recombinant expression of synthetic genes encoding eukaryotic toxins will allow exploring the extraordinary potency and pharmacological diversity of animal venoms, an increasingly valuable but unexplored source of lead molecules for drug discovery.
Hosseini Ashtiani, Saman; Moeini, Ali; Nowzari-Dalini, Abbas; Masoudi-Nejad, Ali
2013-01-01
Our goal of this study was to reconstruct a “genome-scale co-expression network” and find important modules in lung adenocarcinoma so that we could identify the genes involved in lung adenocarcinoma. We integrated gene mutation, GWAS, CGH, array-CGH and SNP array data in order to identify important genes and loci in genome-scale. Afterwards, on the basis of the identified genes a co-expression network was reconstructed from the co-expression data. The reconstructed network was named “genome-scale co-expression network”. As the next step, 23 key modules were disclosed through clustering. In this study a number of genes have been identified for the first time to be implicated in lung adenocarcinoma by analyzing the modules. The genes EGFR, PIK3CA, TAF15, XIAP, VAPB, Appl1, Rab5a, ARF4, CLPTM1L, SP4, ZNF124, LPP, FOXP1, SOX18, MSX2, NFE2L2, SMARCC1, TRA2B, CBX3, PRPF6, ATP6V1C1, MYBBP1A, MACF1, GRM2, TBXA2R, PRKAR2A, PTK2, PGF and MYO10 are among the genes that belong to modules 1 and 22. All these genes, being implicated in at least one of the phenomena, namely cell survival, proliferation and metastasis, have an over-expression pattern similar to that of EGFR. In few modules, the genes such as CCNA2 (Cyclin A2), CCNB2 (Cyclin B2), CDK1, CDK5, CDC27, CDCA5, CDCA8, ASPM, BUB1, KIF15, KIF2C, NEK2, NUSAP1, PRC1, SMC4, SYCE2, TFDP1, CDC42 and ARHGEF9 are present that play a crucial role in cell cycle progression. In addition to the mentioned genes, there are some other genes (i.e. DLGAP5, BIRC5, PSMD2, Src, TTK, SENP2, PSMD2, DOK2, FUS and etc.) in the modules. PMID:23874428
Bidkhori, Gholamreza; Narimani, Zahra; Hosseini Ashtiani, Saman; Moeini, Ali; Nowzari-Dalini, Abbas; Masoudi-Nejad, Ali
2013-01-01
Our goal of this study was to reconstruct a "genome-scale co-expression network" and find important modules in lung adenocarcinoma so that we could identify the genes involved in lung adenocarcinoma. We integrated gene mutation, GWAS, CGH, array-CGH and SNP array data in order to identify important genes and loci in genome-scale. Afterwards, on the basis of the identified genes a co-expression network was reconstructed from the co-expression data. The reconstructed network was named "genome-scale co-expression network". As the next step, 23 key modules were disclosed through clustering. In this study a number of genes have been identified for the first time to be implicated in lung adenocarcinoma by analyzing the modules. The genes EGFR, PIK3CA, TAF15, XIAP, VAPB, Appl1, Rab5a, ARF4, CLPTM1L, SP4, ZNF124, LPP, FOXP1, SOX18, MSX2, NFE2L2, SMARCC1, TRA2B, CBX3, PRPF6, ATP6V1C1, MYBBP1A, MACF1, GRM2, TBXA2R, PRKAR2A, PTK2, PGF and MYO10 are among the genes that belong to modules 1 and 22. All these genes, being implicated in at least one of the phenomena, namely cell survival, proliferation and metastasis, have an over-expression pattern similar to that of EGFR. In few modules, the genes such as CCNA2 (Cyclin A2), CCNB2 (Cyclin B2), CDK1, CDK5, CDC27, CDCA5, CDCA8, ASPM, BUB1, KIF15, KIF2C, NEK2, NUSAP1, PRC1, SMC4, SYCE2, TFDP1, CDC42 and ARHGEF9 are present that play a crucial role in cell cycle progression. In addition to the mentioned genes, there are some other genes (i.e. DLGAP5, BIRC5, PSMD2, Src, TTK, SENP2, PSMD2, DOK2, FUS and etc.) in the modules.
Sanzol, Javier
2010-05-14
Gene duplication is central to genome evolution. In plants, genes can be duplicated through small-scale events and large-scale duplications often involving polyploidy. The apple belongs to the subtribe Pyrinae (Rosaceae), a diverse lineage that originated via allopolyploidization. Both small-scale duplications and polyploidy may have been important mechanisms shaping the genome of this species. This study evaluates the gene duplication and polyploidy history of the apple by characterizing duplicated genes in this species using EST data. Overall, 68% of the apple genes were clustered into families with a mean copy-number of 4.6. Analysis of the age distribution of gene duplications supported a continuous mode of small-scale duplications, plus two episodes of large-scale duplicates of vastly different ages. The youngest was consistent with the polyploid origin of the Pyrinae 37-48 MYBP, whereas the older may be related to gamma-triplication; an ancient hexapolyploidization previously characterized in the four sequenced eurosid genomes and basal to the eurosid-asterid divergence. Duplicated genes were studied for functional diversification with an emphasis on young paralogs; those originated during or after the formation of the Pyrinae lineage. Unequal assignment of single-copy genes and gene families to Gene Ontology categories suggested functional bias in the pattern of gene retention of paralogs. Young paralogs related to signal transduction, metabolism, and energy pathways have been preferentially retained. Non-random retention of duplicated genes seems to have mediated the expansion of gene families, some of which may have substantially increased their members after the origin of the Pyrinae. The joint analysis of over-duplicated functional categories and phylogenies, allowed evaluation of the role of both polyploidy and small-scale duplications during this process. Finally, gene expression analysis indicated that 82% of duplicated genes, including 80% of young paralogs, showed uncorrelated expression profiles, suggesting extensive subfunctionalization and a role of gene duplication in the acquisition of novel patterns of gene expression. This study reports a genome-wide analysis of the mode of gene duplication in the apple, and provides evidence for its role in genome functional diversification by characterising three major processes: selective retention of paralogs, amplification of gene families, and changes in gene expression.
Grath, Sonja; Parsch, John
2012-01-01
Sex-biased gene expression (i.e., the differential expression of genes between males and females) is common among sexually reproducing species. However, genes often differ in their sex-bias classification or degree of sex bias between species. There is also an unequal distribution of sex-biased genes (especially male-biased genes) between the X chromosome and the autosomes. We used whole-genome expression data and evolutionary rate estimates for two different Drosophilid lineages, melanogaster and obscura, spanning an evolutionary time scale of around 50 Myr to investigate the influence of sex-biased gene expression and chromosomal location on the rate of molecular evolution. In both lineages, the rate of protein evolution correlated positively with the male/female expression ratio. Genes with highly male-biased expression, genes expressed specifically in male reproductive tissues, and genes with conserved male-biased expression over long evolutionary time scales showed the fastest rates of evolution. An analysis of sex-biased gene evolution in both lineages revealed evidence for a “fast-X” effect in which the rate of evolution was greater for X-linked than for autosomal genes. This pattern was particularly pronounced for male-biased genes. Genes located on the obscura “neo-X” chromosome, which originated from a recent X-autosome fusion, showed rates of evolution that were intermediate between genes located on the ancestral X-chromosome and the autosomes. This suggests that the shift to X-linkage led to an increase in the rate of molecular evolution. PMID:22321769
paraGSEA: a scalable approach for large-scale gene expression profiling
Peng, Shaoliang; Yang, Shunyun
2017-01-01
Abstract More studies have been conducted using gene expression similarity to identify functional connections among genes, diseases and drugs. Gene Set Enrichment Analysis (GSEA) is a powerful analytical method for interpreting gene expression data. However, due to its enormous computational overhead in the estimation of significance level step and multiple hypothesis testing step, the computation scalability and efficiency are poor on large-scale datasets. We proposed paraGSEA for efficient large-scale transcriptome data analysis. By optimization, the overall time complexity of paraGSEA is reduced from O(mn) to O(m+n), where m is the length of the gene sets and n is the length of the gene expression profiles, which contributes more than 100-fold increase in performance compared with other popular GSEA implementations such as GSEA-P, SAM-GS and GSEA2. By further parallelization, a near-linear speed-up is gained on both workstations and clusters in an efficient manner with high scalability and performance on large-scale datasets. The analysis time of whole LINCS phase I dataset (GSE92742) was reduced to nearly half hour on a 1000 node cluster on Tianhe-2, or within 120 hours on a 96-core workstation. The source code of paraGSEA is licensed under the GPLv3 and available at http://github.com/ysycloud/paraGSEA. PMID:28973463
Shi, Kerong; He, Feng; Yuan, Xuefeng; Zhao, Yaofeng; Deng, Xuemei; Hu, Xiaoxiang; Li, Ning
2013-08-01
The ovarian follicle supplies a unique dynamic system for gametes that ensures the propagation of the species. During folliculogenesis, the vast majority of the germ cells are lost or inactivated because of ovarian follicle atresia, resulting in diminished reproductive potency and potential infertility. Understanding the underlying molecular mechanism of folliculogenesis rules is essential. Primordial (P), preantral (M), and large antral (L) porcine follicles were used to reveal their genome-wide gene expression profiles. Results indicate that primordial follicles (P) process a diverse gene expression pattern compared to growing follicles (M and L). The 5,548 differentially expressed genes display a similar expression mode in M and L, with a correlation coefficient of 0.892. The number of regulated (both up and down) genes in M is more than that in L. Also, their regulation folds in M (2-364-fold) are much more acute than in L (2-75-fold). Differentially expressed gene groups with different regulation patterns in certain follicular stages are identified and presumed to be closely related following follicular developmental rules. Interestingly, functional annotation analysis revealed that these gene groups feature distinct biological processes or molecular functions. Moreover, representative candidate genes from these gene groups have had their RNA or protein expressions within follicles confirmed. Our study emphasized genome-scale gene expression characteristics, which provide novel entry points for understanding the folliculogenesis rules on the molecular level, such as follicular initiation, atresia, and dominance. Transcriptional regulatory circuitries in certain follicular stages are expected to be found among the identified differentially expressed gene groups.
Bao, Weier; Greenwold, Matthew J; Sawyer, Roger H
2017-11-01
Gene co-expression network analysis has been a research method widely used in systematically exploring gene function and interaction. Using the Weighted Gene Co-expression Network Analysis (WGCNA) approach to construct a gene co-expression network using data from a customized 44K microarray transcriptome of chicken epidermal embryogenesis, we have identified two distinct modules that are highly correlated with scale or feather development traits. Signaling pathways related to feather development were enriched in the traditional KEGG pathway analysis and functional terms relating specifically to embryonic epidermal development were also enriched in the Gene Ontology analysis. Significant enrichment annotations were discovered from customized enrichment tools such as Modular Single-Set Enrichment Test (MSET) and Medical Subject Headings (MeSH). Hub genes in both trait-correlated modules showed strong specific functional enrichment toward epidermal development. Also, regulatory elements, such as transcription factors and miRNAs, were targeted in the significant enrichment result. This work highlights the advantage of this methodology for functional prediction of genes not previously associated with scale- and feather trait-related modules.
USDA-ARS?s Scientific Manuscript database
Large-scale, gene expression methods allow for high throughput analysis of physiological pathways at a fraction of the cost of individual gene expression analysis. Systems, such as the Fluidigm quantitative PCR array described here, can provide powerful assessments of the effects of diet, environme...
Course 10: Three Lectures on Biological Networks
NASA Astrophysics Data System (ADS)
Magnasco, M. O.
1 Enzymatic networks. Proofreading knots: How DNA topoisomerases disentangle DNA 1.1 Length scales and energy scales 1.2 DNA topology 1.3 Topoisomerases 1.4 Knots and supercoils 1.5 Topological equilibrium 1.6 Can topoisomerases recognize topology? 1.7 Proposal: Kinetic proofreading 1.8 How to do it twice 1.9 The care and proofreading of knots 1.10 Suppression of supercoils 1.11 Problems and outlook 1.12 Disquisition 2 Gene expression networks. Methods for analysis of DNA chip experiments 2.1 The regulation of gene expression 2.2 Gene expression arrays 2.3 Analysis of array data 2.4 Some simplifying assumptions 2.5 Probeset analysis 2.6 Discussion 3 Neural and gene expression networks: Song-induced gene expression in the canary brain 3.1 The study of songbirds 3.2 Canary song 3.3 ZENK 3.4 The blush 3.5 Histological analysis 3.6 Natural vs. artificial 3.7 The Blush II: gAP 3.8 Meditation
PLEXdb: Gene expression resources for plants and plant pathogens
USDA-ARS?s Scientific Manuscript database
PLEXdb (Plant Expression Database), in partnership with community databases, supports comparisons of gene expression across multiple plant and pathogen species, promoting individuals and/or consortia to upload genome-scale data sets to contrast them to previously archived data. These analyses facili...
Gildor, Tsvia; Ben-Tabou de-Leon, Smadar
2015-01-01
Accurate temporal control of gene expression is essential for normal development and must be robust to natural genetic and environmental variation. Studying gene expression variation within and between related species can delineate the level of expression variability that development can tolerate. Here we exploit the comprehensive model of sea urchin gene regulatory networks and generate high-density expression profiles of key regulatory genes of the Mediterranean sea urchin, Paracentrotus lividus (Pl). The high resolution of our studies reveals highly reproducible gene initiation times that have lower variation than those of maximal mRNA levels between different individuals of the same species. This observation supports a threshold behavior of gene activation that is less sensitive to input concentrations. We then compare Mediterranean sea urchin gene expression profiles to those of its Pacific Ocean relative, Strongylocentrotus purpuratus (Sp). These species shared a common ancestor about 40 million years ago and show highly similar embryonic morphologies. Our comparative analyses of five regulatory circuits operating in different embryonic territories reveal a high conservation of the temporal order of gene activation but also some cases of divergence. A linear ratio of 1.3-fold between gene initiation times in Pl and Sp is partially explained by scaling of the developmental rates with temperature. Scaling the developmental rates according to the estimated Sp-Pl ratio and normalizing the expression levels reveals a striking conservation of relative dynamics of gene expression between the species. Overall, our findings demonstrate the ability of biological developmental systems to tightly control the timing of gene activation and relative dynamics and overcome expression noise induced by genetic variation and growth conditions. PMID:26230518
Analysis of blood-based gene expression in idiopathic Parkinson disease.
Shamir, Ron; Klein, Christine; Amar, David; Vollstedt, Eva-Juliane; Bonin, Michael; Usenovic, Marija; Wong, Yvette C; Maver, Ales; Poths, Sven; Safer, Hershel; Corvol, Jean-Christophe; Lesage, Suzanne; Lavi, Ofer; Deuschl, Günther; Kuhlenbaeumer, Gregor; Pawlack, Heike; Ulitsky, Igor; Kasten, Meike; Riess, Olaf; Brice, Alexis; Peterlin, Borut; Krainc, Dimitri
2017-10-17
To examine whether gene expression analysis of a large-scale Parkinson disease (PD) patient cohort produces a robust blood-based PD gene signature compared to previous studies that have used relatively small cohorts (≤220 samples). Whole-blood gene expression profiles were collected from a total of 523 individuals. After preprocessing, the data contained 486 gene profiles (n = 205 PD, n = 233 controls, n = 48 other neurodegenerative diseases) that were partitioned into training, validation, and independent test cohorts to identify and validate a gene signature. Batch-effect reduction and cross-validation were performed to ensure signature reliability. Finally, functional and pathway enrichment analyses were applied to the signature to identify PD-associated gene networks. A gene signature of 100 probes that mapped to 87 genes, corresponding to 64 upregulated and 23 downregulated genes differentiating between patients with idiopathic PD and controls, was identified with the training cohort and successfully replicated in both an independent validation cohort (area under the curve [AUC] = 0.79, p = 7.13E-6) and a subsequent independent test cohort (AUC = 0.74, p = 4.2E-4). Network analysis of the signature revealed gene enrichment in pathways, including metabolism, oxidation, and ubiquitination/proteasomal activity, and misregulation of mitochondria-localized genes, including downregulation of COX4I1 , ATP5A1 , and VDAC3 . We present a large-scale study of PD gene expression profiling. This work identifies a reliable blood-based PD signature and highlights the importance of large-scale patient cohorts in developing potential PD biomarkers. © 2017 American Academy of Neurology.
Hiss, Manuel; Laule, Oliver; Meskauskiene, Rasa M; Arif, Muhammad A; Decker, Eva L; Erxleben, Anika; Frank, Wolfgang; Hanke, Sebastian T; Lang, Daniel; Martin, Anja; Neu, Christina; Reski, Ralf; Richardt, Sandra; Schallenberg-Rüdinger, Mareike; Szövényi, Peter; Tiko, Theodhor; Wiedemann, Gertrud; Wolf, Luise; Zimmermann, Philip; Rensing, Stefan A
2014-08-01
The moss Physcomitrella patens is an important model organism for studying plant evolution, development, physiology and biotechnology. Here we have generated microarray gene expression data covering the principal developmental stages, culture forms and some environmental/stress conditions. Example analyses of developmental stages and growth conditions as well as abiotic stress treatments demonstrate that (i) growth stage is dominant over culture conditions, (ii) liquid culture is not stressful for the plant, (iii) low pH might aid protoplastation by reduced expression of cell wall structure genes, (iv) largely the same gene pool mediates response to dehydration and rehydration, and (v) AP2/EREBP transcription factors play important roles in stress response reactions. With regard to the AP2 gene family, phylogenetic analysis and comparison with Arabidopsis thaliana shows commonalities as well as uniquely expressed family members under drought, light perturbations and protoplastation. Gene expression profiles for P. patens are available for the scientific community via the easy-to-use tool at https://www.genevestigator.com. By providing large-scale expression profiles, the usability of this model organism is further enhanced, for example by enabling selection of control genes for quantitative real-time PCR. Now, gene expression levels across a broad range of conditions can be accessed online for P. patens. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Wu, Jun-Zheng; Liu, Qin; Geng, Xiao-Shan; Li, Kai-Mian; Luo, Li-Juan; Liu, Jin-Ping
2017-03-14
Cassava (Manihot esculenta Crantz) is a major crop extensively cultivated in the tropics as both an important source of calories and a promising source for biofuel production. Although stable gene expression have been used for transgenic breeding and gene function study, a quick, easy and large-scale transformation platform has been in urgent need for gene functional characterization, especially after the cassava full genome was sequenced. Fully expanded leaves from in vitro plantlets of Manihot esculenta were used to optimize the concentrations of cellulase R-10 and macerozyme R-10 for obtaining protoplasts with the highest yield and viability. Then, the optimum conditions (PEG4000 concentration and transfection time) were determined for cassava protoplast transient gene expression. In addition, the reliability of the established protocol was confirmed for subcellular protein localization. In this work we optimized the main influencing factors and developed an efficient mesophyll protoplast isolation and PEG-mediated transient gene expression in cassava. The suitable enzyme digestion system was established with the combination of 1.6% cellulase R-10 and 0.8% macerozyme R-10 for 16 h of digestion in the dark at 25 °C, resulting in the high yield (4.4 × 10 7 protoplasts/g FW) and vitality (92.6%) of mesophyll protoplasts. The maximum transfection efficiency (70.8%) was obtained with the incubation of the protoplasts/vector DNA mixture with 25% PEG4000 for 10 min. We validated the applicability of the system for studying the subcellular localization of MeSTP7 (an H + /monosaccharide cotransporter) with our transient expression protocol and a heterologous Arabidopsis transient gene expression system. We optimized the main influencing factors and developed an efficient mesophyll protoplast isolation and transient gene expression in cassava, which will facilitate large-scale characterization of genes and pathways in cassava.
Annotation of gene function in citrus using gene expression information and co-expression networks
2014-01-01
Background The genus Citrus encompasses major cultivated plants such as sweet orange, mandarin, lemon and grapefruit, among the world’s most economically important fruit crops. With increasing volumes of transcriptomics data available for these species, Gene Co-expression Network (GCN) analysis is a viable option for predicting gene function at a genome-wide scale. GCN analysis is based on a “guilt-by-association” principle whereby genes encoding proteins involved in similar and/or related biological processes may exhibit similar expression patterns across diverse sets of experimental conditions. While bioinformatics resources such as GCN analysis are widely available for efficient gene function prediction in model plant species including Arabidopsis, soybean and rice, in citrus these tools are not yet developed. Results We have constructed a comprehensive GCN for citrus inferred from 297 publicly available Affymetrix Genechip Citrus Genome microarray datasets, providing gene co-expression relationships at a genome-wide scale (33,000 transcripts). The comprehensive citrus GCN consists of a global GCN (condition-independent) and four condition-dependent GCNs that survey the sweet orange species only, all citrus fruit tissues, all citrus leaf tissues, or stress-exposed plants. All of these GCNs are clustered using genome-wide, gene-centric (guide) and graph clustering algorithms for flexibility of gene function prediction. For each putative cluster, gene ontology (GO) enrichment and gene expression specificity analyses were performed to enhance gene function, expression and regulation pattern prediction. The guide-gene approach was used to infer novel roles of genes involved in disease susceptibility and vitamin C metabolism, and graph-clustering approaches were used to investigate isoprenoid/phenylpropanoid metabolism in citrus peel, and citric acid catabolism via the GABA shunt in citrus fruit. Conclusions Integration of citrus gene co-expression networks, functional enrichment analysis and gene expression information provide opportunities to infer gene function in citrus. We present a publicly accessible tool, Network Inference for Citrus Co-Expression (NICCE, http://citrus.adelaide.edu.au/nicce/home.aspx), for the gene co-expression analysis in citrus. PMID:25023870
Effects of seawater acidification on gene expression: resolving broader-scale trends in sea urchins.
Evans, Tyler G; Watson-Wynn, Priscilla
2014-06-01
Sea urchins are ecologically and economically important calcifying organisms threatened by acidification of the global ocean caused by anthropogenic CO2 emissions. Propelled by the sequencing of the purple sea urchin (Strongylocentrotus purpuratus) genome, profiling changes in gene expression during exposure to high pCO2 seawater has emerged as a powerful and increasingly common method to infer the response of urchins to ocean change. However, analyses of gene expression are sensitive to experimental methodology, and comparisons between studies of genes regulated by ocean acidification are most often made in the context of major caveats. Here we perform meta-analyses as a means of minimizing experimental discrepancies and resolving broader-scale trends regarding the effects of ocean acidification on gene expression in urchins. Analyses across eight studies and four urchin species largely support prevailing hypotheses about the impact of ocean acidification on marine calcifiers. The predominant expression pattern involved the down-regulation of genes within energy-producing pathways, a clear indication of metabolic depression. Genes with functions in ion transport were significantly over-represented and are most plausibly contributing to intracellular pH regulation. Expression profiles provided extensive evidence for an impact on biomineralization, epitomized by the down-regulation of seven spicule matrix proteins. In contrast, expression profiles provided limited evidence for CO2-mediated developmental delay or induction of a cellular stress response. Congruence between studies of gene expression and the ocean acidification literature in general validates the accuracy of gene expression in predicting the consequences of ocean change and justifies its continued use in future studies. © 2014 Marine Biological Laboratory.
Chi, Baofang; Tao, Shiheng; Liu, Yanlin
2015-01-01
Sampling the solution space of genome-scale models is generally conducted to determine the feasible region for metabolic flux distribution. Because the region for actual metabolic states resides only in a small fraction of the entire space, it is necessary to shrink the solution space to improve the predictive power of a model. A common strategy is to constrain models by integrating extra datasets such as high-throughput datasets and C13-labeled flux datasets. However, studies refining these approaches by performing a meta-analysis of massive experimental metabolic flux measurements, which are closely linked to cellular phenotypes, are limited. In the present study, experimentally identified metabolic flux data from 96 published reports were systematically reviewed. Several strong associations among metabolic flux phenotypes were observed. These phenotype-phenotype associations at the flux level were quantified and integrated into a Saccharomyces cerevisiae genome-scale model as extra physiological constraints. By sampling the shrunken solution space of the model, the metabolic flux fluctuation level, which is an intrinsic trait of metabolic reactions determined by the network, was estimated and utilized to explore its relationship to gene expression noise. Although no correlation was observed in all enzyme-coding genes, a relationship between metabolic flux fluctuation and expression noise of genes associated with enzyme-dosage sensitive reactions was detected, suggesting that the metabolic network plays a role in shaping gene expression noise. Such correlation was mainly attributed to the genes corresponding to non-essential reactions, rather than essential ones. This was at least partially, due to regulations underlying the flux phenotype-phenotype associations. Altogether, this study proposes a new approach in shrinking the solution space of a genome-scale model, of which sampling provides new insights into gene expression noise.
Diffusion and scaling during early embryonic pattern formation.
Gregor, Thomas; Bialek, William; de Ruyter van Steveninck, Rob R; Tank, David W; Wieschaus, Eric F
2005-12-20
Development of spatial patterns in multicellular organisms depends on gradients in the concentration of signaling molecules that control gene expression. In the Drosophila embryo, Bicoid (Bcd) morphogen controls cell fate along 70% of the anteroposterior axis but is translated from mRNA localized at the anterior pole. Gradients of Bcd and other morphogens are thought to arise through diffusion, but this basic assumption has never been rigorously tested in living embryos. Furthermore, because diffusion sets a relationship between length and time scales, it is hard to see how patterns of gene expression established by diffusion would scale proportionately as egg size changes during evolution. Here, we show that the motion of inert molecules through the embryo is well described by the diffusion equation on the relevant length and time scales, and that effective diffusion constants are essentially the same in closely related dipteran species with embryos of very different size. Nonetheless, patterns of gene expression in these different species scale with egg length. We show that this scaling can be traced back to scaling of the Bcd gradient itself. Our results, together with constraints imposed by the time scales of development, suggest that the mechanism for scaling is a species-specific adaptation of the Bcd lifetime.
Sex-Biased Gene Expression and Sexual Conflict throughout Development
Ingleby, Fiona C.; Flis, Ilona; Morrow, Edward H.
2015-01-01
Sex-biased gene expression is likely to account for most sexually dimorphic traits because males and females share much of their genome. When fitness optima differ between sexes for a shared trait, sexual dimorphism can allow each sex to express their optimum trait phenotype, and in this way, the evolution of sex-biased gene expression is one mechanism that could help to resolve intralocus sexual conflict. Genome-wide patterns of sex-biased gene expression have been identified in a number of studies, which we review here. However, very little is known about how sex-biased gene expression relates to sex-specific fitness and about how sex-biased gene expression and conflict vary throughout development or across different genotypes, populations, and environments. We discuss the importance of these neglected areas of research and use data from a small-scale experiment on sex-specific expression of genes throughout development to highlight potentially interesting avenues for future research. PMID:25376837
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kolker, Eugene
Our project focused primarily on analysis of different types of data produced by global high-throughput technologies, data integration of gene annotation, and gene and protein expression information, as well as on getting a better functional annotation of Shewanella genes. Specifically, four of our numerous major activities and achievements include the development of: statistical models for identification and expression proteomics, superior to currently available approaches (including our own earlier ones); approaches to improve gene annotations on the whole-organism scale; standards for annotation, transcriptomics and proteomics approaches; and generalized approaches for data integration of gene annotation, gene and protein expression information.
Oakley, Todd H; Gu, Zhenglong; Abouheif, Ehab; Patel, Nipam H; Li, Wen-Hsiung
2005-01-01
Understanding the evolution of gene function is a primary challenge of modern evolutionary biology. Despite an expanding database from genomic and developmental studies, we are lacking quantitative methods for analyzing the evolution of some important measures of gene function, such as gene-expression patterns. Here, we introduce phylogenetic comparative methods to compare different models of gene-expression evolution in a maximum-likelihood framework. We find that expression of duplicated genes has evolved according to a nonphylogenetic model, where closely related genes are no more likely than more distantly related genes to share common expression patterns. These results are consistent with previous studies that found rapid evolution of gene expression during the history of yeast. The comparative methods presented here are general enough to test a wide range of evolutionary hypotheses using genomic-scale data from any organism.
APPLICATION OF DNA MICROARRAYS TO REPRODUCTIVE TOXICOLOGY AND THE DEVELOPMENT OF A TESTIS ARRAY
With the advent of sequence information for entire mammalian genomes, it is now possible to analyze gene expression and gene polymorphisms on a genomic scale. The primary tool for analysis of gene expression is the DNA microarray. We have used commercially available cDNA micro...
With the advent of sequence information for entire eukaryotic genomes, it is now possible to analyze gene expression on a genomic scale. The primary tool for genomic analysis of gene expression is the gene microarray. We have used commercially available and custom cDNA microarray...
Yu, Hua; Jiao, Bingke; Lu, Lu; Wang, Pengfei; Chen, Shuangcheng; Liang, Chengzhi; Liu, Wei
2018-01-01
Accurately reconstructing gene co-expression network is of great importance for uncovering the genetic architecture underlying complex and various phenotypes. The recent availability of high-throughput RNA-seq sequencing has made genome-wide detecting and quantifying of the novel, rare and low-abundance transcripts practical. However, its potential merits in reconstructing gene co-expression network have still not been well explored. Using massive-scale RNA-seq samples, we have designed an ensemble pipeline, called NetMiner, for building genome-scale and high-quality Gene Co-expression Network (GCN) by integrating three frequently used inference algorithms. We constructed a RNA-seq-based GCN in one species of monocot rice. The quality of network obtained by our method was verified and evaluated by the curated gene functional association data sets, which obviously outperformed each single method. In addition, the powerful capability of network for associating genes with functions and agronomic traits was shown by enrichment analysis and case studies. In particular, we demonstrated the potential value of our proposed method to predict the biological roles of unknown protein-coding genes, long non-coding RNA (lncRNA) genes and circular RNA (circRNA) genes. Our results provided a valuable and highly reliable data source to select key candidate genes for subsequent experimental validation. To facilitate identification of novel genes regulating important biological processes and phenotypes in other plants or animals, we have published the source code of NetMiner, making it freely available at https://github.com/czllab/NetMiner.
Gene expression profiling in the hippocampus of learned helpless and nonhelpless rats.
Kohen, R; Kirov, S; Navaja, G P; Happe, H Kevin; Hamblin, M W; Snoddy, J R; Neumaier, J F; Petty, F
2005-01-01
In the learned helplessness (LH) animal model of depression, failure to attempt escape from avoidable environmental stress, LH, indicates behavioral despair, whereas nonhelpless (NH) behavior reflects behavioral resilience to the effects of environmental stress. Comparing hippocampal gene expression with large-scale oligonucleotide microarrays, we found that stress-resilient (NH) rats, although behaviorally indistinguishable from controls, showed a distinct gene expression profile compared to LH, sham stressed, and naïve control animals. Genes that were confirmed as differentially expressed in the NH group by quantitative PCR strongly correlated in their levels of expression across all four animal groups. Differential expression could not be confirmed at the protein level. We identified several shared degenerate sequence motifs in the 3' untranslated region (3'UTR) of differentially expressed genes that could be a factor in this tight correlation of expression levels among differentially expressed genes.
USDA-ARS?s Scientific Manuscript database
The amount of microarray gene expression data in public repositories has been increasing exponentially for the last couple of decades. High-throughput microarray data integration and analysis has become a critical step in exploring the large amount of expression data for biological discovery. Howeve...
Ingestion of bacterially expressed double-stranded RNA inhibits gene expression in planarians.
Newmark, Phillip A; Reddien, Peter W; Cebrià, Francesc; Sánchez Alvarado, Alejandro
2003-09-30
Freshwater planarian flatworms are capable of regenerating complete organisms from tiny fragments of their bodies; the basis for this regenerative prowess is an experimentally accessible stem cell population that is present in the adult planarian. The study of these organisms, classic experimental models for investigating metazoan regeneration, has been revitalized by the application of modern molecular biological approaches. The identification of thousands of unique planarian ESTs, coupled with large-scale whole-mount in situ hybridization screens, and the ability to inhibit planarian gene expression through double-stranded RNA-mediated genetic interference, provide a wealth of tools for studying the molecular mechanisms that regulate tissue regeneration and stem cell biology in these organisms. Here we show that, as in Caenorhabditis elegans, ingestion of bacterially expressed double-stranded RNA can inhibit gene expression in planarians. This inhibition persists throughout the process of regeneration, allowing phenotypes with disrupted regenerative patterning to be identified. These results pave the way for large-scale screens for genes involved in regenerative processes.
TOXICOGENOMICS AND HUMAN DISEASE RISK ASSESSMENT
Toxicogenomics and Human Disease Risk Assessment.
Complete sequencing of human and other genomes, availability of large-scale gene
expression arrays with ever-increasing numbers of genes displayed, and steady
improvements in protein expression technology can hav...
Tsuchiya, Masa; Giuliani, Alessandro; Hashimoto, Midori; Erenpreisa, Jekaterina; Yoshikawa, Kenichi
2015-01-01
Background The underlying mechanism of dynamic control of the genome-wide expression is a fundamental issue in bioscience. We addressed it in terms of phase transition by a systemic approach based on both density analysis and characteristics of temporal fluctuation for the time-course mRNA expression in differentiating MCF-7 breast cancer cells. Methodology In a recent work, we suggested criticality as an essential aspect of dynamic control of genome-wide gene expression. Criticality was evident by a unimodal-bimodal transition through flattened unimodal expression profile. The flatness on the transition suggests the existence of a critical transition at which up- and down-regulated expression is balanced. Mean field (averaging) behavior of mRNAs based on the temporal expression changes reveals a sandpile type of transition in the flattened profile. Furthermore, around the transition, a self-similar unimodal-bimodal transition of the whole expression occurs in the density profile of an ensemble of mRNA expression. These singular and scaling behaviors identify the transition as the expression phase transition driven by self-organized criticality (SOC). Principal Findings Emergent properties of SOC through a mean field approach are revealed: i) SOC, as a form of genomic phase transition, consolidates distinct critical states of expression, ii) Coupling of coherent stochastic oscillations between critical states on different time-scales gives rise to SOC, and iii) Specific gene clusters (barcode genes) ranging in size from kbp to Mbp reveal similar SOC to genome-wide mRNA expression and ON-OFF synchronization to critical states. This suggests that the cooperative gene regulation of topological genome sub-units is mediated by the coherent phase transitions of megadomain-scaled conformations between compact and swollen chromatin states. Conclusion and Significance In summary, our study provides not only a systemic method to demonstrate SOC in whole-genome expression, but also introduces novel, physically grounded concepts for a breakthrough in the study of biological regulation. PMID:26067993
2011-01-01
Background Global transcriptional analysis of loblolly pine (Pinus taeda L.) is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes). Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01). Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs) including those with significant homology (E-values ≤ 2 × 10-30) to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in roots. Many of the genes identified are known to be up-regulated in response to osmotic stress in pine and other plant species and encode proteins involved in both signal transduction and stress tolerance. Gene expression levels returned to control values within a 48-hour recovery period in all but 76 transcripts. Correlation network analysis indicates a scale-free network topology for the pine root transcriptome and identifies central nodes that may serve as drivers of drought-responsive transcriptome dynamics in the roots of loblolly pine. PMID:21609476
Diffusion and scaling during early embryonic pattern formation
Gregor, Thomas; Bialek, William; van Steveninck, Rob R. de Ruyter; Tank, David W.; Wieschaus, Eric F.
2005-01-01
Development of spatial patterns in multicellular organisms depends on gradients in the concentration of signaling molecules that control gene expression. In the Drosophila embryo, Bicoid (Bcd) morphogen controls cell fate along 70% of the anteroposterior axis but is translated from mRNA localized at the anterior pole. Gradients of Bcd and other morphogens are thought to arise through diffusion, but this basic assumption has never been rigorously tested in living embryos. Furthermore, because diffusion sets a relationship between length and time scales, it is hard to see how patterns of gene expression established by diffusion would scale proportionately as egg size changes during evolution. Here, we show that the motion of inert molecules through the embryo is well described by the diffusion equation on the relevant length and time scales, and that effective diffusion constants are essentially the same in closely related dipteran species with embryos of very different size. Nonetheless, patterns of gene expression in these different species scale with egg length. We show that this scaling can be traced back to scaling of the Bcd gradient itself. Our results, together with constraints imposed by the time scales of development, suggest that the mechanism for scaling is a species-specific adaptation of the Bcd lifetime. PMID:16352710
Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko
2014-05-01
Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
General statistics of stochastic process of gene expression in eukaryotic cells.
Kuznetsov, V A; Knott, G D; Bonner, R F
2002-01-01
Thousands of genes are expressed at such very low levels (< or =1 copy per cell) that global gene expression analysis of rarer transcripts remains problematic. Ambiguity in identification of rarer transcripts creates considerable uncertainty in fundamental questions such as the total number of genes expressed in an organism and the biological significance of rarer transcripts. Knowing the distribution of the true number of genes expressed at each level and the corresponding gene expression level probability function (GELPF) could help resolve these uncertainties. We found that all observed large-scale gene expression data sets in yeast, mouse, and human cells follow a Pareto-like distribution model skewed by many low-abundance transcripts. A novel stochastic model of the gene expression process predicts the universality of the GELPF both across different cell types within a multicellular organism and across different organisms. This model allows us to predict the frequency distribution of all gene expression levels within a single cell and to estimate the number of expressed genes in a single cell and in a population of cells. A random "basal" transcription mechanism for protein-coding genes in all or almost all eukaryotic cell types is predicted. This fundamental mechanism might enhance the expression of rarely expressed genes and, thus, provide a basic level of phenotypic diversity, adaptability, and random monoallelic expression in cell populations. PMID:12136033
Discovering Functions of Unannotated Genes from a Transcriptome Survey of Wild Fungal Isolates
Ellison, Christopher E.; Kowbel, David; Glass, N. Louise; Taylor, John W.
2014-01-01
ABSTRACT Most fungal genomes are poorly annotated, and many fungal traits of industrial and biomedical relevance are not well suited to classical genetic screens. Assigning genes to phenotypes on a genomic scale thus remains an urgent need in the field. We developed an approach to infer gene function from expression profiles of wild fungal isolates, and we applied our strategy to the filamentous fungus Neurospora crassa. Using transcriptome measurements in 70 strains from two well-defined clades of this microbe, we first identified 2,247 cases in which the expression of an unannotated gene rose and fell across N. crassa strains in parallel with the expression of well-characterized genes. We then used image analysis of hyphal morphologies, quantitative growth assays, and expression profiling to test the functions of four genes predicted from our population analyses. The results revealed two factors that influenced regulation of metabolism of nonpreferred carbon and nitrogen sources, a gene that governed hyphal architecture, and a gene that mediated amino acid starvation resistance. These findings validate the power of our population-transcriptomic approach for inference of novel gene function, and we suggest that this strategy will be of broad utility for genome-scale annotation in many fungal systems. PMID:24692637
Carlsbecker, Annelie; Sundström, Jens F; Englund, Marie; Uddenberg, Daniel; Izquierdo, Liz; Kvarnheden, Anders; Vergara-Silva, Francisco; Engström, Peter
2013-10-01
Reproductive organs in seed plants are morphologically divergent and their evolutionary history is often unclear. The mechanisms controlling their development have been extensively studied in angiosperms but are poorly understood in conifers and other gymnosperms. Here, we address the molecular control of seed cone development in Norway spruce, Picea abies. We present expression analyses of five novel MADS-box genes in comparison with previously identified MADS and LEAFY genes at distinct developmental stages. In addition, we have characterized the homeotic transformation from vegetative shoot to female cone and associated changes in regulatory gene expression patterns occurring in the acrocona mutant. The analyses identified genes active at the onset of ovuliferous and ovule development and identified expression patterns marking distinct domains of the ovuliferous scale. The reproductive transformation in acrocona involves the activation of all tested genes normally active in early cone development, except for an AGAMOUS-LIKE6/SEPALLATA (AGL6/SEP) homologue. This absence may be functionally associated with the nondeterminate development of the acrocona ovule-bearing scales. Our morphological and gene expression analyses give support to the hypothesis that the modern cone is a complex structure, and the ovuliferous scale the result of reductions and compactions of an ovule-bearing axillary short shoot in cones of Paleozoic conifers. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
NASA Astrophysics Data System (ADS)
Tabaka, Marcin; Kalwarczyk, Tomasz; Szymanski, Jedrzej; Hou, Sen; Hołyst, Robert
2014-09-01
We discuss a quantitative influence of macromolecular crowding on biological processes: motion, bimolecular reactions, and gene expression in prokaryotic and eukaryotic cells. We present scaling laws relating diffusion coefficient of an object moving in a cytoplasm of cells to a size of this object and degree of crowding. Such description leads to the notion of the length scale dependent viscosity characteristic for all living cells. We present an application of the length-scale dependent viscosity model to the description of motion in the cytoplasm of both eukaryotic and prokaryotic living cells. We compare the model with all recent data on diffusion of nanoscopic objects in HeLa, and E. coli cells. Additionally a description of the mobility of molecules in cell nucleus is presented. Finally we discuss the influence of crowding on the bimolecular association rates and gene expression in living cells.
Systems Biophysics of Gene Expression
Vilar, Jose M.G.; Saiz, Leonor
2013-01-01
Gene expression is a process central to any form of life. It involves multiple temporal and functional scales that extend from specific protein-DNA interactions to the coordinated regulation of multiple genes in response to intracellular and extracellular changes. This diversity in scales poses fundamental challenges to the use of traditional approaches to fully understand even the simplest gene expression systems. Recent advances in computational systems biophysics have provided promising avenues to reliably integrate the molecular detail of biophysical process into the system behavior. Here, we review recent advances in the description of gene regulation as a system of biophysical processes that extend from specific protein-DNA interactions to the combinatorial assembly of nucleoprotein complexes. There is now basic mechanistic understanding on how promoters controlled by multiple, local and distal, DNA binding sites for transcription factors can actively control transcriptional noise, cell-to-cell variability, and other properties of gene regulation, including precision and flexibility of the transcriptional responses. PMID:23790365
Gene Expression Noise, Fitness Landscapes, and Evolution
NASA Astrophysics Data System (ADS)
Charlebois, Daniel
The stochastic (or noisy) process of gene expression can have fitness consequences for living organisms. For example, gene expression noise facilitates the development of drug resistance by increasing the time scale at which beneficial phenotypic states can be maintained. The present work investigates the relationship between gene expression noise and the fitness landscape. By incorporating the costs and benefits of gene expression, we track how the fluctuation magnitude and timescale of expression noise evolve in simulations of cell populations under stress. We find that properties of expression noise evolve to maximize fitness on the fitness landscape, and that low levels of expression noise emerge when the fitness benefits of gene expression exceed the fitness costs (and that high levels of noise emerge when the costs of expression exceed the benefits). The findings from our theoretical/computational work offer new hypotheses on the development of drug resistance, some of which are now being investigated in evolution experiments in our laboratory using well-characterized synthetic gene regulatory networks in budding yeast. Nserc Postdoctoral Fellowship (Grant No. PDF-453977-2014).
Li, XueYan; Wang, ChunXia; Cheng, JinYun; Zhang, Jing; da Silva, Jaime A Teixeira; Liu, XiaoYu; Duan, Xin; Li, TianLai; Sun, HongMei
2014-12-19
The formation and development of bulblets are crucial to the Lilium genus since these processes are closely related to carbohydrate metabolism, especially to starch and sucrose metabolism. However, little is known about the transcriptional regulation of both processes. To gain insight into carbohydrate-related genes involved in bulblet formation and development, we conducted comparative transcriptome profiling of Lilium davidii var. unicolor bulblets at 0 d, 15 d (bulblets emerged) and 35 d (bulblets formed a basic shape with three or four scales) after scale propagation. Analysis of the transcriptome revealed that a total of 52,901 unigenes with an average sequence size of 630 bp were generated. Based on Clusters of Orthologous Groups (COG) analysis, 8% of the sequences were attributed to carbohydrate transport and metabolism. The results of KEGG pathway enrichment analysis showed that starch and sucrose metabolism constituted the predominant pathway among the three library pairs. The starch content in mother scales and bulblets decreased and increased, respectively, with almost the same trend as sucrose content. Gene expression analysis of the key enzymes in starch and sucrose metabolism suggested that sucrose synthase (SuSy) and invertase (INV), mainly hydrolyzing sucrose, presented higher gene expression in mother scales and bulblets at stages of bulblet appearance and enlargement, while sucrose phosphate synthase (SPS) showed higher expression in bulblets at morphogenesis. The enzymes involved in the starch synthetic direction such as ADPG pyrophosphorylase (AGPase), soluble starch synthase (SSS), starch branching enzyme (SBE) and granule-bound starch synthase (GBSS) showed a decreasing trend in mother scales and higher gene expression in bulblets at bulblet appearance and enlargement stages while the enzyme in the cleavage direction, starch de-branching enzyme (SDBE), showed higher gene expression in mother scales than in bulblets. An extensive transcriptome analysis of three bulblet development stages contributes considerable novel information to our understanding of carbohydrate metabolism-related genes in Lilium at the transcriptional level, and demonstrates the fundamentality of carbohydrate metabolism in bulblet emergence and development at the molecular level. This could facilitate further investigation into the molecular mechanisms underlying these processes in lily and other related species.
An integrated approach to reconstructing genome-scale transcriptional regulatory networks
Imam, Saheed; Noguera, Daniel R.; Donohue, Timothy J.; ...
2015-02-27
Transcriptional regulatory networks (TRNs) program cells to dynamically alter their gene expression in response to changing internal or environmental conditions. In this study, we develop a novel workflow for generating large-scale TRN models that integrates comparative genomics data, global gene expression analyses, and intrinsic properties of transcription factors (TFs). An assessment of this workflow using benchmark datasets for the well-studied γ-proteobacterium Escherichia coli showed that it outperforms expression-based inference approaches, having a significantly larger area under the precision-recall curve. Further analysis indicated that this integrated workflow captures different aspects of the E. coli TRN than expression-based approaches, potentially making themmore » highly complementary. We leveraged this new workflow and observations to build a large-scale TRN model for the α-Proteobacterium Rhodobacter sphaeroides that comprises 120 gene clusters, 1211 genes (including 93 TFs), 1858 predicted protein-DNA interactions and 76 DNA binding motifs. We found that ~67% of the predicted gene clusters in this TRN are enriched for functions ranging from photosynthesis or central carbon metabolism to environmental stress responses. We also found that members of many of the predicted gene clusters were consistent with prior knowledge in R. sphaeroides and/or other bacteria. Experimental validation of predictions from this R. sphaeroides TRN model showed that high precision and recall was also obtained for TFs involved in photosynthesis (PpsR), carbon metabolism (RSP_0489) and iron homeostasis (RSP_3341). In addition, this integrative approach enabled generation of TRNs with increased information content relative to R. sphaeroides TRN models built via other approaches. We also show how this approach can be used to simultaneously produce TRN models for each related organism used in the comparative genomics analysis. Our results highlight the advantages of integrating comparative genomics of closely related organisms with gene expression data to assemble large-scale TRN models with high-quality predictions.« less
Lee, Hyuk Je; Schneider, Ralf F; Manousaki, Tereza; Kang, Ji Hyoun; Lein, Etienne; Franchini, Paolo
2017-01-01
Abstract Lateralized behavior (“handedness”) is unusual, but consistently found across diverse animal lineages, including humans. It is thought to reflect brain anatomical and/or functional asymmetries, but its neuro-molecular mechanisms remain largely unknown. Lake Tanganyika scale-eating cichlid fish, Perissodus microlepis show pronounced asymmetry in their jaw morphology as well as handedness in feeding behavior—biting scales preferentially only from one or the other side of their victims. This makes them an ideal model in which to investigate potential laterality in neuroanatomy and transcription in the brain in relation to behavioral handedness. After determining behavioral handedness in P. microlepis (preferred attack side), we estimated the volume of the hemispheres of brain regions and captured their gene expression profiles. Our analyses revealed that the degree of behavioral handedness is mirrored at the level of neuroanatomical asymmetry, particularly in the tectum opticum. Transcriptome analyses showed that different brain regions (tectum opticum, telencephalon, hypothalamus, and cerebellum) display distinct expression patterns, potentially reflecting their developmental interrelationships. For numerous genes in each brain region, their extent of expression differences between hemispheres was found to be correlated with the degree of behavioral lateralization. Interestingly, the tectum opticum and telencephalon showed divergent biases on the direction of up- or down-regulation of the laterality candidate genes (e.g., grm2) in the hemispheres, highlighting the connection of handedness with gene expression profiles and the different roles of these brain regions. Hence, handedness in predation behavior may be caused by asymmetric size of brain hemispheres and also by lateralized gene expressions in the brain. PMID:29069363
Lee, Hyuk Je; Schneider, Ralf F; Manousaki, Tereza; Kang, Ji Hyoun; Lein, Etienne; Franchini, Paolo; Meyer, Axel
2017-11-01
Lateralized behavior ("handedness") is unusual, but consistently found across diverse animal lineages, including humans. It is thought to reflect brain anatomical and/or functional asymmetries, but its neuro-molecular mechanisms remain largely unknown. Lake Tanganyika scale-eating cichlid fish, Perissodus microlepis show pronounced asymmetry in their jaw morphology as well as handedness in feeding behavior-biting scales preferentially only from one or the other side of their victims. This makes them an ideal model in which to investigate potential laterality in neuroanatomy and transcription in the brain in relation to behavioral handedness. After determining behavioral handedness in P. microlepis (preferred attack side), we estimated the volume of the hemispheres of brain regions and captured their gene expression profiles. Our analyses revealed that the degree of behavioral handedness is mirrored at the level of neuroanatomical asymmetry, particularly in the tectum opticum. Transcriptome analyses showed that different brain regions (tectum opticum, telencephalon, hypothalamus, and cerebellum) display distinct expression patterns, potentially reflecting their developmental interrelationships. For numerous genes in each brain region, their extent of expression differences between hemispheres was found to be correlated with the degree of behavioral lateralization. Interestingly, the tectum opticum and telencephalon showed divergent biases on the direction of up- or down-regulation of the laterality candidate genes (e.g., grm2) in the hemispheres, highlighting the connection of handedness with gene expression profiles and the different roles of these brain regions. Hence, handedness in predation behavior may be caused by asymmetric size of brain hemispheres and also by lateralized gene expressions in the brain. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Han, Rongfei; Huang, Guanqun; Wang, Yejun; Xu, Yafei; Hu, Yueming; Jiang, Wenqi; Wang, Tianfu; Xiao, Tian; Zheng, Duo
2016-11-01
Gene expression in metazoans is delicately organized. As genetic information transmits from DNA to RNA and protein, expression noise is inevitably generated. Recent studies begin to unveil the mechanisms of gene expression noise control, but the changes of gene expression precision in pathologic conditions like cancers are unknown. Here we analyzed the transcriptomic data of human breast, liver, lung and colon cancers, and found that the expression noise of more than 74.9% genes was increased in cancer tissues as compared to adjacent normal tissues. This suggested that gene expression precision controlling collapsed during cancer development. A set of 269 genes with noise increased more than 2-fold were identified across different cancer types. These genes were involved in cell adhesion, catalytic and metabolic functions, implying the vulnerability of deregulation of these processes in cancers. We also observed a tendency of increased expression noise in patients with low p53 and immune activity in breast, liver and lung caners but not in colon cancers, which indicated the contributions of p53 signaling and host immune surveillance to gene expression noise in cancers. Moreover, more than 53.7% genes had increased noise in patients with late stage than early stage cancers, suggesting that gene expression precision was associated with cancer outcome. Together, these results provided genomic scale explorations of gene expression noise control in human cancers.
Large-Scale Analysis of Network Bistability for Human Cancers
Shiraishi, Tetsuya; Matsuyama, Shinako; Kitano, Hiroaki
2010-01-01
Protein–protein interaction and gene regulatory networks are likely to be locked in a state corresponding to a disease by the behavior of one or more bistable circuits exhibiting switch-like behavior. Sets of genes could be over-expressed or repressed when anomalies due to disease appear, and the circuits responsible for this over- or under-expression might persist for as long as the disease state continues. This paper shows how a large-scale analysis of network bistability for various human cancers can identify genes that can potentially serve as drug targets or diagnosis biomarkers. PMID:20628618
Seq-ing answers: uncovering the unexpected in global gene regulation.
Otto, George Maxwell; Brar, Gloria Ann
2018-04-19
The development of techniques for measuring gene expression globally has greatly expanded our understanding of gene regulatory mechanisms in depth and scale. We can now quantify every intermediate and transition in the canonical pathway of gene expression-from DNA to mRNA to protein-genome-wide. Employing such measurements in parallel can produce rich datasets, but extracting the most information requires careful experimental design and analysis. Here, we argue for the value of genome-wide studies that measure multiple outputs of gene expression over many timepoints during the course of a natural developmental process. We discuss our findings from a highly parallel gene expression dataset of meiotic differentiation, and those of others, to illustrate how leveraging these features can provide new and surprising insight into fundamental mechanisms of gene regulation.
Cheaib, Miriam; Dehghani Amirabad, Azim; Nordström, Karl J. V.; Schulz, Marcel H.; Simon, Martin
2015-01-01
Phenotypic variation of a single genotype is achieved by alterations in gene expression patterns. Regulation of such alterations depends on their time scale, where short-time adaptations differ from permanently established gene expression patterns maintained by epigenetic mechanisms. In the ciliate Paramecium, serotypes were described for an epigenetically controlled gene expression pattern of an individual multigene family. Paradoxically, individual serotypes can be triggered in Paramecium by alternating environments but are then stabilized by epigenetic mechanisms, thus raising the question to which extend their expression follows environmental stimuli. To characterize environmental adaptation in the context of epigenetically controlled serotype expression, we used RNA-seq to characterize transcriptomes of serotype pure cultures. The resulting vegetative transcriptome resource is first analysed for genes involved in the adaptive response to the altered environment. Secondly, we identified groups of genes that do not follow the adaptive response but show co-regulation with the epigenetically controlled serotype system, suggesting that their gene expression pattern becomes manifested by similar mechanisms. In our experimental set-up, serotype expression and the entire group of co-regulated genes were stable among environmental changes and only heat-shock genes altered expression of these gene groups. The data suggest that the maintenance of these gene expression patterns in a lineage represents epigenetically controlled robustness counteracting short-time adaptation processes. PMID:26231545
Klingenberg, Jennifer M; McFarland, Kevin L; Friedman, Aaron J; Boyce, Steven T; Aronow, Bruce J; Supp, Dorothy M
2010-02-01
Bioengineered skin substitutes can facilitate wound closure in severely burned patients, but deficiencies limit their outcomes compared with native skin autografts. To identify gene programs associated with their in vivo capabilities and limitations, we extended previous gene expression profile analyses to now compare engineered skin after in vivo grafting with both in vitro maturation and normal human skin. Cultured skin substitutes were grafted on full-thickness wounds in athymic mice, and biopsy samples for microarray analyses were collected at multiple in vitro and in vivo time points. Over 10,000 transcripts exhibited large-scale expression pattern differences during in vitro and in vivo maturation. Using hierarchical clustering, 11 different expression profile clusters were partitioned on the basis of differential sample type and temporal stage-specific activation or repression. Analyses show that the wound environment exerts a massive influence on gene expression in skin substitutes. For example, in vivo-healed skin substitutes gained the expression of many native skin-expressed genes, including those associated with epidermal barrier and multiple categories of cell-cell and cell-basement membrane adhesion. In contrast, immunological, trichogenic, and endothelial gene programs were largely lacking. These analyses suggest important areas for guiding further improvement of engineered skin for both increased homology with native skin and enhanced wound healing.
Lan, Hui; Carson, Rachel; Provart, Nicholas J; Bonner, Anthony J
2007-09-21
Arabidopsis thaliana is the model species of current plant genomic research with a genome size of 125 Mb and approximately 28,000 genes. The function of half of these genes is currently unknown. The purpose of this study is to infer gene function in Arabidopsis using machine-learning algorithms applied to large-scale gene expression data sets, with the goal of identifying genes that are potentially involved in plant response to abiotic stress. Using in house and publicly available data, we assembled a large set of gene expression measurements for A. thaliana. Using those genes of known function, we first evaluated and compared the ability of basic machine-learning algorithms to predict which genes respond to stress. Predictive accuracy was measured using ROC50 and precision curves derived through cross validation. To improve accuracy, we developed a method for combining these classifiers using a weighted-voting scheme. The combined classifier was then trained on genes of known function and applied to genes of unknown function, identifying genes that potentially respond to stress. Visual evidence corroborating the predictions was obtained using electronic Northern analysis. Three of the predicted genes were chosen for biological validation. Gene knockout experiments confirmed that all three are involved in a variety of stress responses. The biological analysis of one of these genes (At1g16850) is presented here, where it is shown to be necessary for the normal response to temperature and NaCl. Supervised learning methods applied to large-scale gene expression measurements can be used to predict gene function. However, the ability of basic learning methods to predict stress response varies widely and depends heavily on how much dimensionality reduction is used. Our method of combining classifiers can improve the accuracy of such predictions - in this case, predictions of genes involved in stress response in plants - and it effectively chooses the appropriate amount of dimensionality reduction automatically. The method provides a useful means of identifying genes in A. thaliana that potentially respond to stress, and we expect it would be useful in other organisms and for other gene functions.
2011-01-01
Background Abiotic stresses, such as water deficit and soil salinity, result in changes in physiology, nutrient use, and vegetative growth in vines, and ultimately, yield and flavor in berries of wine grape, Vitis vinifera L. Large-scale expressed sequence tags (ESTs) were generated, curated, and analyzed to identify major genetic determinants responsible for stress-adaptive responses. Although roots serve as the first site of perception and/or injury for many types of abiotic stress, EST sequencing in root tissues of wine grape exposed to abiotic stresses has been extremely limited to date. To overcome this limitation, large-scale EST sequencing was conducted from root tissues exposed to multiple abiotic stresses. Results A total of 62,236 expressed sequence tags (ESTs) were generated from leaf, berry, and root tissues from vines subjected to abiotic stresses and compared with 32,286 ESTs sequenced from 20 public cDNA libraries. Curation to correct annotation errors, clustering and assembly of the berry and leaf ESTs with currently available V. vinifera full-length transcripts and ESTs yielded a total of 13,278 unique sequences, with 2302 singletons and 10,976 mapped to V. vinifera gene models. Of these, 739 transcripts were found to have significant differential expression in stressed leaves and berries including 250 genes not described previously as being abiotic stress responsive. In a second analysis of 16,452 ESTs from a normalized root cDNA library derived from roots exposed to multiple, short-term, abiotic stresses, 135 genes with root-enriched expression patterns were identified on the basis of their relative EST abundance in roots relative to other tissues. Conclusions The large-scale analysis of relative EST frequency counts among a diverse collection of 23 different cDNA libraries from leaf, berry, and root tissues of wine grape exposed to a variety of abiotic stress conditions revealed distinct, tissue-specific expression patterns, previously unrecognized stress-induced genes, and many novel genes with root-enriched mRNA expression for improving our understanding of root biology and manipulation of rootstock traits in wine grape. mRNA abundance estimates based on EST library-enriched expression patterns showed only modest correlations between microarray and quantitative, real-time reverse transcription-polymerase chain reaction (qRT-PCR) methods highlighting the need for deep-sequencing expression profiling methods. PMID:21592389
Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation
Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.; Taylor, Ronald C.; Weisenhorn, Pamela; Olson, Robert D.; Stevens, Rick L.; Rocha, Miguel; Rocha, Isabel; Best, Aaron A.; DeJongh, Matthew; Tintle, Nathan L.; Parrello, Bruce; Overbeek, Ross; Henry, Christopher S.
2016-01-01
Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain. PMID:27933038
Yang, Laurence; Tan, Justin; O'Brien, Edward J; Monk, Jonathan M; Kim, Donghyuk; Li, Howard J; Charusanti, Pep; Ebrahim, Ali; Lloyd, Colton J; Yurkovich, James T; Du, Bin; Dräger, Andreas; Thomas, Alex; Sun, Yuekai; Saunders, Michael A; Palsson, Bernhard O
2015-08-25
Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.
Groten, Karin; Pahari, Nabin T; Xu, Shuqing; Miloradovic van Doorn, Maja; Baldwin, Ian T
2015-01-01
Most land plants live in a symbiotic association with arbuscular mycorrhizal fungi (AMF) that belong to the phylum Glomeromycota. Although a number of plant genes involved in the plant-AMF interactions have been identified by analyzing mutants, the ability to rapidly manipulate gene expression to study the potential functions of new candidate genes remains unrealized. We analyzed changes in gene expression of wild tobacco roots (Nicotiana attenuata) after infection with mycorrhizal fungi (Rhizophagus irregularis) by serial analysis of gene expression (SuperSAGE) combined with next generation sequencing, and established a virus-induced gene-silencing protocol to study the function of candidate genes in the interaction. From 92,434 SuperSAGE Tag sequences, 32,808 (35%) matched with our in-house Nicotiana attenuata transcriptome database and 3,698 (4%) matched to Rhizophagus genes. In total, 11,194 Tags showed a significant change in expression (p<0.05, >2-fold change) after infection. When comparing the functions of highly up-regulated annotated Tags in this study with those of two previous large-scale gene expression studies, 18 gene functions were found to be up-regulated in all three studies mainly playing roles related to phytohormone metabolism, catabolism and defense. To validate the function of identified candidate genes, we used the technique of virus-induced gene silencing (VIGS) to silence the expression of three putative N. attenuata genes: germin-like protein, indole-3-acetic acid-amido synthetase GH3.9 and, as a proof-of-principle, calcium and calmodulin-dependent protein kinase (CCaMK). The silencing of the three plant genes in roots was successful, but only CCaMK silencing had a significant effect on the interaction with R. irregularis. Interestingly, when a highly activated inoculum was used for plant inoculation, the effect of CCaMK silencing on fungal colonization was masked, probably due to trans-complementation. This study demonstrates that large-scale gene expression studies across different species induce of a core set of genes of similar functions. However, additional factors seem to influence the overall pattern of gene expression, resulting in high variability among independent studies with different hosts. We conclude that VIGS is a powerful tool with which to investigate the function of genes involved in plant-AMF interactions but that inoculum strength can strongly influence the outcome of the interaction.
Targeted and genome-scale methylomics reveals gene body signatures in human cell lines
Ball, Madeleine Price; Li, Jin Billy; Gao, Yuan; Lee, Je-Hyuk; LeProust, Emily; Park, In-Hyun; Xie, Bin; Daley, George Q.; Church, George M.
2012-01-01
Cytosine methylation, an epigenetic modification of DNA, is a target of growing interest for developing high throughput profiling technologies. Here we introduce two new, complementary techniques for cytosine methylation profiling utilizing next generation sequencing technology: bisulfite padlock probes (BSPPs) and methyl sensitive cut counting (MSCC). In the first method, we designed a set of ~10,000 BSPPs distributed over the ENCODE pilot project regions to take advantage of existing expression and chromatin immunoprecipitation data. We observed a pattern of low promoter methylation coupled with high gene body methylation in highly expressed genes. Using the second method, MSCC, we gathered genome-scale data for 1.4 million HpaII sites and confirmed that gene body methylation in highly expressed genes is a consistent phenomenon over the entire genome. Our observations highlight the usefulness of techniques which are not inherently or intentionally biased in favor of only profiling particular subsets like CpG islands or promoter regions. PMID:19329998
Jiang, Zhenhong; He, Fei; Zhang, Ziding
2017-07-01
Through large-scale transcriptional data analyses, we highlighted the importance of plant metabolism in plant immunity and identified 26 metabolic pathways that were frequently influenced by the infection of 14 different pathogens. Reprogramming of plant metabolism is a common phenomenon in plant defense responses. Currently, a large number of transcriptional profiles of infected tissues in Arabidopsis (Arabidopsis thaliana) have been deposited in public databases, which provides a great opportunity to understand the expression patterns of metabolic pathways during plant defense responses at the systems level. Here, we performed a large-scale transcriptome analysis based on 135 previously published expression samples, including 14 different pathogens, to explore the expression pattern of Arabidopsis metabolic pathways. Overall, metabolic genes are significantly changed in expression during plant defense responses. Upregulated metabolic genes are enriched on defense responses, and downregulated genes are enriched on photosynthesis, fatty acid and lipid metabolic processes. Gene set enrichment analysis (GSEA) identifies 26 frequently differentially expressed metabolic pathways (FreDE_Paths) that are differentially expressed in more than 60% of infected samples. These pathways are involved in the generation of energy, fatty acid and lipid metabolism as well as secondary metabolite biosynthesis. Clustering analysis based on the expression levels of these 26 metabolic pathways clearly distinguishes infected and control samples, further suggesting the importance of these metabolic pathways in plant defense responses. By comparing with FreDE_Paths from abiotic stresses, we find that the expression patterns of 26 FreDE_Paths from biotic stresses are more consistent across different infected samples. By investigating the expression correlation between transcriptional factors (TFs) and FreDE_Paths, we identify several notable relationships. Collectively, the current study will deepen our understanding of plant metabolism in plant immunity and provide new insights into disease-resistant crop improvement.
GEM-TREND: a web tool for gene expression data mining toward relevant network discovery
Feng, Chunlai; Araki, Michihiro; Kunimoto, Ryo; Tamon, Akiko; Makiguchi, Hiroki; Niijima, Satoshi; Tsujimoto, Gozoh; Okuno, Yasushi
2009-01-01
Background DNA microarray technology provides us with a first step toward the goal of uncovering gene functions on a genomic scale. In recent years, vast amounts of gene expression data have been collected, much of which are available in public databases, such as the Gene Expression Omnibus (GEO). To date, most researchers have been manually retrieving data from databases through web browsers using accession numbers (IDs) or keywords, but gene-expression patterns are not considered when retrieving such data. The Connectivity Map was recently introduced to compare gene expression data by introducing gene-expression signatures (represented by a set of genes with up- or down-regulated labels according to their biological states) and is available as a web tool for detecting similar gene-expression signatures from a limited data set (approximately 7,000 expression profiles representing 1,309 compounds). In order to support researchers to utilize the public gene expression data more effectively, we developed a web tool for finding similar gene expression data and generating its co-expression networks from a publicly available database. Results GEM-TREND, a web tool for searching gene expression data, allows users to search data from GEO using gene-expression signatures or gene expression ratio data as a query and retrieve gene expression data by comparing gene-expression pattern between the query and GEO gene expression data. The comparison methods are based on the nonparametric, rank-based pattern matching approach of Lamb et al. (Science 2006) with the additional calculation of statistical significance. The web tool was tested using gene expression ratio data randomly extracted from the GEO and with in-house microarray data, respectively. The results validated the ability of GEM-TREND to retrieve gene expression entries biologically related to a query from GEO. For further analysis, a network visualization interface is also provided, whereby genes and gene annotations are dynamically linked to external data repositories. Conclusion GEM-TREND was developed to retrieve gene expression data by comparing query gene-expression pattern with those of GEO gene expression data. It could be a very useful resource for finding similar gene expression profiles and constructing its gene co-expression networks from a publicly available database. GEM-TREND was designed to be user-friendly and is expected to support knowledge discovery. GEM-TREND is freely available at . PMID:19728865
GEM-TREND: a web tool for gene expression data mining toward relevant network discovery.
Feng, Chunlai; Araki, Michihiro; Kunimoto, Ryo; Tamon, Akiko; Makiguchi, Hiroki; Niijima, Satoshi; Tsujimoto, Gozoh; Okuno, Yasushi
2009-09-03
DNA microarray technology provides us with a first step toward the goal of uncovering gene functions on a genomic scale. In recent years, vast amounts of gene expression data have been collected, much of which are available in public databases, such as the Gene Expression Omnibus (GEO). To date, most researchers have been manually retrieving data from databases through web browsers using accession numbers (IDs) or keywords, but gene-expression patterns are not considered when retrieving such data. The Connectivity Map was recently introduced to compare gene expression data by introducing gene-expression signatures (represented by a set of genes with up- or down-regulated labels according to their biological states) and is available as a web tool for detecting similar gene-expression signatures from a limited data set (approximately 7,000 expression profiles representing 1,309 compounds). In order to support researchers to utilize the public gene expression data more effectively, we developed a web tool for finding similar gene expression data and generating its co-expression networks from a publicly available database. GEM-TREND, a web tool for searching gene expression data, allows users to search data from GEO using gene-expression signatures or gene expression ratio data as a query and retrieve gene expression data by comparing gene-expression pattern between the query and GEO gene expression data. The comparison methods are based on the nonparametric, rank-based pattern matching approach of Lamb et al. (Science 2006) with the additional calculation of statistical significance. The web tool was tested using gene expression ratio data randomly extracted from the GEO and with in-house microarray data, respectively. The results validated the ability of GEM-TREND to retrieve gene expression entries biologically related to a query from GEO. For further analysis, a network visualization interface is also provided, whereby genes and gene annotations are dynamically linked to external data repositories. GEM-TREND was developed to retrieve gene expression data by comparing query gene-expression pattern with those of GEO gene expression data. It could be a very useful resource for finding similar gene expression profiles and constructing its gene co-expression networks from a publicly available database. GEM-TREND was designed to be user-friendly and is expected to support knowledge discovery. GEM-TREND is freely available at http://cgs.pharm.kyoto-u.ac.jp/services/network.
Parallel human genome analysis: microarray-based expression monitoring of 1000 genes.
Schena, M; Shalon, D; Heller, R; Chai, A; Brown, P O; Davis, R W
1996-01-01
Microarrays containing 1046 human cDNAs of unknown sequence were printed on glass with high-speed robotics. These 1.0-cm2 DNA "chips" were used to quantitatively monitor differential expression of the cognate human genes using a highly sensitive two-color hybridization assay. Array elements that displayed differential expression patterns under given experimental conditions were characterized by sequencing. The identification of known and novel heat shock and phorbol ester-regulated genes in human T cells demonstrates the sensitivity of the assay. Parallel gene analysis with microarrays provides a rapid and efficient method for large-scale human gene discovery. Images Fig. 1 Fig. 2 Fig. 3 PMID:8855227
Zhou, Weichen; Ma, Yanyun; Zhang, Jun; Hu, Jingyi; Zhang, Menghan; Wang, Yi; Li, Yi; Wu, Lijun; Pan, Yida; Zhang, Yitong; Zhang, Xiaonan; Zhang, Xinxin; Zhang, Zhanqing; Zhang, Jiming; Li, Hai; Lu, Lungen; Jin, Li; Wang, Jiucun; Yuan, Zhenghong; Liu, Jie
2017-11-01
Liver biopsy is the gold standard to assess pathological features (eg inflammation grades) for hepatitis B virus-infected patients although it is invasive and traumatic; meanwhile, several gene profiles of chronic hepatitis B (CHB) have been separately described in relatively small hepatitis B virus (HBV)-infected samples. We aimed to analyse correlations among inflammation grades, gene expressions and clinical parameters (serum alanine amino transaminase, aspartate amino transaminase and HBV-DNA) in large-scale CHB samples and to predict inflammation grades by using clinical parameters and/or gene expressions. We analysed gene expressions with three clinical parameters in 122 CHB samples by an improved regression model. Principal component analysis and machine-learning methods including Random Forest, K-nearest neighbour and support vector machine were used for analysis and further diagnosis models. Six normal samples were conducted to validate the predictive model. Significant genes related to clinical parameters were found enriching in the immune system, interferon-stimulated, regulation of cytokine production, anti-apoptosis, and etc. A panel of these genes with clinical parameters can effectively predict binary classifications of inflammation grade (area under the ROC curve [AUC]: 0.88, 95% confidence interval [CI]: 0.77-0.93), validated by normal samples. A panel with only clinical parameters was also valuable (AUC: 0.78, 95% CI: 0.65-0.86), indicating that liquid biopsy method for detecting the pathology of CHB is possible. This is the first study to systematically elucidate the relationships among gene expressions, clinical parameters and pathological inflammation grades in CHB, and to build models predicting inflammation grades by gene expressions and/or clinical parameters as well. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
2009-01-01
Background Sequence identification of ESTs from non-model species offers distinct challenges particularly when these species have duplicated genomes and when they are phylogenetically distant from sequenced model organisms. For the common carp, an environmental model of aquacultural interest, large numbers of ESTs remained unidentified using BLAST sequence alignment. We have used the expression profiles from large-scale microarray experiments to suggest gene identities. Results Expression profiles from ~700 cDNA microarrays describing responses of 7 major tissues to multiple environmental stressors were used to define a co-expression landscape. This was based on the Pearsons correlation coefficient relating each gene with all other genes, from which a network description provided clusters of highly correlated genes as 'mountains'. We show that these contain genes with known identities and genes with unknown identities, and that the correlation constitutes evidence of identity in the latter. This procedure has suggested identities to 522 of 2701 unknown carp ESTs sequences. We also discriminate several common carp genes and gene isoforms that were not discriminated by BLAST sequence alignment alone. Precision in identification was substantially improved by use of data from multiple tissues and treatments. Conclusion The detailed analysis of co-expression landscapes is a sensitive technique for suggesting an identity for the large number of BLAST unidentified cDNAs generated in EST projects. It is capable of detecting even subtle changes in expression profiles, and thereby of distinguishing genes with a common BLAST identity into different identities. It benefits from the use of multiple treatments or contrasts, and from the large-scale microarray data. PMID:19939286
Dhungel, Bidur; Ohno, Yoshikazu; Matayoshi, Rie; Otaki, Joji M
2013-03-25
Candidate genes for color pattern formation in butterfly wings have been known based on gene expression patterns since the 1990s, but their functions remain elusive due to a lack of a functional assay. Several methods of transferring and expressing a foreign gene in butterfly wings have been reported, but they have suffered from low success rates or low expression levels. Here, we developed a simple, practical method to efficiently deliver and express a foreign gene using baculovirus-mediated gene transfer in butterfly wings in vivo. A recombinant baculovirus containing a gene for green fluorescent protein (GFP) was injected into pupae of the blue pansy butterfly Junonia orithya (Nymphalidae). GFP fluorescence was detected in the pupal wings and other body parts of the injected individuals three to five days post-injection at various degrees of fluorescence. We obtained a high GFP expression rate at relatively high virus titers, but it was associated with pupal death before color pattern formation in wings. To reduce the high mortality rate caused by the baculovirus treatment, we administered an anti-gp64 antibody, which was raised against baculovirus coat protein gp64, to infected pupae after the baculovirus injection. This treatment greatly reduced the mortality rate of the infected pupae. GFP fluorescence was observed in pupal and adult wings and other body parts of the antibody-treated individuals at various degrees of fluorescence. Importantly, we obtained completely developed wings with a normal color pattern, in which fluorescent signals originated directly from scales or the basal membrane after the removal of scales. GFP fluorescence in wing tissues spatially coincided with anti-GFP antibody staining, confirming that the fluorescent signals originated from the expressed GFP molecules. Our baculovirus-mediated gene transfer system with an anti-gp64 antibody is reasonably efficient, and it can be an invaluable tool to transfer, express, and functionally examine foreign genes in butterfly wings and also in other non-model insect systems.
Tummala, Seshu B; Junne, Stefan G; Paredes, Carlos J; Papoutsakis, Eleftherios T
2003-12-30
Antisense RNA (asRNA) downregulation alters protein expression without changing the regulation of gene expression. Downregulation of primary metabolic enzymes possibly combined with overexpression of other metabolic enzymes may result in profound changes in product formation, and this may alter the large-scale transcriptional program of the cells. DNA-array based large-scale transcriptional analysis has the potential to elucidate factors that control cellular fluxes even in the absence of proteome data. These themes are explored in the study of large-scale transcriptional analysis programs and the in vivo primary-metabolism fluxes of several related recombinant C. acetobutylicum strains: C. acetobutylicum ATCC 824(pSOS95del) (plasmid control; produces high levels of butanol snd acetone), 824(pCTFB1AS) (expresses antisense RNA against CoA transferase (ctfb1-asRNA); produces very low levels of butanol and acetone), and 824(pAADB1) (expresses ctfb1-asRNA and the alcohol-aldehyde dahydrogenase gene (aad); produce high alcohol and low acetone levels). DNA-array based transcriptional analysis revealed that the large changes in product concentrations (snd notably butanol concentration) due to ctfb1-asRNA expression alone and in combination with aad overexpression resulted in dramatic changes of the cellular transcriptome. Cluster analysis and gene expression patterns of established and putative operons involved in stress response, motility, sporulation, and fatty-acid biosynthesis indicate that these simple genetic changes dramatically alter the cellular programs of C. acetobutylicum. Comparison of gene expression and flux analysis data may point to possible flux-controling steps and suggest unknown regulatory mechanisms. Copyright 2003; Wiley Periodicals, Inc.
Expression profiles of urbilaterian genes uniquely shared between honey bee and vertebrates
Matsui, Toshiaki; Yamamoto, Toshiyuki; Wyder, Stefan; Zdobnov, Evgeny M; Kadowaki, Tatsuhiko
2009-01-01
Background Large-scale comparison of metazoan genomes has revealed that a significant fraction of genes of the last common ancestor of Bilateria (Urbilateria) is lost in each animal lineage. This event could be one of the underlying mechanisms involved in generating metazoan diversity. However, the present functions of these ancient genes have not been addressed extensively. To understand the functions and evolutionary mechanisms of such ancient Urbilaterian genes, we carried out comprehensive expression profile analysis of genes shared between vertebrates and honey bees but not with the other sequenced ecdysozoan genomes (honey bee-vertebrate specific, HVS genes) as a model. Results We identified 30 honey bee and 55 mouse HVS genes. Many HVS genes exhibited tissue-selective expression patterns; intriguingly, the expression of 60% of honey bee HVS genes was found to be brain enriched, and 24% of mouse HVS genes were highly expressed in either or both the brain and testis. Moreover, a minimum of 38% of mouse HVS genes demonstrated neuron-enriched expression patterns, and 62% of them exhibited expression in selective brain areas, particularly the forebrain and cerebellum. Furthermore, gene ontology (GO) analysis of HVS genes predicted that 35% of genes are associated with DNA transcription and RNA processing. Conclusion These results suggest that HVS genes include genes that are biased towards expression in the brain and gonads. They also demonstrate that at least some of Urbilaterian genes retained in the specific animal lineage may be selectively maintained to support the species-specific phenotypes. PMID:19138430
Expression profiles of urbilaterian genes uniquely shared between honey bee and vertebrates.
Matsui, Toshiaki; Yamamoto, Toshiyuki; Wyder, Stefan; Zdobnov, Evgeny M; Kadowaki, Tatsuhiko
2009-01-12
Large-scale comparison of metazoan genomes has revealed that a significant fraction of genes of the last common ancestor of Bilateria (Urbilateria) is lost in each animal lineage. This event could be one of the underlying mechanisms involved in generating metazoan diversity. However, the present functions of these ancient genes have not been addressed extensively. To understand the functions and evolutionary mechanisms of such ancient Urbilaterian genes, we carried out comprehensive expression profile analysis of genes shared between vertebrates and honey bees but not with the other sequenced ecdysozoan genomes (honey bee-vertebrate specific, HVS genes) as a model. We identified 30 honey bee and 55 mouse HVS genes. Many HVS genes exhibited tissue-selective expression patterns; intriguingly, the expression of 60% of honey bee HVS genes was found to be brain enriched, and 24% of mouse HVS genes were highly expressed in either or both the brain and testis. Moreover, a minimum of 38% of mouse HVS genes demonstrated neuron-enriched expression patterns, and 62% of them exhibited expression in selective brain areas, particularly the forebrain and cerebellum. Furthermore, gene ontology (GO) analysis of HVS genes predicted that 35% of genes are associated with DNA transcription and RNA processing. These results suggest that HVS genes include genes that are biased towards expression in the brain and gonads. They also demonstrate that at least some of Urbilaterian genes retained in the specific animal lineage may be selectively maintained to support the species-specific phenotypes.
Cheaib, Miriam; Dehghani Amirabad, Azim; Nordström, Karl J V; Schulz, Marcel H; Simon, Martin
2015-08-01
Phenotypic variation of a single genotype is achieved by alterations in gene expression patterns. Regulation of such alterations depends on their time scale, where short-time adaptations differ from permanently established gene expression patterns maintained by epigenetic mechanisms. In the ciliate Paramecium, serotypes were described for an epigenetically controlled gene expression pattern of an individual multigene family. Paradoxically, individual serotypes can be triggered in Paramecium by alternating environments but are then stabilized by epigenetic mechanisms, thus raising the question to which extend their expression follows environmental stimuli. To characterize environmental adaptation in the context of epigenetically controlled serotype expression, we used RNA-seq to characterize transcriptomes of serotype pure cultures. The resulting vegetative transcriptome resource is first analysed for genes involved in the adaptive response to the altered environment. Secondly, we identified groups of genes that do not follow the adaptive response but show co-regulation with the epigenetically controlled serotype system, suggesting that their gene expression pattern becomes manifested by similar mechanisms. In our experimental set-up, serotype expression and the entire group of co-regulated genes were stable among environmental changes and only heat-shock genes altered expression of these gene groups. The data suggest that the maintenance of these gene expression patterns in a lineage represents epigenetically controlled robustness counteracting short-time adaptation processes. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
DR-Integrator: a new analytic tool for integrating DNA copy number and gene expression data.
Salari, Keyan; Tibshirani, Robert; Pollack, Jonathan R
2010-02-01
DNA copy number alterations (CNA) frequently underlie gene expression changes by increasing or decreasing gene dosage. However, only a subset of genes with altered dosage exhibit concordant changes in gene expression. This subset is likely to be enriched for oncogenes and tumor suppressor genes, and can be identified by integrating these two layers of genome-scale data. We introduce DNA/RNA-Integrator (DR-Integrator), a statistical software tool to perform integrative analyses on paired DNA copy number and gene expression data. DR-Integrator identifies genes with significant correlations between DNA copy number and gene expression, and implements a supervised analysis that captures genes with significant alterations in both DNA copy number and gene expression between two sample classes. DR-Integrator is freely available for non-commercial use from the Pollack Lab at http://pollacklab.stanford.edu/ and can be downloaded as a plug-in application to Microsoft Excel and as a package for the R statistical computing environment. The R package is available under the name 'DRI' at http://cran.r-project.org/. An example analysis using DR-Integrator is included as supplemental material. Supplementary data are available at Bioinformatics online.
Tran, Frances; Penniket, Carolyn; Patel, Rohan V; Provart, Nicholas J; Laroche, André; Rowland, Owen; Robert, Laurian S
2013-06-01
Despite their importance, there remains a paucity of large-scale gene expression-based studies of reproductive development in species belonging to the Triticeae. As a first step to address this deficiency, a gene expression atlas of triticale reproductive development was generated using the 55K Affymetrix GeneChip(®) wheat genome array. The global transcriptional profiles of the anther/pollen, ovary and stigma were analyzed at concurrent developmental stages, and co-expressed as well as preferentially expressed genes were identified. Data analysis revealed both novel and conserved regulatory factors underlying Triticeae floral development and function. This comprehensive resource rests upon detailed gene annotations, and the expression profiles are readily accessible via a web browser. © 2013 Her Majesty the Queen in Right of Canada as represented by the Minister of Agriculture and Agri-Food Canada.
2013-01-01
Background METH is an illicit drug of abuse that influences gene expression in the rat striatum. Histone modifications regulate gene transcription. Methods We therefore used microarray analysis and genome-scale approaches to examine potential relationships between the effects of METH on gene expression and on DNA binding of histone H4 acetylated at lysine 4 (H4K5Ac) in the rat dorsal striatum of METH-naïve and METH-pretreated rats. Results Acute and chronic METH administration caused differential changes in striatal gene expression. METH also increased H4K5Ac binding around the transcriptional start sites (TSSs) of genes in the rat striatum. In order to relate gene expression to histone acetylation, we binned genes of similar expression into groups of 100 genes and proceeded to relate gene expression to H4K5Ac binding. We found a positive correlation between gene expression and H4K5Ac binding in the striatum of control rats. Similar correlations were observed in METH-treated rats. Genes that showed acute METH-induced increased expression in saline-pretreated rats also showed METH-induced increased H4K5Ac binding. The acute METH injection caused similar increases in H4K5Ac binding in METH-pretreated rats, without affecting gene expression to the same degree. Finally, genes that showed METH-induced decreased expression exhibited either decreases or no changes in H4K5Ac binding. Conclusion Acute METH injections caused increased gene expression of genes that showed increased H4K5Ac binding near their transcription start sites. PMID:23937714
Cadet, Jean Lud; Jayanthi, Subramaniam; McCoy, Michael T; Ladenheim, Bruce; Saint-Preux, Fabienne; Lehrmann, Elin; De, Supriyo; Becker, Kevin G; Brannock, Christie
2013-08-12
METH is an illicit drug of abuse that influences gene expression in the rat striatum. Histone modifications regulate gene transcription. We therefore used microarray analysis and genome-scale approaches to examine potential relationships between the effects of METH on gene expression and on DNA binding of histone H4 acetylated at lysine 4 (H4K5Ac) in the rat dorsal striatum of METH-naïve and METH-pretreated rats. Acute and chronic METH administration caused differential changes in striatal gene expression. METH also increased H4K5Ac binding around the transcriptional start sites (TSSs) of genes in the rat striatum. In order to relate gene expression to histone acetylation, we binned genes of similar expression into groups of 100 genes and proceeded to relate gene expression to H4K5Ac binding. We found a positive correlation between gene expression and H4K5Ac binding in the striatum of control rats. Similar correlations were observed in METH-treated rats. Genes that showed acute METH-induced increased expression in saline-pretreated rats also showed METH-induced increased H4K5Ac binding. The acute METH injection caused similar increases in H4K5Ac binding in METH-pretreated rats, without affecting gene expression to the same degree. Finally, genes that showed METH-induced decreased expression exhibited either decreases or no changes in H4K5Ac binding. Acute METH injections caused increased gene expression of genes that showed increased H4K5Ac binding near their transcription start sites.
Statistical mechanics of scale-free gene expression networks
NASA Astrophysics Data System (ADS)
Gross, Eitan
2012-12-01
The gene co-expression networks of many organisms including bacteria, mice and man exhibit scale-free distribution. This heterogeneous distribution of connections decreases the vulnerability of the network to random attacks and thus may confer the genetic replication machinery an intrinsic resilience to such attacks, triggered by changing environmental conditions that the organism may be subject to during evolution. This resilience to random attacks comes at an energetic cost, however, reflected by the lower entropy of the scale-free distribution compared to the more homogenous, random network. In this study we found that the cell cycle-regulated gene expression pattern of the yeast Saccharomyces cerevisiae obeys a power-law distribution with an exponent α = 2.1 and an entropy of 1.58. The latter is very close to the maximal value of 1.65 obtained from linear optimization of the entropy function under the constraint of a constant cost function, determined by the average degree connectivity
Lehnert, Sigrid A; Reverter, Antonio; Byrne, Keren A; Wang, Yonghong; Nattrass, Greg S; Hudson, Nicholas J; Greenwood, Paul L
2007-01-01
Background The muscle fiber number and fiber composition of muscle is largely determined during prenatal development. In order to discover genes that are involved in determining adult muscle phenotypes, we studied the gene expression profile of developing fetal bovine longissimus muscle from animals with two different genetic backgrounds using a bovine cDNA microarray. Fetal longissimus muscle was sampled at 4 stages of myogenesis and muscle maturation: primary myogenesis (d 60), secondary myogenesis (d 135), as well as beginning (d 195) and final stages (birth) of functional differentiation of muscle fibers. All fetuses and newborns (total n = 24) were from Hereford dams and crossed with either Wagyu (high intramuscular fat) or Piedmontese (GDF8 mutant) sires, genotypes that vary markedly in muscle and compositional characteristics later in postnatal life. Results We obtained expression profiles of three individuals for each time point and genotype to allow comparisons across time and between sire breeds. Quantitative reverse transcription-PCR analysis of RNA from developing longissimus muscle was able to validate the differential expression patterns observed for a selection of differentially expressed genes, with one exception. We detected large-scale changes in temporal gene expression between the four developmental stages in genes coding for extracellular matrix and for muscle fiber structural and metabolic proteins. FSTL1 and IGFBP5 were two genes implicated in growth and differentiation that showed developmentally regulated expression levels in fetal muscle. An abundantly expressed gene with no functional annotation was found to be developmentally regulated in the same manner as muscle structural proteins. We also observed differences in gene expression profiles between the two different sire breeds. Wagyu-sired calves showed higher expression of fatty acid binding protein 5 (FABP5) RNA at birth. The developing longissimus muscle of fetuses carrying the Piedmontese mutation shows an emphasis on glycolytic muscle biochemistry and a large-scale up-regulation of the translational machinery at birth. We also document evidence for timing differences in differentiation events between the two breeds. Conclusion Taken together, these findings provide a detailed description of molecular events accompanying skeletal muscle differentiation in the bovine, as well as gene expression differences that may underpin the phenotype differences between the two breeds. In addition, this study has highlighted a non-coding RNA, which is abundantly expressed and developmentally regulated in bovine fetal muscle. PMID:17697390
Distal-less regulates eyespot patterns and melanization in Bicyclus butterflies.
Monteiro, Antónia; Chen, Bin; Ramos, Diane M; Oliver, Jeffrey C; Tong, Xiaoling; Guo, Min; Wang, Wen-Kai; Fazzino, Lisa; Kamal, Firdous
2013-07-01
Butterfly eyespots represent novel complex traits that display substantial diversity in number and size within and across species. Correlative gene expression studies have implicated a large suite of transcription factors, including Distal-less (Dll), Engrailed (En), and Spalt (Sal), in eyespot development in butterflies, but direct evidence testing the function of any of these proteins is still missing. Here we show that the characteristic two-eyespot pattern of wildtype Bicyclus anynana forewings is correlated with dynamic progression of Dll, En, and Sal expression in larval wings from four spots to two spots, whereas no such decline in gene expression ensues in a four-eyespot mutant. We then conduct transgenic experiments testing whether over-expression of any of these genes in a wild-type genetic background is sufficient to induce eyespot differentiation in these pre-patterned wing compartments. We also produce a Dll-RNAi transgenic line to test how Dll down-regulation affects eyespot development. Finally we test how ectopic expression of these genes during the pupal stages of development alters adults color patters. We show that over-expressing Dll in larvae is sufficient to induce the differentiation of additional eyespots and increase the size of eyespots, whereas down-regulating Dll leads to a decrease in eyespot size. Furthermore, ectopic expression of Dll in the early pupal wing led to the appearance of ectopic patches of black scales. We conclude that Dll is a positive regulator of focal differentiation and eyespot signaling and that this gene is also a possible selector gene for scale melanization in butterflies. Copyright © 2013 Wiley Periodicals, Inc.
“Real time” genetic manipulation: a new tool for ecological field studies
Schäfer, Martin; Brütting, Christoph; Gase, Klaus; Reichelt, Michael; Baldwin, Ian; Meldau, Stefan
2014-01-01
Summary Field experiments with transgenic plants often reveal the functional significance of genetic traits important for plant performance in their natural environments. Until now, only constitutive overexpression, ectopic expression and gene silencing methods have been used to analyze gene-related phenotypes in natural habitats. These methods do not allow sufficient control over gene expression to study ecological interactions in real-time, genetic traits playing essential roles in development, or dose-dependent effects. We applied the sensitive dexamethasone (DEX)-inducible pOp6/LhGR expression system to the ecological model plant Nicotiana attenuata and established a lanolin-based DEX application method to facilitate ectopic gene expression and RNAi mediated gene silencing in the field and under challenging conditions (e.g. high temperature, wind and UV radiation). Fully established field-grown plants were used to silence phytoene desaturase and thereby cause photobleaching only in specific plant sectors, and to activate expression of the cytokinin (CK) biosynthesis gene isopentenyl transferase (ipt). We used ipt expression to analyze the role of CK’s in both the glasshouse and field to understand resistance to the native herbivore Tupiocoris notatus, which attack plants at small spatial scales. By spatially restricting ipt expression and elevating CK levels in single leaves, T. notatus damage increased, demonstrating CK’s role in this plant-herbivore interaction at a small scale. As the arena of most ecological interactions is highly constrained in time and space, these tools will advance the genetic analysis of dynamic traits that matter for plant performance in nature. PMID:23906159
Molecular Structure-Based Large-Scale Prediction of Chemical-Induced Gene Expression Changes.
Liu, Ruifeng; AbdulHameed, Mohamed Diwan M; Wallqvist, Anders
2017-09-25
The quantitative structure-activity relationship (QSAR) approach has been used to model a wide range of chemical-induced biological responses. However, it had not been utilized to model chemical-induced genomewide gene expression changes until very recently, owing to the complexity of training and evaluating a very large number of models. To address this issue, we examined the performance of a variable nearest neighbor (v-NN) method that uses information on near neighbors conforming to the principle that similar structures have similar activities. Using a data set of gene expression signatures of 13 150 compounds derived from cell-based measurements in the NIH Library of Integrated Network-based Cellular Signatures program, we were able to make predictions for 62% of the compounds in a 10-fold cross validation test, with a correlation coefficient of 0.61 between the predicted and experimentally derived signatures-a reproducibility rivaling that of high-throughput gene expression measurements. To evaluate the utility of the predicted gene expression signatures, we compared the predicted and experimentally derived signatures in their ability to identify drugs known to cause specific liver, kidney, and heart injuries. Overall, the predicted and experimentally derived signatures had similar receiver operating characteristics, whose areas under the curve ranged from 0.71 to 0.77 and 0.70 to 0.73, respectively, across the three organ injury models. However, detailed analyses of enrichment curves indicate that signatures predicted from multiple near neighbors outperformed those derived from experiments, suggesting that averaging information from near neighbors may help improve the signal from gene expression measurements. Our results demonstrate that the v-NN method can serve as a practical approach for modeling large-scale, genomewide, chemical-induced, gene expression changes.
NASA Technical Reports Server (NTRS)
Wenck, A. R.; Quinn, M.; Whetten, R. W.; Pullman, G.; Sederoff, R.; Brown, C. S. (Principal Investigator)
1999-01-01
Agrobacterium-mediated gene transfer is the method of choice for many plant biotechnology laboratories; however, large-scale use of this organism in conifer transformation has been limited by difficult propagation of explant material, selection efficiencies and low transformation frequency. We have analyzed co-cultivation conditions and different disarmed strains of Agrobacterium to improve transformation. Additional copies of virulence genes were added to three common disarmed strains. These extra virulence genes included either a constitutively active virG or extra copies of virG and virB, both from pTiBo542. In experiments with Norway spruce, we increased transformation efficiencies 1000-fold from initial experiments where little or no transient expression was detected. Over 100 transformed lines expressing the marker gene beta-glucuronidase (GUS) were generated from rapidly dividing embryogenic suspension-cultured cells co-cultivated with Agrobacterium. GUS activity was used to monitor transient expression and to further test lines selected on kanamycin-containing medium. In loblolly pine, transient expression increased 10-fold utilizing modified Agrobacterium strains. Agrobacterium-mediated gene transfer is a useful technique for large-scale generation of transgenic Norway spruce and may prove useful for other conifer species.
Hacker, David L; Bertschinger, Martin; Baldi, Lucia; Wurm, Florian M
2004-10-27
Human embryonic kidney 293 (HEK293) cells, a widely used host for large-scale transient expression of recombinant proteins, are transformed with the adenovirus E1A and E1B genes. Because the E1A proteins function as transcriptional activators or repressors, they may have a positive or negative effect on transient transgene expression in this cell line. Suspension cultures of HEK293 EBNA (HEK293E) cells were co-transfected with a reporter plasmid expressing the GFP gene and a plasmid expressing a short hairpin RNA (shRNA) targeting the E1A mRNAs for degradation by RNA interference (RNAi). The presence of the shRNA in HEK293E cells reduced the steady state level of E1A mRNA up to 75% and increased transient GFP expression from either the elongation factor-1alpha (EF-1alpha) promoter or the human cytomegalovirus (HCMV) immediate early promoter up to twofold. E1A mRNA depletion also resulted in a twofold increase in transient expression of a recombinant IgG in both small- and large-scale suspension cultures when the IgG light and heavy chain genes were controlled by the EF-1alpha promoter. Finally, transient IgG expression was enhanced 2.5-fold when the anti-E1A shRNA was expressed from the same vector as the IgG light chain gene. These results demonstrated that E1A has a negative effect on transient gene expression in HEK293E cells, and they established that RNAi can be used to enhance recombinant protein expression in mammalian cells.
Strakova, Eva; Zikova, Alice; Vohradsky, Jiri
2014-01-01
A computational model of gene expression was applied to a novel test set of microarray time series measurements to reveal regulatory interactions between transcriptional regulators represented by 45 sigma factors and the genes expressed during germination of a prokaryote Streptomyces coelicolor. Using microarrays, the first 5.5 h of the process was recorded in 13 time points, which provided a database of gene expression time series on genome-wide scale. The computational modeling of the kinetic relations between the sigma factors, individual genes and genes clustered according to the similarity of their expression kinetics identified kinetically plausible sigma factor-controlled networks. Using genome sequence annotations, functional groups of genes that were predominantly controlled by specific sigma factors were identified. Using external binding data complementing the modeling approach, specific genes involved in the control of the studied process were identified and their function suggested.
Large-scale analysis of gene expression using cDNA microarrays promises the
rapid detection of the mode of toxicity for drugs and other chemicals. cDNA
microarrays were used to examine chemically-induced alterations of gene
expression in HepG2 cells exposed to oxidative ...
Gertz, Jason; Reddy, Timothy E.; Varley, Katherine E.; Garabedian, Michael J.; Myers, Richard M.
2012-01-01
Endogenous estrogens that are synthesized in the body impact gene regulation by activating estrogen receptors in diverse cell types. Exogenous compounds that have estrogenic properties can also be found circulating in the blood in both children and adults. The genome-wide impact of these environmental estrogens on gene regulation is unclear. To obtain an integrated view of gene regulation in response to environmental and endogenous estrogens on a genome-wide scale, we performed ChIP-seq to identify estrogen receptor 1 (ESR1; previously estrogen receptor α) binding sites, and RNA-seq in endometrial cancer cells exposed to bisphenol A (BPA; found in plastics), genistein (GEN; found in soybean), or 17β-estradiol (E2; an endogenous estrogen). GEN and BPA treatment induces thousands of ESR1 binding sites and >50 gene expression changes, representing a subset of E2-induced gene regulation changes. Genes affected by E2 were highly enriched for ribosome-associated proteins; however, GEN and BPA failed to regulate most ribosome-associated proteins and instead enriched for transporters of carboxylic acids. Treatment-dependent changes in gene expression were associated with treatment-dependent ESR1 binding sites, with the exception that many genes up-regulated by E2 harbored a BPA-induced ESR1 binding site but failed to show any expression change after BPA treatment. GEN and BPA exhibited a similar relationship to E2 in the breast cancer line T-47D, where cell type specificity played a much larger role than treatment specificity. Overall, both environmental estrogens clearly regulate gene expression through ESR1 on a genome-wide scale, although with lower potency resulting in less ESR1 binding sites and less gene expression changes compared to the endogenous estrogen, E2. PMID:23019147
Sémon, Marie; Mouchiroud, Dominique; Duret, Laurent
2005-02-01
Mammalian chromosomes are characterized by large-scale variations of DNA base composition (the so-called isochores). In contradiction with previous studies, Lercher et al. (Hum. Mol. Genet., 12, 2411, 2003) recently reported a strong correlation between gene expression breadth and GC-content, suggesting that there might be a selective pressure favoring the concentration of housekeeping genes in GC-rich isochores. We reassessed this issue by examining in human and mouse the correlation between gene expression and GC-content, using different measures of gene expression (EST, SAGE and microarray) and different measures of GC-content. We show that correlations between GC-content and expression are very weak, and may vary according to the method used to measure expression. Such weak correlations have a very low predictive value. The strong correlations reported by Lercher et al. (2003) are because of the fact that they measured variables over neighboring genes windows. We show here that using gene windows artificially enhances the correlation. The assertion that the expression of a given gene depends on the GC-content of the region where it is located is therefore not supported by the data.
Gene expression inference with deep learning.
Chen, Yifei; Li, Yi; Narayan, Rajiv; Subramanian, Aravind; Xie, Xiaohui
2016-06-15
Large-scale gene expression profiling has been widely used to characterize cellular states in response to various disease conditions, genetic perturbations, etc. Although the cost of whole-genome expression profiles has been dropping steadily, generating a compendium of expression profiling over thousands of samples is still very expensive. Recognizing that gene expressions are often highly correlated, researchers from the NIH LINCS program have developed a cost-effective strategy of profiling only ∼1000 carefully selected landmark genes and relying on computational methods to infer the expression of remaining target genes. However, the computational approach adopted by the LINCS program is currently based on linear regression (LR), limiting its accuracy since it does not capture complex nonlinear relationship between expressions of genes. We present a deep learning method (abbreviated as D-GEX) to infer the expression of target genes from the expression of landmark genes. We used the microarray-based Gene Expression Omnibus dataset, consisting of 111K expression profiles, to train our model and compare its performance to those from other methods. In terms of mean absolute error averaged across all genes, deep learning significantly outperforms LR with 15.33% relative improvement. A gene-wise comparative analysis shows that deep learning achieves lower error than LR in 99.97% of the target genes. We also tested the performance of our learned model on an independent RNA-Seq-based GTEx dataset, which consists of 2921 expression profiles. Deep learning still outperforms LR with 6.57% relative improvement, and achieves lower error in 81.31% of the target genes. D-GEX is available at https://github.com/uci-cbcl/D-GEX CONTACT: xhx@ics.uci.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Gene expression inference with deep learning
Chen, Yifei; Li, Yi; Narayan, Rajiv; Subramanian, Aravind; Xie, Xiaohui
2016-01-01
Motivation: Large-scale gene expression profiling has been widely used to characterize cellular states in response to various disease conditions, genetic perturbations, etc. Although the cost of whole-genome expression profiles has been dropping steadily, generating a compendium of expression profiling over thousands of samples is still very expensive. Recognizing that gene expressions are often highly correlated, researchers from the NIH LINCS program have developed a cost-effective strategy of profiling only ∼1000 carefully selected landmark genes and relying on computational methods to infer the expression of remaining target genes. However, the computational approach adopted by the LINCS program is currently based on linear regression (LR), limiting its accuracy since it does not capture complex nonlinear relationship between expressions of genes. Results: We present a deep learning method (abbreviated as D-GEX) to infer the expression of target genes from the expression of landmark genes. We used the microarray-based Gene Expression Omnibus dataset, consisting of 111K expression profiles, to train our model and compare its performance to those from other methods. In terms of mean absolute error averaged across all genes, deep learning significantly outperforms LR with 15.33% relative improvement. A gene-wise comparative analysis shows that deep learning achieves lower error than LR in 99.97% of the target genes. We also tested the performance of our learned model on an independent RNA-Seq-based GTEx dataset, which consists of 2921 expression profiles. Deep learning still outperforms LR with 6.57% relative improvement, and achieves lower error in 81.31% of the target genes. Availability and implementation: D-GEX is available at https://github.com/uci-cbcl/D-GEX. Contact: xhx@ics.uci.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26873929
Expression Atlas: gene and protein expression across multiple studies and organisms
Tang, Y Amy; Bazant, Wojciech; Burke, Melissa; Fuentes, Alfonso Muñoz-Pomer; George, Nancy; Koskinen, Satu; Mohammed, Suhaib; Geniza, Matthew; Preece, Justin; Jarnuczak, Andrew F; Huber, Wolfgang; Stegle, Oliver; Brazma, Alvis; Petryszak, Robert
2018-01-01
Abstract Expression Atlas (http://www.ebi.ac.uk/gxa) is an added value database that provides information about gene and protein expression in different species and contexts, such as tissue, developmental stage, disease or cell type. The available public and controlled access data sets from different sources are curated and re-analysed using standardized, open source pipelines and made available for queries, download and visualization. As of August 2017, Expression Atlas holds data from 3,126 studies across 33 different species, including 731 from plants. Data from large-scale RNA sequencing studies including Blueprint, PCAWG, ENCODE, GTEx and HipSci can be visualized next to each other. In Expression Atlas, users can query genes or gene-sets of interest and explore their expression across or within species, tissues, developmental stages in a constitutive or differential context, representing the effects of diseases, conditions or experimental interventions. All processed data matrices are available for direct download in tab-delimited format or as R-data. In addition to the web interface, data sets can now be searched and downloaded through the Expression Atlas R package. Novel features and visualizations include the on-the-fly analysis of gene set overlaps and the option to view gene co-expression in experiments investigating constitutive gene expression across tissues or other conditions. PMID:29165655
Tuller, Tamir; Atar, Shimshi; Ruppin, Eytan; Gurevich, Michael; Achiron, Anat
2011-09-15
Multiple sclerosis (MS) is a central nervous system autoimmune inflammatory T-cell-mediated disease with a relapsing-remitting course in the majority of patients. In this study, we performed a high-resolution systems biology analysis of gene expression and physical interactions in MS relapse and remission. To this end, we integrated 164 large-scale measurements of gene expression in peripheral blood mononuclear cells of MS patients in relapse or remission and healthy subjects, with large-scale information about the physical interactions between these genes obtained from public databases. These data were analyzed with a variety of computational methods. We find that there is a clear and significant global network-level signal that is related to the changes in gene expression of MS patients in comparison to healthy subjects. However, despite the clear differences in the clinical symptoms of MS patients in relapse versus remission, the network level signal is weaker when comparing patients in these two stages of the disease. This result suggests that most of the genes have relatively similar expression levels in the two stages of the disease. In accordance with previous studies, we found that the pathways related to regulation of cell death, chemotaxis and inflammatory response are differentially expressed in the disease in comparison to healthy subjects, while pathways related to cell adhesion, cell migration and cell-cell signaling are activated in relapse in comparison to remission. However, the current study includes a detailed report of the exact set of genes involved in these pathways and the interactions between them. For example, we found that the genes TP53 and IL1 are 'network-hub' that interacts with many of the differentially expressed genes in MS patients versus healthy subjects, and the epidermal growth factor receptor is a 'network-hub' in the case of MS patients with relapse versus remission. The statistical approaches employed in this study enabled us to report new sets of genes that according to their gene expression and physical interactions are predicted to be differentially expressed in MS versus healthy subjects, and in MS patients in relapse versus remission. Some of these genes may be useful biomarkers for diagnosing MS and predicting relapses in MS patients.
Discrete domains of gene expression in germinal layers distinguish the development of gyrencephaly
de Juan Romero, Camino; Bruder, Carl; Tomasello, Ugo; Sanz-Anquela, José Miguel; Borrell, Víctor
2015-01-01
Gyrencephalic species develop folds in the cerebral cortex in a stereotypic manner, but the genetic mechanisms underlying this patterning process are unknown. We present a large-scale transcriptomic analysis of individual germinal layers in the developing cortex of the gyrencephalic ferret, comparing between regions prospective of fold and fissure. We find unique transcriptional signatures in each germinal compartment, where thousands of genes are differentially expressed between regions, including ∼80% of genes mutated in human cortical malformations. These regional differences emerge from the existence of discrete domains of gene expression, which occur at multiple locations across the developing cortex of ferret and human, but not the lissencephalic mouse. Complex expression patterns emerge late during development and map the eventual location of folds or fissures. Protomaps of gene expression within germinal layers may contribute to define cortical folds or functional areas, but our findings demonstrate that they distinguish the development of gyrencephalic cortices. PMID:25916825
Sex-Specific Selection and Sex-Biased Gene Expression in Humans and Flies.
Cheng, Changde; Kirkpatrick, Mark
2016-09-01
Sexual dimorphism results from sex-biased gene expression, which evolves when selection acts differently on males and females. While there is an intimate connection between sex-biased gene expression and sex-specific selection, few empirical studies have studied this relationship directly. Here we compare the two on a genome-wide scale in humans and flies. We find a distinctive "Twin Peaks" pattern in humans that relates the strength of sex-specific selection, quantified by genetic divergence between male and female adults at autosomal loci, to the degree of sex-biased expression. Genes with intermediate degrees of sex-biased expression show evidence of ongoing sex-specific selection, while genes with either little or completely sex-biased expression do not. This pattern apparently results from differential viability selection in males and females acting in the current generation. The Twin Peaks pattern is also found in Drosophila using a different measure of sex-specific selection acting on fertility. We develop a simple model that successfully recapitulates the Twin Peaks. Our results suggest that many genes with intermediate sex-biased expression experience ongoing sex-specific selection in humans and flies.
Protein disorder is positively correlated with gene expression in E. coli
Paliy, Oleg; Gargac, Shawn M.; Cheng, Yugong; Uversky, Vladimir N.; Dunker, A. Keith
2009-01-01
We considered on a global scale the relationship between the predicted fraction of protein disorder and RNA and protein expression in E. coli. Fraction of protein disorder correlated positively with both measured RNA expression levels of E. coli genes in three different growth media and with predicted abundance levels of E. coli proteins. Though weak, the correlation was highly significant. Correlation of protein disorder with RNA expression did not depend on the growth rate of E. coli cultures and was not caused by a small subset of genes showing exceptionally high concordance in their disorder and expression levels. Global analysis was complemented by detailed consideration of several groups of proteins. PMID:18465893
Automated Protocol for Large-Scale Modeling of Gene Expression Data.
Hall, Michelle Lynn; Calkins, David; Sherman, Woody
2016-11-28
With the continued rise of phenotypic- and genotypic-based screening projects, computational methods to analyze, process, and ultimately make predictions in this field take on growing importance. Here we show how automated machine learning workflows can produce models that are predictive of differential gene expression as a function of a compound structure using data from A673 cells as a proof of principle. In particular, we present predictive models with an average accuracy of greater than 70% across a highly diverse ∼1000 gene expression profile. In contrast to the usual in silico design paradigm, where one interrogates a particular target-based response, this work opens the opportunity for virtual screening and lead optimization for desired multitarget gene expression profiles.
Mosquera Orgueira, Adrián
2015-01-01
DNA methylation is a frequent epigenetic mechanism that participates in transcriptional repression. Variations in DNA methylation with respect to gene expression are constant, and, for unknown reasons, some genes with highly methylated promoters are sometimes overexpressed. In this study we have analyzed the expression and methylation patterns of thousands of genes in five groups of cancer and normal tissue samples in order to determine local and genome-wide differences. We observed significant changes in global methylation-expression correlation in all the neoplasms, which suggests that differential correlation events are frequent in cancer. A focused analysis in the breast cancer cohort identified 1662 genes whose correlation varies significantly between normal and cancerous breast, but whose DNA methylation and gene expression patterns do not change substantially. These genes were enriched in cancer-related pathways and repressive chromatin features across various model cell lines, such as PRC2 binding and H3K27me3 marks. Substantial changes in methylation-expression correlation indicate that these genes are subject to epigenetic remodeling, where the differential activity of other factors break the expected relationship between both variables. Our findings suggest a complex regulatory landscape where a redistribution of local and large-scale chromatin repressive domains at differentially correlated genes (DCGs) creates epigenetic hotspots that modulate cancer-specific gene expression.
Kudo, Toru; Sasaki, Yohei; Terashima, Shin; Matsuda-Imai, Noriko; Takano, Tomoyuki; Saito, Misa; Kanno, Maasa; Ozaki, Soichi; Suwabe, Keita; Suzuki, Go; Watanabe, Masao; Matsuoka, Makoto; Takayama, Seiji; Yano, Kentaro
2016-10-13
In quantitative gene expression analysis, normalization using a reference gene as an internal control is frequently performed for appropriate interpretation of the results. Efforts have been devoted to exploring superior novel reference genes using microarray transcriptomic data and to evaluating commonly used reference genes by targeting analysis. However, because the number of specifically detectable genes is totally dependent on probe design in the microarray analysis, exploration using microarray data may miss some of the best choices for the reference genes. Recently emerging RNA sequencing (RNA-seq) provides an ideal resource for comprehensive exploration of reference genes since this method is capable of detecting all expressed genes, in principle including even unknown genes. We report the results of a comprehensive exploration of reference genes using public RNA-seq data from plants such as Arabidopsis thaliana (Arabidopsis), Glycine max (soybean), Solanum lycopersicum (tomato) and Oryza sativa (rice). To select reference genes suitable for the broadest experimental conditions possible, candidates were surveyed by the following four steps: (1) evaluation of the basal expression level of each gene in each experiment; (2) evaluation of the expression stability of each gene in each experiment; (3) evaluation of the expression stability of each gene across the experiments; and (4) selection of top-ranked genes, after ranking according to the number of experiments in which the gene was expressed stably. Employing this procedure, 13, 10, 12 and 21 top candidates for reference genes were proposed in Arabidopsis, soybean, tomato and rice, respectively. Microarray expression data confirmed that the expression of the proposed reference genes under broad experimental conditions was more stable than that of commonly used reference genes. These novel reference genes will be useful for analyzing gene expression profiles across experiments carried out under various experimental conditions.
Aberrant Gene Expression in Humans
Yang, Ence; Ji, Guoli; Brinkmeyer-Langford, Candice L.; Cai, James J.
2015-01-01
Gene expression as an intermediate molecular phenotype has been a focus of research interest. In particular, studies of expression quantitative trait loci (eQTL) have offered promise for understanding gene regulation through the discovery of genetic variants that explain variation in gene expression levels. Existing eQTL methods are designed for assessing the effects of common variants, but not rare variants. Here, we address the problem by establishing a novel analytical framework for evaluating the effects of rare or private variants on gene expression. Our method starts from the identification of outlier individuals that show markedly different gene expression from the majority of a population, and then reveals the contributions of private SNPs to the aberrant gene expression in these outliers. Using population-scale mRNA sequencing data, we identify outlier individuals using a multivariate approach. We find that outlier individuals are more readily detected with respect to gene sets that include genes involved in cellular regulation and signal transduction, and less likely to be detected with respect to the gene sets with genes involved in metabolic pathways and other fundamental molecular functions. Analysis of polymorphic data suggests that private SNPs of outlier individuals are enriched in the enhancer and promoter regions of corresponding aberrantly-expressed genes, suggesting a specific regulatory role of private SNPs, while the commonly-occurring regulatory genetic variants (i.e., eQTL SNPs) show little evidence of involvement. Additional data suggest that non-genetic factors may also underlie aberrant gene expression. Taken together, our findings advance a novel viewpoint relevant to situations wherein common eQTLs fail to predict gene expression when heritable, rare inter-individual variation exists. The analytical framework we describe, taking into consideration the reality of differential phenotypic robustness, may be valuable for investigating complex traits and conditions. PMID:25617623
Monzón-Sandoval, Jimena; Castillo-Morales, Atahualpa; Crampton, Sean; McKelvey, Laura; Nolan, Aoife; O'Keeffe, Gerard; Gutierrez, Humberto
2015-01-01
During development, the nervous system (NS) is assembled and sculpted through a concerted series of neurodevelopmental events orchestrated by a complex genetic programme. While neural-specific gene expression plays a critical part in this process, in recent years, a number of immune-related signaling and regulatory components have also been shown to play key physiological roles in the developing and adult NS. While the involvement of individual immune-related signaling components in neural functions may reflect their ubiquitous character, it may also reflect a much wider, as yet undescribed, genetic network of immune-related molecules acting as an intrinsic component of the neural-specific regulatory machinery that ultimately shapes the NS. In order to gain insights into the scale and wider functional organization of immune-related genetic networks in the NS, we examined the large scale pattern of expression of these genes in the brain. Our results show a highly significant correlated expression and transcriptional clustering among immune-related genes in the developing and adult brain, and this correlation was the highest in the brain when compared to muscle, liver, kidney and endothelial cells. We experimentally tested the regulatory clustering of immune system (IS) genes by using microarray expression profiling in cultures of dissociated neurons stimulated with the pro-inflammatory cytokine TNF-alpha, and found a highly significant enrichment of immune system-related genes among the resulting differentially expressed genes. Our findings strongly suggest a coherent recruitment of entire immune-related genetic regulatory modules by the neural-specific genetic programme that shapes the NS.
Cheng, Hsiao-Ying; Masiello, Caroline A; Bennett, George N; Silberg, Jonathan J
2016-08-16
Traditional visual reporters of gene expression have only very limited use in soils because their outputs are challenging to detect through the soil matrix. This severely restricts our ability to study time-dependent microbial gene expression in one of the Earth's largest, most complex habitats. Here we describe an approach to report on dynamic gene expression within a microbial population in a soil under natural water levels (at and below water holding capacity) via production of methyl halides using a methyl halide transferase. As a proof-of-concept application, we couple the expression of this gas reporter to the conjugative transfer of a bacterial plasmid in a soil matrix and show that gas released from the matrix displays a strong correlation with the number of transconjugant bacteria that formed. Gas reporting of gene expression will make possible dynamic studies of natural and engineered microbes within many hard-to-image environmental matrices (soils, sediments, sludge, and biomass) at sample scales exceeding those used for traditional visual reporting.
de Jong, Simone; Boks, Marco P. M.; Fuller, Tova F.; Strengman, Eric; Janson, Esther; de Kovel, Carolien G. F.; Ori, Anil P. S.; Vi, Nancy; Mulder, Flip; Blom, Jan Dirk; Glenthøj, Birte; Schubart, Chris D.; Cahn, Wiepke; Kahn, René S.; Horvath, Steve; Ophoff, Roel A.
2012-01-01
Despite large-scale genome-wide association studies (GWAS), the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1), is located in, and regulated by the major histocompatibility (MHC) complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network. PMID:22761806
ROTH, STEPHEN M.; FERRELL, ROBERT E.; PETERS, DAVID G.; METTER, E. JEFFREY; HURLEY, BEN F.; ROGERS, MARC A.
2010-01-01
The purpose of this study was to determine the influence of age, sex, and strength training (ST) on large-scale gene expression patterns in vastus lateralis muscle biopsies using high-density cDNA microarrays and quantitative PCR. Muscle samples from sedentary young (20–30 yr) and older (65–75 yr) men and women (5 per group) were obtained before and after a 9-wk unilateral heavy resistance ST program. RNA was hybridized to cDNA filter microarrays representing ~4,000 known human genes and comparisons were made among arrays to determine differential gene expression as a result of age and sex differences, and/or response to ST. Sex had the strongest influence on muscle gene expression, with differential expression (>1.7-fold) observed for ~200 genes between men and women (~75% with higher expression in men). Age contributed to differential expression as well, as ~50 genes were identified as differentially expressed (>1.7-fold) in relation to age, representing structural, metabolic, and regulatory gene classes. Sixty-nine genes were identified as being differentially expressed (>1.7-fold) in all groups in response to ST, and the majority of these were downregulated. Quantitative PCR was employed to validate expression levels for caldesmon, SWI/SNF (BAF60b), and four-and-a-half LIM domains 1. These significant differences suggest that in the analysis of skeletal muscle gene expression issues of sex, age, and habitual physical activity must be addressed, with sex being the most critical variable. PMID:12209020
Okada, D; Endo, S; Matsuda, H; Ogawa, S; Taniguchi, Y; Katsuta, T; Watanabe, T; Iwaisaki, H
2018-05-12
Genome-wide association studies (GWAS) of quantitative traits have detected numerous genetic associations, but they encounter difficulties in pinpointing prominent candidate genes and inferring gene networks. The present study used a systems genetics approach integrating GWAS results with external RNA-expression data to detect candidate gene networks in feed utilization and growth traits of Japanese Black cattle, which are matters of concern. A SNP co-association network was derived from significant correlations between SNPs with effects estimated by GWAS across seven phenotypic traits. The resulting network genes contained significant numbers of annotations related to the traits. Using bovine transcriptome data from a public database, an RNA co-expression network was inferred based on the similarity of expression patterns across different tissues. An intersection network was then generated by superimposing the SNP and RNA networks and extracting shared interactions. This intersection network contained four tissue-specific modules: nervous system, reproductive system, muscular system, and glands. To characterize the structure (topographical properties) of the three networks, their scale-free properties were evaluated, which revealed that the intersection network was the most scale-free. In the sub-network containing the most connected transcription factors (URI1, ROCK2 and ETV6), most genes were widely expressed across tissues, and genes previously shown to be involved in the traits were found. Results indicated that the current approach might be used to construct a gene network that better reflects biological information, providing encouragement for the genetic dissection of economically important quantitative traits.
A high resolution atlas of gene expression in the domestic sheep (Ovis aries)
Farquhar, Iseabail L.; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G.; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C. Bruce; Freeman, Tom C.; Archibald, Alan L.; Hume, David A.
2017-01-01
Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of ‘guilt by association’ was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages. PMID:28915238
A high resolution atlas of gene expression in the domestic sheep (Ovis aries).
Clark, Emily L; Bush, Stephen J; McCulloch, Mary E B; Farquhar, Iseabail L; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G; Wu, Chunlei; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C Bruce; Freeman, Tom C; Summers, Kim M; Archibald, Alan L; Hume, David A
2017-09-01
Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of 'guilt by association' was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages.
Comparative studies of gene expression and the evolution of gene regulation
Romero, Irene Gallego; Ruvinsky, Ilya; Gilad, Yoav
2014-01-01
The hypothesis that differences in gene regulation play an important role in speciation and adaptation is more than 40 years old. With the advent of new sequencing technologies, we are able to characterize and study gene expression levels and associated regulatory mechanisms in a large number of individuals and species at unprecedented resolution and scale. We have thus gained new insights into the evolutionary pressures that shape gene expression levels, as well as developed an appreciation for the relative importance of evolutionary changes in different regulatory genetic and epigenetic mechanisms. The current challenge is to link gene regulatory changes to adaptive evolution of complex phenotypes. Here we mainly focus on comparative studies in primates, and how they are complemented by studies in model organisms. PMID:22705669
2013-01-01
Background Candidate genes for color pattern formation in butterfly wings have been known based on gene expression patterns since the 1990s, but their functions remain elusive due to a lack of a functional assay. Several methods of transferring and expressing a foreign gene in butterfly wings have been reported, but they have suffered from low success rates or low expression levels. Here, we developed a simple, practical method to efficiently deliver and express a foreign gene using baculovirus-mediated gene transfer in butterfly wings in vivo. Results A recombinant baculovirus containing a gene for green fluorescent protein (GFP) was injected into pupae of the blue pansy butterfly Junonia orithya (Nymphalidae). GFP fluorescence was detected in the pupal wings and other body parts of the injected individuals three to five days post-injection at various degrees of fluorescence. We obtained a high GFP expression rate at relatively high virus titers, but it was associated with pupal death before color pattern formation in wings. To reduce the high mortality rate caused by the baculovirus treatment, we administered an anti-gp64 antibody, which was raised against baculovirus coat protein gp64, to infected pupae after the baculovirus injection. This treatment greatly reduced the mortality rate of the infected pupae. GFP fluorescence was observed in pupal and adult wings and other body parts of the antibody-treated individuals at various degrees of fluorescence. Importantly, we obtained completely developed wings with a normal color pattern, in which fluorescent signals originated directly from scales or the basal membrane after the removal of scales. GFP fluorescence in wing tissues spatially coincided with anti-GFP antibody staining, confirming that the fluorescent signals originated from the expressed GFP molecules. Conclusions Our baculovirus-mediated gene transfer system with an anti-gp64 antibody is reasonably efficient, and it can be an invaluable tool to transfer, express, and functionally examine foreign genes in butterfly wings and also in other non-model insect systems. PMID:23522444
Place, Sean P.; Menge, Bruce A.; Hofmann, Gretchen E.
2011-01-01
Summary The marine intertidal zone is characterized by large variation in temperature, pH, dissolved oxygen and the supply of nutrients and food on seasonal and daily time scales. These oceanic fluctuations drive of ecological processes such as recruitment, competition and consumer-prey interactions largely via physiological mehcanisms. Thus, to understand coastal ecosystem dynamics and responses to climate change, it is crucial to understand these mechanisms. Here we utilize transcriptome analysis of the physiological response of the mussel Mytilus californianus at different spatial scales to gain insight into these mechanisms. We used mussels inhabiting different vertical locations within Strawberry Hill on Cape Perpetua, OR and Boiler Bay on Cape Foulweather, OR to study inter- and intra-site variation of gene expression. The results highlight two distinct gene expression signatures related to the cycling of metabolic activity and perturbations to cellular homeostasis. Intermediate spatial scales show a strong influence of oceanographic differences in food and stress environments between sites separated by ~65 km. Together, these new insights into environmental control of gene expression may allow understanding of important physiological drivers within and across populations. PMID:22563136
Gene expression profiling of single cells on large-scale oligonucleotide arrays
Hartmann, Claudia H.; Klein, Christoph A.
2006-01-01
Over the last decade, important insights into the regulation of cellular responses to various stimuli were gained by global gene expression analyses of cell populations. More recently, specific cell functions and underlying regulatory networks of rare cells isolated from their natural environment moved to the center of attention. However, low cell numbers still hinder gene expression profiling of rare ex vivo material in biomedical research. Therefore, we developed a robust method for gene expression profiling of single cells on high-density oligonucleotide arrays with excellent coverage of low abundance transcripts. The protocol was extensively tested with freshly isolated single cells of very low mRNA content including single epithelial, mature and immature dendritic cells and hematopoietic stem cells. Quantitative PCR confirmed that the PCR-based global amplification method did not change the relative ratios of transcript abundance and unsupervised hierarchical cluster analysis revealed that the histogenetic origin of an individual cell is correctly reflected by the gene expression profile. Moreover, the gene expression data from dendritic cells demonstrate that cellular differentiation and pathway activation can be monitored in individual cells. PMID:17071717
Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild
2009-07-01
Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.
Emergence of the self-similar property in gene expression dynamics
NASA Astrophysics Data System (ADS)
Ochiai, T.; Nacher, J. C.; Akutsu, T.
2007-08-01
Many theoretical models have recently been proposed to understand the structure of cellular systems composed of various types of elements (e.g., proteins, metabolites and genes) and their interactions. However, the cell is a highly dynamic system with thousands of functional elements fluctuating across temporal states. Therefore, structural analysis alone is not sufficient to reproduce the cell's observed behavior. In this article, we analyze the gene expression dynamics (i.e., how the amount of mRNA molecules in cell fluctuate in time) by using a new constructive approach, which reveals a symmetry embedded in gene expression fluctuations and characterizes the dynamical equation of gene expression (i.e., a specific stochastic differential equation). First, by using experimental data of human and yeast gene expression time series, we found a symmetry in short-time transition probability from time t to time t+1. We call it self-similarity symmetry (i.e., the gene expression short-time fluctuations contain a repeating pattern of smaller and smaller parts that are like the whole, but different in size). Secondly, we reconstruct the global behavior of the observed distribution of gene expression (i.e., scaling-law) and the local behavior of the power-law tail of this distribution. This approach may represent a step forward toward an integrated image of the basic elements of the whole cell.
Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping
2015-01-27
Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.
Upper airway gene expression in smokers: the mouth as a "window to the soul" of lung carcinogenesis?
Spira, Avrum
2010-03-01
This perspective on Boyle et al. (beginning on page 266 in this issue of the journal) explores transcriptomic profiling of upper airway epithelium as a biomarker of host response to tobacco smoke exposure. Boyle et al. have shown a striking relationship between smoking-related gene expression changes in the mouth and bronchus. This relationship suggests that buccal gene expression may serve as a relatively noninvasive surrogate marker of the physiologic response of the lung to tobacco smoke that could be used in large-scale screening and chemoprevention studies for lung cancer.
The Effect of Gestational Age on Angiogenic Gene Expression in the Rat Placenta
Vaswani, Kanchan; Hum, Melissa Wen-Ching; Chan, Hsiu-Wen; Ryan, Jennifer; Wood-Bradley, Ryan J.; Nitert, Marloes Dekker; Mitchell, Murray D.; Armitage, James A.; Rice, Gregory E.
2013-01-01
The placenta plays a central role in determining the outcome of pregnancy. It undergoes changes during gestation as the fetus develops and as demands for energy substrate transfer and gas exchange increase. The molecular mechanisms that coordinate these changes have yet to be fully elucidated. The study performed a large scale screen of the transcriptome of the rat placenta throughout mid-late gestation (E14.25–E20) with emphasis on characterizing gestational age associated changes in the expression of genes invoved in angiogenic pathways. Sprague Dawley dams were sacrificed at E14.25, E15.25, E17.25 and E20 (n = 6 per group) and RNA was isolated from one placenta per dam. Changes in placental gene expression were identifed using Illumina Rat Ref-12 Expression BeadChip Microarrays. Differentially expressed genes (>2-fold change, <1% false discovery rate, FDR) were functionally categorised by gene ontology pathway analysis. A subset of differentially expressed genes identified by microarrays were confirmed using Real-Time qPCR. The expression of thirty one genes involved in the angiogenic pathway was shown to change over time, using microarray analysis (22 genes displayed increased and 9 gene decreased expression). Five genes (4 up regulated: Cd36, Mmp14, Rhob and Angpt4 and 1 down regulated: Foxm1) involved in angiogenesis and blood vessel morphogenesis were subjected to further validation. qPCR confirmed late gestational increased expression of Cd36, Mmp14, Rhob and Angpt4 and a decrease in expression of Foxm1 before labour onset (P<0.0001). The observed acute, pre-labour changes in the expression of the 31 genes during gestation warrant further investigation to elucidate their role in pregnancy. PMID:24391823
Quiapim, Andréa C.; Brito, Michael S.; Bernardes, Luciano A.S.; daSilva, Idalete; Malavazi, Iran; DePaoli, Henrique C.; Molfetta-Machado, Jeanne B.; Giuliatti, Silvana; Goldman, Gustavo H.; Goldman, Maria Helena S.
2009-01-01
The success of plant reproduction depends on pollen-pistil interactions occurring at the stigma/style. These interactions vary depending on the stigma type: wet or dry. Tobacco (Nicotiana tabacum) represents a model of wet stigma, and its stigmas/styles express genes to accomplish the appropriate functions. For a large-scale study of gene expression during tobacco pistil development and preparation for pollination, we generated 11,216 high-quality expressed sequence tags (ESTs) from stigmas/styles and created the TOBEST database. These ESTs were assembled in 6,177 clusters, from which 52.1% are pistil transcripts/genes of unknown function. The 21 clusters with the highest number of ESTs (putative higher expression levels) correspond to genes associated with defense mechanisms or pollen-pistil interactions. The database analysis unraveled tobacco sequences homologous to the Arabidopsis (Arabidopsis thaliana) genes involved in specifying pistil identity or determining normal pistil morphology and function. Additionally, 782 independent clusters were examined by macroarray, revealing 46 stigma/style preferentially expressed genes. Real-time reverse transcription-polymerase chain reaction experiments validated the pistil-preferential expression for nine out of 10 genes tested. A search for these 46 genes in the Arabidopsis pistil data sets demonstrated that only 11 sequences, with putative equivalent molecular functions, are expressed in this dry stigma species. The reverse search for the Arabidopsis pistil genes in the TOBEST exposed a partial overlap between these dry and wet stigma transcriptomes. The TOBEST represents the most extensive survey of gene expression in the stigmas/styles of wet stigma plants, and our results indicate that wet and dry stigmas/styles express common as well as distinct genes in preparation for the pollination process. PMID:19052150
A gene expression resource generated by genome-wide lacZ profiling in the mouse
Tuck, Elizabeth; Estabel, Jeanne; Oellrich, Anika; Maguire, Anna Karin; Adissu, Hibret A.; Souter, Luke; Siragher, Emma; Lillistone, Charlotte; Green, Angela L.; Wardle-Jones, Hannah; Carragher, Damian M.; Karp, Natasha A.; Smedley, Damian; Adams, Niels C.; Bussell, James N.; Adams, David J.; Ramírez-Solis, Ramiro; Steel, Karen P.; Galli, Antonella; White, Jacqueline K.
2015-01-01
ABSTRACT Knowledge of the expression profile of a gene is a critical piece of information required to build an understanding of the normal and essential functions of that gene and any role it may play in the development or progression of disease. High-throughput, large-scale efforts are on-going internationally to characterise reporter-tagged knockout mouse lines. As part of that effort, we report an open access adult mouse expression resource, in which the expression profile of 424 genes has been assessed in up to 47 different organs, tissues and sub-structures using a lacZ reporter gene. Many specific and informative expression patterns were noted. Expression was most commonly observed in the testis and brain and was most restricted in white adipose tissue and mammary gland. Over half of the assessed genes presented with an absent or localised expression pattern (categorised as 0-10 positive structures). A link between complexity of expression profile and viability of homozygous null animals was observed; inactivation of genes expressed in ≥21 structures was more likely to result in reduced viability by postnatal day 14 compared with more restricted expression profiles. For validation purposes, this mouse expression resource was compared with Bgee, a federated composite of RNA-based expression data sets. Strong agreement was observed, indicating a high degree of specificity in our data. Furthermore, there were 1207 observations of expression of a particular gene in an anatomical structure where Bgee had no data, indicating a large amount of novelty in our data set. Examples of expression data corroborating and extending genotype-phenotype associations and supporting disease gene candidacy are presented to demonstrate the potential of this powerful resource. PMID:26398943
Tuller, T; Atar, S; Ruppin, E; Gurevich, M; Achiron, A
2013-03-01
The aim of this study is to understand intracellular regulatory mechanisms in peripheral blood mononuclear cells (PBMCs), which are either common to many autoimmune diseases or specific to some of them. We incorporated large-scale data such as protein-protein interactions, gene expression and demographical information of hundreds of patients and healthy subjects, related to six autoimmune diseases with available large-scale gene expression measurements: multiple sclerosis (MS), systemic lupus erythematosus (SLE), juvenile rheumatoid arthritis (JRA), Crohn's disease (CD), ulcerative colitis (UC) and type 1 diabetes (T1D). These data were analyzed concurrently by statistical and systems biology approaches tailored for this purpose. We found that chemokines such as CXCL1-3, 5, 6 and the interleukin (IL) IL8 tend to be differentially expressed in PBMCs of patients with the analyzed autoimmune diseases. In addition, the anti-apoptotic gene BCL3, interferon-γ (IFNG), and the vitamin D receptor (VDR) gene physically interact with significantly many genes that tend to be differentially expressed in PBMCs of patients with the analyzed autoimmune diseases. In general, similar cellular processes tend to be differentially expressed in PBMC in the analyzed autoimmune diseases. Specifically, the cellular processes related to cell proliferation (for example, epidermal growth factor, platelet-derived growth factor, nuclear factor-κB, Wnt/β-catenin signaling, stress-activated protein kinase c-Jun NH2-terminal kinase), inflammatory response (for example, interleukins IL2 and IL6, the cytokine granulocyte-macrophage colony-stimulating factor and the B-cell receptor), general signaling cascades (for example, mitogen-activated protein kinase, extracellular signal-regulated kinase, p38 and TRK) and apoptosis are activated in most of the analyzed autoimmune diseases. However, our results suggest that in each of the analyzed diseases, apoptosis and chemotaxis are activated via different subsignaling pathways. Analyses of the expression levels of dozens of genes and the protein-protein interactions among them demonstrated that CD and UC have relatively similar gene expression signatures, whereas the gene expression signatures of T1D and JRA relatively differ from the signatures of the other autoimmune diseases. These diseases are the only ones activated via the Fcɛ pathway. The relevant genes and pathways reported in this study are discussed at length, and may be helpful in the diagnoses and understanding of autoimmunity and/or specific autoimmune diseases.
USDA-ARS?s Scientific Manuscript database
Functional annotations of large plant genome projects mostly provide information on gene function and gene families based on the presence of protein domains and gene homology, but not necessarily in association with gene expression or metabolic and regulatory networks. These additional annotations a...
Sun, Yang; Huang, Shuijin; Wang, Shuping; Guo, Dianhao; Ge, Chang; Xiao, Huamei; Jie, Wencai; Yang, Qiupu; Teng, Xiaolu; Li, Fei
2017-04-01
Insects undergo metamorphosis, involving an abrupt change in body structure through cell growth and differentiation. Rice stem stripped borer (SSB), Chilo suppressalis, is one of the most destructive rice pests. However, little is known about the regulation mechanism of metamorphosis development in this notorious insect pest. Here, we studied the expression of 22,197 SSB genes at seven time points during pupa development with a customized microarray, identifying 622 differentially expressed genes (DEG) during pupa development. Gene ontology (GO) analysis of these DEGs indicated that the genes related to substance metabolism were highly expressed in the early pupa, which participate in the physiological processes of larval tissue disintegration at these stages. In comparison, highly expressed genes in the late pupal stages were mainly associated with substance biosynthesis, consistent with adult organ formation at these stages. There were 27 solute carrier (SLC) genes that were highly expressed during pupa development. We knocked down SLC22A3 at the prepupal stage, demonstrating that silencing SLC22A3 induced a deficiency in pupa stiffness and pigmentation. The RNAi-treated individuals had white and soft pupa, suggesting that this gene has an essential role in pupal development. Copyright © 2016 Elsevier Ltd. All rights reserved.
Imprinted gene expression in fetal growth and development.
Lambertini, L; Marsit, C J; Sharma, P; Maccani, M; Ma, Y; Hu, J; Chen, J
2012-06-01
Experimental studies showed that genomic imprinting is fundamental in fetoplacental development by timely regulating the expression of the imprinted genes to overlook a set of events determining placenta implantation, growth and embryogenesis. We examined the expression profile of 22 imprinted genes which have been linked to pregnancy abnormalities that may ultimately influence childhood development. The study was conducted in a subset of 106 placenta samples, overrepresented with small and large for gestational age cases, from the Rhode Island Child Health Study. We investigated associations between imprinted gene expression and three fetal development parameters: newborn head circumference, birth weight, and size for gestational age. Results from our investigation show that the maternally imprinted/paternally expressed gene ZNF331 inversely associates with each parameter to drive smaller fetal size, while paternally imprinted/maternally expressed gene SLC22A18 directly associates with the newborn head circumference promoting growth. Multidimensional Scaling analysis revealed two clusters within the 22 imprinted genes which are independently associated with fetoplacental development. Our data suggest that cluster 1 genes work by assuring cell growth and tissue development, while cluster 2 genes act by coordinating these processes. Results from this epidemiologic study offer solid support for the key role of imprinting in fetoplacental development. Copyright © 2012 Elsevier Ltd. All rights reserved.
Cloud-scale genomic signals processing classification analysis for gene expression microarray data.
Harvey, Benjamin; Soo-Yeon Ji
2014-01-01
As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring inference though analysis of DNA/mRNA sequence data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological inference by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale classification analysis of microarray data using Wavelet thresholding in a Cloud environment to identify significantly expressed features. This paper proposes a novel methodology that uses Wavelet based Denoising to initialize a threshold for determination of significantly expressed genes for classification. Additionally, this research was implemented and encompassed within cloud-based distributed processing environment. The utilization of Cloud computing and Wavelet thresholding was used for the classification 14 tumor classes from the Global Cancer Map (GCM). The results proved to be more accurate than using a predefined p-value for differential expression classification. This novel methodology analyzed Wavelet based threshold features of gene expression in a Cloud environment, furthermore classifying the expression of samples by analyzing gene patterns, which inform us of biological processes. Moreover, enabling researchers to face the present and forthcoming challenges that may arise in the analysis of data in functional genomics of large microarray datasets.
Haney, Robert A.; Clarke, Thomas H.; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y.; Ayoub, Nadia A.; Garb, Jessica E.
2016-01-01
Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland–specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species. PMID:26733576
Sex-Specific Selection and Sex-Biased Gene Expression in Humans and Flies
Kirkpatrick, Mark
2016-01-01
Sexual dimorphism results from sex-biased gene expression, which evolves when selection acts differently on males and females. While there is an intimate connection between sex-biased gene expression and sex-specific selection, few empirical studies have studied this relationship directly. Here we compare the two on a genome-wide scale in humans and flies. We find a distinctive “Twin Peaks” pattern in humans that relates the strength of sex-specific selection, quantified by genetic divergence between male and female adults at autosomal loci, to the degree of sex-biased expression. Genes with intermediate degrees of sex-biased expression show evidence of ongoing sex-specific selection, while genes with either little or completely sex-biased expression do not. This pattern apparently results from differential viability selection in males and females acting in the current generation. The Twin Peaks pattern is also found in Drosophila using a different measure of sex-specific selection acting on fertility. We develop a simple model that successfully recapitulates the Twin Peaks. Our results suggest that many genes with intermediate sex-biased expression experience ongoing sex-specific selection in humans and flies. PMID:27658217
DGEM--a microarray gene expression database for primary human disease tissues.
Xia, Yuni; Campen, Andrew; Rigsby, Dan; Guo, Ying; Feng, Xingdong; Su, Eric W; Palakal, Mathew; Li, Shuyu
2007-01-01
Gene expression patterns can reflect gene regulations in human tissues under normal or pathologic conditions. Gene expression profiling data from studies of primary human disease samples are particularly valuable since these studies often span many years in order to collect patient clinical information and achieve a large sample size. Disease-to-Gene Expression Mapper (DGEM) provides a beneficial community resource to access and analyze these data; it currently includes Affymetrix oligonucleotide array datasets for more than 40 human diseases and 1400 samples. The data are normalized to the same scale and stored in a relational database. A statistical-analysis pipeline was implemented to identify genes abnormally expressed in disease tissues or genes whose expressions are associated with clinical parameters such as cancer patient survival. Data-mining results can be queried through a web-based interface at http://dgem.dhcp.iupui.edu/. The query tool enables dynamic generation of graphs and tables that are further linked to major gene and pathway resources that connect the data to relevant biology, including Entrez Gene and Kyoto Encyclopedia of Genes and Genomes (KEGG). In summary, DGEM provides scientists and physicians a valuable tool to study disease mechanisms, to discover potential disease biomarkers for diagnosis and prognosis, and to identify novel gene targets for drug discovery. The source code is freely available for non-profit use, on request to the authors.
Xiao, Xiaolin; Moreno-Moral, Aida; Rotival, Maxime; Bottolo, Leonardo; Petretto, Enrico
2014-01-01
Recent high-throughput efforts such as ENCODE have generated a large body of genome-scale transcriptional data in multiple conditions (e.g., cell-types and disease states). Leveraging these data is especially important for network-based approaches to human disease, for instance to identify coherent transcriptional modules (subnetworks) that can inform functional disease mechanisms and pathological pathways. Yet, genome-scale network analysis across conditions is significantly hampered by the paucity of robust and computationally-efficient methods. Building on the Higher-Order Generalized Singular Value Decomposition, we introduce a new algorithmic approach for efficient, parameter-free and reproducible identification of network-modules simultaneously across multiple conditions. Our method can accommodate weighted (and unweighted) networks of any size and can similarly use co-expression or raw gene expression input data, without hinging upon the definition and stability of the correlation used to assess gene co-expression. In simulation studies, we demonstrated distinctive advantages of our method over existing methods, which was able to recover accurately both common and condition-specific network-modules without entailing ad-hoc input parameters as required by other approaches. We applied our method to genome-scale and multi-tissue transcriptomic datasets from rats (microarray-based) and humans (mRNA-sequencing-based) and identified several common and tissue-specific subnetworks with functional significance, which were not detected by other methods. In humans we recapitulated the crosstalk between cell-cycle progression and cell-extracellular matrix interactions processes in ventricular zones during neocortex expansion and further, we uncovered pathways related to development of later cognitive functions in the cortical plate of the developing brain which were previously unappreciated. Analyses of seven rat tissues identified a multi-tissue subnetwork of co-expressed heat shock protein (Hsp) and cardiomyopathy genes (Bag3, Cryab, Kras, Emd, Plec), which was significantly replicated using separate failing heart and liver gene expression datasets in humans, thus revealing a conserved functional role for Hsp genes in cardiovascular disease.
Prediction of gene expression in embryonic structures of Drosophila melanogaster.
Samsonova, Anastasia A; Niranjan, Mahesan; Russell, Steven; Brazma, Alvis
2007-07-01
Understanding how sets of genes are coordinately regulated in space and time to generate the diversity of cell types that characterise complex metazoans is a major challenge in modern biology. The use of high-throughput approaches, such as large-scale in situ hybridisation and genome-wide expression profiling via DNA microarrays, is beginning to provide insights into the complexities of development. However, in many organisms the collection and annotation of comprehensive in situ localisation data is a difficult and time-consuming task. Here, we present a widely applicable computational approach, integrating developmental time-course microarray data with annotated in situ hybridisation studies, that facilitates the de novo prediction of tissue-specific expression for genes that have no in vivo gene expression localisation data available. Using a classification approach, trained with data from microarray and in situ hybridisation studies of gene expression during Drosophila embryonic development, we made a set of predictions on the tissue-specific expression of Drosophila genes that have not been systematically characterised by in situ hybridisation experiments. The reliability of our predictions is confirmed by literature-derived annotations in FlyBase, by overrepresentation of Gene Ontology biological process annotations, and, in a selected set, by detailed gene-specific studies from the literature. Our novel organism-independent method will be of considerable utility in enriching the annotation of gene function and expression in complex multicellular organisms.
Prediction of Gene Expression in Embryonic Structures of Drosophila melanogaster
Samsonova, Anastasia A; Niranjan, Mahesan; Russell, Steven; Brazma, Alvis
2007-01-01
Understanding how sets of genes are coordinately regulated in space and time to generate the diversity of cell types that characterise complex metazoans is a major challenge in modern biology. The use of high-throughput approaches, such as large-scale in situ hybridisation and genome-wide expression profiling via DNA microarrays, is beginning to provide insights into the complexities of development. However, in many organisms the collection and annotation of comprehensive in situ localisation data is a difficult and time-consuming task. Here, we present a widely applicable computational approach, integrating developmental time-course microarray data with annotated in situ hybridisation studies, that facilitates the de novo prediction of tissue-specific expression for genes that have no in vivo gene expression localisation data available. Using a classification approach, trained with data from microarray and in situ hybridisation studies of gene expression during Drosophila embryonic development, we made a set of predictions on the tissue-specific expression of Drosophila genes that have not been systematically characterised by in situ hybridisation experiments. The reliability of our predictions is confirmed by literature-derived annotations in FlyBase, by overrepresentation of Gene Ontology biological process annotations, and, in a selected set, by detailed gene-specific studies from the literature. Our novel organism-independent method will be of considerable utility in enriching the annotation of gene function and expression in complex multicellular organisms. PMID:17658945
Mosquera Orgueira, Adrián
2015-01-01
DNA methylation is a frequent epigenetic mechanism that participates in transcriptional repression. Variations in DNA methylation with respect to gene expression are constant, and, for unknown reasons, some genes with highly methylated promoters are sometimes overexpressed. In this study we have analyzed the expression and methylation patterns of thousands of genes in five groups of cancer and normal tissue samples in order to determine local and genome-wide differences. We observed significant changes in global methylation-expression correlation in all the neoplasms, which suggests that differential correlation events are frequent in cancer. A focused analysis in the breast cancer cohort identified 1662 genes whose correlation varies significantly between normal and cancerous breast, but whose DNA methylation and gene expression patterns do not change substantially. These genes were enriched in cancer-related pathways and repressive chromatin features across various model cell lines, such as PRC2 binding and H3K27me3 marks. Substantial changes in methylation-expression correlation indicate that these genes are subject to epigenetic remodeling, where the differential activity of other factors break the expected relationship between both variables. Our findings suggest a complex regulatory landscape where a redistribution of local and large-scale chromatin repressive domains at differentially correlated genes (DCGs) creates epigenetic hotspots that modulate cancer-specific gene expression. PMID:26029238
Kaufman, Alon; Dror, Gideon; Meilijson, Isaac; Ruppin, Eytan
2006-12-08
The claim that genetic properties of neurons significantly influence their synaptic network structure is a common notion in neuroscience. The nematode Caenorhabditis elegans provides an exciting opportunity to approach this question in a large-scale quantitative manner. Its synaptic connectivity network has been identified, and, combined with cellular studies, we currently have characteristic connectivity and gene expression signatures for most of its neurons. By using two complementary analysis assays we show that the expression signature of a neuron carries significant information about its synaptic connectivity signature, and identify a list of putative genes predicting neural connectivity. The current study rigorously quantifies the relation between gene expression and synaptic connectivity signatures in the C. elegans nervous system and identifies subsets of neurons where this relation is highly marked. The results presented and the genes identified provide a promising starting point for further, more detailed computational and experimental investigations.
Discovering functions of unannotated genes from a transcriptome survey of wild fungal isolates.
Ellison, Christopher E; Kowbel, David; Glass, N Louise; Taylor, John W; Brem, Rachel B
2014-04-01
Most fungal genomes are poorly annotated, and many fungal traits of industrial and biomedical relevance are not well suited to classical genetic screens. Assigning genes to phenotypes on a genomic scale thus remains an urgent need in the field. We developed an approach to infer gene function from expression profiles of wild fungal isolates, and we applied our strategy to the filamentous fungus Neurospora crassa. Using transcriptome measurements in 70 strains from two well-defined clades of this microbe, we first identified 2,247 cases in which the expression of an unannotated gene rose and fell across N. crassa strains in parallel with the expression of well-characterized genes. We then used image analysis of hyphal morphologies, quantitative growth assays, and expression profiling to test the functions of four genes predicted from our population analyses. The results revealed two factors that influenced regulation of metabolism of nonpreferred carbon and nitrogen sources, a gene that governed hyphal architecture, and a gene that mediated amino acid starvation resistance. These findings validate the power of our population-transcriptomic approach for inference of novel gene function, and we suggest that this strategy will be of broad utility for genome-scale annotation in many fungal systems. IMPORTANCE Some fungal species cause deadly infections in humans or crop plants, and other fungi are workhorses of industrial chemistry, including the production of biofuels. Advances in medical and industrial mycology require an understanding of the genes that control fungal traits. We developed a method to infer functions of uncharacterized genes by observing correlated expression of their mRNAs with those of known genes across wild fungal isolates. We applied this strategy to a filamentous fungus and predicted functions for thousands of unknown genes. In four cases, we experimentally validated the predictions from our method, discovering novel genes involved in the metabolism of nutrient sources relevant for biofuel production, as well as colony morphology and starvation resistance. Our strategy is straightforward, inexpensive, and applicable for predicting gene function in many fungal species.
Query-based biclustering of gene expression data using Probabilistic Relational Models.
Zhao, Hui; Cloots, Lore; Van den Bulcke, Tim; Wu, Yan; De Smet, Riet; Storms, Valerie; Meysman, Pieter; Engelen, Kristof; Marchal, Kathleen
2011-02-15
With the availability of large scale expression compendia it is now possible to view own findings in the light of what is already available and retrieve genes with an expression profile similar to a set of genes of interest (i.e., a query or seed set) for a subset of conditions. To that end, a query-based strategy is needed that maximally exploits the coexpression behaviour of the seed genes to guide the biclustering, but that at the same time is robust against the presence of noisy genes in the seed set as seed genes are often assumed, but not guaranteed to be coexpressed in the queried compendium. Therefore, we developed ProBic, a query-based biclustering strategy based on Probabilistic Relational Models (PRMs) that exploits the use of prior distributions to extract the information contained within the seed set. We applied ProBic on a large scale Escherichia coli compendium to extend partially described regulons with potentially novel members. We compared ProBic's performance with previously published query-based biclustering algorithms, namely ISA and QDB, from the perspective of bicluster expression quality, robustness of the outcome against noisy seed sets and biological relevance.This comparison learns that ProBic is able to retrieve biologically relevant, high quality biclusters that retain their seed genes and that it is particularly strong in handling noisy seeds. ProBic is a query-based biclustering algorithm developed in a flexible framework, designed to detect biologically relevant, high quality biclusters that retain relevant seed genes even in the presence of noise or when dealing with low quality seed sets.
Context-specific metabolic networks are consistent with experiments.
Becker, Scott A; Palsson, Bernhard O
2008-05-16
Reconstructions of cellular metabolism are publicly available for a variety of different microorganisms and some mammalian genomes. To date, these reconstructions are "genome-scale" and strive to include all reactions implied by the genome annotation, as well as those with direct experimental evidence. Clearly, many of the reactions in a genome-scale reconstruction will not be active under particular conditions or in a particular cell type. Methods to tailor these comprehensive genome-scale reconstructions into context-specific networks will aid predictive in silico modeling for a particular situation. We present a method called Gene Inactivity Moderated by Metabolism and Expression (GIMME) to achieve this goal. The GIMME algorithm uses quantitative gene expression data and one or more presupposed metabolic objectives to produce the context-specific reconstruction that is most consistent with the available data. Furthermore, the algorithm provides a quantitative inconsistency score indicating how consistent a set of gene expression data is with a particular metabolic objective. We show that this algorithm produces results consistent with biological experiments and intuition for adaptive evolution of bacteria, rational design of metabolic engineering strains, and human skeletal muscle cells. This work represents progress towards producing constraint-based models of metabolism that are specific to the conditions where the expression profiling data is available.
Construction of two vectors for gene expression in Trichoderma reesei.
Lv, Dandan; Wang, Wei; Wei, Dongzhi
2012-01-01
We report the construction of two filamentous fungi Trichoderma reesei expression vectors, pWEF31 and pWEF32. Both vectors possess the hygromycin phosphotransferase B gene expression cassette and the strong promoter and terminator of the cellobiohydrolase 1 gene (cbh1) from T. reesei. The two newly constructed vectors can be efficiently transformed into T. reesei with Agrobacterium-mediated transformation. The difference between pWEF31 and pWEF32 is that pWEF32 has two longer homologous arms. As a result, pWEF32 easily undergoes homologous recombination. On the other hand, pWEF31 undergoes random recombination. The applicability of both vectors was tested by first generating the expression vectors pWEF31-red and pWEF32-red and then detecting the expression of the DsRed2 gene in T. reesei Rut C30. Additionally, we measured the exo-1,4-β-glucanase activity of the recombinant cells. Our work provides an effective transformation system for homologous and heterologous gene expression and gene knockout in T. reesei. It also provides a method for recombination at a specific chromosomal location. Finally, both vectors will be useful for the large-scale gene expression industry. Copyright © 2011 Elsevier Inc. All rights reserved.
Placinta, Mike; Shen, Meng-Chieh; Achermann, Marc; Karlstrom, Rolf O
2009-12-30
Tissue heating has been employed to study a variety of biological processes, including the study of genes that control embryonic development. Conditional regulation of gene expression is a particularly powerful approach for understanding gene function. One popular method for mis-expressing a gene of interest employs heat-inducible heat shock protein (hsp) promoters. Global heat shock of hsp-promoter-containing transgenic animals induces gene expression throughout all tissues, but does not allow for spatial control. Local heating allows for spatial control of hsp-promoter-driven transgenes, but methods for local heating are cumbersome and variably effective. We describe a simple, highly controllable, and versatile apparatus for heating biological tissue and other materials on the micron-scale. This microheater employs micron-scale fiber optics and uses an inexpensive laser-pointer as a power source. Optical fibers can be pulled on a standard electrode puller to produce tips of varying sizes that can then be used to reliably heat 20-100 mum targets. We demonstrate precise spatiotemporal control of hsp70l:GFP transgene expression in a variety of tissue types in zebrafish embryos and larvae. We also show how this system can be employed as part of a new method for lineage tracing that would greatly facilitate the study of organogenesis and tissue regulation at any time in the life cycle. This versatile and simple local heater has broad utility for the study of gene function and for lineage tracing. This system could be used to control hsp-driven gene expression in any organism simply by bringing the fiber optic tip in contact with the tissue of interest. Beyond these uses for the study of gene function, this device has wide-ranging utility in materials science and could easily be adapted for therapeutic purposes in humans.
Sibout, Richard; Proost, Sebastian; Hansen, Bjoern Oest; Vaid, Neha; Giorgi, Federico M; Ho-Yue-Kuang, Severine; Legée, Frédéric; Cézart, Laurent; Bouchabké-Coussa, Oumaya; Soulhat, Camille; Provart, Nicholas; Pasha, Asher; Le Bris, Philippe; Roujol, David; Hofte, Herman; Jamet, Elisabeth; Lapierre, Catherine; Persson, Staffan; Mutwil, Marek
2017-08-01
While Brachypodium distachyon (Brachypodium) is an emerging model for grasses, no expression atlas or gene coexpression network is available. Such tools are of high importance to provide insights into the function of Brachypodium genes. We present a detailed Brachypodium expression atlas, capturing gene expression in its major organs at different developmental stages. The data were integrated into a large-scale coexpression database ( www.gene2function.de), enabling identification of duplicated pathways and conserved processes across 10 plant species, thus allowing genome-wide inference of gene function. We highlight the importance of the atlas and the platform through the identification of duplicated cell wall modules, and show that a lignin biosynthesis module is conserved across angiosperms. We identified and functionally characterised a putative ferulate 5-hydroxylase gene through overexpression of it in Brachypodium, which resulted in an increase in lignin syringyl units and reduced lignin content of mature stems, and led to improved saccharification of the stem biomass. Our Brachypodium expression atlas thus provides a powerful resource to reveal functionally related genes, which may advance our understanding of important biological processes in grasses. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Kim, Minseung; Zorraquino, Violeta; Tagkopoulos, Ilias
2015-03-01
A tantalizing question in cellular physiology is whether the cellular state and environmental conditions can be inferred by the expression signature of an organism. To investigate this relationship, we created an extensive normalized gene expression compendium for the bacterium Escherichia coli that was further enriched with meta-information through an iterative learning procedure. We then constructed an ensemble method to predict environmental and cellular state, including strain, growth phase, medium, oxygen level, antibiotic and carbon source presence. Results show that gene expression is an excellent predictor of environmental structure, with multi-class ensemble models achieving balanced accuracy between 70.0% (±3.5%) to 98.3% (±2.3%) for the various characteristics. Interestingly, this performance can be significantly boosted when environmental and strain characteristics are simultaneously considered, as a composite classifier that captures the inter-dependencies of three characteristics (medium, phase and strain) achieved 10.6% (±1.0%) higher performance than any individual models. Contrary to expectations, only 59% of the top informative genes were also identified as differentially expressed under the respective conditions. Functional analysis of the respective genetic signatures implicates a wide spectrum of Gene Ontology terms and KEGG pathways with condition-specific information content, including iron transport, transferases, and enterobactin synthesis. Further experimental phenotypic-to-genotypic mapping that we conducted for knock-out mutants argues for the information content of top-ranked genes. This work demonstrates the degree at which genome-scale transcriptional information can be predictive of latent, heterogeneous and seemingly disparate phenotypic and environmental characteristics, with far-reaching applications.
Ye, Meixia; Wang, Zhong; Wang, Yaqun; Wu, Rongling
2015-03-01
Dynamic changes of gene expression reflect an intrinsic mechanism of how an organism responds to developmental and environmental signals. With the increasing availability of expression data across a time-space scale by RNA-seq, the classification of genes as per their biological function using RNA-seq data has become one of the most significant challenges in contemporary biology. Here we develop a clustering mixture model to discover distinct groups of genes expressed during a period of organ development. By integrating the density function of multivariate Poisson distribution, the model accommodates the discrete property of read counts characteristic of RNA-seq data. The temporal dependence of gene expression is modeled by the first-order autoregressive process. The model is implemented with the Expectation-Maximization algorithm and model selection to determine the optimal number of gene clusters and obtain the estimates of Poisson parameters that describe the pattern of time-dependent expression of genes from each cluster. The model has been demonstrated by analyzing a real data from an experiment aimed to link the pattern of gene expression to catkin development in white poplar. The usefulness of the model has been validated through computer simulation. The model provides a valuable tool for clustering RNA-seq data, facilitating our global view of expression dynamics and understanding of gene regulation mechanisms. © The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Caldwell, Rachel; Lin, Yan-Xia; Zhang, Ren
2015-01-01
There is a continuing interest in the analysis of gene architecture and gene expression to determine the relationship that may exist. Advances in high-quality sequencing technologies and large-scale resource datasets have increased the understanding of relationships and cross-referencing of expression data to the large genome data. Although a negative correlation between expression level and gene (especially transcript) length has been generally accepted, there have been some conflicting results arising from the literature concerning the impacts of different regions of genes, and the underlying reason is not well understood. The research aims to apply quantile regression techniques for statistical analysis of coding and noncoding sequence length and gene expression data in the plant, Arabidopsis thaliana, and fruit fly, Drosophila melanogaster, to determine if a relationship exists and if there is any variation or similarities between these species. The quantile regression analysis found that the coding sequence length and gene expression correlations varied, and similarities emerged for the noncoding sequence length (5′ and 3′ UTRs) between animal and plant species. In conclusion, the information described in this study provides the basis for further exploration into gene regulation with regard to coding and noncoding sequence length. PMID:26114098
Superior Cross-Species Reference Genes: A Blueberry Case Study
Die, Jose V.; Rowland, Lisa J.
2013-01-01
The advent of affordable Next Generation Sequencing technologies has had major impact on studies of many crop species, where access to genomic technologies and genome-scale data sets has been extremely limited until now. The recent development of genomic resources in blueberry will enable the application of high throughput gene expression approaches that should relatively quickly increase our understanding of blueberry physiology. These studies, however, require a highly accurate and robust workflow and make necessary the identification of reference genes with high expression stability for correct target gene normalization. To create a set of superior reference genes for blueberry expression analyses, we mined a publicly available transcriptome data set from blueberry for orthologs to a set of Arabidopsis genes that showed the most stable expression in a developmental series. In total, the expression stability of 13 putative reference genes was evaluated by qPCR and a set of new references with high stability values across a developmental series in fruits and floral buds of blueberry were identified. We also demonstrated the need to use at least two, preferably three, reference genes to avoid inconsistencies in results, even when superior reference genes are used. The new references identified here provide a valuable resource for accurate normalization of gene expression in Vaccinium spp. and may be useful for other members of the Ericaceae family as well. PMID:24058469
Application of industrial scale genomics to discovery of therapeutic targets in heart failure.
Mehraban, F; Tomlinson, J E
2001-12-01
In recent years intense activity in both academic and industrial sectors has provided a wealth of information on the human genome with an associated impressive increase in the number of novel gene sequences deposited in sequence data repositories and patent applications. This genomic industrial revolution has transformed the way in which drug target discovery is now approached. In this article we discuss how various differential gene expression (DGE) technologies are being utilized for cardiovascular disease (CVD) drug target discovery. Other approaches such as sequencing cDNA from cardiovascular derived tissues and cells coupled with bioinformatic sequence analysis are used with the aim of identifying novel gene sequences that may be exploited towards target discovery. Additional leverage from gene sequence information is obtained through identification of polymorphisms that may confer disease susceptibility and/or affect drug responsiveness. Pharmacogenomic studies are described wherein gene expression-based techniques are used to evaluate drug response and/or efficacy. Industrial-scale genomics supports and addresses not only novel target gene discovery but also the burgeoning issues in pharmaceutical and clinical cardiovascular medicine relative to polymorphic gene responses.
Shanley, Thomas P; Cvijanovich, Natalie; Lin, Richard; Allen, Geoffrey L; Thomas, Neal J; Doctor, Allan; Kalyanaraman, Meena; Tofil, Nancy M; Penfil, Scott; Monaco, Marie; Odoms, Kelli; Barnes, Michael; Sakthivel, Bhuvaneswari; Aronow, Bruce J; Wong, Hector R
2007-01-01
We have conducted longitudinal studies focused on the expression profiles of signaling pathways and gene networks in children with septic shock. Genome-level expression profiles were generated from whole blood-derived RNA of children with septic shock (n = 30) corresponding to day one and day three of septic shock, respectively. Based on sequential statistical and expression filters, day one and day three of septic shock were characterized by differential regulation of 2,142 and 2,504 gene probes, respectively, relative to controls (n = 15). Venn analysis demonstrated 239 unique genes in the day one dataset, 598 unique genes in the day three dataset, and 1,906 genes common to both datasets. Functional analyses demonstrated time-dependent, differential regulation of genes involved in multiple signaling pathways and gene networks primarily related to immunity and inflammation. Notably, multiple and distinct gene networks involving T cell- and MHC antigen-related biology were persistently downregulated on both day one and day three. Further analyses demonstrated large scale, persistent downregulation of genes corresponding to functional annotations related to zinc homeostasis. These data represent the largest reported cohort of patients with septic shock subjected to longitudinal genome-level expression profiling. The data further advance our genome-level understanding of pediatric septic shock and support novel hypotheses. PMID:17932561
Lam, Max; Trampush, Joey W; Yu, Jin; Knowles, Emma; Davies, Gail; Liewald, David C; Starr, John M; Djurovic, Srdjan; Melle, Ingrid; Sundet, Kjetil; Christoforou, Andrea; Reinvang, Ivar; DeRosse, Pamela; Lundervold, Astri J; Steen, Vidar M; Espeseth, Thomas; Räikkönen, Katri; Widen, Elisabeth; Palotie, Aarno; Eriksson, Johan G; Giegling, Ina; Konte, Bettina; Roussos, Panos; Giakoumaki, Stella; Burdick, Katherine E; Payton, Antony; Ollier, William; Chiba-Falek, Ornit; Attix, Deborah K; Need, Anna C; Cirulli, Elizabeth T; Voineskos, Aristotle N; Stefanis, Nikos C; Avramopoulos, Dimitrios; Hatzimanolis, Alex; Arking, Dan E; Smyrnis, Nikolaos; Bilder, Robert M; Freimer, Nelson A; Cannon, Tyrone D; London, Edythe; Poldrack, Russell A; Sabb, Fred W; Congdon, Eliza; Conley, Emily Drabant; Scult, Matthew A; Dickinson, Dwight; Straub, Richard E; Donohoe, Gary; Morris, Derek; Corvin, Aiden; Gill, Michael; Hariri, Ahmad R; Weinberger, Daniel R; Pendleton, Neil; Bitsios, Panos; Rujescu, Dan; Lahti, Jari; Le Hellard, Stephanie; Keller, Matthew C; Andreassen, Ole A; Deary, Ian J; Glahn, David C; Malhotra, Anil K; Lencz, Todd
2017-11-28
Here, we present a large (n = 107,207) genome-wide association study (GWAS) of general cognitive ability ("g"), further enhanced by combining results with a large-scale GWAS of educational attainment. We identified 70 independent genomic loci associated with general cognitive ability. Results showed significant enrichment for genes causing Mendelian disorders with an intellectual disability phenotype. Competitive pathway analysis implicated the biological processes of neurogenesis and synaptic regulation, as well as the gene targets of two pharmacologic agents: cinnarizine, a T-type calcium channel blocker, and LY97241, a potassium channel inhibitor. Transcriptome-wide and epigenome-wide analysis revealed that the implicated loci were enriched for genes expressed across all brain regions (most strongly in the cerebellum). Enrichment was exclusive to genes expressed in neurons but not oligodendrocytes or astrocytes. Finally, we report genetic correlations between cognitive ability and disparate phenotypes including psychiatric disorders, several autoimmune disorders, longevity, and maternal age at first birth. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Cellular Factors Shape 3D Genome Landscape
Researchers, using novel large-scale imaging technology, have mapped the spatial location of individual genes in the nucleus of human cells and identified 50 cellular factors required for the proper 3D positioning of genes. These spatial locations play important roles in gene expression, DNA repair, genome stability, and other cellular activities.
Piersma, Sjouke; Denham, Emma L; Drulhe, Samuel; Tonk, Rudi H J; Schwikowski, Benno; van Dijl, Jan Maarten
2013-01-01
Gene expression heterogeneity is a key driver for microbial adaptation to fluctuating environmental conditions, cell differentiation and the evolution of species. This phenomenon has therefore enormous implications, not only for life in general, but also for biotechnological applications where unwanted subpopulations of non-producing cells can emerge in large-scale fermentations. Only time-lapse fluorescence microscopy allows real-time measurements of gene expression heterogeneity. A major limitation in the analysis of time-lapse microscopy data is the lack of fast, cost-effective, open, simple and adaptable protocols. Here we describe TLM-Quant, a semi-automatic pipeline for the analysis of time-lapse fluorescence microscopy data that enables the user to visualize and quantify gene expression heterogeneity. Importantly, our pipeline builds on the open-source packages ImageJ and R. To validate TLM-Quant, we selected three possible scenarios, namely homogeneous expression, highly 'noisy' heterogeneous expression, and bistable heterogeneous expression in the Gram-positive bacterium Bacillus subtilis. This bacterium is both a paradigm for systems-level studies on gene expression and a highly appreciated biotechnological 'cell factory'. We conclude that the temporal resolution of such analyses with TLM-Quant is only limited by the numbers of recorded images.
Approximate geodesic distances reveal biologically relevant structures in microarray data.
Nilsson, Jens; Fioretos, Thoas; Höglund, Mattias; Fontes, Magnus
2004-04-12
Genome-wide gene expression measurements, as currently determined by the microarray technology, can be represented mathematically as points in a high-dimensional gene expression space. Genes interact with each other in regulatory networks, restricting the cellular gene expression profiles to a certain manifold, or surface, in gene expression space. To obtain knowledge about this manifold, various dimensionality reduction methods and distance metrics are used. For data points distributed on curved manifolds, a sensible distance measure would be the geodesic distance along the manifold. In this work, we examine whether an approximate geodesic distance measure captures biological similarities better than the traditionally used Euclidean distance. We computed approximate geodesic distances, determined by the Isomap algorithm, for one set of lymphoma and one set of lung cancer microarray samples. Compared with the ordinary Euclidean distance metric, this distance measure produced more instructive, biologically relevant, visualizations when applying multidimensional scaling. This suggests the Isomap algorithm as a promising tool for the interpretation of microarray data. Furthermore, the results demonstrate the benefit and importance of taking nonlinearities in gene expression data into account.
Wang, Yupeng; Ficklin, Stephen P; Wang, Xiyin; Feltus, F Alex; Paterson, Andrew H
2016-01-01
Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots.
Wang, Yupeng; Ficklin, Stephen P.; Wang, Xiyin; Feltus, F. Alex; Paterson, Andrew H.
2016-01-01
Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots. PMID:27195960
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta
Whittle, C. A.; Sun, Y.; Johannesson, H.
2011-01-01
Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862
Bedoukian, Matthew A.; Rodriguez, Sarah M.; Cohen, Matthew B.; Duncan Smith, Stuart V.; Park, Jennifer
2009-01-01
Gene expression in Drosophila melanogaster changes significantly throughout life and some of these changes can be delayed by lowering ambient temperature and also by dietary restriction. These two interventions are known to slow the rate of aging as well as the accumulation of damage. It is unknown, however, whether gene expression changes that occur during development and early adult life make an animal more vulnerable to death. Here we develop a method capable of measuring the rate of programmed genetic changes during young adult life in Drosophila melanogaster and show that these changes can be delayed or accelerated in a manner that is predictive of longevity. We show that temperature shifts and dietary restriction, which slow the rate of aging in Drosophila melanogaster, extend the window of neuronal susceptibility to GRIM over-expression in a way that scales to lifespan. We propose that this susceptibility can be used to test compounds and genetic manipulations that alter the onset of senescence by changing the programmed timing of gene expression that correlates and may be causal to aging. PMID:19428445
The Plant Genome Integrative Explorer Resource: PlantGenIE.org.
Sundell, David; Mannapperuma, Chanaka; Netotea, Sergiu; Delhomme, Nicolas; Lin, Yao-Cheng; Sjödin, Andreas; Van de Peer, Yves; Jansson, Stefan; Hvidsten, Torgeir R; Street, Nathaniel R
2015-12-01
Accessing and exploring large-scale genomics data sets remains a significant challenge to researchers without specialist bioinformatics training. We present the integrated PlantGenIE.org platform for exploration of Populus, conifer and Arabidopsis genomics data, which includes expression networks and associated visualization tools. Standard features of a model organism database are provided, including genome browsers, gene list annotation, Blast homology searches and gene information pages. Community annotation updating is supported via integration of WebApollo. We have produced an RNA-sequencing (RNA-Seq) expression atlas for Populus tremula and have integrated these data within the expression tools. An updated version of the ComPlEx resource for performing comparative plant expression analyses of gene coexpression network conservation between species has also been integrated. The PlantGenIE.org platform provides intuitive access to large-scale and genome-wide genomics data from model forest tree species, facilitating both community contributions to annotation improvement and tools supporting use of the included data resources to inform biological insight. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Gurunathan, Rajalakshmi; Van Emden, Bernard; Panchanathan, Sethuraman; Kumar, Sudhir
2004-01-01
Background Modern developmental biology relies heavily on the analysis of embryonic gene expression patterns. Investigators manually inspect hundreds or thousands of expression patterns to identify those that are spatially similar and to ultimately infer potential gene interactions. However, the rapid accumulation of gene expression pattern data over the last two decades, facilitated by high-throughput techniques, has produced a need for the development of efficient approaches for direct comparison of images, rather than their textual descriptions, to identify spatially similar expression patterns. Results The effectiveness of the Binary Feature Vector (BFV) and Invariant Moment Vector (IMV) based digital representations of the gene expression patterns in finding biologically meaningful patterns was compared for a small (226 images) and a large (1819 images) dataset. For each dataset, an ordered list of images, with respect to a query image, was generated to identify overlapping and similar gene expression patterns, in a manner comparable to what a developmental biologist might do. The results showed that the BFV representation consistently outperforms the IMV representation in finding biologically meaningful matches when spatial overlap of the gene expression pattern and the genes involved are considered. Furthermore, we explored the value of conducting image-content based searches in a dataset where individual expression components (or domains) of multi-domain expression patterns were also included separately. We found that this technique improves performance of both IMV and BFV based searches. Conclusions We conclude that the BFV representation consistently produces a more extensive and better list of biologically useful patterns than the IMV representation. The high quality of results obtained scales well as the search database becomes larger, which encourages efforts to build automated image query and retrieval systems for spatial gene expression patterns. PMID:15603586
González-González, Andrea; Hug, Shaun M; Rodríguez-Verdugo, Alejandra; Patel, Jagdish Suresh; Gaut, Brandon S
2017-11-01
Modifications to transcriptional regulators play a major role in adaptation. Here, we compared the effects of multiple beneficial mutations within and between Escherichia coli rpoB, the gene encoding the RNA polymerase β subunit, and rho, which encodes a transcriptional terminator. These two genes have harbored adaptive mutations in numerous E. coli evolution experiments but particularly in our previous large-scale thermal stress experiment, where the two genes characterized alternative adaptive pathways. To compare the effects of beneficial mutations, we engineered four advantageous mutations into each of the two genes and measured their effects on fitness, growth, gene expression and transcriptional termination at 42.2 °C. Among the eight mutations, two rho mutations had no detectable effect on relative fitness, suggesting they were beneficial only in the context of epistatic interactions. The remaining six mutations had an average relative fitness benefit of ∼20%. The rpoB mutations affected the expression of ∼1,700 genes; rho mutations affected the expression of fewer genes but most (83%) were a subset of those altered by rpoB mutants. Across the eight mutants, relative fitness correlated with the degree to which a mutation restored gene expression back to the unstressed, 37.0 °C state. The beneficial mutations in the two genes did not have identical effects on fitness, growth or gene expression, but they caused parallel phenotypic effects on gene expression and genome-wide transcriptional termination. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Kameshwar, Ayyappa Kumar Sista; Qin, Wensheng
2017-01-01
In literature, extensive studies have been conducted on popular wood degrading white rot fungus, Phanerochaete chrysosporium about its lignin degrading mechanisms compared to the cellulose and hemicellulose degrading abilities. This study delineates cellulose and hemicellulose degrading mechanisms through large scale metadata analysis of P. chrysosporium gene expression data (retrieved from NCBI GEO) to understand the common expression patterns of differentially expressed genes when cultured on different growth substrates. Genes encoding glycoside hydrolase classes commonly expressed during breakdown of cellulose such as GH-5,6,7,9,44,45,48 and hemicellulose are GH-2,8,10,11,26,30,43,47 were found to be highly expressed among varied growth conditions including simple customized and complex natural plant biomass growth mediums. Genes encoding carbohydrate esterase class enzymes CE (1,4,8,9,15,16) polysaccharide lyase class enzymes PL-8 and PL-14, and glycosyl transferases classes GT (1,2,4,8,15,20,35,39,48) were differentially expressed in natural plant biomass growth mediums. Based on these results, P. chrysosporium, on natural plant biomass substrates was found to express lignin and hemicellulose degrading enzymes more than cellulolytic enzymes except GH-61 (LPMO) class enzymes, in early stages. It was observed that the fate of P. chrysosporium transcriptome is significantly affected by the wood substrate provided. We believe, the gene expression findings in this study plays crucial role in developing genetically efficient microbe with effective cellulose and hemicellulose degradation abilities.
Zhang, Qingyang
2018-05-16
Differential co-expression analysis, as a complement of differential expression analysis, offers significant insights into the changes in molecular mechanism of different phenotypes. A prevailing approach to detecting differentially co-expressed genes is to compare Pearson's correlation coefficients in two phenotypes. However, due to the limitations of Pearson's correlation measure, this approach lacks the power to detect nonlinear changes in gene co-expression which is common in gene regulatory networks. In this work, a new nonparametric procedure is proposed to search differentially co-expressed gene pairs in different phenotypes from large-scale data. Our computational pipeline consisted of two main steps, a screening step and a testing step. The screening step is to reduce the search space by filtering out all the independent gene pairs using distance correlation measure. In the testing step, we compare the gene co-expression patterns in different phenotypes by a recently developed edge-count test. Both steps are distribution-free and targeting nonlinear relations. We illustrate the promise of the new approach by analyzing the Cancer Genome Atlas data and the METABRIC data for breast cancer subtypes. Compared with some existing methods, the new method is more powerful in detecting nonlinear type of differential co-expressions. The distance correlation screening can greatly improve computational efficiency, facilitating its application to large data sets.
Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jakob; Gilchrist, Michael J; Panitz, Frank; Jørgensen, Claus; Scheibye-Knudsen, Karsten; Arvin, Troels; Lumholdt, Steen; Sawera, Milena; Green, Trine; Nielsen, Bente J; Havgaard, Jakob H; Rosenkilde, Carina; Wang, Jun; Li, Heng; Li, Ruiqiang; Liu, Bin; Hu, Songnian; Dong, Wei; Li, Wei; Yu, Jun; Wang, Jian; Stærfeldt, Hans-Henrik; Wernersson, Rasmus; Madsen, Lone B; Thomsen, Bo; Hornshøj, Henrik; Bujie, Zhan; Wang, Xuegang; Wang, Xuefei; Bolund, Lars; Brunak, Søren; Yang, Huanming; Bendixen, Christian; Fredholm, Merete
2007-01-01
Background Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. Results Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which approximately 25% have a high confidence match to UniProt. Approximately 6,000 new porcine gene clusters were identified. Expression analysis based on the non-normalized libraries resulted in the following findings. The distribution of cluster sizes is scaling invariant. Brain and testes are among the tissues with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression of genes between different tissues, in particular brain/spinal cord, and found patterns of correlation between genes that share expression in pairs of libraries. Finally, there was remarkable agreement in expression between specialized tissues according to Gene Ontology categories. Conclusion This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies. PMID:17407547
Construction of regulatory networks using expression time-series data of a genotyped population.
Yeung, Ka Yee; Dombek, Kenneth M; Lo, Kenneth; Mittler, John E; Zhu, Jun; Schadt, Eric E; Bumgarner, Roger E; Raftery, Adrian E
2011-11-29
The inference of regulatory and biochemical networks from large-scale genomics data is a basic problem in molecular biology. The goal is to generate testable hypotheses of gene-to-gene influences and subsequently to design bench experiments to confirm these network predictions. Coexpression of genes in large-scale gene-expression data implies coregulation and potential gene-gene interactions, but provide little information about the direction of influences. Here, we use both time-series data and genetics data to infer directionality of edges in regulatory networks: time-series data contain information about the chronological order of regulatory events and genetics data allow us to map DNA variations to variations at the RNA level. We generate microarray data measuring time-dependent gene-expression levels in 95 genotyped yeast segregants subjected to a drug perturbation. We develop a Bayesian model averaging regression algorithm that incorporates external information from diverse data types to infer regulatory networks from the time-series and genetics data. Our algorithm is capable of generating feedback loops. We show that our inferred network recovers existing and novel regulatory relationships. Following network construction, we generate independent microarray data on selected deletion mutants to prospectively test network predictions. We demonstrate the potential of our network to discover de novo transcription-factor binding sites. Applying our construction method to previously published data demonstrates that our method is competitive with leading network construction algorithms in the literature.
Greenwold, Matthew J; Sawyer, Roger H
2013-09-01
The archosauria consist of two living groups, crocodilians, and birds. Here we compare the structure, expression, and phylogeny of the beta (β)-keratins in two crocodilian genomes and two avian genomes to gain a better understanding of the evolutionary origin of the feather β-keratins. Unlike squamates such as the green anole with 40 β-keratins in its genome, the chicken and zebra finch genomes have over 100 β-keratin genes in their genomes, while the American alligator has 20 β-keratin genes, and the saltwater crocodile has 21 β-keratin genes. The crocodilian β-keratins are similar to those of birds and these structural proteins have a central filament domain and N- and C-termini, which contribute to the matrix material between the twisted β-sheets, which form the 2-3 nm filament. Overall the expression of alligator β-keratin genes in the integument increases during development. Phylogenetic analysis demonstrates that a crocodilian β-keratin clade forms a monophyletic group with the avian scale and feather β-keratins, suggesting that avian scale and feather β-keratins along with a subset of crocodilian β-keratins evolved from a common ancestral gene/s. Overall, our analyses support the view that the epidermal appendages of basal archosaurs used a diverse array of β-keratins, which evolved into crocodilian and avian specific clades. In birds, the scale and feather subfamilies appear to have evolved independently in the avian lineage from a subset of archosaurian claw β-keratins. The expansion of the avian specific feather β-keratin genes accompanied the diversification of birds and the evolution of feathers. Copyright © 2013 Wiley Periodicals, Inc.
Researchers use Modified CRISPR Systems to Modulate Gene Expression on a Genomic Scale
Cancer Target Discovery and Development Network (CTD2) researchers at the University of California, San Francisco, developed a CRISPR system that can regulate both gene repression and activation with fewer off-target effects.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Friddle, Carl J; Koga, Teiichiro; Rubin, Edward M.
2000-03-15
While cardiac hypertrophy has been the subject of intensive investigation, regression of hypertrophy has been significantly less studied, precluding large-scale analysis of the relationship between these processes. In the present study, using pharmacological models of hypertrophy in mice, expression profiling was performed with fragments of more than 3,000 genes to characterize and contrast expression changes during induction and regression of hypertrophy. Administration of angiotensin II and isoproterenol by osmotic minipump produced increases in heart weight (15% and 40% respectively) that returned to pre-induction size following drug withdrawal. From multiple expression analyses of left ventricular RNA isolated at daily time-points duringmore » cardiac hypertrophy and regression, we identified sets of genes whose expression was altered at specific stages of this process. While confirming the participation of 25 genes or pathways previously known to be altered by hypertrophy, a larger set of 30 genes was identified whose expression had not previously been associated with cardiac hypertrophy or regression. Of the 55 genes that showed reproducible changes during the time course of induction and regression, 32 genes were altered only during induction and 8 were altered only during regression. This study identified both known and novel genes whose expression is affected at different stages of cardiac hypertrophy and regression and demonstrates that cardiac remodeling during regression utilizes a set of genes that are distinct from those used during induction of hypertrophy.« less
Marbach, Daniel; Roy, Sushmita; Ay, Ferhat; Meyer, Patrick E.; Candeias, Rogerio; Kahveci, Tamer; Bristow, Christopher A.; Kellis, Manolis
2012-01-01
Gaining insights on gene regulation from large-scale functional data sets is a grand challenge in systems biology. In this article, we develop and apply methods for transcriptional regulatory network inference from diverse functional genomics data sets and demonstrate their value for gene function and gene expression prediction. We formulate the network inference problem in a machine-learning framework and use both supervised and unsupervised methods to predict regulatory edges by integrating transcription factor (TF) binding, evolutionarily conserved sequence motifs, gene expression, and chromatin modification data sets as input features. Applying these methods to Drosophila melanogaster, we predict ∼300,000 regulatory edges in a network of ∼600 TFs and 12,000 target genes. We validate our predictions using known regulatory interactions, gene functional annotations, tissue-specific expression, protein–protein interactions, and three-dimensional maps of chromosome conformation. We use the inferred network to identify putative functions for hundreds of previously uncharacterized genes, including many in nervous system development, which are independently confirmed based on their tissue-specific expression patterns. Last, we use the regulatory network to predict target gene expression levels as a function of TF expression, and find significantly higher predictive power for integrative networks than for motif or ChIP-based networks. Our work reveals the complementarity between physical evidence of regulatory interactions (TF binding, motif conservation) and functional evidence (coordinated expression or chromatin patterns) and demonstrates the power of data integration for network inference and studies of gene regulation at the systems level. PMID:22456606
Modulation of gene expression in heart and liver of hibernating black bears (Ursus americanus)
2011-01-01
Background Hibernation is an adaptive strategy to survive in highly seasonal or unpredictable environments. The molecular and genetic basis of hibernation physiology in mammals has only recently been studied using large scale genomic approaches. We analyzed gene expression in the American black bear, Ursus americanus, using a custom 12,800 cDNA probe microarray to detect differences in expression that occur in heart and liver during winter hibernation in comparison to summer active animals. Results We identified 245 genes in heart and 319 genes in liver that were differentially expressed between winter and summer. The expression of 24 genes was significantly elevated during hibernation in both heart and liver. These genes are mostly involved in lipid catabolism and protein biosynthesis and include RNA binding protein motif 3 (Rbm3), which enhances protein synthesis at mildly hypothermic temperatures. Elevated expression of protein biosynthesis genes suggests induction of translation that may be related to adaptive mechanisms reducing cardiac and muscle atrophies over extended periods of low metabolism and immobility during hibernation in bears. Coordinated reduction of transcription of genes involved in amino acid catabolism suggests redirection of amino acids from catabolic pathways to protein biosynthesis. We identify common for black bears and small mammalian hibernators transcriptional changes in the liver that include induction of genes responsible for fatty acid β oxidation and carbohydrate synthesis and depression of genes involved in lipid biosynthesis, carbohydrate catabolism, cellular respiration and detoxification pathways. Conclusions Our findings show that modulation of gene expression during winter hibernation represents molecular mechanism of adaptation to extreme environments. PMID:21453527
Comparative modular analysis of gene expression in vertebrate organs.
Piasecka, Barbara; Kutalik, Zoltán; Roux, Julien; Bergmann, Sven; Robinson-Rechavi, Marc
2012-03-29
The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.
Principles of gene microarray data analysis.
Mocellin, Simone; Rossi, Carlo Riccardo
2007-01-01
The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
2010-01-01
Background Cytochrome P450 monooxygenases (P450s) catalyze oxidation of various substrates using oxygen and NAD(P)H. Plant P450s are involved in the biosynthesis of primary and secondary metabolites performing diverse biological functions. The recent availability of the soybean genome sequence allows us to identify and analyze soybean putative P450s at a genome scale. Co-expression analysis using an available soybean microarray and Illumina sequencing data provides clues for functional annotation of these enzymes. This approach is based on the assumption that genes that have similar expression patterns across a set of conditions may have a functional relationship. Results We have identified a total number of 332 full-length P450 genes and 378 pseudogenes from the soybean genome. From the full-length sequences, 195 genes belong to A-type, which could be further divided into 20 families. The remaining 137 genes belong to non-A type P450s and are classified into 28 families. A total of 178 probe sets were found to correspond to P450 genes on the Affymetrix soybean array. Out of these probe sets, 108 represented single genes. Using the 28 publicly available microarray libraries that contain organ-specific information, some tissue-specific P450s were identified. Similarly, stress responsive soybean P450s were retrieved from 99 microarray soybean libraries. We also utilized Illumina transcriptome sequencing technology to analyze the expressions of all 332 soybean P450 genes. This dataset contains total RNAs isolated from nodules, roots, root tips, leaves, flowers, green pods, apical meristem, mock-inoculated and Bradyrhizobium japonicum-infected root hair cells. The tissue-specific expression patterns of these P450 genes were analyzed and the expression of a representative set of genes were confirmed by qRT-PCR. We performed the co-expression analysis on many of the 108 P450 genes on the Affymetrix arrays. First we confirmed that CYP93C5 (an isoflavone synthase gene) is co-expressed with several genes encoding isoflavonoid-related metabolic enzymes. We then focused on nodulation-induced P450s and found that CYP728H1 was co-expressed with the genes involved in phenylpropanoid metabolism. Similarly, CYP736A34 was highly co-expressed with lipoxygenase, lectin and CYP83D1, all of which are involved in root and nodule development. Conclusions The genome scale analysis of P450s in soybean reveals many unique features of these important enzymes in this crop although the functions of most of them are largely unknown. Gene co-expression analysis proves to be a useful tool to infer the function of uncharacterized genes. Our work presented here could provide important leads toward functional genomics studies of soybean P450s and their regulatory network through the integration of reverse genetics, biochemistry, and metabolic profiling tools. The identification of nodule-specific P450s and their further exploitation may help us to better understand the intriguing process of soybean and rhizobium interaction. PMID:21062474
Diversification of Root Hair Development Genes in Vascular Plants.
Huang, Ling; Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui; Schiefelbein, John
2017-07-01
The molecular genetic program for root hair development has been studied intensively in Arabidopsis ( Arabidopsis thaliana ). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. © 2017 American Society of Plant Biologists. All Rights Reserved.
Diversification of Root Hair Development Genes in Vascular Plants1[OPEN
Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui
2017-01-01
The molecular genetic program for root hair development has been studied intensively in Arabidopsis (Arabidopsis thaliana). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. PMID:28487476
Ulianov, Sergey V; Galitsyna, Aleksandra A; Flyamer, Ilya M; Golov, Arkadiy K; Khrameeva, Ekaterina E; Imakaev, Maxim V; Abdennur, Nezar A; Gelfand, Mikhail S; Gavrilov, Alexey A; Razin, Sergey V
2017-07-11
In homeotherms, the alpha-globin gene clusters are located within permanently open genome regions enriched in housekeeping genes. Terminal erythroid differentiation results in dramatic upregulation of alpha-globin genes making their expression comparable to the rRNA transcriptional output. Little is known about the influence of the erythroid-specific alpha-globin gene transcription outburst on adjacent, widely expressed genes and large-scale chromatin organization. Here, we have analyzed the total transcription output, the overall chromatin contact profile, and CTCF binding within the 2.7 Mb segment of chicken chromosome 14 harboring the alpha-globin gene cluster in cultured lymphoid cells and cultured erythroid cells before and after induction of terminal erythroid differentiation. We found that, similarly to mammalian genome, the chicken genomes is organized in TADs and compartments. Full activation of the alpha-globin gene transcription in differentiated erythroid cells is correlated with upregulation of several adjacent housekeeping genes and the emergence of abundant intergenic transcription. An extended chromosome region encompassing the alpha-globin cluster becomes significantly decompacted in differentiated erythroid cells, and depleted in CTCF binding and CTCF-anchored chromatin loops, while the sub-TAD harboring alpha-globin gene cluster and the upstream major regulatory element (MRE) becomes highly enriched with chromatin interactions as compared to lymphoid and proliferating erythroid cells. The alpha-globin gene domain and the neighboring loci reside within the A-like chromatin compartment in both lymphoid and erythroid cells and become further segregated from the upstream gene desert upon terminal erythroid differentiation. Our findings demonstrate that the effects of tissue-specific transcription activation are not restricted to the host genomic locus but affect the overall chromatin structure and transcriptional output of the encompassing topologically associating domain.
Inferring causal genomic alterations in breast cancer using gene expression data
2011-01-01
Background One of the primary objectives in cancer research is to identify causal genomic alterations, such as somatic copy number variation (CNV) and somatic mutations, during tumor development. Many valuable studies lack genomic data to detect CNV; therefore, methods that are able to infer CNVs from gene expression data would help maximize the value of these studies. Results We developed a framework for identifying recurrent regions of CNV and distinguishing the cancer driver genes from the passenger genes in the regions. By inferring CNV regions across many datasets we were able to identify 109 recurrent amplified/deleted CNV regions. Many of these regions are enriched for genes involved in many important processes associated with tumorigenesis and cancer progression. Genes in these recurrent CNV regions were then examined in the context of gene regulatory networks to prioritize putative cancer driver genes. The cancer driver genes uncovered by the framework include not only well-known oncogenes but also a number of novel cancer susceptibility genes validated via siRNA experiments. Conclusions To our knowledge, this is the first effort to systematically identify and validate drivers for expression based CNV regions in breast cancer. The framework where the wavelet analysis of copy number alteration based on expression coupled with the gene regulatory network analysis, provides a blueprint for leveraging genomic data to identify key regulatory components and gene targets. This integrative approach can be applied to many other large-scale gene expression studies and other novel types of cancer data such as next-generation sequencing based expression (RNA-Seq) as well as CNV data. PMID:21806811
Miyamoto, Tadashi; Furusawa, Chikara; Kaneko, Kunihiko
2015-01-01
Embryonic stem cells exhibit pluripotency: they can differentiate into all types of somatic cells. Pluripotent genes such as Oct4 and Nanog are activated in the pluripotent state, and their expression decreases during cell differentiation. Inversely, expression of differentiation genes such as Gata6 and Gata4 is promoted during differentiation. The gene regulatory network controlling the expression of these genes has been described, and slower-scale epigenetic modifications have been uncovered. Although the differentiation of pluripotent stem cells is normally irreversible, reprogramming of cells can be experimentally manipulated to regain pluripotency via overexpression of certain genes. Despite these experimental advances, the dynamics and mechanisms of differentiation and reprogramming are not yet fully understood. Based on recent experimental findings, we constructed a simple gene regulatory network including pluripotent and differentiation genes, and we demonstrated the existence of pluripotent and differentiated states from the resultant dynamical-systems model. Two differentiation mechanisms, interaction-induced switching from an expression oscillatory state and noise-assisted transition between bistable stationary states, were tested in the model. The former was found to be relevant to the differentiation process. We also introduced variables representing epigenetic modifications, which controlled the threshold for gene expression. By assuming positive feedback between expression levels and the epigenetic variables, we observed differentiation in expression dynamics. Additionally, with numerical reprogramming experiments for differentiated cells, we showed that pluripotency was recovered in cells by imposing overexpression of two pluripotent genes and external factors to control expression of differentiation genes. Interestingly, these factors were consistent with the four Yamanaka factors, Oct4, Sox2, Klf4, and Myc, which were necessary for the establishment of induced pluripotent stem cells. These results, based on a gene regulatory network and expression dynamics, contribute to our wider understanding of pluripotency, differentiation, and reprogramming of cells, and they provide a fresh viewpoint on robustness and control during development. PMID:26308610
Lai, Yinglei; Zhang, Fanni; Nayak, Tapan K; Modarres, Reza; Lee, Norman H; McCaffrey, Timothy A
2014-01-01
Gene set enrichment analysis (GSEA) is an important approach to the analysis of coordinate expression changes at a pathway level. Although many statistical and computational methods have been proposed for GSEA, the issue of a concordant integrative GSEA of multiple expression data sets has not been well addressed. Among different related data sets collected for the same or similar study purposes, it is important to identify pathways or gene sets with concordant enrichment. We categorize the underlying true states of differential expression into three representative categories: no change, positive change and negative change. Due to data noise, what we observe from experiments may not indicate the underlying truth. Although these categories are not observed in practice, they can be considered in a mixture model framework. Then, we define the mathematical concept of concordant gene set enrichment and calculate its related probability based on a three-component multivariate normal mixture model. The related false discovery rate can be calculated and used to rank different gene sets. We used three published lung cancer microarray gene expression data sets to illustrate our proposed method. One analysis based on the first two data sets was conducted to compare our result with a previous published result based on a GSEA conducted separately for each individual data set. This comparison illustrates the advantage of our proposed concordant integrative gene set enrichment analysis. Then, with a relatively new and larger pathway collection, we used our method to conduct an integrative analysis of the first two data sets and also all three data sets. Both results showed that many gene sets could be identified with low false discovery rates. A consistency between both results was also observed. A further exploration based on the KEGG cancer pathway collection showed that a majority of these pathways could be identified by our proposed method. This study illustrates that we can improve detection power and discovery consistency through a concordant integrative analysis of multiple large-scale two-sample gene expression data sets.
Sheng, Jian-Hua; Ng, Tze-Pin; Li, Chun-Bo; Lu, Guang-Hua; He, Wei; Qian, Yi-Ping; Wang, Jing-Hua; Yu, Shun-Ying
2012-12-01
To explore the peripheral leucocytic messenger RNA (mRNA) expression of glycogen synthase kinase-3β (GSK-3β) gene in Alzheimer's disease (AD) patients. Using TaqMan relative quantitative real-time polymerase chain reaction, we analyzed leucocytic gene expression of GSK-3β in 48 AD patients and 49 healthy controls. Clinical data of AD patients were also collected. The mRNA expression level of the GSK-3β gene was significantly higher in the AD group (3.13±0.62) than in the normal group (2.77±0.77). Correlational analyses showed that the mRNA expression level of GSK-3β gene in AD patients was associated with the age of onset (P=0.047), age (P=0.055), and Behavioral Pathology in Alzheimer's Disease Rating Scale total score (P=0.062) and subscores: aggressiveness score (P=0.073) and anxieties and phobias score (P=0.067). Through multivariate regression model, older age, higher anxieties and phobias score and aggressiveness score were associated with higher mRNA expression level of GSK-3β gene. In AD patients, the mRNA expression level of the GSK-3β gene is increased and may be related to age and behavioural pathology in AD. © 2012 The Authors. Psychogeriatrics © 2012 Japanese Psychogeriatric Society.
The evolution of duplicate gene expression in mammalian organs
Guschanski, Katerina; Warnefors, Maria; Kaessmann, Henrik
2017-01-01
Gene duplications generate genomic raw material that allows the emergence of novel functions, likely facilitating adaptive evolutionary innovations. However, global assessments of the functional and evolutionary relevance of duplicate genes in mammals were until recently limited by the lack of appropriate comparative data. Here, we report a large-scale study of the expression evolution of DNA-based functional gene duplicates in three major mammalian lineages (placental mammals, marsupials, egg-laying monotremes) and birds, on the basis of RNA sequencing (RNA-seq) data from nine species and eight organs. We observe dynamic changes in tissue expression preference of paralogs with different duplication ages, suggesting differential contribution of paralogs to specific organ functions during vertebrate evolution. Specifically, we show that paralogs that emerged in the common ancestor of bony vertebrates are enriched for genes with brain-specific expression and provide evidence for differential forces underlying the preferential emergence of young testis- and liver-specific expressed genes. Further analyses uncovered that the overall spatial expression profiles of gene families tend to be conserved, with several exceptions of pronounced tissue specificity shifts among lineage-specific gene family expansions. Finally, we trace new lineage-specific genes that may have contributed to the specific biology of mammalian organs, including the little-studied placenta. Overall, our study provides novel and taxonomically broad evidence for the differential contribution of duplicate genes to tissue-specific transcriptomes and for their importance for the phenotypic evolution of vertebrates. PMID:28743766
Evolution and expression analysis of the grape (Vitis vinifera L.) WRKY gene family.
Guo, Chunlei; Guo, Rongrong; Xu, Xiaozhao; Gao, Min; Li, Xiaoqin; Song, Junyang; Zheng, Yi; Wang, Xiping
2014-04-01
WRKY proteins comprise a large family of transcription factors that play important roles in plant defence regulatory networks, including responses to various biotic and abiotic stresses. To date, no large-scale study of WRKY genes has been undertaken in grape (Vitis vinifera L.). In this study, a total of 59 putative grape WRKY genes (VvWRKY) were identified and renamed on the basis of their respective chromosome distribution. A multiple sequence alignment analysis using all predicted grape WRKY genes coding sequences, together with those from Arabidopsis thaliana and tomato (Solanum lycopersicum), indicated that the 59 VvWRKY genes can be classified into three main groups (I-III). An evaluation of the duplication events suggested that several WRKY genes arose before the divergence of the grape and Arabidopsis lineages. Moreover, expression profiles derived from semiquantitative PCR and real-time quantitative PCR analyses showed distinct expression patterns in various tissues and in response to different treatments. Four VvWRKY genes showed a significantly higher expression in roots or leaves, 55 responded to varying degrees to at least one abiotic stress treatment, and the expression of 38 were altered following powdery mildew (Erysiphe necator) infection. Most VvWRKY genes were downregulated in response to abscisic acid or salicylic acid treatments, while the expression of a subset was upregulated by methyl jasmonate or ethylene treatments.
Evolution and expression analysis of the grape (Vitis vinifera L.) WRKY gene family
Guo, Chunlei; Guo, Rongrong; Wang, Xiping
2014-01-01
WRKY proteins comprise a large family of transcription factors that play important roles in plant defence regulatory networks, including responses to various biotic and abiotic stresses. To date, no large-scale study of WRKY genes has been undertaken in grape (Vitis vinifera L.). In this study, a total of 59 putative grape WRKY genes (VvWRKY) were identified and renamed on the basis of their respective chromosome distribution. A multiple sequence alignment analysis using all predicted grape WRKY genes coding sequences, together with those from Arabidopsis thaliana and tomato (Solanum lycopersicum), indicated that the 59 VvWRKY genes can be classified into three main groups (I–III). An evaluation of the duplication events suggested that several WRKY genes arose before the divergence of the grape and Arabidopsis lineages. Moreover, expression profiles derived from semiquantitative PCR and real-time quantitative PCR analyses showed distinct expression patterns in various tissues and in response to different treatments. Four VvWRKY genes showed a significantly higher expression in roots or leaves, 55 responded to varying degrees to at least one abiotic stress treatment, and the expression of 38 were altered following powdery mildew (Erysiphe necator) infection. Most VvWRKY genes were downregulated in response to abscisic acid or salicylic acid treatments, while the expression of a subset was upregulated by methyl jasmonate or ethylene treatments. PMID:24510937
Function and Evolution of DNA Methylation in Nasonia vitripennis
Wang, Xu; Wheeler, David; Avery, Amanda; Rago, Alfredo; Choi, Jeong-Hyeon; Colbourne, John K.; Clark, Andrew G.; Werren, John H.
2013-01-01
The parasitoid wasp Nasonia vitripennis is an emerging genetic model for functional analysis of DNA methylation. Here, we characterize genome-wide methylation at a base-pair resolution, and compare these results to gene expression across five developmental stages and to methylation patterns reported in other insects. An accurate assessment of DNA methylation across the genome is accomplished using bisulfite sequencing of adult females from a highly inbred line. One-third of genes show extensive methylation over the gene body, yet methylated DNA is not found in non-coding regions and rarely in transposons. Methylated genes occur in small clusters across the genome. Methylation demarcates exon-intron boundaries, with elevated levels over exons, primarily in the 5′ regions of genes. It is also elevated near the sites of translational initiation and termination, with reduced levels in 5′ and 3′ UTRs. Methylated genes have higher median expression levels and lower expression variation across development stages than non-methylated genes. There is no difference in frequency of differential splicing between methylated and non-methylated genes, and as yet no established role for methylation in regulating alternative splicing in Nasonia. Phylogenetic comparisons indicate that many genes maintain methylation status across long evolutionary time scales. Nasonia methylated genes are more likely to be conserved in insects, but even those that are not conserved show broader expression across development than comparable non-methylated genes. Finally, examination of duplicated genes shows that those paralogs that have lost methylation in the Nasonia lineage following gene duplication evolve more rapidly, show decreased median expression levels, and increased specialization in expression across development. Methylation of Nasonia genes signals constitutive transcription across developmental stages, whereas non-methylated genes show more dynamic developmental expression patterns. We speculate that loss of methylation may result in increased developmental specialization in evolution and acquisition of methylation may lead to broader constitutive expression. PMID:24130511
Lee, Ann-Ying; Chen, Chun-Yi; Chang, Yao-Chien Alex; Chao, Ya-Ting; Shih, Ming-Che
2013-01-01
Previously we developed genomic resources for orchids, including transcriptomic analyses using next-generation sequencing techniques and construction of a web-based orchid genomic database. Here, we report a modified molecular model of flower development in the Orchidaceae based on functional analysis of gene expression profiles in Phalaenopsis aphrodite (a moth orchid) that revealed novel roles for the transcription factors involved in floral organ pattern formation. Phalaenopsis orchid floral organ-specific genes were identified by microarray analysis. Several critical transcription factors including AP3, PI, AP1 and AGL6, displayed distinct spatial distribution patterns. Phylogenetic analysis of orchid MADS box genes was conducted to infer the evolutionary relationship among floral organ-specific genes. The results suggest that gene duplication MADS box genes in orchid may have resulted in their gaining novel functions during evolution. Based on these analyses, a modified model of orchid flowering was proposed. Comparison of the expression profiles of flowers of a peloric mutant and wild-type Phalaenopsis orchid further identified genes associated with lip morphology and peloric effects. Large scale investigation of gene expression profiles revealed that homeotic genes from the ABCDE model of flower development classes A and B in the Phalaenopsis orchid have novel functions due to evolutionary diversification, and display differential expression patterns. PMID:24265826
Sex-biased transcriptome divergence along a latitudinal gradient.
Allen, Scott L; Bonduriansky, Russell; Sgro, Carla M; Chenoweth, Stephen F
2017-03-01
Sex-dependent gene expression is likely an important genomic mechanism that allows sex-specific adaptation to environmental changes. Among Drosophila species, sex-biased genes display remarkably consistent evolutionary patterns; male-biased genes evolve faster than unbiased genes in both coding sequence and expression level, suggesting sex differences in selection through time. However, comparatively little is known of the evolutionary process shaping sex-biased expression within species. Latitudinal clines offer an opportunity to examine how changes in key ecological parameters also influence sex-specific selection and the evolution of sex-biased gene expression. We assayed male and female gene expression in Drosophila serrata along a latitudinal gradient in eastern Australia spanning most of its endemic distribution. Analysis of 11 631 genes across eight populations revealed strong sex differences in the frequency, mode and strength of divergence. Divergence was far stronger in males than females and while latitudinal clines were evident in both sexes, male divergence was often population specific, suggesting responses to localized selection pressures that do not covary predictably with latitude. While divergence was enriched for male-biased genes, there was no overrepresentation of X-linked genes in males. By contrast, X-linked divergence was elevated in females, especially for female-biased genes. Many genes that diverged in D. serrata have homologs also showing latitudinal divergence in Drosophila simulans and Drosophila melanogaster on other continents, likely indicating parallel adaptation in these distantly related species. Our results suggest that sex differences in selection play an important role in shaping the evolution of gene expression over macro- and micro-ecological spatial scales. © 2017 John Wiley & Sons Ltd.
[Isolation and function of genes regulating aphB expression in Vibrio cholerae].
Chen, Haili; Zhu, Zhaoqin; Zhong, Zengtao; Zhu, Jun; Kan, Biao
2012-02-04
We identified genes that regulate the expression of aphB, the gene encoding a key virulence regulator in Vibrio cholerae O1 E1 Tor C6706(-). We constructed a transposon library in V. cholerae C6706 strain containing a P(aphB)-luxCDABE and P(aphB)-lacZ transcriptional reporter plasmids. Using a chemiluminescence imager system, we rapidly detected aphB promoter expression level at a large scale. We then sequenced the transposon insertion sites by arbitrary PCR and sequencing analysis. We obtained two candidate mutants T1 and T2 which displayed reduced aphB expression from approximately 40,000 transposon insertion mutants. Sequencing analysis shows that Tn inserted in vc1585 reading frame in the T1 mutant and Tn inserted in the end of coding sequence of vc1602 in the T2 mutant. By using a genetic screen, we identified two potential genes that may involve in regulation of the expression of the key virulence regulator AphB. This study sheds light on our further investigation to fully understand V. cholerae virulence gene regulatory cascades.
MicroRNAs shape circadian hepatic gene expression on a transcriptome-wide scale
Du, Ngoc-Hien; Arpat, Alaaddin Bulak; De Matos, Mara; Gatfield, David
2014-01-01
A considerable proportion of mammalian gene expression undergoes circadian oscillations. Post-transcriptional mechanisms likely make important contributions to mRNA abundance rhythms. We have investigated how microRNAs (miRNAs) contribute to core clock and clock-controlled gene expression using mice in which miRNA biogenesis can be inactivated in the liver. While the hepatic core clock was surprisingly resilient to miRNA loss, whole transcriptome sequencing uncovered widespread effects on clock output gene expression. Cyclic transcription paired with miRNA-mediated regulation was thus identified as a frequent phenomenon that affected up to 30% of the rhythmic transcriptome and served to post-transcriptionally adjust the phases and amplitudes of rhythmic mRNA accumulation. However, only few mRNA rhythms were actually generated by miRNAs. Overall, our study suggests that miRNAs function to adapt clock-driven gene expression to tissue-specific requirements. Finally, we pinpoint several miRNAs predicted to act as modulators of rhythmic transcripts, and identify rhythmic pathways particularly prone to miRNA regulation. DOI: http://dx.doi.org/10.7554/eLife.02510.001 PMID:24867642
Piersma, Sjouke; Denham, Emma L.; Drulhe, Samuel; Tonk, Rudi H. J.; Schwikowski, Benno; van Dijl, Jan Maarten
2013-01-01
Gene expression heterogeneity is a key driver for microbial adaptation to fluctuating environmental conditions, cell differentiation and the evolution of species. This phenomenon has therefore enormous implications, not only for life in general, but also for biotechnological applications where unwanted subpopulations of non-producing cells can emerge in large-scale fermentations. Only time-lapse fluorescence microscopy allows real-time measurements of gene expression heterogeneity. A major limitation in the analysis of time-lapse microscopy data is the lack of fast, cost-effective, open, simple and adaptable protocols. Here we describe TLM-Quant, a semi-automatic pipeline for the analysis of time-lapse fluorescence microscopy data that enables the user to visualize and quantify gene expression heterogeneity. Importantly, our pipeline builds on the open-source packages ImageJ and R. To validate TLM-Quant, we selected three possible scenarios, namely homogeneous expression, highly ‘noisy’ heterogeneous expression, and bistable heterogeneous expression in the Gram-positive bacterium Bacillus subtilis. This bacterium is both a paradigm for systems-level studies on gene expression and a highly appreciated biotechnological ‘cell factory’. We conclude that the temporal resolution of such analyses with TLM-Quant is only limited by the numbers of recorded images. PMID:23874729
Galfalvy, Hanga C; Erraji-Benchekroun, Loubna; Smyrniotopoulos, Peggy; Pavlidis, Paul; Ellis, Steven P; Mann, J John; Sibille, Etienne; Arango, Victoria
2003-01-01
Background Genomic studies of complex tissues pose unique analytical challenges for assessment of data quality, performance of statistical methods used for data extraction, and detection of differentially expressed genes. Ideally, to assess the accuracy of gene expression analysis methods, one needs a set of genes which are known to be differentially expressed in the samples and which can be used as a "gold standard". We introduce the idea of using sex-chromosome genes as an alternative to spiked-in control genes or simulations for assessment of microarray data and analysis methods. Results Expression of sex-chromosome genes were used as true internal biological controls to compare alternate probe-level data extraction algorithms (Microarray Suite 5.0 [MAS5.0], Model Based Expression Index [MBEI] and Robust Multi-array Average [RMA]), to assess microarray data quality and to establish some statistical guidelines for analyzing large-scale gene expression. These approaches were implemented on a large new dataset of human brain samples. RMA-generated gene expression values were markedly less variable and more reliable than MAS5.0 and MBEI-derived values. A statistical technique controlling the false discovery rate was applied to adjust for multiple testing, as an alternative to the Bonferroni method, and showed no evidence of false negative results. Fourteen probesets, representing nine Y- and two X-chromosome linked genes, displayed significant sex differences in brain prefrontal cortex gene expression. Conclusion In this study, we have demonstrated the use of sex genes as true biological internal controls for genomic analysis of complex tissues, and suggested analytical guidelines for testing alternate oligonucleotide microarray data extraction protocols and for adjusting multiple statistical analysis of differentially expressed genes. Our results also provided evidence for sex differences in gene expression in the brain prefrontal cortex, supporting the notion of a putative direct role of sex-chromosome genes in differentiation and maintenance of sexual dimorphism of the central nervous system. Importantly, these analytical approaches are applicable to all microarray studies that include male and female human or animal subjects. PMID:12962547
Galfalvy, Hanga C; Erraji-Benchekroun, Loubna; Smyrniotopoulos, Peggy; Pavlidis, Paul; Ellis, Steven P; Mann, J John; Sibille, Etienne; Arango, Victoria
2003-09-08
Genomic studies of complex tissues pose unique analytical challenges for assessment of data quality, performance of statistical methods used for data extraction, and detection of differentially expressed genes. Ideally, to assess the accuracy of gene expression analysis methods, one needs a set of genes which are known to be differentially expressed in the samples and which can be used as a "gold standard". We introduce the idea of using sex-chromosome genes as an alternative to spiked-in control genes or simulations for assessment of microarray data and analysis methods. Expression of sex-chromosome genes were used as true internal biological controls to compare alternate probe-level data extraction algorithms (Microarray Suite 5.0 [MAS5.0], Model Based Expression Index [MBEI] and Robust Multi-array Average [RMA]), to assess microarray data quality and to establish some statistical guidelines for analyzing large-scale gene expression. These approaches were implemented on a large new dataset of human brain samples. RMA-generated gene expression values were markedly less variable and more reliable than MAS5.0 and MBEI-derived values. A statistical technique controlling the false discovery rate was applied to adjust for multiple testing, as an alternative to the Bonferroni method, and showed no evidence of false negative results. Fourteen probesets, representing nine Y- and two X-chromosome linked genes, displayed significant sex differences in brain prefrontal cortex gene expression. In this study, we have demonstrated the use of sex genes as true biological internal controls for genomic analysis of complex tissues, and suggested analytical guidelines for testing alternate oligonucleotide microarray data extraction protocols and for adjusting multiple statistical analysis of differentially expressed genes. Our results also provided evidence for sex differences in gene expression in the brain prefrontal cortex, supporting the notion of a putative direct role of sex-chromosome genes in differentiation and maintenance of sexual dimorphism of the central nervous system. Importantly, these analytical approaches are applicable to all microarray studies that include male and female human or animal subjects.
Canonical Genetic Signatures of the Adult Human Brain
Hawrylycz, Michael; Miller, Jeremy A.; Menon, Vilas; Feng, David; Dolbeare, Tim; Guillozet-Bongaarts, Angela L.; Jegga, Anil G.; Aronow, Bruce J.; Lee, Chang-Kyu; Bernard, Amy; Glasser, Matthew F.; Dierker, Donna L.; Menche, Jörge; Szafer, Aaron; Collman, Forrest; Grange, Pascal; Berman, Kenneth A.; Mihalas, Stefan; Yao, Zizhen; Stewart, Lance; Barabási, Albert-László; Schulkin, Jay; Phillips, John; Ng, Lydia; Dang, Chinh; Haynor, David R.; Jones, Allan; Van Essen, David C.; Koch, Christof; Lein, Ed
2015-01-01
The structure and function of the human brain are highly stereotyped, implying a conserved molecular program responsible for its development, cellular structure, and function. We applied a correlation-based metric of “differential stability” (DS) to assess reproducibility of gene expression patterning across 132 structures in six individual brains, revealing meso-scale genetic organization. The highest DS genes are highly biologically relevant, with enrichment for brain-related biological annotations, disease associations, drug targets, and literature citations. Using high DS genes we identified 32 anatomically diverse and reproducible gene expression signatures, which represent distinct cell types, intracellular components, and/or associations with neurodevelopmental and neurodegenerative disorders. Genes in neuron-associated compared to non-neuronal networks showed higher preservation between human and mouse; however, many diversely-patterned genes displayed dramatic shifts in regulation between species. Finally, highly consistent transcriptional architecture in neocortex is correlated with resting state functional connectivity, suggesting a link between conserved gene expression and functionally relevant circuitry. PMID:26571460
Zhou, Qingyuan; Jia, Junting; Huang, Xing; Yan, Xueqing; Cheng, Liqin; Chen, Shuangyan; Li, Xiaoxia; Peng, Xianjun; Liu, Gongshe
2014-05-26
Many Poaceae species show a gametophytic self-incompatibility (GSI) system, which is controlled by at least two independent and multiallelic loci, S and Z. Until currently, the gene products for S and Z were unknown. Grass SI plant stigmas discriminate between pollen grains that land on its surface and support compatible pollen tube growth and penetration into the stigma, whereas recognizing incompatible pollen and thus inhibiting pollination behaviors. Leymus chinensis (Trin.) Tzvel. (sheepgrass) is a Poaceae SI species. A comprehensive analysis of sheepgrass stigma transcriptome may provide valuable information for understanding the mechanism of pollen-stigma interactions and grass SI. The transcript abundance profiles of mature stigmas, mature ovaries and leaves were examined using high-throughput next generation sequencing technology. A comparative transcriptomic analysis of these tissues identified 1,025 specifically or preferentially expressed genes in sheepgrass stigmas. These genes contained a significant proportion of genes predicted to function in cell-cell communication and signal transduction. We identified 111 putative transcription factors (TFs) genes and the most abundant groups were MYB, C2H2, C3H, FAR1, MADS. Comparative analysis of the sheepgrass, rice and Arabidopsis stigma-specific or preferential datasets showed broad similarities and some differences in the proportion of genes in the Gene Ontology (GO) functional categories. Potential SI candidate genes identified in other grasses were also detected in the sheepgrass stigma-specific or preferential dataset. Quantitative real-time PCR experiments validated the expression pattern of stigma preferential genes including homologous grass SI candidate genes. This study represents the first large-scale investigation of gene expression in the stigmas of an SI grass species. We uncovered many notable genes that are potentially involved in pollen-stigma interactions and SI mechanisms, including genes encoding receptor-like protein kinases (RLK), CBL (calcineurin B-like proteins) interacting protein kinases, calcium-dependent protein kinase, expansins, pectinesterase, peroxidases and various transcription factors. The availability of a pool of stigma-specific or preferential genes for L. chinensis offers an opportunity to elucidate the mechanisms of SI in Poaceae.
Evidence for a large expansion and subfunctionalisation of globin genes in sea anemones.
Smith, Hayden L; Pavasovic, Ana; Surm, Joachim M; Phillips, Matthew J; Prentis, Peter J
2018-06-27
The globin gene superfamily has been well-characterised in vertebrates, however, there has been limited research in early-diverging lineages, such as phylum Cnidaria. This study aimed to identify globin genes in multiple cnidarian lineages, and use bioinformatic approaches to characterise the evolution, structure and expression of these genes. Phylogenetic analyses and in silico protein predictions showed that all cnidarians have undergone an expansion of globin genes, which likely have a hexacoordinate protein structure. Our protein modelling has also revealed the possibility of a single pentacoordinate globin lineage in anthozoan species. Some cnidarian globin genes displayed tissue and development specific expression with very few orthologous genes similarly expressed across species. Our phylogenetic analyses also revealed that eumetazoan globin genes form a polyphyletic relationship with vertebrate globin genes. Overall, our analyses suggest that a Ngb-like and GbX-like gene were most likely present in the globin gene repertoire for the last common ancestor of eumetazoans. The identification of a large-scale expansion and subfunctionalisation of globin genes in actiniarians provides an excellent starting point to further our understanding of the evolution and function of the globin gene superfamily in early-diverging lineages.
Ding, Zhong-Tao; Zhang, Zhi; Luo, Di; Zhou, Jin-Yan; Zhong, Juan; Yang, Jie; Xiao, Liang; Shu, Dan; Tan, Hong
2015-01-01
The phytopathogenic ascomycete Botrytis cinerea produces several secondary metabolites that have biotechnical significance and has been particularly used for S-(+)-abscisic acid production at the industrial scale. To manipulate the expression levels of specific secondary metabolite biosynthetic genes of B. cinerea with Agrobacterium tumefaciens-mediated transformation system, two expression vectors (pCBh1 and pCBg1 with different selection markers) and one RNA silencing vector, pCBSilent1, were developed with the In-Fusion assembly method. Both expression vectors were highly effective in constitutively expressing eGFP, and pCBSilent1 effectively silenced the eGFP gene in B. cinerea. Bcaba4, a gene suggested to participate in ABA biosynthesis in B. cinerea, was then targeted for gene overexpression and RNA silencing with these reverse genetic tools. The overexpression of bcaba4 dramatically induced ABA formation in the B. cinerea wild type strain Bc-6, and the gene silencing of bcaba4 significantly reduced ABA-production in an ABA-producing B. cinerea strain. PMID:25955649
Aberrant RNA splicing in cancer; expression changes and driver mutations of splicing factor genes.
Sveen, A; Kilpinen, S; Ruusulehto, A; Lothe, R A; Skotheim, R I
2016-05-12
Alternative splicing is a widespread process contributing to structural transcript variation and proteome diversity. In cancer, the splicing process is commonly disrupted, resulting in both functional and non-functional end-products. Cancer-specific splicing events are known to contribute to disease progression; however, the dysregulated splicing patterns found on a genome-wide scale have until recently been less well-studied. In this review, we provide an overview of aberrant RNA splicing and its regulation in cancer. We then focus on the executors of the splicing process. Based on a comprehensive catalog of splicing factor encoding genes and analyses of available gene expression and somatic mutation data, we identify cancer-associated patterns of dysregulation. Splicing factor genes are shown to be significantly differentially expressed between cancer and corresponding normal samples, and to have reduced inter-individual expression variation in cancer. Furthermore, we identify enrichment of predicted cancer-critical genes among the splicing factors. In addition to previously described oncogenic splicing factor genes, we propose 24 novel cancer-critical splicing factors predicted from somatic mutations.
Gram-scale production of a basidiomycetous laccase in Aspergillus niger.
Mekmouche, Yasmina; Zhou, Simeng; Cusano, Angela M; Record, Eric; Lomascolo, Anne; Robert, Viviane; Simaan, A Jalila; Rousselot-Pailley, Pierre; Ullah, Sana; Chaspoul, Florence; Tron, Thierry
2014-01-01
We report on the expression in Aspergillus niger of a laccase gene we used to produce variants in Saccharomyces cerevisiae. Grams of recombinant enzyme can be easily obtained. This highlights the potential of combining this generic laccase sequence to the yeast and fungal expression systems for large-scale productions of variants. Copyright © 2013 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Ramsden, Helen L; Sürmeli, Gülşen; McDonagh, Steven G; Nolan, Matthew F
2015-01-01
Neural circuits in the medial entorhinal cortex (MEC) encode an animal's position and orientation in space. Within the MEC spatial representations, including grid and directional firing fields, have a laminar and dorsoventral organization that corresponds to a similar topography of neuronal connectivity and cellular properties. Yet, in part due to the challenges of integrating anatomical data at the resolution of cortical layers and borders, we know little about the molecular components underlying this organization. To address this we develop a new computational pipeline for high-throughput analysis and comparison of in situ hybridization (ISH) images at laminar resolution. We apply this pipeline to ISH data for over 16,000 genes in the Allen Brain Atlas and validate our analysis with RNA sequencing of MEC tissue from adult mice. We find that differential gene expression delineates the borders of the MEC with neighboring brain structures and reveals its laminar and dorsoventral organization. We propose a new molecular basis for distinguishing the deep layers of the MEC and show that their similarity to corresponding layers of neocortex is greater than that of superficial layers. Our analysis identifies ion channel-, cell adhesion- and synapse-related genes as candidates for functional differentiation of MEC layers and for encoding of spatial information at different scales along the dorsoventral axis of the MEC. We also reveal laminar organization of genes related to disease pathology and suggest that a high metabolic demand predisposes layer II to neurodegenerative pathology. In principle, our computational pipeline can be applied to high-throughput analysis of many forms of neuroanatomical data. Our results support the hypothesis that differences in gene expression contribute to functional specialization of superficial layers of the MEC and dorsoventral organization of the scale of spatial representations.
Ramsden, Helen L.; Sürmeli, Gülşen; McDonagh, Steven G.; Nolan, Matthew F.
2015-01-01
Neural circuits in the medial entorhinal cortex (MEC) encode an animal’s position and orientation in space. Within the MEC spatial representations, including grid and directional firing fields, have a laminar and dorsoventral organization that corresponds to a similar topography of neuronal connectivity and cellular properties. Yet, in part due to the challenges of integrating anatomical data at the resolution of cortical layers and borders, we know little about the molecular components underlying this organization. To address this we develop a new computational pipeline for high-throughput analysis and comparison of in situ hybridization (ISH) images at laminar resolution. We apply this pipeline to ISH data for over 16,000 genes in the Allen Brain Atlas and validate our analysis with RNA sequencing of MEC tissue from adult mice. We find that differential gene expression delineates the borders of the MEC with neighboring brain structures and reveals its laminar and dorsoventral organization. We propose a new molecular basis for distinguishing the deep layers of the MEC and show that their similarity to corresponding layers of neocortex is greater than that of superficial layers. Our analysis identifies ion channel-, cell adhesion- and synapse-related genes as candidates for functional differentiation of MEC layers and for encoding of spatial information at different scales along the dorsoventral axis of the MEC. We also reveal laminar organization of genes related to disease pathology and suggest that a high metabolic demand predisposes layer II to neurodegenerative pathology. In principle, our computational pipeline can be applied to high-throughput analysis of many forms of neuroanatomical data. Our results support the hypothesis that differences in gene expression contribute to functional specialization of superficial layers of the MEC and dorsoventral organization of the scale of spatial representations. PMID:25615592
Mechanisms Underlying Adaptation to Life in Hydrogen Sulfide–Rich Environments
Kelley, Joanna L.; Arias-Rodriguez, Lenin; Patacsil Martin, Dorrelyn; Yee, Muh-Ching; Bustamante, Carlos D.; Tobler, Michael
2016-01-01
Hydrogen sulfide (H2S) is a potent toxicant interfering with oxidative phosphorylation in mitochondria and creating extreme environmental conditions in aquatic ecosystems. The mechanistic basis of adaptation to perpetual exposure to H2S remains poorly understood. We investigated evolutionarily independent lineages of livebearing fishes that have colonized and adapted to springs rich in H2S and compared their genome-wide gene expression patterns with closely related lineages from adjacent, nonsulfidic streams. Significant differences in gene expression were uncovered between all sulfidic and nonsulfidic population pairs. Variation in the number of differentially expressed genes among population pairs corresponded to differences in divergence times and rates of gene flow, which is consistent with neutral drift driving a substantial portion of gene expression variation among populations. Accordingly, there was little evidence for convergent evolution shaping large-scale gene expression patterns among independent sulfide spring populations. Nonetheless, we identified a small number of genes that was consistently differentially expressed in the same direction in all sulfidic and nonsulfidic population pairs. Functional annotation of shared differentially expressed genes indicated upregulation of genes associated with enzymatic H2S detoxification and transport of oxidized sulfur species, oxidative phosphorylation, energy metabolism, and pathways involved in responses to oxidative stress. Overall, our results suggest that modification of processes associated with H2S detoxification and toxicity likely complement each other to mediate elevated H2S tolerance in sulfide spring fishes. Our analyses allow for the development of novel hypotheses about biochemical and physiological mechanisms of adaptation to extreme environments. PMID:26861137
Ma, Chuang; Xin, Mingming; Feldmann, Kenneth A.; Wang, Xiangfeng
2014-01-01
Machine learning (ML) is an intelligent data mining technique that builds a prediction model based on the learning of prior knowledge to recognize patterns in large-scale data sets. We present an ML-based methodology for transcriptome analysis via comparison of gene coexpression networks, implemented as an R package called machine learning–based differential network analysis (mlDNA) and apply this method to reanalyze a set of abiotic stress expression data in Arabidopsis thaliana. The mlDNA first used a ML-based filtering process to remove nonexpressed, constitutively expressed, or non-stress-responsive “noninformative” genes prior to network construction, through learning the patterns of 32 expression characteristics of known stress-related genes. The retained “informative” genes were subsequently analyzed by ML-based network comparison to predict candidate stress-related genes showing expression and network differences between control and stress networks, based on 33 network topological characteristics. Comparative evaluation of the network-centric and gene-centric analytic methods showed that mlDNA substantially outperformed traditional statistical testing–based differential expression analysis at identifying stress-related genes, with markedly improved prediction accuracy. To experimentally validate the mlDNA predictions, we selected 89 candidates out of the 1784 predicted salt stress–related genes with available SALK T-DNA mutagenesis lines for phenotypic screening and identified two previously unreported genes, mutants of which showed salt-sensitive phenotypes. PMID:24520154
Palumbo, Maria Concetta; Zenoni, Sara; Fasoli, Marianna; Massonnet, Mélanie; Farina, Lorenzo; Castiglione, Filippo; Pezzotti, Mario; Paci, Paola
2014-12-01
We developed an approach that integrates different network-based methods to analyze the correlation network arising from large-scale gene expression data. By studying grapevine (Vitis vinifera) and tomato (Solanum lycopersicum) gene expression atlases and a grapevine berry transcriptomic data set during the transition from immature to mature growth, we identified a category named "fight-club hubs" characterized by a marked negative correlation with the expression profiles of neighboring genes in the network. A special subset named "switch genes" was identified, with the additional property of many significant negative correlations outside their own group in the network. Switch genes are involved in multiple processes and include transcription factors that may be considered master regulators of the previously reported transcriptome remodeling that marks the developmental shift from immature to mature growth. All switch genes, expressed at low levels in vegetative/green tissues, showed a significant increase in mature/woody organs, suggesting a potential regulatory role during the developmental transition. Finally, our analysis of tomato gene expression data sets showed that wild-type switch genes are downregulated in ripening-deficient mutants. The identification of known master regulators of tomato fruit maturation suggests our method is suitable for the detection of key regulators of organ development in different fleshy fruit crops. © 2014 American Society of Plant Biologists. All rights reserved.
Probabilistic representation of gene regulatory networks.
Mao, Linyong; Resat, Haluk
2004-09-22
Recent experiments have established unambiguously that biological systems can have significant cell-to-cell variations in gene expression levels even in isogenic populations. Computational approaches to studying gene expression in cellular systems should capture such biological variations for a more realistic representation. In this paper, we present a new fully probabilistic approach to the modeling of gene regulatory networks that allows for fluctuations in the gene expression levels. The new algorithm uses a very simple representation for the genes, and accounts for the repression or induction of the genes and for the biological variations among isogenic populations simultaneously. Because of its simplicity, introduced algorithm is a very promising approach to model large-scale gene regulatory networks. We have tested the new algorithm on the synthetic gene network library bioengineered recently. The good agreement between the computed and the experimental results for this library of networks, and additional tests, demonstrate that the new algorithm is robust and very successful in explaining the experimental data. The simulation software is available upon request. Supplementary material will be made available on the OUP server.
DeSalvo, M K; Voolstra, C R; Sunagawa, S; Schwarz, J A; Stillman, J H; Coffroth, M A; Szmant, A M; Medina, M
2008-09-01
The declining health of coral reefs worldwide is likely to intensify in response to continued anthropogenic disturbance from coastal development, pollution, and climate change. In response to these stresses, reef-building corals may exhibit bleaching, which marks the breakdown in symbiosis between coral and zooxanthellae. Mass coral bleaching due to elevated water temperature can devastate coral reefs on a large geographical scale. In order to understand the molecular and cellular basis of bleaching in corals, we have measured gene expression changes associated with thermal stress and bleaching using a complementary DNA microarray containing 1310 genes of the Caribbean coral Montastraea faveolata. In a first experiment, we identified differentially expressed genes by comparing experimentally bleached M. faveolata fragments to control non-heat-stressed fragments. In a second experiment, we identified differentially expressed genes during a time course experiment with four time points across 9 days. Results suggest that thermal stress and bleaching in M. faveolata affect the following processes: oxidative stress, Ca(2+) homeostasis, cytoskeletal organization, cell death, calcification, metabolism, protein synthesis, heat shock protein activity, and transposon activity. These results represent the first medium-scale transcriptomic study focused on revealing the cellular foundation of thermal stress-induced coral bleaching. We postulate that oxidative stress in thermal-stressed corals causes a disruption of Ca(2+) homeostasis, which in turn leads to cytoskeletal and cell adhesion changes, decreased calcification, and the initiation of cell death via apoptosis and necrosis.
Biomarker discovery for colon cancer using a 761 gene RT-PCR assay.
Clark-Langone, Kim M; Wu, Jenny Y; Sangli, Chithra; Chen, Angela; Snable, James L; Nguyen, Anhthu; Hackett, James R; Baker, Joffre; Yothers, Greg; Kim, Chungyeul; Cronin, Maureen T
2007-08-15
Reverse transcription PCR (RT-PCR) is widely recognized to be the gold standard method for quantifying gene expression. Studies using RT-PCR technology as a discovery tool have historically been limited to relatively small gene sets compared to other gene expression platforms such as microarrays. We have recently shown that TaqMan RT-PCR can be scaled up to profile expression for 192 genes in fixed paraffin-embedded (FPE) clinical study tumor specimens. This technology has also been used to develop and commercialize a widely used clinical test for breast cancer prognosis and prediction, the Onco typeDX assay. A similar need exists in colon cancer for a test that provides information on the likelihood of disease recurrence in colon cancer (prognosis) and the likelihood of tumor response to standard chemotherapy regimens (prediction). We have now scaled our RT-PCR assay to efficiently screen 761 biomarkers across hundreds of patient samples and applied this process to biomarker discovery in colon cancer. This screening strategy remains attractive due to the inherent advantages of maintaining platform consistency from discovery through clinical application. RNA was extracted from formalin fixed paraffin embedded (FPE) tissue, as old as 28 years, from 354 patients enrolled in NSABP C-01 and C-02 colon cancer studies. Multiplexed reverse transcription reactions were performed using a gene specific primer pool containing 761 unique primers. PCR was performed as independent TaqMan reactions for each candidate gene. Hierarchal clustering demonstrates that genes expected to co-express form obvious, distinct and in certain cases very tightly correlated clusters, validating the reliability of this technical approach to biomarker discovery. We have developed a high throughput, quantitatively precise multi-analyte gene expression platform for biomarker discovery that approaches low density DNA arrays in numbers of genes analyzed while maintaining the high specificity, sensitivity and reproducibility that are characteristics of RT-PCR. Biomarkers discovered using this approach can be transferred to a clinical reference laboratory setting without having to re-validate the assay on a second technology platform.
Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE
Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.
2009-01-01
Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438
Defining the human macula transcriptome and candidate retinal disease genes using EyeSAGE.
Bowes Rickman, Catherine; Ebright, Jessica N; Zavodni, Zachary J; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P; Wistow, Graeme; Boon, Kathy; Hauser, Michael A
2006-06-01
To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. The EyeSAGE database, combining three different gene-profiling platforms including the authors' multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions.
A regulatory toolbox of MiniPromoters to drive selective expression in the brain.
Portales-Casamar, Elodie; Swanson, Douglas J; Liu, Li; de Leeuw, Charles N; Banks, Kathleen G; Ho Sui, Shannan J; Fulton, Debra L; Ali, Johar; Amirabbasi, Mahsa; Arenillas, David J; Babyak, Nazar; Black, Sonia F; Bonaguro, Russell J; Brauer, Erich; Candido, Tara R; Castellarin, Mauro; Chen, Jing; Chen, Ying; Cheng, Jason C Y; Chopra, Vik; Docking, T Roderick; Dreolini, Lisa; D'Souza, Cletus A; Flynn, Erin K; Glenn, Randy; Hatakka, Kristi; Hearty, Taryn G; Imanian, Behzad; Jiang, Steven; Khorasan-zadeh, Shadi; Komljenovic, Ivana; Laprise, Stéphanie; Liao, Nancy Y; Lim, Jonathan S; Lithwick, Stuart; Liu, Flora; Liu, Jun; Lu, Meifen; McConechy, Melissa; McLeod, Andrea J; Milisavljevic, Marko; Mis, Jacek; O'Connor, Katie; Palma, Betty; Palmquist, Diana L; Schmouth, Jean-François; Swanson, Magdalena I; Tam, Bonny; Ticoll, Amy; Turner, Jenna L; Varhol, Richard; Vermeulen, Jenny; Watkins, Russell F; Wilson, Gary; Wong, Bibiana K Y; Wong, Siaw H; Wong, Tony Y T; Yang, George S; Ypsilanti, Athena R; Jones, Steven J M; Holt, Robert A; Goldowitz, Daniel; Wasserman, Wyeth W; Simpson, Elizabeth M
2010-09-21
The Pleiades Promoter Project integrates genomewide bioinformatics with large-scale knockin mouse production and histological examination of expression patterns to develop MiniPromoters and related tools designed to study and treat the brain by directed gene expression. Genes with brain expression patterns of interest are subjected to bioinformatic analysis to delineate candidate regulatory regions, which are then incorporated into a panel of compact human MiniPromoters to drive expression to brain regions and cell types of interest. Using single-copy, homologous-recombination "knockins" in embryonic stem cells, each MiniPromoter reporter is integrated immediately 5' of the Hprt locus in the mouse genome. MiniPromoter expression profiles are characterized in differentiation assays of the transgenic cells or in mouse brains following transgenic mouse production. Histological examination of adult brains, eyes, and spinal cords for reporter gene activity is coupled to costaining with cell-type-specific markers to define expression. The publicly available Pleiades MiniPromoter Project is a key resource to facilitate research on brain development and therapies.
Wang, Jian-Hua; Chen, Shi-Shu
2002-07-01
To clone gastric adenocarcinoma metastasis related genes, RF-1 cell line (primary tumor of a gastric adenocarcinoma patient ) and RF-48 cell line (its metastatic counterpart) were used as a model for studying the molecular mechanism of tumor metastasis. Two fluorescent cDNA probes, labeled with Cy3 and Cy5 dyes, were prepared from RF-1 and RF-48 mRNA samples by reverse transcription method. The two color probes were then mixed and hybridized to the cDNA chip constructed by double-dots of 4 096 human genes, and scanned at two wavelengths. The experiment was repeated for 2 times. Differential expression genes from the above two cells were analyzed using the computer. 138 in all genes (3.4%) revealed differential expression in RF-48 cells compared with RF-1 cells: 81(2.1%) genes revealed apparent up-regulation, and 56(1.3%) genes revealed down-regulation. 45 genes involved in gastric adenocarcinoma metastasis were cloned using fluorescent differential display-PCR (FDD-PCR), including 3 novel genes. There were 7 differential expression genes that agreed with each other in two detection methods. The possible roles of some differential expressed genes, which maybe involved in the mechanism of tumor metastasis, were discussed. cDNA chip was used to analyze gene expression in a high-throughput and large scale manner, in combination with FDD-PCR for cloning unknown novel genes. In conclusion, some genes related to metastasis were preliminarily scanned, which would contribute to disclose the molecular mechanism of gastric adenocarcinoma metastasis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Reddy, Anireddy; Ben-Hur, Asa
Abiotic stresses including drought are major limiting factors of crop yields and cause significant crop losses. Acquisition of stress tolerance to abiotic stresses requires coordinated regulation of a multitude of biochemical and physiological changes, and most of these changes depend on alterations in gene expression. The goal of this work is to perform global analysis of differential regulation of gene expression and alternative splicing, and their relationship with chromatin landscape in drought sensitive and tolerant cultivars. our Iso-Seq study revealed transcriptome-wide full-length isoforms at an unprecedented scale with over 11000 novel splice isoforms. Additionally, we uncovered alternative polyadenylation sites ofmore » ~11000 expressed genes and many novel genes. Overall, Iso-Seq results greatly enhanced sorghum gene annotations that are not only useful in analyzing all our RNA-seq, ChIP-seq and ATAC-seq data but also serve as a great resource to the plant biology community. Our studies identified differentially expressed genes and splicing events that are correlated with the drought-resistant phenotype. An association between alternative splicing and chromatin accessibility was also revealed. Several computational tools developed here (TAPIS and iDiffIR) have been made freely available to the research community in analyzing alternative splicing and differential alternative splicing.« less
Lee, Moon Young; Park, Chanjae; Berent, Robyn M.; Park, Paul J.; Fuchs, Robert; Syn, Hannah; Chin, Albert; Townsend, Jared; Benson, Craig C.; Redelman, Doug; Shen, Tsai-wei; Park, Jong Kun; Miano, Joseph M.; Sanders, Kenton M.; Ro, Seungil
2015-01-01
Genome-scale expression data on the absolute numbers of gene isoforms offers essential clues in cellular functions and biological processes. Smooth muscle cells (SMCs) perform a unique contractile function through expression of specific genes controlled by serum response factor (SRF), a transcription factor that binds to DNA sites known as the CArG boxes. To identify SRF-regulated genes specifically expressed in SMCs, we isolated SMC populations from mouse small intestine and colon, obtained their transcriptomes, and constructed an interactive SMC genome and CArGome browser. To our knowledge, this is the first online resource that provides a comprehensive library of all genetic transcripts expressed in primary SMCs. The browser also serves as the first genome-wide map of SRF binding sites. The browser analysis revealed novel SMC-specific transcriptional variants and SRF target genes, which provided new and unique insights into the cellular and biological functions of the cells in gastrointestinal (GI) physiology. The SRF target genes in SMCs, which were discovered in silico, were confirmed by proteomic analysis of SMC-specific Srf knockout mice. Our genome browser offers a new perspective into the alternative expression of genes in the context of SRF binding sites in SMCs and provides a valuable reference for future functional studies. PMID:26241044
Duan, Qiaonan; Flynn, Corey; Niepel, Mario; Hafner, Marc; Muhlich, Jeremy L; Fernandez, Nicolas F; Rouillard, Andrew D; Tan, Christopher M; Chen, Edward Y; Golub, Todd R; Sorger, Peter K; Subramanian, Aravind; Ma'ayan, Avi
2014-07-01
For the Library of Integrated Network-based Cellular Signatures (LINCS) project many gene expression signatures using the L1000 technology have been produced. The L1000 technology is a cost-effective method to profile gene expression in large scale. LINCS Canvas Browser (LCB) is an interactive HTML5 web-based software application that facilitates querying, browsing and interrogating many of the currently available LINCS L1000 data. LCB implements two compacted layered canvases, one to visualize clustered L1000 expression data, and the other to display enrichment analysis results using 30 different gene set libraries. Clicking on an experimental condition highlights gene-sets enriched for the differentially expressed genes from the selected experiment. A search interface allows users to input gene lists and query them against over 100 000 conditions to find the top matching experiments. The tool integrates many resources for an unprecedented potential for new discoveries in systems biology and systems pharmacology. The LCB application is available at http://www.maayanlab.net/LINCS/LCB. Customized versions will be made part of the http://lincscloud.org and http://lincs.hms.harvard.edu websites. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Rapid Y degeneration and dosage compensation in plant sex chromosomes
Papadopulos, Alexander S. T.; Chester, Michael; Ridout, Kate; Filatov, Dmitry A.
2015-01-01
The nonrecombining regions of animal Y chromosomes are known to undergo genetic degeneration, but previous work has failed to reveal large-scale gene degeneration on plant Y chromosomes. Here, we uncover rapid and extensive degeneration of Y-linked genes in a plant species, Silene latifolia, that evolved sex chromosomes de novo in the last 10 million years. Previous transcriptome-based studies of this species missed unexpressed, degenerate Y-linked genes. To identify sex-linked genes, regardless of their expression, we sequenced male and female genomes of S. latifolia and integrated the genomic contigs with a high-density genetic map. This revealed that 45% of Y-linked genes are not expressed, and 23% are interrupted by premature stop codons. This contrasts with X-linked genes, in which only 1.3% of genes contained stop codons and 4.3% of genes were not expressed in males. Loss of functional Y-linked genes is partly compensated for by gene-specific up-regulation of X-linked genes. Our results demonstrate that the rate of genetic degeneration of Y-linked genes in S. latifolia is as fast as in animals, and that the evolutionary trajectories of sex chromosomes are similar in the two kingdoms. PMID:26438872
Crombach, Anton; Cicin-Sain, Damjan; Wotton, Karl R; Jaeger, Johannes
2012-01-01
Understanding the function and evolution of developmental regulatory networks requires the characterisation and quantification of spatio-temporal gene expression patterns across a range of systems and species. However, most high-throughput methods to measure the dynamics of gene expression do not preserve the detailed spatial information needed in this context. For this reason, quantification methods based on image bioinformatics have become increasingly important over the past few years. Most available approaches in this field either focus on the detailed and accurate quantification of a small set of gene expression patterns, or attempt high-throughput analysis of spatial expression through binary pattern extraction and large-scale analysis of the resulting datasets. Here we present a robust, "medium-throughput" pipeline to process in situ hybridisation patterns from embryos of different species of flies. It bridges the gap between high-resolution, and high-throughput image processing methods, enabling us to quantify graded expression patterns along the antero-posterior axis of the embryo in an efficient and straightforward manner. Our method is based on a robust enzymatic (colorimetric) in situ hybridisation protocol and rapid data acquisition through wide-field microscopy. Data processing consists of image segmentation, profile extraction, and determination of expression domain boundary positions using a spline approximation. It results in sets of measured boundaries sorted by gene and developmental time point, which are analysed in terms of expression variability or spatio-temporal dynamics. Our method yields integrated time series of spatial gene expression, which can be used to reverse-engineer developmental gene regulatory networks across species. It is easily adaptable to other processes and species, enabling the in silico reconstitution of gene regulatory networks in a wide range of developmental contexts.
Transcriptional Analysis of Resistance to Low Temperatures in Bermudagrass Crown Tissues
Melmaiee, Kalpalatha; Anderson, Michael; Elavarthi, Sathya; Guenzi, Arron; Canaan, Patricia
2015-01-01
Bermudagrass (Cynodon dactylon L pers.) is one of the most geographically adapted and utilized of the warm-season grasses. However, bermudagrass adaptation to the Northern USA is limited by freeze damage and winterkill. Our study provides the first large-scale analyses of gene expression in bermudagrass regenerative crown tissues during cold acclimation. We compared gene expression patterns in crown tissues from highly cold tolerant “MSU” and susceptible “Zebra” genotypes exposed to near-freezing temperatures. Suppressive subtractive hybridization was used to isolate putative cold responsive genes Approximately, 3845 transcript sequences enriched for cold acclimation were deposited in the GenBank. A total of 4589 ESTs (3184 unigenes) including 744 ESTs associated with the bermudagrass disease spring dead spot were printed on microarrays and hybridized with cold acclimated complementary Deoxyribonucleic acid (cDNA). A total of 587 differentially expressed unigenes were identified in this study. Of these only 97 (17%) showed significant NCBI matches. The overall expression pattern revealed 40% more down- than up-regulated genes, which was particularly enhanced in MSU compared to Zebra. Among the up-regulated genes 68% were uniquely expressed in MSU (36%) or Zebra (32%). Among the down-regulated genes 40% were unique to MSU, while only 15% to Zebra. Overall expression intensity was significantly higher in MSU than in Zebra (p value ≤ 0.001) and the overall number of genes expressed at 28 days was 2.7 fold greater than at 2 days. These changes in expression patterns reflect the strong genotypic and temporal response to cold temperatures. Additionally, differentially expressed genes from this study can be utilized for developing molecular markers in bermudagrass and other warm season grasses for enhancing cold hardiness. PMID:26348040
Reed, Robert D; McMillan, W Owen; Nagy, Lisa M
2008-01-07
Geographical variation in the mimetic wing patterns of the butterfly Heliconius erato is a textbook example of adaptive polymorphism; however, little is known about how this variation is controlled developmentally. Using microarrays and qPCR, we identified and compared expression of candidate genes potentially involved with a red/yellow forewing band polymorphism in H. erato. We found that transcripts encoding the pigment synthesis enzymes cinnabar and vermilion showed pattern- and polymorphism-related expression patterns, respectively. cinnabar expression was associated with the forewing band regardless of pigment colour, providing the first gene expression pattern known to be correlated with a major Heliconius colour pattern. In contrast, vermilion expression changed spatially over time in red-banded butterflies, but was not expressed at detectable levels in yellow-banded butterflies, suggesting that regulation of this gene may be involved with the red/yellow polymorphism. Furthermore, we found that the yellow pigment, 3-hydroxykynurenine, is incorporated into wing scales from the haemolymph rather than being synthesized in situ. We propose that some aspects of Heliconius colour patterns are determined by spatio-temporal overlap of pigment gene transcription prepatterns and speculate that evolutionary changes in vermilion regulation may in part underlie an adaptive colour pattern polymorphism.
Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying
2016-07-14
Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis.
Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying
2016-01-01
Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis. PMID:27411928
Gene coexpression measures in large heterogeneous samples using count statistics.
Wang, Y X Rachel; Waterman, Michael S; Huang, Haiyan
2014-11-18
With the advent of high-throughput technologies making large-scale gene expression data readily available, developing appropriate computational tools to process these data and distill insights into systems biology has been an important part of the "big data" challenge. Gene coexpression is one of the earliest techniques developed that is still widely in use for functional annotation, pathway analysis, and, most importantly, the reconstruction of gene regulatory networks, based on gene expression data. However, most coexpression measures do not specifically account for local features in expression profiles. For example, it is very likely that the patterns of gene association may change or only exist in a subset of the samples, especially when the samples are pooled from a range of experiments. We propose two new gene coexpression statistics based on counting local patterns of gene expression ranks to take into account the potentially diverse nature of gene interactions. In particular, one of our statistics is designed for time-course data with local dependence structures, such as time series coupled over a subregion of the time domain. We provide asymptotic analysis of their distributions and power, and evaluate their performance against a wide range of existing coexpression measures on simulated and real data. Our new statistics are fast to compute, robust against outliers, and show comparable and often better general performance.
Sealable femtoliter chamber arrays for cell-free biology
DOE Office of Scientific and Technical Information (OSTI.GOV)
Retterer, Scott T.; Fowlkes, Jason Davidson; Collier, Charles Patrick
Cell-free systems provide a flexible platform for probing specific networks of biological reactions isolated from the complex resource sharing (e.g. global gene expression, cell division) encountered within living cells. However, such systems, used in conventional macro-scale bulk reactors, often fail to exhibit the dynamic behaviors and efficiencies characteristic of their living micro-scale counterparts. Understanding the impact of internal cell structure and scale on reaction dynamics is crucial to understanding complex gene networks. Here we report a microfabricated device that confines cell-free reactions in cellular scale volumes while allowing flexible characterization of the enclosed molecular system. This multilayered poly(dimethylsiloxane) (PDMS) devicemore » contains femtoliter-scale reaction chambers on an elastomeric membrane which can be actuated (open and closed). When actuated, the chambers confine Cell-Free Protein Synthesis (CFPS) reactions expressing a fluorescent protein, allowing for the visualization of the reaction kinetics over time using time-lapse fluorescent microscopy. Lastly, we demonstrate how this device may be used to measure the noise structure of CFPS reactions in a manner that is directly analogous to those used to characterize cellular systems, thereby enabling the use of noise biology techniques to characterize CFPS gene circuits and their interactions with the cell-free environment.« less
Sealable femtoliter chamber arrays for cell-free biology
Retterer, Scott T.; Fowlkes, Jason Davidson; Collier, Charles Patrick; ...
2015-03-11
Cell-free systems provide a flexible platform for probing specific networks of biological reactions isolated from the complex resource sharing (e.g. global gene expression, cell division) encountered within living cells. However, such systems, used in conventional macro-scale bulk reactors, often fail to exhibit the dynamic behaviors and efficiencies characteristic of their living micro-scale counterparts. Understanding the impact of internal cell structure and scale on reaction dynamics is crucial to understanding complex gene networks. Here we report a microfabricated device that confines cell-free reactions in cellular scale volumes while allowing flexible characterization of the enclosed molecular system. This multilayered poly(dimethylsiloxane) (PDMS) devicemore » contains femtoliter-scale reaction chambers on an elastomeric membrane which can be actuated (open and closed). When actuated, the chambers confine Cell-Free Protein Synthesis (CFPS) reactions expressing a fluorescent protein, allowing for the visualization of the reaction kinetics over time using time-lapse fluorescent microscopy. Lastly, we demonstrate how this device may be used to measure the noise structure of CFPS reactions in a manner that is directly analogous to those used to characterize cellular systems, thereby enabling the use of noise biology techniques to characterize CFPS gene circuits and their interactions with the cell-free environment.« less
Beane, Joal D; Lee, Gary; Zheng, Zhili; Mendel, Matthew; Abate-Daga, Daniel; Bharathan, Mini; Black, Mary; Gandhi, Nimisha; Yu, Zhiya; Chandran, Smita; Giedlin, Martin; Ando, Dale; Miller, Jeff; Paschon, David; Guschin, Dmitry; Rebar, Edward J; Reik, Andreas; Holmes, Michael C; Gregory, Philip D; Restifo, Nicholas P; Rosenberg, Steven A; Morgan, Richard A; Feldman, Steven A
2015-01-01
Programmed cell death-1 (PD-1) is expressed on activated T cells and represents an attractive target for gene-editing of tumor targeted T cells prior to adoptive cell transfer (ACT). We used zinc finger nucleases (ZFNs) directed against the gene encoding human PD-1 (PDCD-1) to gene-edit melanoma tumor infiltrating lymphocytes (TIL). We show that our clinical scale TIL production process yielded efficient modification of the PD-1 gene locus, with an average modification frequency of 74.8% (n = 3, range 69.9–84.1%) of the alleles in a bulk TIL population, which resulted in a 76% reduction in PD-1 surface-expression. Forty to 48% of PD-1 gene-edited cells had biallelic PD-1 modification. Importantly, the PD-1 gene-edited TIL product showed improved in vitro effector function and a significantly increased polyfunctional cytokine profile (TNFα, GM-CSF, and IFNγ) compared to unmodified TIL in two of the three donors tested. In addition, all donor cells displayed an effector memory phenotype and expanded approximately 500–2,000-fold in vitro. Thus, further study to determine the efficiency and safety of adoptive cell transfer using PD-1 gene-edited TIL for the treatment of metastatic melanoma is warranted. PMID:25939491
Expression induction of P450 genes by imidacloprid in Nilaparvata lugens: A genome-scale analysis.
Zhang, Jianhua; Zhang, Yixi; Wang, Yunchao; Yang, Yuanxue; Cang, Xinzhu; Liu, Zewen
2016-09-01
The overexpression of P450 monooxygenase genes is a main mechanism for the resistance to imidacloprid, a representative neonicotinoid insecticide, in Nilaparvata lugens (brown planthopper, BPH). However, only two P450 genes (CYP6AY1 and CYP6ER1), among fifty-four P450 genes identified from BPH genome database, have been reported to play important roles in imidacloprid resistance until now. In this study, after the confirmation of important roles of P450s in imidacloprid resistance by the synergism analysis, the expression induction by imidacloprid was determined for all P450 genes. In the susceptible (Sus) strain, eight P450 genes in Clade4, eight in Clade3 and two in Clade2 were up-regulated by imidacloprid, among which three genes (CYP6CS1, CYP6CW1 and CYP6ER1, all in Clade3) were increased to above 4.0-fold and eight genes to above 2.0-fold. In contrast, no P450 genes were induced in Mito clade. Eight genes induced to above 2.0-fold were selected to determine their expression and induced levels in Huzhou population, in which piperonyl butoxide showed the biggest effects on imidacloprid toxicity among eight field populations. The expression levels of seven P450 genes were higher in Huzhou population than that in Sus strain, with the biggest differences for CYP6CS1 (9.8-fold), CYP6ER1 (7.7-fold) and CYP6AY1 (5.1-fold). The induction levels for all tested genes were bigger in Sus strain than that in Huzhou population except CYP425B1. Screening the induction of P450 genes by imidacloprid in the genome-scale will provide an overall view on the possible metabolic factors in the resistance to neonicotinoid insecticides. The further work, such as the functional study of recombinant proteins, will be performed to validate the roles of these P450s in imidacloprid resistance. Copyright © 2015 Elsevier B.V. All rights reserved.
Joly, Jean-Stephane; Bourrat, Franck; Nguyen, Van; Chourrout, Daniel
1997-01-01
Large-scale genetic screens for mutations affecting early neurogenesis of vertebrates have recently been performed with an aquarium fish, the zebrafish. Later stages of neural morphogenesis have attracted less attention in small fish species, partly because of the lack of molecular markers of developing structures that may facilitate the detection of discrete structural alterations. In this context, we report the characterization of Ol-Prx 3 (Oryzias latipes-Prx 3). This gene was isolated in the course of a large-scale screen for brain cDNAs containing a highly conserved DNA binding region, the homeobox helix-three. Sequence analysis revealed that this gene belongs to another class of homeobox genes, together with a previously isolated mouse ortholog, called OG-12 [Rovescalli, A. C., Asoh, S. & Nirenberg, M. (1996) Proc. Natl. Acad. Sci. USA 93, 10691–10696] and with the human SHOX gene [Rao, E., Weiss, B., Fukami, M., Rump, A., Niesler, B., et al. (1997) Nat. Genet. 16, 54–62], thought to be involved in the short-stature phenotype of Turner syndrome patients. These three genes exhibit a moderate level of identity in the homeobox with the other genes of the paired-related (PRX) gene family. Ol-Prx 3, as well as the PRX genes, are expressed in various cartilaginous structures of head and limbs. These genes might thus be involved in common regulatory pathways during the morphogenesis of these structures. Moreover, this paper reports a complex and monophasic pattern of Ol-Prx 3 expression in the central nervous system, which differs markedly from the patterns reported for the PRX genes, Prx 3 excluded: this gene begins to be expressed in a variety of central nervous system territories at late neurula stage. Strikingly, it remains turned on in some of the derivatives of each territory during the entire life of the fish. We hope this work will thus help identify common features for the PRX 3 family of homeobox genes. PMID:9371787
Biochemical Diversification through Foreign Gene Expression in Bdelloid Rotifers
Eyres, Isobel; Wang-Koh, Yuan; Lubzens, Esther; Barraclough, Timothy G.; Micklem, Gos; Tunnacliffe, Alan
2012-01-01
Bdelloid rotifers are microinvertebrates with unique characteristics: they have survived tens of millions of years without sexual reproduction; they withstand extreme desiccation by undergoing anhydrobiosis; and they tolerate very high levels of ionizing radiation. Recent evidence suggests that subtelomeric regions of the bdelloid genome contain sequences originating from other organisms by horizontal gene transfer (HGT), of which some are known to be transcribed. However, the extent to which foreign gene expression plays a role in bdelloid physiology is unknown. We address this in the first large scale analysis of the transcriptome of the bdelloid Adineta ricciae: cDNA libraries from hydrated and desiccated bdelloids were subjected to massively parallel sequencing and assembled transcripts compared against the UniProtKB database by blastx to identify their putative products. Of ∼29,000 matched transcripts, ∼10% were inferred from blastx matches to be horizontally acquired, mainly from eubacteria but also from fungi, protists, and algae. After allowing for possible sources of error, the rate of HGT is at least 8%–9%, a level significantly higher than other invertebrates. We verified their foreign nature by phylogenetic analysis and by demonstrating linkage of foreign genes with metazoan genes in the bdelloid genome. Approximately 80% of horizontally acquired genes expressed in bdelloids code for enzymes, and these represent 39% of enzymes in identified pathways. Many enzymes encoded by foreign genes enhance biochemistry in bdelloids compared to other metazoans, for example, by potentiating toxin degradation or generation of antioxidants and key metabolites. They also supplement, and occasionally potentially replace, existing metazoan functions. Bdelloid rotifers therefore express horizontally acquired genes on a scale unprecedented in animals, and foreign genes make a profound contribution to their metabolism. This represents a potential mechanism for ancient asexuals to adapt rapidly to changing environments and thereby persist over long evolutionary time periods in the absence of sex. PMID:23166508
Koda, Satoru; Onda, Yoshihiko; Matsui, Hidetoshi; Takahagi, Kotaro; Yamaguchi-Uehara, Yukiko; Shimizu, Minami; Inoue, Komaki; Yoshida, Takuhiro; Sakurai, Tetsuya; Honda, Hiroshi; Eguchi, Shinto; Nishii, Ryuei; Mochida, Keiichi
2017-01-01
We report the comprehensive identification of periodic genes and their network inference, based on a gene co-expression analysis and an Auto-Regressive eXogenous (ARX) model with a group smoothly clipped absolute deviation (SCAD) method using a time-series transcriptome dataset in a model grass, Brachypodium distachyon . To reveal the diurnal changes in the transcriptome in B. distachyon , we performed RNA-seq analysis of its leaves sampled through a diurnal cycle of over 48 h at 4 h intervals using three biological replications, and identified 3,621 periodic genes through our wavelet analysis. The expression data are feasible to infer network sparsity based on ARX models. We found that genes involved in biological processes such as transcriptional regulation, protein degradation, and post-transcriptional modification and photosynthesis are significantly enriched in the periodic genes, suggesting that these processes might be regulated by circadian rhythm in B. distachyon . On the basis of the time-series expression patterns of the periodic genes, we constructed a chronological gene co-expression network and identified putative transcription factors encoding genes that might be involved in the time-specific regulatory transcriptional network. Moreover, we inferred a transcriptional network composed of the periodic genes in B. distachyon , aiming to identify genes associated with other genes through variable selection by grouping time points for each gene. Based on the ARX model with the group SCAD regularization using our time-series expression datasets of the periodic genes, we constructed gene networks and found that the networks represent typical scale-free structure. Our findings demonstrate that the diurnal changes in the transcriptome in B. distachyon leaves have a sparse network structure, demonstrating the spatiotemporal gene regulatory network over the cyclic phase transitions in B. distachyon diurnal growth.
Ohyanagi, Hajime; Takano, Tomoyuki; Terashima, Shin; Kobayashi, Masaaki; Kanno, Maasa; Morimoto, Kyoko; Kanegae, Hiromi; Sasaki, Yohei; Saito, Misa; Asano, Satomi; Ozaki, Soichi; Kudo, Toru; Yokoyama, Koji; Aya, Koichiro; Suwabe, Keita; Suzuki, Go; Aoki, Koh; Kubo, Yasutaka; Watanabe, Masao; Matsuoka, Makoto; Yano, Kentaro
2015-01-01
Comprehensive integration of large-scale omics resources such as genomes, transcriptomes and metabolomes will provide deeper insights into broader aspects of molecular biology. For better understanding of plant biology, we aim to construct a next-generation sequencing (NGS)-derived gene expression network (GEN) repository for a broad range of plant species. So far we have incorporated information about 745 high-quality mRNA sequencing (mRNA-Seq) samples from eight plant species (Arabidopsis thaliana, Oryza sativa, Solanum lycopersicum, Sorghum bicolor, Vitis vinifera, Solanum tuberosum, Medicago truncatula and Glycine max) from the public short read archive, digitally profiled the entire set of gene expression profiles, and drawn GENs by using correspondence analysis (CA) to take advantage of gene expression similarities. In order to understand the evolutionary significance of the GENs from multiple species, they were linked according to the orthology of each node (gene) among species. In addition to other gene expression information, functional annotation of the genes will facilitate biological comprehension. Currently we are improving the given gene annotations with natural language processing (NLP) techniques and manual curation. Here we introduce the current status of our analyses and the web database, PODC (Plant Omics Data Center; http://bioinf.mind.meiji.ac.jp/podc/), now open to the public, providing GENs, functional annotations and additional comprehensive omics resources. PMID:25505034
Palumbo, Maria Concetta; Zenoni, Sara; Fasoli, Marianna; Massonnet, Mélanie; Farina, Lorenzo; Castiglione, Filippo; Pezzotti, Mario; Paci, Paola
2014-01-01
We developed an approach that integrates different network-based methods to analyze the correlation network arising from large-scale gene expression data. By studying grapevine (Vitis vinifera) and tomato (Solanum lycopersicum) gene expression atlases and a grapevine berry transcriptomic data set during the transition from immature to mature growth, we identified a category named “fight-club hubs” characterized by a marked negative correlation with the expression profiles of neighboring genes in the network. A special subset named “switch genes” was identified, with the additional property of many significant negative correlations outside their own group in the network. Switch genes are involved in multiple processes and include transcription factors that may be considered master regulators of the previously reported transcriptome remodeling that marks the developmental shift from immature to mature growth. All switch genes, expressed at low levels in vegetative/green tissues, showed a significant increase in mature/woody organs, suggesting a potential regulatory role during the developmental transition. Finally, our analysis of tomato gene expression data sets showed that wild-type switch genes are downregulated in ripening-deficient mutants. The identification of known master regulators of tomato fruit maturation suggests our method is suitable for the detection of key regulators of organ development in different fleshy fruit crops. PMID:25490918
Transcriptome sequencing and annotation of the halophytic microalga Dunaliella salina * #
Hong, Ling; Liu, Jun-li; Midoun, Samira Z.; Miller, Philip C.
2017-01-01
The unicellular green alga Dunaliella salina is well adapted to salt stress and contains compounds (including β-carotene and vitamins) with potential commercial value. A large transcriptome database of D. salina during the adjustment, exponential and stationary growth phases was generated using a high throughput sequencing platform. We characterized the metabolic processes in D. salina with a focus on valuable metabolites, with the aim of manipulating D. salina to achieve greater economic value in large-scale production through a bioengineering strategy. Gene expression profiles under salt stress verified using quantitative polymerase chain reaction (qPCR) implied that salt can regulate the expression of key genes. This study generated a substantial fraction of D. salina transcriptional sequences for the entire growth cycle, providing a basis for the discovery of novel genes. This first full-scale transcriptome study of D. salina establishes a foundation for further comparative genomic studies. PMID:28990374
Kong, Ling-An; Wu, Du-Qing; Huang, Wen-Kun; Peng, Huan; Wang, Gao-Feng; Cui, Jiang-Kuan; Liu, Shi-Ming; Li, Zhi-Gang; Yang, Jun; Peng, De-Liang
2015-10-16
Cereal cyst nematode Heterodera avenae, an important soil-borne pathogen in wheat, causes numerous annual yield losses worldwide, and use of resistant cultivars is the best strategy for control. However, target genes are not readily available for breeding resistant cultivars. Therefore, comparative transcriptomic analyses were performed to identify more applicable resistance genes for cultivar breeding. The developing nematodes within roots were stained with acid fuchsin solution. Transcriptome assemblies and redundancy filteration were obtained by Trinity, TGI Clustering Tool and BLASTN, respectively. Gene Ontology annotation was yielded by Blast2GO program, and metabolic pathways of transcripts were analyzed by Path_finder. The ROS levels were determined by luminol-chemiluminescence assay. The transcriptional gene expression profiles were obtained by quantitative RT-PCR. The RNA-sequencing was performed using an incompatible wheat cultivar VP1620 and a compatible control cultivar WEN19 infected with H. avenae at 24 h, 3 d and 8 d. Infection assays showed that VP1620 failed to block penetration of H. avenae but disturbed the transition of developmental stages, leading to a significant reduction in cyst formation. Two types of expression profiles were established to predict candidate resistance genes after developing a novel strategy to generate clean RNA-seq data by removing the transcripts of H. avenae within the raw data before assembly. Using the uncoordinated expression profiles with transcript abundance as a standard, 424 candidate resistance genes were identified, including 302 overlapping genes and 122 VP1620-specific genes. Genes with similar expression patterns were further classified according to the scales of changed transcript abundances, and 182 genes were rescued as supplementary candidate resistance genes. Functional characterizations revealed that diverse defense-related pathways were responsible for wheat resistance against H. avenae. Moreover, phospholipase was involved in many defense-related pathways and localized in the connection position. Furthermore, strong bursts of reactive oxygen species (ROS) within VP1620 roots infected with H. avenae were induced at 24 h and 3 d, and eight ROS-producing genes were significantly upregulated, including three class III peroxidase and five lipoxygenase genes. Large-scale identification of wheat resistance genes were processed by comparative transcriptomic analysis. Functional characterization showed that phospholipases associated with ROS production played vital roles in early defense responses to H. avenae via involvement in diverse defense-related pathways as a hub switch. This study is the first to investigate the early defense responses of wheat against H. avenae, not only provides applicable candidate resistance genes for breeding novel wheat cultivars, but also enables a better understanding of the defense mechanisms of wheat against H. avenae.
Lineage-specific evolution of cnidarian Wnt ligands.
Hensel, Katrin; Lotan, Tamar; Sanders, Steve M; Cartwright, Paulyn; Frank, Uri
2014-09-01
We have studied the evolution of Wnt genes in cnidarians and the expression pattern of all Wnt ligands in the hydrozoan Hydractinia echinata. Current views favor a scenario in which 12 Wnt sub-families were jointly inherited by cnidarians and bilaterians from their last common ancestor. Our phylogenetic analyses clustered all medusozoan genes in distinct, well-supported clades, but many orthologous relationships between medusozoan Wnts and anthozoan and bilaterian Wnt genes were poorly supported. Only seven anthozoan genes, Wnt2, Wnt4, Wnt5, Wnt6, Wnt 10, Wnt11, and Wnt16 were recovered with strong support with bilaterian genes and of those, only the Wnt2, Wnt5, Wnt11, and Wnt16 clades also included medusozoan genes. Although medusozoan Wnt8 genes clustered with anthozoan and bilaterian genes, this was not well supported. In situ hybridization studies revealed poor conservation of expression patterns of putative Wnt orthologs within Cnidaria. In polyps, only Wnt1, Wnt3, and Wnt7 were expressed at the same position in the studied cnidarian models Hydra, Hydractinia, and Nematostella. Different expression patterns are consistent with divergent functions. Our data do not fully support previous assertions regarding Wnt gene homology, and suggest a more complex history of Wnt family genes than previously suggested. This includes high rates of sequence divergence and lineage-specific duplications of Wnt genes within medusozoans, followed by functional divergence over evolutionary time scales. © 2014 Wiley Periodicals, Inc.
Meng, Jia; Kanzaki, Gregory; Meas, Diane; Lam, Christopher K.; Crummer, Heather; Tain, Justina; Xu, H. Howard
2013-01-01
Regulated antisense RNA (asRNA) expression has been employed successfully in Gram-positive bacteria for genome-wide essential gene identification and drug target determination. However, there have been no published reports describing the application of asRNA gene silencing for comprehensive analyses of essential genes in Gram-negative bacteria. In this study, we report the first genome-wide identification of asRNA constructs for essential genes in Escherichia coli. We screened 250,000 library transformants for conditional growth-inhibitory recombinant clones from two shot-gun genomic libraries of E. coli using a paired-termini expression vector (pHN678). After sequencing plasmid inserts of 675 confirmed inducer-sensitive cell clones, we identified 152 separate asRNA constructs of which 134 inserts came from essential genes while 18 originated from non-essential genes (but share operons with essential genes). Among the 79 individual essential genes silenced by these asRNA constructs, 61 genes (77%) engage in processes related to protein synthesis. The cell-based assays of an asRNA clone targeting fusA (encoding elongation factor G) showed that the induced cells were sensitized 12 fold to fusidic acid, a known specific inhibitor. Our results demonstrate the utility of the paired-termini expression vector and feasibility of large-scale gene silencing in E. coli using regulated asRNA expression. PMID:22268863
Design and construction of functional AAV vectors.
Gray, John T; Zolotukhin, Serge
2011-01-01
Using the basic principles of molecular biology and laboratory techniques presented in this chapter, researchers should be able to create a wide variety of AAV vectors for both clinical and basic research applications. Basic vector design concepts are covered for both protein coding gene expression and small non-coding RNA gene expression cassettes. AAV plasmid vector backbones (available via AddGene) are described, along with critical sequence details for a variety of modular expression components that can be inserted as needed for specific applications. Protocols are provided for assembling the various DNA components into AAV vector plasmids in Escherichia coli, as well as for transferring these vector sequences into baculovirus genomes for large-scale production of AAV in the insect cell production system.
Barat, Ana; Ruskin, Heather J; Byrne, Annette T; Prehn, Jochen H M
2015-11-23
Recently, considerable attention has been paid to gene expression-based classifications of colorectal cancers (CRC) and their association with patient prognosis. In addition to changes in gene expression, abnormal DNA-methylation is known to play an important role in cancer onset and development, and colon cancer is no exception to this rule. Large-scale technologies, such as methylation microarray assays and specific sequencing of methylated DNA, have been used to determine whole genome profiles of CpG island methylation in tissue samples. In this article, publicly available microarray-based gene expression and methylation data sets are used to characterize expression subtypes with respect to locus-specific methylation. A major objective was to determine whether integration of these data types improves previously characterized subtypes, or provides evidence for additional subtypes. We used unsupervised clustering techniques to determine methylation-based subgroups, which are subsequently annotated with three published expression-based classifications, comprising from three to six subtypes. Our results showed that, while methylation profiles provide a further basis for segregation of certain (Inflammatory and Goblet-like) finer-grained expression-based subtypes, they also suggest that other finer-grained subtypes are not distinctive and can be considered as a single subtype.
Barat, Ana; Ruskin, Heather J.; Byrne, Annette T.; Prehn, Jochen H. M.
2015-01-01
Recently, considerable attention has been paid to gene expression-based classifications of colorectal cancers (CRC) and their association with patient prognosis. In addition to changes in gene expression, abnormal DNA-methylation is known to play an important role in cancer onset and development, and colon cancer is no exception to this rule. Large-scale technologies, such as methylation microarray assays and specific sequencing of methylated DNA, have been used to determine whole genome profiles of CpG island methylation in tissue samples. In this article, publicly available microarray-based gene expression and methylation data sets are used to characterize expression subtypes with respect to locus-specific methylation. A major objective was to determine whether integration of these data types improves previously characterized subtypes, or provides evidence for additional subtypes. We used unsupervised clustering techniques to determine methylation-based subgroups, which are subsequently annotated with three published expression-based classifications, comprising from three to six subtypes. Our results showed that, while methylation profiles provide a further basis for segregation of certain (Inflammatory and Goblet-like) finer-grained expression-based subtypes, they also suggest that other finer-grained subtypes are not distinctive and can be considered as a single subtype. PMID:27600244
Irizarry, Kristopher J L; Downs, Eileen; Bryden, Randall; Clark, Jory; Griggs, Lisa; Kopulos, Renee; Boettger, Cynthia M; Carr, Thomas J; Keeler, Calvin L; Collisson, Ellen; Drechsler, Yvonne
2017-01-01
Discovering genetic biomarkers associated with disease resistance and enhanced immunity is critical to developing advanced strategies for controlling viral and bacterial infections in different species. Macrophages, important cells of innate immunity, are directly involved in cellular interactions with pathogens, the release of cytokines activating other immune cells and antigen presentation to cells of the adaptive immune response. IFNγ is a potent activator of macrophages and increased production has been associated with disease resistance in several species. This study characterizes the molecular basis for dramatically different nitric oxide production and immune function between the B2 and the B19 haplotype chicken macrophages.A large-scale RNA sequencing approach was employed to sequence the RNA of purified macrophages from each haplotype group (B2 vs. B19) during differentiation and after stimulation. Our results demonstrate that a large number of genes exhibit divergent expression between B2 and B19 haplotype cells both prior and after stimulation. These differences in gene expression appear to be regulated by complex epigenetic mechanisms that need further investigation.
Murray, John Isaac
2018-05-01
The convergence of developmental biology and modern genomics tools brings the potential for a comprehensive understanding of developmental systems. This is especially true for the Caenorhabditis elegans embryo because its small size, invariant developmental lineage, and powerful genetic and genomic tools provide the prospect of a cellular resolution understanding of messenger RNA (mRNA) expression and regulation across the organism. We describe here how a systems biology framework might allow large-scale determination of the embryonic regulatory relationships encoded in the C. elegans genome. This framework consists of two broad steps: (a) defining the "parts list"-all genes expressed in all cells at each time during development and (b) iterative steps of computational modeling and refinement of these models by experimental perturbation. Substantial progress has been made towards defining the parts list through imaging methods such as large-scale green fluorescent protein (GFP) reporter analysis. Imaging results are now being augmented by high-resolution transcriptome methods such as single-cell RNA sequencing, and it is likely the complete expression patterns of all genes across the embryo will be known within the next few years. In contrast, the modeling and perturbation experiments performed so far have focused largely on individual cell types or genes, and improved methods will be needed to expand them to the full genome and organism. This emerging comprehensive map of embryonic expression and regulatory function will provide a powerful resource for developmental biologists, and would also allow scientists to ask questions not accessible without a comprehensive picture. This article is categorized under: Invertebrate Organogenesis > Worms Technologies > Analysis of the Transcriptome Gene Expression and Transcriptional Hierarchies > Gene Networks and Genomics. © 2018 Wiley Periodicals, Inc.
Generation of Envelope-Modified Baculoviruses for Gene Delivery into Mammalian Cells.
Hofmann, Christian
2016-01-01
Genetically modified baculoviruses can efficiently deliver and express genes in mammalian cells. The major prerequisite for the expression of a gene transferred by baculovirus is its control by a promoter that is active in mammalian cells. This chapter describes methods for producing second generation baculovirus vectors through modification of their envelope. Envelope modified baculoviruses offer additional new applications of the system, such as their use in in vivo gene delivery, targeting, and vaccination. Methods of generating a recombinant baculovirus vector with a modified envelope and its amplification and purification, including technical scale production, are discussed. A variety of notes give clues regarding specific technical procedures. Finally, methods to analyze the virus and transduction procedures are presented.
Primiani, Christopher T.; Ryan, Veronica H.; Rao, Jagadeesh S.; Cam, Margaret C.; Ahn, Kwangmi; Modi, Hiren R.; Rapoport, Stanley I.
2014-01-01
Background Age changes in expression of inflammatory, synaptic, and neurotrophic genes are not well characterized during human brain development and senescence. Knowing these changes may elucidate structural, metabolic, and functional brain processes over the lifespan, as well vulnerability to neurodevelopmental or neurodegenerative diseases. Hypothesis Expression levels of inflammatory, synaptic, and neurotrophic genes in the human brain are coordinated over the lifespan and underlie changes in phenotypic networks or cascades. Methods We used a large-scale microarray dataset from human prefrontal cortex, BrainCloud, to quantify age changes over the lifespan, divided into Development (0 to 21 years, 87 brains) and Aging (22 to 78 years, 144 brains) intervals, in transcription levels of 39 genes. Results Gene expression levels followed different trajectories over the lifespan. Many changes were intercorrelated within three similar groups or clusters of genes during both Development and Aging, despite different roles of the gene products in the two intervals. During Development, changes were related to reported neuronal loss, dendritic growth and pruning, and microglial events; TLR4, IL1R1, NFKB1, MOBP, PLA2G4A, and PTGS2 expression increased in the first years of life, while expression of synaptic genes GAP43 and DBN1 decreased, before reaching plateaus. During Aging, expression was upregulated for potentially pro-inflammatory genes such as NFKB1, TRAF6, TLR4, IL1R1, TSPO, and GFAP, but downregulated for neurotrophic and synaptic integrity genes such as BDNF, NGF, PDGFA, SYN, and DBN1. Conclusions Coordinated changes in gene transcription cascades underlie changes in synaptic, neurotrophic, and inflammatory phenotypic networks during brain Development and Aging. Early postnatal expression changes relate to neuronal, glial, and myelin growth and synaptic pruning events, while late Aging is associated with pro-inflammatory and synaptic loss changes. Thus, comparable transcriptional regulatory networks that operate throughout the lifespan underlie different phenotypic processes during Aging compared to Development. PMID:25329999
Primiani, Christopher T; Ryan, Veronica H; Rao, Jagadeesh S; Cam, Margaret C; Ahn, Kwangmi; Modi, Hiren R; Rapoport, Stanley I
2014-01-01
Age changes in expression of inflammatory, synaptic, and neurotrophic genes are not well characterized during human brain development and senescence. Knowing these changes may elucidate structural, metabolic, and functional brain processes over the lifespan, as well vulnerability to neurodevelopmental or neurodegenerative diseases. Expression levels of inflammatory, synaptic, and neurotrophic genes in the human brain are coordinated over the lifespan and underlie changes in phenotypic networks or cascades. We used a large-scale microarray dataset from human prefrontal cortex, BrainCloud, to quantify age changes over the lifespan, divided into Development (0 to 21 years, 87 brains) and Aging (22 to 78 years, 144 brains) intervals, in transcription levels of 39 genes. Gene expression levels followed different trajectories over the lifespan. Many changes were intercorrelated within three similar groups or clusters of genes during both Development and Aging, despite different roles of the gene products in the two intervals. During Development, changes were related to reported neuronal loss, dendritic growth and pruning, and microglial events; TLR4, IL1R1, NFKB1, MOBP, PLA2G4A, and PTGS2 expression increased in the first years of life, while expression of synaptic genes GAP43 and DBN1 decreased, before reaching plateaus. During Aging, expression was upregulated for potentially pro-inflammatory genes such as NFKB1, TRAF6, TLR4, IL1R1, TSPO, and GFAP, but downregulated for neurotrophic and synaptic integrity genes such as BDNF, NGF, PDGFA, SYN, and DBN1. Coordinated changes in gene transcription cascades underlie changes in synaptic, neurotrophic, and inflammatory phenotypic networks during brain Development and Aging. Early postnatal expression changes relate to neuronal, glial, and myelin growth and synaptic pruning events, while late Aging is associated with pro-inflammatory and synaptic loss changes. Thus, comparable transcriptional regulatory networks that operate throughout the lifespan underlie different phenotypic processes during Aging compared to Development.
Saas, J; Haag, J; Rueger, D; Chubinskaya, S; Sohler, F; Zimmer, R; Bartnik, E; Aigner, T
2006-10-01
Anabolic and catabolic cytokines and growth factors such as BMP-7 and IL-1beta play a central role in controlling the balance between degradation and repair of normal and (osteo)arthritic articular cartilage matrix. In this report, we investigated the response of articular chondrocytes to these factors IL-1beta and BMP-7 in terms of changes in gene expression levels. Large scale analysis was performed on primary human adult articular chondrocytes isolated from two human, independent donors cultured in alginate beads (non-stimulated and stimulated with IL-1beta and BMP-7 for 48 h) using Affymetrix gene chips (oligo-arrays). Biostatistical and bioinformatic evaluation of gene expression pattern was performed using the Resolver software (Rosetta). Part of the results were confirmed using real-time PCR. IL-1beta modulated significantly 909 out of 3459 genes detectable, whereas BMP-7 influenced only 36 out of 3440. BMP-7 induced mainly anabolic activation of chondrocytes including classical target genes such as collagen type II and aggrecan, while IL-1beta, both, significantly modulated the gene expression levels of numerous genes; namely, IL-1beta down-regulated the expression of anabolic genes and induced catabolic genes and mediators. Our data indicate that BMP-7 has only a limited effect on differentiated cells, whereas IL-1beta causes a dramatic change in gene expression pattern, i.e. induced or repressed much more genes. This presumably reflects the fact that BMP-7 signaling is effected via one pathway only (i.e. Smad-pathway) whereas IL-1beta is able to signal via a broad variety of intracellular signaling cascades involving the JNK, p38, NFkB and Erk pathways and even influencing BMP signaling.
Genomic Organization, Phylogenetic and Expression Analysis of the B-BOX Gene Family in Tomato
Chu, Zhuannan; Wang, Xin; Li, Ying; Yu, Huiyang; Li, Jinhua; Lu, Yongen; Li, Hanxia; Ouyang, Bo
2016-01-01
The B-BOX (BBX) proteins encode a class of zinc-finger transcription factors possessing one or two B-BOX domains and in some cases an additional CCT (CO, CO-like and TOC1) motif, which play important roles in regulating plant growth, development and stress response. Nevertheless, no systematic study of BBX genes has undertaken in tomato (Solanum lycopersicum). Here we present the results of a genome-wide analysis of the 29 BBX genes in this important vegetable species. Their structures, conserved domains, phylogenetic relationships, subcellular localizations, and promoter cis-regulatory elements were analyzed; their tissue expression profiles and expression patterns under various hormones and stress treatments were also investigated in detail. Tomato BBX genes can be divided into five subfamilies, and twelve of them were found to be segmentally duplicated. Real-time quantitative PCR analysis showed that most BBX genes exhibited different temporal and spatial expression patterns. The expression of most BBX genes can be induced by drought, polyethylene glycol-6000 or heat stress. Some BBX genes were induced strongly by phytohormones such as abscisic acid, gibberellic acid, or ethephon. The majority of tomato BBX proteins was predicted to be located in nuclei, and the transient expression assay using Arabidopsis mesophyll protoplasts demonstrated that all the seven BBX members tested (SlBBX5, 7, 15, 17, 20, 22, and 24) were localized in nucleus. Our analysis of tomato BBX genes on the genome scale would provide valuable information for future functional characterization of specific genes in this family. PMID:27807440
Gibson, Scott M; Ficklin, Stephen P; Isaacson, Sven; Luo, Feng; Feltus, Frank A; Smith, Melissa C
2013-01-01
The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust.
Hanno-Iijima, Yoko; Tanaka, Masami; Iijima, Takatoshi
2015-01-01
Homeostatic synaptic plasticity, or synaptic scaling, is a mechanism that tunes neuronal transmission to compensate for prolonged, excessive changes in neuronal activity. Both excitatory and inhibitory neurons undergo homeostatic changes based on synaptic transmission strength, which could effectively contribute to a fine-tuning of circuit activity. However, gene regulation that underlies homeostatic synaptic plasticity in GABAergic (GABA, gamma aminobutyric) neurons is still poorly understood. The present study demonstrated activity-dependent dynamic scaling in which NMDA-R (N-methyl-D-aspartic acid receptor) activity regulated the expression of GABA synthetic enzymes: glutamic acid decarboxylase 65 and 67 (GAD65 and GAD67). Results revealed that activity-regulated BDNF (brain-derived neurotrophic factor) release is necessary, but not sufficient, for activity-dependent up-scaling of these GAD isoforms. Bidirectional forms of activity-dependent GAD expression require both BDNF-dependent and BDNF-independent pathways, both triggered by NMDA-R activity. Additional results indicated that these two GAD genes differ in their responsiveness to chronic changes in neuronal activity, which could be partially caused by differential dependence on BDNF. In parallel to activity-dependent bidirectional scaling in GAD expression, the present study further observed that a chronic change in neuronal activity leads to an alteration in neurotransmitter release from GABAergic neurons in a homeostatic, bidirectional fashion. Therefore, the differential expression of GAD65 and 67 during prolonged changes in neuronal activity may be implicated in some aspects of bidirectional homeostatic plasticity within mature GABAergic presynapses. PMID:26241953
Hanno-Iijima, Yoko; Tanaka, Masami; Iijima, Takatoshi
2015-01-01
Homeostatic synaptic plasticity, or synaptic scaling, is a mechanism that tunes neuronal transmission to compensate for prolonged, excessive changes in neuronal activity. Both excitatory and inhibitory neurons undergo homeostatic changes based on synaptic transmission strength, which could effectively contribute to a fine-tuning of circuit activity. However, gene regulation that underlies homeostatic synaptic plasticity in GABAergic (GABA, gamma aminobutyric) neurons is still poorly understood. The present study demonstrated activity-dependent dynamic scaling in which NMDA-R (N-methyl-D-aspartic acid receptor) activity regulated the expression of GABA synthetic enzymes: glutamic acid decarboxylase 65 and 67 (GAD65 and GAD67). Results revealed that activity-regulated BDNF (brain-derived neurotrophic factor) release is necessary, but not sufficient, for activity-dependent up-scaling of these GAD isoforms. Bidirectional forms of activity-dependent GAD expression require both BDNF-dependent and BDNF-independent pathways, both triggered by NMDA-R activity. Additional results indicated that these two GAD genes differ in their responsiveness to chronic changes in neuronal activity, which could be partially caused by differential dependence on BDNF. In parallel to activity-dependent bidirectional scaling in GAD expression, the present study further observed that a chronic change in neuronal activity leads to an alteration in neurotransmitter release from GABAergic neurons in a homeostatic, bidirectional fashion. Therefore, the differential expression of GAD65 and 67 during prolonged changes in neuronal activity may be implicated in some aspects of bidirectional homeostatic plasticity within mature GABAergic presynapses.
Gilchrist, Christopher L.; Ruch, David S.; Little, Dianne; Guilak, Farshid
2014-01-01
Tissue and biomaterial microenvironments provide architectural cues that direct important cell behaviors including cell shape, alignment, migration, and resulting tissue formation. These architectural features may be presented to cells across multiple length scales, from nanometers to millimeters in size. In this study, we examined how architectural cues at two distinctly different length scales, “micro-scale” cues on the order of ~1–2 μm, and “meso-scale” cues several orders of magnitude larger (>100 μm), interact to direct aligned neo-tissue formation. Utilizing a micro-photopatterning (μPP) model system to precisely arrange cell-adhesive patterns, we examined the effects of substrate architecture at these length scales on human mesenchymal stem cell (hMSC) organization, gene expression, and fibrillar collagen deposition. Both micro- and meso-scale architectures directed cell alignment and resulting tissue organization, and when combined, meso cues could enhance or compete against micro-scale cues. As meso boundary aspect ratios were increased, meso-scale cues overrode micro-scale cues and controlled tissue alignment, with a characteristic critical width (~500 μm) similar to boundary dimensions that exist in vivo in highly aligned tissues. Meso-scale cues acted via both lateral confinement (in a cell-density-dependent manner) and by permitting end-to-end cell arrangements that yielded greater fibrillar collagen deposition. Despite large differences in fibrillar collagen content and organization between μPP architectural conditions, these changes did not correspond with changes in gene expression of key matrix or tendon-related genes. These findings highlight the complex interplay between geometric cues at multiple length scales and may have implications for tissue engineering strategies, where scaffold designs that incorporate cues at multiple length scales could improve neo-tissue organization and resulting functional outcomes. PMID:25263687
Gene expression profiling--Opening the black box of plant ecosystem responses to global change
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leakey, A.D.B.; Ainsworth, E.A.; Bernard, S.M.
The use of genomic techniques to address ecological questions is emerging as the field of genomic ecology. Experimentation under environmentally realistic conditions to investigate the molecular response of plants to meaningful changes in growth conditions and ecological interactions is the defining feature of genomic ecology. Since the impact of global change factors on plant performance are mediated by direct effects at the molecular, biochemical and physiological scales, gene expression analysis promises important advances in understanding factors that have previously been consigned to the 'black box' of unknown mechanism. Various tools and approaches are available for assessing gene expression in modelmore » and non-model species as part of global change biology studies. Each approach has its own unique advantages and constraints. A first generation of genomic ecology studies in managed ecosystems and mesocosms have provided a testbed for the approach and have begun to reveal how the experimental design and data analysis of gene expression studies can be tailored for use in an ecological context.« less
Global gene expression analysis by combinatorial optimization.
Ameur, Adam; Aurell, Erik; Carlsson, Mats; Westholm, Jakub Orzechowski
2004-01-01
Generally, there is a trade-off between methods of gene expression analysis that are precise but labor-intensive, e.g. RT-PCR, and methods that scale up to global coverage but are not quite as quantitative, e.g. microarrays. In the present paper, we show how how a known method of gene expression profiling (K. Kato, Nucleic Acids Res. 23, 3685-3690 (1995)), which relies on a fairly small number of steps, can be turned into a global gene expression measurement by advanced data post-processing, with potentially little loss of accuracy. Post-processing here entails solving an ancillary combinatorial optimization problem. Validation is performed on in silico experiments generated from the FANTOM data base of full-length mouse cDNA. We present two variants of the method. One uses state-of-the-art commercial software for solving problems of this kind, the other a code developed by us specifically for this purpose, released in the public domain under GPL license.
Jia, Chen
2017-09-01
Here we develop an effective approach to simplify two-time-scale Markov chains with infinite state spaces by removal of states with fast leaving rates, which improves the simplification method of finite Markov chains. We introduce the concept of fast transition paths and show that the effective transitions of the reduced chain can be represented as the superposition of the direct transitions and the indirect transitions via all the fast transition paths. Furthermore, we apply our simplification approach to the standard Markov model of single-cell stochastic gene expression and provide a mathematical theory of random gene expression bursts. We give the precise mathematical conditions for the bursting kinetics of both mRNAs and proteins. It turns out that random bursts exactly correspond to the fast transition paths of the Markov model. This helps us gain a better understanding of the physics behind the bursting kinetics as an emergent behavior from the fundamental multiscale biochemical reaction kinetics of stochastic gene expression.
NASA Astrophysics Data System (ADS)
Jia, Chen
2017-09-01
Here we develop an effective approach to simplify two-time-scale Markov chains with infinite state spaces by removal of states with fast leaving rates, which improves the simplification method of finite Markov chains. We introduce the concept of fast transition paths and show that the effective transitions of the reduced chain can be represented as the superposition of the direct transitions and the indirect transitions via all the fast transition paths. Furthermore, we apply our simplification approach to the standard Markov model of single-cell stochastic gene expression and provide a mathematical theory of random gene expression bursts. We give the precise mathematical conditions for the bursting kinetics of both mRNAs and proteins. It turns out that random bursts exactly correspond to the fast transition paths of the Markov model. This helps us gain a better understanding of the physics behind the bursting kinetics as an emergent behavior from the fundamental multiscale biochemical reaction kinetics of stochastic gene expression.
Genome-scale expression studies and comprehensive loss-of-function genetic screens have focused almost exclusively on the highest confidence candidate genes. Here, we describe a strategy for characterizing the lower confidence candidates identified by such approaches.
Tran, Dinh Thi Minh; Phan, Trang Thi Phuong; Huynh, Thanh Kieu; Dang, Ngan Thi Kim; Huynh, Phuong Thi Kim; Nguyen, Tri Minh; Truong, Tuom Thi Tinh; Tran, Thuoc Linh; Schumann, Wolfgang; Nguyen, Hoang Duc
2017-07-25
Besides Escherichia coli, Bacillus subtilis is an important bacterial species for the production of recombinant proteins. Recombinant genes are inserted into shuttle expression vectors which replicate in both E. coli and in B. subtilis. The ligation products are first transformed into E. coli cells, analyzed for correct insertions, and the correct recombinant plasmids are then transformed into B. subtilis. A major problem using E. coli cells can be the strong basal level of expression of the recombinant protein which may interfere with the stability of the cells. To minimize this problem, we developed strong expression vectors being repressed in E. coli and inducer-free in B. subtilis. In general, induction of IPTG-inducible expression vectors is determined by the regulatory lacI gene encoding the LacI repressor in combination with the lacO operator on the promoter. To investigate the inducer-free properties of the vectors, we constructed inducer-free expression plasmids by removing the lacI gene and characterized their properties. First, we examined the ability to repress a reporter gene in E. coli, which is a prominent property facilitating the construction of the expression vectors carrying a target gene. The β-galactosidase (bgaB gene) basal levels expressed from Pgrac01-bgaB could be repressed at least twice in the E. coli cloning strain. Second, the inducer-free production of BgaB from four different plasmids with the Pgrac01 promoter in B. subtilis was investigated. As expected, BgaB expression levels of inducer-free constructs are at least 37 times higher than that of the inducible constructs in the absence of IPTG, and comparable to those in the presence of the inducer. Third, using efficient IPTG-inducible expression vectors containing the strong promoter Pgrac100, we could convert them into inducer-free expression plasmids. The BgaB production levels from the inducer-free plasmid in the absence of the inducer were at least 4.5 times higher than that of the inducible vector using the same promoter. Finally, we used gfp as a reporter gene in combination with the two promoters Pgrac01 and Pgrac100 to test the new vector types. The GFP expression levels could be repressed at least 1.5 times for the Pgrac01-gfp+ inducer-free construct in E. coli. The inducer-free constructs Pgrac01-gfp+ and Pgrac100-gfp+ allowed GFP expression at high levels from 23 × 10 4 to 32 × 10 4 RFU units and 9-13% of total intracellular proteins. We could reconfirm the two major advantages of the new inducer-free expression plasmids: (1) Strong repression of the target gene expression in the E. coli cloning strain, and (2) production of the target protein at high levels in B. subtilis in the absence of the inducer. We propose a general strategy to generate inducer-free expression vector by using IPTG-inducible vectors, and more specifically we developed inducer-free expression plasmids using IPTG-inducible promoters in the absence of the LacI repressor. These plasmids could be an excellent choice for high-level production of recombinant proteins in B. subtilis without the addition of inducer and at the same time maintaining a low basal level of the recombinant proteins in E. coli. The repression of the recombinant gene expression would facilitate cloning of genes that potentially inhibit the growth of E. coli cloning strains. The inducer-free expression plasmids will be extended versions of the current available IPTG-inducible expression vectors for B. subtilis, in which all these vectors use the same cognate promoters. These inducer-free and previously developed IPTG-inducible expression plasmids will be a useful cassette to study gene expression at a small scale up to a larger scale up for the production of recombinant proteins.
Di, Yanming; Schafer, Daniel W.; Wilhelm, Larry J.; Fox, Samuel E.; Sullivan, Christopher M.; Curzon, Aron D.; Carrington, James C.; Mockler, Todd C.; Chang, Jeff H.
2011-01-01
GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts. PMID:21998647
Albert, Elise; Gricourt, Justine; Bertin, Nadia; Bonnefoi, Julien; Pateyron, Stéphanie; Tamby, Jean-Philippe; Bitton, Frédérique; Causse, Mathilde
2016-02-01
In tomato, genotype by watering interaction resulted from genotype re-ranking more than scale changes. Interactive QTLs according to watering regime were detected. Differentially expressed genes were identified in some intervals. As a result of climate change, drought will increasingly limit crop production in the future. Studying genotype by watering regime interactions is necessary to improve plant adaptation to low water availability. In cultivated tomato (Solanum lycopersicum L.), extensively grown in dry areas, well-mastered water deficits can stimulate metabolite production, increasing plant defenses and concentration of compounds involved in fruit quality, at the same time. However, few tomato Quantitative Trait Loci (QTLs) and genes involved in response to drought are identified or only in wild species. In this study, we phenotyped a population of 119 recombinant inbred lines derived from a cross between a cherry tomato and a large fruit tomato, grown in greenhouse under two watering regimes, in two locations. A large genetic variability was measured for 19 plant and fruit traits, under the two watering treatments. Highly significant genotype by watering regime interactions were detected and resulted from re-ranking more than scale changes. The population was genotyped for 679 SNP markers to develop a genetic map. In total, 56 QTLs were identified among which 11 were interactive between watering regimes. These later mainly exhibited antagonist effects according to watering treatment. Variation in gene expression in leaves of parental accessions revealed 2259 differentially expressed genes, among which candidate genes presenting sequence polymorphisms were identified under two main interactive QTLs. Our results provide knowledge about the genetic control of genotype by watering regime interactions in cultivated tomato and the possible use of deficit irrigation to improve tomato quality.
Bi-Force: large-scale bicluster editing and its application to gene expression data biclustering
Sun, Peng; Speicher, Nora K.; Röttger, Richard; Guo, Jiong; Baumbach, Jan
2014-01-01
Abstract The explosion of the biological data has dramatically reformed today's biological research. The need to integrate and analyze high-dimensional biological data on a large scale is driving the development of novel bioinformatics approaches. Biclustering, also known as ‘simultaneous clustering’ or ‘co-clustering’, has been successfully utilized to discover local patterns in gene expression data and similar biomedical data types. Here, we contribute a new heuristic: ‘Bi-Force’. It is based on the weighted bicluster editing model, to perform biclustering on arbitrary sets of biological entities, given any kind of pairwise similarities. We first evaluated the power of Bi-Force to solve dedicated bicluster editing problems by comparing Bi-Force with two existing algorithms in the BiCluE software package. We then followed a biclustering evaluation protocol in a recent review paper from Eren et al. (2013) (A comparative analysis of biclustering algorithms for gene expressiondata. Brief. Bioinform., 14:279–292.) and compared Bi-Force against eight existing tools: FABIA, QUBIC, Cheng and Church, Plaid, BiMax, Spectral, xMOTIFs and ISA. To this end, a suite of synthetic datasets as well as nine large gene expression datasets from Gene Expression Omnibus were analyzed. All resulting biclusters were subsequently investigated by Gene Ontology enrichment analysis to evaluate their biological relevance. The distinct theoretical foundation of Bi-Force (bicluster editing) is more powerful than strict biclustering. We thus outperformed existing tools with Bi-Force at least when following the evaluation protocols from Eren et al. Bi-Force is implemented in Java and integrated into the open source software package of BiCluE. The software as well as all used datasets are publicly available at http://biclue.mpi-inf.mpg.de. PMID:24682815
Pan, Feng; Wang, Yue; Liu, Huanglong; Wu, Min; Chu, Wenyuan; Chen, Danmei; Xiang, Yan
2017-06-27
The SQUAMOSA promoter binding protein-like (SPL) proteins are plant-specific transcription factors (TFs) that function in a variety of developmental processes including growth, flower development, and signal transduction. SPL proteins are encoded by a gene family, and these genes have been characterized in two model grass species, Zea mays and Oryza sativa. The SPL gene family has not been well studied in moso bamboo (Phyllostachys edulis), a woody grass species. We identified 32 putative PeSPL genes in the P. edulis genome. Phylogenetic analysis arranged the PeSPL protein sequences in eight groups. Similarly, phylogenetic analysis of the SBP-like and SBP proteins from rice and maize clustered them into eight groups analogous to those from P. edulis. Furthermore, the deduced PeSPL proteins in each group contained very similar conserved sequence motifs. Our analyses indicate that the PeSPL genes experienced a large-scale duplication event ~15 million years ago (MYA), and that divergence between the PeSPL and OsSPL genes occurred 34 MYA. The stress-response expression profiles and tissue-specificity of the putative PeSPL gene promoter regions showed that SPL genes in moso bamboo have potential biological functions in stress resistance as well as in growth and development. We therefore examined PeSPL gene expression in response to different plant hormone and drought (polyethylene glycol-6000; PEG) treatments to mimic biotic and abiotic stresses. Expression of three (PeSPL10, -12, -17), six (PeSPL1, -10, -12, -17, -20, -31), and nine (PeSPL5, -8, -9, -14, -15, -19, -20, -31, -32) genes remained relatively stable after treating with salicylic acid (SA), gibberellic acid (GA), and PEG, respectively, while the expression patterns of other genes changed. In addition, analysis of tissue-specific expression of the moso bamboo SPL genes during development showed differences in their spatiotemporal expression patterns, and many were expressed at high levels in flowers and leaves. The PeSPL genes play important roles in plant growth and development, including responses to stresses, and most of the genes are expressed in different tissues. Our study provides a comprehensive understanding of the PeSPL gene family and may enable future studies on the function and evolution of SPL genes in moso bamboo.
Gene expression in lung adenocarcinomas of smokers and nonsmokers.
Powell, Charles A; Spira, Avrum; Derti, Adnan; DeLisi, Charles; Liu, Gang; Borczuk, Alain; Busch, Steve; Sahasrabudhe, Sudhir; Chen, Yangde; Sugarbaker, David; Bueno, Raphael; Richards, William G; Brody, Jerome S
2003-08-01
Adenocarcinoma (AC) has become the most frequent type of lung cancer in men and women, and is the major form of lung cancer in nonsmokers. Our goal in this paper was to determine if AC in smokers and nonsmokers represents the same genetic disease. We compared gene expression profiles in resected samples of nonmalignant lung tissue and tumor tissue in six never-smokers with AC and in six smokers with AC, who were matched for clinical staging and histologic criteria of cell differentiation. Results were analyzed using a variety of bioinformatic tools. Four times as many genes changed expression in the transition from noninvolved lung to tumor in nonsmokers as in smokers, suggesting that AC in nonsmokers evolves locally, whereas AC in smokers evolves in a field of genetically altered tissue. There were some similarities in gene expression in smokers and nonsmokers, but many differences, suggesting different pathways of cell transformation and tumor formation. Gene expression in the noninvolved lungs of smokers differed from that of nonsmokers, and multidimensional scaling showed that noninvolved lungs of smokers groups with tumors rather than noninvolved lungs of nonsmokers. In addition, expression of a number of genes correlated with smoking intensity. Our findings, although limited by small sample size, suggest that additional studies comparing noninvolved to tumor tissue may identify pathogenetic mechanisms and therapeutic targets that differ in AC of smokers and nonsmokers.
Multiple Regulatory Modules Are Required for Scale-to-Feather Conversion.
Wu, Ping; Yan, Jie; Lai, Yung-Chih; Ng, Chen Siang; Li, Ang; Jiang, Xueyuan; Elsey, Ruth M; Widelitz, Randall; Bajpai, Ruchi; Li, Wen-Hsiung; Chuong, Cheng-Ming
2018-02-01
The origin of feathers is an important question in Evo-Devo studies, with the eventual evolution of vaned feathers which are aerodynamic, allowing feathered dinosaurs and early birds to fly and venture into new ecological niches. Studying how feathers and scales are developmentally specified provides insight into how a new organ may evolve. We identified feather-associated genes using genomic analyses. The candidate genes were tested by expressing them in chicken and alligator scale forming regions. Ectopic expression of these genes induced intermediate morphotypes between scales and feathers which revealed several major morphogenetic events along this path: Localized growth zone formation, follicle invagination, epithelial branching, feather keratin differentiation, and dermal papilla formation. In addition to molecules known to induce feathers on scales (retinoic acid, β-catenin), we identified novel scale-feather converters (Sox2, Zic1, Grem1, Spry2, Sox18) which induce one or more regulatory modules guiding these morphogenetic events. Some morphotypes resemble filamentous appendages found in feathered dinosaur fossils, whereas others exhibit characteristics of modern avian feathers. We propose these morpho-regulatory modules were used to diversify archosaur scales and to initiate feather evolution. The regulatory combination and hierarchical integration may have led to the formation of extant feather forms. Our study highlights the importance of integrating discoveries between developmental biology and paleontology. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Zhou, Yanrong; Lin, Yanli; Wu, Xiaojie; Xiong, Fuyin; Lv, Yuemeng; Zheng, Tao; Huang, Peitang; Chen, Hongxing
2012-02-01
Transgene expression for the mammary gland bioreactor aimed at producing recombinant proteins requires optimized expression vector construction. Previously we presented a hybrid gene locus strategy, which was originally tested with human lactoferrin (hLF) as target transgene, and an extremely high-level expression of rhLF ever been achieved as to 29.8 g/l in mice milk. Here to demonstrate the broad application of this strategy, another 38.4 kb mWAP-htPA hybrid gene locus was constructed, in which the 3-kb genomic coding sequence in the 24-kb mouse whey acidic protein (mWAP) gene locus was substituted by the 17.4-kb genomic coding sequence of human tissue plasminogen activator (htPA), exactly from the start codon to the end codon. Corresponding five transgenic mice lines were generated and the highest expression level of rhtPA in the milk attained as to 3.3 g/l. Our strategy will provide a universal way for the large-scale production of pharmaceutical proteins in the mammary gland of transgenic animals.
The opportunities and challenges of large-scale molecular approaches to songbird neurobiology
Mello, C.V.; Clayton, D.F.
2014-01-01
High-through put methods for analyzing genome structure and function are having a large impact in song-bird neurobiology. Methods include genome sequencing and annotation, comparative genomics, DNA microarrays and transcriptomics, and the development of a brain atlas of gene expression. Key emerging findings include the identification of complex transcriptional programs active during singing, the robust brain expression of non-coding RNAs, evidence of profound variations in gene expression across brain regions, and the identification of molecular specializations within song production and learning circuits. Current challenges include the statistical analysis of large datasets, effective genome curations, the efficient localization of gene expression changes to specific neuronal circuits and cells, and the dissection of behavioral and environmental factors that influence brain gene expression. The field requires efficient methods for comparisons with organisms like chicken, which offer important anatomical, functional and behavioral contrasts. As sequencing costs plummet, opportunities emerge for comparative approaches that may help reveal evolutionary transitions contributing to vocal learning, social behavior and other properties that make songbirds such compelling research subjects. PMID:25280907
Carbonell, Alberto; Fahlgren, Noah; Mitchell, Skyler; ...
2015-05-20
Artificial microRNAs (amiRNAs) are used for selective gene silencing in plants. However, current methods to produce amiRNA constructs for silencing transcripts in monocot species are not suitable for simple, cost-effective and large-scale synthesis. Here, a series of expression vectors based on Oryza sativa MIR390 (OsMIR390) precursor was developed for high-throughput cloning and high expression of amiRNAs in monocots. Four different amiRNA sequences designed to target specifically endogenous genes and expressed from OsMIR390-based vectors were validated in transgenic Brachypodium distachyon plants. Surprisingly, amiRNAs accumulated to higher levels and were processed more accurately when expressed from chimeric OsMIR390-based precursors that include distalmore » stem-loop sequences from Arabidopsis thaliana MIR390a (AtMIR390a). In all cases, transgenic plants displayed the predicted phenotypes induced by target gene repression, and accumulated high levels of amiRNAs and low levels of the corresponding target transcripts. Genome-wide transcriptome profiling combined with 5-RLM-RACE analysis in transgenic plants confirmed that amiRNAs were highly specific. Finally, significance Statement A series of amiRNA vectors based on Oryza sativa MIR390 (OsMIR390) precursor were developed for simple, cost-effective and large-scale synthesis of amiRNA constructs to silence genes in monocots. Unexpectedly, amiRNAs produced from chimeric OsMIR390-based precursors including Arabidopsis thaliana MIR390a distal stem-loop sequences accumulated elevated levels of highly effective and specific amiRNAs in transgenic Brachypodium distachyon plants.« less
Clarke, Thomas H.; Garb, Jessica E.; Hayashi, Cheryl Y.; Arensburger, Peter; Ayoub, Nadia A.
2015-01-01
The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). PMID:26058392
Biomarkers identified for prostate cancer patients through genome-scale screening.
Wang, Lei-Yun; Cui, Jia-Jia; Zhu, Tao; Shao, Wei-Hua; Zhao, Yi; Wang, Sai; Zhang, Yu-Peng; Wu, Ji-Chu; Zhang, Le
2017-11-03
Prostate cancer is a threat to men and usually occurs in aged males. Though prostate specific antigen level and Gleason score are utilized for evaluation of the prostate cancer in clinic, the biomarkers for this malignancy have not been widely recognized. Furthermore, the outcome varies across individuals receiving comparable treatment regimens and the underlying mechanism is still unclear. We supposed that genetic feature may be responsible for, at least in part, this process and conducted a two-cohort study to compare the genetic difference in tumorous and normal tissues of prostate cancer patients. The Gene Expression Omnibus dataset were used and a total of 41 genes were found significantly differently expressed in tumor tissues as compared with normal prostate tissues. Four genes (SPOCK3, SPON1, PTN and TGFB3) were selected for further evaluation after Gene Ontology analysis, Kyoto Encyclopedia of Genes and Genomes pathway analysis and clinical association analysis. MIR1908 was also found decreased expression level in prostate cancer whose target genes were found expressing in both prostate tumor and normal tissues. These results indicated that these potential biomarkers deserve attention in prostate cancer patients and the underlying mechanism should be further investigated.
Selection Shapes Transcriptional Logic and Regulatory Specialization in Genetic Networks.
Fogelmark, Karl; Peterson, Carsten; Troein, Carl
2016-01-01
Living organisms need to regulate their gene expression in response to environmental signals and internal cues. This is a computational task where genes act as logic gates that connect to form transcriptional networks, which are shaped at all scales by evolution. Large-scale mutations such as gene duplications and deletions add and remove network components, whereas smaller mutations alter the connections between them. Selection determines what mutations are accepted, but its importance for shaping the resulting networks has been debated. To investigate the effects of selection in the shaping of transcriptional networks, we derive transcriptional logic from a combinatorially powerful yet tractable model of the binding between DNA and transcription factors. By evolving the resulting networks based on their ability to function as either a simple decision system or a circadian clock, we obtain information on the regulation and logic rules encoded in functional transcriptional networks. Comparisons are made between networks evolved for different functions, as well as with structurally equivalent but non-functional (neutrally evolved) networks, and predictions are validated against the transcriptional network of E. coli. We find that the logic rules governing gene expression depend on the function performed by the network. Unlike the decision systems, the circadian clocks show strong cooperative binding and negative regulation, which achieves tight temporal control of gene expression. Furthermore, we find that transcription factors act preferentially as either activators or repressors, both when binding multiple sites for a single target gene and globally in the transcriptional networks. This separation into positive and negative regulators requires gene duplications, which highlights the interplay between mutation and selection in shaping the transcriptional networks.
Morabito, Giuseppe; Giannelli, Serena G; Ordazzo, Gabriele; Bido, Simone; Castoldi, Valerio; Indrigo, Marzia; Cabassi, Tommaso; Cattaneo, Stefano; Luoni, Mirko; Cancellieri, Cinzia; Sessa, Alessandro; Bacigaluppi, Marco; Taverna, Stefano; Leocani, Letizia; Lanciego, José L; Broccoli, Vania
2017-12-06
The lack of technology for direct global-scale targeting of the adult mouse nervous system has hindered research on brain processing and dysfunctions. Currently, gene transfer is normally achieved by intraparenchymal viral injections, but these injections target a restricted brain area. Herein, we demonstrated that intravenous delivery of adeno-associated virus (AAV)-PHP.B viral particles permeated and diffused throughout the neural parenchyma, targeting both the central and the peripheral nervous system in a global pattern. We then established multiple procedures of viral transduction to control gene expression or inactivate gene function exclusively in the adult nervous system and assessed the underlying behavioral effects. Building on these results, we established an effective gene therapy strategy to counteract the widespread accumulation of α-synuclein deposits throughout the forebrain in a mouse model of synucleinopathy. Transduction of A53T-SCNA transgenic mice with AAV-PHP.B-GBA1 restored physiological levels of the enzyme, reduced α-synuclein pathology, and produced significant behavioral recovery. Finally, we provided evidence that AAV-PHP.B brain penetration does not lead to evident dysfunctions in blood-brain barrier integrity or permeability. Altogether, the AAV-PHP.B viral platform enables non-invasive, widespread, and long-lasting global neural expression of therapeutic genes, such as GBA1, providing an invaluable approach to treat neurodegenerative diseases with diffuse brain pathology such as synucleinopathies. Copyright © 2017 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.
Microarray analysis identifies candidate genes for key roles in coral development
Grasso, Lauretta C; Maindonald, John; Rudd, Stephen; Hayward, David C; Saint, Robert; Miller, David J; Ball, Eldon E
2008-01-01
Background Anthozoan cnidarians are amongst the simplest animals at the tissue level of organization, but are surprisingly complex and vertebrate-like in terms of gene repertoire. As major components of tropical reef ecosystems, the stony corals are anthozoans of particular ecological significance. To better understand the molecular bases of both cnidarian development in general and coral-specific processes such as skeletogenesis and symbiont acquisition, microarray analysis was carried out through the period of early development – when skeletogenesis is initiated, and symbionts are first acquired. Results Of 5081 unique peptide coding genes, 1084 were differentially expressed (P ≤ 0.05) in comparisons between four different stages of coral development, spanning key developmental transitions. Genes of likely relevance to the processes of settlement, metamorphosis, calcification and interaction with symbionts were characterised further and their spatial expression patterns investigated using whole-mount in situ hybridization. Conclusion This study is the first large-scale investigation of developmental gene expression for any cnidarian, and has provided candidate genes for key roles in many aspects of coral biology, including calcification, metamorphosis and symbiont uptake. One surprising finding is that some of these genes have clear counterparts in higher animals but are not present in the closely-related sea anemone Nematostella. Secondly, coral-specific processes (i.e. traits which distinguish corals from their close relatives) may be analogous to similar processes in distantly related organisms. This first large-scale application of microarray analysis demonstrates the potential of this approach for investigating many aspects of coral biology, including the effects of stress and disease. PMID:19014561
Coram, Tristan E; Pang, Edwin C K
2006-11-01
Using microarray technology and a set of chickpea (Cicer arietinum L.) unigenes, grasspea (Lathyrus sativus L.) expressed sequence tags (ESTs) and lentil (Lens culinaris Med.) resistance gene analogues, the ascochyta blight (Ascochyta rabiei (Pass.) L.) resistance response was studied in four chickpea genotypes, including resistant, moderately resistant, susceptible and wild relative (Cicer echinospermum L.) genotypes. The experimental system minimized environmental effects and was conducted in reference design, in which samples from mock-inoculated controls acted as reference against post-inoculation samples. Robust data quality was achieved through the use of three biological replicates (including a dye swap), the inclusion of negative controls and strict selection criteria for differentially expressed genes, including a fold change cut-off determined by self-self hybridizations, Student's t-test and multiple testing correction (P < 0.05). Microarray observations were also validated by quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). The time course expression patterns of 756 microarray features resulted in the differential expression of 97 genes in at least one genotype at one time point. k-means clustering grouped the genes into clusters of similar observations for each genotype, and comparisons between A. rabiei-resistant and A. rabiei-susceptible genotypes revealed potential gene 'signatures' predictive of effective A. rabiei resistance. These genes included several pathogenesis-related proteins, SNAKIN2 antimicrobial peptide, proline-rich protein, disease resistance response protein DRRG49-C, environmental stress-inducible protein, leucine-zipper protein, polymorphic antigen membrane protein, Ca-binding protein and several unknown proteins. The potential involvement of these genes and their pathways of induction are discussed. This study represents the first large-scale gene expression profiling in chickpea, and future work will focus on the functional validation of the genes of interest.
Sen Sarma, Moushumi; Rodriguez-Zas, Sandra L.; Hong, Feng; Zhong, Sheng; Robinson, Gene E.
2009-01-01
Background We conducted a large-scale transcriptomic profiling of selected regions of the central nervous system (CNS) across three species of honey bees, in foragers that were performing dance behavior to communicate to their nestmates the location, direction and profitability of an attractive floral resource. We used microarrays to measure gene expression in bees from Apis mellifera, dorsata and florea, species that share major traits unique to the genus and also show striking differences in biology and dance communication. The goals of this study were to determine the extent of regional specialization in gene expression and to explore the molecular basis of dance communication. Principal Findings This “snapshot” of the honey bee CNS during dance behavior provides strong evidence for both species-consistent and species-specific differences in gene expression. Gene expression profiles in the mushroom bodies consistently showed the biggest differences relative to the other CNS regions. There were strong similarities in gene expression between the central brain and the second thoracic ganglion across all three species; many of the genes were related to metabolism and energy production. We also obtained gene expression differences between CNS regions that varied by species: A. mellifera differed the most, while dorsata and florea tended to be more similar. Significance Species differences in gene expression perhaps mirror known differences in nesting habit, ecology and dance behavior between mellifera, florea and dorsata. Species-specific differences in gene expression in selected CNS regions that relate to synaptic activity and motor control provide particularly attractive candidate genes to explain the differences in dance behavior exhibited by these three honey bee species. Similarities between central brain and thoracic ganglion provide a unique perspective on the potential coupling of these two motor-related regions during dance behavior and perhaps provide a snapshot of the energy intensive process of dance output generation. Mushroom body results reflect known roles for this region in the regulation of learning, memory and rhythmic behavior. PMID:19641619
Sen Sarma, Moushumi; Rodriguez-Zas, Sandra L; Hong, Feng; Zhong, Sheng; Robinson, Gene E
2009-07-29
We conducted a large-scale transcriptomic profiling of selected regions of the central nervous system (CNS) across three species of honey bees, in foragers that were performing dance behavior to communicate to their nestmates the location, direction and profitability of an attractive floral resource. We used microarrays to measure gene expression in bees from Apis mellifera, dorsata and florea, species that share major traits unique to the genus and also show striking differences in biology and dance communication. The goals of this study were to determine the extent of regional specialization in gene expression and to explore the molecular basis of dance communication. This "snapshot" of the honey bee CNS during dance behavior provides strong evidence for both species-consistent and species-specific differences in gene expression. Gene expression profiles in the mushroom bodies consistently showed the biggest differences relative to the other CNS regions. There were strong similarities in gene expression between the central brain and the second thoracic ganglion across all three species; many of the genes were related to metabolism and energy production. We also obtained gene expression differences between CNS regions that varied by species: A. mellifera differed the most, while dorsata and florea tended to be more similar. Species differences in gene expression perhaps mirror known differences in nesting habit, ecology and dance behavior between mellifera, florea and dorsata. Species-specific differences in gene expression in selected CNS regions that relate to synaptic activity and motor control provide particularly attractive candidate genes to explain the differences in dance behavior exhibited by these three honey bee species. Similarities between central brain and thoracic ganglion provide a unique perspective on the potential coupling of these two motor-related regions during dance behavior and perhaps provide a snapshot of the energy intensive process of dance output generation. Mushroom body results reflect known roles for this region in the regulation of learning, memory and rhythmic behavior.
Combined strategies for improving expression of Citrobacter amalonaticus phytase in Pichia pastoris.
Li, Cheng; Lin, Ying; Zheng, Xueyun; Pang, Nuo; Liao, Xihao; Liu, Xiaoxiao; Huang, Yuanyuan; Liang, Shuli
2015-09-26
Phytase is used as an animal feed additive that degrades phytic acid and reduces feeding costs and pollution caused by fecal excretion of phosphorus. Some phytases have been expressed in Pichia pastoris, among which the phytase from Citrobacter amalonaticus CGMCC 1696 had high specific activity (3548 U/mg). Improvement of the phytase expression level will contribute to facilitate its industrial applications. To improve the phytase expression, we use modification of P AOX1 and the α-factor signal peptide, increasing the gene copy number, and overexpressing HAC1 (i) to enhance folding and secretion of the protein in the endoplasmic reticulum. The genetic stability and fermentation in 10-L scaled-up fed-batch fermenter was performed to prepare for the industrial production. The phytase gene from C. amalonaticus CGMCC 1696 was cloned under the control of the AOX1 promoter (P AOX1 ) and expressed in P. pastoris. The phytase activity achieved was 414 U/mL. Modifications of P AOX1 and the α-factor signal peptide increased the phytase yield by 35 and 12%, respectively. Next, on increasing the copy number of the Phy gene to six, the phytase yield was 141% higher than in the strain containing only a single gene copy. Furthermore, on overexpression of HAC1 (i) (i indicating induced), a gene encoding Hac1p that regulates the unfolded protein response, the phytase yield achieved was 0.75 g/L with an activity of 2119 U/mL, 412% higher than for the original strain. The plasmids in this high-phytase expression strain were stable during incubation at 30 °C in Yeast Extract Peptone Dextrose (YPD) Medium. In a 10-L scaled-up fed-batch fermenter, the phytase yield achieved was 9.58 g/L with an activity of 35,032 U/mL. The production of a secreted protein will reach its limit at a specific gene copy number where further increases in transcription and translation due to the higher abundance of gene copies will not enhance the secretion process any further. Enhancement of protein folding in the ER can alleviate bottlenecks in the folding and secretion pathways during the overexpression of heterologous proteins in P. pastoris. Using modification of P AOX1 and the α-factor signal peptide, increasing the gene copy number, and overexpressing HAC1 (i) to enhance folding and secretion of the protein in the endoplasmic reticulum, we have successfully increased the phytase yield 412% relative to the original strain. In a 10-L fed-batch fermenter, the phytase yield achieved was 9.58 g/L with an activity of 35,032 U/mL. Large-scale production of phytase can be applied towards different biocatalytic and feed additive applications.
Gould, S J; Hong, S T; Carney, J R
1998-01-01
The genes for most of the biosynthesis of the kinamycin antibiotics have been cloned and heterologously expressed. Genomic DNA of Streptomyces murayamaensis was partially digested with MboI and a library of approximately 40 kb fragments in E. coli XL1-BlueMR was prepared using the cosmid vector pOJ446. Hybridization with the actI probe from the actinorhodin polyketide synthase genes identified two clusters of polyketide genes. After transferal of these clusters to S. lividans ZX7, expression of one cluster was established by HPLC with photodiode array detection. Peaks were identified from the kin cluster for dehydrorabelomycin, kinobscurinone, and stealthin C, which are known intermediates in kinamycin biosynthesis. Two shunt metabolites, kinafluorenone and seongomycin were also identified. The structure of the latter was determined from a quantity obtained from large-scale fermentation of one of the clones.
Effect of storage time on gene expression data acquired from unfrozen archived newborn blood spots.
Ho, Nhan T; Busik, Julia V; Resau, James H; Paneth, Nigel; Khoo, Sok Kean
2016-11-01
Unfrozen archived newborn blood spots (NBS) have been shown to retain sufficient messenger RNA (mRNA) for gene expression profiling. However, the effect of storage time at ambient temperature for NBS samples in relation to the quality of gene expression data is relatively unknown. Here, we evaluated mRNA expression from quantitative real-time PCR (qRT-PCR) and microarray data obtained from NBS samples stored at ambient temperature to determine the effect of storage time on the quality of gene expression. These data were generated in a previous case-control study examining NBS in 53 children with cerebral palsy (CP) and 53 matched controls. NBS sample storage period ranged from 3 to 16years at ambient temperature. We found persistently low RNA integrity numbers (RIN=2.3±0.71) and 28S/18S rRNA ratios (~0) across NBS samples for all storage periods. In both qRT-PCR and microarray data, the expression of three common housekeeping genes-beta cytoskeletal actin (ACTB), glyceraldehyde 3-phosphate dehydrogenase (GAPDH), and peptidylprolyl isomerase A (PPIA)-decreased with increased storage time. Median values of each microarray probe intensity at log 2 scale also decreased over time. After eight years of storage, probe intensity values were largely reduced to background intensity levels. Of 21,500 genes tested, 89% significantly decreased in signal intensity, with 13,551, 10,730, and 9925 genes detected within 5years, > 5 to <10years, and >10years of storage, respectively. We also examined the expression of two gender-specific genes (X inactivation-specific transcript, XIST and lysine-specific demethylase 5D, KDM5D) and seven gene sets representing the inflammatory, hypoxic, coagulative, and thyroidal pathways hypothesized to be related to CP risk to determine the effect of storage time on the detection of these biologically relevant genes. We found the gender-specific genes and CP-related gene sets detectable in all storage periods, but exhibited differential expression (between male vs. female or CP vs. control) only within the first six years of storage. We concluded that gene expression data quality deteriorates in unfrozen archived NBS over time and that differential gene expression profiling and analysis is recommended for those NBS samples collected and stored within six years at ambient temperature. Copyright © 2016 Elsevier Inc. All rights reserved.
2011-01-01
Background For efficient and large scale production of recombinant proteins in plants transient expression by agroinfection has a number of advantages over stable transformation. Simple manipulation, rapid analysis and high expression efficiency are possible. In pea, Pisum sativum, a Virus Induced Gene Silencing System using the pea early browning virus has been converted into an efficient agroinfection system by converting the two RNA genomes of the virus into binary expression vectors for Agrobacterium transformation. Results By vacuum infiltration (0.08 Mpa, 1 min) of germinating pea seeds with 2-3 cm roots with Agrobacteria carrying the binary vectors, expression of the gene for Green Fluorescent Protein as marker and the gene for the human acidic fibroblast growth factor (aFGF) was obtained in 80% of the infiltrated developing seedlings. Maximal production of the recombinant proteins was achieved 12-15 days after infiltration. Conclusions Compared to the leaf injection method vacuum infiltration of germinated seeds is highly efficient allowing large scale production of plants transiently expressing recombinant proteins. The production cycle of plants for harvesting the recombinant protein was shortened from 30 days for leaf injection to 15 days by applying vacuum infiltration. The synthesized aFGF was purified by heparin-affinity chromatography and its mitogenic activity on NIH 3T3 cells confirmed to be similar to a commercial product. PMID:21548923
Kim, Gwang Hoon; Jeong, Hae Jin; Yoo, Yeong Du; Kim, Sunju; Han, Ji Hee; Han, Jong Won; Zuccarello, Giuseppe C
2013-01-01
The loss of photosynthetic function should lead to the cessation of expression and finally loss of photosynthetic genes in the new heterotroph. Dinoflagellates are known to have lost their photosynthetic ability several times. Dinoflagellates have also acquired photosynthesis from other organisms, either on a long-term basis or as "kleptoplastids" multiple times. The fate of photosynthetic gene expression in heterotrophs can be informative into evolution of gene expression patterns after functional loss, and the dinoflagellates ability to acquire new photosynthetic function through additional endosymbiosis. To explore this we analyzed a large-scale EST database consisting of 151,091 unique sequences (29,170 contigs, 120,921 singletons) obtained from 454 pyrosequencing of the heterotrophic dinoflagellate Pfiesteria piscicida. About 597 contigs from P. piscicida showed significant homology (E-value
Lam, L T; Pickeral, O K; Peng, A C; Rosenwald, A; Hurt, E M; Giltnane, J M; Averett, L M; Zhao, H; Davis, R E; Sathyamoorthy, M; Wahl, L M; Harris, E D; Mikovits, J A; Monks, A P; Hollingshead, M G; Sausville, E A; Staudt, L M
2001-01-01
Flavopiridol, a flavonoid currently in cancer clinical trials, inhibits cyclin-dependent kinases (CDKs) by competitively blocking their ATP-binding pocket. However, the mechanism of action of flavopiridol as an anti-cancer agent has not been fully elucidated. Using DNA microarrays, we found that flavopiridol inhibited gene expression broadly, in contrast to two other CDK inhibitors, roscovitine and 9-nitropaullone. The gene expression profile of flavopiridol closely resembled the profiles of two transcription inhibitors, actinomycin D and 5,6-dichloro-1-beta-D-ribofuranosyl-benzimidazole (DRB), suggesting that flavopiridol inhibits transcription globally. We were therefore able to use flavopiridol to measure mRNA turnover rates comprehensively and we found that different functional classes of genes had distinct distributions of mRNA turnover rates. In particular, genes encoding apoptosis regulators frequently had very short half-lives, as did several genes encoding key cell-cycle regulators. Strikingly, genes that were transcriptionally inducible were disproportionately represented in the class of genes with rapid mRNA turnover. The present genomic-scale measurement of mRNA turnover uncovered a regulatory logic that links gene function with mRNA half-life. The observation that transcriptionally inducible genes often have short mRNA half-lives demonstrates that cells have a coordinated strategy to rapidly modulate the mRNA levels of these genes. In addition, the present results suggest that flavopiridol may be more effective against types of cancer that are highly dependent on genes with unstable mRNAs.
Hwang, Sun-Goo; Kim, Dong Sub; Hwang, Jung Eun; Han, A-Reum; Jang, Cheol Seong
2014-05-15
In order to better understand the biological systems that are affected in response to cosmic ray (CR), we conducted weighted gene co-expression network analysis using the module detection method. By using the Pearson's correlation coefficient (PCC) value, we evaluated complex gene-gene functional interactions between 680 CR-responsive probes from integrated microarray data sets, which included large-scale transcriptional profiling of 1000 microarray samples. These probes were divided into 6 distinct modules that contained 20 enriched gene ontology (GO) functions, such as oxidoreductase activity, hydrolase activity, and response to stimulus and stress. In particular, modules 1 and 2 commonly showed enriched annotation categories such as oxidoreductase activity, including enriched cis-regulatory elements known as ROS-specific regulators. These results suggest that the ROS-mediated irradiation response pathway is affected by CR in modules 1 and 2. We found 243 ionizing radiation (IR)-responsive probes that exhibited similarities in expression patterns in various irradiation microarray data sets. The expression patterns of 6 randomly selected IR-responsive genes were evaluated by quantitative reverse transcription polymerase chain reaction following treatment with CR, gamma rays (GR), and ion beam (IB); similar patterns were observed among these genes under these 3 treatments. Moreover, we constructed subnetworks of IR-responsive genes and evaluated the expression levels of their neighboring genes following GR treatment; similar patterns were observed among them. These results of network-based analyses might provide a clue to understanding the complex biological system related to the CR response in plants. Copyright © 2014 Elsevier B.V. All rights reserved.
Elling, Axel A; Mitreva, Makedonka; Recknor, Justin; Gai, Xiaowu; Martin, John; Maier, Thomas R; McDermott, Jeffrey P; Hewezi, Tarek; McK Bird, David; Davis, Eric L; Hussey, Richard S; Nettleton, Dan; McCarter, James P; Baum, Thomas J
2007-01-01
Background The soybean cyst nematode Heterodera glycines is the most important parasite in soybean production worldwide. A comprehensive analysis of large-scale gene expression changes throughout the development of plant-parasitic nematodes has been lacking to date. Results We report an extensive genomic analysis of H. glycines, beginning with the generation of 20,100 expressed sequence tags (ESTs). In-depth analysis of these ESTs plus approximately 1,900 previously published sequences predicted 6,860 unique H. glycines genes and allowed a classification by function using InterProScan. Expression profiling of all 6,860 genes throughout the H. glycines life cycle was undertaken using the Affymetrix Soybean Genome Array GeneChip. Our data sets and results represent a comprehensive resource for molecular studies of H. glycines. Demonstrating the power of this resource, we were able to address whether arrested development in the Caenorhabditis elegans dauer larva and the H. glycines infective second-stage juvenile (J2) exhibits shared gene expression profiles. We determined that the gene expression profiles associated with the C. elegans dauer pathway are not uniformly conserved in H. glycines and that the expression profiles of genes for metabolic enzymes of C. elegans dauer larvae and H. glycines infective J2 are dissimilar. Conclusion Our results indicate that hallmark gene expression patterns and metabolism features are not shared in the developmentally arrested life stages of C. elegans and H. glycines, suggesting that developmental arrest in these two nematode species has undergone more divergent evolution than previously thought and pointing to the need for detailed genomic analyses of individual parasite species. PMID:17919324
Otto, Benjamin; Gruner, Katharina; Heinlein, Christina; Wegwitz, Florian; Nollau, Peter; Ylstra, Bauke; Pantel, Klaus; Schumacher, Udo; Baumbusch, Lars O; Martin-Subero, José Ignacio; Siebert, Reiner; Wagener, Christoph; Streichert, Thomas; Deppert, Wolfgang; Tolstonog, Genrich V
2013-03-15
Mammary carcinomas developing in SV40 transgenic WAP-T mice arise in two distinct histological phenotypes: as differentiated low-grade and undifferentiated high-grade tumors. We integrated different types of information such as histological grading, analysis of aCGH-based gene copy number and gene expression profiling to provide a comprehensive molecular description of mammary tumors in WAP-T mice. Applying a novel procedure for the correlation of gene copy number with gene expression on a global scale, we observed in tumor samples a global coherence between genotype and transcription. This coherence can be interpreted as a matched transcriptional regulation inherited from the cells of tumor origin and determined by the activity of cancer driver genes. Despite common recurrent genomic aberrations, e.g. gain of chr. 15 in most WAP-T tumors, loss of chr. 19 frequently occurs only in low-grade tumors. These tumors show features of "basal-like" epithelial differentiation, particularly expression of keratin 14. The high-grade tumors are clearly separated from the low-grade tumors by strong expression of the Met gene and by coexpression of epithelial (e.g. keratin 18) and mesenchymal (e.g. vimentin) markers. In high-grade tumors, the expression of the nonmutated Met protein is associated with Met-locus amplification and Met activity. The role of Met as a cancer driver gene is supported by the contribution of active Met signaling to motility and growth of mammary tumor-derived cells. Finally, we discuss the independent origin of low- and high-grade tumors from distinct cells of tumor origin, possibly luminal progenitors, distinguished by Met gene expression and Met signaling. Copyright © 2012 UICC.
Englund, Marie; Carlsbecker, Annelie; Engström, Peter; Vergara-Silva, Francisco
2011-01-01
The morphological variation among reproductive organs of extant gymnosperms is remarkable, especially among conifers. Several hypotheses concerning morphological homology between various conifer reproductive organs have been put forward, in particular in relation to the pine ovuliferous scale. Here, we use the expression patterns of orthologs of the ABC-model MADS-box gene AGAMOUS (AG) for testing morphological homology hypotheses related to organs of the conifer female cone. To this end, we first developed a tailored 3'RACE procedure that allows reliable amplification of partial sequences highly similar to gymnosperm-derived members of the AG-subfamily of MADS-box genes. Expression patterns of two novel conifer AG orthologs cloned with this procedure-namely PodAG and TgAG, obtained from the podocarp Podocarpus reichei and the yew Taxus globosa, respectively-are then further characterized in the morphologically divergent female cones of these species. The expression patterns of PodAG and TgAG are compared with those of DAL2, a previously discovered Picea abies (Pinaceae) AG ortholog. By treating the expression patterns of DAL2, PodAG, and TgAG as character states mapped onto currently accepted cladogram topologies, we suggest that the epimatium-that is, the podocarp female cone organ previously postulated as a "modified" ovuliferous scale-and the canonical Pinaceae ovuliferous scale can be legitimally conceptualized as "primary homologs." Character state mapping for TgAG suggests in turn that the aril of Taxaceae should be considered as a different type of organ. This work demonstrates how the interaction between developmental-genetic data and formal cladistic theory could fruitfully contribute to gymnosperm systematics. © 2011 Wiley Periodicals, Inc.
A regulatory toolbox of MiniPromoters to drive selective expression in the brain
Portales-Casamar, Elodie; Swanson, Douglas J.; Liu, Li; de Leeuw, Charles N.; Banks, Kathleen G.; Ho Sui, Shannan J.; Fulton, Debra L.; Ali, Johar; Amirabbasi, Mahsa; Arenillas, David J.; Babyak, Nazar; Black, Sonia F.; Bonaguro, Russell J.; Brauer, Erich; Candido, Tara R.; Castellarin, Mauro; Chen, Jing; Chen, Ying; Cheng, Jason C. Y.; Chopra, Vik; Docking, T. Roderick; Dreolini, Lisa; D'Souza, Cletus A.; Flynn, Erin K.; Glenn, Randy; Hatakka, Kristi; Hearty, Taryn G.; Imanian, Behzad; Jiang, Steven; Khorasan-zadeh, Shadi; Komljenovic, Ivana; Laprise, Stéphanie; Liao, Nancy Y.; Lim, Jonathan S.; Lithwick, Stuart; Liu, Flora; Liu, Jun; Lu, Meifen; McConechy, Melissa; McLeod, Andrea J.; Milisavljevic, Marko; Mis, Jacek; O'Connor, Katie; Palma, Betty; Palmquist, Diana L.; Schmouth, Jean-François; Swanson, Magdalena I.; Tam, Bonny; Ticoll, Amy; Turner, Jenna L.; Varhol, Richard; Vermeulen, Jenny; Watkins, Russell F.; Wilson, Gary; Wong, Bibiana K. Y.; Wong, Siaw H.; Wong, Tony Y. T.; Yang, George S.; Ypsilanti, Athena R.; Jones, Steven J. M.; Holt, Robert A.; Goldowitz, Daniel; Wasserman, Wyeth W.; Simpson, Elizabeth M.
2010-01-01
The Pleiades Promoter Project integrates genomewide bioinformatics with large-scale knockin mouse production and histological examination of expression patterns to develop MiniPromoters and related tools designed to study and treat the brain by directed gene expression. Genes with brain expression patterns of interest are subjected to bioinformatic analysis to delineate candidate regulatory regions, which are then incorporated into a panel of compact human MiniPromoters to drive expression to brain regions and cell types of interest. Using single-copy, homologous-recombination “knockins” in embryonic stem cells, each MiniPromoter reporter is integrated immediately 5′ of the Hprt locus in the mouse genome. MiniPromoter expression profiles are characterized in differentiation assays of the transgenic cells or in mouse brains following transgenic mouse production. Histological examination of adult brains, eyes, and spinal cords for reporter gene activity is coupled to costaining with cell-type–specific markers to define expression. The publicly available Pleiades MiniPromoter Project is a key resource to facilitate research on brain development and therapies. PMID:20807748
Tucker, James D; Grever, William E; Joiner, Michael C; Konski, Andre A; Thomas, Robert A; Smolinski, Joseph M; Divine, George W; Auner, Gregory W
2012-02-01
In a large-scale nuclear incident, many thousands of people may be exposed to a wide range of radiation doses. Rapid biological dosimetry will be required on an individualized basis to estimate the exposures and to make treatment decisions. To ameliorate the adverse effects of exposure, victims may be treated with one or more cytokine growth factors, including granulocyte colony-stimulating factor (G-CSF), which has therapeutic efficacy for treating radiation-induced bone marrow ablation by stimulating granulopoiesis. The existence of infections and the administration of G-CSF each may confound the ability to achieve reliable dosimetry by gene expression analysis. In this study, C57BL/6 mice were used to determine the extent to which G-CSF and lipopolysaccharide (LPS, which simulates infection by gram-negative bacteria) alter the expression of genes that are either radiation-responsive or non-responsive, i.e., show potential for use as endogenous controls. Mice were acutely exposed to (60)Co γ rays at either 0 Gy or 6 Gy. Two hours later the animals were injected with either 0.1 mg/kg of G-CSF or 0.3 mg/kg of LPS. Expression levels of 96 different gene targets were evaluated in peripheral blood after an additional 4 or 24 h using real-time quantitative PCR. The results indicate that the expression levels of some genes are altered by LPS, but altered expression after G-CSF treatment was generally not observed. The expression levels of many genes therefore retain utility for biological dosimetry or as endogenous controls. These data suggest that PCR-based quantitative gene expression analyses may have utility in radiation biodosimetry in humans even in the presence of an infection or after treatment with G-CSF.
Bilichak, Andriy; Ilnystkyy, Yaroslav; Hollunder, Jens; Kovalchuk, Igor
2012-01-01
Plants are able to acclimate to new growth conditions on a relatively short time-scale. Recently, we showed that the progeny of plants exposed to various abiotic stresses exhibited changes in genome stability, methylation patterns and stress tolerance. Here, we performed a more detailed analysis of methylation patterns in the progeny of Arabidopsis thaliana (Arabidopsis) plants exposed to 25 and 75 mM sodium chloride. We found that the majority of gene promoters exhibiting changes in methylation were hypermethylated, and this group was overrepresented by regulators of the chromatin structure. The analysis of DNA methylation at gene bodies showed that hypermethylation in the progeny of stressed plants was primarily due to changes in the 5′ and 3′ ends as well as in exons rather than introns. All but one hypermethylated gene tested had lower gene expression. The analysis of histone modifications in the promoters and coding sequences showed that hypermethylation and lower gene expression correlated with the enrichment of H3K9me2 and depletion of H3K9ac histones. Thus, our work demonstrated a high degree of correlation between changes in DNA methylation, histone modifications and gene expression in the progeny of salt-stressed plants. PMID:22291972
Ohyanagi, Hajime; Takano, Tomoyuki; Terashima, Shin; Kobayashi, Masaaki; Kanno, Maasa; Morimoto, Kyoko; Kanegae, Hiromi; Sasaki, Yohei; Saito, Misa; Asano, Satomi; Ozaki, Soichi; Kudo, Toru; Yokoyama, Koji; Aya, Koichiro; Suwabe, Keita; Suzuki, Go; Aoki, Koh; Kubo, Yasutaka; Watanabe, Masao; Matsuoka, Makoto; Yano, Kentaro
2015-01-01
Comprehensive integration of large-scale omics resources such as genomes, transcriptomes and metabolomes will provide deeper insights into broader aspects of molecular biology. For better understanding of plant biology, we aim to construct a next-generation sequencing (NGS)-derived gene expression network (GEN) repository for a broad range of plant species. So far we have incorporated information about 745 high-quality mRNA sequencing (mRNA-Seq) samples from eight plant species (Arabidopsis thaliana, Oryza sativa, Solanum lycopersicum, Sorghum bicolor, Vitis vinifera, Solanum tuberosum, Medicago truncatula and Glycine max) from the public short read archive, digitally profiled the entire set of gene expression profiles, and drawn GENs by using correspondence analysis (CA) to take advantage of gene expression similarities. In order to understand the evolutionary significance of the GENs from multiple species, they were linked according to the orthology of each node (gene) among species. In addition to other gene expression information, functional annotation of the genes will facilitate biological comprehension. Currently we are improving the given gene annotations with natural language processing (NLP) techniques and manual curation. Here we introduce the current status of our analyses and the web database, PODC (Plant Omics Data Center; http://bioinf.mind.meiji.ac.jp/podc/), now open to the public, providing GENs, functional annotations and additional comprehensive omics resources. © The Author 2014. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists.
Carlsbecker, Annelie; Sundström, Jens; Tandre, Karolina; Englund, Marie; Kvarnheden, Anders; Johanson, Urban; Engström, Peter
2003-01-01
Transcription factors encoded by different members of the MADS-box gene family have evolved central roles in the regulation of reproductive organ development in the flowering plants, the angiosperms. Development of the stamens and carpels, the pollen- and seed-bearing organs, involves the B- and C-organ-identity MADS-box genes. B- and C-type gene orthologs with activities specifically in developing pollen- and seed-bearing organs are also present in the distantly related gymnosperms: the conifers and the gnetophytes. We now report on the characterization of DAL10, a novel MADS-box gene from the conifer Norway spruce, which unlike the B- and C-type conifer genes shows no distinct orthology relationship to any angiosperm gene or clade in phylogenetic analyses. Like the B- and C-type genes, it is active specifically in developing pollen cones and seed cones. In situ RNA localization experiments show DAL10 to be expressed in the cone axis, which carry the microsporophylls of the young pollen cone. In contrast, in the seed cone it is expressed both in the cone axis and in the bracts, which subtend the ovuliferous scales. Expression data and the phenotype of transgenic Arabidopsis plants expressing DAL10 suggest that the gene may act upstream to or in concert with the B- and C-type genes in the establishment of reproductive identity of developing cones.
Meng, Jia; Kanzaki, Gregory; Meas, Diane; Lam, Christopher K; Crummer, Heather; Tain, Justina; Xu, H Howard
2012-04-01
Regulated antisense RNA (asRNA) expression has been employed successfully in Gram-positive bacteria for genome-wide essential gene identification and drug target determination. However, there have been no published reports describing the application of asRNA gene silencing for comprehensive analyses of essential genes in Gram-negative bacteria. In this study, we report the first genome-wide identification of asRNA constructs for essential genes in Escherichia coli. We screened 250 000 library transformants for conditional growth inhibitory recombinant clones from two shotgun genomic libraries of E. coli using a paired-termini expression vector (pHN678). After sequencing plasmid inserts of 675 confirmed inducer sensitive cell clones, we identified 152 separate asRNA constructs of which 134 inserts came from essential genes, while 18 originated from nonessential genes (but share operons with essential genes). Among the 79 individual essential genes silenced by these asRNA constructs, 61 genes (77%) engage in processes related to protein synthesis. The cell-based assays of an asRNA clone targeting fusA (encoding elongation factor G) showed that the induced cells were sensitized 12-fold to fusidic acid, a known specific inhibitor. Our results demonstrate the utility of the paired-termini expression vector and feasibility of large-scale gene silencing in E. coli using regulated asRNA expression. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Cui, Peng; Zhong, Tingyan; Wang, Zhuo; Wang, Tao; Zhao, Hongyu; Liu, Chenglin; Lu, Hui
2018-06-01
Circadian genes express periodically in an approximate 24-h period and the identification and study of these genes can provide deep understanding of the circadian control which plays significant roles in human health. Although many circadian gene identification algorithms have been developed, large numbers of false positives and low coverage are still major problems in this field. In this study we constructed a novel computational framework for circadian gene identification using deep neural networks (DNN) - a deep learning algorithm which can represent the raw form of data patterns without imposing assumptions on the expression distribution. Firstly, we transformed time-course gene expression data into categorical-state data to denote the changing trend of gene expression. Two distinct expression patterns emerged after clustering of the state data for circadian genes from our manually created learning dataset. DNN was then applied to discriminate the aperiodic genes and the two subtypes of periodic genes. In order to assess the performance of DNN, four commonly used machine learning methods including k-nearest neighbors, logistic regression, naïve Bayes, and support vector machines were used for comparison. The results show that the DNN model achieves the best balanced precision and recall. Next, we conducted large scale circadian gene detection using the trained DNN model for the remaining transcription profiles. Comparing with JTK_CYCLE and a study performed by Möller-Levet et al. (doi: https://doi.org/10.1073/pnas.1217154110), we identified 1132 novel periodic genes. Through the functional analysis of these novel circadian genes, we found that the GTPase superfamily exhibits distinct circadian expression patterns and may provide a molecular switch of circadian control of the functioning of the immune system in human blood. Our study provides novel insights into both the circadian gene identification field and the study of complex circadian-driven biological control. This article is part of a Special Issue entitled: Accelerating Precision Medicine through Genetic and Genomic Big Data Analysis edited by Yudong Cai & Tao Huang. Copyright © 2017. Published by Elsevier B.V.
USDA-ARS?s Scientific Manuscript database
Natural antisense transcripts (NATs) are transcripts of the opposite DNA strand to the sense-strand either at the same locus (cis-encoded) or a different locus (trans-encoded). They can affect gene expression at multiple stages including transcription, RNA processing and transport, and translation....
Inference of scale-free networks from gene expression time series.
Daisuke, Tominaga; Horton, Paul
2006-04-01
Quantitative time-series observation of gene expression is becoming possible, for example by cell array technology. However, there are no practical methods with which to infer network structures using only observed time-series data. As most computational models of biological networks for continuous time-series data have a high degree of freedom, it is almost impossible to infer the correct structures. On the other hand, it has been reported that some kinds of biological networks, such as gene networks and metabolic pathways, may have scale-free properties. We hypothesize that the architecture of inferred biological network models can be restricted to scale-free networks. We developed an inference algorithm for biological networks using only time-series data by introducing such a restriction. We adopt the S-system as the network model, and a distributed genetic algorithm to optimize models to fit its simulated results to observed time series data. We have tested our algorithm on a case study (simulated data). We compared optimization under no restriction, which allows for a fully connected network, and under the restriction that the total number of links must equal that expected from a scale free network. The restriction reduced both false positive and false negative estimation of the links and also the differences between model simulation and the given time-series data.
Jackson, Kasey L.; Dayton, Robert D.; Deverman, Benjamin E.; Klein, Ronald L.
2016-01-01
Widespread genetic modification of cells in the central nervous system (CNS) with a viral vector has become possible and increasingly more efficient. We previously applied an AAV9 vector with the cytomegalovirus/chicken beta-actin (CBA) hybrid promoter and achieved wide-scale CNS transduction in neonatal and adult rats. However, this method transduces a variety of tissues in addition to the CNS. Thus we studied intravenous AAV9 gene transfer with a synapsin promoter to better target the neurons. We noted in systematic comparisons that the synapsin promoter drives lower level expression than does the CBA promoter. The engineered adeno-associated virus (AAV)-PHP.B serotype was compared with AAV9, and AAV-PHP.B did enhance the efficiency of expression. Combining the synapsin promoter with AAV-PHP.B could therefore be advantageous in terms of combining two refinements of targeting and efficiency. Wide-scale expression was used to model a disease with widespread pathology. Vectors encoding the amyotrophic lateral sclerosis (ALS)-related protein transactive response DNA-binding protein, 43 kDa (TDP-43) with the synapsin promoter and AAV-PHP.B were used for efficient CNS-targeted TDP-43 expression. Intracerebroventricular injections were also explored to limit TDP-43 expression to the CNS. The neuron-selective promoter and the AAV-PHP.B enhanced gene transfer and ALS disease modeling in adult rats. PMID:27867348
Jackson, Kasey L; Dayton, Robert D; Deverman, Benjamin E; Klein, Ronald L
2016-01-01
Widespread genetic modification of cells in the central nervous system (CNS) with a viral vector has become possible and increasingly more efficient. We previously applied an AAV9 vector with the cytomegalovirus/chicken beta-actin (CBA) hybrid promoter and achieved wide-scale CNS transduction in neonatal and adult rats. However, this method transduces a variety of tissues in addition to the CNS. Thus we studied intravenous AAV9 gene transfer with a synapsin promoter to better target the neurons. We noted in systematic comparisons that the synapsin promoter drives lower level expression than does the CBA promoter. The engineered adeno-associated virus (AAV)-PHP.B serotype was compared with AAV9, and AAV-PHP.B did enhance the efficiency of expression. Combining the synapsin promoter with AAV-PHP.B could therefore be advantageous in terms of combining two refinements of targeting and efficiency. Wide-scale expression was used to model a disease with widespread pathology. Vectors encoding the amyotrophic lateral sclerosis (ALS)-related protein transactive response DNA-binding protein, 43 kDa (TDP-43) with the synapsin promoter and AAV-PHP.B were used for efficient CNS-targeted TDP-43 expression. Intracerebroventricular injections were also explored to limit TDP-43 expression to the CNS. The neuron-selective promoter and the AAV-PHP.B enhanced gene transfer and ALS disease modeling in adult rats.
To discover novel PPI signaling hubs for lung cancer, CTD2 Center at Emory utilized large-scale genomics datasets and literature to compile a set of lung cancer-associated genes. A library of expression vectors were generated for these genes and utilized for detecting pairwise PPIs with cell lysate-based TR-FRET assays in high-throughput screening format. Read the abstract.
Li, Wenli; Turner, Amy; Aggarwal, Praful; Matter, Andrea; Storvick, Erin; Arnett, Donna K; Broeckel, Ulrich
2015-12-16
Whole transcriptome sequencing (RNA-seq) represents a powerful approach for whole transcriptome gene expression analysis. However, RNA-seq carries a few limitations, e.g., the requirement of a significant amount of input RNA and complications led by non-specific mapping of short reads. The Ion AmpliSeq Transcriptome Human Gene Expression Kit (AmpliSeq) was recently introduced by Life Technologies as a whole-transcriptome, targeted gene quantification kit to overcome these limitations of RNA-seq. To assess the performance of this new methodology, we performed a comprehensive comparison of AmpliSeq with RNA-seq using two well-established next-generation sequencing platforms (Illumina HiSeq and Ion Torrent Proton). We analyzed standard reference RNA samples and RNA samples obtained from human induced pluripotent stem cell derived cardiomyocytes (hiPSC-CMs). Using published data from two standard RNA reference samples, we observed a strong concordance of log2 fold change for all genes when comparing AmpliSeq to Illumina HiSeq (Pearson's r = 0.92) and Ion Torrent Proton (Pearson's r = 0.92). We used ROC, Matthew's correlation coefficient and RMSD to determine the overall performance characteristics. All three statistical methods demonstrate AmpliSeq as a highly accurate method for differential gene expression analysis. Additionally, for genes with high abundance, AmpliSeq outperforms the two RNA-seq methods. When analyzing four closely related hiPSC-CM lines, we show that both AmpliSeq and RNA-seq capture similar global gene expression patterns consistent with known sources of variations. Our study indicates that AmpliSeq excels in the limiting areas of RNA-seq for gene expression quantification analysis. Thus, AmpliSeq stands as a very sensitive and cost-effective approach for very large scale gene expression analysis and mRNA marker screening with high accuracy.
Xu, Jing; Huang, Wei; Zhong, Chengrong; Luo, Daji; Li, Shuangfei; Zhu, Zuoyan; Hu, Wei
2011-01-01
Background The hypothalamic-pituitary-gonadal (HPG) axis is critical in the development and regulation of reproduction in fish. The inhibition of neuropeptide gonadotropin-releasing hormone (GnRH) expression may diminish or severely hamper gonadal development due to it being the key regulator of the axis, and then provide a model for the comprehensive study of the expression patterns of genes with respect to the fish reproductive system. Methodology/Principal Findings In a previous study we injected 342 fertilized eggs from the common carp (Cyprinus carpio) with a gene construct that expressed antisense sGnRH. Four years later, we found a total of 38 transgenic fish with abnormal or missing gonads. From this group we selected the 12 sterile females with abnormal ovaries in which we combined suppression subtractive hybridization (SSH) and cDNA microarray analysis to define changes in gene expression of the HPG axis in the present study. As a result, nine, 28, and 212 genes were separately identified as being differentially expressed in hypothalamus, pituitary, and ovary, of which 87 genes were novel. The number of down- and up-regulated genes was five and four (hypothalamus), 16 and 12 (pituitary), 119 and 93 (ovary), respectively. Functional analyses showed that these genes involved in several biological processes, such as biosynthesis, organogenesis, metabolism pathways, immune systems, transport links, and apoptosis. Within these categories, significant genes for neuropeptides, gonadotropins, metabolic, oogenesis and inflammatory factors were identified. Conclusions/Significance This study indicated the progressive scaling-up effect of hypothalamic sGnRH antisense on the pituitary and ovary receptors of female carp and provided comprehensive data with respect to global changes in gene expression throughout the HPG signaling pathway, contributing towards improving our understanding of the molecular mechanisms and regulative pathways in the reproductive system of teleost fish. PMID:21695218
Huang, Pengyun; Lin, Fucheng
2014-01-01
Because of great challenges and workload in deleting genes on a large scale, the functions of most genes in pathogenic fungi are still unclear. In this study, we developed a high-throughput gene knockout system using a novel yeast-Escherichia-Agrobacterium shuttle vector, pKO1B, in the rice blast fungus Magnaporthe oryzae. Using this method, we deleted 104 fungal-specific Zn2Cys6 transcription factor (TF) genes in M. oryzae. We then analyzed the phenotypes of these mutants with regard to growth, asexual and infection-related development, pathogenesis, and 9 abiotic stresses. The resulting data provide new insights into how this rice pathogen of global significance regulates important traits in the infection cycle through Zn2Cys6TF genes. A large variation in biological functions of Zn2Cys6TF genes was observed under the conditions tested. Sixty-one of 104 Zn2Cys6 TF genes were found to be required for fungal development. In-depth analysis of TF genes revealed that TF genes involved in pathogenicity frequently tend to function in multiple development stages, and disclosed many highly conserved but unidentified functional TF genes of importance in the fungal kingdom. We further found that the virulence-required TF genes GPF1 and CNF2 have similar regulation mechanisms in the gene expression involved in pathogenicity. These experimental validations clearly demonstrated the value of a high-throughput gene knockout system in understanding the biological functions of genes on a genome scale in fungi, and provided a solid foundation for elucidating the gene expression network that regulates the development and pathogenicity of M. oryzae. PMID:25299517
Adiabatic reduction of a model of stochastic gene expression with jump Markov process.
Yvinec, Romain; Zhuge, Changjing; Lei, Jinzhi; Mackey, Michael C
2014-04-01
This paper considers adiabatic reduction in a model of stochastic gene expression with bursting transcription considered as a jump Markov process. In this model, the process of gene expression with auto-regulation is described by fast/slow dynamics. The production of mRNA is assumed to follow a compound Poisson process occurring at a rate depending on protein levels (the phenomena called bursting in molecular biology) and the production of protein is a linear function of mRNA numbers. When the dynamics of mRNA is assumed to be a fast process (due to faster mRNA degradation than that of protein) we prove that, with appropriate scalings in the burst rate, jump size or translational rate, the bursting phenomena can be transmitted to the slow variable. We show that, depending on the scaling, the reduced equation is either a stochastic differential equation with a jump Poisson process or a deterministic ordinary differential equation. These results are significant because adiabatic reduction techniques seem to have not been rigorously justified for a stochastic differential system containing a jump Markov process. We expect that the results can be generalized to adiabatic methods in more general stochastic hybrid systems.
Isaacson, Sven; Luo, Feng; Feltus, Frank A.; Smith, Melissa C.
2013-01-01
The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust. PMID:23409071
Stekel, Dov J.; Sarti, Donatella; Trevino, Victor; Zhang, Lihong; Salmon, Mike; Buckley, Chris D.; Stevens, Mark; Pallen, Mark J.; Penn, Charles; Falciani, Francesco
2005-01-01
A key step in the analysis of microarray data is the selection of genes that are differentially expressed. Ideally, such experiments should be properly replicated in order to infer both technical and biological variability, and the data should be subjected to rigorous hypothesis tests to identify the differentially expressed genes. However, in microarray experiments involving the analysis of very large numbers of biological samples, replication is not always practical. Therefore, there is a need for a method to select differentially expressed genes in a rational way from insufficiently replicated data. In this paper, we describe a simple method that uses bootstrapping to generate an error model from a replicated pilot study that can be used to identify differentially expressed genes in subsequent large-scale studies on the same platform, but in which there may be no replicated arrays. The method builds a stratified error model that includes array-to-array variability, feature-to-feature variability and the dependence of error on signal intensity. We apply this model to the characterization of the host response in a model of bacterial infection of human intestinal epithelial cells. We demonstrate the effectiveness of error model based microarray experiments and propose this as a general strategy for a microarray-based screening of large collections of biological samples. PMID:15800204
Evans, Tyler G.; Hofmann, Gretchen E.
2012-01-01
Anthropogenic stressors, such as climate change, are driving fundamental shifts in the abiotic characteristics of marine ecosystems. As the environmental aspects of our world's oceans deviate from evolved norms, of major concern is whether extant marine species possess the capacity to cope with such rapid change. In what many scientists consider the post-genomic era, tools that exploit the availability of DNA sequence information are being increasingly recognized as relevant to questions surrounding ocean change and marine conservation. In this review, we highlight the application of high-throughput gene-expression profiling, primarily transcriptomics, to the field of marine conservation physiology. Through the use of case studies, we illustrate how gene expression can be used to standardize metrics of sub-lethal stress, track organism condition in natural environments and bypass phylogenetic barriers that hinder the application of other physiological techniques to conservation. When coupled with fine-scale monitoring of environmental variables, gene-expression profiling provides a powerful approach to conservation capable of informing diverse issues related to ocean change, from coral bleaching to the spread of invasive species. Integrating novel approaches capable of improving existing conservation strategies, including gene-expression profiling, will be critical to ensuring the ecological and economic health of the global ocean. PMID:22566679
Comparison of gene expression changes induced by biguanides in db/db mice liver.
Heishi, Masayuki; Hayashi, Koji; Ichihara, Junji; Ishikawa, Hironori; Kawamura, Takao; Kanaoka, Masaharu; Taiji, Mutsuo; Kimura, Toru
2008-08-01
Large-scale clinical studies have shown that the biguanide drug metformin, widely used for type 2 diabetes, to be very safe. By contrast, another biguanide, phenformin, has been withdrawn from major markets because of a high incidence of serious adverse effects. The difference in mode of action between the two biguanides remains unclear. To gain insight into the different modes of action of the two drugs, we performed global gene expression profiling using the livers of obese diabetic db/db mice after a single administration of phenformin or metformin at levels sufficient to cause a significant reduction in blood glucose level. Metformin induced modest expression changes, including G6pc in the liver as previously reported. By contrast, phenformin caused changes in expression level of many additional genes. We used a knowledge-based bioinformatic analysis to study the effects of phenformin. Differentially expressed genes identified in this study constitute a large gene network, which may be related to cell death, inflammation or wound response. Our results suggest that the two biguanides show a similar hypoglycemic effect in db/db mice, but phenformin induces a greater stress on the liver even a short time after a single administration. These findings provide a novel insight into the cause of the relatively high occurrence of serious adverse effect after phenformin treatment.
Predicting features of breast cancer with gene expression patterns.
Lu, Xuesong; Lu, Xin; Wang, Zhigang C; Iglehart, J Dirk; Zhang, Xuegong; Richardson, Andrea L
2008-03-01
Data from gene expression arrays hold an enormous amount of biological information. We sought to determine if global gene expression in primary breast cancers contained information about biologic, histologic, and anatomic features of the disease in individual patients. Microarray data from the tumors of 129 patients were analyzed for the ability to predict biomarkers [estrogen receptor (ER) and HER2], histologic features [grade and lymphatic-vascular invasion (LVI)], and stage parameters (tumor size and lymph node metastasis). Multiple statistical predictors were used and the prediction accuracy was determined by cross-validation error rate; multidimensional scaling (MDS) allowed visualization of the predicted states under study. Models built from gene expression data accurately predict ER and HER2 status, and divide tumor grade into high-grade and low-grade clusters; intermediate-grade tumors are not a unique group. In contrast, gene expression data is inaccurate at predicting tumor size, lymph node status or LVI. The best model for prediction of nodal status included tumor size, LVI status and pathologically defined tumor subtype (based on combinations of ER, HER2, and grade); the addition of microarray-based prediction to this model failed to improve the prediction accuracy. Global gene expression supports a binary division of ER, HER2, and grade, clearly separating tumors into two categories; intermediate values for these bio-indicators do not define intermediate tumor subsets. Results are consistent with a model of regional metastasis that depends on inherent biologic differences in metastatic propensity between breast cancer subtypes, upon which time and chance then operate.
Hepatic gene expression patterns following trauma-hemorrhage: effect of posttreatment with estrogen.
Yu, Huang-Ping; Pang, See-Tong; Chaudry, Irshad H
2013-01-01
The aim of this study was to examine the role of estrogen on hepatic gene expression profiles at an early time point following trauma-hemorrhage in rats. Groups of injured and sham controls receiving estrogen or vehicle were killed 2 h after injury and resuscitation, and liver tissue was harvested. Complementary RNA was synthesized from each RNA sample and hybridized to microarrays. A large number of genes were differentially expressed at the 2-h time point in injured animals with or without estrogen treatment. The upregulation or downregulation of a cohort of 14 of these genes was validated by reverse transcription-polymerase chain reaction. This large-scale microarray analysis shows that at the 2-h time point, there is marked alteration in hepatic gene expression following trauma-hemorrhage. However, estrogen treatment attenuated these changes in injured animals. Pathway analysis demonstrated predominant changes in the expression of genes involved in metabolism, immunity, and apoptosis. Upregulation of low-density lipoprotein receptor, protein phosphatase 1, regulatory subunit 3C, ring-finger protein 11, pyroglutamyl-peptidase I, bactericidal/permeability-increasing protein, integrin, αD, BCL2-like 11, leukemia inhibitory factor receptor, ATPase, Cu transporting, α polypeptide, and Mk1 protein was found in estrogen-treated trauma-hemorrhaged animals. Thus, estrogen produces hepatoprotection following trauma-hemorrhage likely via antiapoptosis and improving/restoring metabolism and immunity pathways.
Gene expression analysis of flax seed development
2011-01-01
Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise even low-expressed genes such as those encoding transcription factors. This has allowed us to delineate the spatio-temporal aspects of gene expression underlying the biosynthesis of a number of important seed constituents in flax. Flax belongs to a taxonomic group of diverse plants and the large sequence database will allow for evolutionary studies as well. PMID:21529361
2011-01-01
Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format) and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper) is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays), implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile), useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples) and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene clusters with differential expression during the differentiation toward megakaryocyte were identified. Conclusions TRAM is designed to create, and statistically analyze, quantitative transcriptome maps, based on gene expression data from multiple sources. The release includes FileMaker Pro database management runtime application and it is freely available at http://apollo11.isto.unibo.it/software/, along with preconfigured implementations for mapping of human, mouse and zebrafish transcriptomes. PMID:21333005
Andrew, Audra L; Card, Daren C; Ruggiero, Robert P; Schield, Drew R; Adams, Richard H; Pollock, David D; Secor, Stephen M; Castoe, Todd A
2015-05-01
Snakes provide a unique and valuable model system for studying the extremes of physiological remodeling because of the ability of some species to rapidly upregulate organ form and function upon feeding. The predominant model species used to study such extreme responses has been the Burmese python because of the extreme nature of postfeeding response in this species. We analyzed the Burmese python intestine across a time series, before, during, and after feeding to understand the patterns and timing of changes in gene expression and their relationship to changes in intestinal form and function upon feeding. Our results indicate that >2,000 genes show significant changes in expression in the small intestine following feeding, including genes involved in intestinal morphology and function (e.g., hydrolases, microvillus proteins, trafficking and transport proteins), as well as genes involved in cell division and apoptosis. Extensive changes in gene expression occur surprisingly rapidly, within the first 6 h of feeding, coincide with changes in intestinal morphology, and effectively return to prefeeding levels within 10 days. Collectively, our results provide an unprecedented portrait of parallel changes in gene expression and intestinal morphology and physiology on a scale that is extreme both in the magnitude of changes, as well as in the incredibly short time frame of these changes, with up- and downregulation of expression and function occurring in the span of 10 days. Our results also identify conserved vertebrate signaling pathways that modulate these responses, which may suggest pathways for therapeutic modulation of intestinal function in humans. Copyright © 2015 the American Physiological Society.
Naxerova, Kamila; Bult, Carol J; Peaston, Anne; Fancher, Karen; Knowles, Barbara B; Kasif, Simon; Kohane, Isaac S
2008-01-01
Background In recent years, the molecular underpinnings of the long-observed resemblance between neoplastic and immature tissue have begun to emerge. Genome-wide transcriptional profiling has revealed similar gene expression signatures in several tumor types and early developmental stages of their tissue of origin. However, it remains unclear whether such a relationship is a universal feature of malignancy, whether heterogeneities exist in the developmental component of different tumor types and to which degree the resemblance between cancer and development is a tissue-specific phenomenon. Results We defined a developmental landscape by summarizing the main features of ten developmental time courses and projected gene expression from a variety of human tumor types onto this landscape. This comparison demonstrates a clear imprint of developmental gene expression in a wide range of tumors and with respect to different, even non-cognate developmental backgrounds. Our analysis reveals three classes of cancers with developmentally distinct transcriptional patterns. We characterize the biological processes dominating these classes and validate the class distinction with respect to a new time series of murine embryonic lung development. Finally, we identify a set of genes that are upregulated in most cancers and we show that this signature is active in early development. Conclusion This systematic and quantitative overview of the relationship between the neoplastic and developmental transcriptome spanning dozens of tissues provides a reliable outline of global trends in cancer gene expression, reveals potentially clinically relevant differences in the gene expression of different cancer types and represents a reference framework for interpretation of smaller-scale functional studies. PMID:18611264
Stornaiuolo, Anna; Piovani, Bianca Maria; Bossi, Sergio; Zucchelli, Eleonora; Corna, Stefano; Salvatori, Francesca; Mavilio, Fulvio; Bordignon, Claudio; Rizzardi, Gian Paolo; Bovolenta, Chiara
2013-08-01
Over the last two decades, several attempts to generate packaging cells for lentiviral vectors (LV) have been made. Despite different technologies, no packaging clone is currently employed in clinical trials. We developed a new strategy for LV stable production based on the HEK-293T progenitor cells; the sequential insertion of the viral genes by integrating vectors; the constitutive expression of the viral components; and the RD114-TR envelope pseudotyping. We generated the intermediate clone PK-7 expressing constitutively gag/pol and rev genes and, by adding tat and rd114-tr genes, the stable packaging cell line RD2-MolPack, which can produce LV carrying any transfer vector (TV). Finally, we obtained the RD2-MolPack-Chim3 producer clone by transducing RD2-MolPack cells with the TV expressing the anti-HIV transgene Chim3. Remarkably, RD114-TR pseudovirions have much higher potency when produced by stable compared with transient technology. Most importantly, comparable transduction efficiency in hematopoietic stem cells (HSC) is obtained with 2-logs less physical particles respect to VSV-G pseudovirions produced by transient transfection. Altogether, RD2-MolPack technology should be considered a valid option for large-scale production of LV to be used in gene therapy protocols employing HSC, resulting in the possibility of downsizing the manufacturing scale by about 10-fold in respect to transient technology.
Reprogramming cell fate with a genome-scale library of artificial transcription factors.
Eguchi, Asuka; Wleklinski, Matthew J; Spurgat, Mackenzie C; Heiderscheit, Evan A; Kropornicka, Anna S; Vu, Catherine K; Bhimsaria, Devesh; Swanson, Scott A; Stewart, Ron; Ramanathan, Parameswaran; Kamp, Timothy J; Slukvin, Igor; Thomson, James A; Dutton, James R; Ansari, Aseem Z
2016-12-20
Artificial transcription factors (ATFs) are precision-tailored molecules designed to bind DNA and regulate transcription in a preprogrammed manner. Libraries of ATFs enable the high-throughput screening of gene networks that trigger cell fate decisions or phenotypic changes. We developed a genome-scale library of ATFs that display an engineered interaction domain (ID) to enable cooperative assembly and synergistic gene expression at targeted sites. We used this ATF library to screen for key regulators of the pluripotency network and discovered three combinations of ATFs capable of inducing pluripotency without exogenous expression of Oct4 (POU domain, class 5, TF 1). Cognate site identification, global transcriptional profiling, and identification of ATF binding sites reveal that the ATFs do not directly target Oct4; instead, they target distinct nodes that converge to stimulate the endogenous pluripotency network. This forward genetic approach enables cell type conversions without a priori knowledge of potential key regulators and reveals unanticipated gene network dynamics that drive cell fate choices.
Arkas: Rapid reproducible RNAseq analysis
Colombo, Anthony R.; J. Triche Jr, Timothy; Ramsingh, Giridharan
2017-01-01
The recently introduced Kallisto pseudoaligner has radically simplified the quantification of transcripts in RNA-sequencing experiments. We offer cloud-scale RNAseq pipelines Arkas-Quantification, and Arkas-Analysis available within Illumina’s BaseSpace cloud application platform which expedites Kallisto preparatory routines, reliably calculates differential expression, and performs gene-set enrichment of REACTOME pathways . Due to inherit inefficiencies of scale, Illumina's BaseSpace computing platform offers a massively parallel distributive environment improving data management services and data importing. Arkas-Quantification deploys Kallisto for parallel cloud computations and is conveniently integrated downstream from the BaseSpace Sequence Read Archive (SRA) import/conversion application titled SRA Import. Arkas-Analysis annotates the Kallisto results by extracting structured information directly from source FASTA files with per-contig metadata, calculates the differential expression and gene-set enrichment analysis on both coding genes and transcripts. The Arkas cloud pipeline supports ENSEMBL transcriptomes and can be used downstream from the SRA Import facilitating raw sequencing importing, SRA FASTQ conversion, RNA quantification and analysis steps. PMID:28868134
Reprogramming cell fate with a genome-scale library of artificial transcription factors
Eguchi, Asuka; Wleklinski, Matthew J.; Spurgat, Mackenzie C.; Heiderscheit, Evan A.; Kropornicka, Anna S.; Vu, Catherine K.; Bhimsaria, Devesh; Swanson, Scott A.; Stewart, Ron; Ramanathan, Parameswaran; Kamp, Timothy J.; Slukvin, Igor; Thomson, James A.; Dutton, James R.; Ansari, Aseem Z.
2016-01-01
Artificial transcription factors (ATFs) are precision-tailored molecules designed to bind DNA and regulate transcription in a preprogrammed manner. Libraries of ATFs enable the high-throughput screening of gene networks that trigger cell fate decisions or phenotypic changes. We developed a genome-scale library of ATFs that display an engineered interaction domain (ID) to enable cooperative assembly and synergistic gene expression at targeted sites. We used this ATF library to screen for key regulators of the pluripotency network and discovered three combinations of ATFs capable of inducing pluripotency without exogenous expression of Oct4 (POU domain, class 5, TF 1). Cognate site identification, global transcriptional profiling, and identification of ATF binding sites reveal that the ATFs do not directly target Oct4; instead, they target distinct nodes that converge to stimulate the endogenous pluripotency network. This forward genetic approach enables cell type conversions without a priori knowledge of potential key regulators and reveals unanticipated gene network dynamics that drive cell fate choices. PMID:27930301
Sawada, H; Nakagoshi, M; Reinhardt, R K; Ziegler, I; Koch, P B
2002-06-01
Color patterns of butterfly wings are composed of single color points represented by each scale. In the case of Precis coenia, at the end of pupal development, different types of pigments are synthesized sequentially in the differently colored scales beginning with white (pterins) followed by red (ommatins) and then black (melanin). In order to explain how formation of these different colors is regulated, we examined the expression of an mRNA-encoding guanosine triphosphate-cyclohydrolase I (GTP-CH I; EC 3.5.4.16), the first key enzyme in the biosynthesis of pteridines, during pigment formation in the wings of P. coenia. The strongest positive signal was recognized around pigment formation one day before butterfly emergence. This GTP-CH I gene expression is paralleled by GTP-CH I enzyme activity measured in wing extracts. We also investigated the effect of 20-hydroxyecdysone on the expression of GTP-CH I mRNA and the enzyme activity during color formation. The results strongly suggest that the onset and duration of the expression of a GTP-CH I mRNA is triggered by a declining ecdysteroid hormone titer during late pupal development.
Identification of a core set of rhizobial infection genes using data from single cell-types.
Chen, Da-Song; Liu, Cheng-Wu; Roy, Sonali; Cousins, Donna; Stacey, Nicola; Murray, Jeremy D
2015-01-01
Genome-wide expression studies on nodulation have varied in their scale from entire root systems to dissected nodules or root sections containing nodule primordia (NP). More recently efforts have focused on developing methods for isolation of root hairs from infected plants and the application of laser-capture microdissection technology to nodules. Here we analyze two published data sets to identify a core set of infection genes that are expressed in the nodule and in root hairs during infection. Among the genes identified were those encoding phenylpropanoid biosynthesis enzymes including Chalcone-O-Methyltransferase which is required for the production of the potent Nod gene inducer 4',4-dihydroxy-2-methoxychalcone. A promoter-GUS analysis in transgenic hairy roots for two genes encoding Chalcone-O-Methyltransferase isoforms revealed their expression in rhizobially infected root hairs and the nodule infection zone but not in the nitrogen fixation zone. We also describe a group of Rhizobially Induced Peroxidases whose expression overlaps with the production of superoxide in rhizobially infected root hairs and in nodules and roots. Finally, we identify a cohort of co-regulated transcription factors as candidate regulators of these processes.
Selection Shapes Transcriptional Logic and Regulatory Specialization in Genetic Networks
Fogelmark, Karl; Peterson, Carsten; Troein, Carl
2016-01-01
Background Living organisms need to regulate their gene expression in response to environmental signals and internal cues. This is a computational task where genes act as logic gates that connect to form transcriptional networks, which are shaped at all scales by evolution. Large-scale mutations such as gene duplications and deletions add and remove network components, whereas smaller mutations alter the connections between them. Selection determines what mutations are accepted, but its importance for shaping the resulting networks has been debated. Methodology To investigate the effects of selection in the shaping of transcriptional networks, we derive transcriptional logic from a combinatorially powerful yet tractable model of the binding between DNA and transcription factors. By evolving the resulting networks based on their ability to function as either a simple decision system or a circadian clock, we obtain information on the regulation and logic rules encoded in functional transcriptional networks. Comparisons are made between networks evolved for different functions, as well as with structurally equivalent but non-functional (neutrally evolved) networks, and predictions are validated against the transcriptional network of E. coli. Principal Findings We find that the logic rules governing gene expression depend on the function performed by the network. Unlike the decision systems, the circadian clocks show strong cooperative binding and negative regulation, which achieves tight temporal control of gene expression. Furthermore, we find that transcription factors act preferentially as either activators or repressors, both when binding multiple sites for a single target gene and globally in the transcriptional networks. This separation into positive and negative regulators requires gene duplications, which highlights the interplay between mutation and selection in shaping the transcriptional networks. PMID:26927540
Ferree, Patrick M.; Fang, Christopher; Mastrodimos, Mariah; Hay, Bruce A.; Amrhein, Henry; Akbari, Omar S.
2015-01-01
The jewel wasp Nasonia vitripennis is a rising model organism for the study of haplo-diploid reproduction characteristic of hymenopteran insects, which include all wasps, bees, and ants. We performed transcriptional profiling of the ovary, the female soma, and the male soma of N. vitripennis to complement a previously existing transcriptome of the wasp testis. These data were deposited into an open-access genome browser for visualization of transcripts relative to their gene models. We used these data to identify the assemblies of genes uniquely expressed in the germ-line tissues. We found that 156 protein-coding genes are expressed exclusively in the wasp testis compared with only 22 in the ovary. Of the testis-specific genes, eight are candidates for male-specific DNA packaging proteins known as protamines. We found very similar expression patterns of centrosome associated genes in the testis and ovary, arguing that de novo centrosome formation, a key process for development of unfertilized eggs into males, likely does not rely on large-scale transcriptional differences between these tissues. In contrast, a number of meiosis-related genes show a bias toward testis-specific expression, despite the lack of true meiosis in N. vitripennis males. These patterns may reflect an unexpected complexity of male gamete production in the haploid males of this organism. Broadly, these data add to the growing number of genomic and genetic tools available in N. vitripennis for addressing important biological questions in this rising insect model organism. PMID:26464360
Laffaire, Julien; Rivals, Isabelle; Dauphinot, Luce; Pasteau, Fabien; Wehrle, Rosine; Larrat, Benoit; Vitalis, Tania; Moldrich, Randal X; Rossier, Jean; Sinkus, Ralph; Herault, Yann; Dusart, Isabelle; Potier, Marie-Claude
2009-01-01
Background Down syndrome is a chromosomal disorder caused by the presence of three copies of chromosome 21. The mechanisms by which this aneuploidy produces the complex and variable phenotype observed in people with Down syndrome are still under discussion. Recent studies have demonstrated an increased transcript level of the three-copy genes with some dosage compensation or amplification for a subset of them. The impact of this gene dosage effect on the whole transcriptome is still debated and longitudinal studies assessing the variability among samples, tissues and developmental stages are needed. Results We thus designed a large scale gene expression study in mice (the Ts1Cje Down syndrome mouse model) in which we could measure the effects of trisomy 21 on a large number of samples (74 in total) in a tissue that is affected in Down syndrome (the cerebellum) and where we could quantify the defect during postnatal development in order to correlate gene expression changes to the phenotype observed. Statistical analysis of microarray data revealed a major gene dosage effect: for the three-copy genes as well as for a 2 Mb segment from mouse chromosome 12 that we show for the first time as being deleted in the Ts1Cje mice. This gene dosage effect impacts moderately on the expression of euploid genes (2.4 to 7.5% differentially expressed). Only 13 genes were significantly dysregulated in Ts1Cje mice at all four postnatal development stages studied from birth to 10 days after birth, and among them are 6 three-copy genes. The decrease in granule cell proliferation demonstrated in newborn Ts1Cje cerebellum was correlated with a major gene dosage effect on the transcriptome in dissected cerebellar external granule cell layer. Conclusion High throughput gene expression analysis in the cerebellum of a large number of samples of Ts1Cje and euploid mice has revealed a prevailing gene dosage effect on triplicated genes. Moreover using an enriched cell population that is thought responsible for the cerebellar hypoplasia in Down syndrome, a global destabilization of gene expression was not detected. Altogether these results strongly suggest that the three-copy genes are directly responsible for the phenotype present in cerebellum. We provide here a short list of candidate genes. PMID:19331679
Finding gene regulatory network candidates using the gene expression knowledge base.
Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin
2014-12-10
Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.
Cellular dissection of psoriasis for transcriptome analyses and the post-GWAS era
2014-01-01
Background Genome-scale studies of psoriasis have been used to identify genes of potential relevance to disease mechanisms. For many identified genes, however, the cell type mediating disease activity is uncertain, which has limited our ability to design gene functional studies based on genomic findings. Methods We identified differentially expressed genes (DEGs) with altered expression in psoriasis lesions (n = 216 patients), as well as candidate genes near susceptibility loci from psoriasis GWAS studies. These gene sets were characterized based upon their expression across 10 cell types present in psoriasis lesions. Susceptibility-associated variation at intergenic (non-coding) loci was evaluated to identify sites of allele-specific transcription factor binding. Results Half of DEGs showed highest expression in skin cells, although the dominant cell type differed between psoriasis-increased DEGs (keratinocytes, 35%) and psoriasis-decreased DEGs (fibroblasts, 33%). In contrast, psoriasis GWAS candidates tended to have highest expression in immune cells (71%), with a significant fraction showing maximal expression in neutrophils (24%, P < 0.001). By identifying candidate cell types for genes near susceptibility loci, we could identify and prioritize SNPs at which susceptibility variants are predicted to influence transcription factor binding. This led to the identification of potentially causal (non-coding) SNPs for which susceptibility variants influence binding of AP-1, NF-κB, IRF1, STAT3 and STAT4. Conclusions These findings underscore the role of innate immunity in psoriasis and highlight neutrophils as a cell type linked with pathogenetic mechanisms. Assignment of candidate cell types to genes emerging from GWAS studies provides a first step towards functional analysis, and we have proposed an approach for generating hypotheses to explain GWAS hits at intergenic loci. PMID:24885462
Novel Genomic and Evolutionary Insight of WRKY Transcription Factors in Plant Lineage
Mohanta, Tapan Kumar; Park, Yong-Hwan; Bae, Hanhong
2016-01-01
The evolutionarily conserved WRKY transcription factor (TF) regulates different aspects of gene expression in plants, and modulates growth, development, as well as biotic and abiotic stress responses. Therefore, understanding the details regarding WRKY TFs is very important. In this study, large-scale genomic analyses of the WRKY TF gene family from 43 plant species were conducted. The results of our study revealed that WRKY TFs could be grouped and specifically classified as those belonging to the monocot or dicot plant lineage. In this study, we identified several novel WRKY TFs. To our knowledge, this is the first report on a revised grouping system of the WRKY TF gene family in plants. The different forms of novel chimeric forms of WRKY TFs in the plant genome might play a crucial role in their evolution. Tissue-specific gene expression analyses in Glycine max and Phaseolus vulgaris showed that WRKY11-1, WRKY11-2 and WRKY11-3 were ubiquitously expressed in all tissue types, and WRKY15-2 was highly expressed in the stem, root, nodule and pod tissues in G. max and P. vulgaris. PMID:27853303
Novel Genomic and Evolutionary Insight of WRKY Transcription Factors in Plant Lineage.
Mohanta, Tapan Kumar; Park, Yong-Hwan; Bae, Hanhong
2016-11-17
The evolutionarily conserved WRKY transcription factor (TF) regulates different aspects of gene expression in plants, and modulates growth, development, as well as biotic and abiotic stress responses. Therefore, understanding the details regarding WRKY TFs is very important. In this study, large-scale genomic analyses of the WRKY TF gene family from 43 plant species were conducted. The results of our study revealed that WRKY TFs could be grouped and specifically classified as those belonging to the monocot or dicot plant lineage. In this study, we identified several novel WRKY TFs. To our knowledge, this is the first report on a revised grouping system of the WRKY TF gene family in plants. The different forms of novel chimeric forms of WRKY TFs in the plant genome might play a crucial role in their evolution. Tissue-specific gene expression analyses in Glycine max and Phaseolus vulgaris showed that WRKY11-1, WRKY11-2 and WRKY11-3 were ubiquitously expressed in all tissue types, and WRKY15-2 was highly expressed in the stem, root, nodule and pod tissues in G. max and P. vulgaris.
Rare Cell Detection by Single-Cell RNA Sequencing as Guided by Single-Molecule RNA FISH.
Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Gospocic, Janko; Gupte, Rohit; Bonasio, Roberto; Kim, Junhyong; Murray, John; Raj, Arjun
2018-02-28
Although single-cell RNA sequencing can reliably detect large-scale transcriptional programs, it is unclear whether it accurately captures the behavior of individual genes, especially those that express only in rare cells. Here, we use single-molecule RNA fluorescence in situ hybridization as a gold standard to assess trade-offs in single-cell RNA-sequencing data for detecting rare cell expression variability. We quantified the gene expression distribution for 26 genes that range from ubiquitous to rarely expressed and found that the correspondence between estimates across platforms improved with both transcriptome coverage and increased number of cells analyzed. Further, by characterizing the trade-off between transcriptome coverage and number of cells analyzed, we show that when the number of genes required to answer a given biological question is small, then greater transcriptome coverage is more important than analyzing large numbers of cells. More generally, our report provides guidelines for selecting quality thresholds for single-cell RNA-sequencing experiments aimed at rare cell analyses. Copyright © 2018 Elsevier Inc. All rights reserved.
Clarke, Thomas H; Garb, Jessica E; Hayashi, Cheryl Y; Arensburger, Peter; Ayoub, Nadia A
2015-06-08
The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Soybean defense responses to the soybean aphid.
Li, Yan; Zou, Jijun; Li, Min; Bilgin, Damla D; Vodkin, Lila O; Hartman, Glen L; Clough, Steven J
2008-01-01
Transcript profiles in aphid (Aphis glycines)-resistant (cv. Dowling) and -susceptible (cv. Williams 82) soybean (Glycine max) cultivars using soybean cDNA microarrays were investigated. Large-scale soybean cDNA microarrays representing approx. 18 000 genes or c. 30% of the soybean genome were compared at 6 and 12 h post-application of aphids. In a separate experiment utilizing clip cages, expression of three defense-related genes were examined at 6, 12, 24, 48, and 72 h in both cultivars by quantitative real-time PCR. One hundred and forty genes showed specific responses for resistance; these included genes related to cell wall, defense, DNA/RNA, secondary metabolism, signaling and other processes. When an extended time period of sampling was investigated, earlier and greater induction of three defense-related genes was observed in the resistant cultivar; however, the induction declined after 24 or 48 h in the resistant cultivar but continued to increase in the susceptible cultivar after 24 h. Aphid-challenged resistant plants showed rapid differential gene expression patterns similar to the incompatible response induced by avirulent Pseudomonas syringae. Five genes were identified as differentially expressed between the two genotypes in the absence of aphids.
Colak, Recep; Moser, Flavia; Chu, Jeffrey Shih-Chieh; Schönhuth, Alexander; Chen, Nansheng; Ester, Martin
2010-10-25
Computational prediction of functionally related groups of genes (functional modules) from large-scale data is an important issue in computational biology. Gene expression experiments and interaction networks are well studied large-scale data sources, available for many not yet exhaustively annotated organisms. It has been well established, when analyzing these two data sources jointly, modules are often reflected by highly interconnected (dense) regions in the interaction networks whose participating genes are co-expressed. However, the tractability of the problem had remained unclear and methods by which to exhaustively search for such constellations had not been presented. We provide an algorithmic framework, referred to as Densely Connected Biclustering (DECOB), by which the aforementioned search problem becomes tractable. To benchmark the predictive power inherent to the approach, we computed all co-expressed, dense regions in physical protein and genetic interaction networks from human and yeast. An automatized filtering procedure reduces our output which results in smaller collections of modules, comparable to state-of-the-art approaches. Our results performed favorably in a fair benchmarking competition which adheres to standard criteria. We demonstrate the usefulness of an exhaustive module search, by using the unreduced output to more quickly perform GO term related function prediction tasks. We point out the advantages of our exhaustive output by predicting functional relationships using two examples. We demonstrate that the computation of all densely connected and co-expressed regions in interaction networks is an approach to module discovery of considerable value. Beyond confirming the well settled hypothesis that such co-expressed, densely connected interaction network regions reflect functional modules, we open up novel computational ways to comprehensively analyze the modular organization of an organism based on prevalent and largely available large-scale datasets. Software and data sets are available at http://www.sfu.ca/~ester/software/DECOB.zip.
APETALA2 like genes from Picea abies show functional similarities to their Arabidopsis homologues.
Nilsson, Lars; Carlsbecker, Annelie; Sundås-Larsson, Annika; Vahala, Tiina
2007-02-01
In angiosperm flower development the identity of the floral organs is determined by the A, B and C factors. Here we present the characterisation of three homologues of the A class gene APETALA2 (AP2) from the conifer Picea abies (Norway spruce), Picea abies APETALA2 LIKE1 (PaAP2L1), PaAP2L2 and PaAP2L3. Similar to AP2 these genes contain sequence motifs complementary to miRNA172 that has been shown to regulate AP2 in Arabidopsis. The genes display distinct expression patterns during plant development; in the female-cone bud PaAP2L1 and PaAP2L3 are expressed in the seed-bearing ovuliferous scale in a pattern complementary to each other, and overlapping with the expression of the C class-related gene DAL2. To study the function of PaAP2L1 and PaAP2L2 the genes were expressed in Arabidopsis. The transgenic PaAP2L2 plants were stunted and flowered later than control plants. Flowers were indeterminate and produced an excess of floral organs most severely in the two inner whorls, associated with an ectopic expression of the meristem-regulating gene WUSCHEL. No homeotic changes in floral-organ identities occurred, but in the ap2-1 mutant background PaAP2L2 was able to promote petal identity, indicating that the spruce AP2 gene has the capacity to substitute for an A class gene in Arabidopsis. In spite of the long evolutionary distance between angiosperms and gymnosperms and the fact that gymnosperms lack structures homologous to sepals and petals our data supports a functional conservation of AP2 genes among the seed plants.
Structure and vascular tissue expression of duplicated TERMINAL EAR1-like paralogues in poplar.
Charon, Céline; Vivancos, Julien; Mazubert, Christelle; Paquet, Nicolas; Pilate, Gilles; Dron, Michel
2010-02-01
TERMINAL EAR1-like (TEL) genes encode putative RNA-binding proteins only found in land plants. Previous studies suggested that they may regulate tissue and organ initiation in Poaceae. Two TEL genes were identified in both Populus trichocarpa and the hybrid aspen Populus tremula x P. alba, named, respectively, PoptrTEL1-2 and PtaTEL1-2. The analysis of the organisation around the PoptrTEL genes in the P. trichocarpa genome and the estimation of the synonymous substitution rate for PtaTEL1-2 genes indicate that the paralogous link between these two Populus TEL genes probably results from the Salicoid large-scale gene-duplication event. Phylogenetic analyses confirmed their orthology link with the other TEL genes. The expression pattern of both PtaTEL genes appeared to be restricted to the mother cells of the plant body: leaf founder cells, leaf primordia, axillary buds and root differentiating tissues, as well as to mother cells of vascular tissues. Most interestingly, PtaTEL1-2 transcripts were found in differentiating cells of secondary xylem and phloem, but probably not in the cambium itself. Taken together, these results indicate specific expression of the TEL genes in differentiating cells controlling tissue and organ development in Populus (and other Angiosperm species).
Hamilton, John P.; Vaillancourt, Brieanne; Buell, C. Robin; Day, Brad
2012-01-01
Pseudoperonospora cubensis, an oomycete, is the causal agent of cucurbit downy mildew, and is responsible for significant losses on cucurbit crops worldwide. While other oomycete plant pathogens have been extensively studied at the molecular level, Ps. cubensis and the molecular basis of its interaction with cucurbit hosts has not been well examined. Here, we present the first large-scale global gene expression analysis of Ps. cubensis infection of a susceptible Cucumis sativus cultivar, ‘Vlaspik’, and identification of genes with putative roles in infection, growth, and pathogenicity. Using high throughput whole transcriptome sequencing, we captured differential expression of 2383 Ps. cubensis genes in sporangia and at 1, 2, 3, 4, 6, and 8 days post-inoculation (dpi). Additionally, comparison of Ps. cubensis expression profiles with expression profiles from an infection time course of the oomycete pathogen Phytophthora infestans on Solanum tuberosum revealed similarities in expression patterns of 1,576–6,806 orthologous genes suggesting a substantial degree of overlap in molecular events in virulence between the biotrophic Ps. cubensis and the hemi-biotrophic P. infestans. Co-expression analyses identified distinct modules of Ps. cubensis genes that were representative of early, intermediate, and late infection stages. Collectively, these expression data have advanced our understanding of key molecular and genetic events in the virulence of Ps. cubensis and thus, provides a foundation for identifying mechanism(s) by which to engineer or effect resistance in the host. PMID:22545137
Gossmann, Toni I; Schmid, Marc W; Grossniklaus, Ueli; Schmid, Karl J
2014-03-01
Sex-biased genes are genes with a preferential or specific expression in one sex and tend to show an accelerated rate of evolution in animals. Various hypotheses--which are not mutually exclusive--have been put forth to explain observed patterns of rapid evolution. One possible explanation is positive selection, but this has been shown only in few animal species and mostly for male-specific genes. Here, we present a large-scale study that investigates evolutionary patterns of sex-biased genes in the predominantly self-fertilizing plant Arabidopsis thaliana. Unlike most animal species, A. thaliana does not possess sex chromosomes, its flowers develop both male and female sexual organs, and it is characterized by low outcrossing rates. Using cell-specific gene expression data, we identified genes whose expression is enriched in comparison with all other tissues in the male and female gametes (sperm, egg, and central cell), as well as in synergids, pollen, and pollen tubes, which also play an important role in reproduction. Genes specifically expressed in gametes and synergids show higher rates of protein evolution compared with the genome-wide average and no evidence for positive selection. In contrast, pollen- and pollen tube-specific genes not only have lower rates of protein evolution but also exhibit a higher proportion of adaptive amino acid substitutions. We show that this is the result of increased levels of purifying and positive selection among genes with pollen- and pollen tube-specific expression. The increased proportion of adaptive substitutions cannot be explained by the fact that pollen- and pollen tube-expressed genes are enriched in segmental duplications, are on average older, or have a larger effective population size. Our observations are consistent with prezygotic sexual selection as a result of interactions during pollination and pollen tube growth such as pollen tube competition.
Swindell, William R.; Johnston, Andrew; Sun, Liou; Xing, Xianying; Fisher, Gary J.; Bulyk, Martha L.; Elder, James T.; Gudjonsson, Johann E.
2012-01-01
Background Skin aging is associated with intrinsic processes that compromise the structure of the extracellular matrix while promoting loss of functional and regenerative capacity. These processes are accompanied by a large-scale shift in gene expression, but underlying mechanisms are not understood and conservation of these mechanisms between humans and mice is uncertain. Results We used genome-wide expression profiling to investigate the aging skin transcriptome. In humans, age-related shifts in gene expression were sex-specific. In females, aging increased expression of transcripts associated with T-cells, B-cells and dendritic cells, and decreased expression of genes in regions with elevated Zeb1, AP-2 and YY1 motif density. In males, however, these effects were contrasting or absent. When age-associated gene expression patterns in human skin were compared to those in tail skin from CB6F1 mice, overall human-mouse correspondence was weak. Moreover, inflammatory gene expression patterns were not induced with aging of mouse tail skin, and well-known aging biomarkers were in fact decreased (e.g., Clec7a, Lyz1 and Lyz2). These unexpected patterns and weak human-mouse correspondence may be due to decreased abundance of antigen presenting cells in mouse tail skin with age. Conclusions Aging is generally associated with a pro-inflammatory state, but we have identified an exception to this pattern with aging of CB6F1 mouse tail skin. Aging therefore does not uniformly heighten inflammatory status across all mouse tissues. Furthermore, we identified both intercellular and intracellular mechanisms of transcriptome aging, including those that are sex- and species-specific. PMID:22413003
Xie, Xin-Ping; Xie, Yu-Feng; Wang, Hong-Qiang
2017-08-23
Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be addressed carefully for this goal. This paper proposes a regulation probability model-based meta-analysis, jGRP, for identifying differentially expressed genes (DEGs). The method integrates multiple transcriptomics data sets in a gene regulatory space instead of in a gene expression space, which makes it easy to capture and manage data heterogeneity across studies from different laboratories or platforms. Specifically, we transform gene expression profiles into a united gene regulation profile across studies by mathematically defining two gene regulation events between two conditions and estimating their occurring probabilities in a sample. Finally, a novel differential expression statistic is established based on the gene regulation profiles, realizing accurate and flexible identification of DEGs in gene regulation space. We evaluated the proposed method on simulation data and real-world cancer datasets and showed the effectiveness and efficiency of jGRP in identifying DEGs identification in the context of meta-analysis. Data heterogeneity largely influences the performance of meta-analysis of DEGs identification. Existing different meta-analysis methods were revealed to exhibit very different degrees of sensitivity to study heterogeneity. The proposed method, jGRP, can be a standalone tool due to its united framework and controllable way to deal with study heterogeneity.
Transcriptional analysis of the Arabidopsis ovule by massively parallel signature sequencing
Sánchez-León, Nidia; Arteaga-Vázquez, Mario; Alvarez-Mejía, César; Mendiola-Soto, Javier; Durán-Figueroa, Noé; Rodríguez-Leal, Daniel; Rodríguez-Arévalo, Isaac; García-Campayo, Vicenta; García-Aguilar, Marcelina; Olmedo-Monfil, Vianey; Arteaga-Sánchez, Mario; Martínez de la Vega, Octavio; Nobuta, Kan; Vemaraju, Kalyan; Meyers, Blake C.; Vielle-Calzada, Jean-Philippe
2012-01-01
The life cycle of flowering plants alternates between a predominant sporophytic (diploid) and an ephemeral gametophytic (haploid) generation that only occurs in reproductive organs. In Arabidopsis thaliana, the female gametophyte is deeply embedded within the ovule, complicating the study of the genetic and molecular interactions involved in the sporophytic to gametophytic transition. Massively parallel signature sequencing (MPSS) was used to conduct a quantitative large-scale transcriptional analysis of the fully differentiated Arabidopsis ovule prior to fertilization. The expression of 9775 genes was quantified in wild-type ovules, additionally detecting >2200 new transcripts mapping to antisense or intergenic regions. A quantitative comparison of global expression in wild-type and sporocyteless (spl) individuals resulted in 1301 genes showing 25-fold reduced or null activity in ovules lacking a female gametophyte, including those encoding 92 signalling proteins, 75 transcription factors, and 72 RNA-binding proteins not reported in previous studies based on microarray profiling. A combination of independent genetic and molecular strategies confirmed the differential expression of 28 of them, showing that they are either preferentially active in the female gametophyte, or dependent on the presence of a female gametophyte to be expressed in sporophytic cells of the ovule. Among 18 genes encoding pentatricopeptide-repeat proteins (PPRs) that show transcriptional activity in wild-type but not spl ovules, CIHUATEOTL (At4g38150) is specifically expressed in the female gametophyte and necessary for female gametogenesis. These results expand the nature of the transcriptional universe present in the ovule of Arabidopsis, and offer a large-scale quantitative reference of global expression for future genomic and developmental studies. PMID:22442422
Transcriptional analysis of the Arabidopsis ovule by massively parallel signature sequencing.
Sánchez-León, Nidia; Arteaga-Vázquez, Mario; Alvarez-Mejía, César; Mendiola-Soto, Javier; Durán-Figueroa, Noé; Rodríguez-Leal, Daniel; Rodríguez-Arévalo, Isaac; García-Campayo, Vicenta; García-Aguilar, Marcelina; Olmedo-Monfil, Vianey; Arteaga-Sánchez, Mario; de la Vega, Octavio Martínez; Nobuta, Kan; Vemaraju, Kalyan; Meyers, Blake C; Vielle-Calzada, Jean-Philippe
2012-06-01
The life cycle of flowering plants alternates between a predominant sporophytic (diploid) and an ephemeral gametophytic (haploid) generation that only occurs in reproductive organs. In Arabidopsis thaliana, the female gametophyte is deeply embedded within the ovule, complicating the study of the genetic and molecular interactions involved in the sporophytic to gametophytic transition. Massively parallel signature sequencing (MPSS) was used to conduct a quantitative large-scale transcriptional analysis of the fully differentiated Arabidopsis ovule prior to fertilization. The expression of 9775 genes was quantified in wild-type ovules, additionally detecting >2200 new transcripts mapping to antisense or intergenic regions. A quantitative comparison of global expression in wild-type and sporocyteless (spl) individuals resulted in 1301 genes showing 25-fold reduced or null activity in ovules lacking a female gametophyte, including those encoding 92 signalling proteins, 75 transcription factors, and 72 RNA-binding proteins not reported in previous studies based on microarray profiling. A combination of independent genetic and molecular strategies confirmed the differential expression of 28 of them, showing that they are either preferentially active in the female gametophyte, or dependent on the presence of a female gametophyte to be expressed in sporophytic cells of the ovule. Among 18 genes encoding pentatricopeptide-repeat proteins (PPRs) that show transcriptional activity in wild-type but not spl ovules, CIHUATEOTL (At4g38150) is specifically expressed in the female gametophyte and necessary for female gametogenesis. These results expand the nature of the transcriptional universe present in the ovule of Arabidopsis, and offer a large-scale quantitative reference of global expression for future genomic and developmental studies.
Circadian Enhancers Coordinate Multiple Phases of Rhythmic Gene Transcription In Vivo
Fang, Bin; Everett, Logan J.; Jager, Jennifer; Briggs, Erika; Armour, Sean M.; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A.
2014-01-01
SUMMARY Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of eRNAs that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed novel mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed new light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ. PMID:25416951
Circadian enhancers coordinate multiple phases of rhythmic gene transcription in vivo.
Fang, Bin; Everett, Logan J; Jager, Jennifer; Briggs, Erika; Armour, Sean M; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A
2014-11-20
Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of enhancer RNAs (eRNAs) that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ.
Wang, Jianhua; Chen, Shishu
2002-10-01
To identify certain gastric adenocarcinoma metastasis-related genes, an RF-1 cell line (primary tumor from a gastric adenocarcinoma patient) and an RF-48 cell line (its metastatic counterpart) were used as a model for studying the molecular mechanism of tumor metastasis. Two fluorescent cDNA probes, labeled with Cy3 and Cy5 dyes, were prepared from RF-1 and RF-48 mRNA samples by the reverse transcription method. The two color probes were then mixed and hybridized to a cDNA chip constructed with double-dots from 4,096 human genes, and scanned at two wavelengths. The experiment was repeated twice. Differentially expressedn genes from the above two cells were analyzed by use of computer. Of the total genes, 138 (3.4%) revealed differential expression in RF-48 cells compared with RF-1 cells: 81 (2.1%) genes revealed apparent up-regulation, and 56 (1.3%) genes revealed down-regulation. Forty-five genes involved in gastric adenocarcinoma metastasis were cloned using fluorescent differential display-PCR (FDD-PCR), including three novel genes. There were seven differentially expressed genes that presented the same behaviour under both detection methods. The possible roles of some differentially expressed genes, which may be involved in the mechanism of tumor metastasis, were discussed. cDNA chip was used to analyze gene expression in a high-throughput and large-scale manner in combination with FDD-PCR for cloning unknown novel genes. Some genes related to metastasis were preliminarily scanned, which would contribute to disclose the molecular mechanism of gastric adenocarcinoma metastasis and provide new targets for therapeutic intervention.
A transcriptional dynamic network during Arabidopsis thaliana pollen development.
Wang, Jigang; Qiu, Xiaojie; Li, Yuhua; Deng, Youping; Shi, Tieliu
2011-01-01
To understand transcriptional regulatory networks (TRNs), especially the coordinated dynamic regulation between transcription factors (TFs) and their corresponding target genes during development, computational approaches would represent significant advances in the genome-wide expression analysis. The major challenges for the experiments include monitoring the time-specific TFs' activities and identifying the dynamic regulatory relationships between TFs and their target genes, both of which are currently not yet available at the large scale. However, various methods have been proposed to computationally estimate those activities and regulations. During the past decade, significant progresses have been made towards understanding pollen development at each development stage under the molecular level, yet the regulatory mechanisms that control the dynamic pollen development processes remain largely unknown. Here, we adopt Networks Component Analysis (NCA) to identify TF activities over time course, and infer their regulatory relationships based on the coexpression of TFs and their target genes during pollen development. We carried out meta-analysis by integrating several sets of gene expression data related to Arabidopsis thaliana pollen development (stages range from UNM, BCP, TCP, HP to 0.5 hr pollen tube and 4 hr pollen tube). We constructed a regulatory network, including 19 TFs, 101 target genes and 319 regulatory interactions. The computationally estimated TF activities were well correlated to their coordinated genes' expressions during the development process. We clustered the expression of their target genes in the context of regulatory influences, and inferred new regulatory relationships between those TFs and their target genes, such as transcription factor WRKY34, which was identified that specifically expressed in pollen, and regulated several new target genes. Our finding facilitates the interpretation of the expression patterns with more biological relevancy, since the clusters corresponding to the activity of specific TF or the combination of TFs suggest the coordinated regulation of TFs to their target genes. Through integrating different resources, we constructed a dynamic regulatory network of Arabidopsis thaliana during pollen development with gene coexpression and NCA. The network illustrated the relationships between the TFs' activities and their target genes' expression, as well as the interactions between TFs, which provide new insight into the molecular mechanisms that control the pollen development.
Differential gene expression in Schistosoma japonicum schistosomula from Wistar rats and BALB/c mice
2011-01-01
Background More than 46 species of mammals can be naturally infected with Schistosoma japonicum in the mainland of China. Mice are permissive and may act as the definitive host of the life cycle. In contrast, rats are less susceptible to S. japonicum infection, and are considered to provide an unsuitable micro-environment for parasite growth and development. Since little is known of what effects this micro-environment has on the parasite itself, we have in the present study utilised a S. japonicum oligonucleotide microarray to compare the gene expression differences of 10-day-old schistosomula maintained in Wistar rats with those maintained in BALB/c mice. Results In total 3,468 schistosome genes were found to be differentially expressed, of which the majority (3,335) were down-regulated (≤ 2 fold) and 133 were up-regulated (≥ 2 fold) in schistosomula from Wistar rats compared with those from BALB/c mice. Gene ontology (GO) analysis revealed that of the differentially expressed genes with already established functions or close homology to well characterized genes in another organisms, many are related to important biological functions or molecular processes. Among the genes that were down-regulated in schistosomula from Wistar rats, some were associated with metabolism, signal transduction and development. Of these genes related to metabolic processes, areas including translation, protein and amino acid phosphorylation, proteolysis, oxidoreductase activities, catalytic activities and hydrolase activities, were represented. KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analysis of differential expressed genes indicated that of the 328 genes that had a specific KEGG pathway annotation, 324 were down-regulated and were mainly associated with metabolism, growth, redox pathway, oxidative phosphorylation, the cell cycle, ubiquitin-mediated proteolysis, protein export and the MAPK (mitogen-activated protein kinases) signaling pathway. Conclusions This work presents the first large scale gene expression study identifying the differences between schistosomula maintained in mice and those maintained in rats, and specifically highlights differential expression that may impact on the survival and development of the parasite within the definitive host. The research presented here provides valuable information for the better understanding of schistosome development and host-parasite interactions. PMID:21819550
Fast and robust group-wise eQTL mapping using sparse graphical models.
Cheng, Wei; Shi, Yu; Zhang, Xiang; Wang, Wei
2015-01-16
Genome-wide expression quantitative trait loci (eQTL) studies have emerged as a powerful tool to understand the genetic basis of gene expression and complex traits. The traditional eQTL methods focus on testing the associations between individual single-nucleotide polymorphisms (SNPs) and gene expression traits. A major drawback of this approach is that it cannot model the joint effect of a set of SNPs on a set of genes, which may correspond to hidden biological pathways. We introduce a new approach to identify novel group-wise associations between sets of SNPs and sets of genes. Such associations are captured by hidden variables connecting SNPs and genes. Our model is a linear-Gaussian model and uses two types of hidden variables. One captures the set associations between SNPs and genes, and the other captures confounders. We develop an efficient optimization procedure which makes this approach suitable for large scale studies. Extensive experimental evaluations on both simulated and real datasets demonstrate that the proposed methods can effectively capture both individual and group-wise signals that cannot be identified by the state-of-the-art eQTL mapping methods. Considering group-wise associations significantly improves the accuracy of eQTL mapping, and the successful multi-layer regression model opens a new approach to understand how multiple SNPs interact with each other to jointly affect the expression level of a group of genes.
Song, Hyun-Seob; McClure, Ryan S.; Bernstein, Hans C.; ...
2015-03-27
Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as ‘topologically important.’ Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termedmore » as ‘functionally important’ genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Song, Hyun-Seob; McClure, Ryan S.; Bernstein, Hans C.
Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as ‘topologically important.’ Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termedmore » as ‘functionally important’ genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.« less
Optimal consistency in microRNA expression analysis using reference-gene-based normalization.
Wang, Xi; Gardiner, Erin J; Cairns, Murray J
2015-05-01
Normalization of high-throughput molecular expression profiles secures differential expression analysis between samples of different phenotypes or biological conditions, and facilitates comparison between experimental batches. While the same general principles apply to microRNA (miRNA) normalization, there is mounting evidence that global shifts in their expression patterns occur in specific circumstances, which pose a challenge for normalizing miRNA expression data. As an alternative to global normalization, which has the propensity to flatten large trends, normalization against constitutively expressed reference genes presents an advantage through their relative independence. Here we investigated the performance of reference-gene-based (RGB) normalization for differential miRNA expression analysis of microarray expression data, and compared the results with other normalization methods, including: quantile, variance stabilization, robust spline, simple scaling, rank invariant, and Loess regression. The comparative analyses were executed using miRNA expression in tissue samples derived from subjects with schizophrenia and non-psychiatric controls. We proposed a consistency criterion for evaluating methods by examining the overlapping of differentially expressed miRNAs detected using different partitions of the whole data. Based on this criterion, we found that RGB normalization generally outperformed global normalization methods. Thus we recommend the application of RGB normalization for miRNA expression data sets, and believe that this will yield a more consistent and useful readout of differentially expressed miRNAs, particularly in biological conditions characterized by large shifts in miRNA expression.
The transcriptional response of Escherichia coli to recombinant protein insolubility.
Smith, Harold E
2007-03-01
Bacterial production of recombinant proteins offers several advantages over alternative expression methods and remains the system of choice for many structural genomics projects. However, a large percentage of targets accumulate as insoluble inclusion bodies rather than soluble protein, creating a significant bottleneck in the protein production pipeline. Numerous strategies have been reported that can improve in vivo protein solubility, but most do not scale easily for high-throughput expression screening. To understand better the host cell response to the accumulation of insoluble protein, we determined genome-wide changes in bacterial gene expression upon induction of either soluble or insoluble target proteins. By comparing transcriptional profiles for multiple examples from the soluble or insoluble class, we identified a pattern of gene expression that correlates strongly with protein solubility. Direct targets of the sigma32 heat shock sigma factor, which includes genes involved in protein folding and degradation, were highly expressed in response to induction of insoluble protein. This same group of genes was also upregulated by insoluble protein accumulation under a different growth regime, indicating that sigma32-mediated gene expression is a general response to protein insolubility. This knowledge provides a starting point for the rational design of growth parameters and host strains with improved protein solubility characteristics. Summary Problems with protein solubility are frequently encountered when recombinant proteins are expressed in E. coli. The bacterial host responds to this problem by increasing expression of the protein folding machinery via the heat shock sigma factor sigma32. Manipulation of the sigma32 regulon might provide a general mechanism for improving recombinant protein solubility.
Harnessing Diversity towards the Reconstructing of Large Scale Gene Regulatory Networks
Yamanaka, Ryota; Kitano, Hiroaki
2013-01-01
Elucidating gene regulatory network (GRN) from large scale experimental data remains a central challenge in systems biology. Recently, numerous techniques, particularly consensus driven approaches combining different algorithms, have become a potentially promising strategy to infer accurate GRNs. Here, we develop a novel consensus inference algorithm, TopkNet that can integrate multiple algorithms to infer GRNs. Comprehensive performance benchmarking on a cloud computing framework demonstrated that (i) a simple strategy to combine many algorithms does not always lead to performance improvement compared to the cost of consensus and (ii) TopkNet integrating only high-performance algorithms provide significant performance improvement compared to the best individual algorithms and community prediction. These results suggest that a priori determination of high-performance algorithms is a key to reconstruct an unknown regulatory network. Similarity among gene-expression datasets can be useful to determine potential optimal algorithms for reconstruction of unknown regulatory networks, i.e., if expression-data associated with known regulatory network is similar to that with unknown regulatory network, optimal algorithms determined for the known regulatory network can be repurposed to infer the unknown regulatory network. Based on this observation, we developed a quantitative measure of similarity among gene-expression datasets and demonstrated that, if similarity between the two expression datasets is high, TopkNet integrating algorithms that are optimal for known dataset perform well on the unknown dataset. The consensus framework, TopkNet, together with the similarity measure proposed in this study provides a powerful strategy towards harnessing the wisdom of the crowds in reconstruction of unknown regulatory networks. PMID:24278007
2011-01-01
Background The bi-directional communication between the oocyte and its companion cumulus cells (CCs) is crucial for development and functions of both cell types. Transcripts that are exclusively expressed either in oocytes or CCs and molecular mechanisms affected due to removal of the communication axis between the two cell types is not investigated at a larger scale. The main objectives of this study were: 1. To identify transcripts exclusively expressed either in oocyte or CCs and 2. To identify those which are differentially expressed when the oocyte is cultured with or without its companion CCs and vice versa. Results We analyzed transcriptome profile of different oocyte and CC samples using Affymetrix GeneChip Bovine Genome array containing 23000 transcripts. Out of 13162 genes detected in germinal vesicle (GV) oocytes and their companion CCs, 1516 and 2727 are exclusively expressed in oocytes and CCs, respectively, while 8919 are expressed in both. Similarly, of 13602 genes detected in metaphase II (MII) oocytes and CCs, 1423 and 3100 are exclusively expressed in oocytes and CCs, respectively, while 9079 are expressed in both. A total of 265 transcripts are differentially expressed between oocytes cultured with (OO + CCs) and without (OO - CCs) CCs, of which 217 and 48 are over expressed in the former and the later groups, respectively. Similarly, 566 transcripts are differentially expressed when CCs mature with (CCs + OO) or without (CCs - OO) their enclosed oocytes. Of these, 320 and 246 are over expressed in CCs + OO and CCs - OO, respectively. While oocyte specific transcripts include those involved in transcription (IRF6, POU5F1, MYF5, MED18), translation (EIF2AK1, EIF4ENIF1) and CCs specific ones include those involved in carbohydrate metabolism (HYAL1, PFKL, PYGL, MPI), protein metabolic processes (IHH, APOA1, PLOD1), steroid biosynthetic process (APOA1, CYP11A1, HSD3B1, HSD3B7). Similarly, while transcripts over expressed in OO + CCs are involved in carbohydrate metabolism (ACO1, 2), molecular transport (GAPDH, GFPT1) and nucleic acid metabolism (CBS, NOS2), those over expressed in CCs + OO are involved in cellular growth and proliferation (FOS, GADD45A), cell cycle (HAS2, VEGFA), cellular development (AMD1, AURKA, DPP4) and gene expression (FOSB, TGFB2). Conclusion In conclusion, this study has generated large scale gene expression data from different oocyte and CCs samples that would provide insights into gene functions and interactions within and across different pathways that are involved in the maturation of bovine oocytes. Moreover, the presence or absence of oocyte and CC factors during bovine oocyte maturation can have a profound effect on transcript abundance of each cell types, thereby showing the prevailing molecular cross-talk between oocytes and their corresponding CCs. PMID:21261964
Yap, Karen; Makeyev, Eugene V
2013-09-01
Eukaryotic gene expression is orchestrated on a genome-wide scale through several post-transcriptional mechanisms. Of these, alternative pre-mRNA splicing expands the proteome diversity and modulates mRNA stability through downstream RNA quality control (QC) pathways including nonsense-mediated decay (NMD) of mRNAs containing premature termination codons and nuclear retention and elimination (NRE) of intron-containing transcripts. Although originally identified as mechanisms for eliminating aberrant transcripts, a growing body of evidence suggests that NMD and NRE coupled with deliberate changes in pre-mRNA splicing patterns are also used in a number of biological contexts for deterministic control of gene expression. Here we review recent studies elucidating molecular mechanisms and biological significance of these gene regulation strategies with a specific focus on their roles in nervous system development and physiology. This article is part of a Special Issue entitled 'RNA and splicing regulation in neurodegeneration'. Copyright © 2013 Elsevier Inc. All rights reserved.
Partial least squares based identification of Duchenne muscular dystrophy specific genes.
An, Hui-bo; Zheng, Hua-cheng; Zhang, Li; Ma, Lin; Liu, Zheng-yan
2013-11-01
Large-scale parallel gene expression analysis has provided a greater ease for investigating the underlying mechanisms of Duchenne muscular dystrophy (DMD). Previous studies typically implemented variance/regression analysis, which would be fundamentally flawed when unaccounted sources of variability in the arrays existed. Here we aim to identify genes that contribute to the pathology of DMD using partial least squares (PLS) based analysis. We carried out PLS-based analysis with two datasets downloaded from the Gene Expression Omnibus (GEO) database to identify genes contributing to the pathology of DMD. Except for the genes related to inflammation, muscle regeneration and extracellular matrix (ECM) modeling, we found some genes with high fold change, which have not been identified by previous studies, such as SRPX, GPNMB, SAT1, and LYZ. In addition, downregulation of the fatty acid metabolism pathway was found, which may be related to the progressive muscle wasting process. Our results provide a better understanding for the downstream mechanisms of DMD.
Evolution and Expression Patterns of TCP Genes in Asparagales
Madrigal, Yesenia; Alzate, Juan F.; Pabón-Mora, Natalia
2017-01-01
CYCLOIDEA-like genes are involved in the symmetry gene network, limiting cell proliferation in the dorsal regions of bilateral flowers in core eudicots. CYC-like and closely related TCP genes (acronym for TEOSINTE BRANCHED1, CYCLOIDEA, and PROLIFERATION CELL FACTOR) have been poorly studied in Asparagales, the largest order of monocots that includes both bilateral flowers in Orchidaceae (ca. 25.000 spp) and radially symmetrical flowers in Hypoxidaceae (ca. 200 spp). With the aim of assessing TCP gene evolution in the Asparagales, we isolated TCP-like genes from publicly available databases and our own transcriptomes of Cattleya trianae (Orchidaceae) and Hypoxis decumbens (Hypoxidaceae). Our matrix contains 452 sequences representing the three major clades of TCP genes. Besides the previously identified CYC specific core eudicot duplications, our ML phylogenetic analyses recovered an early CIN-like duplication predating all angiosperms, two CIN-like Asparagales-specific duplications and a duplication prior to the diversification of Orchidoideae and Epidendroideae. In addition, we provide evidence of at least three duplications of PCF-like genes in Asparagales. While CIN-like and PCF-like genes have multiplied in Asparagales, likely enhancing the genetic network for cell proliferation, CYC-like genes remain as single, shorter copies with low expression. Homogeneous expression of CYC-like genes in the labellum as well as the lateral petals suggests little contribution to the bilateral perianth in C. trianae. CIN-like and PCF-like gene expression suggests conserved roles in cell proliferation in leaves, sepals and petals, carpels, ovules and fruits in Asparagales by comparison with previously reported functions in core eudicots and monocots. This is the first large scale analysis of TCP-like genes in Asparagales that will serve as a platform for in-depth functional studies in emerging model monocots. PMID:28144250
Li, Jingtao; Sun, Xinhua; Liu, Yanzhi; Wang, Xueliang; Zhang, Hao; Pan, Hongyu
2017-01-01
Plant productivity is limited by salinity stress, both in natural and agricultural systems. Identification of salt stress-related genes from halophyte can provide insights into mechanisms of salt stress tolerance in plants. Atriplex canescens is a xero-halophyte that exhibits optimum growth in the presence of 400 mM NaCl. A cDNA library derived from highly salt-treated A. canescens plants was constructed based on a yeast expression system. A total of 53 transgenic yeast clones expressing enhanced salt tolerance were selected from 105 transformants. Their plasmids were sequenced and the gene characteristics were annotated using a BLASTX search. Retransformation of yeast cells with the selected plasmids conferred salt tolerance to the resulting transformants. The expression patterns of 28 of these stress-related genes were further investigated in A. canescens leaves by quantitative reverse transcription-PCR. In this study, we provided a rapid and robust assay system for large-scale screening of genes for varied abiotic stress tolerance with high efficiency in A. canescens. PMID:29149055
Lum, Thomas E.; Merritt, Thomas J. S.
2011-01-01
Regulation of transcription can be a complex process in which many cis- and trans-interactions determine the final pattern of expression. Among these interactions are trans-interactions mediated by the pairing of homologous chromosomes. These trans-effects are wide ranging, affecting gene regulation in many species and creating complex possibilities in gene regulation. Here we describe a novel case of trans-interaction between alleles of the Malic enzyme (Men) locus in Drosophila melanogaster that results in allele-specific, non-additive gene expression. Using both empirical biochemical and predictive bioinformatic approaches, we show that the regulatory elements of one allele are capable of interacting in trans with, and modifying the expression of, the second allele. Furthermore, we show that nonlocal factors—different genetic backgrounds—are capable of significant interactions with individual Men alleles, suggesting that these trans-effects can be modified by both locally and distantly acting elements. In sum, these results emphasize the complexity of gene regulation and the need to understand both small- and large-scale interactions as more complete models of the role of trans-interactions in gene regulation are developed. PMID:21900270
Scholz, Birger; Doidge, Amie N.; Barnes, Philip; Hall, Jeremy; Wilkinson, Lawrence S.; Thomas, Kerrie L.
2016-01-01
We investigated the distinctiveness of gene regulatory networks in CA1 associated with the extinction of contextual fear memory (CFM) after recall using Affymetrix GeneChip Rat Genome 230 2.0 Arrays. These data were compared to previously published retrieval and reconsolidation-attributed, and consolidation datasets. A stringent dual normalization and pareto-scaled orthogonal partial least-square discriminant multivariate analysis together with a jack-knifing-based cross-validation approach was used on all datasets to reduce false positives. Consolidation, retrieval and extinction were correlated with distinct patterns of gene expression 2 hours later. Extinction-related gene expression was most distinct from the profile accompanying consolidation. A highly specific feature was the discrete regulation of neuroimmunological gene expression associated with retrieval and extinction. Immunity–associated genes of the tyrosine kinase receptor TGFβ and PDGF, and TNF families’ characterized extinction. Cytokines and proinflammatory interleukins of the IL-1 and IL-6 families were enriched with the no-extinction retrieval condition. We used comparative genomics to predict transcription factor binding sites in proximal promoter regions of the retrieval-regulated genes. Retrieval that does not lead to extinction was associated with NF-κB-mediated gene expression. We confirmed differential NF-κBp65 expression, and activity in all of a representative sample of our candidate genes in the no-extinction condition. The differential regulation of cytokine networks after the acquisition and retrieval of CFM identifies the important contribution that neuroimmune signalling plays in normal hippocampal function. Further, targeting cytokine signalling upon retrieval offers a therapeutic strategy to promote extinction mechanisms in human disorders characterised by dysregulation of associative memory. PMID:27224427
2011-01-01
Background Streptomyces species are a major source of antibiotics. They usually grow slowly at their optimal temperature and fermentation of industrial strains in a large scale often takes a long time, consuming more energy and materials than some other bacterial industrial strains (e.g., E. coli and Bacillus). Most thermophilic Streptomyces species grow fast, but no gene cloning systems have been developed in such strains. Results We report here the isolation of 41 fast-growing (about twice the rate of S. coelicolor), moderately thermophilic (growing at both 30°C and 50°C) Streptomyces strains, detection of one linear and three circular plasmids in them, and sequencing of a 6996-bp plasmid, pTSC1, from one of them. pTSC1-derived pCWH1 could replicate in both thermophilic and mesophilic Streptomyces strains. On the other hand, several Streptomyces replicons function in thermophilic Streptomyces species. By examining ten well-sporulating strains, we found two promising cloning hosts, 2C and 4F. A gene cloning system was established by using the two strains. The actinorhodin and anthramycin biosynthetic gene clusters from mesophilic S. coelicolor A3(2) and thermophilic S. refuineus were heterologously expressed in one of the hosts. Conclusions We have developed a gene cloning and expression system in a fast-growing and moderately thermophilic Streptomyces species. Although just a few plasmids and one antibiotic biosynthetic gene cluster from mesophilic Streptomyces were successfully expressed in thermophilic Streptomyces species, we expect that by utilizing thermophilic Streptomyces-specific promoters, more genes and especially antibiotic genes clusters of mesophilic Streptomyces should be heterologously expressed. PMID:22032628
Gildor, Tsvia; Hinman, Veronica; Ben-Tabou-De-Leon, Smadar
2017-01-01
It has long been argued that heterochrony, a change in relative timing of a developmental process, is a major source of evolutionary innovation. Heterochronic changes of regulatory gene activation could be the underlying molecular mechanism driving heterochronic changes through evolution. Here, we compare the temporal expression profiles of key regulatory circuits between sea urchin and sea star, representative of two classes of Echinoderms that shared a common ancestor about 500 million years ago. The morphologies of the sea urchin and sea star embryos are largely comparable, yet, differences in certain mesodermal cell types and ectodermal patterning result in distinct larval body plans. We generated high resolution temporal profiles of 17 mesodermally-, endodermally- and ectodermally-expressed regulatory genes in the sea star, Patiria miniata, and compared these to their orthologs in the Mediterranean sea urchin, Paracentrotus lividus. We found that the maternal to zygotic transition is delayed in the sea star compared to the sea urchin, in agreement with the longer cleavage stage in the sea star. Interestingly, the order of gene activation shows the highest variation in the relatively diverged mesodermal circuit, while the correlations of expression dynamics are the highest in the strongly conserved endodermal circuit. We detected loose scaling of the developmental rates of these species and observed interspecies heterochronies within all studied regulatory circuits. Thus, after 500 million years of parallel evolution, mild heterochronies between the species are frequently observed and the tight temporal scaling observed for closely related species no longer holds.
Integrative analysis of RUNX1 downstream pathways and target genes
Michaud, Joëlle; Simpson, Ken M; Escher, Robert; Buchet-Poyau, Karine; Beissbarth, Tim; Carmichael, Catherine; Ritchie, Matthew E; Schütz, Frédéric; Cannon, Ping; Liu, Marjorie; Shen, Xiaofeng; Ito, Yoshiaki; Raskind, Wendy H; Horwitz, Marshall S; Osato, Motomi; Turner, David R; Speed, Terence P; Kavallaris, Maria; Smyth, Gordon K; Scott, Hamish S
2008-01-01
Background The RUNX1 transcription factor gene is frequently mutated in sporadic myeloid and lymphoid leukemia through translocation, point mutation or amplification. It is also responsible for a familial platelet disorder with predisposition to acute myeloid leukemia (FPD-AML). The disruption of the largely unknown biological pathways controlled by RUNX1 is likely to be responsible for the development of leukemia. We have used multiple microarray platforms and bioinformatic techniques to help identify these biological pathways to aid in the understanding of why RUNX1 mutations lead to leukemia. Results Here we report genes regulated either directly or indirectly by RUNX1 based on the study of gene expression profiles generated from 3 different human and mouse platforms. The platforms used were global gene expression profiling of: 1) cell lines with RUNX1 mutations from FPD-AML patients, 2) over-expression of RUNX1 and CBFβ, and 3) Runx1 knockout mouse embryos using either cDNA or Affymetrix microarrays. We observe that our datasets (lists of differentially expressed genes) significantly correlate with published microarray data from sporadic AML patients with mutations in either RUNX1 or its cofactor, CBFβ. A number of biological processes were identified among the differentially expressed genes and functional assays suggest that heterozygous RUNX1 point mutations in patients with FPD-AML impair cell proliferation, microtubule dynamics and possibly genetic stability. In addition, analysis of the regulatory regions of the differentially expressed genes has for the first time systematically identified numerous potential novel RUNX1 target genes. Conclusion This work is the first large-scale study attempting to identify the genetic networks regulated by RUNX1, a master regulator in the development of the hematopoietic system and leukemia. The biological pathways and target genes controlled by RUNX1 will have considerable importance in disease progression in both familial and sporadic leukemia as well as therapeutic implications. PMID:18671852
Zhou, Xionghui; Liu, Juan
2014-01-01
Although many methods have been proposed to reconstruct gene regulatory network, most of them, when applied in the sample-based data, can not reveal the gene regulatory relations underlying the phenotypic change (e.g. normal versus cancer). In this paper, we adopt phenotype as a variable when constructing the gene regulatory network, while former researches either neglected it or only used it to select the differentially expressed genes as the inputs to construct the gene regulatory network. To be specific, we integrate phenotype information with gene expression data to identify the gene dependency pairs by using the method of conditional mutual information. A gene dependency pair (A,B) means that the influence of gene A on the phenotype depends on gene B. All identified gene dependency pairs constitute a directed network underlying the phenotype, namely gene dependency network. By this way, we have constructed gene dependency network of breast cancer from gene expression data along with two different phenotype states (metastasis and non-metastasis). Moreover, we have found the network scale free, indicating that its hub genes with high out-degrees may play critical roles in the network. After functional investigation, these hub genes are found to be biologically significant and specially related to breast cancer, which suggests that our gene dependency network is meaningful. The validity has also been justified by literature investigation. From the network, we have selected 43 discriminative hubs as signature to build the classification model for distinguishing the distant metastasis risks of breast cancer patients, and the result outperforms those classification models with published signatures. In conclusion, we have proposed a promising way to construct the gene regulatory network by using sample-based data, which has been shown to be effective and accurate in uncovering the hidden mechanism of the biological process and identifying the gene signature for phenotypic change.
EgoNet: identification of human disease ego-network modules
2014-01-01
Background Mining novel biomarkers from gene expression profiles for accurate disease classification is challenging due to small sample size and high noise in gene expression measurements. Several studies have proposed integrated analyses of microarray data and protein-protein interaction (PPI) networks to find diagnostic subnetwork markers. However, the neighborhood relationship among network member genes has not been fully considered by those methods, leaving many potential gene markers unidentified. The main idea of this study is to take full advantage of the biological observation that genes associated with the same or similar diseases commonly reside in the same neighborhood of molecular networks. Results We present EgoNet, a novel method based on egocentric network-analysis techniques, to exhaustively search and prioritize disease subnetworks and gene markers from a large-scale biological network. When applied to a triple-negative breast cancer (TNBC) microarray dataset, the top selected modules contain both known gene markers in TNBC and novel candidates, such as RAD51 and DOK1, which play a central role in their respective ego-networks by connecting many differentially expressed genes. Conclusions Our results suggest that EgoNet, which is based on the ego network concept, allows the identification of novel biomarkers and provides a deeper understanding of their roles in complex diseases. PMID:24773628
Genomic analysis of expressed sequence tags in American black bear Ursus americanus
2010-01-01
Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065
Govender, Nisha; Senan, Siju; Mohamed-Hussein, Zeti-Azura; Wickneswari, Ratnam
2018-06-15
The plant shoot system consists of reproductive organs such as inflorescences, buds and fruits, and the vegetative leaves and stems. In this study, the reproductive part of the Jatropha curcas shoot system, which includes the aerial shoots, shoots bearing the inflorescence and inflorescence were investigated in regard to gene-to-gene interactions underpinning yield-related biological processes. An RNA-seq based sequencing of shoot tissues performed on an Illumina HiSeq. 2500 platform generated 18 transcriptomes. Using the reference genome-based mapping approach, a total of 64 361 genes was identified in all samples and the data was annotated against the non-redundant database by the BLAST2GO Pro. Suite. After removing the outlier genes and samples, a total of 12 734 genes across 17 samples were subjected to gene co-expression network construction using petal, an R library. A gene co-expression network model built with scale-free and small-world properties extracted four vicinity networks (VNs) with putative involvement in yield-related biological processes as follow; heat stress tolerance, floral and shoot meristem differentiation, biosynthesis of chlorophyll molecules and laticifers, cell wall metabolism and epigenetic regulations. Our VNs revealed putative key players that could be adapted in breeding strategies for J. curcas shoot system improvements.
Genomic analysis of expressed sequence tags in American black bear Ursus americanus.
Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun
2010-03-26
Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.
Bi-Force: large-scale bicluster editing and its application to gene expression data biclustering.
Sun, Peng; Speicher, Nora K; Röttger, Richard; Guo, Jiong; Baumbach, Jan
2014-05-01
The explosion of the biological data has dramatically reformed today's biological research. The need to integrate and analyze high-dimensional biological data on a large scale is driving the development of novel bioinformatics approaches. Biclustering, also known as 'simultaneous clustering' or 'co-clustering', has been successfully utilized to discover local patterns in gene expression data and similar biomedical data types. Here, we contribute a new heuristic: 'Bi-Force'. It is based on the weighted bicluster editing model, to perform biclustering on arbitrary sets of biological entities, given any kind of pairwise similarities. We first evaluated the power of Bi-Force to solve dedicated bicluster editing problems by comparing Bi-Force with two existing algorithms in the BiCluE software package. We then followed a biclustering evaluation protocol in a recent review paper from Eren et al. (2013) (A comparative analysis of biclustering algorithms for gene expressiondata. Brief. Bioinform., 14:279-292.) and compared Bi-Force against eight existing tools: FABIA, QUBIC, Cheng and Church, Plaid, BiMax, Spectral, xMOTIFs and ISA. To this end, a suite of synthetic datasets as well as nine large gene expression datasets from Gene Expression Omnibus were analyzed. All resulting biclusters were subsequently investigated by Gene Ontology enrichment analysis to evaluate their biological relevance. The distinct theoretical foundation of Bi-Force (bicluster editing) is more powerful than strict biclustering. We thus outperformed existing tools with Bi-Force at least when following the evaluation protocols from Eren et al. Bi-Force is implemented in Java and integrated into the open source software package of BiCluE. The software as well as all used datasets are publicly available at http://biclue.mpi-inf.mpg.de. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Integrative Approach to Pain Genetics Identifies Pain Sensitivity Loci across Diseases
Ruau, David; Dudley, Joel T.; Chen, Rong; Phillips, Nicholas G.; Swan, Gary E.; Lazzeroni, Laura C.; Clark, J. David
2012-01-01
Identifying human genes relevant for the processing of pain requires difficult-to-conduct and expensive large-scale clinical trials. Here, we examine a novel integrative paradigm for data-driven discovery of pain gene candidates, taking advantage of the vast amount of existing disease-related clinical literature and gene expression microarray data stored in large international repositories. First, thousands of diseases were ranked according to a disease-specific pain index (DSPI), derived from Medical Subject Heading (MESH) annotations in MEDLINE. Second, gene expression profiles of 121 of these human diseases were obtained from public sources. Third, genes with expression variation significantly correlated with DSPI across diseases were selected as candidate pain genes. Finally, selected candidate pain genes were genotyped in an independent human cohort and prospectively evaluated for significant association between variants and measures of pain sensitivity. The strongest signal was with rs4512126 (5q32, ABLIM3, P = 1.3×10−10) for the sensitivity to cold pressor pain in males, but not in females. Significant associations were also observed with rs12548828, rs7826700 and rs1075791 on 8q22.2 within NCALD (P = 1.7×10−4, 1.8×10−4, and 2.2×10−4 respectively). Our results demonstrate the utility of a novel paradigm that integrates publicly available disease-specific gene expression data with clinical data curated from MEDLINE to facilitate the discovery of pain-relevant genes. This data-derived list of pain gene candidates enables additional focused and efficient biological studies validating additional candidates. PMID:22685391
Gu, Deqing; Jian, Xingxing; Zhang, Cheng; Hua, Qiang
2017-01-01
Genome-scale metabolic network models (GEMs) have played important roles in the design of genetically engineered strains and helped biologists to decipher metabolism. However, due to the complex gene-reaction relationships that exist in model systems, most algorithms have limited capabilities with respect to directly predicting accurate genetic design for metabolic engineering. In particular, methods that predict reaction knockout strategies leading to overproduction are often impractical in terms of gene manipulations. Recently, we proposed a method named logical transformation of model (LTM) to simplify the gene-reaction associations by introducing intermediate pseudo reactions, which makes it possible to generate genetic design. Here, we propose an alternative method to relieve researchers from deciphering complex gene-reactions by adding pseudo gene controlling reactions. In comparison to LTM, this new method introduces fewer pseudo reactions and generates a much smaller model system named as gModel. We showed that gModel allows two seldom reported applications: identification of minimal genomes and design of minimal cell factories within a modified OptKnock framework. In addition, gModel could be used to integrate expression data directly and improve the performance of the E-Fmin method for predicting fluxes. In conclusion, the model transformation procedure will facilitate genetic research based on GEMs, extending their applications.
Guilloux, Jean-Philippe; Bassi, Sabrina; Ding, Ying; Walsh, Chris; Turecki, Gustavo; Tseng, George; Cyranowski, Jill M; Sibille, Etienne
2015-02-01
Major depressive disorder (MDD) in general, and anxious-depression in particular, are characterized by poor rates of remission with first-line treatments, contributing to the chronic illness burden suffered by many patients. Prospective research is needed to identify the biomarkers predicting nonremission prior to treatment initiation. We collected blood samples from a discovery cohort of 34 adult MDD patients with co-occurring anxiety and 33 matched, nondepressed controls at baseline and after 12 weeks (of citalopram plus psychotherapy treatment for the depressed cohort). Samples were processed on gene arrays and group differences in gene expression were investigated. Exploratory analyses suggest that at pretreatment baseline, nonremitting patients differ from controls with gene function and transcription factor analyses potentially related to elevated inflammation and immune activation. In a second phase, we applied an unbiased machine learning prediction model and corrected for model-selection bias. Results show that baseline gene expression predicted nonremission with 79.4% corrected accuracy with a 13-gene model. The same gene-only model predicted nonremission after 8 weeks of citalopram treatment with 76% corrected accuracy in an independent validation cohort of 63 MDD patients treated with citalopram at another institution. Together, these results demonstrate the potential, but also the limitations, of baseline peripheral blood-based gene expression to predict nonremission after citalopram treatment. These results not only support their use in future prediction tools but also suggest that increased accuracy may be obtained with the inclusion of additional predictors (eg, genetics and clinical scales).
Massive activation of archaeal defense genes during viral infection.
Quax, Tessa E F; Voet, Marleen; Sismeiro, Odile; Dillies, Marie-Agnes; Jagla, Bernd; Coppée, Jean-Yves; Sezonov, Guennadi; Forterre, Patrick; van der Oost, John; Lavigne, Rob; Prangishvili, David
2013-08-01
Archaeal viruses display unusually high genetic and morphological diversity. Studies of these viruses proved to be instrumental for the expansion of knowledge on viral diversity and evolution. The Sulfolobus islandicus rod-shaped virus 2 (SIRV2) is a model to study virus-host interactions in Archaea. It is a lytic virus that exploits a unique egress mechanism based on the formation of remarkable pyramidal structures on the host cell envelope. Using whole-transcriptome sequencing, we present here a global map defining host and viral gene expression during the infection cycle of SIRV2 in its hyperthermophilic host S. islandicus LAL14/1. This information was used, in combination with a yeast two-hybrid analysis of SIRV2 protein interactions, to advance current understanding of viral gene functions. As a consequence of SIRV2 infection, transcription of more than one-third of S. islandicus genes was differentially regulated. While expression of genes involved in cell division decreased, those genes playing a role in antiviral defense were activated on a large scale. Expression of genes belonging to toxin-antitoxin and clustered regularly interspaced short palindromic repeat (CRISPR)-Cas systems was specifically pronounced. The observed different degree of activation of various CRISPR-Cas systems highlights the specialized functions they perform. The information on individual gene expression and activation of antiviral defense systems is expected to aid future studies aimed at detailed understanding of the functions and interplay of these systems in vivo.
Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis.
dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana
2015-01-01
Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis.
Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis
dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana
2015-01-01
Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis. PMID:26393928
Kameshwar, Ayyappa Kumar Sista; Qin, Wensheng
2017-10-01
Lignin, most complex and abundant biopolymer on the earth's surface, attains its stability from intricate polyphenolic units and non-phenolic bonds, making it difficult to depolymerize or separate from other units of biomass. Eccentric lignin degrading ability and availability of annotated genome make Phanerochaete chrysosporium ideal for studying lignin degrading mechanisms. Decoding and understanding the molecular mechanisms underlying the process of lignin degradation will significantly aid the progressing biofuel industries and lead to the production of commercially vital platform chemicals. In this study, we have performed a large-scale metadata analysis to understand the common gene expression patterns of P. chrysosporium during lignin degradation. Gene expression datasets were retrieved from NCBI GEO database and analyzed using GEO2R and Bioconductor packages. Commonly expressed statistically significant genes among different datasets were further considered to understand their involvement in lignin degradation and detoxification mechanisms. We have observed three sets of enzymes commonly expressed during ligninolytic conditions which were later classified into primary ligninolytic, aromatic compound-degrading and other necessary enzymes. Similarly, we have observed three sets of genes coding for detoxification and stress-responsive, phase I and phase II metabolic enzymes. Results obtained in this study indicate the coordinated action of enzymes involved in lignin depolymerization and detoxification-stress responses under ligninolytic conditions. We have developed tentative network of genes and enzymes involved in lignin degradation and detoxification mechanisms by P. chrysosporium based on the literature and results obtained in this study. However, ambiguity raised due to higher expression of several uncharacterized proteins necessitates for further proteomic studies in P. chrysosporium.
Genome-wide identification and characterisation of F-box family in maize.
Jia, Fengjuan; Wu, Bingjiang; Li, Hui; Huang, Jinguang; Zheng, Chengchao
2013-11-01
F-box-containing proteins, as the key components of the protein degradation machinery, are widely distributed in higher plants and are considered as one of the largest known families of regulatory proteins. The F-box protein family plays a crucial role in plant growth and development and in response to biotic and abiotic stresses. However, systematic analysis of the F-box family in maize (Zea mays) has not been reported yet. In this paper, we identified and characterised the maize F-box genes in a genome-wide scale, including phylogenetic analysis, chromosome distribution, gene structure, promoter analysis and gene expression profiles. A total of 359 F-box genes were identified and divided into 15 subgroups by phylogenetic analysis. The F-box domain was relatively conserved, whereas additional motifs outside the F-box domain may indicate the functional diversification of maize F-box genes. These genes were unevenly distributed in ten maize chromosomes, suggesting that they expanded in the maize genome because of tandem and segmental duplication events. The expression profiles suggested that the maize F-box genes had temporal and spatial expression patterns. Putative cis-acting regulatory DNA elements involved in abiotic stresses were observed in maize F-box gene promoters. The gene expression profiles under abiotic stresses also suggested that some genes participated in stress responsive pathways. Furthermore, ten genes were chosen for quantitative real-time PCR analysis under drought stress and the results were consistent with the microarray data. This study has produced a comparative genomics analysis of the maize ZmFBX gene family that can be used in further studies to uncover their roles in maize growth and development.
Carbohydrate utilization and the lager yeast transcriptome during brewery fermentation.
Gibson, Brian R; Boulton, Chris A; Box, Wendy G; Graham, Neil S; Lawrence, Stephen J; Linforth, Robert S T; Smart, Katherine A
2008-08-01
The fermentable carbohydrate composition of wort and the manner in which it is utilized by yeast during brewery fermentation have a direct influence on fermentation efficiency and quality of the final product. In this study the response of a brewing yeast strain to changes in wort fermentable carbohydrate concentration and composition during full-scale (3275 hl) brewery fermentation was investigated by measuring transcriptome changes with the aid of oligonucleotide-based DNA arrays. Up to 74% of the detectable genes showed a significant (p=0.01) differential expression pattern during fermentation and the majority of these genes showed transient or prolonged peaks in expression following the exhaustion of the monosaccharides from the wort. Transcriptional activity of many genes was consistent with their known responses to glucose de/repression under laboratory conditions, despite the presence of di- and trisaccharide sugars in the wort. In a number of cases the transcriptional response of genes was not consistent with their known responses to glucose, suggesting a degree of complexity during brewery fermentation which cannot be replicated in small-scale wort fermentations or in laboratory experiments involving defined media. Copyright 2008 John Wiley & Sons, Ltd.
Spliced synthetic genes as internal controls in RNA sequencing experiments.
Hardwick, Simon A; Chen, Wendy Y; Wong, Ted; Deveson, Ira W; Blackburn, James; Andersen, Stacey B; Nielsen, Lars K; Mattick, John S; Mercer, Tim R
2016-09-01
RNA sequencing (RNA-seq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and diversity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNA-seq analysis. We have developed a set of spike-in RNA standards, termed 'sequins' (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but they align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and it provides scaling factors for normalization between samples. We demonstrate the use of sequins in RNA-seq experiments to measure sample-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA samples. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome.
NASA Astrophysics Data System (ADS)
Noble, Misty L.; Song, Shuxian; Sun, Ryan R.; Fan, Luping; DiBlasi, Robert M.; O'Kelly-Priddy, Colleen; Loeb, Keith R.; Miao, Carol H.
2012-11-01
Ultrasound (US) targeted microbubble (MB) destruction (UTMD) has been shown to be an effective method in delivering drugs and plasmid DNA (pDNA) into cells. We previously reported successful gene transfection of a reporter luciferase gene, pGL4, into livers of mice and rats using UTMD. The challenge is to translate and achieve similar gene expression in large animals, like swine, where the treated tissue volume is substantially larger. The scale-up study requires proportionally increased amount of pDNA/MBs delivered to tissues and an equivalent increase in US energy. We use different MBs and surgical strategies to retain most of pDNA/MB locally during US application in order to maximize the effect of UTMD in gene transfection. Our results show significant increase in luciferase expression in swine injected with MBs and exposed to 2.7 MPa US. We obtained up to 1800-fold enhancement in the pig experiment using Definity® MBs, and 2000-fold and 6300-fold enhancement in two pig studies using RN18 MBs compared to sham. These results represent an important developmental step towards US mediated gene delivery in large animals and clinical trials.
Ray, Sumanta; Hossain, Sk Md Mosaddek; Khatun, Lutfunnesa; Mukhopadhyay, Anirban
2017-12-20
Alzheimer's disease (AD) is a chronic neuro-degenerative disruption of the brain which involves in large scale transcriptomic variation. The disease does not impact every regions of the brain at the same time, instead it progresses slowly involving somewhat sequential interaction with different regions. Analysis of the expression patterns of the genes in different regions of the brain influenced in AD surely contribute for a enhanced comprehension of AD pathogenesis and shed light on the early characterization of the disease. Here, we have proposed a framework to identify perturbation and preservation characteristics of gene expression patterns across six distinct regions of the brain ("EC", "HIP", "PC", "MTG", "SFG", and "VCX") affected in AD. Co-expression modules were discovered considering a couple of regions at once. These are then analyzed to know the preservation and perturbation characteristics. Different module preservation statistics and a rank aggregation mechanism have been adopted to detect the changes of expression patterns across brain regions. Gene ontology (GO) and pathway based analysis were also carried out to know the biological meaning of preserved and perturbed modules. In this article, we have extensively studied the preservation patterns of co-expressed modules in six distinct brain regions affected in AD. Some modules are emerged as the most preserved while some others are detected as perturbed between a pair of brain regions. Further investigation on the topological properties of preserved and non-preserved modules reveals a substantial association amongst "betweenness centrality" and "degree" of the involved genes. Our findings may render a deeper realization of the preservation characteristics of gene expression patterns in discrete brain regions affected by AD.
Barad, Shiri; Sela, Noa; Kumar, Dilip; Kumar-Dubey, Amit; Glam-Matana, Nofar; Sherman, Amir; Prusky, Dov
2016-05-04
Penicillium expansum is a destructive phytopathogen that causes decay in deciduous fruits during postharvest handling and storage. During colonization the fungus secretes D-gluconic acid (GLA), which modulates environmental pH and regulates mycotoxin accumulation in colonized tissue. Till now no transcriptomic analysis has addressed the specific contribution of the pathogen's pH regulation to the P. expansum colonization process. For this purpose total RNA from the leading edge of P. expansum-colonized apple tissue of cv. 'Golden Delicious' and from fungal cultures grown under pH 4 or 7 were sequenced and their gene expression patterns were compared. We present a large-scale analysis of the transcriptome data of P. expansum and apple response to fungal colonization. The fungal analysis revealed nine different clusters of gene expression patterns that were divided among three major groups in which the colonized tissue showed, respectively: (i) differing transcript expression patterns between mycelial growth at pH 4 and pH 7; (ii) similar transcript expression patterns of mycelial growth at pH 4; and (iii) similar transcript expression patterns of mycelial growth at pH 7. Each group was functionally characterized in order to decipher genes that are important for pH regulation and also for colonization of apple fruits by Penicillium. Furthermore, comparison of gene expression of healthy apple tissue with that of colonized tissue showed that differentially expressed genes revealed up-regulation of the jasmonic acid and mevalonate pathways, and also down-regulation of the glycogen and starch biosynthesis pathways. Overall, we identified important genes and functionalities of P. expansum that were controlled by the environmental pH. Differential expression patterns of genes belonging to the same gene family suggest that genes were selectively activated according to their optimal environmental conditions (pH, in vitro or in vivo) to enable the fungus to cope with varying conditions and to make optimal use of available enzymes. Comparison between the activation of the colonized host's gene responses by alkalizing Colletotrichum gloeosporioides and acidifying P. expansum pathogens indicated similar gene response patterns, but stronger responses to P. expansum, suggesting the importance of acidification by P. expansum as a factor in its increased aggressiveness.
At what scale should microarray data be analyzed?
Huang, Shuguang; Yeo, Adeline A; Gelbert, Lawrence; Lin, Xi; Nisenbaum, Laura; Bemis, Kerry G
2004-01-01
The hybridization intensities derived from microarray experiments, for example Affymetrix's MAS5 signals, are very often transformed in one way or another before statistical models are fitted. The motivation for performing transformation is usually to satisfy the model assumptions such as normality and homogeneity in variance. Generally speaking, two types of strategies are often applied to microarray data depending on the analysis need: correlation analysis where all the gene intensities on the array are considered simultaneously, and gene-by-gene ANOVA where each gene is analyzed individually. We investigate the distributional properties of the Affymetrix GeneChip signal data under the two scenarios, focusing on the impact of analyzing the data at an inappropriate scale. The Box-Cox type of transformation is first investigated for the strategy of pooling genes. The commonly used log-transformation is particularly applied for comparison purposes. For the scenario where analysis is on a gene-by-gene basis, the model assumptions such as normality are explored. The impact of using a wrong scale is illustrated by log-transformation and quartic-root transformation. When all the genes on the array are considered together, the dependent relationship between the expression and its variation level can be satisfactorily removed by Box-Cox transformation. When genes are analyzed individually, the distributional properties of the intensities are shown to be gene dependent. Derivation and simulation show that some loss of power is incurred when a wrong scale is used, but due to the robustness of the t-test, the loss is acceptable when the fold-change is not very large.
Mechanisms of macroevolution: polyphagous plasticity in butterfly larvae revealed by RNA-Seq.
de la Paz Celorio-Mancera, Maria; Wheat, Christopher W; Vogel, Heiko; Söderlind, Lina; Janz, Niklas; Nylin, Sören
2013-10-01
Transcriptome studies of insect herbivory are still rare, yet studies in model systems have uncovered patterns of transcript regulation that appear to provide insights into how insect herbivores attain polyphagy, such as a general increase in expression breadth and regulation of ribosomal, digestion- and detoxification-related genes. We investigated the potential generality of these emerging patterns, in the Swedish comma, Polygonia c-album, which is a polyphagous, widely-distributed butterfly. Urtica dioica and Ribes uva-crispa are hosts of P. c-album, but Ribes represents a recent evolutionary shift onto a very divergent host. Utilizing the assembled transcriptome for read mapping, we assessed gene expression finding that caterpillar life-history (i.e. 2nd vs. 4th-instar regulation) had a limited influence on gene expression plasticity. In contrast, differential expression in response to host-plant identified genes encoding serine-type endopeptidases, membrane-associated proteins and transporters. Differential regulation of genes involved in nucleic acid binding was also observed suggesting that polyphagy involves large scale transcriptional changes. Additionally, transcripts coding for structural constituents of the cuticle were differentially expressed in caterpillars in response to their diet indicating that the insect cuticle may be a target for plant defence. Our results state that emerging patterns of transcript regulation from model species appear relevant in species when placed in an evolutionary context. © 2013 John Wiley & Sons Ltd.
Dynamic Visualization of Co-expression in Systems Genetics Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
New, Joshua Ryan; Huang, Jian; Chesler, Elissa J
2008-01-01
Biologists hope to address grand scientific challenges by exploring the abundance of data made available through modern microarray technology and other high-throughput techniques. The impact of this data, however, is limited unless researchers can effectively assimilate such complex information and integrate it into their daily research; interactive visualization tools are called for to support the effort. Specifically, typical studies of gene co-expression require novel visualization tools that enable the dynamic formulation and fine-tuning of hypotheses to aid the process of evaluating sensitivity of key parameters. These tools should allow biologists to develop an intuitive understanding of the structure of biologicalmore » networks and discover genes which reside in critical positions in networks and pathways. By using a graph as a universal data representation of correlation in gene expression data, our novel visualization tool employs several techniques that when used in an integrated manner provide innovative analytical capabilities. Our tool for interacting with gene co-expression data integrates techniques such as: graph layout, qualitative subgraph extraction through a novel 2D user interface, quantitative subgraph extraction using graph-theoretic algorithms or by querying an optimized b-tree, dynamic level-of-detail graph abstraction, and template-based fuzzy classification using neural networks. We demonstrate our system using a real-world workflow from a large-scale, systems genetics study of mammalian gene co-expression.« less
Prader-Willi syndrome: intellectual abilities and behavioural features by genetic subtype.
Milner, Katja M; Craig, Ellen E; Thompson, Russell J; Veltman, Marijcke W M; Thomas, N Simon; Roberts, Sian; Bellamy, Margaret; Curran, Sarah R; Sporikou, Caroline M J; Bolton, Patrick F
2005-10-01
Studies of chromosome 15 abnormality have implicated over-expression of paternally imprinted genes in the 15q11-13 region in the aetiology of autism. To test this hypothesis we compared individuals with Prader-Willi syndrome (PWS) due to uniparental disomy (UPD--where paternally imprinted genes are over-expressed) to individuals with the 15q11-13 deletion form of the syndrome (where paternally imprinted genes are not over-expressed). We also tested reports that PWS cases due to the larger type I (TI) form of deletion show differences to cases with the smaller type II (TII) deletion. Ninety-six individuals with PWS were recruited from genetic centres and the PWS association. Forty-nine individuals were confirmed as having maternal UPD of chromosome 15 and were age and sex matched to 47 individuals with a deletion involving 15q11-13 (32 had the shorter (T II) deletion, and 14 had the longer (TI) deletion). Behavioural assessments were carried out blind to genetic status, using the Autism Diagnostic Observation Schedule (ADOS), the Autism Diagnostic Interview (ADI), the Autism Screening Questionnaire (ASQ), the Children's Yale-Brown Obsessive-Compulsive Scale (CY-BOCS), the Vineland Adaptive Behaviour Scales (VABS), and measurements of intellectual ability, including the Wechsler and Mullen Scales and Raven's Matrices. UPD cases exhibited significantly more autistic-like impairments in reciprocal social interaction on questionnaire, interview and standardised observational measures. Comparison of TI and TII deletion cases revealed few differences, but ability levels tended to be lower in the TI deletion cases. Findings from a large study comparing deletion and UPD forms of Prader-Willi syndrome were consistent with other evidence in indicating that paternally imprinted genes in the 15q11-13 region constitute a genetic risk factor for aspects of autistic symptomatology. These genes may therefore play a role in the aetiology of autism. By contrast with another report, there was no clear-cut relationship between the size of the deletion and the form of cognitive and behavioural phenotype.
Female Behaviour Drives Expression and Evolution of Gustatory Receptors in Butterflies
Briscoe, Adriana D.; Macias-Muñoz, Aide; Kozak, Krzysztof M.; Walters, James R.; Yuan, Furong; Jamie, Gabriel A.; Martin, Simon H.; Dasmahapatra, Kanchon K.; Ferguson, Laura C.; Mallet, James; Jacquin-Joly, Emmanuelle; Jiggins, Chris D.
2013-01-01
Secondary plant compounds are strong deterrents of insect oviposition and feeding, but may also be attractants for specialist herbivores. These insect-plant interactions are mediated by insect gustatory receptors (Grs) and olfactory receptors (Ors). An analysis of the reference genome of the butterfly Heliconius melpomene, which feeds on passion-flower vines (Passiflora spp.), together with whole-genome sequencing within the species and across the Heliconius phylogeny has permitted an unprecedented opportunity to study the patterns of gene duplication and copy-number variation (CNV) among these key sensory genes. We report in silico gene predictions of 73 Gr genes in the H. melpomene reference genome, including putative CO2, sugar, sugar alcohol, fructose, and bitter receptors. The majority of these Grs are the result of gene duplications since Heliconius shared a common ancestor with the monarch butterfly or the silkmoth. Among Grs but not Ors, CNVs are more common within species in those gene lineages that have also duplicated over this evolutionary time-scale, suggesting ongoing rapid gene family evolution. Deep sequencing (∼1 billion reads) of transcriptomes from proboscis and labial palps, antennae, and legs of adult H. melpomene males and females indicates that 67 of the predicted 73 Gr genes and 67 of the 70 predicted Or genes are expressed in these three tissues. Intriguingly, we find that one-third of all Grs show female-biased gene expression (n = 26) and nearly all of these (n = 21) are Heliconius-specific Grs. In fact, a significant excess of Grs that are expressed in female legs but not male legs are the result of recent gene duplication. This difference in Gr gene expression diversity between the sexes is accompanied by a striking sexual dimorphism in the abundance of gustatory sensilla on the forelegs of H. melpomene, suggesting that female oviposition behaviour drives the evolution of new gustatory receptors in butterfly genomes. PMID:23950722
Rajpurohit, Subhash; Oliveira, Cássia C; Etges, William J; Gibbs, Allen G
2013-05-01
We used whole-transcriptome microarrays to assess changes in gene expression and monitored mortality rates and epicuticular hydrocarbons (CHCs) in response to desiccation stress in four natural populations of Drosophila mojavensis from Baja California and mainland Mexico. Desiccation had the greatest effect on gene expression, followed by biogeographical variation at regional and population levels. Genes involved in environmental sensing and cuticular structure were up-regulated in dry conditions, while genes involved in transcription itself were down-regulated. Flies from Baja California had higher expression of reproductive and mitochondrial genes, suggesting that these populations have greater fecundity and higher metabolic rates. Host plant differences had a surprisingly minor effect on the transcriptome. In most cases, desiccation-caused mortality was greater in flies reared on fermenting cactus tissues than that on laboratory media. Water content of adult females and males was significantly different and was lower in Baja California males. Different groups of CHCs simultaneously increased and decreased in amounts due to desiccation exposure of 9 and 18 h and were population-specific and dependent on larval rearing substrates. Overall, we observed that changes in gene expression involved a coordinated response of behavioural, cuticular and metabolic genes. Together with differential expression of cuticular hydrocarbons, this study revealed some of the mechanisms that have allowed D. mojavensis to exploit its harsh desert conditions. Certainly, for D. mojavensis that uses different host plants, population-level understanding of responses to stressors associated with future climate change in desert regions must be evaluated across geographical and local ecological scales. © 2013 Blackwell Publishing Ltd.
Gene Expression in Parp1 Deficient Mice Exposed to a Median Lethal Dose of Gamma Rays.
Kumar, M A Suresh; Laiakis, Evagelia C; Ghandhi, Shanaz A; Morton, Shad R; Fornace, Albert J; Amundson, Sally A
2018-05-10
There is a current interest in the development of biodosimetric methods for rapidly assessing radiation exposure in the wake of a large-scale radiological event. This work was initially focused on determining the exposure dose to an individual using biological indicators. Gene expression signatures show promise for biodosimetric application, but little is known about how these signatures might translate for the assessment of radiological injury in radiosensitive individuals, who comprise a significant proportion of the general population, and who would likely require treatment after exposure to lower doses. Using Parp1 -/- mice as a model radiation-sensitive genotype, we have investigated the effect of this DNA repair deficiency on the gene expression response to radiation. Although Parp1 is known to play general roles in regulating transcription, the pattern of gene expression changes observed in Parp1 -/- mice 24 h postirradiation to a LD 50/30 was remarkably similar to that in wild-type mice after exposure to LD 50/30 . Similar levels of activation of both the p53 and NFκB radiation response pathways were indicated in both strains. In contrast, exposure of wild-type mice to a sublethal dose that was equal to the Parp1 -/- LD 50/30 , which resulted in a lower magnitude gene expression response. Thus, Parp1 -/- mice displayed a heightened gene expression response to radiation, which was more similar to the wild-type response to an equitoxic dose than to an equal absorbed dose. Gene expression classifiers trained on the wild-type data correctly identified all wild-type samples as unexposed, exposed to a sublethal dose or exposed to an LD 50/30 . All unexposed samples from Parp1 -/- mice were also correctly classified with the same gene set, and 80% of irradiated Parp1 -/- samples were identified as exposed to an LD 50/30 . The results of this study suggest that, at least for some pathways that may influence radiosensitivity in humans, specific gene expression signatures have the potential to accurately detect the extent of radiological injury, rather than serving only as a surrogate of physical radiation dose.
A novel X-linked disorder with developmental delay and autistic features.
Kaya, Namik; Colak, Dilek; Albakheet, Albandary; Al-Owain, Mohammad; Abu-Dheim, Nada; Al-Younes, Banan; Al-Zahrani, Jawaher; Mukaddes, Nahit M; Dervent, Aysin; Al-Dosari, Naji; Al-Odaib, Ali; Kayaalp, Inci V; Al-Sayed, Moeenaladin; Al-Hassnan, Zuhair; Nester, Michael J; Al-Dosari, Mohammad; Al-Dhalaan, Hesham; Chedrawi, Aziza; Gunoz, Hulya; Karakas, Bedri; Sakati, Nadia; Alkuraya, Fowzan S; Gascon, Generaso G; Ozand, Pinar T
2012-04-01
Genomic duplications that lead to autism and other human diseases are interesting pathological lesions since the underlying mechanism almost certainly involves dosage sensitive genes. We aim to understand a novel genomic disorder with profound phenotypic consequences, most notably global developmental delay, autism, psychosis, and anorexia nervosa. We evaluated the affected individuals, all maternally related, using childhood autism rating scale (CARS) and Vineland Adaptive scales, magnetic resonance imaging (MRI) and magnetic resonance spectroscopy (MRS) brain, electroencephalography (EEG), electromyography (EMG), muscle biopsy, high-resolution molecular karyotype arrays, Giemsa banding (G-banding) and fluorescent in situ hybridization (FISH) experiments, mitochondrial DNA (mtDNA) sequencing, X-chromosome inactivation study, global gene expression analysis on Epstein-Barr virus (EBV)-transformed lymphoblasts, and quantitative reverse-transcription polymerase chain reaction (qRT-PCR). We have identified a novel Xq12-q13.3 duplication in an extended family. Clinically normal mothers were completely skewed in favor of the normal chromosome X. Global transcriptional profiling of affected individuals and controls revealed significant alterations of genes and pathways in a pattern consistent with previous microarray studies of autism spectrum disorder patients. Moreover, expression analysis revealed copy number-dependent increased messenger RNA (mRNA) levels in affected patients compared to control individuals. A subset of differentially expressed genes was validated using qRT-PCR. Xq12-q13.3 duplication is a novel global developmental delay and autism-predisposing chromosomal aberration; pathogenesis of which may be mediated by increased dosage of genes contained in the duplication, including NLGN3, OPHN1, AR, EFNB1, TAF1, GJB1, and MED12. Copyright © 2011 American Neurological Association.
The Use of Mouse Models to Study Epigenetics
Blewitt, Marnie; Whitelaw, Emma
2013-01-01
Much of what we know about the role of epigenetics in the determination of phenotype has come from studies of inbred mice. Some unusual expression patterns arising from endogenous and transgenic murine alleles, such as the Agouti coat color alleles, have allowed the study of variegation, variable expressivity, transgenerational epigenetic inheritance, parent-of-origin effects, and position effects. These phenomena have taught us much about gene silencing and the probabilistic nature of epigenetic processes. Based on some of these alleles, large-scale mutagenesis screens have broadened our knowledge of epigenetic control by identifying and characterizing novel genes involved in these processes. PMID:24186070
Sign: large-scale gene network estimation environment for high performance computing.
Tamada, Yoshinori; Shimamura, Teppei; Yamaguchi, Rui; Imoto, Seiya; Nagasaki, Masao; Miyano, Satoru
2011-01-01
Our research group is currently developing software for estimating large-scale gene networks from gene expression data. The software, called SiGN, is specifically designed for the Japanese flagship supercomputer "K computer" which is planned to achieve 10 petaflops in 2012, and other high performance computing environments including Human Genome Center (HGC) supercomputer system. SiGN is a collection of gene network estimation software with three different sub-programs: SiGN-BN, SiGN-SSM and SiGN-L1. In these three programs, five different models are available: static and dynamic nonparametric Bayesian networks, state space models, graphical Gaussian models, and vector autoregressive models. All these models require a huge amount of computational resources for estimating large-scale gene networks and therefore are designed to be able to exploit the speed of 10 petaflops. The software will be available freely for "K computer" and HGC supercomputer system users. The estimated networks can be viewed and analyzed by Cell Illustrator Online and SBiP (Systems Biology integrative Pipeline). The software project web site is available at http://sign.hgc.jp/ .
The Spike-and-Slab Lasso Generalized Linear Models for Prediction and Associated Genes Detection.
Tang, Zaixiang; Shen, Yueping; Zhang, Xinyan; Yi, Nengjun
2017-01-01
Large-scale "omics" data have been increasingly used as an important resource for prognostic prediction of diseases and detection of associated genes. However, there are considerable challenges in analyzing high-dimensional molecular data, including the large number of potential molecular predictors, limited number of samples, and small effect of each predictor. We propose new Bayesian hierarchical generalized linear models, called spike-and-slab lasso GLMs, for prognostic prediction and detection of associated genes using large-scale molecular data. The proposed model employs a spike-and-slab mixture double-exponential prior for coefficients that can induce weak shrinkage on large coefficients, and strong shrinkage on irrelevant coefficients. We have developed a fast and stable algorithm to fit large-scale hierarchal GLMs by incorporating expectation-maximization (EM) steps into the fast cyclic coordinate descent algorithm. The proposed approach integrates nice features of two popular methods, i.e., penalized lasso and Bayesian spike-and-slab variable selection. The performance of the proposed method is assessed via extensive simulation studies. The results show that the proposed approach can provide not only more accurate estimates of the parameters, but also better prediction. We demonstrate the proposed procedure on two cancer data sets: a well-known breast cancer data set consisting of 295 tumors, and expression data of 4919 genes; and the ovarian cancer data set from TCGA with 362 tumors, and expression data of 5336 genes. Our analyses show that the proposed procedure can generate powerful models for predicting outcomes and detecting associated genes. The methods have been implemented in a freely available R package BhGLM (http://www.ssg.uab.edu/bhglm/). Copyright © 2017 by the Genetics Society of America.
Genome-wide Mapping Reveals Conservation of Promoter DNA Methylation Following Chicken Domestication
Li, Qinghe; Wang, Yuanyuan; Hu, Xiaoxiang; Zhao, Yaofeng; Li, Ning
2015-01-01
It is well-known that environment influences DNA methylation, however, the extent of heritable DNA methylation variation following animal domestication remains largely unknown. Using meDIP-chip we mapped the promoter methylomes for 23,316 genes in muscle tissues of ancestral and domestic chickens. We systematically examined the variation of promoter DNA methylation in terms of different breeds, differentially expressed genes, SNPs and genes undergo genetic selection sweeps. While considerable changes in DNA sequence and gene expression programs were prevalent, we found that the inter-strain DNA methylation patterns were highly conserved in promoter region between the wild and domestic chicken breeds. Our data suggests a global preservation of DNA methylation between the wild and domestic chicken breeds in either a genome-wide or locus-specific scale in chick muscle tissues. PMID:25735894
Wang, Haibo; Zou, Zhurong; Wang, Shasha; Gong, Ming
2013-01-01
Background Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance) that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE) are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. Results In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs) were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C) for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. Conclusions This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of crucial genes for genetically enhancing cold resistance in J. curcas. PMID:24349370
Wang, Haibo; Zou, Zhurong; Wang, Shasha; Gong, Ming
2013-01-01
Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance) that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE) are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs) were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C) for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of crucial genes for genetically enhancing cold resistance in J. curcas.
Reverse engineering and analysis of large genome-scale gene networks
Aluru, Maneesha; Zola, Jaroslaw; Nettleton, Dan; Aluru, Srinivas
2013-01-01
Reverse engineering the whole-genome networks of complex multicellular organisms continues to remain a challenge. While simpler models easily scale to large number of genes and gene expression datasets, more accurate models are compute intensive limiting their scale of applicability. To enable fast and accurate reconstruction of large networks, we developed Tool for Inferring Network of Genes (TINGe), a parallel mutual information (MI)-based program. The novel features of our approach include: (i) B-spline-based formulation for linear-time computation of MI, (ii) a novel algorithm for direct permutation testing and (iii) development of parallel algorithms to reduce run-time and facilitate construction of large networks. We assess the quality of our method by comparison with ARACNe (Algorithm for the Reconstruction of Accurate Cellular Networks) and GeneNet and demonstrate its unique capability by reverse engineering the whole-genome network of Arabidopsis thaliana from 3137 Affymetrix ATH1 GeneChips in just 9 min on a 1024-core cluster. We further report on the development of a new software Gene Network Analyzer (GeNA) for extracting context-specific subnetworks from a given set of seed genes. Using TINGe and GeNA, we performed analysis of 241 Arabidopsis AraCyc 8.0 pathways, and the results are made available through the web. PMID:23042249
Liu, Guiyou; Zhang, Fang; Jiang, Yongshuai; Hu, Yang; Gong, Zhongying; Liu, Shoufeng; Chen, Xiuju; Jiang, Qinghua; Hao, Junwei
2017-02-01
Much effort has been expended on identifying the genetic determinants of multiple sclerosis (MS). Existing large-scale genome-wide association study (GWAS) datasets provide strong support for using pathway and network-based analysis methods to investigate the mechanisms underlying MS. However, no shared genetic pathways have been identified to date. We hypothesize that shared genetic pathways may indeed exist in different MS-GWAS datasets. Here, we report results from a three-stage analysis of GWAS and expression datasets. In stage 1, we conducted multiple pathway analyses of two MS-GWAS datasets. In stage 2, we performed a candidate pathway analysis of the large-scale MS-GWAS dataset. In stage 3, we performed a pathway analysis using the dysregulated MS gene list from seven human MS case-control expression datasets. In stage 1, we identified 15 shared pathways. In stage 2, we successfully replicated 14 of these 15 significant pathways. In stage 3, we found that dysregulated MS genes were significantly enriched in 10 of 15 MS risk pathways identified in stages 1 and 2. We report shared genetic pathways in different MS-GWAS datasets and highlight some new MS risk pathways. Our findings provide new insights on the genetic determinants of MS.
Mackeh, Rafah; Boughorbel, Sabri; Chaussabel, Damien; Kino, Tomoshige
2017-01-01
The collection of large-scale datasets available in public repositories is rapidly growing and providing opportunities to identify and fill gaps in different fields of biomedical research. However, users of these datasets should be able to selectively browse datasets related to their field of interest. Here we made available a collection of transcriptome datasets related to human follicular cells from normal individuals or patients with polycystic ovary syndrome, in the process of their development, during in vitro fertilization. After RNA-seq dataset exclusion and careful selection based on study description and sample information, 12 datasets, encompassing a total of 85 unique transcriptome profiles, were identified in NCBI Gene Expression Omnibus and uploaded to the Gene Expression Browser (GXB), a web application specifically designed for interactive query and visualization of integrated large-scale data. Once annotated in GXB, multiple sample grouping has been made in order to create rank lists to allow easy data interpretation and comparison. The GXB tool also allows the users to browse a single gene across multiple projects to evaluate its expression profiles in multiple biological systems/conditions in a web-based customized graphical views. The curated dataset is accessible at the following link: http://ivf.gxbsidra.org/dm3/landing.gsp.
Mackeh, Rafah; Boughorbel, Sabri; Chaussabel, Damien; Kino, Tomoshige
2017-01-01
The collection of large-scale datasets available in public repositories is rapidly growing and providing opportunities to identify and fill gaps in different fields of biomedical research. However, users of these datasets should be able to selectively browse datasets related to their field of interest. Here we made available a collection of transcriptome datasets related to human follicular cells from normal individuals or patients with polycystic ovary syndrome, in the process of their development, during in vitro fertilization. After RNA-seq dataset exclusion and careful selection based on study description and sample information, 12 datasets, encompassing a total of 85 unique transcriptome profiles, were identified in NCBI Gene Expression Omnibus and uploaded to the Gene Expression Browser (GXB), a web application specifically designed for interactive query and visualization of integrated large-scale data. Once annotated in GXB, multiple sample grouping has been made in order to create rank lists to allow easy data interpretation and comparison. The GXB tool also allows the users to browse a single gene across multiple projects to evaluate its expression profiles in multiple biological systems/conditions in a web-based customized graphical views. The curated dataset is accessible at the following link: http://ivf.gxbsidra.org/dm3/landing.gsp. PMID:28413616
Li, Huan-Jun; Zhang, De-Huai; Yue, Tong-Hui; Jiang, Lu-Xi; Yu, Xuya; Zhao, Peng; Li, Tao; Xu, Jun-Wei
2016-01-10
Expression of Vitreoscilla hemoglobin (VHb) gene was used to improve polysaccharide production in Ganoderma lucidum. The VHb gene, vgb, under the control of the constitutive glyceraldehyde-3-phosphate dehydrogenase gene promoter was introduced into G. lucidum. The activity of expressed VHb was confirmed by the observation of VHb specific CO-difference spectrum with a maximal absorption at 419 nm for the transformant. The effects of VHb expression on intracellular polysaccharide (IPS) content, extracellular polysaccharide (EPS) production and transcription levels of three genes encoding the enzymes involved in polysaccharide biosynthesis, including phosphoglucomutase (PGM), uridine diphosphate glucose pyrophosphorylase (UGP), and β-1,3-glucan synthase (GLS), were investigated. The maximum IPS content and EPS production in the vgb-bearing G. lucidum were 26.4 mg/100mg dry weight and 0.83 g/L, respectively, which were higher by 30.5% and 88.2% than those of the wild-type strain. The transcription levels of PGM, UGP and GLS were up-regulated by 1.51-, 1.55- and 3.83-fold, respectively, in the vgb-bearing G. lucidum. This work highlights the potential of VHb to enhance G. lucidum polysaccharide production by large scale fermentation. Copyright © 2015 Elsevier B.V. All rights reserved.
Genome-scale approaches to the epigenetics of common human disease
2011-01-01
Traditionally, the pathology of human disease has been focused on microscopic examination of affected tissues, chemical and biochemical analysis of biopsy samples, other available samples of convenience, such as blood, and noninvasive or invasive imaging of varying complexity, in order to classify disease and illuminate its mechanistic basis. The molecular age has complemented this armamentarium with gene expression arrays and selective analysis of individual genes. However, we are entering a new era of epigenomic profiling, i.e., genome-scale analysis of cell-heritable nonsequence genetic change, such as DNA methylation. The epigenome offers access to stable measurements of cellular state and to biobanked material for large-scale epidemiological studies. Some of these genome-scale technologies are beginning to be applied to create the new field of epigenetic epidemiology. PMID:19844740
Takashima, Kayoko; Mizukawa, Yumiko; Morishita, Katsumi; Okuyama, Manabu; Kasahara, Toshihiko; Toritsuka, Naoki; Miyagishima, Toshikazu; Nagao, Taku; Urushidani, Tetsuro
2006-05-08
The Toxicogenomics Project is a 5-year collaborative project by the Japanese government and pharmaceutical companies in 2002. Its aim is to construct a large-scale toxicology database of 150 compounds orally administered to rats. The test consists of a single administration test (3, 6, 9 and 24 h) and a repeated administration test (3, 7, 14 and 28 days), and the conventional toxicology data together with the gene expression data in liver as analyzed by using Affymetrix GeneChip are being accumulated. In the project, either methylcellulose or corn oil is employed as vehicle. We examined whether the vehicle itself affects the analysis of gene expression and found that corn oil alone affected the food consumption and biochemical parameters mainly related to lipid metabolism, and this accompanied typical changes in the gene expression. Most of the genes modulated by corn oil were related to cholesterol or fatty acid metabolism (e.g., CYP7A1, CYP8B1, 3-hydroxy-3-methylglutaryl-Coenzyme A reductase, squalene epoxidase, angiopoietin-like protein 4, fatty acid synthase, fatty acid binding proteins), suggesting that the response was physiologic to the oil intake. Many of the lipid-related genes showed circadian rhythm within a day, but the expression pattern of general clock genes (e.g., period 2, arylhydrocarbon nuclear receptor translocator-like, D site albumin promoter binding protein) were unaffected by corn oil, suggesting that the effects are specific for lipid metabolism. These results would be useful for usage of the database especially when drugs with different vehicle control are compared.
Liu, Jingjing; Yin, Tongming; Ye, Ning; Chen, Yingnan; Yin, Tingting; Liu, Min; Hassani, Danial
2013-01-01
Background The dioecious system is relatively rare in plants. Shrub willow is an annual flowering dioecious woody plant, and possesses many characteristics that lend it as a great model for tracking the missing pieces of sex determination evolution. To gain a global view of the genes differentially expressed in the male and female shrub willows and to develop a database for further studies, we performed a large-scale transcriptome sequencing of flower buds which were separately collected from two types of sexes. Results Totally, 1,201,931 high quality reads were obtained, with an average length of 389 bp and a total length of 467.96 Mb. The ESTs were assembled into 29,048 contigs, and 132,709 singletons. These unigenes were further functionally annotated by comparing their sequences to different proteins and functional domain databases and assigned with Gene Ontology (GO) terms. A biochemical pathway database containing 291 predicted pathways was also created based on the annotations of the unigenes. Digital expression analysis identified 806 differentially expressed genes between the male and female flower buds. And 33 of them located on the incipient sex chromosome of Salicaceae, among which, 12 genes might involve in plant sex determination empirically. These genes were worthy of special notification in future studies. Conclusions In this study, a large number of EST sequences were generated from the flower buds of a male and a female shrub willow. We also reported the differentially expressed genes between the two sex-type flowers. This work provides valuable information and sequence resources for uncovering the sex determining genes and for future functional genomics analysis of Salicaceae spp. PMID:23560075
NASA Astrophysics Data System (ADS)
Mouser, P. J.
2010-12-01
In order to develop decision-making tools for the prediction and optimization of subsurface bioremediation strategies, we must be able to link the molecular-scale activity of microorganisms involved in remediation processes with biogeochemical processes observed at the field-scale. This requires the ability to quantify changes in the in situ metabolic condition of dominant microbes and associate these changes to fluctuations in nutrient levels throughout the bioremediation process. It also necessitates a need to understand the spatiotemporal variability of the molecular-scale information to develop meaningful parameters and constraint ranges in complex bio-physio-chemical models. The expression of three Geobacter species genes (ammonium transporter (amtB), nitrogen fixation (nifD), and a housekeeping gene (recA)) were tracked at two monitoring locations that differed significantly in ammonium (NH4+) concentrations during a field-scale experiment where acetate was injected into the subsurface to simulate Geobacteraceae in a uranium-contaminated aquifer. Analysis of amtB and nifD mRNA transcript levels indicated that NH4+ was the primary form of fixed nitrogen during bioremediation. Overall expression levels of amtB were on average 8-fold higher at NH4+ concentrations of 300 μM or more than at lower NH4+ levels (average 60 μM). The degree of temporal correlation in Geobacter species mRNA expression levels was calculated at both locations using autocorrelation methods that describe the relationship between sample semi-variance and time lag. At the monitoring location with lower NH4+, a temporal correlation lag of 8 days was observed for both amtB and nifD transcript patterns. At the location where higher NH4+ levels were observed, no discernable temporal correlation lag above the sampling frequency (approximately every 2 days) was observed for amtB or nifD transcript fluctuations. Autocorrelation trends in recA expression levels at both locations indicated that while a temporal correlation in the general metabolic activity of Geobacter species may exist, considerable variability in transcript levels masked these correlations at the sampled scale. These findings suggest that when Geobacter species are dependent upon a particular nutrient such as NH4+, the time length for which their activity level relating to this nutrient condition can be predicted is significantly enhanced.
bigSCale: an analytical framework for big-scale single-cell data.
Iacono, Giovanni; Mereu, Elisabetta; Guillaumet-Adkins, Amy; Corominas, Roser; Cuscó, Ivon; Rodríguez-Esteban, Gustavo; Gut, Marta; Pérez-Jurado, Luis Alberto; Gut, Ivo; Heyn, Holger
2018-06-01
Single-cell RNA sequencing (scRNA-seq) has significantly deepened our insights into complex tissues, with the latest techniques capable of processing tens of thousands of cells simultaneously. Analyzing increasing numbers of cells, however, generates extremely large data sets, extending processing time and challenging computing resources. Current scRNA-seq analysis tools are not designed to interrogate large data sets and often lack sensitivity to identify marker genes. With bigSCale, we provide a scalable analytical framework to analyze millions of cells, which addresses the challenges associated with large data sets. To handle the noise and sparsity of scRNA-seq data, bigSCale uses large sample sizes to estimate an accurate numerical model of noise. The framework further includes modules for differential expression analysis, cell clustering, and marker identification. A directed convolution strategy allows processing of extremely large data sets, while preserving transcript information from individual cells. We evaluated the performance of bigSCale using both a biological model of aberrant gene expression in patient-derived neuronal progenitor cells and simulated data sets, which underlines the speed and accuracy in differential expression analysis. To test its applicability for large data sets, we applied bigSCale to assess 1.3 million cells from the mouse developing forebrain. Its directed down-sampling strategy accumulates information from single cells into index cell transcriptomes, thereby defining cellular clusters with improved resolution. Accordingly, index cell clusters identified rare populations, such as reelin ( Reln )-positive Cajal-Retzius neurons, for which we report previously unrecognized heterogeneity associated with distinct differentiation stages, spatial organization, and cellular function. Together, bigSCale presents a solution to address future challenges of large single-cell data sets. © 2018 Iacono et al.; Published by Cold Spring Harbor Laboratory Press.
A recombinant actinomycete, Streptomyces lividans TK23.1, expressing a pIJ702-encoded extracellular lignin peroxidase gene cloned from the chromosome of Streptomyces virodosporus T7A, was released into soil in flask- and microcosm-scale studies to determine its effects on humific...
Graham, Morag R; Smoot, Laura M; Migliaccio, Cristi A Lux; Virtaneva, Kimmo; Sturdevant, Daniel E; Porcella, Stephen F; Federle, Michael J; Adams, Gerald J; Scott, June R; Musser, James M
2002-10-15
Two-component gene regulatory systems composed of a membrane-bound sensor and cytoplasmic response regulator are important mechanisms used by bacteria to sense and respond to environmental stimuli. Group A Streptococcus, the causative agent of mild infections and life-threatening invasive diseases, produces many virulence factors that promote survival in humans. A two-component regulatory system, designated covRS (cov, control of virulence; csrRS), negatively controls expression of five proven or putative virulence factors (capsule, cysteine protease, streptokinase, streptolysin S, and streptodornase). Inactivation of covRS results in enhanced virulence in mouse models of invasive disease. Using DNA microarrays and quantitative RT-PCR, we found that CovR influences transcription of 15% (n = 271) of all chromosomal genes, including many that encode surface and secreted proteins mediating host-pathogen interactions. CovR also plays a central role in gene regulatory networks by influencing expression of genes encoding transcriptional regulators, including other two-component systems. Differential transcription of genes influenced by covR also was identified in mouse soft-tissue infection. This analysis provides a genome-scale overview of a virulence gene network in an important human pathogen and adds insight into the molecular mechanisms used by group A Streptococcus to interact with the host, promote survival, and cause disease.
Jourda, Cyril; Cardi, Céline; Gibert, Olivier; Giraldo Toro, Andrès; Ricci, Julien; Mbéguié-A-Mbéguié, Didier; Yahiaoui, Nabila
2016-01-01
Starch is the most widespread and abundant storage carbohydrate in plants. It is also a major feature of cultivated bananas as it accumulates to large amounts during banana fruit development before almost complete conversion to soluble sugars during ripening. Little is known about the structure of major gene families involved in banana starch metabolism and their evolution compared to other species. To identify genes involved in banana starch metabolism and investigate their evolutionary history, we analyzed six gene families playing a crucial role in plant starch biosynthesis and degradation: the ADP-glucose pyrophosphorylases (AGPases), starch synthases (SS), starch branching enzymes (SBE), debranching enzymes (DBE), α-amylases (AMY) and β-amylases (BAM). Using comparative genomics and phylogenetic approaches, these genes were classified into families and sub-families and orthology relationships with functional genes in Eudicots and in grasses were identified. In addition to known ancestral duplications shaping starch metabolism gene families, independent evolution in banana and grasses also occurred through lineage-specific whole genome duplications for specific sub-families of AGPase, SS, SBE, and BAM genes; and through gene-scale duplications for AMY genes. In particular, banana lineage duplications yielded a set of AGPase, SBE and BAM genes that were highly or specifically expressed in banana fruits. Gene expression analysis highlighted a complex transcriptional reprogramming of starch metabolism genes during ripening of banana fruits. A differential regulation of expression between banana gene duplicates was identified for SBE and BAM genes, suggesting that part of starch metabolism regulation in the fruit evolved in the banana lineage. PMID:27994606
Kim, Unkyu; Siegel, Rachael; Ren, Xiaodi; Gunther, Cary S; Gaasterland, Terry; Roeder, Robert G
2003-07-22
The tissue-specific transcriptional coactivator OCA-B is required for antigen-dependent B cell differentiation events, including germinal center formation. However, the identity of OCA-B target genes involved in this process is unknown. This study has used large-scale cDNA arrays to monitor changes in gene expression patterns that accompany mature B cell differentiation. B cell receptor ligation alone induces many genes involved in B cell expansion, whereas B cell receptor and helper T cell costimulation induce genes associated with B cell effector function. OCA-B expression is induced by both B cell receptor ligation alone and helper T cell costimulation, suggesting that OCA-B is involved in B cell expansion as well as B cell function. Accordingly, several genes involved in cell proliferation and signaling, such as Lck, Kcnn4, Cdc37, cyclin D3, B4galt1, and Ms4a11, have been identified as OCA-B-dependent genes. Further studies on the roles played by these genes in B cells will contribute to an understanding of B cell differentiation.
Singh, Nitesh Kumar; Ernst, Mathias; Liebscher, Volkmar; Fuellen, Georg; Taher, Leila
2016-10-20
The biological relationships both between and within the functions, processes and pathways that operate within complex biological systems are only poorly characterized, making the interpretation of large scale gene expression datasets extremely challenging. Here, we present an approach that integrates gene expression and biological annotation data to identify and describe the interactions between biological functions, processes and pathways that govern a phenotype of interest. The product is a global, interconnected network, not of genes but of functions, processes and pathways, that represents the biological relationships within the system. We validated our approach on two high-throughput expression datasets describing organismal and organ development. Our findings are well supported by the available literature, confirming that developmental processes and apoptosis play key roles in cell differentiation. Furthermore, our results suggest that processes related to pluripotency and lineage commitment, which are known to be critical for development, interact mainly indirectly, through genes implicated in more general biological processes. Moreover, we provide evidence that supports the relevance of cell spatial organization in the developing liver for proper liver function. Our strategy can be viewed as an abstraction that is useful to interpret high-throughput data and devise further experiments.
Transcriptome analysis of a wild bird reveals physiological responses to the urban environment
Watson, Hannah; Videvall, Elin; Andersson, Martin N.; Isaksson, Caroline
2017-01-01
Identifying the molecular basis of environmentally induced phenotypic variation presents exciting opportunities for furthering our understanding of how ecological processes and the environment can shape the phenotype. Urban and rural environments present free-living organisms with different challenges and opportunities, which have marked consequences for the phenotype, yet little is known about responses at the molecular level. We characterised transcriptomes from an urban and a rural population of great tits Parus major, demonstrating striking differences in gene expression profiles in both blood and liver tissues. Differentially expressed genes had functions related to immune and inflammatory responses, detoxification, protection against oxidative stress, lipid metabolism, and regulation of gene expression. Many genes linked to stress responses were expressed at higher levels in the urban birds, in accordance with our prediction that urban animals are exposed to greater environmental stress. This is one of the first studies to reveal transcriptional differences between urban- and rural-dwelling animals and suggests an important role for epigenetics in mediating environmentally induced physiological variation. The study provides valuable resources for developing further in-depth studies of the mechanisms driving phenotypic variation in the urban context at larger spatial and temporal scales. PMID:28290496
Han, Benfeng; Zhang, Shen; Zeng, Fanrong; Mao, Jianjun
2017-01-01
Background The green lacewing, Chrysopa pallens Rambur, is one of the most important natural predators because of its extensive spectrum of prey and wide distribution. However, what we know about the nutritional and reproductive physiology of this species is very scarce. Results By cDNA amplification and Illumina short-read sequencing, we analyzed transcriptomes of C. pallens female adult under starved and fed conditions. In total, 71236 unigenes were obtained with an average length of 833 bp. Four vitellogenins, three insulin-like peptides and two insulin receptors were annotated. Comparison of gene expression profiles suggested that totally 1501 genes were differentially expressed between the two nutritional statuses. KEGG orthology classification showed that these differentially expression genes (DEGs) were mapped to 241 pathways. In turn, the top 4 are ribosome, protein processing in endoplasmic reticulum, biosynthesis of amino acids and carbon metabolism, indicating a distinct difference in nutritional and reproductive signaling between the two feeding conditions. Conclusions Our study yielded large-scale molecular information relevant to C. pallens nutritional and reproductive signaling, which will contribute to mass rearing and commercial use of this predaceous insect species. PMID:28683101
Han, Benfeng; Zhang, Shen; Zeng, Fanrong; Mao, Jianjun
2017-01-01
The green lacewing, Chrysopa pallens Rambur, is one of the most important natural predators because of its extensive spectrum of prey and wide distribution. However, what we know about the nutritional and reproductive physiology of this species is very scarce. By cDNA amplification and Illumina short-read sequencing, we analyzed transcriptomes of C. pallens female adult under starved and fed conditions. In total, 71236 unigenes were obtained with an average length of 833 bp. Four vitellogenins, three insulin-like peptides and two insulin receptors were annotated. Comparison of gene expression profiles suggested that totally 1501 genes were differentially expressed between the two nutritional statuses. KEGG orthology classification showed that these differentially expression genes (DEGs) were mapped to 241 pathways. In turn, the top 4 are ribosome, protein processing in endoplasmic reticulum, biosynthesis of amino acids and carbon metabolism, indicating a distinct difference in nutritional and reproductive signaling between the two feeding conditions. Our study yielded large-scale molecular information relevant to C. pallens nutritional and reproductive signaling, which will contribute to mass rearing and commercial use of this predaceous insect species.
Evolution and Expression of Tissue Globins in Ray-Finned Fishes.
Gallagher, Michael D; Macqueen, Daniel J
2017-01-01
The globin gene family encodes oxygen-binding hemeproteins conserved across the major branches of multicellular life. The origins and evolutionary histories of complete globin repertoires have been established for many vertebrates, but there remain major knowledge gaps for ray-finned fish. Therefore, we used phylogenetic, comparative genomic and gene expression analyses to discover and characterize canonical “non-blood” globin family members (i.e., myoglobin, cytoglobin, neuroglobin, globin-X, and globin-Y) across multiple ray-finned fish lineages, revealing novel gene duplicates (paralogs) conserved from whole genome duplication (WGD) and small-scale duplication events. Our key findings were that: (1) globin-X paralogs in teleosts have been retained from the teleost-specific WGD, (2) functional paralogs of cytoglobin, neuroglobin, and globin-X, but not myoglobin, have been conserved from the salmonid-specific WGD, (3) triplicate lineage-specific myoglobin paralogs are conserved in arowanas (Osteoglossiformes), which arose by tandem duplication and diverged under positive selection, (4) globin-Y is retained in multiple early branching fish lineages that diverged before teleosts, and (5) marked variation in tissue-specific expression of globin gene repertoires exists across ray-finned fish evolution, including several previously uncharacterized sites of expression. In this respect, our data provide an interesting link between myoglobin expression and the evolution of air breathing in teleosts. Together, our findings demonstrate great-unrecognized diversity in the repertoire and expression of nonblood globins that has arisen during ray-finned fish evolution.
Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi
2004-02-01
To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.
Yu, Shijiang; Ding, Lili; Luo, Ren; Li, Xiaojiao; Yang, Juan; Liu, Haoqiang; Cong, Lin; Ran, Chun
2016-01-01
Dialeurodes citri is a major pest in citrus producing areas, and large-scale outbreaks have occurred increasingly often in recent years. Lecanicillium attenuatum is an important entomopathogenic fungus that can parasitize and kill D. citri. We separated the fungus from corpses of D. citri larvae. However, the sound immune defense system of pests makes infection by an entomopathogenic fungus difficult. Here we used RNA sequencing technology (RNA-Seq) to build a transcriptome database for D. citri and performed digital gene expression profiling to screen genes that act in the immune defense of D. citri larvae infected with a pathogenic fungus. De novo assembly generated 84,733 unigenes with mean length of 772 nt. All unigenes were searched against GO, Nr, Swiss-Prot, COG, and KEGG databases and a total of 28,190 (33.3%) unigenes were annotated. We identified 129 immunity-related unigenes in transcriptome database that were related to pattern recognition receptors, information transduction factors and response factors. From the digital gene expression profile, we identified 441 unigenes that were differentially expressed in D. citri infected with L. attenuatum. Through calculated Log2Ratio values, we identified genes for which fold changes in expression were obvious, including cuticle protein, vitellogenin, cathepsin, prophenoloxidase, clip-domain serine protease, lysozyme, and others. Subsequent quantitative real-time polymerase chain reaction analysis verified the results. The identified genes may serve as target genes for microbial control of D. citri.
Yu, Shijiang; Ding, Lili; Luo, Ren; Li, Xiaojiao; Yang, Juan; Liu, Haoqiang; Cong, Lin; Ran, Chun
2016-01-01
Dialeurodes citri is a major pest in citrus producing areas, and large-scale outbreaks have occurred increasingly often in recent years. Lecanicillium attenuatum is an important entomopathogenic fungus that can parasitize and kill D. citri. We separated the fungus from corpses of D. citri larvae. However, the sound immune defense system of pests makes infection by an entomopathogenic fungus difficult. Here we used RNA sequencing technology (RNA-Seq) to build a transcriptome database for D. citri and performed digital gene expression profiling to screen genes that act in the immune defense of D. citri larvae infected with a pathogenic fungus. De novo assembly generated 84,733 unigenes with mean length of 772 nt. All unigenes were searched against GO, Nr, Swiss-Prot, COG, and KEGG databases and a total of 28,190 (33.3%) unigenes were annotated. We identified 129 immunity-related unigenes in transcriptome database that were related to pattern recognition receptors, information transduction factors and response factors. From the digital gene expression profile, we identified 441 unigenes that were differentially expressed in D. citri infected with L. attenuatum. Through calculated Log2Ratio values, we identified genes for which fold changes in expression were obvious, including cuticle protein, vitellogenin, cathepsin, prophenoloxidase, clip-domain serine protease, lysozyme, and others. Subsequent quantitative real-time polymerase chain reaction analysis verified the results. The identified genes may serve as target genes for microbial control of D. citri. PMID:27644092
Thymidine Kinase PET Reporter Gene Imaging of Cancer Cells In Vivo.
McCracken, Melissa N
2018-01-01
Positron emission tomography (PET) is a three dimensional imaging modality that detects the accumulation of radiolabeled isotopes in vivo. Ectopic expression of a thymidine kinase reporter gene allows for the specific detection of reporter cells in vivo by imaging with the reporter specific probe. PET reporter imaging is sensitive, quantitative and can be scaled into larger tumors or animals with little to no tissue diffraction. Here, we describe how thymidine kinase PET reporter genes can be used to noninvasively image cancer cells in vivo.
TOXICOGENOMICS DRUG DISCOVERY AND THE PATHOLOGIST
Toxicogenomics, drug discovery, and pathologist.
The field of toxicogenomics, which currently focuses on the application of large-scale differential gene expression (DGE) data to toxicology, is starting to influence drug discovery and development in the pharmaceutical indu...
Brizuela, Leonardo; Richardson, Aaron; Marsischky, Gerald; Labaer, Joshua
2002-01-01
Thanks to the results of the multiple completed and ongoing genome sequencing projects and to the newly available recombination-based cloning techniques, it is now possible to build gene repositories with no precedent in their composition, formatting, and potential. This new type of gene repository is necessary to address the challenges imposed by the post-genomic era, i.e., experimentation on a genome-wide scale. We are building the FLEXGene (Full Length EXpression-ready) repository. This unique resource will contain clones representing the complete ORFeome of different organisms, including Homo sapiens as well as several pathogens and model organisms. It will consist of a comprehensive, characterized (sequence-verified), and arrayed gene repository. This resource will allow full exploitation of the genomic information by enabling genome-wide scale experimentation at the level of functional/phenotypic assays as well as at the level of protein expression, purification, and analysis. Here we describe the rationale and construction of this resource and focus on the data obtained from the Saccharomyces cerevisiae project.
He, Weiwei; Mu, Wanmeng; Jiang, Bo; Yan, Xin; Zhang, Tao
2016-04-27
A food grade recombinant Bacillus subtilis that produces d-psicose 3-epimerase (DPEase; EC 5.1.3.30) was constructed by transforming a replicative multicopy plasmid with a d-alanine racemase gene marker into B. subtilis 1A751 with the d-alanine racemase gene knocked out. The DPEase was expressed in B. subtilis without antibiotic resistance genes and without adding antibiotics during fermentation. Whole cells of the food grade recombinant B. subtilis were used to biotransform d-fructose to d-allulose. The two tandem promoters, including the HpaII and P43 promoters, increased expression levels compared to the use of one promoter, HpaII. For large-scale d-allulose production, the optimal enzyme dose was 40 enzyme activity units of dry cells per gram of d-fructose, which produced a 28.5% turnover yield in 60 min. The recombinant plasmid exhibited stability over 100 generations. This food grade recombinant B. subtilis may be used for large-scale d-allulose production in the food industry.
Hu, Yan-Hong; Chen, Xiao-Ming; Yang, Pu; Ding, Wei-Feng
2018-04-01
Ericerus pela Chavannes (Hemiptera: Coccoidae) is an economically important scale insect because the second instar males secrete a harvestable wax-like substance. In this study, we report the molecular cloning of a fatty acyl-CoA reductase gene (EpFAR) of E. pela. We predicted a 520-aa protein with the FAR family features from the deduced amino acid sequence. The EpFAR mRNA was expressed in five tested tissues, testis, alimentary canal, fat body, Malpighian tubules, and mostly in cuticle. The EpFAR protein was localized by immunofluorescence only in the wax glands and testis. EpFAR expression in High Five insect cells documented the recombinant EpFAR reduced 26-0:(S) CoA and to its corresponding alcohol. The data illuminate the molecular mechanism for fatty alcohol biosynthesis in a beneficial insect, E. pela. © 2017 Wiley Periodicals, Inc.
Li, Qi-Gang; He, Yong-Han; Wu, Huan; Yang, Cui-Ping; Pu, Shao-Yan; Fan, Song-Qing; Jiang, Li-Ping; Shen, Qiu-Shuo; Wang, Xiao-Xiong; Chen, Xiao-Qiong; Yu, Qin; Li, Ying; Sun, Chang; Wang, Xiangting; Zhou, Jumin; Li, Hai-Peng; Chen, Yong-Bin; Kong, Qing-Peng
2017-01-01
Heterogeneity in transcriptional data hampers the identification of differentially expressed genes (DEGs) and understanding of cancer, essentially because current methods rely on cross-sample normalization and/or distribution assumption-both sensitive to heterogeneous values. Here, we developed a new method, Cross-Value Association Analysis (CVAA), which overcomes the limitation and is more robust to heterogeneous data than the other methods. Applying CVAA to a more complex pan-cancer dataset containing 5,540 transcriptomes discovered numerous new DEGs and many previously rarely explored pathways/processes; some of them were validated, both in vitro and in vivo , to be crucial in tumorigenesis, e.g., alcohol metabolism ( ADH1B ), chromosome remodeling ( NCAPH ) and complement system ( Adipsin ). Together, we present a sharper tool to navigate large-scale expression data and gain new mechanistic insights into tumorigenesis.
2014-01-01
Background In complex large-scale experiments, in addition to simultaneously considering a large number of features, multiple hypotheses are often being tested for each feature. This leads to a problem of multi-dimensional multiple testing. For example, in gene expression studies over ordered categories (such as time-course or dose-response experiments), interest is often in testing differential expression across several categories for each gene. In this paper, we consider a framework for testing multiple sets of hypothesis, which can be applied to a wide range of problems. Results We adopt the concept of the overall false discovery rate (OFDR) for controlling false discoveries on the hypothesis set level. Based on an existing procedure for identifying differentially expressed gene sets, we discuss a general two-step hierarchical hypothesis set testing procedure, which controls the overall false discovery rate under independence across hypothesis sets. In addition, we discuss the concept of the mixed-directional false discovery rate (mdFDR), and extend the general procedure to enable directional decisions for two-sided alternatives. We applied the framework to the case of microarray time-course/dose-response experiments, and proposed three procedures for testing differential expression and making multiple directional decisions for each gene. Simulation studies confirm the control of the OFDR and mdFDR by the proposed procedures under independence and positive correlations across genes. Simulation results also show that two of our new procedures achieve higher power than previous methods. Finally, the proposed methodology is applied to a microarray dose-response study, to identify 17 β-estradiol sensitive genes in breast cancer cells that are induced at low concentrations. Conclusions The framework we discuss provides a platform for multiple testing procedures covering situations involving two (or potentially more) sources of multiplicity. The framework is easy to use and adaptable to various practical settings that frequently occur in large-scale experiments. Procedures generated from the framework are shown to maintain control of the OFDR and mdFDR, quantities that are especially relevant in the case of multiple hypothesis set testing. The procedures work well in both simulations and real datasets, and are shown to have better power than existing methods. PMID:24731138
Tatro, Erick T; Scott, Erick R; Nguyen, Timothy B; Salaria, Shahid; Banerjee, Sugato; Moore, David J; Masliah, Eliezer; Achim, Cristian L; Everall, Ian P
2010-04-26
HIV infection disturbs the central nervous system (CNS) through inflammation and glial activation. Evidence suggests roles for microRNA (miRNA) in host defense and neuronal homeostasis, though little is known about miRNAs' role in HIV CNS infection. MiRNAs are non-coding RNAs that regulate gene translation through post-transcriptional mechanisms. Messenger-RNA profiling alone is insufficient to elucidate the dynamic dance of molecular expression of the genome. We sought to clarify RNA alterations in the frontal cortex (FC) of HIV-infected individuals and those concurrently infected and diagnosed with major depressive disorder (MDD). This report is the first published study of large-scale miRNA profiling from human HIV-infected FC. The goals of this study were to: 1. Identify changes in miRNA expression that occurred in the frontal cortex (FC) of HIV individuals, 2. Determine whether miRNA expression profiles of the FC could differentiate HIV from HIV/MDD, and 3. Adapt a method to meaningfully integrate gene expression data and miRNA expression data in clinical samples. We isolated RNA from the FC (n = 3) of three separate groups (uninfected controls, HIV, and HIV/MDD) and then pooled the RNA within each group for use in large-scale miRNA profiling. RNA from HIV and HIV/MDD patients (n = 4 per group) were also used for non-pooled mRNA analysis on Affymetrix U133 Plus 2.0 arrays. We then utilized a method for integrating the two datasets in a Target Bias Analysis. We found miRNAs of three types: A) Those with many dysregulated mRNA targets of less stringent statistical significance, B) Fewer dysregulated target-genes of highly stringent statistical significance, and C) unclear bias. In HIV/MDD, more miRNAs were downregulated than in HIV alone. Specific miRNA families at targeted chromosomal loci were dysregulated. The dysregulated miRNAs clustered on Chromosomes 14, 17, 19, and X. A small subset of dysregulated genes had many 3' untranslated region (3'UTR) target-sites for dysregulated miRNAs. We provide evidence that certain miRNAs serve as key elements in gene regulatory networks in HIV-infected FC and may be implicated in neurobehavioral disorder. Finally, our data indicates that some genes may serve as hubs of miRNA activity.
NASA Astrophysics Data System (ADS)
Scholz, Jan; Dejori, Mathäus; Stetter, Martin; Greiner, Martin
2005-05-01
The impact of observational noise on the analysis of scale-free networks is studied. Various noise sources are modeled as random link removal, random link exchange and random link addition. Emphasis is on the resulting modifications for the node-degree distribution and for a functional ranking based on betweenness centrality. The implications for estimated gene-expressed networks for childhood acute lymphoblastic leukemia are discussed.
Knoll-Gellida, Anja; André, Michèle; Gattegno, Tamar; Forgue, Jean; Admon, Arie; Babin, Patrick J
2006-01-01
Background The ability of an oocyte to develop into a viable embryo depends on the accumulation of specific maternal information and molecules, such as RNAs and proteins. A serial analysis of gene expression (SAGE) was carried out in parallel with proteomic analysis on fully-grown ovarian follicles from zebrafish (Danio rerio). The data obtained were compared with ovary/follicle/egg molecular phenotypes of other animals, published or available in public sequence databases. Results Sequencing of 27,486 SAGE tags identified 11,399 different ones, including 3,329 tags with an occurrence superior to one. Fifty-eight genes were expressed at over 0.15% of the total population and represented 17.34% of the mRNA population identified. The three most expressed transcripts were a rhamnose-binding lectin, beta-actin 2, and a transcribed locus similar to the H2B histone family. Comparison with the large-scale expressed sequence tags sequencing approach revealed highly expressed transcripts that were not previously known to be expressed at high levels in fish ovaries, like the short-sized polarized metallothionein 2 transcript. A higher sensitivity for the detection of transcripts with a characterized maternal genetic contribution was also demonstrated compared to large-scale sequencing of cDNA libraries. Ferritin heavy polypeptide 1, heat shock protein 90-beta, lactate dehydrogenase B4, beta-actin isoforms, tubulin beta 2, ATP synthase subunit 9, together with 40 S ribosomal protein S27a, were common highly-expressed transcripts of vertebrate ovary/unfertilized egg. Comparison of transcriptome and proteome data revealed that transcript levels provide little predictive value with respect to the extent of protein abundance. All the proteins identified by proteomic analysis of fully-grown zebrafish follicles had at least one transcript counterpart, with two exceptions: eosinophil chemotactic cytokine and nothepsin. Conclusion This study provides a complete sequence data set of maternal mRNA stored in zebrafish germ cells at the end of oogenesis. This catalogue contains highly-expressed transcripts that are part of a vertebrate ovarian expressed gene signature. Comparison of transcriptome and proteome data identified downregulated transcripts or proteins potentially incorporated in the oocyte by endocytosis. The molecular phenotype described provides groundwork for future experimental approaches aimed at identifying functionally important stored maternal transcripts and proteins involved in oogenesis and early stages of embryo development. PMID:16526958
Macro optical projection tomography for large scale 3D imaging of plant structures and gene activity
Lee, Karen J. I.; Calder, Grant M.; Hindle, Christopher R.; Newman, Jacob L.; Robinson, Simon N.; Avondo, Jerome J. H. Y.
2017-01-01
Abstract Optical projection tomography (OPT) is a well-established method for visualising gene activity in plants and animals. However, a limitation of conventional OPT is that the specimen upper size limit precludes its application to larger structures. To address this problem we constructed a macro version called Macro OPT (M-OPT). We apply M-OPT to 3D live imaging of gene activity in growing whole plants and to visualise structural morphology in large optically cleared plant and insect specimens up to 60 mm tall and 45 mm deep. We also show how M-OPT can be used to image gene expression domains in 3D within fixed tissue and to visualise gene activity in 3D in clones of growing young whole Arabidopsis plants. A further application of M-OPT is to visualise plant-insect interactions. Thus M-OPT provides an effective 3D imaging platform that allows the study of gene activity, internal plant structures and plant-insect interactions at a macroscopic scale. PMID:28025317
Orthogonal control of expression mean and variance by epigenetic features at different genomic loci
Dey, Siddharth S.; Foley, Jonathan E.; Limsirichai, Prajit; ...
2015-05-05
While gene expression noise has been shown to drive dramatic phenotypic variations, the molecular basis for this variability in mammalian systems is not well understood. Gene expression has been shown to be regulated by promoter architecture and the associated chromatin environment. However, the exact contribution of these two factors in regulating expression noise has not been explored. Using a dual-reporter lentiviral model system, we deconvolved the influence of the promoter sequence to systematically study the contribution of the chromatin environment at different genomic locations in regulating expression noise. By integrating a large-scale analysis to quantify mRNA levels by smFISH andmore » protein levels by flow cytometry in single cells, we found that mean expression and noise are uncorrelated across genomic locations. Furthermore, we showed that this independence could be explained by the orthogonal control of mean expression by the transcript burst size and noise by the burst frequency. Finally, we showed that genomic locations displaying higher expression noise are associated with more repressed chromatin, thereby indicating the contribution of the chromatin environment in regulating expression noise.« less
Distal-less induces elemental color patterns in Junonia butterfly wings.
Dhungel, Bidur; Ohno, Yoshikazu; Matayoshi, Rie; Iwasaki, Mayo; Taira, Wataru; Adhikari, Kiran; Gurung, Raj; Otaki, Joji M
2016-01-01
The border ocellus, or eyespot, is a conspicuous color pattern element in butterfly wings. For two decades, it has been hypothesized that transcription factors such as Distal-less (Dll) are responsible for eyespot pattern development in butterfly wings, based on their expression in the prospective eyespots. In particular, it has been suggested that Dll is a determinant for eyespot size. However, functional evidence for this hypothesis has remained incomplete, due to technical difficulties. Here, we show that ectopically expressed Dll induces ectopic elemental color patterns in the adult wings of the blue pansy butterfly, Junonia orithya (Lepidoptera, Nymphalidae). Using baculovirus-mediated gene transfer, we misexpressed Dll protein fused with green fluorescent protein (GFP) in pupal wings, resulting in ectopic color patterns, but not the formation of intact eyespots. Induced changes included clusters of black and orange scales (a basic feature of eyespot patterns), black and gray scales, and inhibition of cover scale development. In contrast, ectopic expression of GFP alone did not induce any color pattern changes using the same baculovirus-mediated gene transfer system. These results suggest that Dll plays an instructive role in the development of color pattern elements in butterfly wings, although Dll alone may not be sufficient to induce a complete eyespot. This study thus experimentally supports the hypothesis of Dll function in eyespot development.
Zeng, Fansuo; Sun, Fengkun; Li, Leilei; Liu, Kun; Zhan, Yaguang
2014-01-01
Evidence supporting nitric oxide (NO) as a mediator of plant biochemistry continues to grow, but its functions at the molecular level remains poorly understood and, in some cases, controversial. To study the role of NO at the transcriptional level in Betula platyphylla cells, we conducted a genome-scale transcriptome analysis of these cells. The transcriptome of untreated birch cells and those treated by sodium nitroprusside (SNP) were analyzed using the Solexa sequencing. Data were collected by sequencing cDNA libraries of birch cells, which had a long period to adapt to the suspension culture conditions before SNP-treated cells and untreated cells were sampled. Among the 34,100 UniGenes detected, BLASTX search revealed that 20,631 genes showed significant (E-values≤10−5) sequence similarity with proteins from the NR-database. Numerous expressed sequence tags (i.e., 1374) were identified as differentially expressed between the 12 h SNP-treated cells and control cells samples: 403 up-regulated and 971 down-regulated. From this, we specifically examined a core set of NO-related transcripts. The altered expression levels of several transcripts, as determined by transcriptome analysis, was confirmed by qRT-PCR. The results of transcriptome analysis, gene expression quantification, the content of triterpenoid and activities of defensive enzymes elucidated NO has a significant effect on many processes including triterpenoid production, carbohydrate metabolism and cell wall biosynthesis. PMID:25551661
Psychological Well-Being and the Human Conserved Transcriptional Response to Adversity
Fredrickson, Barbara L.; Grewen, Karen M.; Algoe, Sara B.; Firestine, Ann M.; Arevalo, Jesusa M. G.; Ma, Jeffrey; Cole, Steve W.
2015-01-01
Research in human social genomics has identified a conserved transcriptional response to adversity (CTRA) characterized by up-regulated expression of pro-inflammatory genes and down-regulated expression of Type I interferon- and antibody-related genes. This report seeks to identify the specific aspects of positive psychological well-being that oppose such effects and predict reduced CTRA gene expression. In a new confirmation study of 122 healthy adults that replicated the approach of a previously reported discovery study, mixed effect linear model analyses identified a significant inverse association between expression of CTRA indicator genes and a summary measure of eudaimonic well-being from the Mental Health Continuum – Short Form. Analyses of a 2- representation of eudaimonia converged in finding correlated psychological and social subdomains of eudaimonic well-being to be the primary carriers of CTRA associations. Hedonic well-being showed no consistent CTRA association independent of eudaimonic well-being, and summary measures integrating hedonic and eudaimonic well-being showed less stable CTRA associations than did focal measures of eudaimonia (psychological and social well-being). Similar results emerged from analyses of pooled discovery and confirmation samples (n = 198). Similar results also emerged from analyses of a second new generalization study of 107 healthy adults that included the more detailed Ryff Scales of Psychological Well-being and found this more robust measure of eudaimonic well-being to also associate with reduced CTRA gene expression. Five of the 6 major sub-domains of psychological well-being predicted reduced CTRA gene expression when analyzed separately, and 3 remained distinctively prognostic in mutually adjusted analyses. All associations were independent of demographic characteristics, health-related confounders, and RNA indicators of leukocyte subset distribution. These results identify specific sub-dimensions of eudaimonic well-being as promising targets for future interventions to mitigate CTRA gene expression, and provide no support for any independent favorable contribution from hedonic well-being. PMID:25811656
Huang, Xianzhong; Yang, Lifei; Jin, Yuhuan; Lin, Jun; Liu, Fang
2017-01-01
Arabidopsis pumila is an ephemeral plant, and a close relative of the model plant Arabidopsis thaliana , but it possesses higher photosynthetic efficiency, higher propagation rate, and higher salinity tolerance compared to those A. thaliana , thus providing a candidate plant system for gene mining for environmental adaption and salt tolerance. However, A. pumila is an under-explored resource for understanding the genetic mechanisms underlying abiotic stress adaptation. To improve our understanding of the molecular and genetic mechanisms of salt stress adaptation, more than 19,900 clones randomly selected from a cDNA library constructed previously from leaf tissue exposed to high-salinity shock were sequenced. A total of 16,014 high-quality expressed sequence tags (ESTs) were generated, which have been deposited in the dbEST GenBank under accession numbers JZ932319 to JZ948332. Clustering and assembly of these ESTs resulted in the identification of 8,835 unique sequences, consisting of 2,469 contigs and 6,366 singletons. The blastx results revealed 8,011 unigenes with significant similarity to known genes, while only 425 unigenes remained uncharacterized. Functional classification demonstrated an abundance of unigenes involved in binding, catalytic, structural or transporter activities, and in pathways of energy, carbohydrate, amino acid, or lipid metabolism. At least seven main classes of genes were related to salt-tolerance among the 8,835 unigenes. Many previously reported salt tolerance genes were also manifested in this library, for example VP1, H + -ATPase, NHX1, SOS2, SOS3, NAC, MYB, ERF, LEA, P5CS1 . In addition, 251 transcription factors were identified from the library, classified into 42 families. Lastly, changes in expression of the 12 most abundant unigenes, 12 transcription factor genes, and 19 stress-related genes in the first 24 h of exposure to high-salinity stress conditions were monitored by qRT-PCR. The large-scale EST library obtained in this study provides first-hand information on gene sequences expressed in young leaves of A. pumila exposed to salt shock. The rapid discovery of known or unknown genes related to salinity stress response in A. pumila will facilitate the understanding of complex adaptive mechanisms for ephemerals.
Prokaryotic Gene Clusters: A Rich Toolbox for Synthetic Biology
Fischbach, Michael; Voigt, Christopher A.
2014-01-01
Bacteria construct elaborate nanostructures, obtain nutrients and energy from diverse sources, synthesize complex molecules, and implement signal processing to react to their environment. These complex phenotypes require the coordinated action of multiple genes, which are often encoded in a contiguous region of the genome, referred to as a gene cluster. Gene clusters sometimes contain all of the genes necessary and sufficient for a particular function. As an evolutionary mechanism, gene clusters facilitate the horizontal transfer of the complete function between species. Here, we review recent work on a number of clusters whose functions are relevant to biotechnology. Engineering these clusters has been hindered by their regulatory complexity, the need to balance the expression of many genes, and a lack of tools to design and manipulate DNA at this scale. Advances in synthetic biology will enable the large-scale bottom-up engineering of the clusters to optimize their functions, wake up cryptic clusters, or to transfer them between organisms. Understanding and manipulating gene clusters will move towards an era of genome engineering, where multiple functions can be “mixed-and-matched” to create a designer organism. PMID:21154668
Functional regression method for whole genome eQTL epistasis analysis with sequencing data.
Xu, Kelin; Jin, Li; Xiong, Momiao
2017-05-18
Epistasis plays an essential rule in understanding the regulation mechanisms and is an essential component of the genetic architecture of the gene expressions. However, interaction analysis of gene expressions remains fundamentally unexplored due to great computational challenges and data availability. Due to variation in splicing, transcription start sites, polyadenylation sites, post-transcriptional RNA editing across the entire gene, and transcription rates of the cells, RNA-seq measurements generate large expression variability and collectively create the observed position level read count curves. A single number for measuring gene expression which is widely used for microarray measured gene expression analysis is highly unlikely to sufficiently account for large expression variation across the gene. Simultaneously analyzing epistatic architecture using the RNA-seq and whole genome sequencing (WGS) data poses enormous challenges. We develop a nonlinear functional regression model (FRGM) with functional responses where the position-level read counts within a gene are taken as a function of genomic position, and functional predictors where genotype profiles are viewed as a function of genomic position, for epistasis analysis with RNA-seq data. Instead of testing the interaction of all possible pair-wises SNPs, the FRGM takes a gene as a basic unit for epistasis analysis, which tests for the interaction of all possible pairs of genes and use all the information that can be accessed to collectively test interaction between all possible pairs of SNPs within two genome regions. By large-scale simulations, we demonstrate that the proposed FRGM for epistasis analysis can achieve the correct type 1 error and has higher power to detect the interactions between genes than the existing methods. The proposed methods are applied to the RNA-seq and WGS data from the 1000 Genome Project. The numbers of pairs of significantly interacting genes after Bonferroni correction identified using FRGM, RPKM and DESeq were 16,2361, 260 and 51, respectively, from the 350 European samples. The proposed FRGM for epistasis analysis of RNA-seq can capture isoform and position-level information and will have a broad application. Both simulations and real data analysis highlight the potential for the FRGM to be a good choice of the epistatic analysis with sequencing data.
Selinger, D A; Chandler, V L
1999-12-21
The b locus encodes a transcription factor that regulates the expression of genes that produce purple anthocyanin pigment. Different b alleles are expressed in distinct tissues, causing tissue-specific anthocyanin production. Understanding how phenotypic diversity is produced and maintained at the b locus should provide models for how other regulatory genes, including those that influence morphological traits and development, evolve. We have investigated how different levels and patterns of pigmentation have evolved by determining the phenotypic and evolutionary relationships between 18 alleles that represent the diversity of b alleles in Zea mays. Although most of these alleles have few phenotypic differences, five alleles have very distinct tissue-specific patterns of pigmentation. Superimposing the phenotypes on the molecular phylogeny reveals that the alleles with strong and distinctive patterns of expression are closely related to alleles with weak expression, implying that the distinctive patterns have arisen recently. We have identified apparent insertions in three of the five phenotypically distinct alleles, and the fourth has unique upstream restriction fragment length polymorphisms relative to closely related alleles. The insertion in B-Peru has been shown to be responsible for its unique expression and, in the other two alleles, the presence of the insertion correlates with the phenotype. These results suggest that major changes in gene expression are probably the result of large-scale changes in DNA sequence and/or structure most likely mediated by transposable elements.
Müller, Christian; Schillert, Arne; Röthemeier, Caroline; Trégouët, David-Alexandre; Proust, Carole; Binder, Harald; Pfeiffer, Norbert; Beutel, Manfred; Lackner, Karl J.; Schnabel, Renate B.; Tiret, Laurence; Wild, Philipp S.; Blankenberg, Stefan
2016-01-01
Technical variation plays an important role in microarray-based gene expression studies, and batch effects explain a large proportion of this noise. It is therefore mandatory to eliminate technical variation while maintaining biological variability. Several strategies have been proposed for the removal of batch effects, although they have not been evaluated in large-scale longitudinal gene expression data. In this study, we aimed at identifying a suitable method for batch effect removal in a large study of microarray-based longitudinal gene expression. Monocytic gene expression was measured in 1092 participants of the Gutenberg Health Study at baseline and 5-year follow up. Replicates of selected samples were measured at both time points to identify technical variability. Deming regression, Passing-Bablok regression, linear mixed models, non-linear models as well as ReplicateRUV and ComBat were applied to eliminate batch effects between replicates. In a second step, quantile normalization prior to batch effect correction was performed for each method. Technical variation between batches was evaluated by principal component analysis. Associations between body mass index and transcriptomes were calculated before and after batch removal. Results from association analyses were compared to evaluate maintenance of biological variability. Quantile normalization, separately performed in each batch, combined with ComBat successfully reduced batch effects and maintained biological variability. ReplicateRUV performed perfectly in the replicate data subset of the study, but failed when applied to all samples. All other methods did not substantially reduce batch effects in the replicate data subset. Quantile normalization plus ComBat appears to be a valuable approach for batch correction in longitudinal gene expression data. PMID:27272489
2012-01-01
Background Colorectal carcinomas (CRC) carry massive genetic and transcriptional alterations that influence multiple cellular pathways. The study of proteins whose loss-of-function (LOF) alters the growth of CRC cells can be used to further understand the cellular processes cancer cells depend upon for survival. Results A small-scale RNAi screen of ~400 genes conducted in SW480 CRC cells identified several candidate genes as required for the viability of CRC cells, most prominently CASP8AP2/FLASH. To understand the function of this gene in maintaining the viability of CRC cells in an unbiased manner, we generated gene specific expression profiles following RNAi. Silencing of CASP8AP2/FLASH resulted in altered expression of over 2500 genes enriched for genes associated with cellular growth and proliferation. Loss of CASP8AP2/FLASH function was significantly associated with altered transcription of the genes encoding the replication-dependent histone proteins as a result of the expression of the non-canonical polyA variants of these transcripts. Silencing of CASP8AP2/FLASH also mediated enrichment of changes in the expression of targets of the NFκB and MYC transcription factors. These findings were confirmed by whole transcriptome analysis of CASP8AP2/FLASH silenced cells at multiple time points. Finally, we identified and validated that CASP8AP2/FLASH LOF increases the expression of neurofilament heavy polypeptide (NEFH), a protein recently linked to regulation of the AKT1/ß-catenin pathway. Conclusions We have used unbiased RNAi based approaches to identify and characterize the function of CASP8AP2/FLASH, a protein not previously reported as required for cell survival. This study further defines the role CASP8AP2/FLASH plays in the regulating expression of the replication-dependent histones and shows that its LOF results in broad and reproducible effects on the transcriptome of colorectal cancer cells including the induction of expression of the recently described tumor suppressor gene NEFH. PMID:22216762
Hummon, Amanda B; Pitt, Jason J; Camps, Jordi; Emons, Georg; Skube, Susan B; Huppi, Konrad; Jones, Tamara L; Beissbarth, Tim; Kramer, Frank; Grade, Marian; Difilippantonio, Michael J; Ried, Thomas; Caplen, Natasha J
2012-01-04
Colorectal carcinomas (CRC) carry massive genetic and transcriptional alterations that influence multiple cellular pathways. The study of proteins whose loss-of-function (LOF) alters the growth of CRC cells can be used to further understand the cellular processes cancer cells depend upon for survival. A small-scale RNAi screen of ~400 genes conducted in SW480 CRC cells identified several candidate genes as required for the viability of CRC cells, most prominently CASP8AP2/FLASH. To understand the function of this gene in maintaining the viability of CRC cells in an unbiased manner, we generated gene specific expression profiles following RNAi. Silencing of CASP8AP2/FLASH resulted in altered expression of over 2500 genes enriched for genes associated with cellular growth and proliferation. Loss of CASP8AP2/FLASH function was significantly associated with altered transcription of the genes encoding the replication-dependent histone proteins as a result of the expression of the non-canonical polyA variants of these transcripts. Silencing of CASP8AP2/FLASH also mediated enrichment of changes in the expression of targets of the NFκB and MYC transcription factors. These findings were confirmed by whole transcriptome analysis of CASP8AP2/FLASH silenced cells at multiple time points. Finally, we identified and validated that CASP8AP2/FLASH LOF increases the expression of neurofilament heavy polypeptide (NEFH), a protein recently linked to regulation of the AKT1/ß-catenin pathway. We have used unbiased RNAi based approaches to identify and characterize the function of CASP8AP2/FLASH, a protein not previously reported as required for cell survival. This study further defines the role CASP8AP2/FLASH plays in the regulating expression of the replication-dependent histones and shows that its LOF results in broad and reproducible effects on the transcriptome of colorectal cancer cells including the induction of expression of the recently described tumor suppressor gene NEFH.
Schmidt-Heck, Wolfgang; Wönne, Eva C; Hiller, Thomas; Menzel, Uwe; Koczan, Dirk; Damm, Georg; Seehofer, Daniel; Knöspel, Fanny; Freyer, Nora; Guthke, Reinhard; Dooley, Steven; Zeilinger, Katrin
2017-05-01
The liver is the major site for alcohol metabolism in the body and therefore the primary target organ for ethanol (EtOH)-induced toxicity. In this study, we investigated the in vitro response of human liver cells to different EtOH concentrations in a perfused bioartificial liver device that mimics the complex architecture of the natural organ. Primary human liver cells were cultured in the bioartificial liver device and treated for 24 hours with medium containing 150 mM (low), 300 mM (medium), or 600 mM (high) EtOH, while a control culture was kept untreated. Gene expression patterns for each EtOH concentration were monitored using Affymetrix Human Gene 1.0 ST Gene chips. Scaled expression profiles of differentially expressed genes (DEGs) were clustered using Fuzzy c-means algorithm. In addition, functional classification methods, KEGG pathway mapping and also a machine learning approach (Random Forest) were utilized. A number of 966 (150 mM EtOH), 1,334 (300 mM EtOH), or 4,132 (600 mM EtOH) genes were found to be differentially expressed. Dose-response relationships of the identified clusters of co-expressed genes showed a monotonic, threshold, or nonmonotonic (hormetic) behavior. Functional classification of DEGs revealed that low or medium EtOH concentrations operate adaptation processes, while alterations observed for the high EtOH concentration reflect the response to cellular damage. The genes displaying a hormetic response were functionally characterized by overrepresented "cellular ketone metabolism" and "carboxylic acid metabolism." Altered expression of the genes BAHD1 and H3F3B was identified as sufficient to classify the samples according to the applied EtOH doses. Different pathways of metabolic and epigenetic regulation are affected by EtOH exposition and partly undergo hormetic regulation in the bioartificial liver device. Gene expression changes observed at high EtOH concentrations reflect in some aspects the situation of alcoholic hepatitis in humans. Copyright © 2017 by the Research Society on Alcoholism.
Expression of the Arabidopsis thaliana BBX32 gene in soybean increases grain yield.
Preuss, Sasha B; Meister, Robert; Xu, Qingzhang; Urwin, Carl P; Tripodi, Federico A; Screen, Steven E; Anil, Veena S; Zhu, Shuquan; Morrell, James A; Liu, Grace; Ratcliffe, Oliver J; Reuber, T Lynne; Khanna, Rajnish; Goldman, Barry S; Bell, Erin; Ziegler, Todd E; McClerren, Amanda L; Ruff, Thomas G; Petracek, Marie E
2012-01-01
Crop yield is a highly complex quantitative trait. Historically, successful breeding for improved grain yield has led to crop plants with improved source capacity, altered plant architecture, and increased resistance to abiotic and biotic stresses. To date, transgenic approaches towards improving crop grain yield have primarily focused on protecting plants from herbicide, insects, or disease. In contrast, we have focused on identifying genes that, when expressed in soybean, improve the intrinsic ability of the plant to yield more. Through the large scale screening of candidate genes in transgenic soybean, we identified an Arabidopsis thaliana B-box domain gene (AtBBX32) that significantly increases soybean grain yield year after year in multiple transgenic events in multi-location field trials. In order to understand the underlying physiological changes that are associated with increased yield in transgenic soybean, we examined phenotypic differences in two AtBBX32-expressing lines and found increases in plant height and node, flower, pod, and seed number. We propose that these phenotypic changes are likely the result of changes in the timing of reproductive development in transgenic soybean that lead to the increased duration of the pod and seed development period. Consistent with the role of BBX32 in A. thaliana in regulating light signaling, we show that the constitutive expression of AtBBX32 in soybean alters the abundance of a subset of gene transcripts in the early morning hours. In particular, AtBBX32 alters transcript levels of the soybean clock genes GmTOC1 and LHY-CCA1-like2 (GmLCL2). We propose that through the expression of AtBBX32 and modulation of the abundance of circadian clock genes during the transition from dark to light, the timing of critical phases of reproductive development are altered. These findings demonstrate a specific role for AtBBX32 in modulating soybean development, and demonstrate the validity of expressing single genes in crops to deliver increased agricultural productivity.
Large-scale gene function analysis with the PANTHER classification system.
Mi, Huaiyu; Muruganujan, Anushya; Casagrande, John T; Thomas, Paul D
2013-08-01
The PANTHER (protein annotation through evolutionary relationship) classification system (http://www.pantherdb.org/) is a comprehensive system that combines gene function, ontology, pathways and statistical analysis tools that enable biologists to analyze large-scale, genome-wide data from sequencing, proteomics or gene expression experiments. The system is built with 82 complete genomes organized into gene families and subfamilies, and their evolutionary relationships are captured in phylogenetic trees, multiple sequence alignments and statistical models (hidden Markov models or HMMs). Genes are classified according to their function in several different ways: families and subfamilies are annotated with ontology terms (Gene Ontology (GO) and PANTHER protein class), and sequences are assigned to PANTHER pathways. The PANTHER website includes a suite of tools that enable users to browse and query gene functions, and to analyze large-scale experimental data with a number of statistical tests. It is widely used by bench scientists, bioinformaticians, computer scientists and systems biologists. In the 2013 release of PANTHER (v.8.0), in addition to an update of the data content, we redesigned the website interface to improve both user experience and the system's analytical capability. This protocol provides a detailed description of how to analyze genome-wide experimental data with the PANTHER classification system.
Micro-Scale Genomic DNA Copy Number Aberrations as Another Means of Mutagenesis in Breast Cancer
Chao, Hann-Hsiang; He, Xiaping; Parker, Joel S.; Zhao, Wei; Perou, Charles M.
2012-01-01
Introduction In breast cancer, the basal-like subtype has high levels of genomic instability relative to other breast cancer subtypes with many basal-like-specific regions of aberration. There is evidence that this genomic instability extends to smaller scale genomic aberrations, as shown by a previously described micro-deletion event in the PTEN gene in the Basal-like SUM149 breast cancer cell line. Methods We sought to identify if small regions of genomic DNA copy number changes exist by using a high density, gene-centric Comparative Genomic Hybridizations (CGH) array on cell lines and primary tumors. A custom tiling array for CGH (244,000 probes, 200 bp tiling resolution) was created to identify small regions of genomic change, which was focused on previously identified basal-like-specific, and general cancer genes. Tumor genomic DNA from 94 patients and 2 breast cancer cell lines was labeled and hybridized to these arrays. Aberrations were called using SWITCHdna and the smallest 25% of SWITCHdna-defined genomic segments were called micro-aberrations (<64 contiguous probes, ∼ 15 kb). Results Our data showed that primary tumor breast cancer genomes frequently contained many small-scale copy number gains and losses, termed micro-aberrations, most of which are undetectable using typical-density genome-wide aCGH arrays. The basal-like subtype exhibited the highest incidence of these events. These micro-aberrations sometimes altered expression of the involved gene. We confirmed the presence of the PTEN micro-amplification in SUM149 and by mRNA-seq showed that this resulted in loss of expression of all exons downstream of this event. Micro-aberrations disproportionately affected the 5′ regions of the affected genes, including the promoter region, and high frequency of micro-aberrations was associated with poor survival. Conclusion Using a high-probe-density, gene-centric aCGH microarray, we present evidence of small-scale genomic aberrations that can contribute to gene inactivation. These events may contribute to tumor formation through mechanisms not detected using conventional DNA copy number analyses. PMID:23284754
Bertrand, Erin M; McCrow, John P; Moustafa, Ahmed; Zheng, Hong; McQuaid, Jeffrey B; Delmont, Tom O; Post, Anton F; Sipler, Rachel E; Spackeen, Jenna L; Xu, Kai; Bronk, Deborah A; Hutchins, David A; Allen, Andrew E
2015-08-11
Southern Ocean primary productivity plays a key role in global ocean biogeochemistry and climate. At the Southern Ocean sea ice edge in coastal McMurdo Sound, we observed simultaneous cobalamin and iron limitation of surface water phytoplankton communities in late Austral summer. Cobalamin is produced only by bacteria and archaea, suggesting phytoplankton-bacterial interactions must play a role in this limitation. To characterize these interactions and investigate the molecular basis of multiple nutrient limitation, we examined transitions in global gene expression over short time scales, induced by shifts in micronutrient availability. Diatoms, the dominant primary producers, exhibited transcriptional patterns indicative of co-occurring iron and cobalamin deprivation. The major contributor to cobalamin biosynthesis gene expression was a gammaproteobacterial population, Oceanospirillaceae ASP10-02a. This group also contributed significantly to metagenomic cobalamin biosynthesis gene abundance throughout Southern Ocean surface waters. Oceanospirillaceae ASP10-02a displayed elevated expression of organic matter acquisition and cell surface attachment-related genes, consistent with a mutualistic relationship in which they are dependent on phytoplankton growth to fuel cobalamin production. Separate bacterial groups, including Methylophaga, appeared to rely on phytoplankton for carbon and energy sources, but displayed gene expression patterns consistent with iron and cobalamin deprivation. This suggests they also compete with phytoplankton and are important cobalamin consumers. Expression patterns of siderophore- related genes offer evidence for bacterial influences on iron availability as well. The nature and degree of this episodic colimitation appear to be mediated by a series of phytoplankton-bacterial interactions in both positive and negative feedback loops.
Bertrand, Erin M.; McCrow, John P.; Moustafa, Ahmed; Zheng, Hong; McQuaid, Jeffrey B.; Delmont, Tom O.; Post, Anton F.; Sipler, Rachel E.; Spackeen, Jenna L.; Xu, Kai; Bronk, Deborah A.; Hutchins, David A.; Allen, Andrew E.
2015-01-01
Southern Ocean primary productivity plays a key role in global ocean biogeochemistry and climate. At the Southern Ocean sea ice edge in coastal McMurdo Sound, we observed simultaneous cobalamin and iron limitation of surface water phytoplankton communities in late Austral summer. Cobalamin is produced only by bacteria and archaea, suggesting phytoplankton–bacterial interactions must play a role in this limitation. To characterize these interactions and investigate the molecular basis of multiple nutrient limitation, we examined transitions in global gene expression over short time scales, induced by shifts in micronutrient availability. Diatoms, the dominant primary producers, exhibited transcriptional patterns indicative of co-occurring iron and cobalamin deprivation. The major contributor to cobalamin biosynthesis gene expression was a gammaproteobacterial population, Oceanospirillaceae ASP10-02a. This group also contributed significantly to metagenomic cobalamin biosynthesis gene abundance throughout Southern Ocean surface waters. Oceanospirillaceae ASP10-02a displayed elevated expression of organic matter acquisition and cell surface attachment-related genes, consistent with a mutualistic relationship in which they are dependent on phytoplankton growth to fuel cobalamin production. Separate bacterial groups, including Methylophaga, appeared to rely on phytoplankton for carbon and energy sources, but displayed gene expression patterns consistent with iron and cobalamin deprivation. This suggests they also compete with phytoplankton and are important cobalamin consumers. Expression patterns of siderophore- related genes offer evidence for bacterial influences on iron availability as well. The nature and degree of this episodic colimitation appear to be mediated by a series of phytoplankton–bacterial interactions in both positive and negative feedback loops. PMID:26221022
Applications of Proteomic Technologies to Toxicology
Proteomics is the large-scale study of gene expression at the protein level. This cutting edge technology has been extensively applied to toxicology research recently. The up-to-date development of proteomics has presented the toxicology community with an unprecedented opportunit...
ProbFAST: Probabilistic functional analysis system tool.
Silva, Israel T; Vêncio, Ricardo Z N; Oliveira, Thiago Y K; Molfetta, Greice A; Silva, Wilson A
2010-03-30
The post-genomic era has brought new challenges regarding the understanding of the organization and function of the human genome. Many of these challenges are centered on the meaning of differential gene regulation under distinct biological conditions and can be performed by analyzing the Multiple Differential Expression (MDE) of genes associated with normal and abnormal biological processes. Currently MDE analyses are limited to usual methods of differential expression initially designed for paired analysis. We proposed a web platform named ProbFAST for MDE analysis which uses Bayesian inference to identify key genes that are intuitively prioritized by means of probabilities. A simulated study revealed that our method gives a better performance when compared to other approaches and when applied to public expression data, we demonstrated its flexibility to obtain relevant genes biologically associated with normal and abnormal biological processes. ProbFAST is a free accessible web-based application that enables MDE analysis on a global scale. It offers an efficient methodological approach for MDE analysis of a set of genes that are turned on and off related to functional information during the evolution of a tumor or tissue differentiation. ProbFAST server can be accessed at http://gdm.fmrp.usp.br/probfast.
ProbFAST: Probabilistic Functional Analysis System Tool
2010-01-01
Background The post-genomic era has brought new challenges regarding the understanding of the organization and function of the human genome. Many of these challenges are centered on the meaning of differential gene regulation under distinct biological conditions and can be performed by analyzing the Multiple Differential Expression (MDE) of genes associated with normal and abnormal biological processes. Currently MDE analyses are limited to usual methods of differential expression initially designed for paired analysis. Results We proposed a web platform named ProbFAST for MDE analysis which uses Bayesian inference to identify key genes that are intuitively prioritized by means of probabilities. A simulated study revealed that our method gives a better performance when compared to other approaches and when applied to public expression data, we demonstrated its flexibility to obtain relevant genes biologically associated with normal and abnormal biological processes. Conclusions ProbFAST is a free accessible web-based application that enables MDE analysis on a global scale. It offers an efficient methodological approach for MDE analysis of a set of genes that are turned on and off related to functional information during the evolution of a tumor or tissue differentiation. ProbFAST server can be accessed at http://gdm.fmrp.usp.br/probfast. PMID:20353576
Smith, Ian; Greenside, Peyton G; Natoli, Ted; Lahr, David L; Wadden, David; Tirosh, Itay; Narayan, Rajiv; Root, David E; Golub, Todd R; Subramanian, Aravind; Doench, John G
2017-11-01
The application of RNA interference (RNAi) to mammalian cells has provided the means to perform phenotypic screens to determine the functions of genes. Although RNAi has revolutionized loss-of-function genetic experiments, it has been difficult to systematically assess the prevalence and consequences of off-target effects. The Connectivity Map (CMAP) represents an unprecedented resource to study the gene expression consequences of expressing short hairpin RNAs (shRNAs). Analysis of signatures for over 13,000 shRNAs applied in 9 cell lines revealed that microRNA (miRNA)-like off-target effects of RNAi are far stronger and more pervasive than generally appreciated. We show that mitigating off-target effects is feasible in these datasets via computational methodologies to produce a consensus gene signature (CGS). In addition, we compared RNAi technology to clustered regularly interspaced short palindromic repeat (CRISPR)-based knockout by analysis of 373 single guide RNAs (sgRNAs) in 6 cells lines and show that the on-target efficacies are comparable, but CRISPR technology is far less susceptible to systematic off-target effects. These results will help guide the proper use and analysis of loss-of-function reagents for the determination of gene function.
CellLineNavigator: a workbench for cancer cell line analysis
Krupp, Markus; Itzel, Timo; Maass, Thorsten; Hildebrandt, Andreas; Galle, Peter R.; Teufel, Andreas
2013-01-01
The CellLineNavigator database, freely available at http://www.medicalgenomics.org/celllinenavigator, is a web-based workbench for large scale comparisons of a large collection of diverse cell lines. It aims to support experimental design in the fields of genomics, systems biology and translational biomedical research. Currently, this compendium holds genome wide expression profiles of 317 different cancer cell lines, categorized into 57 different pathological states and 28 individual tissues. To enlarge the scope of CellLineNavigator, the database was furthermore closely linked to commonly used bioinformatics databases and knowledge repositories. To ensure easy data access and search ability, a simple data and an intuitive querying interface were implemented. It allows the user to explore and filter gene expression, focusing on pathological or physiological conditions. For a more complex search, the advanced query interface may be used to query for (i) differentially expressed genes; (ii) pathological or physiological conditions; or (iii) gene names or functional attributes, such as Kyoto Encyclopaedia of Genes and Genomes pathway maps. These queries may also be combined. Finally, CellLineNavigator allows additional advanced analysis of differentially regulated genes by a direct link to the Database for Annotation, Visualization and Integrated Discovery (DAVID) Bioinformatics Resources. PMID:23118487
Asamizu, E; Nakamura, Y; Sato, S; Tabata, S
2000-06-30
For comprehensive analysis of genes expressed in the model dicotyledonous plant, Arabidopsis thaliana, expressed sequence tags (ESTs) were accumulated. Normalized and size-selected cDNA libraries were constructed from aboveground organs, flower buds, roots, green siliques and liquid-cultured seedlings, respectively, and a total of 14,026 5'-end ESTs and 39,207 3'-end ESTs were obtained. The 3'-end ESTs could be clustered into 12,028 non-redundant groups. Similarity search of the non-redundant ESTs against the public non-redundant protein database indicated that 4816 groups show similarity to genes of known function, 1864 to hypothetical genes, and the remaining 5348 are novel sequences. Gene coverage by the non-redundant ESTs was analyzed using the annotated genomic sequences of approximately 10 Mb on chromosomes 3 and 5. A total of 923 regions were hit by at least one EST, among which only 499 regions were hit by the ESTs deposited in the public database. The result indicates that the EST source generated in this project complements the EST data in the public database and facilitates new gene discovery.
Kittas, Aristotelis; Delobelle, Aurélien; Schmitt, Sabrina; Breuhahn, Kai; Guziolowski, Carito; Grabe, Niels
2016-01-01
An effective means to analyze mRNA expression data is to take advantage of established knowledge from pathway databases, using methods such as pathway-enrichment analyses. However, pathway databases are not case-specific and expression data could be used to infer gene-regulation patterns in the context of specific pathways. In addition, canonical pathways may not always describe the signaling mechanisms properly, because interactions can frequently occur between genes in different pathways. Relatively few methods have been proposed to date for generating and analyzing such networks, preserving the causality between gene interactions and reasoning over the qualitative logic of regulatory effects. We present an algorithm (MCWalk) integrated with a logic programming approach, to discover subgraphs in large-scale signaling networks by random walks in a fully automated pipeline. As an exemplary application, we uncover the signal transduction mechanisms in a gene interaction network describing hepatocyte growth factor-stimulated cell migration and proliferation from gene-expression measured with microarray and RT-qPCR using in-house perturbation experiments in a keratinocyte-fibroblast co-culture. The resulting subgraphs illustrate possible associations of hepatocyte growth factor receptor c-Met nodes, differentially expressed genes and cellular states. Using perturbation experiments and Answer Set programming, we are able to select those which are more consistent with the experimental data. We discover key regulator nodes by measuring the frequency with which they are traversed when connecting signaling between receptors and significantly regulated genes and predict their expression-shift consistently with the measured data. The Java implementation of MCWalk is publicly available under the MIT license at: https://bitbucket.org/akittas/biosubg. © 2015 FEBS.
Leslie, Trent; Baucom, Regina S.
2014-01-01
Human-mediated selection can lead to rapid evolution in very short time scales, and the evolution of herbicide resistance in agricultural weeds is an excellent example of this phenomenon. The common morning glory, Ipomoea purpurea, is resistant to the herbicide glyphosate, but genetic investigations of this trait have been hampered by the lack of genomic resources for this species. Here, we present the annotated transcriptome of the common morning glory, Ipomoea purpurea, along with an examination of whole genome expression profiling to assess potential gene expression differences between three artificially selected herbicide resistant lines and three susceptible lines. The assembled Ipomoea transcriptome reported in this work contains 65,459 assembled transcripts, ~28,000 of which were functionally annotated by assignment to Gene Ontology categories. Our RNA-seq survey using this reference transcriptome identified 19 differentially expressed genes associated with resistance—one of which, a cytochrome P450, belongs to a large plant family of genes involved in xenobiotic detoxification. The differentially expressed genes also broadly implicated receptor-like kinases, which were down-regulated in the resistant lines, and other growth and defense genes, which were up-regulated in resistant lines. Interestingly, the target of glyphosate—EPSP synthase—was not overexpressed in the resistant Ipomoea lines as in other glyphosate resistant weeds. Overall, this work identifies potential candidate resistance loci for future investigations and dramatically increases genomic resources for this species. The assembled transcriptome presented herein will also provide a valuable resource to the Ipomoea community, as well as to those interested in utilizing the close relationship between the Convolvulaceae and the Solanaceae for phylogenetic and comparative genomics examinations. PMID:25155274
Leslie, Trent; Baucom, Regina S
2014-08-25
Human-mediated selection can lead to rapid evolution in very short time scales, and the evolution of herbicide resistance in agricultural weeds is an excellent example of this phenomenon. The common morning glory, Ipomoea purpurea, is resistant to the herbicide glyphosate, but genetic investigations of this trait have been hampered by the lack of genomic resources for this species. Here, we present the annotated transcriptome of the common morning glory, Ipomoea purpurea, along with an examination of whole genome expression profiling to assess potential gene expression differences between three artificially selected herbicide resistant lines and three susceptible lines. The assembled Ipomoea transcriptome reported in this work contains 65,459 assembled transcripts, ~28,000 of which were functionally annotated by assignment to Gene Ontology categories. Our RNA-seq survey using this reference transcriptome identified 19 differentially expressed genes associated with resistance-one of which, a cytochrome P450, belongs to a large plant family of genes involved in xenobiotic detoxification. The differentially expressed genes also broadly implicated receptor-like kinases, which were down-regulated in the resistant lines, and other growth and defense genes, which were up-regulated in resistant lines. Interestingly, the target of glyphosate-EPSP synthase-was not overexpressed in the resistant Ipomoea lines as in other glyphosate resistant weeds. Overall, this work identifies potential candidate resistance loci for future investigations and dramatically increases genomic resources for this species. The assembled transcriptome presented herein will also provide a valuable resource to the Ipomoea community, as well as to those interested in utilizing the close relationship between the Convolvulaceae and the Solanaceae for phylogenetic and comparative genomics examinations. Copyright © 2014 Leslie and Baucom.
Novakovic, Boris; Evain-Brion, Danièle; Murthi, Padma; Fournier, Thiery; Saffery, Richard
2017-06-01
Placental functioning relies on the appropriate differentiation of progenitor villous cytotrophoblasts (CTBs) into extravillous cytotrophoblasts (EVCTs), including invasive EVCTs, and the multinucleated syncytiotrophoblast (ST) layer. This is accompanied by a general move away from a proliferative, immature phenotype. Genome-scale expression studies have provided valuable insight into genes that are associated with the shift to both an invasive EVCT and ST phenotype, whereas genome-scale DNA methylation analysis has shown that differentiation to ST involves widespread methylation shifts, which are counteracted by low oxygen. In the current study, we sought to identify DNA methylation variation that is associated with transition from CTB to ST in vitro and from a noninvasive to invasive EVCT phenotype after culture on Matrigel. Of the several hundred differentially methylated regions that were identified in each comparison, the majority showed a loss of methylation with differentiation. This included a large differentially methylated region (DMR) in the gene body of death domain-associated protein 6 ( DAXX ), which lost methylation during both CTB syncytialization to ST and EVCT differentiation to invasive EVCT. Comparison to publicly available methylation array data identified the same DMR as among the most consistently differentially methylated genes in placental samples from preeclampsia pregnancies. Of interest, in vitro culture of CTB or ST in low oxygen increases methylation in the same region, which correlates with delayed differentiation. Analysis of combined epigenomics signatures confirmed DAXX DMR as a likely regulatory element, and direct gene expression analysis identified a positive association between methylation at this site and DAXX expression levels. The widespread dynamic nature of DAXX methylation in association with trophoblast differentiation and placenta-associated pathologies is consistent with an important role for this gene in proper placental development and function.-Novakovic, B., Evain-Brion, D., Murthi, P., Fournier, T., Saffery, R. Variable DAXX gene methylation is a common feature of placental trophoblast differentiation, preeclampsia, and response to hypoxia. © FASEB.
FUNDAMENTALS OF VITAMIN D HORMONE-REGULATED GENE EXPRESSION
Pike, J. Wesley; Meyer, Mark B.
2014-01-01
Initial research focused upon several known genetic targets provided early insight into the mechanism of action of the vitamin D hormone (1,25-dihydroxyvitamin D3 (1,25(OH)2D3)). Recently, however, a series of technical advances involving the coupling of chromatin immunoprecipitation (ChIP) to unbiased methodologies that initially involved tiled DNA microarrays (ChIP-chip analysis) and now Next Generation DNA Sequencing techniques (ChIP-Seq analysis) has opened new avenues of research into the mechanisms through which 1,25(OH)2D3 regulates gene expression. In this review, we summarize briefly the results of this early work and then focus on more recent studies in which ChIP-chip and ChIP-seq analyses have been used to explore the mechanisms of 1,25(OH)2D3 action on a genome-wide scale providing specific target genes as examples. The results of this work have advanced our understanding of the mechanisms involved at both genetic and epigenetic levels and have revealed a series of new principles through which the vitamin D hormone functions to control the expression of genes. PMID:24239506
Mechanisms of stable lipid loss in a social insect
Ament, Seth A.; Chan, Queenie W.; Wheeler, Marsha M.; Nixon, Scott E.; Johnson, S. Peir; Rodriguez-Zas, Sandra L.; Foster, Leonard J.; Robinson, Gene E.
2011-01-01
SUMMARY Worker honey bees undergo a socially regulated, highly stable lipid loss as part of their behavioral maturation. We used large-scale transcriptomic and proteomic experiments, physiological experiments and RNA interference to explore the mechanistic basis for this lipid loss. Lipid loss was associated with thousands of gene expression changes in abdominal fat bodies. Many of these genes were also regulated in young bees by nutrition during an initial period of lipid gain. Surprisingly, in older bees, which is when maximum lipid loss occurs, diet played less of a role in regulating fat body gene expression for components of evolutionarily conserved nutrition-related endocrine systems involving insulin and juvenile hormone signaling. By contrast, fat body gene expression in older bees was regulated more strongly by evolutionarily novel regulatory factors, queen mandibular pheromone (a honey bee-specific social signal) and vitellogenin (a conserved yolk protein that has evolved novel, maturation-related functions in the bee), independent of nutrition. These results demonstrate that conserved molecular pathways can be manipulated to achieve stable lipid loss through evolutionarily novel regulatory processes. PMID:22031746
Mechanisms of stable lipid loss in a social insect.
Ament, Seth A; Chan, Queenie W; Wheeler, Marsha M; Nixon, Scott E; Johnson, S Peir; Rodriguez-Zas, Sandra L; Foster, Leonard J; Robinson, Gene E
2011-11-15
Worker honey bees undergo a socially regulated, highly stable lipid loss as part of their behavioral maturation. We used large-scale transcriptomic and proteomic experiments, physiological experiments and RNA interference to explore the mechanistic basis for this lipid loss. Lipid loss was associated with thousands of gene expression changes in abdominal fat bodies. Many of these genes were also regulated in young bees by nutrition during an initial period of lipid gain. Surprisingly, in older bees, which is when maximum lipid loss occurs, diet played less of a role in regulating fat body gene expression for components of evolutionarily conserved nutrition-related endocrine systems involving insulin and juvenile hormone signaling. By contrast, fat body gene expression in older bees was regulated more strongly by evolutionarily novel regulatory factors, queen mandibular pheromone (a honey bee-specific social signal) and vitellogenin (a conserved yolk protein that has evolved novel, maturation-related functions in the bee), independent of nutrition. These results demonstrate that conserved molecular pathways can be manipulated to achieve stable lipid loss through evolutionarily novel regulatory processes.
Use of the Fluidigm C1 platform for RNA sequencing of single mouse pancreatic islet cells.
Xin, Yurong; Kim, Jinrang; Ni, Min; Wei, Yi; Okamoto, Haruka; Lee, Joseph; Adler, Christina; Cavino, Katie; Murphy, Andrew J; Yancopoulos, George D; Lin, Hsin Chieh; Gromada, Jesper
2016-03-22
This study provides an assessment of the Fluidigm C1 platform for RNA sequencing of single mouse pancreatic islet cells. The system combines microfluidic technology and nanoliter-scale reactions. We sequenced 622 cells, allowing identification of 341 islet cells with high-quality gene expression profiles. The cells clustered into populations of α-cells (5%), β-cells (92%), δ-cells (1%), and pancreatic polypeptide cells (2%). We identified cell-type-specific transcription factors and pathways primarily involved in nutrient sensing and oxidation and cell signaling. Unexpectedly, 281 cells had to be removed from the analysis due to low viability, low sequencing quality, or contamination resulting in the detection of more than one islet hormone. Collectively, we provide a resource for identification of high-quality gene expression datasets to help expand insights into genes and pathways characterizing islet cell types. We reveal limitations in the C1 Fluidigm cell capture process resulting in contaminated cells with altered gene expression patterns. This calls for caution when interpreting single-cell transcriptomics data using the C1 Fluidigm system.
Evans, Tyler G; Hammill, Edd; Kaukinen, Karia; Schulze, Angela D; Patterson, David A; English, Karl K; Curtis, Janelle M R; Miller, Kristina M
2011-11-01
Environmental shifts accompanying salmon spawning migrations from ocean feeding grounds to natal freshwater streams can be severe, with the underlying stress often cited as a cause of increased mortality. Here, a salmonid microarray was used to characterize changes in gene expression occurring between ocean and river habitats in gill and liver tissues of wild migrating sockeye salmon (Oncorhynchus nerka Walbaum) returning to spawn in the Fraser River, British Columbia, Canada. Expression profiles indicate that the transcriptome of migrating salmon is strongly affected by shifting abiotic and biotic conditions encountered along migration routes. Conspicuous shifts in gene expression associated with changing salinity, temperature, pathogen exposure and dissolved oxygen indicate that these environmental variables most strongly impact physiology during spawning migrations. Notably, transcriptional changes related to osmoregulation were largely preparatory and occurred well before salmon encountered freshwater. In the river environment, differential expression of genes linked with elevated temperatures indicated that thermal regimes within the Fraser River are approaching tolerance limits for adult salmon. To empirically correlate gene expression with survival, biopsy sampling of gill tissue and transcriptomic profiling were combined with telemetry. Many genes correlated with environmental variables were differentially expressed between premature mortalities and successful migrants. Parametric survival analyses demonstrated a broad-scale transcriptional regulator, cofactor required for Sp1 transcriptional activation (CRSP), to be significantly predictive of survival. As the environmental characteristics of salmon habitats continue to change, establishing how current environmental conditions influence salmon physiology under natural conditions is critical to conserving this ecologically and economically important fish species. © 2011 Blackwell Publishing Ltd.
Bao, Yun-Juan; Liang, Zhong; Mayfield, Jeffrey A; Lee, Shaun W; Ploplis, Victoria A; Castellino, Francis J
2015-10-01
The two-component control of virulence (Cov) regulator (R)-sensor (S) (CovRS) regulates the virulence of Streptococcus pyogenes (group A Streptococcus [GAS]). Inactivation of CovS during infection switches the pathogenicity of GAS to a more invasive form by regulating transcription of diverse virulence genes via CovR. However, the manner in which CovRS controls virulence through expression of extended gene families has not been fully determined. In the current study, the CovS-regulated gene expression profiles of a hypervirulent emm23 GAS strain (M23ND/CovS negative [M23ND/CovS(-)]) and a noninvasive isogenic strain (M23ND/CovS(+)), under different growth conditions, were investigated. RNA sequencing identified altered expression of ∼ 349 genes (18% of the chromosome). The data demonstrated that M23ND/CovS(-) achieved hypervirulence by allowing enhanced expression of genes responsible for antiphagocytosis (e.g., hasABC), by abrogating expression of toxin genes (e.g., speB), and by compromising gene products with dispensable functions (e.g., sfb1). Among these genes, several (e.g., parE and parC) were not previously reported to be regulated by CovRS. Furthermore, the study revealed that CovS also modulated the expression of a broad spectrum of metabolic genes that maximized nutrient utilization and energy metabolism during growth and dissemination, where the bacteria encounter large variations in available nutrients, thus restructuring metabolism of GAS for adaption to diverse growth environments. From constructing a genome-scale metabolic model, we identified 16 nonredundant metabolic gene modules that constitute unique nutrient sources. These genes were proposed to be essential for pathogen growth and are likely associated with GAS virulence. The genome-wide prediction of genes associated with virulence identifies new candidate genes that potentially contribute to GAS virulence. The CovRS system modulates transcription of ∼ 18% of the genes in the Streptococcus pyogenes genome. Mutations that inactivate CovR or CovS enhance the virulence of this bacterium. We determined complete transcriptomes of a naturally CovS-inactivated invasive deep tissue isolate of an emm23 strain of S. pyogenes (M23ND) and its complemented avirulent variant (CovS(+)). We identified diverse virulence genes whose altered expression revealed a genetic switching of a nonvirulent form of M23ND to a highly virulent strain. Furthermore, we also systematically uncovered for the first time the comparative levels of expression of a broad spectrum of metabolic genes, which reflected different metabolic needs of the bacterium as it invaded deeper tissue of the human host. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
GECKO: a complete large-scale gene expression analysis platform.
Theilhaber, Joachim; Ulyanov, Anatoly; Malanthara, Anish; Cole, Jack; Xu, Dapeng; Nahf, Robert; Heuer, Michael; Brockel, Christoph; Bushnell, Steven
2004-12-10
Gecko (Gene Expression: Computation and Knowledge Organization) is a complete, high-capacity centralized gene expression analysis system, developed in response to the needs of a distributed user community. Based on a client-server architecture, with a centralized repository of typically many tens of thousands of Affymetrix scans, Gecko includes automatic processing pipelines for uploading data from remote sites, a data base, a computational engine implementing approximately 50 different analysis tools, and a client application. Among available analysis tools are clustering methods, principal component analysis, supervised classification including feature selection and cross-validation, multi-factorial ANOVA, statistical contrast calculations, and various post-processing tools for extracting data at given error rates or significance levels. On account of its open architecture, Gecko also allows for the integration of new algorithms. The Gecko framework is very general: non-Affymetrix and non-gene expression data can be analyzed as well. A unique feature of the Gecko architecture is the concept of the Analysis Tree (actually, a directed acyclic graph), in which all successive results in ongoing analyses are saved. This approach has proven invaluable in allowing a large (approximately 100 users) and distributed community to share results, and to repeatedly return over a span of years to older and potentially very complex analyses of gene expression data. The Gecko system is being made publicly available as free software http://sourceforge.net/projects/geckoe. In totality or in parts, the Gecko framework should prove useful to users and system developers with a broad range of analysis needs.
Pey, Jon; Valgepea, Kaspar; Rubio, Angel; Beasley, John E; Planes, Francisco J
2013-12-08
The study of cellular metabolism in the context of high-throughput -omics data has allowed us to decipher novel mechanisms of importance in biotechnology and health. To continue with this progress, it is essential to efficiently integrate experimental data into metabolic modeling. We present here an in-silico framework to infer relevant metabolic pathways for a particular phenotype under study based on its gene/protein expression data. This framework is based on the Carbon Flux Path (CFP) approach, a mixed-integer linear program that expands classical path finding techniques by considering additional biophysical constraints. In particular, the objective function of the CFP approach is amended to account for gene/protein expression data and influence obtained paths. This approach is termed integrative Carbon Flux Path (iCFP). We show that gene/protein expression data also influences the stoichiometric balancing of CFPs, which provides a more accurate picture of active metabolic pathways. This is illustrated in both a theoretical and real scenario. Finally, we apply this approach to find novel pathways relevant in the regulation of acetate overflow metabolism in Escherichia coli. As a result, several targets which could be relevant for better understanding of the phenomenon leading to impaired acetate overflow are proposed. A novel mathematical framework that determines functional pathways based on gene/protein expression data is presented and validated. We show that our approach is able to provide new insights into complex biological scenarios such as acetate overflow in Escherichia coli.
Kassir, Yona; Stuart, David T
2017-01-01
The budding yeast Saccharomyces cerevisiae has a long history as a model organism for studies of meiosis and the cell cycle. The popularity of this yeast as a model is in large part due to the variety of genetic and cytological approaches that can be effectively performed with the cells. Cultures of the cells can be induced to synchronously progress through meiosis and sporulation allowing large-scale gene expression and biochemical studies to be performed. Additionally, the spore tetrads resulting from meiosis make it possible to characterize the haploid products of meiosis allowing investigation of meiotic recombination and chromosome segregation. Here we describe genetic methods for analysis progression of S. cerevisiae through meiosis and sporulation with an emphasis on strategies for the genetic analysis of regulators of meiosis-specific genes.
Predicting selective drug targets in cancer through metabolic networks
Folger, Ori; Jerby, Livnat; Frezza, Christian; Gottlieb, Eyal; Ruppin, Eytan; Shlomi, Tomer
2011-01-01
The interest in studying metabolic alterations in cancer and their potential role as novel targets for therapy has been rejuvenated in recent years. Here, we report the development of the first genome-scale network model of cancer metabolism, validated by correctly identifying genes essential for cellular proliferation in cancer cell lines. The model predicts 52 cytostatic drug targets, of which 40% are targeted by known, approved or experimental anticancer drugs, and the rest are new. It further predicts combinations of synthetic lethal drug targets, whose synergy is validated using available drug efficacy and gene expression measurements across the NCI-60 cancer cell line collection. Finally, potential selective treatments for specific cancers that depend on cancer type-specific downregulation of gene expression and somatic mutations are compiled. PMID:21694718
The CRISPR/Cas9 system enables genome editing and somatic cell genetic screens in mammalian cells. We performed genome-scale loss-of-function screens in 33 cancer cell lines to identify genes essential for proliferation/survival and found a strong correlation between increased gene copy number and decreased cell viability after genome editing. Within regions of copy-number gain, CRISPR/Cas9 targeting of both expressed and unexpressed genes, as well as intergenic loci, led to significantly decreased cell proliferation through induction of a G2 cell-cycle arrest.
RNA Study Using DNA Nanotechnology.
Tadakuma, Hisashi; Masubuchi, Takeya; Ueda, Takuya
2016-01-01
Transcription is one of the fundamental steps of gene expression, where RNA polymerases (RNAPs) bind to their template genes and make RNAs. In addition to RNAP and the template gene, many molecules such as transcription factors are involved. The interaction and the effect of these factors depend on the geometry. Molecular layout of these factors, RNAP and gene is thus important. DNA nanotechnology is a promising technology that allows controlling of the molecular layout in the range of nanometer to micrometer scale with nanometer resolution; thus, it is expected to expand the RNA study beyond the current limit. Copyright © 2016 Elsevier Inc. All rights reserved.
Vukmirovic, Milica; Herazo-Maya, Jose D; Blackmon, John; Skodric-Trifunovic, Vesna; Jovanovic, Dragana; Pavlovic, Sonja; Stojsic, Jelena; Zeljkovic, Vesna; Yan, Xiting; Homer, Robert; Stefanovic, Branko; Kaminski, Naftali
2017-01-12
Idiopathic Pulmonary Fibrosis (IPF) is a lethal lung disease of unknown etiology. A major limitation in transcriptomic profiling of lung tissue in IPF has been a dependence on snap-frozen fresh tissues (FF). In this project we sought to determine whether genome scale transcript profiling using RNA Sequencing (RNA-Seq) could be applied to archived Formalin-Fixed Paraffin-Embedded (FFPE) IPF tissues. We isolated total RNA from 7 IPF and 5 control FFPE lung tissues and performed 50 base pair paired-end sequencing on Illumina 2000 HiSeq. TopHat2 was used to map sequencing reads to the human genome. On average ~62 million reads (53.4% of ~116 million reads) were mapped per sample. 4,131 genes were differentially expressed between IPF and controls (1,920 increased and 2,211 decreased (FDR < 0.05). We compared our results to differentially expressed genes calculated from a previously published dataset generated from FF tissues analyzed on Agilent microarrays (GSE47460). The overlap of differentially expressed genes was very high (760 increased and 1,413 decreased, FDR < 0.05). Only 92 differentially expressed genes changed in opposite directions. Pathway enrichment analysis performed using MetaCore confirmed numerous IPF relevant genes and pathways including extracellular remodeling, TGF-beta, and WNT. Gene network analysis of MMP7, a highly differentially expressed gene in both datasets, revealed the same canonical pathways and gene network candidates in RNA-Seq and microarray data. For validation by NanoString nCounter® we selected 35 genes that had a fold change of 2 in at least one dataset (10 discordant, 10 significantly differentially expressed in one dataset only and 15 concordant genes). High concordance of fold change and FDR was observed for each type of the samples (FF vs FFPE) with both microarrays (r = 0.92) and RNA-Seq (r = 0.90) and the number of discordant genes was reduced to four. Our results demonstrate that RNA sequencing of RNA obtained from archived FFPE lung tissues is feasible. The results obtained from FFPE tissue are highly comparable to FF tissues. The ability to perform RNA-Seq on archived FFPE IPF tissues should greatly enhance the availability of tissue biopsies for research in IPF.
Inferring Gene Regulatory Networks by Singular Value Decomposition and Gravitation Field Algorithm
Zheng, Ming; Wu, Jia-nan; Huang, Yan-xin; Liu, Gui-xia; Zhou, You; Zhou, Chun-guang
2012-01-01
Reconstruction of gene regulatory networks (GRNs) is of utmost interest and has become a challenge computational problem in system biology. However, every existing inference algorithm from gene expression profiles has its own advantages and disadvantages. In particular, the effectiveness and efficiency of every previous algorithm is not high enough. In this work, we proposed a novel inference algorithm from gene expression data based on differential equation model. In this algorithm, two methods were included for inferring GRNs. Before reconstructing GRNs, singular value decomposition method was used to decompose gene expression data, determine the algorithm solution space, and get all candidate solutions of GRNs. In these generated family of candidate solutions, gravitation field algorithm was modified to infer GRNs, used to optimize the criteria of differential equation model, and search the best network structure result. The proposed algorithm is validated on both the simulated scale-free network and real benchmark gene regulatory network in networks database. Both the Bayesian method and the traditional differential equation model were also used to infer GRNs, and the results were used to compare with the proposed algorithm in our work. And genetic algorithm and simulated annealing were also used to evaluate gravitation field algorithm. The cross-validation results confirmed the effectiveness of our algorithm, which outperforms significantly other previous algorithms. PMID:23226565
Zhu, Ying; Zhang, Yun-Xia; Liu, Wen-Wen; Ma, Yan; Fang, Qun; Yao, Bo
2015-04-01
This paper describes a nanoliter droplet array-based single-cell reverse transcription quantitative PCR (RT-qPCR) assay method for quantifying gene expression in individual cells. By sequentially printing nanoliter-scale droplets on microchip using a microfluidic robot, all liquid-handling operations including cell encapsulation, lysis, reverse transcription, and quantitative PCR with real-time fluorescence detection, can be automatically achieved. The inhibition effect of cell suspension buffer on RT-PCR assay was comprehensively studied to achieve high-sensitivity gene quantification. The present system was applied in the quantitative measurement of expression level of mir-122 in single Huh-7 cells. A wide distribution of mir-122 expression in single cells from 3061 copies/cell to 79998 copies/cell was observed, showing a high level of cell heterogeneity. With the advantages of full-automation in liquid-handling, simple system structure, and flexibility in achieving multi-step operations, the present method provides a novel liquid-handling mode for single cell gene expression analysis, and has significant potentials in transcriptional identification and rare cell analysis.
Zhu, Ying; Zhang, Yun-Xia; Liu, Wen-Wen; Ma, Yan; Fang, Qun; Yao, Bo
2015-01-01
This paper describes a nanoliter droplet array-based single-cell reverse transcription quantitative PCR (RT-qPCR) assay method for quantifying gene expression in individual cells. By sequentially printing nanoliter-scale droplets on microchip using a microfluidic robot, all liquid-handling operations including cell encapsulation, lysis, reverse transcription, and quantitative PCR with real-time fluorescence detection, can be automatically achieved. The inhibition effect of cell suspension buffer on RT-PCR assay was comprehensively studied to achieve high-sensitivity gene quantification. The present system was applied in the quantitative measurement of expression level of mir-122 in single Huh-7 cells. A wide distribution of mir-122 expression in single cells from 3061 copies/cell to 79998 copies/cell was observed, showing a high level of cell heterogeneity. With the advantages of full-automation in liquid-handling, simple system structure, and flexibility in achieving multi-step operations, the present method provides a novel liquid-handling mode for single cell gene expression analysis, and has significant potentials in transcriptional identification and rare cell analysis. PMID:25828383
DEIVA: a web application for interactive visual analysis of differential gene expression profiles.
Harshbarger, Jayson; Kratz, Anton; Carninci, Piero
2017-01-07
Differential gene expression (DGE) analysis is a technique to identify statistically significant differences in RNA abundance for genes or arbitrary features between different biological states. The result of a DGE test is typically further analyzed using statistical software, spreadsheets or custom ad hoc algorithms. We identified a need for a web-based system to share DGE statistical test results, and locate and identify genes in DGE statistical test results with a very low barrier of entry. We have developed DEIVA, a free and open source, browser-based single page application (SPA) with a strong emphasis on being user friendly that enables locating and identifying single or multiple genes in an immediate, interactive, and intuitive manner. By design, DEIVA scales with very large numbers of users and datasets. Compared to existing software, DEIVA offers a unique combination of design decisions that enable inspection and analysis of DGE statistical test results with an emphasis on ease of use.
A PCR primer bank for quantitative gene expression analysis.
Wang, Xiaowei; Seed, Brian
2003-12-15
Although gene expression profiling by microarray analysis is a useful tool for assessing global levels of transcriptional activity, variability associated with the data sets usually requires that observed differences be validated by some other method, such as real-time quantitative polymerase chain reaction (real-time PCR). However, non-specific amplification of non-target genes is frequently observed in the latter, confounding the analysis in approximately 40% of real-time PCR attempts when primer-specific labels are not used. Here we present an experimentally validated algorithm for the identification of transcript-specific PCR primers on a genomic scale that can be applied to real-time PCR with sequence-independent detection methods. An online database, PrimerBank, has been created for researchers to retrieve primer information for their genes of interest. PrimerBank currently contains 147 404 primers encompassing most known human and mouse genes. The primer design algorithm has been tested by conventional and real-time PCR for a subset of 112 primer pairs with a success rate of 98.2%.
Discovery of time-delayed gene regulatory networks based on temporal gene expression profiling
Li, Xia; Rao, Shaoqi; Jiang, Wei; Li, Chuanxing; Xiao, Yun; Guo, Zheng; Zhang, Qingpu; Wang, Lihong; Du, Lei; Li, Jing; Li, Li; Zhang, Tianwen; Wang, Qing K
2006-01-01
Background It is one of the ultimate goals for modern biological research to fully elucidate the intricate interplays and the regulations of the molecular determinants that propel and characterize the progression of versatile life phenomena, to name a few, cell cycling, developmental biology, aging, and the progressive and recurrent pathogenesis of complex diseases. The vast amount of large-scale and genome-wide time-resolved data is becoming increasing available, which provides the golden opportunity to unravel the challenging reverse-engineering problem of time-delayed gene regulatory networks. Results In particular, this methodological paper aims to reconstruct regulatory networks from temporal gene expression data by using delayed correlations between genes, i.e., pairwise overlaps of expression levels shifted in time relative each other. We have thus developed a novel model-free computational toolbox termed TdGRN (Time-delayed Gene Regulatory Network) to address the underlying regulations of genes that can span any unit(s) of time intervals. This bioinformatics toolbox has provided a unified approach to uncovering time trends of gene regulations through decision analysis of the newly designed time-delayed gene expression matrix. We have applied the proposed method to yeast cell cycling and human HeLa cell cycling and have discovered most of the underlying time-delayed regulations that are supported by multiple lines of experimental evidence and that are remarkably consistent with the current knowledge on phase characteristics for the cell cyclings. Conclusion We established a usable and powerful model-free approach to dissecting high-order dynamic trends of gene-gene interactions. We have carefully validated the proposed algorithm by applying it to two publicly available cell cycling datasets. In addition to uncovering the time trends of gene regulations for cell cycling, this unified approach can also be used to study the complex gene regulations related to the development, aging and progressive pathogenesis of a complex disease where potential dependences between different experiment units might occurs. PMID:16420705
Complex modulation of androgen responsive gene expression by methoxyacetic acid
2011-01-01
Background Optimal androgen signaling is critical for testicular development and spermatogenesis. Methoxyacetic acid (MAA), the primary active metabolite of the industrial chemical ethylene glycol monomethyl ether, disrupts spermatogenesis and causes testicular atrophy. Transcriptional trans-activation studies have indicated that MAA can enhance androgen receptor activity, however, whether MAA actually impacts the expression of androgen-responsive genes in vivo, and which genes might be affected is not known. Methods A mouse TM3 Leydig cell line that stably expresses androgen receptor (TM3-AR) was prepared and analyzed by transcriptional profiling to identify target gene interactions between MAA and testosterone on a global scale. Results MAA is shown to have widespread effects on androgen-responsive genes, affecting processes ranging from apoptosis to ion transport, cell adhesion, phosphorylation and transcription, with MAA able to enhance, as well as antagonize, androgenic responses. Moreover, testosterone is shown to exert both positive and negative effects on MAA gene responses. Motif analysis indicated that binding sites for FOX, HOX, LEF/TCF, STAT5 and MEF2 family transcription factors are among the most highly enriched in genes regulated by testosterone and MAA. Notably, 65 FOXO targets were repressed by testosterone or showed repression enhanced by MAA with testosterone; these include 16 genes associated with developmental processes, six of which are Hox genes. Conclusions These findings highlight the complex interactions between testosterone and MAA, and provide insight into the effects of MAA exposure on androgen-dependent processes in a Leydig cell model. PMID:21453523
Improved evidence-based genome-scale metabolic models for maize leaf, embryo, and endosperm
Seaver, Samuel M. D.; Bradbury, Louis M. T.; Frelin, Océane; Zarecki, Raphy; Ruppin, Eytan; Hanson, Andrew D.; Henry, Christopher S.
2015-01-01
There is a growing demand for genome-scale metabolic reconstructions for plants, fueled by the need to understand the metabolic basis of crop yield and by progress in genome and transcriptome sequencing. Methods are also required to enable the interpretation of plant transcriptome data to study how cellular metabolic activity varies under different growth conditions or even within different organs, tissues, and developmental stages. Such methods depend extensively on the accuracy with which genes have been mapped to the biochemical reactions in the plant metabolic pathways. Errors in these mappings lead to metabolic reconstructions with an inflated number of reactions and possible generation of unreliable metabolic phenotype predictions. Here we introduce a new evidence-based genome-scale metabolic reconstruction of maize, with significant improvements in the quality of the gene-reaction associations included within our model. We also present a new approach for applying our model to predict active metabolic genes based on transcriptome data. This method includes a minimal set of reactions associated with low expression genes to enable activity of a maximum number of reactions associated with high expression genes. We apply this method to construct an organ-specific model for the maize leaf, and tissue specific models for maize embryo and endosperm cells. We validate our models using fluxomics data for the endosperm and embryo, demonstrating an improved capacity of our models to fit the available fluxomics data. All models are publicly available via the DOE Systems Biology Knowledgebase and PlantSEED, and our new method is generally applicable for analysis transcript profiles from any plant, paving the way for further in silico studies with a wide variety of plant genomes. PMID:25806041
Improved evidence-based genome-scale metabolic models for maize leaf, embryo, and endosperm
Seaver, Samuel M.D.; Bradbury, Louis M.T.; Frelin, Océane; ...
2015-03-10
There is a growing demand for genome-scale metabolic reconstructions for plants, fueled by the need to understand the metabolic basis of crop yield and by progress in genome and transcriptome sequencing. Methods are also required to enable the interpretation of plant transcriptome data to study how cellular metabolic activity varies under different growth conditions or even within different organs, tissues, and developmental stages. Such methods depend extensively on the accuracy with which genes have been mapped to the biochemical reactions in the plant metabolic pathways. Errors in these mappings lead to metabolic reconstructions with an inflated number of reactions andmore » possible generation of unreliable metabolic phenotype predictions. Here we introduce a new evidence-based genome-scale metabolic reconstruction of maize, with significant improvements in the quality of the gene-reaction associations included within our model. We also present a new approach for applying our model to predict active metabolic genes based on transcriptome data. This method includes a minimal set of reactions associated with low expression genes to enable activity of a maximum number of reactions associated with high expression genes. We apply this method to construct an organ-specific model for the maize leaf, and tissue specific models for maize embryo and endosperm cells. We validate our models using fluxomics data for the endosperm and embryo, demonstrating an improved capacity of our models to fit the available fluxomics data. All models are publicly available via the DOE Systems Biology Knowledgebase and PlantSEED, and our new method is generally applicable for analysis transcript profiles from any plant, paving the way for further in silico studies with a wide variety of plant genomes.« less
Particle Radiation signals the Expression of Genes in stress-associated Pathways
NASA Astrophysics Data System (ADS)
Blakely, E.; Chang, P.; Bjornstad, K.; Dosanjh, M.; Cherbonnel, C.; Rosen, C.
The explosive development of microarray screening methods has propelled genome research in a variety of biological systems allowing investigators to examine large-scale alterations in gene expression for research in toxicology pathology and therapy The radiation environment in space is complex and encompasses a variety of highly energetic and charged particles Estimation of biological responses after exposure to these types of radiation is important for NASA in their plans for long-term manned space missions Instead of using the 10 000 gene arrays that are in the marketplace we have chosen to examine particle radiation-induced changes in gene expression using a focused DNA microarray system to study the expression of about 100 genes specifically associated with both the upstream and downstream aspects of the TP53 stress-responsive pathway Genes that are regulated by TP53 include functional clusters that are implicated in cell cycle arrest apoptosis and DNA repair A cultured human lens epithelial cell model Blakely et al IOVS 41 3808 2000 was used for these studies Additional human normal and radiosensitive fibroblast cell lines have also been examined Lens cells were grown on matrix-coated substrate and exposed to 55 MeV u protons at the 88 cyclotron in LBNL or 1 GeV u Iron ions at the NASA Space Radiation Laboratory The other cells lines were grown on conventional tissue culture plasticware RNA and proteins were harvested at different times after irradiation RNA was isolated from sham-treated or select irradiated populations
Han, F; Liu, Y; Guo, L Q; Zeng, X L; Liu, Z M; Lin, J F
2010-11-01
FIP-gsi, a fungal immunomodulatory protein found in Ganoderma sinense, has antitumour, anti-allergy and immunomodulatory activities and is regulated by the fip-gsi gene. In this study, we aimed to express the fip-gsi gene from G. sinense in Coprinopsis cinerea to increase yield of FIPs-gsi. A fungal expression vector pBfip-gsi containing the gpd promoter from Agaricus bisporus and the fip-gsi gene from the G. sinense was constructed and transformed into C. cinerea. PCR and Southern blotting analysis verified the successful integration of the exogenous gene fip-gsi into the genome of C. cinerea. RT-PCR and Northern blotting analysis confirmed that the fip-gsi gene was transcribed in C. cinerea. The yield of the FIP-gsi protein reached 314mg kg(-1) fresh mycelia. The molecular weight of the FIP-gsi was 13kDa, and the FIP-gsi was capable of hemagglutinating mouse red blood cells, but no such activity was observed towards human red blood cells in vitro. The fip-gsi from G. sinense has been successfully translated in C. cinerea, and the yield of bioactive FIP-gsi protein was high. This is the first report using the C. cinerea for the heterologous expression of FIP-gsi protein and it might supply a basis for large-scale production of the protein. © 2010 The Authors. Journal of Applied Microbiology © 2010 The Society for Applied Microbiology.
Wei, Chuan-Bao; Wei, Yang-Yang; Yang, Yu; Liu, Shi-Liang; Hu, Hao-Yu; He, Yue
2011-10-01
To prepare antiserum against Fritillary virus Y (FVY) CP for detecting FVY and study serological relationships with other viruses. Specific primer was designed according to Genbank (accession: AM039800) to amplify CP gene of FVY infecting Thunberg fritillary. Sequence relationship with other potyviruses was made by Blast. The CP gene was inserted into pSBET and expressed in Escherichia coli BL21 (DE3) plys E strain. The object protein was purified by 12% SDS-PAGE firstly and subsequently 5% - 20% gradient SDS-PAGE. The antiserum against the CP was raised in mouse and its specificity was confirmed by Western blot analysis. The reactivity of the antiserum produced to FVY CP was tested by Western blot against the over-expressed coat proteins of 17 potyviruses. The ability to combine with nature FVY particles was confirmed by ELISA analysis. It shared 81.2% nucleotide acids identities with TrVY (Tricyrtis virus Y, AY 864850) CP gene, 68.1% with SMV-P (Soybean mosaic virus Pinellia strain, AJ507388. 2) CP gene and 67.2% with ZYMV (Zucchini yellow mosaic virus Luan isolate) CP gene. The prepared antiserum was special to FVY CP, also reacted moderately to the expressed CP of SMV-P (Soybean mosaic virus Pinellia strain) and weakly to that of ZYMV (Zucchini yellow mosaic virus Luan isolate). The antibody could combine to nature FVY particles and the antiserum is suitable for FVY detection by ELISA in large scale.
Anatskaya, Olga V; Vinogradov, Alexander E
2007-01-01
To elucidate the functional significance of genome multiplication in somatic tissues, we performed a large-scale analysis of ploidy-associated changes in expression of non-tissue-specific (i.e., broadly expressed) genes in the heart and liver of human and mouse (6585 homologous genes were analyzed). These species have inverse patterns of polyploidization in cardiomyocytes and hepatocytes. The between-species comparison of two pairs of homologous tissues with crisscross contrast in ploidy levels allows the removal of the effects of species and tissue specificity on the profile of gene activity. The different tests performed from the standpoint of modular biology revealed a consistent picture of ploidy-associated alteration in a wide range of functional gene groups. The major effects consisted of hypoxia-inducible factor-triggered changes in main cellular processes and signaling pathways, activation of defense against DNA lesions, acceleration of protein turnover and transcription, and the impairment of apoptosis, the immune response, and cytoskeleton maintenance. We also found a severe decline in aerobic respiration and stimulation of sugar and fatty acid metabolism. These metabolic rearrangements create a special type of metabolism that can be considered intermediate between aerobic and anaerobic. The metabolic and physiological changes revealed (reflected in the alteration of gene expression) help explain the unique ability of polyploid tissues to combine proliferation and differentiation, which are separated in diploid tissues. We argue that genome multiplication promotes cell survival and tissue regeneration under stressful conditions.
NIBBS-search for fast and accurate prediction of phenotype-biased metabolic systems.
Schmidt, Matthew C; Rocha, Andrea M; Padmanabhan, Kanchana; Shpanskaya, Yekaterina; Banfield, Jill; Scott, Kathleen; Mihelcic, James R; Samatova, Nagiza F
2012-01-01
Understanding of genotype-phenotype associations is important not only for furthering our knowledge on internal cellular processes, but also essential for providing the foundation necessary for genetic engineering of microorganisms for industrial use (e.g., production of bioenergy or biofuels). However, genotype-phenotype associations alone do not provide enough information to alter an organism's genome to either suppress or exhibit a phenotype. It is important to look at the phenotype-related genes in the context of the genome-scale network to understand how the genes interact with other genes in the organism. Identification of metabolic subsystems involved in the expression of the phenotype is one way of placing the phenotype-related genes in the context of the entire network. A metabolic system refers to a metabolic network subgraph; nodes are compounds and edges labels are the enzymes that catalyze the reaction. The metabolic subsystem could be part of a single metabolic pathway or span parts of multiple pathways. Arguably, comparative genome-scale metabolic network analysis is a promising strategy to identify these phenotype-related metabolic subsystems. Network Instance-Based Biased Subgraph Search (NIBBS) is a graph-theoretic method for genome-scale metabolic network comparative analysis that can identify metabolic systems that are statistically biased toward phenotype-expressing organismal networks. We set up experiments with target phenotypes like hydrogen production, TCA expression, and acid-tolerance. We show via extensive literature search that some of the resulting metabolic subsystems are indeed phenotype-related and formulate hypotheses for other systems in terms of their role in phenotype expression. NIBBS is also orders of magnitude faster than MULE, one of the most efficient maximal frequent subgraph mining algorithms that could be adjusted for this problem. Also, the set of phenotype-biased metabolic systems output by NIBBS comes very close to the set of phenotype-biased subgraphs output by an exact maximally-biased subgraph enumeration algorithm ( MBS-Enum ). The code (NIBBS and the module to visualize the identified subsystems) is available at http://freescience.org/cs/NIBBS.
NIBBS-Search for Fast and Accurate Prediction of Phenotype-Biased Metabolic Systems
Padmanabhan, Kanchana; Shpanskaya, Yekaterina; Banfield, Jill; Scott, Kathleen; Mihelcic, James R.; Samatova, Nagiza F.
2012-01-01
Understanding of genotype-phenotype associations is important not only for furthering our knowledge on internal cellular processes, but also essential for providing the foundation necessary for genetic engineering of microorganisms for industrial use (e.g., production of bioenergy or biofuels). However, genotype-phenotype associations alone do not provide enough information to alter an organism's genome to either suppress or exhibit a phenotype. It is important to look at the phenotype-related genes in the context of the genome-scale network to understand how the genes interact with other genes in the organism. Identification of metabolic subsystems involved in the expression of the phenotype is one way of placing the phenotype-related genes in the context of the entire network. A metabolic system refers to a metabolic network subgraph; nodes are compounds and edges labels are the enzymes that catalyze the reaction. The metabolic subsystem could be part of a single metabolic pathway or span parts of multiple pathways. Arguably, comparative genome-scale metabolic network analysis is a promising strategy to identify these phenotype-related metabolic subsystems. Network Instance-Based Biased Subgraph Search (NIBBS) is a graph-theoretic method for genome-scale metabolic network comparative analysis that can identify metabolic systems that are statistically biased toward phenotype-expressing organismal networks. We set up experiments with target phenotypes like hydrogen production, TCA expression, and acid-tolerance. We show via extensive literature search that some of the resulting metabolic subsystems are indeed phenotype-related and formulate hypotheses for other systems in terms of their role in phenotype expression. NIBBS is also orders of magnitude faster than MULE, one of the most efficient maximal frequent subgraph mining algorithms that could be adjusted for this problem. Also, the set of phenotype-biased metabolic systems output by NIBBS comes very close to the set of phenotype-biased subgraphs output by an exact maximally-biased subgraph enumeration algorithm ( MBS-Enum ). The code (NIBBS and the module to visualize the identified subsystems) is available at http://freescience.org/cs/NIBBS. PMID:22589706
Corrales-Guerrero, Laura; Tal, Asaf; Arbel-Goren, Rinat; Mariscal, Vicente; Flores, Enrique; Herrero, Antonia; Stavans, Joel
2015-04-01
Under nitrogen deprivation, filaments of the cyanobacterium Anabaena undergo a process of development, resulting in a one-dimensional pattern of nitrogen-fixing heterocysts separated by about ten photosynthetic vegetative cells. Many aspects of gene expression before nitrogen deprivation and during the developmental process remain to be elucidated. Furthermore, the coupling of gene expression fluctuations between cells along a multicellular filament is unknown. We studied the statistics of fluctuations of gene expression of HetR, a transcription factor essential for heterocyst differentiation, both under steady-state growth in nitrogen-rich conditions and at different times following nitrogen deprivation, using a chromosomally-encoded translational hetR-gfp fusion. Statistical analysis of fluorescence at the individual cell level in wild-type and mutant filaments demonstrates that expression fluctuations of hetR in nearby cells are coupled, with a characteristic spatial range of circa two to three cells, setting the scale for cellular interactions along a filament. Correlations between cells predominantly arise from intercellular molecular transfer and less from cell division. Fluctuations after nitrogen step-down can build up on those under nitrogen-replete conditions. We found that under nitrogen-rich conditions, basal, steady-state expression of the HetR inhibitor PatS, cell-cell communication influenced by the septal protein SepJ and positive HetR auto-regulation are essential determinants of fluctuations in hetR expression and its distribution along filaments. A comparison between the expression of hetR-gfp under nitrogen-rich and nitrogen-poor conditions highlights the differences between the two HetR inhibitors PatS and HetN, as well as the differences in specificity between the septal proteins SepJ and FraC/FraD. Activation, inhibition and cell-cell communication lie at the heart of developmental processes. Our results show that proteins involved in these basic ingredients combine together in the presence of inevitable stochasticity in gene expression, to control the coupled fluctuations of gene expression that give rise to a one-dimensional developmental pattern in this organism.
Scale-up of recombinant Opc protein production in Escherichia coli for a meningococcal vaccine.
Pérez, Raúl Espinosa; Lasa, Alexis Musacchio; Rodríguez, Ricardo Silva; Menéndez, Evelin Caballero; Suárez, José García; Balaguer, Héctor Díaz
2006-12-15
Opc is an outer membrane protein from Neisseria meningitidis present in meningococcal vaccine preparations. The opc gene, codifying for this protein, was cloned in to Escherichia coli and the Opc protein was expressed under the control of a tryptophan promoter. The recombinant strain was grown in batch cultures. Opc was expressed as inclusion bodies at about 32% of the total cellular protein. We examined the scale-up culture conditions for the production of the recombinant Opc. The scale-up process was performed from 1.5 l to 50 l culture, using first, the constant power per unit of volume (P/V) as main scaling criteria, and then the oxygen mass transfer coefficient (K(L)a) scaling criteria to adjust the optimal aeration conditions. A final productivity of 52 mgl(-1)h(-1) was obtained at the 50l culture scale compared with the 49 mgl(-1)h(-1) productivity at 1.5l laboratory scale.
Larter, Maximilian; Dunbar-Wallis, Amy; Berardi, Andrea E; Smith, Stacey D
2018-06-07
The predictability of evolution, or whether lineages repeatedly follow the same evolutionary trajectories during phenotypic convergence remains an open question of evolutionary biology. In this study, we investigate evolutionary convergence at the biochemical pathway level and test the predictability of evolution using floral anthocyanin pigmentation, a trait with a well-understood genetic and regulatory basis. We reconstructed the evolution of floral anthocyanin content across 28 species of the Andean clade Iochrominae (Solanaceae) and investigated how shifts in pigmentation are related to changes in expression of 7 key anthocyanin pathway genes. We used phylogenetic multivariate analysis of gene expression to test for phenotypic and developmental convergence at a macroevolutionary scale. Our results show that the four independent losses of the ancestral pigment delphinidin involved convergent losses of expression of the three late pathway genes (F3'5'h, Dfr and Ans). Transitions between pigment types affecting floral hue (e.g. blue to red) involve changes to the expression of branching genes F3'h and F3'5'h, while the expression levels of early steps of the pathway are strongly conserved in all species. These patterns support the idea that the macroevolution of floral pigmentation follows predictable evolutionary trajectories to reach convergent phenotype space, repeatedly involving regulatory changes. This is likely driven by constraints at the pathway level, such as pleiotropy and regulatory structure.
A proposed metric for assessing the measurement quality of individual microarrays
Kim, Kyoungmi; Page, Grier P; Beasley, T Mark; Barnes, Stephen; Scheirer, Katherine E; Allison, David B
2006-01-01
Background High-density microarray technology is increasingly applied to study gene expression levels on a large scale. Microarray experiments rely on several critical steps that may introduce error and uncertainty in analyses. These steps include mRNA sample extraction, amplification and labeling, hybridization, and scanning. In some cases this may be manifested as systematic spatial variation on the surface of microarray in which expression measurements within an individual array may vary as a function of geographic position on the array surface. Results We hypothesized that an index of the degree of spatiality of gene expression measurements associated with their physical geographic locations on an array could indicate the summary of the physical reliability of the microarray. We introduced a novel way to formulate this index using a statistical analysis tool. Our approach regressed gene expression intensity measurements on a polynomial response surface of the microarray's Cartesian coordinates. We demonstrated this method using a fixed model and presented results from real and simulated datasets. Conclusion We demonstrated the potential of such a quantitative metric for assessing the reliability of individual arrays. Moreover, we showed that this procedure can be incorporated into laboratory practice as a means to set quality control specifications and as a tool to determine whether an array has sufficient quality to be retained in terms of spatial correlation of gene expression measurements. PMID:16430768
Huang, Jinjin; Xia, Ji; Yang, Zhen; Guan, Feifei; Cui, Di; Guan, Guohua; Jiang, Wei; Li, Ying
2014-01-01
We previously cloned a 1,3-specific lipase gene from the fungus Rhizomucor miehei and expressed it in methylotrophic yeast Pichia pastoris strain GS115. The enzyme produced (termed RML) was able to catalyze methanolysis of soybean oil and showed strong position specificity. However, the enzyme activity and amount of enzyme produced were not adequate for industrial application. Our goal in the present study was to improve the enzyme properties of RML in order to apply it for the conversion of microalgae oil to biofuel. Several new expression plasmids were constructed by adding the propeptide of the target gene, optimizing the signal peptide, and varying the number of target gene copies. Each plasmid was transformed separately into P. pastoris strain X-33. Screening by flask culture showed maximal (21.4-fold increased) enzyme activity for the recombinant strain with two copies of the target gene; the enzyme was termed Lipase GH2. The expressed protein with the propeptide (pRML) was a stable glycosylated protein, because of glycosylation sites in the propeptide. Quantitative real-time RT-PCR analysis revealed two major reasons for the increase in enzyme activity: (1) the modified recombinant expression system gave an increased transcription level of the target gene (rml), and (2) the enzyme was suitable for expression in host cells without causing endoplasmic reticulum (ER) stress. The modified enzyme had improved thermostability and methanol or ethanol tolerance, and was applicable directly as free lipase (fermentation supernatant) in the catalytic esterification and transesterification reaction. After reaction for 24 hours at 30°C, the conversion rate of microalgae oil to biofuel was above 90%. Our experimental results show that signal peptide optimization in the expression plasmid, addition of the gene propeptide, and proper gene dosage significantly increased RML expression level and enhanced the enzymatic properties. The target enzyme was the major component of fermentation supernatant and was stable for over six months at 4°C. The modified free lipase is potentially applicable for industrial-scale conversion of microalgae oil to biodiesel.
ExpressionDB: An open source platform for distributing genome-scale datasets.
Hughes, Laura D; Lewis, Scott A; Hughes, Michael E
2017-01-01
RNA-sequencing (RNA-seq) and microarrays are methods for measuring gene expression across the entire transcriptome. Recent advances have made these techniques practical and affordable for essentially any laboratory with experience in molecular biology. A variety of computational methods have been developed to decrease the amount of bioinformatics expertise necessary to analyze these data. Nevertheless, many barriers persist which discourage new labs from using functional genomics approaches. Since high-quality gene expression studies have enduring value as resources to the entire research community, it is of particular importance that small labs have the capacity to share their analyzed datasets with the research community. Here we introduce ExpressionDB, an open source platform for visualizing RNA-seq and microarray data accommodating virtually any number of different samples. ExpressionDB is based on Shiny, a customizable web application which allows data sharing locally and online with customizable code written in R. ExpressionDB allows intuitive searches based on gene symbols, descriptions, or gene ontology terms, and it includes tools for dynamically filtering results based on expression level, fold change, and false-discovery rates. Built-in visualization tools include heatmaps, volcano plots, and principal component analysis, ensuring streamlined and consistent visualization to all users. All of the scripts for building an ExpressionDB with user-supplied data are freely available on GitHub, and the Creative Commons license allows fully open customization by end-users. We estimate that a demo database can be created in under one hour with minimal programming experience, and that a new database with user-supplied expression data can be completed and online in less than one day.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kenny, Paraic A.; Lee, Genee Y.; Myers, Connie A.
2007-01-31
3D cell cultures are rapidly becoming the method of choice for the physiologically relevant modeling of many aspects of non-malignant and malignant cell behavior ex vivo. Nevertheless, only a limited number of distinct cell types have been evaluated in this assay to date. Here we report the first large scale comparison of the transcriptional profiles and 3D cell culture phenotypes of a substantial panel of human breast cancer cell lines. Each cell line adopts a colony morphology of one of four main classes in 3D culture. These morphologies reflect, at least in part, the underlying gene expression profile and proteinmore » expression patterns of the cell lines, and distinct morphologies were also associated with tumor cell invasiveness and with cell lines originating from metastases. We further demonstrate that consistent differences in genes encoding signal transduction proteins emerge when even tumor cells are cultured in 3D microenvironments.« less
Wang, Zhuo; Jin, Shuilin; Liu, Guiyou; Zhang, Xiurui; Wang, Nan; Wu, Deliang; Hu, Yang; Zhang, Chiping; Jiang, Qinghua; Xu, Li; Wang, Yadong
2017-05-23
The development of single-cell RNA sequencing has enabled profound discoveries in biology, ranging from the dissection of the composition of complex tissues to the identification of novel cell types and dynamics in some specialized cellular environments. However, the large-scale generation of single-cell RNA-seq (scRNA-seq) data collected at multiple time points remains a challenge to effective measurement gene expression patterns in transcriptome analysis. We present an algorithm based on the Dynamic Time Warping score (DTWscore) combined with time-series data, that enables the detection of gene expression changes across scRNA-seq samples and recovery of potential cell types from complex mixtures of multiple cell types. The DTWscore successfully classify cells of different types with the most highly variable genes from time-series scRNA-seq data. The study was confined to methods that are implemented and available within the R framework. Sample datasets and R packages are available at https://github.com/xiaoxiaoxier/DTWscore .
Evolution and Expression of Tissue Globins in Ray-Finned Fishes
Gallagher, Michael D.
2017-01-01
The globin gene family encodes oxygen-binding hemeproteins conserved across the major branches of multicellular life. The origins and evolutionary histories of complete globin repertoires have been established for many vertebrates, but there remain major knowledge gaps for ray-finned fish. Therefore, we used phylogenetic, comparative genomic and gene expression analyses to discover and characterize canonical “non-blood” globin family members (i.e., myoglobin, cytoglobin, neuroglobin, globin-X, and globin-Y) across multiple ray-finned fish lineages, revealing novel gene duplicates (paralogs) conserved from whole genome duplication (WGD) and small-scale duplication events. Our key findings were that: (1) globin-X paralogs in teleosts have been retained from the teleost-specific WGD, (2) functional paralogs of cytoglobin, neuroglobin, and globin-X, but not myoglobin, have been conserved from the salmonid-specific WGD, (3) triplicate lineage-specific myoglobin paralogs are conserved in arowanas (Osteoglossiformes), which arose by tandem duplication and diverged under positive selection, (4) globin-Y is retained in multiple early branching fish lineages that diverged before teleosts, and (5) marked variation in tissue-specific expression of globin gene repertoires exists across ray-finned fish evolution, including several previously uncharacterized sites of expression. In this respect, our data provide an interesting link between myoglobin expression and the evolution of air breathing in teleosts. Together, our findings demonstrate great-unrecognized diversity in the repertoire and expression of nonblood globins that has arisen during ray-finned fish evolution. PMID:28173090
Wu, Qingjun; Wang, Shaoli; Xie, Wen; Zhu, Xun; Baxter, Simon W.; Zhou, Xuguo; Jurat-Fuentes, Juan Luis; Zhang, Youjun
2015-01-01
Insecticidal crystal toxins derived from the soil bacterium Bacillus thuringiensis (Bt) are widely used as biopesticide sprays or expressed in transgenic crops to control insect pests. However, large-scale use of Bt has led to field-evolved resistance in several lepidopteran pests. Resistance to Bt Cry1Ac toxin in the diamondback moth, Plutella xylostella (L.), was previously mapped to a multigenic resistance locus (BtR-1). Here, we assembled the 3.15 Mb BtR-1 locus and found high-level resistance to Cry1Ac and Bt biopesticide in four independent P. xylostella strains were all associated with differential expression of a midgut membrane-bound alkaline phosphatase (ALP) outside this locus and a suite of ATP-binding cassette transporter subfamily C (ABCC) genes inside this locus. The interplay between these resistance genes is controlled by a previously uncharacterized trans-regulatory mechanism via the mitogen-activated protein kinase (MAPK) signaling pathway. Molecular, biochemical, and functional analyses have established ALP as a functional Cry1Ac receptor. Phenotypic association experiments revealed that the recessive Cry1Ac resistance was tightly linked to down-regulation of ALP, ABCC2 and ABCC3, whereas it was not linked to up-regulation of ABCC1. Silencing of ABCC2 and ABCC3 in susceptible larvae reduced their susceptibility to Cry1Ac but did not affect the expression of ALP, whereas suppression of MAP4K4, a constitutively transcriptionally-activated MAPK upstream gene within the BtR-1 locus, led to a transient recovery of gene expression thereby restoring the susceptibility in resistant larvae. These results highlight a crucial role for ALP and ABCC genes in field-evolved resistance to Cry1Ac and reveal a novel trans-regulatory signaling mechanism responsible for modulating the expression of these pivotal genes in P. xylostella. PMID:25875245
Construction and application of a bovine immune-endocrine cDNA microarray.
Tao, Wenjing; Mallard, Bonnie; Karrow, Niel; Bridle, Byram
2004-09-01
A variety of commercial DNA arrays specific for humans and rodents are widely available; however, microarrays containing well-characterized genes to study pathway-specific gene expression are not as accessible for domestic animals, such as cattle, sheep and pigs. Therefore, a small-scale application-targeted bovine immune-endocrine cDNA array was developed to evaluate genetic pathways involved in the immune-endocrine axis of cattle during periods of altered homeostasis provoked by physiological or environmental stressors, such as infection, vaccination or disease. For this purpose, 167 cDNA sequences corresponding to immune, endocrine and inflammatory response genes were collected and categorized. Positive controls included 5 housekeeping genes (glyceraldehydes-3-phosphate dehydrogenase, hypoxanthine phosphoribosyltransferase, ribosomal protein L19, beta-actin, beta2-microglobulin) and bovine genomic DNA. Negative controls were a bacterial gene (Rhodococcus equi 17-kDa virulence-associated protein) and a partial sequence of the plasmid pACYC177. In addition, RNA extracted from un-stimulated, as well as superantigen (Staphylococcus aureus enterotoxin-A, S. aureus Cowan Pansorbin Cells) and mitogen-stimulated (LPS, ConA) bovine blood leukocytes was mixed, reverse transcribed and PCR amplified using gene-specific primers. The endocrine-associated genes were amplified from cDNA derived from un-stimulated bovine hypothalamus, pituitary, adrenal and thyroid gland tissues. The array was constructed in 4 repeating grids of 180 duplicated spots by coupling the PCR amplified 213-630 bp gene fragments onto poly-l-lysine coated glass slides. The bovine immune-endocrine arrays were standardized and preliminary gene expression profiles generated using Cy3 and Cy5 labelled cDNA from un-stimulated and ConA (5 microg/ml) stimulated PBMC of 4 healthy Holstein cows (2-4 replicate arrays/cow) in a time course study. Mononuclear cell-derived cytokine and chemokine (IL-2, IL-1alpha, TNFalpha, IFN-gamma, TGFbeta-1, MCP-1, MCP-2 and MIP-3alpha) mRNA exhibited a repeatable and consistently low expression in un-stimulated cells and at least a two-fold increased expression following 6 and 24 h ConA stimulation as compared to 0 h un-stimulated controls. In contrast, expression of antigen presenting molecules, MHC-DR, MHC-DQ and MHC-DY, were consistently at least two-fold lower following 6 and 24 h ConA stimulation. The only endocrine gene with differential expression following ConA stimulation was prolactin. Additionally, due to the high level of genetic homology between ovine, swine and bovine genes, RNA similarly acquired from sheep and pigs was evaluated and similar gene expression patterns were noted. These data demonstrate that this application-targeted array containing a set of well characterized genes can be used to determine the relative gene expression corresponding to immune-endocrine responses of cattle and related species, sheep and pigs.
Seo, Sang Woo; Gao, Ye; Kim, Donghyuk; Szubin, Richard; Yang, Jina; Cho, Byung-Kwan; Palsson, Bernhard O
2017-05-19
A transcription factor (TF), OmpR, plays a critical role in transcriptional regulation of the osmotic stress response in bacteria. Here, we reveal a genome-scale OmpR regulon in Escherichia coli K-12 MG1655. Integrative data analysis reveals that a total of 37 genes in 24 transcription units (TUs) belong to OmpR regulon. Among them, 26 genes show more than two-fold changes in expression level in an OmpR knock-out strain. Specifically, we find that: 1) OmpR regulates mostly membrane-located gene products involved in diverse fundamental biological processes, such as narU (encoding nitrate/nitrite transporter), ompX (encoding outer membrane protein X), and nuoN (encoding NADH:ubiquinone oxidoreductase); 2) by investigating co-regulation of entire sets of genes regulated by other stress-response TFs, stresses are surprisingly independently regulated among each other; and, 3) a detailed investigation of the physiological roles of the newly discovered OmpR regulon genes reveals that activation of narU represents a novel strategy to significantly improve osmotic stress tolerance of E. coli. Thus, the genome-scale approach to elucidating regulons comprehensively identifies regulated genes and leads to fundamental discoveries related to stress responses.
Wang, Chenggang; Du, Xuezhu; Mou, Zhonglin
2016-01-01
Mediator is a highly conserved protein complex that functions as a transcriptional coactivator in RNA polymerase II (RNAPII)-mediated transcription. The Arabidopsis Mediator complex has recently been implicated in plant immune responses. Here, we compared salicylic acid (SA)-, methyl jasmonate (MeJA)-, and the ethylene (ET) precursor 1-aminocyclopropane-1-carboxylic acid (ACC)-induced defense and/or wound-responsive gene expression in 14 Arabidopsis Mediator subunit mutants. Our results show that MED14, MED15, and MED16 are required for SA-activated expression of the defense marker gene PATHOEGNESIS-RELATED GENE1 , MED25 is required for MeJA-induced expression of the wound-responsive marker gene VEGATATIVE STORAGE PROTEIN1 ( VSP1 ), MED8, MED14, MED15, MED16, MED18, MED20a, MED25, MED31, and MED33A/B (MED33a and MED33B) are required for MeJA-induced expression of the defense maker gene PLANT DEFENSIN1.2 ( PDF1.2 ), and MED8, MED14, MED15, MED16, MED25, and MED33A/B are also required for ACC-triggered expression of PDF1.2 . Furthermore, we investigated the involvement of MED14, MED15, and MED16 in plant defense signaling crosstalk and found that MED14, MED15, and MED16 are required for SA- and ET-mediated suppression of MeJA-induced VSP1 expression. This result suggests that MED14, MED15, and MED16 not only relay defense signaling from the SA and JA/ET defense pathways to the RNAPII transcription machinery, but also fine-tune defense signaling crosstalk. Finally, we show that MED33A/B contributes to the necrotrophic fungal pathogen Botrytis cinerea- induced expression of the defense genes PDF1.2, HEVEIN-LIKE , and BASIC CHITINASE and is required for full-scale basal resistance to B. cinerea , demonstrating a positive role for MED33 in plant immunity against necrotrophic fungal pathogens.
Fibroblast extracellular matrix gene expression in response to keratinocyte-releasable stratifin.
Ghaffari, Abdi; Li, Yunyaun; Karami, Ali; Ghaffari, Mazyar; Tredget, Edward E; Ghahary, Aziz
2006-05-15
Termination of wound-healing process requires a fine balance between connective tissue deposition and its hydrolysis. Previously, we have demonstrated that keratinocyte-releasable stratifin, also known as 14-3-3 sigma protein, stimulates collagenase (MMP-1) expression in dermal fibroblasts. However, role of extracellular stratifin in regulation of extracellular matrix (ECM) factors and other matrix metalloproteinases (MMPs) in dermal fibroblast remains unexplored. To address this question, large-scale ECM gene expression profile were analyzed in human dermal fibroblasts co-cultured with keratinocytes or treated with recombinant stratifin. Superarray pathway-specific microarrays were utilized to identify upregulation or downregulation of 96 human ECM and adhesion molecule genes. RT-PCR and Western blot were used to validate microarray expression profiles of selected genes. Comparison of gene profiles with the appropriate controls showed a significant (more than twofold) increase in expression of collagenase-1, stromelysin-1 and -2, neutrophil collagenase, and membrane type 5 MMP in dermal fibroblasts treated with stratifin or co-cultured with keratinocytes. Expression of type I collagen and fibronectin genes decreased in the same fibroblasts. The results of a dose-response experiment showed that stratifin stimulates the expression of stromelysin-1 (MMP-3) mRNA by dermal fibroblasts in a concentration-dependent fashion. Furthermore, Western blot analysis of fibroblast-conditioned medium showed a peak in MMP-3 protein levels 48 h following treatment with recombinant stratifin. In a lasting-effect study, MMP-3 protein was detected in fibroblast-condition medium for up to 72 h post removal of stratifin. In conclusion, our results suggest that keratinocyte-releasable stratifin plays a major role in induction of ECM degradation by dermal fibroblasts through stimulation of key MMPs, such as MMP-1 and MMP-3. Therefore, stratifin protein may prove to be a useful target for clinical intervention in controlling excessive wound healing in fibrotic conditions. Copyright 2006 Wiley-Liss, Inc.
Kim, Ji-Yeon; Lee, Eunjin; Park, Kyunghee; Park, Woong-Yang; Jung, Hae Hyun; Ahn, Jin Seok; Im, Young-Hyuck; Park, Yeon Hee
2017-04-25
Breast cancer (BC) has been genetically profiled through large-scale genome analyses. However, the role and clinical implications of genetic alterations in metastatic BC (MBC) have not been evaluated. Therefore, we conducted whole-exome sequencing (WES) and RNA-Seq of 37 MBC samples and targeted deep sequencing of another 29 MBCs. We evaluated somatic mutations from WES and targeted sequencing and assessed gene expression and performed pathway analysis from RNA-Seq. In this analysis, PIK3CA was the most commonly mutated gene in estrogen receptor (ER)-positive BC, while in ER-negative BC, TP53 was the most commonly mutated gene (p = 0.018 and p < 0.001, respectively). TP53 stopgain/loss and frameshift mutation was related to low expression of TP53 in contrast nonsynonymous mutation was related to high expression. The impact of TP53 mutation on clinical outcome varied with regard to ER status. In ER-positive BCs, wild type TP53 had a better prognosis than mutated TP53 (median overall survival (OS) (wild type vs. mutated): 88.5 ± 54.4 vs. 32.6 ± 10.7 (months), p = 0.002). In contrast, mutated TP53 had a protective effect in ER-negative BCs (median OS: 0.10 vs. 32.6 ± 8.2, p = 0.026). However, PIK3CA mutation did not affect patient survival. In gene expression analysis, CALM1, a potential regulator of AKT, was highly expressed in PIK3CA-mutated BCs. In conclusion, mutation of TP53 was associated with expression status and affect clinical outcome according to ER status in MBC. Although mutation of PIK3CA was not related to survival in this study, mutation of PIK3CA altered the expression of other genes and pathways including CALM1 and may be a potential predictive marker of PI3K inhibitor effectiveness.
Phylogenomic detection and functional prediction of genes potentially important for plant meiosis.
Zhang, Luoyan; Kong, Hongzhi; Ma, Hong; Yang, Ji
2018-02-15
Meiosis is a specialized type of cell division necessary for sexual reproduction in eukaryotes. A better understanding of the cytological procedures of meiosis has been achieved by comprehensive cytogenetic studies in plants, while the genetic mechanisms regulating meiotic progression remain incompletely understood. The increasing accumulation of complete genome sequences and large-scale gene expression datasets has provided a powerful resource for phylogenomic inference and unsupervised identification of genes involved in plant meiosis. By integrating sequence homology and expression data, 164, 131, 124 and 162 genes potentially important for meiosis were identified in the genomes of Arabidopsis thaliana, Oryza sativa, Selaginella moellendorffii and Pogonatum aloides, respectively. The predicted genes were assigned to 45 meiotic GO terms, and their functions were related to different processes occurring during meiosis in various organisms. Most of the predicted meiotic genes underwent lineage-specific duplication events during plant evolution, with about 30% of the predicted genes retaining only a single copy in higher plant genomes. The results of this study provided clues to design experiments for better functional characterization of meiotic genes in plants, promoting the phylogenomic approach to the evolutionary dynamics of the plant meiotic machineries. Copyright © 2017 Elsevier B.V. All rights reserved.
Lequerré, Thierry; Bansard, Carine; Vittecoq, Olivier; Derambure, Céline; Hiron, Martine; Daveau, Maryvonne; Tron, François; Ayral, Xavier; Biga, Norman; Auquit-Auckbur, Isabelle; Chiocchia, Gilles; Le Loët, Xavier; Salier, Jean-Philippe
2009-01-01
Introduction Rheumatoid arthritis (RA) is a heterogeneous disease and its underlying molecular mechanisms are still poorly understood. Because previous microarray studies have only focused on long-standing (LS) RA compared to osteoarthritis, we aimed to compare the molecular profiles of early and LS RA versus control synovia. Methods Synovial biopsies were obtained by arthroscopy from 15 patients (4 early untreated RA, 4 treated LS RA and 7 controls, who had traumatic or mechanical lesions). Extracted mRNAs were used for large-scale gene-expression profiling. The different gene-expression combinations identified by comparison of profiles of early, LS RA and healthy synovia were linked to the biological processes involved in each situation. Results Three combinations of 719, 116 and 52 transcripts discriminated, respectively, early from LS RA, and early or LS RA from healthy synovia. We identified several gene clusters and distinct molecular signatures specifically expressed during early or LS RA, thereby suggesting the involvement of different pathophysiological mechanisms during the course of RA. Conclusions Early and LS RA have distinct molecular signatures with different biological processes participating at different times during the course of the disease. These results suggest that better knowledge of the main biological processes involved at a given RA stage might help to choose the most appropriate treatment. PMID:19563633
Open-Porous Hydroxyapatite Scaffolds for Three-Dimensional Culture of Human Adult Liver Cells
Schmelzer, Eva; Over, Patrick; Nettleship, Ian; Gerlach, Joerg C.
2016-01-01
Liver cell culture within three-dimensional structures provides an improved culture system for various applications in basic research, pharmacological screening, and implantable or extracorporeal liver support. Biodegradable calcium-based scaffolds in such systems could enhance liver cell functionality by providing endothelial and hepatic cell support through locally elevated calcium levels, increased surface area for cell attachment, and allowing three-dimensional tissue restructuring. Open-porous hydroxyapatite scaffolds were fabricated and seeded with primary adult human liver cells, which were embedded within or without gels of extracellular matrix protein collagen-1 or hyaluronan. Metabolic functions were assessed after 5, 15, and 28 days. Longer-term cultures exhibited highest cell numbers and liver specific gene expression when cultured on hydroxyapatite scaffolds in collagen-1. Endothelial gene expression was induced in cells cultured on scaffolds without extracellular matrix proteins. Hydroxyapatite induced gene expression for cytokeratin-19 when cells were cultured in collagen-1 gel while culture in hyaluronan increased cytokeratin-19 gene expression independent of the use of scaffold in long-term culture. The implementation of hydroxyapatite composites with extracellular matrices affected liver cell cultures and cell differentiation depending on the type of matrix protein and the presence of a scaffold. The hydroxyapatite scaffolds enable scale-up of hepatic three-dimensional culture models for regenerative medicine applications. PMID:27403430
Ivanov, Sergey V.; Kuzmin, Igor; Wei, Ming-Hui; Pack, Svetlana; Geil, Laura; Johnson, Bruce E.; Stanbridge, Eric J.; Lerman, Michael I.
1998-01-01
To discover genes involved in von Hippel-Lindau (VHL)-mediated carcinogenesis, we used renal cell carcinoma cell lines stably transfected with wild-type VHL-expressing transgenes. Large-scale RNA differential display technology applied to these cell lines identified several differentially expressed genes, including an alpha carbonic anhydrase gene, termed CA12. The deduced protein sequence was classified as a one-pass transmembrane CA possessing an apparently intact catalytic domain in the extracellular CA module. Reintroduced wild-type VHL strongly inhibited the overexpression of the CA12 gene in the parental renal cell carcinoma cell lines. Similar results were obtained with CA9, encoding another transmembrane CA with an intact catalytic domain. Although both domains of the VHL protein contribute to regulation of CA12 expression, the elongin binding domain alone could effectively regulate CA9 expression. We mapped CA12 and CA9 loci to chromosome bands 15q22 and 17q21.2 respectively, regions prone to amplification in some human cancers. Additional experiments are needed to define the role of CA IX and CA XII enzymes in the regulation of pH in the extracellular microenvironment and its potential impact on cancer cell growth. PMID:9770531
NASA Astrophysics Data System (ADS)
Werthmann, Britta; Marwan, Wolfgang
2017-11-01
The developmental switch to sporulation in Physarum polycephalum is a phytochrome-mediated far-red light-induced cell fate decision that synchronously encompasses the entire multinucleate plasmodial cell and is associated with extensive reprogramming of the transcriptome. By repeatedly taking samples of single cells after delivery of a light stimulus pulse, we analysed differential gene expression in two mutant strains and in a heterokaryon of the two strains all of which display a different propensity for making the cell fate decision. Multidimensional scaling of the gene expression data revealed individually different single cell trajectories eventually leading to sporulation. Characterization of the trajectories as walks through states of gene expression discretized by hierarchical clustering allowed the reconstruction of Petri nets that model and predict the observed behavior. Structural analyses of the Petri nets indicated stimulus- and genotype-dependence of both, single cell trajectories and of the quasipotential landscape through which these trajectories are taken. The Petri net-based approach to the analysis and decomposition of complex cellular responses and of complex mutant phenotypes may provide a scaffold for the data-driven reconstruction of causal molecular mechanisms that shape the topology of the quasipotential landscape.
Zhao, Chen; Huang, Ying; Guo, Chao; Yang, Bolei; Zhang, Yan; Lan, Zhou; Guan, Xiong; Song, Yuan; Zhang, Xiaolin
2017-01-01
Spinosyns are a group of macrolide insecticides produced by Saccharopolyspora spinosa. Although S. spinosa can be used for industrial-scale production of spinosyns, this might suffer from several limitations, mainly related to its long growth cycle, low fermentation biomass, and inefficient utilization of starch. It is crucial to generate a robust strain for further spinosyn production and the development of spinosyn derivatives. A BAC vector, containing the whole biosynthetic gene cluster for spinosyn (74 kb) and the elements required for conjugal transfer and site-specific integration, was introduced into different Streptomyces hosts in order to obtain heterologous spinosyn-producing strains. The exconjugants of different Streptomyces strains did not show spinosyn production unless the rhamnose biosynthesis genes from S. spinosa genomic DNA were present and expressed under the control of a strong constitutive ermE*p promoter. Using this heterologous expression system resulted in yields of 1 μg/mL and 1.5 μg/mL spinosyns in Streptomyces coelicolor and Streptomyces lividans, respectively. This report demonstrates spinosyn production in 2 Streptomyces strains and stresses the essential role of rhamnose in this process. This work also provides a potential alternative route for producing spinosyn analogs by means of genetic manipulation in the heterologous hosts. © 2017 S. Karger AG, Basel.
Stillman, Jonathon H; Tagmount, Abderrahmane
2009-10-01
Central predictions of climate warming models include increased climate variability and increased severity of heat waves. Physiological acclimatization in populations across large-scale ecological gradients in habitat temperature fluctuation is an important factor to consider in detecting responses to climate change related increases in thermal fluctuation. We measured in vivo cardiac thermal maxima and used microarrays to profile transcriptome heat and cold stress responses in cardiac tissue of intertidal zone porcelain crabs across biogeographic and seasonal gradients in habitat temperature fluctuation. We observed acclimatization dependent induction of heat shock proteins, as well as unknown genes with heat shock protein-like expression profiles. Thermal acclimatization had the largest effect on heat stress responses of extensin-like, beta tubulin, and unknown genes. For these genes, crabs acclimatized to thermally variable sites had higher constitutive expression than specimens from low variability sites, but heat stress dramatically induced expression in specimens from low variability sites and repressed expression in specimens from highly variable sites. Our application of ecological transcriptomics has yielded new biomarkers that may represent sensitive indicators of acclimatization to habitat temperature fluctuation. Our study also has identified novel genes whose further description may yield novel understanding of cellular responses to thermal acclimatization or thermal stress.
Zhang, J D; Berntenis, N; Roth, A; Ebeling, M
2014-06-01
Gene signatures of drug-induced toxicity are of broad interest, but they are often identified from small-scale, single-time point experiments, and are therefore of limited applicability. To address this issue, we performed multivariate analysis of gene expression, cell-based assays, and histopathological data in the TG-GATEs (Toxicogenomics Project-Genomics Assisted Toxicity Evaluation system) database. Data mining highlights four genes-EGR1, ATF3, GDF15 and FGF21-that are induced 2 h after drug administration in human and rat primary hepatocytes poised to eventually undergo cytotoxicity-induced cell death. Modelling and simulation reveals that these early stress-response genes form a functional network with evolutionarily conserved structure and intrinsic dynamics. This is underlined by the fact that early induction of this network in vivo predicts drug-induced liver and kidney pathology with high accuracy. Our findings demonstrate the value of early gene-expression signatures in predicting and understanding compound-induced toxicity. The identified network can empower first-line tests that reduce animal use and costs of safety evaluation.
Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex.
Konermann, Silvana; Brigham, Mark D; Trevino, Alexandro E; Joung, Julia; Abudayyeh, Omar O; Barcena, Clea; Hsu, Patrick D; Habib, Naomi; Gootenberg, Jonathan S; Nishimasu, Hiroshi; Nureki, Osamu; Zhang, Feng
2015-01-29
Systematic interrogation of gene function requires the ability to perturb gene expression in a robust and generalizable manner. Here we describe structure-guided engineering of a CRISPR-Cas9 complex to mediate efficient transcriptional activation at endogenous genomic loci. We used these engineered Cas9 activation complexes to investigate single-guide RNA (sgRNA) targeting rules for effective transcriptional activation, to demonstrate multiplexed activation of ten genes simultaneously, and to upregulate long intergenic non-coding RNA (lincRNA) transcripts. We also synthesized a library consisting of 70,290 guides targeting all human RefSeq coding isoforms to screen for genes that, upon activation, confer resistance to a BRAF inhibitor. The top hits included genes previously shown to be able to confer resistance, and novel candidates were validated using individual sgRNA and complementary DNA overexpression. A gene expression signature based on the top screening hits correlated with markers of BRAF inhibitor resistance in cell lines and patient-derived samples. These results collectively demonstrate the potential of Cas9-based activators as a powerful genetic perturbation technology.
Unity in defence: honeybee workers exhibit conserved molecular responses to diverse pathogens.
Doublet, Vincent; Poeschl, Yvonne; Gogol-Döring, Andreas; Alaux, Cédric; Annoscia, Desiderato; Aurori, Christian; Barribeau, Seth M; Bedoya-Reina, Oscar C; Brown, Mark J F; Bull, James C; Flenniken, Michelle L; Galbraith, David A; Genersch, Elke; Gisder, Sebastian; Grosse, Ivo; Holt, Holly L; Hultmark, Dan; Lattorff, H Michael G; Le Conte, Yves; Manfredini, Fabio; McMahon, Dino P; Moritz, Robin F A; Nazzi, Francesco; Niño, Elina L; Nowick, Katja; van Rij, Ronald P; Paxton, Robert J; Grozinger, Christina M
2017-03-02
Organisms typically face infection by diverse pathogens, and hosts are thought to have developed specific responses to each type of pathogen they encounter. The advent of transcriptomics now makes it possible to test this hypothesis and compare host gene expression responses to multiple pathogens at a genome-wide scale. Here, we performed a meta-analysis of multiple published and new transcriptomes using a newly developed bioinformatics approach that filters genes based on their expression profile across datasets. Thereby, we identified common and unique molecular responses of a model host species, the honey bee (Apis mellifera), to its major pathogens and parasites: the Microsporidia Nosema apis and Nosema ceranae, RNA viruses, and the ectoparasitic mite Varroa destructor, which transmits viruses. We identified a common suite of genes and conserved molecular pathways that respond to all investigated pathogens, a result that suggests a commonality in response mechanisms to diverse pathogens. We found that genes differentially expressed after infection exhibit a higher evolutionary rate than non-differentially expressed genes. Using our new bioinformatics approach, we unveiled additional pathogen-specific responses of honey bees; we found that apoptosis appeared to be an important response following microsporidian infection, while genes from the immune signalling pathways, Toll and Imd, were differentially expressed after Varroa/virus infection. Finally, we applied our bioinformatics approach and generated a gene co-expression network to identify highly connected (hub) genes that may represent important mediators and regulators of anti-pathogen responses. Our meta-analysis generated a comprehensive overview of the host metabolic and other biological processes that mediate interactions between insects and their pathogens. We identified key host genes and pathways that respond to phylogenetically diverse pathogens, representing an important source for future functional studies as well as offering new routes to identify or generate pathogen resilient honey bee stocks. The statistical and bioinformatics approaches that were developed for this study are broadly applicable to synthesize information across transcriptomic datasets. These approaches will likely have utility in addressing a variety of biological questions.
NASA Astrophysics Data System (ADS)
Li, Qiong; Tang, Qing; Xu, Xudong; Gao, Hong
2010-11-01
Genetic engineering in filamentous N2-fixing cyanobacteria usually involves Anabaena sp. PCC 7120 and several other non-aggregating species. Mass culture and harvest of such species are more energy consuming relative to aggregating species. To establish a gene transfer system for aggregating species, we tested many species of Anabaena and Nostoc, and identified Nostoc muscorum FACHB244 as a species that can be genetically manipulated using the conjugative gene transfer system. To promote biodegradation of organophosphorus pollutants in aquatic environments, we introduced a plasmid containing the organophosphorus-degradation gene ( opd) into Anabaena sp. PCC 7120 and Nostoc muscorum FACHB244 by conjugation. The opd gene was driven by a strong promoter, P psbA . From both species, we obtained transgenic strains having organophosphorus-degradation activities. At 25°C, the whole-cell activities of the transgenic Anabaena and Nostoc strains were 0.163±0.001 and 0.289±0.042 unit/μg Chl a, respectively. However, most colonies resulting from the gene transfer showed no activity. PCR and DNA sequencing revealed deletions or rearrangements in the plasmid in some of the colonies. Expression of the green fluorescent protein gene from the same promoter in Anabaena sp. PCC 7120 showed similar results. These results suggest that there is the potential to promote the degradation of organophosphorus pollutants with transgenic cyanobacteria and that selection of high-expression transgenic colonies is important for genetic engineering of Anabaena and Nostoc species. For the first time, we established a gene transfer and expression system in an aggregating filamentous N2-fixing cyanobacterium. The genetic manipulation system of Nostoc muscorum FACHB244 could be utilized in the elimination of pollutants and large-scale production of valuable proteins or metabolites.
Muñoz-Nortes, Tamara; Pérez-Pérez, José Manuel; Ponce, María Rosa; Candela, Héctor; Micol, José Luis
2017-03-01
The characterization of mutants with altered leaf shape and pigmentation has previously allowed the identification of nuclear genes that encode plastid-localized proteins that perform essential functions in leaf growth and development. A large-scale screen previously allowed us to isolate ethyl methanesulfonate-induced mutants with small rosettes and pale green leaves with prominent marginal teeth, which were assigned to a phenotypic class that we dubbed Angulata. The molecular characterization of the 12 genes assigned to this phenotypic class should help us to advance our understanding of the still poorly understood relationship between chloroplast biogenesis and leaf morphogenesis. In this article, we report the phenotypic and molecular characterization of the angulata7-1 (anu7-1) mutant of Arabidopsis thaliana, which we found to be a hypomorphic allele of the EMB2737 gene, which was previously known only for its embryonic-lethal mutations. ANU7 encodes a plant-specific protein that contains a domain similar to the central cysteine-rich domain of DnaJ proteins. The observed genetic interaction of anu7-1 with a loss-of-function allele of GENOMES UNCOUPLED1 suggests that the anu7-1 mutation triggers a retrograde signal that leads to changes in the expression of many genes that normally function in the chloroplasts. Many such genes are expressed at higher levels in anu7-1 rosettes, with a significant overrepresentation of those required for the expression of plastid genome genes. Like in other mutants with altered expression of plastid-encoded genes, we found that anu7-1 exhibits defects in the arrangement of thylakoidal membranes, which appear locally unappressed. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.