Sample records for gene expression exist

  1. From Coexpression to Coregulation: An Approach to Inferring Transcriptional Regulation Among Gene Classes from Large-Scale Expression Data

    NASA Technical Reports Server (NTRS)

    Mjolsness, Eric; Castano, Rebecca; Mann, Tobias; Wold, Barbara

    2000-01-01

    We provide preliminary evidence that existing algorithms for inferring small-scale gene regulation networks from gene expression data can be adapted to large-scale gene expression data coming from hybridization microarrays. The essential steps are (I) clustering many genes by their expression time-course data into a minimal set of clusters of co-expressed genes, (2) theoretically modeling the various conditions under which the time-courses are measured using a continuous-time analog recurrent neural network for the cluster mean time-courses, (3) fitting such a regulatory model to the cluster mean time courses by simulated annealing with weight decay, and (4) analysing several such fits for commonalities in the circuit parameter sets including the connection matrices. This procedure can be used to assess the adequacy of existing and future gene expression time-course data sets for determining transcriptional regulatory relationships such as coregulation.

  2. Iterative local Gaussian clustering for expressed genes identification linked to malignancy of human colorectal carcinoma

    PubMed Central

    Wasito, Ito; Hashim, Siti Zaiton M; Sukmaningrum, Sri

    2007-01-01

    Gene expression profiling plays an important role in the identification of biological and clinical properties of human solid tumors such as colorectal carcinoma. Profiling is required to reveal underlying molecular features for diagnostic and therapeutic purposes. A non-parametric density-estimation-based approach called iterative local Gaussian clustering (ILGC), was used to identify clusters of expressed genes. We used experimental data from a previous study by Muro and others consisting of 1,536 genes in 100 colorectal cancer and 11 normal tissues. In this dataset, the ILGC finds three clusters, two large and one small gene clusters, similar to their results which used Gaussian mixture clustering. The correlation of each cluster of genes and clinical properties of malignancy of human colorectal cancer was analysed for the existence of tumor or normal, the existence of distant metastasis and the existence of lymph node metastasis. PMID:18305825

  3. Iterative local Gaussian clustering for expressed genes identification linked to malignancy of human colorectal carcinoma.

    PubMed

    Wasito, Ito; Hashim, Siti Zaiton M; Sukmaningrum, Sri

    2007-12-30

    Gene expression profiling plays an important role in the identification of biological and clinical properties of human solid tumors such as colorectal carcinoma. Profiling is required to reveal underlying molecular features for diagnostic and therapeutic purposes. A non-parametric density-estimation-based approach called iterative local Gaussian clustering (ILGC), was used to identify clusters of expressed genes. We used experimental data from a previous study by Muro and others consisting of 1,536 genes in 100 colorectal cancer and 11 normal tissues. In this dataset, the ILGC finds three clusters, two large and one small gene clusters, similar to their results which used Gaussian mixture clustering. The correlation of each cluster of genes and clinical properties of malignancy of human colorectal cancer was analysed for the existence of tumor or normal, the existence of distant metastasis and the existence of lymph node metastasis.

  4. Aberrant Gene Expression in Humans

    PubMed Central

    Yang, Ence; Ji, Guoli; Brinkmeyer-Langford, Candice L.; Cai, James J.

    2015-01-01

    Gene expression as an intermediate molecular phenotype has been a focus of research interest. In particular, studies of expression quantitative trait loci (eQTL) have offered promise for understanding gene regulation through the discovery of genetic variants that explain variation in gene expression levels. Existing eQTL methods are designed for assessing the effects of common variants, but not rare variants. Here, we address the problem by establishing a novel analytical framework for evaluating the effects of rare or private variants on gene expression. Our method starts from the identification of outlier individuals that show markedly different gene expression from the majority of a population, and then reveals the contributions of private SNPs to the aberrant gene expression in these outliers. Using population-scale mRNA sequencing data, we identify outlier individuals using a multivariate approach. We find that outlier individuals are more readily detected with respect to gene sets that include genes involved in cellular regulation and signal transduction, and less likely to be detected with respect to the gene sets with genes involved in metabolic pathways and other fundamental molecular functions. Analysis of polymorphic data suggests that private SNPs of outlier individuals are enriched in the enhancer and promoter regions of corresponding aberrantly-expressed genes, suggesting a specific regulatory role of private SNPs, while the commonly-occurring regulatory genetic variants (i.e., eQTL SNPs) show little evidence of involvement. Additional data suggest that non-genetic factors may also underlie aberrant gene expression. Taken together, our findings advance a novel viewpoint relevant to situations wherein common eQTLs fail to predict gene expression when heritable, rare inter-individual variation exists. The analytical framework we describe, taking into consideration the reality of differential phenotypic robustness, may be valuable for investigating complex traits and conditions. PMID:25617623

  5. GENE EXPRESSION NETWORKS

    EPA Science Inventory

    "Gene expression network" is the term used to describe the interplay, simple or complex, between two or more gene products in performing a specific cellular function. Although the delineation of such networks is complicated by the existence of multiple and subtle types of intera...

  6. Systems analysis of cis-regulatory motifs in C4 photosynthesis genes using maize and rice leaf transcriptomic data during a process of de-etiolation

    PubMed Central

    Xu, Jiajia; Bräutigam, Andrea; Weber, Andreas P. M.; Zhu, Xin-Guang

    2016-01-01

    Identification of potential cis-regulatory motifs controlling the development of C4 photosynthesis is a major focus of current research. In this study, we used time-series RNA-seq data collected from etiolated maize and rice leaf tissues sampled during a de-etiolation process to systematically characterize the expression patterns of C4-related genes and to further identify potential cis elements in five different genomic regions (i.e. promoter, 5′UTR, 3′UTR, intron, and coding sequence) of C4 orthologous genes. The results demonstrate that although most of the C4 genes show similar expression patterns, a number of them, including chloroplast dicarboxylate transporter 1, aspartate aminotransferase, and triose phosphate transporter, show shifted expression patterns compared with their C3 counterparts. A number of conserved short DNA motifs between maize C4 genes and their rice orthologous genes were identified not only in the promoter, 5′UTR, 3′UTR, and coding sequences, but also in the introns of core C4 genes. We also identified cis-regulatory motifs that exist in maize C4 genes and also in genes showing similar expression patterns as maize C4 genes but that do not exist in rice C3 orthologs, suggesting a possible recruitment of pre-existing cis-elements from genes unrelated to C4 photosynthesis into C4 photosynthesis genes during C4 evolution. PMID:27436282

  7. A comparison of brain gene expression levels in domesticated and wild animals.

    PubMed

    Albert, Frank W; Somel, Mehmet; Carneiro, Miguel; Aximu-Petri, Ayinuer; Halbwax, Michel; Thalmann, Olaf; Blanco-Aguiar, Jose A; Plyusnina, Irina Z; Trut, Lyudmila; Villafuerte, Rafael; Ferrand, Nuno; Kaiser, Sylvia; Jensen, Per; Pääbo, Svante

    2012-09-01

    Domestication has led to similar changes in morphology and behavior in several animal species, raising the question whether similarities between different domestication events also exist at the molecular level. We used mRNA sequencing to analyze genome-wide gene expression patterns in brain frontal cortex in three pairs of domesticated and wild species (dogs and wolves, pigs and wild boars, and domesticated and wild rabbits). We compared the expression differences with those between domesticated guinea pigs and a distant wild relative (Cavia aperea) as well as between two lines of rats selected for tameness or aggression towards humans. There were few gene expression differences between domesticated and wild dogs, pigs, and rabbits (30-75 genes (less than 1%) of expressed genes were differentially expressed), while guinea pigs and C. aperea differed more strongly. Almost no overlap was found between the genes with differential expression in the different domestication events. In addition, joint analyses of all domesticated and wild samples provided only suggestive evidence for the existence of a small group of genes that changed their expression in a similar fashion in different domesticated species. The most extreme of these shared expression changes include up-regulation in domesticates of SOX6 and PROM1, two modulators of brain development. There was almost no overlap between gene expression in domesticated animals and the tame and aggressive rats. However, two of the genes with the strongest expression differences between the rats (DLL3 and DHDH) were located in a genomic region associated with tameness and aggression, suggesting a role in influencing tameness. In summary, the majority of brain gene expression changes in domesticated animals are specific to the given domestication event, suggesting that the causative variants of behavioral domestication traits may likewise be different.

  8. A Comparison of Brain Gene Expression Levels in Domesticated and Wild Animals

    PubMed Central

    Albert, Frank W.; Somel, Mehmet; Carneiro, Miguel; Aximu-Petri, Ayinuer; Halbwax, Michel; Thalmann, Olaf; Blanco-Aguiar, Jose A.; Trut, Lyudmila; Villafuerte, Rafael; Ferrand, Nuno; Kaiser, Sylvia; Jensen, Per; Pääbo, Svante

    2012-01-01

    Domestication has led to similar changes in morphology and behavior in several animal species, raising the question whether similarities between different domestication events also exist at the molecular level. We used mRNA sequencing to analyze genome-wide gene expression patterns in brain frontal cortex in three pairs of domesticated and wild species (dogs and wolves, pigs and wild boars, and domesticated and wild rabbits). We compared the expression differences with those between domesticated guinea pigs and a distant wild relative (Cavia aperea) as well as between two lines of rats selected for tameness or aggression towards humans. There were few gene expression differences between domesticated and wild dogs, pigs, and rabbits (30–75 genes (less than 1%) of expressed genes were differentially expressed), while guinea pigs and C. aperea differed more strongly. Almost no overlap was found between the genes with differential expression in the different domestication events. In addition, joint analyses of all domesticated and wild samples provided only suggestive evidence for the existence of a small group of genes that changed their expression in a similar fashion in different domesticated species. The most extreme of these shared expression changes include up-regulation in domesticates of SOX6 and PROM1, two modulators of brain development. There was almost no overlap between gene expression in domesticated animals and the tame and aggressive rats. However, two of the genes with the strongest expression differences between the rats (DLL3 and DHDH) were located in a genomic region associated with tameness and aggression, suggesting a role in influencing tameness. In summary, the majority of brain gene expression changes in domesticated animals are specific to the given domestication event, suggesting that the causative variants of behavioral domestication traits may likewise be different. PMID:23028369

  9. Discovering mutated driver genes through a robust and sparse co-regularized matrix factorization framework with prior information from mRNA expression patterns and interaction network.

    PubMed

    Xi, Jianing; Wang, Minghui; Li, Ao

    2018-06-05

    Discovery of mutated driver genes is one of the primary objective for studying tumorigenesis. To discover some relatively low frequently mutated driver genes from somatic mutation data, many existing methods incorporate interaction network as prior information. However, the prior information of mRNA expression patterns are not exploited by these existing network-based methods, which is also proven to be highly informative of cancer progressions. To incorporate prior information from both interaction network and mRNA expressions, we propose a robust and sparse co-regularized nonnegative matrix factorization to discover driver genes from mutation data. Furthermore, our framework also conducts Frobenius norm regularization to overcome overfitting issue. Sparsity-inducing penalty is employed to obtain sparse scores in gene representations, of which the top scored genes are selected as driver candidates. Evaluation experiments by known benchmarking genes indicate that the performance of our method benefits from the two type of prior information. Our method also outperforms the existing network-based methods, and detect some driver genes that are not predicted by the competing methods. In summary, our proposed method can improve the performance of driver gene discovery by effectively incorporating prior information from interaction network and mRNA expression patterns into a robust and sparse co-regularized matrix factorization framework.

  10. Comparisons between Arabidopsis thaliana and Drosophila melanogaster in relation to Coding and Noncoding Sequence Length and Gene Expression

    PubMed Central

    Caldwell, Rachel; Lin, Yan-Xia; Zhang, Ren

    2015-01-01

    There is a continuing interest in the analysis of gene architecture and gene expression to determine the relationship that may exist. Advances in high-quality sequencing technologies and large-scale resource datasets have increased the understanding of relationships and cross-referencing of expression data to the large genome data. Although a negative correlation between expression level and gene (especially transcript) length has been generally accepted, there have been some conflicting results arising from the literature concerning the impacts of different regions of genes, and the underlying reason is not well understood. The research aims to apply quantile regression techniques for statistical analysis of coding and noncoding sequence length and gene expression data in the plant, Arabidopsis thaliana, and fruit fly, Drosophila melanogaster, to determine if a relationship exists and if there is any variation or similarities between these species. The quantile regression analysis found that the coding sequence length and gene expression correlations varied, and similarities emerged for the noncoding sequence length (5′ and 3′ UTRs) between animal and plant species. In conclusion, the information described in this study provides the basis for further exploration into gene regulation with regard to coding and noncoding sequence length. PMID:26114098

  11. Mining functionally relevant gene sets for analyzing physiologically novel clinical expression data.

    PubMed

    Turcan, Sevin; Vetter, Douglas E; Maron, Jill L; Wei, Xintao; Slonim, Donna K

    2011-01-01

    Gene set analyses have become a standard approach for increasing the sensitivity of transcriptomic studies. However, analytical methods incorporating gene sets require the availability of pre-defined gene sets relevant to the underlying physiology being studied. For novel physiological problems, relevant gene sets may be unavailable or existing gene set databases may bias the results towards only the best-studied of the relevant biological processes. We describe a successful attempt to mine novel functional gene sets for translational projects where the underlying physiology is not necessarily well characterized in existing annotation databases. We choose targeted training data from public expression data repositories and define new criteria for selecting biclusters to serve as candidate gene sets. Many of the discovered gene sets show little or no enrichment for informative Gene Ontology terms or other functional annotation. However, we observe that such gene sets show coherent differential expression in new clinical test data sets, even if derived from different species, tissues, and disease states. We demonstrate the efficacy of this method on a human metabolic data set, where we discover novel, uncharacterized gene sets that are diagnostic of diabetes, and on additional data sets related to neuronal processes and human development. Our results suggest that our approach may be an efficient way to generate a collection of gene sets relevant to the analysis of data for novel clinical applications where existing functional annotation is relatively incomplete.

  12. Analysis of Cytoskeletal and Motility Proteins in the Sea Urchin Genome Assembly

    PubMed Central

    RL, Morris; MP, Hoffman; RA, Obar; SS, McCafferty; IR, Gibbons; AD, Leone; J, Cool; EL, Allgood; AM, Musante; KM, Judkins; BJ, Rossetti; AP, Rawson; DR, Burgess

    2007-01-01

    The sea urchin embryo is a classical model system for studying the role of the cytoskeleton in such events as fertilization, mitosis, cleavage, cell migration and gastrulation. We have conducted an analysis of gene models derived from the Strongylocentrotus purpuratus genome assembly and have gathered strong evidence for the existence of multiple gene families encoding cytoskeletal proteins and their regulators in sea urchin. While many cytoskeletal genes have been cloned from sea urchin with sequences already existing in public databases, genome analysis reveals a significantly higher degree of diversity within certain gene families. Furthermore, genes are described corresponding to homologs of cytoskeletal proteins not previously documented in sea urchins. To illustrate the varying degree of sequence diversity that exists within cytoskeletal gene families, we conducted an analysis of genes encoding actins, specific actin-binding proteins, myosins, tubulins, kinesins, dyneins, specific microtubule-associated proteins, and intermediate filaments. We conducted ontological analysis of select genes to better understand the relatedness of urchin cytoskeletal genes to those of other deuterostomes. We analyzed developmental expression (EST) data to confirm the existence of select gene models and to understand their differential expression during various stages of early development. PMID:17027957

  13. Integrating mean and variance heterogeneities to identify differentially expressed genes.

    PubMed

    Ouyang, Weiwei; An, Qiang; Zhao, Jinying; Qin, Huaizhen

    2016-12-06

    In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions. Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment. The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect. Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes. In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition. We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests. Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change. The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings. For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.e., the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates. In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests. Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity. After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment-wide significant MVDE genes. Our results indicate tremendous potential gain of integrating informative variance heterogeneity after adjusting for global confounders and background data structure. The proposed informative integration test better summarizes the impacts of condition change on expression distributions of susceptible genes than do the existent competitors. Therefore, particular attention should be paid to explicitly exploit the variance heterogeneity induced by condition change in functional genomics analysis.

  14. Systems analysis of cis-regulatory motifs in C4 photosynthesis genes using maize and rice leaf transcriptomic data during a process of de-etiolation.

    PubMed

    Xu, Jiajia; Bräutigam, Andrea; Weber, Andreas P M; Zhu, Xin-Guang

    2016-09-01

    Identification of potential cis-regulatory motifs controlling the development of C4 photosynthesis is a major focus of current research. In this study, we used time-series RNA-seq data collected from etiolated maize and rice leaf tissues sampled during a de-etiolation process to systematically characterize the expression patterns of C4-related genes and to further identify potential cis elements in five different genomic regions (i.e. promoter, 5'UTR, 3'UTR, intron, and coding sequence) of C4 orthologous genes. The results demonstrate that although most of the C4 genes show similar expression patterns, a number of them, including chloroplast dicarboxylate transporter 1, aspartate aminotransferase, and triose phosphate transporter, show shifted expression patterns compared with their C3 counterparts. A number of conserved short DNA motifs between maize C4 genes and their rice orthologous genes were identified not only in the promoter, 5'UTR, 3'UTR, and coding sequences, but also in the introns of core C4 genes. We also identified cis-regulatory motifs that exist in maize C4 genes and also in genes showing similar expression patterns as maize C4 genes but that do not exist in rice C3 orthologs, suggesting a possible recruitment of pre-existing cis-elements from genes unrelated to C4 photosynthesis into C4 photosynthesis genes during C4 evolution. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  15. Genome-wide identification, characterization of sugar transporter genes in the silkworm Bombyx mori and role in Bombyx mori nucleopolyhedrovirus (BmNPV) infection.

    PubMed

    Govindaraj, Lekha; Gupta, Tania; Esvaran, Vijaya Gowri; Awasthi, Arvind Kumar; Ponnuvel, Kangayam M

    2016-04-01

    Sugar transporters play an essential role in controlling carbohydrate transport and are responsible for mediating the movement of sugars into cells. These genes exist as large multigene families within the insect genome. In insects, sugar transporters not only have a role in sugar transport, but may also act as receptors for virus entry. Genome-wide annotation of silkworm Bombyx mori (B. mori) revealed 100 putative sugar transporter (BmST) genes exists as a large multigene family and were classified into 11 sub families, through phylogenetic analysis. Chromosomes 27, 26 and 20 were found to possess the highest number of BmST paralogous genes, harboring 22, 7 and 6 genes, respectively. These genes occurred in clusters exhibiting the phenomenon of tandem gene duplication. The ovary, silk gland, hemocytes, midgut and malphigian tubules were the different tissues/cells enriched with BmST gene expression. The BmST gene BGIBMGA001498 had maximum EST transcripts of 134 and expressed exclusively in the malphigian tubule. The expression of EST transcripts of the BmST clustered genes on chromosome 27 was distributed in various tissues like testis, ovary, silk gland, malphigian tubule, maxillary galea, prothoracic gland, epidermis, fat body and midgut. Three sugar transporter genes (BmST) were constitutively expressed in the susceptible race and were down regulated upon BmNPV infection at 12h post infection (hpi). The expression pattern of these three genes was validated through real-time PCR in the midgut tissues at different time intervals from 0 to 30hpi. In the susceptible B. mori race, expression of sugar transporter genes was constitutively expressed making the host succumb to viral infection. Copyright © 2015 Elsevier B.V. All rights reserved.

  16. A mesh generation and machine learning framework for Drosophila gene expression pattern image analysis

    PubMed Central

    2013-01-01

    Background Multicellular organisms consist of cells of many different types that are established during development. Each type of cell is characterized by the unique combination of expressed gene products as a result of spatiotemporal gene regulation. Currently, a fundamental challenge in regulatory biology is to elucidate the gene expression controls that generate the complex body plans during development. Recent advances in high-throughput biotechnologies have generated spatiotemporal expression patterns for thousands of genes in the model organism fruit fly Drosophila melanogaster. Existing qualitative methods enhanced by a quantitative analysis based on computational tools we present in this paper would provide promising ways for addressing key scientific questions. Results We develop a set of computational methods and open source tools for identifying co-expressed embryonic domains and the associated genes simultaneously. To map the expression patterns of many genes into the same coordinate space and account for the embryonic shape variations, we develop a mesh generation method to deform a meshed generic ellipse to each individual embryo. We then develop a co-clustering formulation to cluster the genes and the mesh elements, thereby identifying co-expressed embryonic domains and the associated genes simultaneously. Experimental results indicate that the gene and mesh co-clusters can be correlated to key developmental events during the stages of embryogenesis we study. The open source software tool has been made available at http://compbio.cs.odu.edu/fly/. Conclusions Our mesh generation and machine learning methods and tools improve upon the flexibility, ease-of-use and accuracy of existing methods. PMID:24373308

  17. Discovering high-resolution patterns of differential DNA methylation that correlate with gene expression changes

    PubMed Central

    VanderKraats, Nathan D.; Hiken, Jeffrey F.; Decker, Keith F.; Edwards, John R.

    2013-01-01

    Methylation of the CpG-rich region (CpG island) overlapping a gene’s promoter is a generally accepted mechanism for silencing expression. While recent technological advances have enabled measurement of DNA methylation and expression changes genome-wide, only modest correlations between differential methylation at gene promoters and expression have been found. We hypothesize that stronger associations are not observed because existing analysis methods oversimplify their representation of the data and do not capture the diversity of existing methylation patterns. Recently, other patterns such as CpG island shore methylation and long partially hypomethylated domains have also been linked with gene silencing. Here, we detail a new approach for discovering differential methylation patterns associated with expression change using genome-wide high-resolution methylation data: we represent differential methylation as an interpolated curve, or signature, and then identify groups of genes with similarly shaped signatures and corresponding expression changes. Our technique uncovers a diverse set of patterns that are conserved across embryonic stem cell and cancer data sets. Overall, we find strong associations between these methylation patterns and expression. We further show that an extension of our method also outperforms other approaches by generating a longer list of genes with higher quality associations between differential methylation and expression. PMID:23748561

  18. Transcriptome dynamics along axolotl regenerative development are consistent with an extensive reduction in gene expression heterogeneity in dedifferentiated cells

    PubMed Central

    2017-01-01

    Although in recent years the study of gene expression variation in the absence of genetic or environmental cues or gene expression heterogeneity has intensified considerably, many basic and applied biological fields still remain unaware of how useful the study of gene expression heterogeneity patterns might be for the characterization of biological systems and/or processes. Largely based on the modulator effect chromatin compaction has for gene expression heterogeneity and the extensive changes in chromatin compaction known to occur for specialized cells that are naturally or artificially induced to revert to less specialized states or dedifferentiate, I recently hypothesized that processes that concur with cell dedifferentiation would show an extensive reduction in gene expression heterogeneity. The confirmation of the existence of such trend could be of wide interest because of the biomedical and biotechnological relevance of cell dedifferentiation-based processes, i.e., regenerative development, cancer, human induced pluripotent stem cells, or plant somatic embryogenesis. Here, I report the first empirical evidence consistent with the existence of an extensive reduction in gene expression heterogeneity for processes that concur with cell dedifferentiation by analyzing transcriptome dynamics along forearm regenerative development in Ambystoma mexicanum or axolotl. Also, I briefly discuss on the utility of the study of gene expression heterogeneity dynamics might have for the characterization of cell dedifferentiation-based processes, and the engineering of tools that afforded better monitoring and modulating such processes. Finally, I reflect on how a transitional reduction in gene expression heterogeneity for dedifferentiated cells can promote a long-term increase in phenotypic heterogeneity following cell dedifferentiation with potential adverse effects for biomedical and biotechnological applications. PMID:29134148

  19. Confident difference criterion: a new Bayesian differentially expressed gene selection algorithm with applications.

    PubMed

    Yu, Fang; Chen, Ming-Hui; Kuo, Lynn; Talbott, Heather; Davis, John S

    2015-08-07

    Recently, the Bayesian method becomes more popular for analyzing high dimensional gene expression data as it allows us to borrow information across different genes and provides powerful estimators for evaluating gene expression levels. It is crucial to develop a simple but efficient gene selection algorithm for detecting differentially expressed (DE) genes based on the Bayesian estimators. In this paper, by extending the two-criterion idea of Chen et al. (Chen M-H, Ibrahim JG, Chi Y-Y. A new class of mixture models for differential gene expression in DNA microarray data. J Stat Plan Inference. 2008;138:387-404), we propose two new gene selection algorithms for general Bayesian models and name these new methods as the confident difference criterion methods. One is based on the standardized differences between two mean expression values among genes; the other adds the differences between two variances to it. The proposed confident difference criterion methods first evaluate the posterior probability of a gene having different gene expressions between competitive samples and then declare a gene to be DE if the posterior probability is large. The theoretical connection between the proposed first method based on the means and the Bayes factor approach proposed by Yu et al. (Yu F, Chen M-H, Kuo L. Detecting differentially expressed genes using alibrated Bayes factors. Statistica Sinica. 2008;18:783-802) is established under the normal-normal-model with equal variances between two samples. The empirical performance of the proposed methods is examined and compared to those of several existing methods via several simulations. The results from these simulation studies show that the proposed confident difference criterion methods outperform the existing methods when comparing gene expressions across different conditions for both microarray studies and sequence-based high-throughput studies. A real dataset is used to further demonstrate the proposed methodology. In the real data application, the confident difference criterion methods successfully identified more clinically important DE genes than the other methods. The confident difference criterion method proposed in this paper provides a new efficient approach for both microarray studies and sequence-based high-throughput studies to identify differentially expressed genes.

  20. Synaptic genes are extensively downregulated across multiple brain regions in normal human aging and Alzheimer’s disease

    PubMed Central

    Berchtold, Nicole C.; Coleman, Paul D.; Cribbs, David H.; Rogers, Joseph; Gillen, Daniel L.; Cotman, Carl W.

    2014-01-01

    Synapses are essential for transmitting, processing, and storing information, all of which decline in aging and Alzheimer’s disease (AD). Because synapse loss only partially accounts for the cognitive declines seen in aging and AD, we hypothesized that existing synapses might undergo molecular changes that reduce their functional capacity. Microarrays were used to evaluate expression profiles of 340 synaptic genes in aging (20–99 years) and AD across 4 brain regions from 81 cases. The analysis revealed an unexpectedly large number of significant expression changes in synapse-related genes in aging, with many undergoing progressive downregulation across aging and AD. Functional classification of the genes showing altered expression revealed that multiple aspects of synaptic function are affected, notably synaptic vesicle trafficking and release, neurotransmitter receptors and receptor trafficking, postsynaptic density scaffolding, cell adhesion regulating synaptic stability, and neuromodulatory systems. The widespread declines in synaptic gene expression in normal aging suggests that function of existing synapses might be impaired, and that a common set of synaptic genes are vulnerable to change in aging and AD. PMID:23273601

  1. Isoflurane is a suitable alternative to ether for anesthetizing rats prior to euthanasia for gene expression analysis.

    PubMed

    Nakatsu, Noriyuki; Igarashi, Yoshinobu; Aoshi, Taiki; Hamaguchi, Isao; Saito, Masumichi; Mizukami, Takuo; Momose, Haruka; Ishii, Ken J; Yamada, Hiroshi

    2017-01-01

    Diethyl ether (ether) had been widely used in Japan for anesthesia, despite its explosive properties and toxicity to both humans and animals. We also had used ether as an anesthetic for euthanizing rats for research in the Toxicogenomics Project (TGP). Because the use of ether for these purposes will likely cease, it is required to select an alternative anesthetic which is validated for consistency with existing TGP data acquired under ether anesthesia. We therefore compared two alternative anesthetic candidates, isoflurane and pentobarbital, with ether in terms of hematological findings, serum biochemical parameters, and gene expressions. As a result, few differences among the three agents were observed. In hematological and serum biochemistry analysis, no significant changes were found. In gene expression analysis, four known genes were extracted as differentially expressed genes in the liver of rats anesthetized with ether, isoflurane, or pentobarbital. However, no significant relationships were detected using gene ontology, pathway, or gene enrichment analyses by DAVID and TargetMine. Surprisingly, although it was expected that the lung would be affected by administration via inhalation, only one differentially expressed gene was extracted in the lung. Taken together, our data indicate that there are no significant differences among ether, isoflurane, and pentobarbital with respect to effects on hematological parameters, serum biochemistry parameters, and gene expression. Based on its smallest affect to existing data and its safety profile for humans and animals, we suggest isoflurane as a suitable alternative anesthetic for use in rat euthanasia in toxicogenomics analysis.

  2. THD-Module Extractor: An Application for CEN Module Extraction and Interesting Gene Identification for Alzheimer's Disease.

    PubMed

    Kakati, Tulika; Kashyap, Hirak; Bhattacharyya, Dhruba K

    2016-11-30

    There exist many tools and methods for construction of co-expression network from gene expression data and for extraction of densely connected gene modules. In this paper, a method is introduced to construct co-expression network and to extract co-expressed modules having high biological significance. The proposed method has been validated on several well known microarray datasets extracted from a diverse set of species, using statistical measures, such as p and q values. The modules obtained in these studies are found to be biologically significant based on Gene Ontology enrichment analysis, pathway analysis, and KEGG enrichment analysis. Further, the method was applied on an Alzheimer's disease dataset and some interesting genes are found, which have high semantic similarity among them, but are not significantly correlated in terms of expression similarity. Some of these interesting genes, such as MAPT, CASP2, and PSEN2, are linked with important aspects of Alzheimer's disease, such as dementia, increase cell death, and deposition of amyloid-beta proteins in Alzheimer's disease brains. The biological pathways associated with Alzheimer's disease, such as, Wnt signaling, Apoptosis, p53 signaling, and Notch signaling, incorporate these interesting genes. The proposed method is evaluated in regard to existing literature.

  3. Characterization of an In Vivo Z-DNA Detection Probe Based on a Cell Nucleus Accumulating Intrabody.

    PubMed

    Gulis, Galina; Silva, Izabel Cristina Rodrigues; Sousa, Herdson Renney; Sousa, Isabel Garcia; Bezerra, Maryani Andressa Gomes; Quilici, Luana Salgado; Maranhao, Andrea Queiroz; Brigido, Marcelo Macedo

    2016-09-01

    Left-handed Z-DNA is a physiologically unstable DNA conformation, and its existence in vivo can be attributed to localized torsional distress. Despite evidence for the existence of Z-DNA in vivo, its precise role in the control of gene expression is not fully understood. Here, an in vivo probe based on an anti-Z-DNA intrabody is proposed for native Z-DNA detection. The probe was used for chromatin immunoprecipitation of potential Z-DNA-forming sequences in the human genome. One of the isolated putative Z-DNA-forming sequences was cloned upstream of a reporter gene expression cassette under control of the CMV promoter. The reporter gene encoded an antibody fragment fused to GFP. Transient co-transfection of this vector along with the Z-probe coding vector improved reporter gene expression. This improvement was demonstrated by measuring reporter gene mRNA and protein levels and the amount of fluorescence in co-transfected CHO-K1 cells. These results suggest that the presence of the anti-Z-DNA intrabody can interfere with a Z-DNA-containing reporter gene expression. Therefore, this in vivo probe for the detection of Z-DNA could be used for global correlation of Z-DNA-forming sequences and gene expression regulation.

  4. THD-Module Extractor: An Application for CEN Module Extraction and Interesting Gene Identification for Alzheimer’s Disease

    PubMed Central

    Kakati, Tulika; Kashyap, Hirak; Bhattacharyya, Dhruba K.

    2016-01-01

    There exist many tools and methods for construction of co-expression network from gene expression data and for extraction of densely connected gene modules. In this paper, a method is introduced to construct co-expression network and to extract co-expressed modules having high biological significance. The proposed method has been validated on several well known microarray datasets extracted from a diverse set of species, using statistical measures, such as p and q values. The modules obtained in these studies are found to be biologically significant based on Gene Ontology enrichment analysis, pathway analysis, and KEGG enrichment analysis. Further, the method was applied on an Alzheimer’s disease dataset and some interesting genes are found, which have high semantic similarity among them, but are not significantly correlated in terms of expression similarity. Some of these interesting genes, such as MAPT, CASP2, and PSEN2, are linked with important aspects of Alzheimer’s disease, such as dementia, increase cell death, and deposition of amyloid-beta proteins in Alzheimer’s disease brains. The biological pathways associated with Alzheimer’s disease, such as, Wnt signaling, Apoptosis, p53 signaling, and Notch signaling, incorporate these interesting genes. The proposed method is evaluated in regard to existing literature. PMID:27901073

  5. Gene coexpression measures in large heterogeneous samples using count statistics.

    PubMed

    Wang, Y X Rachel; Waterman, Michael S; Huang, Haiyan

    2014-11-18

    With the advent of high-throughput technologies making large-scale gene expression data readily available, developing appropriate computational tools to process these data and distill insights into systems biology has been an important part of the "big data" challenge. Gene coexpression is one of the earliest techniques developed that is still widely in use for functional annotation, pathway analysis, and, most importantly, the reconstruction of gene regulatory networks, based on gene expression data. However, most coexpression measures do not specifically account for local features in expression profiles. For example, it is very likely that the patterns of gene association may change or only exist in a subset of the samples, especially when the samples are pooled from a range of experiments. We propose two new gene coexpression statistics based on counting local patterns of gene expression ranks to take into account the potentially diverse nature of gene interactions. In particular, one of our statistics is designed for time-course data with local dependence structures, such as time series coupled over a subregion of the time domain. We provide asymptotic analysis of their distributions and power, and evaluate their performance against a wide range of existing coexpression measures on simulated and real data. Our new statistics are fast to compute, robust against outliers, and show comparable and often better general performance.

  6. CRISPR/Cas9 mediates efficient conditional mutagenesis in Drosophila.

    PubMed

    Xue, Zhaoyu; Wu, Menghua; Wen, Kejia; Ren, Menda; Long, Li; Zhang, Xuedi; Gao, Guanjun

    2014-09-05

    Existing transgenic RNA interference (RNAi) methods greatly facilitate functional genome studies via controlled silencing of targeted mRNA in Drosophila. Although the RNAi approach is extremely powerful, concerns still linger about its low efficiency. Here, we developed a CRISPR/Cas9-mediated conditional mutagenesis system by combining tissue-specific expression of Cas9 driven by the Gal4/upstream activating site system with various ubiquitously expressed guide RNA transgenes to effectively inactivate gene expression in a temporally and spatially controlled manner. Furthermore, by including multiple guide RNAs in a transgenic vector to target a single gene, we achieved a high degree of gene mutagenesis in specific tissues. The CRISPR/Cas9-mediated conditional mutagenesis system provides a simple and effective tool for gene function analysis, and complements the existing RNAi approach. Copyright © 2014 Xue et al.

  7. Transcriptomic Analysis and the Expression of Disease-Resistant Genes in Oryza meyeriana under Native Condition

    PubMed Central

    He, Bin; Tao, Xiang; Gu, Yinghong; Wei, Changhe; Cheng, Xiaojie; Xiao, Suqin; Cheng, Zaiquan; Zhang, Yizheng

    2015-01-01

    Oryza meyeriana (O. meyeriana), with a GG genome type (2n = 24), accumulated plentiful excellent characteristics with respect to resistance to many diseases such as rice shade and blast, even immunity to bacterial blight. It is very important to know if the diseases-resistant genes exist and express in this wild rice under native conditions. However, limited genomic or transcriptomic data of O. meyeriana are currently available. In this study, we present the first comprehensive characterization of the O. meyeriana transcriptome using RNA-seq and obtained 185,323 contigs with an average length of 1,692 bp and an N50 of 2,391 bp. Through differential expression analysis, it was found that there were most tissue-specifically expressed genes in roots, and next to stems and leaves. By similarity search against protein databases, 146,450 had at least a significant alignment to existed gene models. Comparison with the Oryza sativa (japonica-type Nipponbare and indica-type 93–11) genomes revealed that 13% of the O. meyeriana contigs had not been detected in O. sativa. Many diseases-resistant genes, such as bacterial blight resistant, blast resistant, rust resistant, fusarium resistant, cyst nematode resistant and downy mildew gene, were mined from the transcriptomic database. There are two kinds of rice bacterial blight-resistant genes (Xa1 and Xa26) differentially or specifically expressed in O. meyeriana. The 4 Xa1 contigs were all only expressed in root, while three of Xa26 contigs have the highest expression level in leaves, two of Xa26 contigs have the highest expression profile in stems and one of Xa26 contigs was expressed dominantly in roots. The transcriptomic database of O. meyeriana has been constructed and many diseases-resistant genes were found to express under native condition, which provides a foundation for future discovery of a number of novel genes and provides a basis for studying the molecular mechanisms associated with disease resistance in O. meyeriana. PMID:26640944

  8. CGO: utilizing and integrating gene expression microarray data in clinical research and data management.

    PubMed

    Bumm, Klaus; Zheng, Mingzhong; Bailey, Clyde; Zhan, Fenghuang; Chiriva-Internati, M; Eddlemon, Paul; Terry, Julian; Barlogie, Bart; Shaughnessy, John D

    2002-02-01

    Clinical GeneOrganizer (CGO) is a novel windows-based archiving, organization and data mining software for the integration of gene expression profiling in clinical medicine. The program implements various user-friendly tools and extracts data for further statistical analysis. This software was written for Affymetrix GeneChip *.txt files, but can also be used for any other microarray-derived data. The MS-SQL server version acts as a data mart and links microarray data with clinical parameters of any other existing database and therefore represents a valuable tool for combining gene expression analysis and clinical disease characteristics.

  9. Integrated Analyses of Gene Expression Profiles Digs out Common Markers for Rheumatic Diseases

    PubMed Central

    Wang, Lan; Wu, Long-Fei; Lu, Xin; Mo, Xing-Bo; Tang, Zai-Xiang; Lei, Shu-Feng; Deng, Fei-Yan

    2015-01-01

    Objective Rheumatic diseases have some common symptoms. Extensive gene expression studies, accumulated thus far, have successfully identified signature molecules for each rheumatic disease, individually. However, whether there exist shared factors across rheumatic diseases has yet to be tested. Methods We collected and utilized 6 public microarray datasets covering 4 types of representative rheumatic diseases including rheumatoid arthritis, systemic lupus erythematosus, ankylosing spondylitis, and osteoarthritis. Then we detected overlaps of differentially expressed genes across datasets and performed a meta-analysis aiming at identifying common differentially expressed genes that discriminate between pathological cases and normal controls. To further gain insights into the functions of the identified common differentially expressed genes, we conducted gene ontology enrichment analysis and protein-protein interaction analysis. Results We identified a total of eight differentially expressed genes (TNFSF10, CX3CR1, LY96, TLR5, TXN, TIA1, PRKCH, PRF1), each associated with at least 3 of the 4 studied rheumatic diseases. Meta-analysis warranted the significance of the eight genes and highlighted the general significance of four genes (CX3CR1, LY96, TLR5, and PRF1). Protein-protein interaction and gene ontology enrichment analyses indicated that the eight genes interact with each other to exert functions related to immune response and immune regulation. Conclusion The findings support that there exist common factors underlying rheumatic diseases. For rheumatoid arthritis, systemic lupus erythematosus, ankylosing spondylitis and osteoarthritis diseases, those common factors include TNFSF10, CX3CR1, LY96, TLR5, TXN, TIA1, PRKCH, and PRF1. In-depth studies on these common factors may provide keys to understanding the pathogenesis and developing intervention strategies for rheumatic diseases. PMID:26352601

  10. Selection and evaluation of reference genes for expression studies with quantitative PCR in the model fungus Neurospora crassa under different environmental conditions in continuous culture.

    PubMed

    Cusick, Kathleen D; Fitzgerald, Lisa A; Pirlo, Russell K; Cockrell, Allison L; Petersen, Emily R; Biffinger, Justin C

    2014-01-01

    Neurospora crassa has served as a model organism for studying circadian pathways and more recently has gained attention in the biofuel industry due to its enhanced capacity for cellulase production. However, in order to optimize N. crassa for biotechnological applications, metabolic pathways during growth under different environmental conditions must be addressed. Reverse-transcription quantitative PCR (RT-qPCR) is a technique that provides a high-throughput platform from which to measure the expression of a large set of genes over time. The selection of a suitable reference gene is critical for gene expression studies using relative quantification, as this strategy is based on normalization of target gene expression to a reference gene whose expression is stable under the experimental conditions. This study evaluated twelve candidate reference genes for use with N. crassa when grown in continuous culture bioreactors under different light and temperature conditions. Based on combined stability values from NormFinder and Best Keeper software packages, the following are the most appropriate reference genes under conditions of: (1) light/dark cycling: btl, asl, and vma1; (2) all-dark growth: btl, tbp, vma1, and vma2; (3) temperature flux: btl, vma1, act, and asl; (4) all conditions combined: vma1, vma2, tbp, and btl. Since N. crassa exists as different cell types (uni- or multi-nucleated), expression changes in a subset of the candidate genes was further assessed using absolute quantification. A strong negative correlation was found to exist between ratio and threshold cycle (CT) values, demonstrating that CT changes serve as a reliable reflection of transcript, and not gene copy number, fluctuations. The results of this study identified genes that are appropriate for use as reference genes in RT-qPCR studies with N. crassa and demonstrated that even with the presence of different cell types, relative quantification is an acceptable method for measuring gene expression changes during growth in bioreactors.

  11. Cell cycle gene expression networks discovered using systems biology: Significance in carcinogenesis

    PubMed Central

    Scott, RE; Ghule, PN; Stein, JL; Stein, GS

    2015-01-01

    The early stages of carcinogenesis are linked to defects in the cell cycle. A series of cell cycle checkpoints are involved in this process. The G1/S checkpoint that serves to integrate the control of cell proliferation and differentiation is linked to carcinogenesis and the mitotic spindle checkpoint with the development of chromosomal instability. This paper presents the outcome of systems biology studies designed to evaluate if networks of covariate cell cycle gene transcripts exist in proliferative mammalian tissues including mice, rats and humans. The GeneNetwork website that contains numerous gene expression datasets from different species, sexes and tissues represents the foundational resource for these studies (www.genenetwork.org). In addition, WebGestalt, a gene ontology tool, facilitated the identification of expression networks of genes that co-vary with key cell cycle targets, especially Cdc20 and Plk1 (www.bioinfo.vanderbilt.edu/webgestalt). Cell cycle expression networks of such covariate mRNAs exist in multiple proliferative tissues including liver, lung, pituitary, adipose and lymphoid tissues among others but not in brain or retina that have low proliferative potential. Sixty-three covariate cell cycle gene transcripts (mRNAs) compose the average cell cycle network with p = e−13 to e−36. Cell cycle expression networks show species, sex and tissue variability and they are enriched in mRNA transcripts associated with mitosis many of which are associated with chromosomal instability. PMID:25808367

  12. Identification of Novel Tissue-Specific Genes by Analysis of Microarray Databases: A Human and Mouse Model

    PubMed Central

    Suh, Yeunsu; Davis, Michael E.; Lee, Kichoon

    2013-01-01

    Understanding the tissue-specific pattern of gene expression is critical in elucidating the molecular mechanisms of tissue development, gene function, and transcriptional regulations of biological processes. Although tissue-specific gene expression information is available in several databases, follow-up strategies to integrate and use these data are limited. The objective of the current study was to identify and evaluate novel tissue-specific genes in human and mouse tissues by performing comparative microarray database analysis and semi-quantitative PCR analysis. We developed a powerful approach to predict tissue-specific genes by analyzing existing microarray data from the NCBI′s Gene Expression Omnibus (GEO) public repository. We investigated and confirmed tissue-specific gene expression in the human and mouse kidney, liver, lung, heart, muscle, and adipose tissue. Applying our novel comparative microarray approach, we confirmed 10 kidney, 11 liver, 11 lung, 11 heart, 8 muscle, and 8 adipose specific genes. The accuracy of this approach was further verified by employing semi-quantitative PCR reaction and by searching for gene function information in existing publications. Three novel tissue-specific genes were discovered by this approach including AMDHD1 (amidohydrolase domain containing 1) in the liver, PRUNE2 (prune homolog 2) in the heart, and ACVR1C (activin A receptor, type IC) in adipose tissue. We further confirmed the tissue-specific expression of these 3 novel genes by real-time PCR. Among them, ACVR1C is adipose tissue-specific and adipocyte-specific in adipose tissue, and can be used as an adipocyte developmental marker. From GEO profiles, we predicted the processes in which AMDHD1 and PRUNE2 may participate. Our approach provides a novel way to identify new sets of tissue-specific genes and to predict functions in which they may be involved. PMID:23741331

  13. Lentivirus-mediated platelet gene therapy of murine hemophilia A with pre-existing anti-FVIII immunity

    PubMed Central

    Kuether, E. L.; Schroeder, J. A.; Fahs, S. A.; Cooley, B. C.; Chen, Y.; Montgomery, R. R.; Wilcox, D. A.; Shi, Q.

    2012-01-01

    Summary Background The development of inhibitory antibodies, referred to as inhibitors, against exogenous FVIII in a significant subset of patients with hemophilia A remains a persistent challenge to the efficacy of protein replacement therapy. Our previous studies using the transgenic approach provided proof-of-principle that platelet-specific expression could be successful for treating hemophilia A in the presence of inhibitory antibodies. Objective To investigate a clinically translatable approach for platelet gene therapy of hemophilia A with pre-existing inhibitors. Methods Platelet-FVIII expression in pre-immunized FVIIInull mice was introduced by transplantation of lentivirus-transduced bone marrow or enriched hematopoietic stem cells. FVIII expression was determined by a chromogenic assay. The transgene copy number per cell was quantitated by real time PCR. Inhibitor titer was measured by Bethesda assay. Phenotypic correction was assessed by the tail clipping assay and an electrolytic-induced venous injury model. Integration sites were analyzed by LAM-PCR. Results Therapeutic levels of platelet-FVIII expression were sustained long-term without evoking an anti-FVIII memory response in the transduced pre-immunized recipients. The tail clip survival test and the electrolytic injury model confirmed that hemostasis was improved in the treated animals. Sequential bone marrow transplants showed sustained platelet-FVIII expression resulting in phenotypic correction in pre-immunized secondary and tertiary recipients. Conclusions Lentivirus-mediated platelet-specific gene transfer improves hemostasis in hemophilic A mice with pre-existing inhibitors, indicating that this approach may be a promising strategy for gene therapy of hemophilia A even in the high-risk setting of pre-existing inhibitory antibodies. PMID:22632092

  14. Existence of a photoinducible phase for ovarian development and photoperiod-related alteration of clock gene expression in a damselfish.

    PubMed

    Takeuchi, Yuki; Hada, Noriko; Imamura, Satoshi; Hur, Sung-Pyo; Bouchekioua, Selma; Takemura, Akihiro

    2015-10-01

    The sapphire devil, Chrysiptera cyanea, is a reef-associated damselfish and their ovarian development can be induced by a long photoperiod. In this study, we demonstrated the existence of a photoinducible phase for the photoperiodic ovarian development in the sapphire devil. Induction of ovarian development under night-interruption light schedules and Nanda-Hamner cycles revealed that the photoinducible phase appeared in a circadian manner between ZT12 and ZT13. To characterize the effect of photoperiod on clock gene expression in the brain of this species, we determined the expression levels of the sdPer1, sdPer2, sdCry1, and sdCry2 clock genes under constant light and dark conditions (LL and DD) and photoperiodic (short and long photoperiods). The expression of sdPer1 exhibited clear circadian oscillation under both LL and DD conditions, while sdPer2 and sdCry1 expression levels were lower under DD than under LL conditions and sdCry2 expression was lower under LL than under DD conditions. These results suggest a key role for sdPer1 in circadian clock cycling and that sdPer2, sdCry1, and sdCry2 are light-responsive clock genes in the sapphire devil. After 1 week under a long photoperiod, we observed photoperiod-related changes in sdPer1, sdPer2, and sdCry2 expression, but not in sdCry1 expression. These results suggest that the expression patterns of some clock genes exhibit seasonal variation according to seasonal changes in day length and that such seasonal alteration of clock gene expression may contribute to seasonal recognition by the sapphire devil. Copyright © 2015 Elsevier Inc. All rights reserved.

  15. ROKU: a novel method for identification of tissue-specific genes.

    PubMed

    Kadota, Koji; Ye, Jiazhen; Nakai, Yuji; Terada, Tohru; Shimizu, Kentaro

    2006-06-12

    One of the important goals of microarray research is the identification of genes whose expression is considerably higher or lower in some tissues than in others. We would like to have ways of identifying such tissue-specific genes. We describe a method, ROKU, which selects tissue-specific patterns from gene expression data for many tissues and thousands of genes. ROKU ranks genes according to their overall tissue specificity using Shannon entropy and detects tissues specific to each gene if any exist using an outlier detection method. We evaluated the capacity for the detection of various specific expression patterns using synthetic and real data. We observed that ROKU was superior to a conventional entropy-based method in its ability to rank genes according to overall tissue specificity and to detect genes whose expression pattern are specific only to objective tissues. ROKU is useful for the detection of various tissue-specific expression patterns. The framework is also directly applicable to the selection of diagnostic markers for molecular classification of multiple classes.

  16. Sexual dimorphism in clock genes expression in human adipose tissue

    USDA-ARS?s Scientific Manuscript database

    This study was carried out to investigate whether sex-related differences exist in the adipocyte expression of clock genes from subcutaneous abdominal and visceral fat depots in severely obese patients. METHODS: We investigated 16 morbidly obese patients, eight men and eight women (mean age 45 +/- 2...

  17. Bacillus anthracis genome organization in light of whole transcriptome sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Martin, Jeffrey; Zhu, Wenhan; Passalacqua, Karla D.

    2010-03-22

    Emerging knowledge of whole prokaryotic transcriptomes could validate a number of theoretical concepts introduced in the early days of genomics. What are the rules connecting gene expression levels with sequence determinants such as quantitative scores of promoters and terminators? Are translation efficiency measures, e.g. codon adaptation index and RBS score related to gene expression? We used the whole transcriptome shotgun sequencing of a bacterial pathogen Bacillus anthracis to assess correlation of gene expression level with promoter, terminator and RBS scores, codon adaptation index, as well as with a new measure of gene translational efficiency, average translation speed. We compared computationalmore » predictions of operon topologies with the transcript borders inferred from RNA-Seq reads. Transcriptome mapping may also improve existing gene annotation. Upon assessment of accuracy of current annotation of protein-coding genes in the B. anthracis genome we have shown that the transcriptome data indicate existence of more than a hundred genes missing in the annotation though predicted by an ab initio gene finder. Interestingly, we observed that many pseudogenes possess not only a sequence with detectable coding potential but also promoters that maintain transcriptional activity.« less

  18. Receptor Signaling Directs Global Recruitment of Pre-existing Transcription Factors to Inducible Elements.

    PubMed

    Cockerill, Peter N

    2016-12-01

    Gene expression programs are largely regulated by the tissue-specific expression of lineage-defining transcription factors or by the inducible expression of transcription factors in response to specific stimuli. Here I will review our own work over the last 20 years to show how specific activation signals also lead to the wide-spread re-distribution of pre-existing constitutive transcription factors to sites undergoing chromatin reorganization. I will summarize studies showing that activation of kinase signaling pathways creates open chromatin regions that recruit pre-existing factors which were previously unable to bind to closed chromatin. As models I will draw upon genes activated or primed by receptor signaling in memory T cells, and genes activated by cytokine receptor mutations in acute myeloid leukemia. I also summarize a hit-and-run model of stable epigenetic reprograming in memory T cells, mediated by transient Activator Protein 1 (AP-1) binding, which enables the accelerated activation of inducible enhancers.

  19. Transcriptome analysis of the whitefly, Bemisia tabaci MEAM1 on tomato infected with the crinivirus, Tomato chlorosis virus, identifies a temporal shift in gene expression and differential regulation of novel orphan genes

    USDA-ARS?s Scientific Manuscript database

    Whiteflies threaten agricultural crop production worldwide, are polyphagous in nature, and transmit hundreds of plant viruses. Little information exists on how whitefly gene expression is altered due to feeding on plants infected with a semipersistently transmitted virus. Tomato chlorosis virus (T...

  20. A new approach to enhance the performance of decision tree for classifying gene expression data.

    PubMed

    Hassan, Md; Kotagiri, Ramamohanarao

    2013-12-20

    Gene expression data classification is a challenging task due to the large dimensionality and very small number of samples. Decision tree is one of the popular machine learning approaches to address such classification problems. However, the existing decision tree algorithms use a single gene feature at each node to split the data into its child nodes and hence might suffer from poor performance specially when classifying gene expression dataset. By using a new decision tree algorithm where, each node of the tree consists of more than one gene, we enhance the classification performance of traditional decision tree classifiers. Our method selects suitable genes that are combined using a linear function to form a derived composite feature. To determine the structure of the tree we use the area under the Receiver Operating Characteristics curve (AUC). Experimental analysis demonstrates higher classification accuracy using the new decision tree compared to the other existing decision trees in literature. We experimentally compare the effect of our scheme against other well known decision tree techniques. Experiments show that our algorithm can substantially boost the classification performance of the decision tree.

  1. Collective Dynamics of Specific Gene Ensembles Crucial for Neutrophil Differentiation: The Existence of Genome Vehicles Revealed

    PubMed Central

    Giuliani, Alessandro; Tomita, Masaru

    2010-01-01

    Cell fate decision remarkably generates specific cell differentiation path among the multiple possibilities that can arise through the complex interplay of high-dimensional genome activities. The coordinated action of thousands of genes to switch cell fate decision has indicated the existence of stable attractors guiding the process. However, origins of the intracellular mechanisms that create “cellular attractor” still remain unknown. Here, we examined the collective behavior of genome-wide expressions for neutrophil differentiation through two different stimuli, dimethyl sulfoxide (DMSO) and all-trans-retinoic acid (atRA). To overcome the difficulties of dealing with single gene expression noises, we grouped genes into ensembles and analyzed their expression dynamics in correlation space defined by Pearson correlation and mutual information. The standard deviation of correlation distributions of gene ensembles reduces when the ensemble size is increased following the inverse square root law, for both ensembles chosen randomly from whole genome and ranked according to expression variances across time. Choosing the ensemble size of 200 genes, we show the two probability distributions of correlations of randomly selected genes for atRA and DMSO responses overlapped after 48 hours, defining the neutrophil attractor. Next, tracking the ranked ensembles' trajectories, we noticed that only certain, not all, fall into the attractor in a fractal-like manner. The removal of these genome elements from the whole genomes, for both atRA and DMSO responses, destroys the attractor providing evidence for the existence of specific genome elements (named “genome vehicle”) responsible for the neutrophil attractor. Notably, within the genome vehicles, genes with low or moderate expression changes, which are often considered noisy and insignificant, are essential components for the creation of the neutrophil attractor. Further investigations along with our findings might provide a comprehensive mechanistic view of cell fate decision. PMID:20725638

  2. Direct multiplexed measurement of gene expression with color-coded probe pairs.

    PubMed

    Geiss, Gary K; Bumgarner, Roger E; Birditt, Brian; Dahl, Timothy; Dowidar, Naeem; Dunaway, Dwayne L; Fell, H Perry; Ferree, Sean; George, Renee D; Grogan, Tammy; James, Jeffrey J; Maysuria, Malini; Mitton, Jeffrey D; Oliveri, Paola; Osborn, Jennifer L; Peng, Tao; Ratcliffe, Amber L; Webster, Philippa J; Davidson, Eric H; Hood, Leroy; Dimitrov, Krassen

    2008-03-01

    We describe a technology, the NanoString nCounter gene expression system, which captures and counts individual mRNA transcripts. Advantages over existing platforms include direct measurement of mRNA expression levels without enzymatic reactions or bias, sensitivity coupled with high multiplex capability, and digital readout. Experiments performed on 509 human genes yielded a replicate correlation coefficient of 0.999, a detection limit between 0.1 fM and 0.5 fM, and a linear dynamic range of over 500-fold. Comparison of the NanoString nCounter gene expression system with microarrays and TaqMan PCR demonstrated that the nCounter system is more sensitive than microarrays and similar in sensitivity to real-time PCR. Finally, a comparison of transcript levels for 21 genes across seven samples measured by the nCounter system and SYBR Green real-time PCR demonstrated similar patterns of gene expression at all transcript levels.

  3. Stochastic gene expression in Arabidopsis thaliana.

    PubMed

    Araújo, Ilka Schultheiß; Pietsch, Jessica Magdalena; Keizer, Emma Mathilde; Greese, Bettina; Balkunde, Rachappa; Fleck, Christian; Hülskamp, Martin

    2017-12-14

    Although plant development is highly reproducible, some stochasticity exists. This developmental stochasticity may be caused by noisy gene expression. Here we analyze the fluctuation of protein expression in Arabidopsis thaliana. Using the photoconvertible KikGR marker, we show that the protein expressions of individual cells fluctuate over time. A dual reporter system was used to study extrinsic and intrinsic noise of marker gene expression. We report that extrinsic noise is higher than intrinsic noise and that extrinsic noise in stomata is clearly lower in comparison to several other tissues/cell types. Finally, we show that cells are coupled with respect to stochastic protein expression in young leaves, hypocotyls and roots but not in mature leaves. Our data indicate that stochasticity of gene expression can vary between tissues/cell types and that it can be coupled in a non-cell-autonomous manner.

  4. TEMPORAL GENE INDUCTION PATTERNS IN SHEEPSHEAD MINNOWS EXPOSED TO 17-ESTRADIOL

    EPA Science Inventory

    Gene arrays provide a powerful method to examine changes in gene expression in fish due to chemical exposures in the environment. In this study, we expanded an existing gene array for sheepshead minnows (Cyprinodon variegatus) (SHM) and used it to examine temporal changes in gene...

  5. eQTL Mapping Using RNA-seq Data

    PubMed Central

    Hu, Yijuan

    2012-01-01

    As RNA-seq is replacing gene expression microarrays to assess genome-wide transcription abundance, gene expression Quantitative Trait Locus (eQTL) studies using RNA-seq have emerged. RNA-seq delivers two novel features that are important for eQTL studies. First, it provides information on allele-specific expression (ASE), which is not available from gene expression microarrays. Second, it generates unprecedentedly rich data to study RNA-isoform expression. In this paper, we review current methods for eQTL mapping using ASE and discuss some future directions. We also review existing works that use RNA-seq data to study RNA-isoform expression and we discuss the gaps between these works and isoform-specific eQTL mapping. PMID:23667399

  6. The G-quadruplex augments translation in the 5' untranslated region of transforming growth factor β2.

    PubMed

    Agarwala, Prachi; Pandey, Satyaprakash; Mapa, Koyeli; Maiti, Souvik

    2013-03-05

    Transforming growth factor β2 (TGFβ2) is a versatile cytokine with a prominent role in cell migration, invasion, cellular development, and immunomodulation. TGFβ2 promotes the malignancy of tumors by inducing epithelial-mesenchymal transition, angiogenesis, and immunosuppression. As it is well-documented that nucleic acid secondary structure can regulate gene expression, we assessed whether any secondary motif regulates its expression at the post-transcriptional level. Bioinformatics analysis predicts an existence of a 23-nucleotide putative G-quadruplex sequence (PG4) in the 5' untranslated region (UTR) of TGFβ2 mRNA. The ability of this stretch of sequence to form a highly stable, intramolecular parallel quadruplex was demonstrated using ultraviolet and circular dichroism spectroscopy. Footprinting studies further validated its existence in the presence of a neighboring nucleotide sequence. Following structural characterization, we evaluated the biological relevance of this secondary motif using a dual luciferase assay. Although PG4 inhibits the expression of the reporter gene, its presence in the context of the entire 5' UTR sequence interestingly enhances gene expression. Mutation or removal of the G-quadruplex sequence from the 5' UTR of the gene diminished the level of expression of this gene at the translational level. Thus, here we highlight an activating role of the G-quadruplex in modulating gene expression of TGFβ2 at the translational level and its potential to be used as a target for the development of therapeutics against cancer.

  7. A Self-Directed Method for Cell-Type Identification and Separation of Gene Expression Microarrays

    PubMed Central

    Zuckerman, Neta S.; Noam, Yair; Goldsmith, Andrea J.; Lee, Peter P.

    2013-01-01

    Gene expression analysis is generally performed on heterogeneous tissue samples consisting of multiple cell types. Current methods developed to separate heterogeneous gene expression rely on prior knowledge of the cell-type composition and/or signatures - these are not available in most public datasets. We present a novel method to identify the cell-type composition, signatures and proportions per sample without need for a-priori information. The method was successfully tested on controlled and semi-controlled datasets and performed as accurately as current methods that do require additional information. As such, this method enables the analysis of cell-type specific gene expression using existing large pools of publically available microarray datasets. PMID:23990767

  8. ExAtlas: An interactive online tool for meta-analysis of gene expression data.

    PubMed

    Sharov, Alexei A; Schlessinger, David; Ko, Minoru S H

    2015-12-01

    We have developed ExAtlas, an on-line software tool for meta-analysis and visualization of gene expression data. In contrast to existing software tools, ExAtlas compares multi-component data sets and generates results for all combinations (e.g. all gene expression profiles versus all Gene Ontology annotations). ExAtlas handles both users' own data and data extracted semi-automatically from the public repository (GEO/NCBI database). ExAtlas provides a variety of tools for meta-analyses: (1) standard meta-analysis (fixed effects, random effects, z-score, and Fisher's methods); (2) analyses of global correlations between gene expression data sets; (3) gene set enrichment; (4) gene set overlap; (5) gene association by expression profile; (6) gene specificity; and (7) statistical analysis (ANOVA, pairwise comparison, and PCA). ExAtlas produces graphical outputs, including heatmaps, scatter-plots, bar-charts, and three-dimensional images. Some of the most widely used public data sets (e.g. GNF/BioGPS, Gene Ontology, KEGG, GAD phenotypes, BrainScan, ENCODE ChIP-seq, and protein-protein interaction) are pre-loaded and can be used for functional annotations.

  9. Macronutrients and the FTO gene expression in hypothalamus; a systematic review of experimental studies.

    PubMed

    Doaei, Saeid; Kalantari, Naser; Mohammadi, Nastaran Keshavarz; Tabesh, Ghasem Azizi; Gholamalizadeh, Maryam

    The various studies have examined the relationship between FTO gene expression and macronutrients levels. In order to obtain better viewpoint from this interactions, all of existing studies were reviewed systematically. All published papers have been obtained and reviewed using standard and sensitive keywords from databases such as CINAHL, Embase, PubMed, PsycInfo, and the Cochrane, from 1990 to 2016. The results indicated that all of 6 studies that met the inclusion criteria (from a total of 428 published article) found FTO gene expression changes at short-term follow-ups. Four of six studies found an increased FTO gene expression after calorie restriction, while two of them indicated decreased FTO gene expression. The effect of protein, carbohydrate and fat were separately assessed and suggested by all of six studies. In Conclusion, The level of FTO gene expression in hypothalamus is related to macronutrients levels. Future research should evaluate the long-term impact of dietary interventions. Copyright © 2017. Published by Elsevier B.V.

  10. A Stable Thoracic Hox Code and Epimorphosis Characterize Posterior Regeneration in Capitella teleta

    PubMed Central

    de Jong, Danielle M.; Seaver, Elaine C.

    2016-01-01

    Regeneration, the ability to replace lost tissues and body parts following traumatic injury, occurs widely throughout the animal tree of life. Regeneration occurs either by remodeling of pre-existing tissues, through addition of new cells by cell division, or a combination of both. We describe a staging system for posterior regeneration in the annelid, Capitella teleta, and use the C. teleta Hox gene code as markers of regional identity for regenerating tissue along the anterior-posterior axis. Following amputation of different posterior regions of the animal, a blastema forms and by two days, proliferating cells are detected by EdU incorporation, demonstrating that epimorphosis occurs during posterior regeneration of C. teleta. Neurites rapidly extend into the blastema, and gradually become organized into discrete nerves before new ganglia appear approximately seven days after amputation. In situ hybridization shows that seven of the ten Hox genes examined are expressed in the blastema, suggesting roles in patterning the newly forming tissue, although neither spatial nor temporal co-linearity was detected. We hypothesized that following amputation, Hox gene expression in pre-existing segments would be re-organized to scale, and the remaining fragment would express the complete suite of Hox genes. Surprisingly, most Hox genes display stable expression patterns in the ganglia of pre-existing tissue following amputation at multiple axial positions, indicating general stability of segmental identity. However, the three Hox genes, CapI-lox4, CapI-lox2 and CapI-Post2, each shift its anterior expression boundary by one segment, and each shift includes a subset of cells in the ganglia. This expression shift depends upon the axial position of the amputation. In C. teleta, thoracic segments exhibit stable positional identity with limited morphallaxis, in contrast with the extensive body remodeling that occurs during regeneration of some other annelids, planarians and acoel flatworms. PMID:26894631

  11. yStreX: yeast stress expression database

    PubMed Central

    Wanichthanarak, Kwanjeera; Nookaew, Intawat; Petranovic, Dina

    2014-01-01

    Over the past decade genome-wide expression analyses have been often used to study how expression of genes changes in response to various environmental stresses. Many of these studies (such as effects of oxygen concentration, temperature stress, low pH stress, osmotic stress, depletion or limitation of nutrients, addition of different chemical compounds, etc.) have been conducted in the unicellular Eukaryal model, yeast Saccharomyces cerevisiae. However, the lack of a unifying or integrated, bioinformatics platform that would permit efficient and rapid use of all these existing data remain an important issue. To facilitate research by exploiting existing transcription data in the field of yeast physiology, we have developed the yStreX database. It is an online repository of analyzed gene expression data from curated data sets from different studies that capture genome-wide transcriptional changes in response to diverse environmental transitions. The first aim of this online database is to facilitate comparison of cross-platform and cross-laboratory gene expression data. Additionally, we performed different expression analyses, meta-analyses and gene set enrichment analyses; and the results are also deposited in this database. Lastly, we constructed a user-friendly Web interface with interactive visualization to provide intuitive access and to display the queried data for users with no background in bioinformatics. Database URL: http://www.ystrexdb.com PMID:25024351

  12. A Novel Paramyxovirus?

    PubMed Central

    García-Sastre, Adolfo; Palese, Peter

    2005-01-01

    In public databases, we identified sequences reported as human genes expressed in kidney mesangial cells. The similarity of these genes to paramyxovirus matrix, fusion, and phosphoprotein genes suggests that they are derived from a novel paramyxovirus. These genes are sufficiently unique to suggest the existence of a novel paramyxovirus genus. PMID:15705331

  13. Evolution and cell-type specificity of human-specific genes preferentially expressed in progenitors of fetal neocortex.

    PubMed

    Florio, Marta; Heide, Michael; Pinson, Anneline; Brandl, Holger; Albert, Mareike; Winkler, Sylke; Wimberger, Pauline; Huttner, Wieland B; Hiller, Michael

    2018-03-21

    Understanding the molecular basis that underlies the expansion of the neocortex during primate, and notably human, evolution requires the identification of genes that are particularly active in the neural stem and progenitor cells of the developing neocortex. Here, we have used existing transcriptome datasets to carry out a comprehensive screen for protein-coding genes preferentially expressed in progenitors of fetal human neocortex. We show that 15 human-specific genes exhibit such expression, and many of them evolved distinct neural progenitor cell-type expression profiles and levels compared to their ancestral paralogs. Functional studies on one such gene, NOTCH2NL , demonstrate its ability to promote basal progenitor proliferation in mice. An additional 35 human genes with progenitor-enriched expression are shown to have orthologs only in primates. Our study provides a resource of genes that are promising candidates to exert specific, and novel, roles in neocortical development during primate, and notably human, evolution. © 2018, Florio et al.

  14. Evolution and cell-type specificity of human-specific genes preferentially expressed in progenitors of fetal neocortex

    PubMed Central

    Pinson, Anneline; Brandl, Holger; Albert, Mareike; Winkler, Sylke; Wimberger, Pauline

    2018-01-01

    Understanding the molecular basis that underlies the expansion of the neocortex during primate, and notably human, evolution requires the identification of genes that are particularly active in the neural stem and progenitor cells of the developing neocortex. Here, we have used existing transcriptome datasets to carry out a comprehensive screen for protein-coding genes preferentially expressed in progenitors of fetal human neocortex. We show that 15 human-specific genes exhibit such expression, and many of them evolved distinct neural progenitor cell-type expression profiles and levels compared to their ancestral paralogs. Functional studies on one such gene, NOTCH2NL, demonstrate its ability to promote basal progenitor proliferation in mice. An additional 35 human genes with progenitor-enriched expression are shown to have orthologs only in primates. Our study provides a resource of genes that are promising candidates to exert specific, and novel, roles in neocortical development during primate, and notably human, evolution. PMID:29561261

  15. HOX genes in human lung: altered expression in primary pulmonary hypertension and emphysema.

    PubMed

    Golpon, H A; Geraci, M W; Moore, M D; Miller, H L; Miller, G J; Tuder, R M; Voelkel, N F

    2001-03-01

    HOX genes belong to the large family of homeodomain genes that function as transcription factors. Animal studies indicate that they play an essential role in lung development. We investigated the expression pattern of HOX genes in human lung tissue by using microarray and degenerate reverse transcriptase-polymerase chain reaction survey techniques. HOX genes predominantly from the 3' end of clusters A and B were expressed in normal human adult lung and among them HOXA5 was the most abundant, followed by HOXB2 and HOXB6. In fetal (12 weeks old) and diseased lung specimens (emphysema, primary pulmonary hypertension) additional HOX genes from clusters C and D were expressed. Using in situ hybridization, transcripts for HOXA5 were predominantly found in alveolar septal and epithelial cells, both in normal and diseased lungs. A 2.5-fold increase in HOXA5 mRNA expression was demonstrated by quantitative reverse transcriptase-polymerase chain reaction in primary pulmonary hypertension lung specimens when compared to normal lung tissue. In conclusion, we demonstrate that HOX genes are selectively expressed in the human lung. Differences in the pattern of HOX gene expression exist among fetal, adult, and diseased lung specimens. The altered pattern of HOX gene expression may contribute to the development of pulmonary diseases.

  16. Tightly Regulated Expression of Autographa californica Multicapsid Nucleopolyhedrovirus Immediate Early Genes Emerges from Their Interactions and Possible Collective Behaviors

    PubMed Central

    Taka, Hitomi; Asano, Shin-ichiro; Matsuura, Yoshiharu; Bando, Hisanori

    2015-01-01

    To infect their hosts, DNA viruses must successfully initiate the expression of viral genes that control subsequent viral gene expression and manipulate the host environment. Viral genes that are immediately expressed upon infection play critical roles in the early infection process. In this study, we investigated the expression and regulation of five canonical regulatory immediate-early (IE) genes of Autographa californica multicapsid nucleopolyhedrovirus: ie0, ie1, ie2, me53, and pe38. A systematic transient gene-expression analysis revealed that these IE genes are generally transactivators, suggesting the existence of a highly interactive regulatory network. A genetic analysis using gene knockout viruses demonstrated that the expression of these IE genes was tolerant to the single deletions of activator IE genes in the early stage of infection. A network graph analysis on the regulatory relationships observed in the transient expression analysis suggested that the robustness of IE gene expression is due to the organization of the IE gene regulatory network and how each IE gene is activated. However, some regulatory relationships detected by the genetic analysis were contradictory to those observed in the transient expression analysis, especially for IE0-mediated regulation. Statistical modeling, combined with genetic analysis using knockout alleles for ie0 and ie1, showed that the repressor function of ie0 was due to the interaction between ie0 and ie1, not ie0 itself. Taken together, these systematic approaches provided insight into the topology and nature of the IE gene regulatory network. PMID:25816136

  17. Prediction of regulatory gene pairs using dynamic time warping and gene ontology.

    PubMed

    Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K

    2014-01-01

    Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.

  18. Accelerated recruitment of new brain development genes into the human genome.

    PubMed

    Zhang, Yong E; Landback, Patrick; Vibranovski, Maria D; Long, Manyuan

    2011-10-01

    How the human brain evolved has attracted tremendous interests for decades. Motivated by case studies of primate-specific genes implicated in brain function, we examined whether or not the young genes, those emerging genome-wide in the lineages specific to the primates or rodents, showed distinct spatial and temporal patterns of transcription compared to old genes, which had existed before primate and rodent split. We found consistent patterns across different sources of expression data: there is a significantly larger proportion of young genes expressed in the fetal or infant brain of humans than in mouse, and more young genes in humans have expression biased toward early developing brains than old genes. Most of these young genes are expressed in the evolutionarily newest part of human brain, the neocortex. Remarkably, we also identified a number of human-specific genes which are expressed in the prefrontal cortex, which is implicated in complex cognitive behaviors. The young genes upregulated in the early developing human brain play diverse functional roles, with a significant enrichment of transcription factors. Genes originating from different mechanisms show a similar expression bias in the developing brain. Moreover, we found that the young genes upregulated in early brain development showed rapid protein evolution compared to old genes also expressed in the fetal brain. Strikingly, genes expressed in the neocortex arose soon after its morphological origin. These four lines of evidence suggest that positive selection for brain function may have contributed to the origination of young genes expressed in the developing brain. These data demonstrate a striking recruitment of new genes into the early development of the human brain.

  19. Effect of various classes of pesticides on expression of stress genes in transgenic C. elegans model of Parkinson's disease.

    PubMed

    Jadiya, Pooja; Mir, Snober S; Nazir, Aamir

    2012-12-01

    Neurodegenerative diseases are known to be associated with genetic and environmental factors. The multifactorial Parkinson's disease (PD) is triggered and/or further worsened by exposure to certain pesticides. Existing literature suggests a link between pesticide exposure and increased incidence of PD. We carried out the present study to look into the stress gene expression pattern of transgenic Caenorhabditis elegans (C. elegans) model of PD after exposure to pesticides from different classes. Expression level of sod-1, sod-2, sod-3, hsp-70, hsp-60, and hsp-16.2 stress responsive genes was determined using qPCR. Our findings demonstrate that the expression of stress related genes does not follow a generalized pattern to different toxicants; rather each pesticide class has a specific expression signature.

  20. ROKU: a novel method for identification of tissue-specific genes

    PubMed Central

    Kadota, Koji; Ye, Jiazhen; Nakai, Yuji; Terada, Tohru; Shimizu, Kentaro

    2006-01-01

    Background One of the important goals of microarray research is the identification of genes whose expression is considerably higher or lower in some tissues than in others. We would like to have ways of identifying such tissue-specific genes. Results We describe a method, ROKU, which selects tissue-specific patterns from gene expression data for many tissues and thousands of genes. ROKU ranks genes according to their overall tissue specificity using Shannon entropy and detects tissues specific to each gene if any exist using an outlier detection method. We evaluated the capacity for the detection of various specific expression patterns using synthetic and real data. We observed that ROKU was superior to a conventional entropy-based method in its ability to rank genes according to overall tissue specificity and to detect genes whose expression pattern are specific only to objective tissues. Conclusion ROKU is useful for the detection of various tissue-specific expression patterns. The framework is also directly applicable to the selection of diagnostic markers for molecular classification of multiple classes. PMID:16764735

  1. Discrete domains of gene expression in germinal layers distinguish the development of gyrencephaly

    PubMed Central

    de Juan Romero, Camino; Bruder, Carl; Tomasello, Ugo; Sanz-Anquela, José Miguel; Borrell, Víctor

    2015-01-01

    Gyrencephalic species develop folds in the cerebral cortex in a stereotypic manner, but the genetic mechanisms underlying this patterning process are unknown. We present a large-scale transcriptomic analysis of individual germinal layers in the developing cortex of the gyrencephalic ferret, comparing between regions prospective of fold and fissure. We find unique transcriptional signatures in each germinal compartment, where thousands of genes are differentially expressed between regions, including ∼80% of genes mutated in human cortical malformations. These regional differences emerge from the existence of discrete domains of gene expression, which occur at multiple locations across the developing cortex of ferret and human, but not the lissencephalic mouse. Complex expression patterns emerge late during development and map the eventual location of folds or fissures. Protomaps of gene expression within germinal layers may contribute to define cortical folds or functional areas, but our findings demonstrate that they distinguish the development of gyrencephalic cortices. PMID:25916825

  2. Overcoming confounded controls in the analysis of gene expression data from microarray experiments.

    PubMed

    Bhattacharya, Soumyaroop; Long, Dang; Lyons-Weiler, James

    2003-01-01

    A potential limitation of data from microarray experiments exists when improper control samples are used. In cancer research, comparisons of tumour expression profiles to those from normal samples is challenging due to tissue heterogeneity (mixed cell populations). A specific example exists in a published colon cancer dataset, in which tissue heterogeneity was reported among the normal samples. In this paper, we show how to overcome or avoid the problem of using normal samples that do not derive from the same tissue of origin as the tumour. We advocate an exploratory unsupervised bootstrap analysis that can reveal unexpected and undesired, but strongly supported, clusters of samples that reflect tissue differences instead of tumour versus normal differences. All of the algorithms used in the analysis, including the maximum difference subset algorithm, unsupervised bootstrap analysis, pooled variance t-test for finding differentially expressed genes and the jackknife to reduce false positives, are incorporated into our online Gene Expression Data Analyzer ( http:// bioinformatics.upmc.edu/GE2/GEDA.html ).

  3. Monoallelic Gene Expression in Mammals.

    PubMed

    Chess, Andrew

    2016-11-23

    Monoallelic expression not due to cis-regulatory sequence polymorphism poses an intriguing problem in epigenetics because it requires the unequal treatment of two segments of DNA that are present in the same nucleus and that can indeed have absolutely identical sequences. Here, I focus on a few recent developments in the field of monoallelic expression that are of particular interest and raise interesting questions for future work. One development is regarding analyses of imprinted genes, in which recent work suggests the possibility that intriguing networks of imprinted genes exist and are important for genetic and physiological studies. Another issue that has been raised in recent years by a number of publications is the question of how skewed allelic expression should be for it to be designated as monoallelic expression and, further, what methods are appropriate or inappropriate for analyzing genomic data to examine allele-specific expression. Perhaps the most exciting recent development in mammalian monoallelic expression is a clever and carefully executed analysis of genetic diversity of autosomal genes subject to random monoallelic expression (RMAE), which provides compelling evidence for distinct evolutionary forces acting on random monoallelically expressed genes.

  4. Time-series RNA-seq analysis package (TRAP) and its application to the analysis of rice, Oryza sativa L. ssp. Japonica, upon drought stress.

    PubMed

    Jo, Kyuri; Kwon, Hawk-Bin; Kim, Sun

    2014-06-01

    Measuring expression levels of genes at the whole genome level can be useful for many purposes, especially for revealing biological pathways underlying specific phenotype conditions. When gene expression is measured over a time period, we have opportunities to understand how organisms react to stress conditions over time. Thus many biologists routinely measure whole genome level gene expressions at multiple time points. However, there are several technical difficulties for analyzing such whole genome expression data. In addition, these days gene expression data is often measured by using RNA-sequencing rather than microarray technologies and then analysis of expression data is much more complicated since the analysis process should start with mapping short reads and produce differentially activated pathways and also possibly interactions among pathways. In addition, many useful tools for analyzing microarray gene expression data are not applicable for the RNA-seq data. Thus a comprehensive package for analyzing time series transcriptome data is much needed. In this article, we present a comprehensive package, Time-series RNA-seq Analysis Package (TRAP), integrating all necessary tasks such as mapping short reads, measuring gene expression levels, finding differentially expressed genes (DEGs), clustering and pathway analysis for time-series data in a single environment. In addition to implementing useful algorithms that are not available for RNA-seq data, we extended existing pathway analysis methods, ORA and SPIA, for time series analysis and estimates statistical values for combined dataset by an advanced metric. TRAP also produces visual summary of pathway interactions. Gene expression change labeling, a practical clustering method used in TRAP, enables more accurate interpretation of the data when combined with pathway analysis. We applied our methods on a real dataset for the analysis of rice (Oryza sativa L. Japonica nipponbare) upon drought stress. The result showed that TRAP was able to detect pathways more accurately than several existing methods. TRAP is available at http://biohealth.snu.ac.kr/software/TRAP/. Copyright © 2014 Elsevier Inc. All rights reserved.

  5. Transcriptomic Analysis and Meta-Analysis of Human Granulosa and Cumulus Cells

    PubMed Central

    Burnik Papler, Tanja; Vrtacnik Bokal, Eda; Maver, Ales; Kopitar, Andreja Natasa; Lovrečić, Luca

    2015-01-01

    Specific gene expression in oocytes and its surrounding cumulus (CC) and granulosa (GC) cells is needed for successful folliculogenesis and oocyte maturation. The aim of the present study was to compare genome-wide gene expression and biological functions of human GC and CC. Individual GC and CC were derived from 37 women undergoing IVF procedures. Gene expression analysis was performed using microarrays, followed by a meta-analysis. Results were validated using quantitative real-time PCR. There were 6029 differentially expressed genes (q < 10−4); of which 650 genes had a log2 FC ≥ 2. After the meta-analysis there were 3156 genes differentially expressed. Among these there were genes that have previously not been reported in human somatic follicular cells, like prokineticin 2 (PROK2), higher expressed in GC, and pregnancy up-regulated nonubiquitous CaM kinase (PNCK), higher expressed in CC. Pathways like inflammatory response and angiogenesis were enriched in GC, whereas in CC, cell differentiation and multicellular organismal development were among enriched pathways. In conclusion, transcriptomes of GC and CC as well as biological functions, are distinctive for each cell subpopulation. By describing novel genes like PROK2 and PNCK, expressed in GC and CC, we upgraded the existing data on human follicular biology. PMID:26313571

  6. HOX Genes in Human Lung

    PubMed Central

    Golpon, Heiko A.; Geraci, Mark W.; Moore, Mark D.; Miller, Heidi L.; Miller, Gary J.; Tuder, Rubin M.; Voelkel, Norbert F.

    2001-01-01

    HOX genes belong to the large family of homeodomain genes that function as transcription factors. Animal studies indicate that they play an essential role in lung development. We investigated the expression pattern of HOX genes in human lung tissue by using microarray and degenerate reverse transcriptase-polymerase chain reaction survey techniques. HOX genes predominantly from the 3′ end of clusters A and B were expressed in normal human adult lung and among them HOXA5 was the most abundant, followed by HOXB2 and HOXB6. In fetal (12 weeks old) and diseased lung specimens (emphysema, primary pulmonary hypertension) additional HOX genes from clusters C and D were expressed. Using in situ hybridization, transcripts for HOXA5 were predominantly found in alveolar septal and epithelial cells, both in normal and diseased lungs. A 2.5-fold increase in HOXA5 mRNA expression was demonstrated by quantitative reverse transcriptase-polymerase chain reaction in primary pulmonary hypertension lung specimens when compared to normal lung tissue. In conclusion, we demonstrate that HOX genes are selectively expressed in the human lung. Differences in the pattern of HOX gene expression exist among fetal, adult, and diseased lung specimens. The altered pattern of HOX gene expression may contribute to the development of pulmonary diseases. PMID:11238043

  7. Validation of the β-amy1 transcription profiling assay and selection of reference genes suited for a RT-qPCR assay in developing barley caryopsis.

    PubMed

    Ovesná, Jaroslava; Kučera, Ladislav; Vaculová, Kateřina; Štrymplová, Kamila; Svobodová, Ilona; Milella, Luigi

    2012-01-01

    Reverse transcription coupled with real-time quantitative PCR (RT-qPCR) is a frequently used method for gene expression profiling. Reference genes (RGs) are commonly employed to normalize gene expression data. A limited information exist on the gene expression and profiling in developing barley caryopsis. Expression stability was assessed by measuring the cycle threshold (Ct) range and applying both the GeNorm (pair-wise comparison of geometric means) and Normfinder (model-based approach) principles for the calculation. Here, we have identified a set of four RGs suitable for studying gene expression in the developing barley caryopsis. These encode the proteins GAPDH, HSP90, HSP70 and ubiquitin. We found a correlation between the frequency of occurrence of a transcript in silico and its suitability as an RG. This set of RGs was tested by comparing the normalized level of β-amylase (β-amy1) transcript with directly measured quantities of the BMY1 gene product in the developing barley caryopsis. This panel of genes could be used for other gene expression studies, as well as to optimize β-amy1 analysis for study of the impact of β-amy1 expression upon barley end-use quality.

  8. An Integrated Approach for RNA-seq Data Normalization.

    PubMed

    Yang, Shengping; Mercante, Donald E; Zhang, Kun; Fang, Zhide

    2016-01-01

    DNA copy number alteration is common in many cancers. Studies have shown that insertion or deletion of DNA sequences can directly alter gene expression, and significant correlation exists between DNA copy number and gene expression. Data normalization is a critical step in the analysis of gene expression generated by RNA-seq technology. Successful normalization reduces/removes unwanted nonbiological variations in the data, while keeping meaningful information intact. However, as far as we know, no attempt has been made to adjust for the variation due to DNA copy number changes in RNA-seq data normalization. In this article, we propose an integrated approach for RNA-seq data normalization. Comparisons show that the proposed normalization can improve power for downstream differentially expressed gene detection and generate more biologically meaningful results in gene profiling. In addition, our findings show that due to the effects of copy number changes, some housekeeping genes are not always suitable internal controls for studying gene expression. Using information from DNA copy number, integrated approach is successful in reducing noises due to both biological and nonbiological causes in RNA-seq data, thus increasing the accuracy of gene profiling.

  9. PhotoMorphs™: A Novel Light-Activated Reagent for Controlling Gene Expression in Zebrafish

    PubMed Central

    Tomasini, Amber J.; Schuler, Aaron D.; Zebala, John A.; Mayer, Alan N.

    2009-01-01

    Manipulating gene expression in zebrafish is critical for exploiting the full potential of this vertebrate model organism. Morpholino oligos are the most commonly employed antisense technology for knocking down gene expression. However, morpholinos suffer from a lack of control over the timing and location of knockdown. In this report, we describe a novel light-activatable knockdown reagent called PhotoMorph™. PhotoMorphs can be generated from existing morpholinos by hybridization with a complementary caging strand containing a photocleavable linkage. The caging strand neutralizes the morpholino activity until irradiation of the PhotoMorph with UV light releases the morpholino. We generated PhotoMorphs to target genes encoding enhanced green fluorescent protein (EGFP), No tail, and E-cadherin to illustrate the utility of this approach. Temporal control of gene expression with PhotoMorphs permitted us to circumvent the early lethal phenotype of E-cadherin knockdown. A splice-blocking PhotoMorph directed to the rheb gene showed light-dependent gene knockdown up to 72 hpf. PhotoMorphs thus offer a new class of laboratory reagents suitable for the spatiotemporal control of gene expression in the zebrafish. PMID:19644983

  10. Bi-Force: large-scale bicluster editing and its application to gene expression data biclustering

    PubMed Central

    Sun, Peng; Speicher, Nora K.; Röttger, Richard; Guo, Jiong; Baumbach, Jan

    2014-01-01

    Abstract The explosion of the biological data has dramatically reformed today's biological research. The need to integrate and analyze high-dimensional biological data on a large scale is driving the development of novel bioinformatics approaches. Biclustering, also known as ‘simultaneous clustering’ or ‘co-clustering’, has been successfully utilized to discover local patterns in gene expression data and similar biomedical data types. Here, we contribute a new heuristic: ‘Bi-Force’. It is based on the weighted bicluster editing model, to perform biclustering on arbitrary sets of biological entities, given any kind of pairwise similarities. We first evaluated the power of Bi-Force to solve dedicated bicluster editing problems by comparing Bi-Force with two existing algorithms in the BiCluE software package. We then followed a biclustering evaluation protocol in a recent review paper from Eren et al. (2013) (A comparative analysis of biclustering algorithms for gene expressiondata. Brief. Bioinform., 14:279–292.) and compared Bi-Force against eight existing tools: FABIA, QUBIC, Cheng and Church, Plaid, BiMax, Spectral, xMOTIFs and ISA. To this end, a suite of synthetic datasets as well as nine large gene expression datasets from Gene Expression Omnibus were analyzed. All resulting biclusters were subsequently investigated by Gene Ontology enrichment analysis to evaluate their biological relevance. The distinct theoretical foundation of Bi-Force (bicluster editing) is more powerful than strict biclustering. We thus outperformed existing tools with Bi-Force at least when following the evaluation protocols from Eren et al. Bi-Force is implemented in Java and integrated into the open source software package of BiCluE. The software as well as all used datasets are publicly available at http://biclue.mpi-inf.mpg.de. PMID:24682815

  11. Isolation of Novel CreERT2-Driver Lines in Zebrafish Using an Unbiased Gene Trap Approach

    PubMed Central

    Jungke, Peggy; Hammer, Juliane; Hans, Stefan; Brand, Michael

    2015-01-01

    Gene manipulation using the Cre/loxP-recombinase system has been successfully employed in zebrafish to study gene functions and lineage relationships. Recently, gene trapping approaches have been applied to produce large collections of transgenic fish expressing conditional alleles in various tissues. However, the limited number of available cell- and tissue-specific Cre/CreERT2-driver lines still constrains widespread application in this model organism. To enlarge the pool of existing CreERT2-driver lines, we performed a genome-wide gene trap screen using a Tol2-based mCherry-T2a-CreERT2 (mCT2aC) gene trap vector. This cassette consists of a splice acceptor and a mCherry-tagged variant of CreERT2 which enables simultaneous labeling of the trapping event, as well as CreERT2 expression from the endogenous promoter. Using this strategy, we generated 27 novel functional CreERT2-driver lines expressing in a cell- and tissue-specific manner during development and adulthood. This study summarizes the analysis of the generated CreERT2-driver lines with respect to functionality, expression, integration, as well as associated phenotypes. Our results significantly enlarge the existing pool of CreERT2-driver lines in zebrafish and combined with Cre–dependent effector lines, the new CreERT2-driver lines will be important tools to manipulate the zebrafish genome. PMID:26083735

  12. Diametrical clustering for identifying anti-correlated gene clusters.

    PubMed

    Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman

    2003-09-01

    Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.

  13. Potential miRNA regulators of differential HPG axis gene expression between low egg producing and high egg producing turkey hens

    USDA-ARS?s Scientific Manuscript database

    Expression differences exist in key genes of the hypothalamo-pituitary-gonadal (HPG) axis in low egg producing hens (LEPH) and high egg producing hens (HEPH); however, regulation of these differences is unknown. MicroRNAs (miRNAs) are small non-coding RNAs that play a role in post-transcriptional re...

  14. Reference genes for gene expression studies in wheat flag leaves grown under different farming conditions

    PubMed Central

    2011-01-01

    Background Internal control genes with highly uniform expression throughout the experimental conditions are required for accurate gene expression analysis as no universal reference genes exists. In this study, the expression stability of 24 candidate genes from Triticum aestivum cv. Cubus flag leaves grown under organic and conventional farming systems was evaluated in two locations in order to select suitable genes that can be used for normalization of real-time quantitative reverse-transcription PCR (RT-qPCR) reactions. The genes were selected among the most common used reference genes as well as genes encoding proteins involved in several metabolic pathways. Findings Individual genes displayed different expression rates across all samples assayed. Applying geNorm, a set of three potential reference genes were suitable for normalization of RT-qPCR reactions in winter wheat flag leaves cv. Cubus: TaFNRII (ferredoxin-NADP(H) oxidoreductase; AJ457980.1), ACT2 (actin 2; TC234027), and rrn26 (a putative homologue to RNA 26S gene; AL827977.1). In addition of these three genes that were also top-ranked by NormFinder, two extra genes: CYP18-2 (Cyclophilin A, AY456122.1) and TaWIN1 (14-3-3 like protein, AB042193) were most consistently stably expressed. Furthermore, we showed that TaFNRII, ACT2, and CYP18-2 are suitable for gene expression normalization in other two winter wheat varieties (Tommi and Centenaire) grown under three treatments (organic, conventional and no nitrogen) and a different environment than the one tested with cv. Cubus. Conclusions This study provides a new set of reference genes which should improve the accuracy of gene expression analyses when using wheat flag leaves as those related to the improvement of nitrogen use efficiency for cereal production. PMID:21951810

  15. Unlocking the potential of publicly available microarray data using inSilicoDb and inSilicoMerging R/Bioconductor packages.

    PubMed

    Taminau, Jonatan; Meganck, Stijn; Lazar, Cosmin; Steenhoff, David; Coletta, Alain; Molter, Colin; Duque, Robin; de Schaetzen, Virginie; Weiss Solís, David Y; Bersini, Hugues; Nowé, Ann

    2012-12-24

    With an abundant amount of microarray gene expression data sets available through public repositories, new possibilities lie in combining multiple existing data sets. In this new context, analysis itself is no longer the problem, but retrieving and consistently integrating all this data before delivering it to the wide variety of existing analysis tools becomes the new bottleneck. We present the newly released inSilicoMerging R/Bioconductor package which, together with the earlier released inSilicoDb R/Bioconductor package, allows consistent retrieval, integration and analysis of publicly available microarray gene expression data sets. Inside the inSilicoMerging package a set of five visual and six quantitative validation measures are available as well. By providing (i) access to uniformly curated and preprocessed data, (ii) a collection of techniques to remove the batch effects between data sets from different sources, and (iii) several validation tools enabling the inspection of the integration process, these packages enable researchers to fully explore the potential of combining gene expression data for downstream analysis. The power of using both packages is demonstrated by programmatically retrieving and integrating gene expression studies from the InSilico DB repository [https://insilicodb.org/app/].

  16. Identification of SSEA-1 expressing enhanced reprogramming (SEER) cells in porcine embryonic fibroblasts

    PubMed Central

    Li, Dong; Secher, Jan O.; Mashayekhi, Kaveh; Nielsen, Troels T.; Hyttel, Poul; Freude, Kristine K.

    2017-01-01

    ABSTRACT Previous research has shown that a subpopulation of cells within cultured human dermal fibroblasts, termed multilineage-differentiating stress enduring (Muse) cells, are preferentially reprogrammed into induced pluripotent stem cells. However, controversy exists over whether these cells are the only cells capable of being reprogrammed from a heterogeneous population of fibroblasts. Similarly, there is little research to suggest such cells may exist in embryonic tissues or other species. To address if such a cell population exists in pigs, we investigated porcine embryonic fibroblast populations (pEFs) and identified heterogeneous expression of several key cell surface markers. Strikingly, we discovered a small population of stage-specific embryonic antigen 1 positive cells (SSEA-1+) in Danish Landrace and Göttingen minipig pEFs, which were absent in the Yucatan pEFs. Furthermore, reprogramming of SSEA-1+ sorted pEFs led to higher reprogramming efficiency. Subsequent transcriptome profiling of the SSEA-1+ vs. the SSEA-1neg cell fraction revealed highly comparable gene signatures. However several genes that were found to be upregulated in the SSEA-1+ cells were similarly expressed in mesenchymal stem cells (MSCs). We therefore termed these cells SSEA-1 Expressing Enhanced Reprogramming (SEER) cells. Interestingly, SEER cells were more effective at differentiating into osteocytes and chondrocytes in vitro. We conclude that SEER cells are more amenable for reprogramming and that the expression of mesenchymal stem cell genes is advantageous in the reprogramming process. This data provides evidence supporting the elite theory and helps to delineate which cell types and specific genes are important for reprogramming in the pig. PMID:28426281

  17. Gene Selection and Cancer Classification: A Rough Sets Based Approach

    NASA Astrophysics Data System (ADS)

    Sun, Lijun; Miao, Duoqian; Zhang, Hongyun

    Indentification of informative gene subsets responsible for discerning between available samples of gene expression data is an important task in bioinformatics. Reducts, from rough sets theory, corresponding to a minimal set of essential genes for discerning samples, is an efficient tool for gene selection. Due to the compuational complexty of the existing reduct algoritms, feature ranking is usually used to narrow down gene space as the first step and top ranked genes are selected . In this paper,we define a novel certierion based on the expression level difference btween classes and contribution to classification of the gene for scoring genes and present a algorithm for generating all possible reduct from informative genes.The algorithm takes the whole attribute sets into account and find short reduct with a significant reduction in computational complexity. An exploration of this approach on benchmark gene expression data sets demonstrates that this approach is successful for selecting high discriminative genes and the classification accuracy is impressive.

  18. Gravity Plays an Important Role in Muscle Development and the Differentiation of Contractile Protein Phenotype

    NASA Technical Reports Server (NTRS)

    Adams, Gregory A.; Haddad, Fadia; Baldwin, Kenneth M.

    2003-01-01

    Several muscles in the body exist mainly to work against gravity. Whether gravity is important in the development of these muscles is not known. By examining the basic proteins that compose muscle, questions about the role of gravity in muscle development can be answered. Myosin heavy chains (MHCs) are a family of proteins critically important for muscle contraction. Several types of MHCs exist (e.g., neonatal, slow, fast), and each type is produced by a particular gene. Neonatal MHCs are produced early in life. Slow MHCs are important in antigravity muscles, and fast MHCs are found in fast-twitch power muscles. The gene that is turned on or expressed will determine which MHC is produced. Early in development, antigravity skeletal muscles (muscles that work against gravity) normally produce a combination of the neonatal/embryonic MHCs. The expression of these primitive MHCs is repressed early in development; and the adult slow and fast MHC genes become fully expressed. We tested the hypothesis that weightbearing activity is critical for inducing the normal expression of the slow MHC gene typically expressed in adult antigravity muscles. Also, we hypothesized that thyroid hormone, but not opposition to gravity, is necessary for expressing the adult fast IIb MHC gene essential for high-intensity muscle performance. Groups of normal thyroid and thyroid-deficient neonatal rats were studied after their return from the 16-day Neurolab mission and compared to matched controls. The results suggest: (1) Weightlessness impaired body and limb skeletal muscle growth in both normal and thyroid-deficient animals. Antigravity muscles were impaired more than those used primarily for locomotion andor nonweightbearing activity. (2) Systemic and muscle expression of insulin-like growth factor-I (IGF-I), an important body and tissue growth factor, was depressed in flight animals. (3) Normal slow, type I MHC gene expression was markedly repressed in the normal thyroid flight group. (4) Fast IIb MHC gene expression was enhanced in fast-twitch muscles of normal thyroid animals exposed to spaceflight; however, thyroid deficiency markedly repressed expression of this gene independently of spaceflight. In summary, the absence of gravity, when imposed at critical stages of development, impaired body and skeletal muscle growth, as well as expression of the MHC gene family of motor proteins. This suggests that normal weightbearing activity is essential for establishing body and muscle growth in neonatal animals, and for expressing the motor gene essential for supporting antigravity functions.

  19. Noise in gene expression is coupled to growth rate.

    PubMed

    Keren, Leeat; van Dijk, David; Weingarten-Gabbay, Shira; Davidi, Dan; Jona, Ghil; Weinberger, Adina; Milo, Ron; Segal, Eran

    2015-12-01

    Genetically identical cells exposed to the same environment display variability in gene expression (noise), with important consequences for the fidelity of cellular regulation and biological function. Although population average gene expression is tightly coupled to growth rate, the effects of changes in environmental conditions on expression variability are not known. Here, we measure the single-cell expression distributions of approximately 900 Saccharomyces cerevisiae promoters across four environmental conditions using flow cytometry, and find that gene expression noise is tightly coupled to the environment and is generally higher at lower growth rates. Nutrient-poor conditions, which support lower growth rates, display elevated levels of noise for most promoters, regardless of their specific expression values. We present a simple model of noise in expression that results from having an asynchronous population, with cells at different cell-cycle stages, and with different partitioning of the cells between the stages at different growth rates. This model predicts non-monotonic global changes in noise at different growth rates as well as overall higher variability in expression for cell-cycle-regulated genes in all conditions. The consistency between this model and our data, as well as with noise measurements of cells growing in a chemostat at well-defined growth rates, suggests that cell-cycle heterogeneity is a major contributor to gene expression noise. Finally, we identify gene and promoter features that play a role in gene expression noise across conditions. Our results show the existence of growth-related global changes in gene expression noise and suggest their potential phenotypic implications. © 2015 Keren et al.; Published by Cold Spring Harbor Laboratory Press.

  20. Noise in gene expression is coupled to growth rate

    PubMed Central

    Keren, Leeat; van Dijk, David; Weingarten-Gabbay, Shira; Davidi, Dan; Jona, Ghil; Weinberger, Adina; Milo, Ron; Segal, Eran

    2015-01-01

    Genetically identical cells exposed to the same environment display variability in gene expression (noise), with important consequences for the fidelity of cellular regulation and biological function. Although population average gene expression is tightly coupled to growth rate, the effects of changes in environmental conditions on expression variability are not known. Here, we measure the single-cell expression distributions of approximately 900 Saccharomyces cerevisiae promoters across four environmental conditions using flow cytometry, and find that gene expression noise is tightly coupled to the environment and is generally higher at lower growth rates. Nutrient-poor conditions, which support lower growth rates, display elevated levels of noise for most promoters, regardless of their specific expression values. We present a simple model of noise in expression that results from having an asynchronous population, with cells at different cell-cycle stages, and with different partitioning of the cells between the stages at different growth rates. This model predicts non-monotonic global changes in noise at different growth rates as well as overall higher variability in expression for cell-cycle–regulated genes in all conditions. The consistency between this model and our data, as well as with noise measurements of cells growing in a chemostat at well-defined growth rates, suggests that cell-cycle heterogeneity is a major contributor to gene expression noise. Finally, we identify gene and promoter features that play a role in gene expression noise across conditions. Our results show the existence of growth-related global changes in gene expression noise and suggest their potential phenotypic implications. PMID:26355006

  1. MALDI-TOF mass spectrometry for quantitative gene expression analysis of acid responses in Staphylococcus aureus.

    PubMed

    Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild

    2009-07-01

    Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.

  2. RNA-sequence data normalization through in silico prediction of reference genes: the bacterial response to DNA damage as case study.

    PubMed

    Berghoff, Bork A; Karlsson, Torgny; Källman, Thomas; Wagner, E Gerhart H; Grabherr, Manfred G

    2017-01-01

    Measuring how gene expression changes in the course of an experiment assesses how an organism responds on a molecular level. Sequencing of RNA molecules, and their subsequent quantification, aims to assess global gene expression changes on the RNA level (transcriptome). While advances in high-throughput RNA-sequencing (RNA-seq) technologies allow for inexpensive data generation, accurate post-processing and normalization across samples is required to eliminate any systematic noise introduced by the biochemical and/or technical processes. Existing methods thus either normalize on selected known reference genes that are invariant in expression across the experiment, assume that the majority of genes are invariant, or that the effects of up- and down-regulated genes cancel each other out during the normalization. Here, we present a novel method, moose 2 , which predicts invariant genes in silico through a dynamic programming (DP) scheme and applies a quadratic normalization based on this subset. The method allows for specifying a set of known or experimentally validated invariant genes, which guides the DP. We experimentally verified the predictions of this method in the bacterium Escherichia coli , and show how moose 2 is able to (i) estimate the expression value distances between RNA-seq samples, (ii) reduce the variation of expression values across all samples, and (iii) to subsequently reveal new functional groups of genes during the late stages of DNA damage. We further applied the method to three eukaryotic data sets, on which its performance compares favourably to other methods. The software is implemented in C++ and is publicly available from http://grabherr.github.io/moose2/. The proposed RNA-seq normalization method, moose 2 , is a valuable alternative to existing methods, with two major advantages: (i) in silico prediction of invariant genes provides a list of potential reference genes for downstream analyses, and (ii) non-linear artefacts in RNA-seq data are handled adequately to minimize variations between replicates.

  3. Machine Learning-Assisted Network Inference Approach to Identify a New Class of Genes that Coordinate the Functionality of Cancer Networks.

    PubMed

    Ghanat Bari, Mehrab; Ung, Choong Yong; Zhang, Cheng; Zhu, Shizhen; Li, Hu

    2017-08-01

    Emerging evidence indicates the existence of a new class of cancer genes that act as "signal linkers" coordinating oncogenic signals between mutated and differentially expressed genes. While frequently mutated oncogenes and differentially expressed genes, which we term Class I cancer genes, are readily detected by most analytical tools, the new class of cancer-related genes, i.e., Class II, escape detection because they are neither mutated nor differentially expressed. Given this hypothesis, we developed a Machine Learning-Assisted Network Inference (MALANI) algorithm, which assesses all genes regardless of expression or mutational status in the context of cancer etiology. We used 8807 expression arrays, corresponding to 9 cancer types, to build more than 2 × 10 8 Support Vector Machine (SVM) models for reconstructing a cancer network. We found that ~3% of ~19,000 not differentially expressed genes are Class II cancer gene candidates. Some Class II genes that we found, such as SLC19A1 and ATAD3B, have been recently reported to associate with cancer outcomes. To our knowledge, this is the first study that utilizes both machine learning and network biology approaches to uncover Class II cancer genes in coordinating functionality in cancer networks and will illuminate our understanding of how genes are modulated in a tissue-specific network contribute to tumorigenesis and therapy development.

  4. A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research.

    PubMed

    Weidner, Christopher; Steinfath, Matthias; Wistorf, Elisa; Oelgeschläger, Michael; Schneider, Marlon R; Schönfelder, Gilbert

    2017-08-16

    Recent studies that compared transcriptomic datasets of human diseases with datasets from mouse models using traditional gene-to-gene comparison techniques resulted in contradictory conclusions regarding the relevance of animal models for translational research. A major reason for the discrepancies between different gene expression analyses is the arbitrary filtering of differentially expressed genes. Furthermore, the comparison of single genes between different species and platforms often is limited by technical variance, leading to misinterpretation of the con/discordance between data from human and animal models. Thus, standardized approaches for systematic data analysis are needed. To overcome subjective gene filtering and ineffective gene-to-gene comparisons, we recently demonstrated that gene set enrichment analysis (GSEA) has the potential to avoid these problems. Therefore, we developed a standardized protocol for the use of GSEA to distinguish between appropriate and inappropriate animal models for translational research. This protocol is not suitable to predict how to design new model systems a-priori, as it requires existing experimental omics data. However, the protocol describes how to interpret existing data in a standardized manner in order to select the most suitable animal model, thus avoiding unnecessary animal experiments and misleading translational studies.

  5. Patterns of homoeologous gene expression shown by RNA sequencing in hexaploid bread wheat

    PubMed Central

    2014-01-01

    Background Bread wheat (Triticum aestivum) has a large, complex and hexaploid genome consisting of A, B and D homoeologous chromosome sets. Therefore each wheat gene potentially exists as a trio of A, B and D homoeoloci, each of which may contribute differentially to wheat phenotypes. We describe a novel approach combining wheat cytogenetic resources (chromosome substitution ‘nullisomic-tetrasomic’ lines) with next generation deep sequencing of gene transcripts (RNA-Seq), to directly and accurately identify homoeologue-specific single nucleotide variants and quantify the relative contribution of individual homoeoloci to gene expression. Results We discover, based on a sample comprising ~5-10% of the total wheat gene content, that at least 45% of wheat genes are expressed from all three distinct homoeoloci. Most of these genes show strikingly biased expression patterns in which expression is dominated by a single homoeolocus. The remaining ~55% of wheat genes are expressed from either one or two homoeoloci only, through a combination of extensive transcriptional silencing and homoeolocus loss. Conclusions We conclude that wheat is tending towards functional diploidy, through a variety of mechanisms causing single homoeoloci to become the predominant source of gene transcripts. This discovery has profound consequences for wheat breeding and our understanding of wheat evolution. PMID:24726045

  6. Stacking transgenes in forest trees.

    PubMed

    Halpin, Claire; Boerjan, Wout

    2003-08-01

    Huge potential exists for improving plant raw materials and foodstuffs via metabolic engineering. To date, progress has mostly been limited to modulating the expression of single genes of well-studied pathways, such as the lignin biosynthetic pathway, in model species. However, a recent report illustrates a new level of sophistication in metabolic engineering by overexpressing one lignin enzyme while simultaneously suppressing the expression of another lignin gene in a tree, aspen. This novel approach to multi-gene manipulation has succeeded in concurrently improving several wood-quality traits.

  7. [Research on the expression of hemolysin genes of Leptospira in vivo by genechip].

    PubMed

    Zhao, Hui; Bao, Lang

    2012-07-01

    To explore the expression of hemolysin genes of Leptospira in infected host. Amplified the gene segment of hemolysin genes from the genome of Leptospira by PCR for gene probe. Manufacture genechip by the VersArray Chipwriter systerm. The total RNAs of Leptospira before and after infection host were extracted, reversely transcribed to cDNA, after the random PCR, the products were marked with HEX and CY5 respectively, and hybridized to genechip to demonstrate the expression of hemolysin genes of Leptospira. The hemolysin genes LA1029 (Ratio = 0.65), LA1027 (Ratio = 0.53) were up-regulated after infection of host; LA3540 (Ratio = 1.88), LA3937 (Ratio = 5.58), LA1029 (Ratio = 3.00) were up-regulated and LA4004 (Ratio = 0.67) was down-regulated in live than in blood; LA3937 (Ratio = 2.28), LA1029 (Ratio = 2.20) were up-regulated in kidney than in blood. The expression level of hemolysin genes exist observable differences with inducement in vivo and in different organs. These suggested that these genes are probably involved in the pathogenesis and and disease progression.

  8. Bayesian median regression for temporal gene expression data

    NASA Astrophysics Data System (ADS)

    Yu, Keming; Vinciotti, Veronica; Liu, Xiaohui; 't Hoen, Peter A. C.

    2007-09-01

    Most of the existing methods for the identification of biologically interesting genes in a temporal expression profiling dataset do not fully exploit the temporal ordering in the dataset and are based on normality assumptions for the gene expression. In this paper, we introduce a Bayesian median regression model to detect genes whose temporal profile is significantly different across a number of biological conditions. The regression model is defined by a polynomial function where both time and condition effects as well as interactions between the two are included. MCMC-based inference returns the posterior distribution of the polynomial coefficients. From this a simple Bayes factor test is proposed to test for significance. The estimation of the median rather than the mean, and within a Bayesian framework, increases the robustness of the method compared to a Hotelling T2-test previously suggested. This is shown on simulated data and on muscular dystrophy gene expression data.

  9. The 3’-Jα Region of the TCRα Locus Bears Gene Regulatory Activity in Thymic and Peripheral T Cells

    PubMed Central

    Kučerová-Levisohn, Martina; Knirr, Stefan; Mejia, Rosa I.; Ortiz, Benjamin D.

    2015-01-01

    Much progress has been made in understanding the important cis-mediated controls on mouse TCRα gene function, including identification of the Eα enhancer and TCRα locus control region (LCR). Nevertheless, previous data have suggested that other cis-regulatory elements may reside in the locus outside of the Eα/LCR. Based on prior findings, we hypothesized the existence of gene regulatory elements in a 3.9-kb region 5’ of the Cα exons. Using DNase hypersensitivity assays and TCRα BAC reporter transgenes in mice, we detected gene regulatory activity within this 3.9-kb region. This region is active in both thymic and peripheral T cells, and selectively affects upstream, but not downstream, gene expression. Together, these data indicate the existence of a novel cis-acting regulatory complex that contributes to TCRα transgene expression in vivo. The active chromatin sites we discovered within this region would remain in the locus after TCRα gene rearrangement, and thus may contribute to endogenous TCRα gene activity, particularly in peripheral T cells, where the Eα element has been found to be inactive. PMID:26177549

  10. Protists and the Wild, Wild West of Gene Expression: New Frontiers, Lawlessness, and Misfits.

    PubMed

    Smith, David Roy; Keeling, Patrick J

    2016-09-08

    The DNA double helix has been called one of life's most elegant structures, largely because of its universality, simplicity, and symmetry. The expression of information encoded within DNA, however, can be far from simple or symmetric and is sometimes surprisingly variable, convoluted, and wantonly inefficient. Although exceptions to the rules exist in certain model systems, the true extent to which life has stretched the limits of gene expression is made clear by nonmodel systems, particularly protists (microbial eukaryotes). The nuclear and organelle genomes of protists are subject to the most tangled forms of gene expression yet identified. The complicated and extravagant picture of the underlying genetics of eukaryotic microbial life changes how we think about the flow of genetic information and the evolutionary processes shaping it. Here, we discuss the origins, diversity, and growing interest in noncanonical protist gene expression and its relationship to genomic architecture.

  11. Chromatin Configuration Determines Cell Responses to Hormone Stimuli | Center for Cancer Research

    Cancer.gov

    Ever since selective gene expression was established as the central driver of cell behavior, researchers have been working to understand the forces that control gene transcription. Aberrant gene expression can cause or promote many diseases, including cancer, and alterations in gene expression are the goal of many therapeutic agents. Recent work has focused on the potential role of chromatin structure as a contributor to gene regulation. Chromatin can exist in a tightly packed/inaccessible or loose/accessible configuration depending on the interactions between DNA and its associated proteins. Patterns of chromatin structure can differ between cell types and can also change within cells in response to certain signals. Cancer researchers are particularly interested in the role of chromatin in gene regulation because many of the genomic regions found to be associated with cancer risk are in open chromatin structures.

  12. A comparative analysis of biclustering algorithms for gene expression data

    PubMed Central

    Eren, Kemal; Deveci, Mehmet; Küçüktunç, Onur; Çatalyürek, Ümit V.

    2013-01-01

    The need to analyze high-dimension biological data is driving the development of new data mining methods. Biclustering algorithms have been successfully applied to gene expression data to discover local patterns, in which a subset of genes exhibit similar expression levels over a subset of conditions. However, it is not clear which algorithms are best suited for this task. Many algorithms have been published in the past decade, most of which have been compared only to a small number of algorithms. Surveys and comparisons exist in the literature, but because of the large number and variety of biclustering algorithms, they are quickly outdated. In this article we partially address this problem of evaluating the strengths and weaknesses of existing biclustering methods. We used the BiBench package to compare 12 algorithms, many of which were recently published or have not been extensively studied. The algorithms were tested on a suite of synthetic data sets to measure their performance on data with varying conditions, such as different bicluster models, varying noise, varying numbers of biclusters and overlapping biclusters. The algorithms were also tested on eight large gene expression data sets obtained from the Gene Expression Omnibus. Gene Ontology enrichment analysis was performed on the resulting biclusters, and the best enrichment terms are reported. Our analyses show that the biclustering method and its parameters should be selected based on the desired model, whether that model allows overlapping biclusters, and its robustness to noise. In addition, we observe that the biclustering algorithms capable of finding more than one model are more successful at capturing biologically relevant clusters. PMID:22772837

  13. Picking Cell Lines for High-Throughput Transcriptomic Toxicity ...

    EPA Pesticide Factsheets

    High throughput, whole genome transcriptomic profiling is a promising approach to comprehensively evaluate chemicals for potential biological effects. To be useful for in vitro toxicity screening, gene expression must be quantified in a set of representative cell types that captures the diversity of potential responses across chemicals. The ideal dataset to select these cell types would consist of hundreds of cell types treated with thousands of chemicals, but does not yet exist. However, basal gene expression data may be useful as a surrogate for representing the relevant biological space necessary for cell type selection. The goal of this study was to identify a small (< 20) number of cell types that capture a large, quantifiable fraction of basal gene expression diversity. Three publicly available collections of Affymetrix U133+2.0 cellular gene expression data were used: 1) 59 cell lines from the NCI60 set; 2) 303 primary cell types from the Mabbott et al (2013) expression atlas; and 3) 1036 cell lines from the Cancer Cell Line Encyclopedia. The data were RMA normalized, log-transformed, and the probe sets mapped to HUGO gene identifiers. The results showed that <20 cell lines capture only a small fraction of the total diversity in basal gene expression when evaluated using either the entire set of 20960 HUGO genes or a subset of druggable genes likely to be chemical targets. The fraction of the total gene expression variation explained was consistent when

  14. Tombusvirus-based vector systems to permit over-expression of genes or that serve as sensors of antiviral RNA silencing in plants.

    PubMed

    Shamekova, Malika; Mendoza, Maria R; Hsieh, Yi-Cheng; Lindbo, John; Omarov, Rustem T; Scholthof, Herman B

    2014-03-01

    A next generation Tomato bushy stunt virus (TBSV) coat protein gene replacement vector system is described that can be applied by either RNA inoculation or through agroinfiltration. A vector expressing GFP rapidly yields high levels of transient gene expression in inoculated leaves of various plant species, as illustrated for Nicotiana benthamiana, cowpea, tomato, pepper, and lettuce. A start-codon mutation to down-regulate the dose of the P19 silencing suppressor reduces GFP accumulation, whereas mutations that result in undetectable levels of P19 trigger rapid silencing of GFP. Compared to existing virus vectors the TBSV system has a unique combination of a very broad host range, rapid and high levels of replication and gene expression, and the ability to regulate its suppressor. These features are attractive for quick transient assays in numerous plant species for over-expression of genes of interest, or as a sensor to monitor the efficacy of antiviral RNA silencing. Copyright © 2014. Published by Elsevier Inc.

  15. Selection of reference genes for gene expression studies in heart failure for left and right ventricles.

    PubMed

    Li, Mengmeng; Rao, Man; Chen, Kai; Zhou, Jianye; Song, Jiangping

    2017-07-15

    Real-time quantitative reverse transcriptase-PCR (qRT-PCR) is a feasible tool for determining gene expression profiles, but the accuracy and reliability of the results depends on the stable expression of selected housekeeping genes in different samples. By far, researches on stable housekeeping genes in human heart failure samples are rare. Moreover the effect of heart failure on the expression of housekeeping genes in right and left ventricles is yet to be studied. Therefore we aim to provide stable housekeeping genes for both ventricles in heart failure and normal heart samples. In this study, we selected seven commonly used housekeeping genes as candidates. By using the qRT-PCR, the expression levels of ACTB, RAB7A, GAPDH, REEP5, RPL5, PSMB4 and VCP in eight heart failure and four normal heart samples were assessed. The stability of candidate housekeeping genes was evaluated by geNorm and Normfinder softwares. GAPDH showed the least variation in all heart samples. Results also indicated the difference of gene expression existed in heart failure left and right ventricles. GAPDH had the highest expression stability in both heart failure and normal heart samples. We also propose using different sets of housekeeping genes for left and right ventricles respectively. The combination of RPL5, GAPDH and PSMB4 is suitable for the right ventricle and the combination of GAPDH, REEP5 and RAB7A is suitable for the left ventricle. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. A measure of the signal-to-noise ratio of microarray samples and studies using gene correlations.

    PubMed

    Venet, David; Detours, Vincent; Bersini, Hugues

    2012-01-01

    The quality of gene expression data can vary dramatically from platform to platform, study to study, and sample to sample. As reliable statistical analysis rests on reliable data, determining such quality is of the utmost importance. Quality measures to spot problematic samples exist, but they are platform-specific, and cannot be used to compare studies. As a proxy for quality, we propose a signal-to-noise ratio for microarray data, the "Signal-to-Noise Applied to Gene Expression Experiments", or SNAGEE. SNAGEE is based on the consistency of gene-gene correlations. We applied SNAGEE to a compendium of 80 large datasets on 37 platforms, for a total of 24,380 samples, and assessed the signal-to-noise ratio of studies and samples. This allowed us to discover serious issues with three studies. We show that signal-to-noise ratios of both studies and samples are linked to the statistical significance of the biological results. We showed that SNAGEE is an effective way to measure data quality for most types of gene expression studies, and that it often outperforms existing techniques. Furthermore, SNAGEE is platform-independent and does not require raw data files. The SNAGEE R package is available in BioConductor.

  17. A Single-Cell Approach to the Elusive Latent Human Cytomegalovirus Transcriptome.

    PubMed

    Goodrum, Felicia; McWeeney, Shannon

    2018-06-12

    Herpesvirus latency has been difficult to understand molecularly due to low levels of viral genomes and gene expression. In the case of the betaherpesvirus human cytomegalovirus (HCMV), this is further complicated by the heterogeneity inherent to hematopoietic subpopulations harboring genomes and, as a consequence, the various patterns of infection that simultaneously exist in a host, ranging from latent to lytic. Single-cell RNA sequencing (scRNA-seq) provides tremendous potential in measuring the gene expression profiles of heterogeneous cell populations for a wide range of applications, including in studies of cancer, immunology, and infectious disease. A recent study by Shnayder et al. (mBio 9:e00013-18, 2018, https://doi.org/10.1128/mBio.00013-18) utilized scRNA-seq to define transcriptomal characteristics of HCMV latency. They conclude that latency-associated gene expression is similar to the late lytic viral program but at lower levels of expression. The study highlights the numerous challenges, from the definition of latency to the analysis of scRNA-seq, that exist in defining a latent transcriptome. Copyright © 2018 Goodrum and McWeeney.

  18. Circular RNA and gene expression profiles in gastric cancer based on microarray chip technology.

    PubMed

    Sui, Weiguo; Shi, Zhoufang; Xue, Wen; Ou, Minglin; Zhu, Ying; Chen, Jiejing; Lin, Hua; Liu, Fuhua; Dai, Yong

    2017-03-01

    The aim of the present study was to screen gastric cancer (GC) tissue and adjacent tissue for differences in mRNA and circular (circRNA) expression, to analyze the differences in circRNA and mRNA expression, and to investigate the circRNA expression in gastric carcinoma and its mechanism. circRNA and mRNA differential expression profiles generated using Agilent microarray technology were analyzed in the GC tissues and adjacent tissues. qRT-PCR was used to verify the differential expression of circRNAs and mRNAs according to the interactions between circRNAs and miRNAs as well as the possible existence of miRNA and mRNA interactions. We found that: i) the circRNA expression profile revealed 1,285 significant differences in circRNA expression, with circRNA expression downregulated in 594 samples and upregulated in 691 samples via interactions with miRNAs. The qRT-PCR validation experiments showed that hsa_circRNA_400071, hsa_circRNA_000543 and hsa_circRNA_001959 expression was consistent with the microarray analysis results. ii) 29,112 genes were found in the GC tissues and adjacent tissues, including 5,460 differentially expressed genes. Among them, 2,390 differentially expressed genes were upregulated and 3,070 genes were downregulated. Gene Ontology (GO) analysis of the differentially expressed genes revealed these genes involved in biological process classification, cellular component classification and molecular function classification. Pathway analysis of the differentially expressed genes identified 83 significantly enriched genes, including 28 upregulated genes and 55 downregulated genes. iii) 69 differentially expressed circRNAs were found that might adsorb specific miRNAs to regulate the expression of their target gene mRNAs. The conclusions are: i) differentially expressed circRNAs had corresponding miRNA binding sites. These circRNAs regulated the expression of target genes through interactions with miRNAs and might become new molecular biomarkers for GC in the future. ii) Differentially expressed genes may be involved in the occurrence of GC via a variety of mechanisms. iii) CD44, CXXC5, MYH9, MALAT1 and other genes may have important implications for the occurrence and development of GC through the regulation, interaction, and mutual influence of circRNA-miRNA-mRNA via different mechanisms.

  19. Regulation of ecmF gene expression and genetic hierarchy among STATa, CudA, and MybC on several prestalk A-specific gene expressions in Dictyostelium.

    PubMed

    Saga, Yukika; Inamura, Tomoka; Shimada, Nao; Kawata, Takefumi

    2016-05-01

    STATa, a Dictyostelium homologue of metazoan signal transducer and activator of transcription, is important for the organizer function in the tip region of the migrating Dictyostelium slug. We previously showed that ecmF gene expression depends on STATa in prestalk A (pstA) cells, where STATa is activated. Deletion and site-directed mutagenesis analysis of the ecmF/lacZ fusion gene in wild-type and STATa null strains identified an imperfect inverted repeat sequence, ACAAATANTATTTGT, as a STATa-responsive element. An upstream sequence element was required for efficient expression in the rear region of pstA zone; an element downstream of the inverted repeat was necessary for sufficient prestalk expression during culmination. Band shift analyses using purified STATa protein detected no sequence-specific binding to those ecmF elements. The only verified upregulated target gene of STATa is cudA gene; CudA directly activates expL7 gene expression in prestalk cells. However, ecmF gene expression was almost unaffected in a cudA null mutant. Several previously reported putative STATa target genes were also expressed in cudA null mutant but were downregulated in STATa null mutant. Moreover, mybC, which encodes another transcription factor, belonged to this category, and ecmF expression was downregulated in a mybC null mutant. These findings demonstrate the existence of a genetic hierarchy for pstA-specific genes, which can be classified into two distinct STATa downstream pathways, CudA dependent and independent. The ecmF expression is indirectly upregulated by STATa in a CudA-independent activation manner but dependent on MybC, whose expression is positively regulated by STATa. © 2016 Japanese Society of Developmental Biologists.

  20. Sequential and combinatorial roles of maf family genes define proper lens development.

    PubMed

    Reza, Hasan Mahmud; Urano, Atsuyo; Shimada, Naoko; Yasuda, Kunio

    2007-01-16

    Maf proteins have been shown to play pivotal roles in lens development in vertebrates. The developing chick lens expresses at least three large Maf proteins. However, the transcriptional relationship among the three large maf genes and their various roles in transactivating the downstream genes largely remain to be elucidated. Chick embryos were electroporated with wild-type L-maf, c-maf, and mafB by in ovo electroporation, and their effects on gene expression were determined by in situ hybridization using specific probes or by immunostaining. Endogenous gene expression was determined using nonelectroporated samples. A regulation mechanism exists among the members of maf family gene. An early-expressed member of this gene family typically stimulates the expression of later-expressed members. We also examined the regulation of various lens-expressing genes with a focus on the interaction between different Maf proteins. We found that the transcriptional ability of Maf proteins varies, even when the target is the same, in parallel with their discrete functions. L-Maf and c-Maf have no effect on E-cadherin expression, whereas MafB enhances its expression and thereby impedes lens vesicle formation. This study also revealed that Maf proteins can regulate the expression of gap junction genes, connexins, and their interacting partner, major intrinsic protein (MIP), during lens development. Misexpression of L-Maf and c-Maf induces ectopic expression of Cx43 and MIP; in contrast, MafB appears to have no effect on Cx43, but induces MIP significantly as evidenced from our gain-of-function experiments. Our results indicate that large Maf function is indispensable for chick lens initiation and development. In addition, L-Maf positively regulates most of the essential genes in this program and directs a series of molecular events leading to proper formation of the lens.

  1. GeneMesh: a web-based microarray analysis tool for relating differentially expressed genes to MeSH terms.

    PubMed

    Jani, Saurin D; Argraves, Gary L; Barth, Jeremy L; Argraves, W Scott

    2010-04-01

    An important objective of DNA microarray-based gene expression experimentation is determining inter-relationships that exist between differentially expressed genes and biological processes, molecular functions, cellular components, signaling pathways, physiologic processes and diseases. Here we describe GeneMesh, a web-based program that facilitates analysis of DNA microarray gene expression data. GeneMesh relates genes in a query set to categories available in the Medical Subject Headings (MeSH) hierarchical index. The interface enables hypothesis driven relational analysis to a specific MeSH subcategory (e.g., Cardiovascular System, Genetic Processes, Immune System Diseases etc.) or unbiased relational analysis to broader MeSH categories (e.g., Anatomy, Biological Sciences, Disease etc.). Genes found associated with a given MeSH category are dynamically linked to facilitate tabular and graphical depiction of Entrez Gene information, Gene Ontology information, KEGG metabolic pathway diagrams and intermolecular interaction information. Expression intensity values of groups of genes that cluster in relation to a given MeSH category, gene ontology or pathway can be displayed as heat maps of Z score-normalized values. GeneMesh operates on gene expression data derived from a number of commercial microarray platforms including Affymetrix, Agilent and Illumina. GeneMesh is a versatile web-based tool for testing and developing new hypotheses through relating genes in a query set (e.g., differentially expressed genes from a DNA microarray experiment) to descriptors making up the hierarchical structure of the National Library of Medicine controlled vocabulary thesaurus, MeSH. The system further enhances the discovery process by providing links between sets of genes associated with a given MeSH category to a rich set of html linked tabular and graphic information including Entrez Gene summaries, gene ontologies, intermolecular interactions, overlays of genes onto KEGG pathway diagrams and heatmaps of expression intensity values. GeneMesh is freely available online at http://proteogenomics.musc.edu/genemesh/.

  2. Defining global neuroendocrine gene expression patterns associated with reproductive seasonality in fish.

    PubMed

    Zhang, Dapeng; Xiong, Huiling; Mennigen, Jan A; Popesku, Jason T; Marlatt, Vicki L; Martyniuk, Christopher J; Crump, Kate; Cossins, Andrew R; Xia, Xuhua; Trudeau, Vance L

    2009-06-05

    Many vertebrates, including the goldfish, exhibit seasonal reproductive rhythms, which are a result of interactions between external environmental stimuli and internal endocrine systems in the hypothalamo-pituitary-gonadal axis. While it is long believed that differential expression of neuroendocrine genes contributes to establishing seasonal reproductive rhythms, no systems-level investigation has yet been conducted. In the present study, by analyzing multiple female goldfish brain microarray datasets, we have characterized global gene expression patterns for a seasonal cycle. A core set of genes (873 genes) in the hypothalamus were identified to be differentially expressed between May, August and December, which correspond to physiologically distinct stages that are sexually mature (prespawning), sexual regression, and early gonadal redevelopment, respectively. Expression changes of these genes are also shared by another brain region, the telencephalon, as revealed by multivariate analysis. More importantly, by examining one dataset obtained from fish in October who were kept under long-daylength photoperiod (16 h) typical of the springtime breeding season (May), we observed that the expression of identified genes appears regulated by photoperiod, a major factor controlling vertebrate reproductive cyclicity. Gene ontology analysis revealed that hormone genes and genes functionally involved in G-protein coupled receptor signaling pathway and transmission of nerve impulses are significantly enriched in an expression pattern, whose transition is located between prespawning and sexually regressed stages. The existence of seasonal expression patterns was verified for several genes including isotocin, ependymin II, GABA(A) gamma2 receptor, calmodulin, and aromatase b by independent samplings of goldfish brains from six seasonal time points and real-time PCR assays. Using both theoretical and experimental strategies, we report for the first time global gene expression patterns throughout a breeding season which may account for dynamic neuroendocrine regulation of seasonal reproductive development.

  3. Defining Global Neuroendocrine Gene Expression Patterns Associated with Reproductive Seasonality in Fish

    PubMed Central

    Mennigen, Jan A.; Popesku, Jason T.; Marlatt, Vicki L.; Martyniuk, Christopher J.; Crump, Kate; Cossins, Andrew R.; Xia, Xuhua; Trudeau, Vance L.

    2009-01-01

    Background Many vertebrates, including the goldfish, exhibit seasonal reproductive rhythms, which are a result of interactions between external environmental stimuli and internal endocrine systems in the hypothalamo-pituitary-gonadal axis. While it is long believed that differential expression of neuroendocrine genes contributes to establishing seasonal reproductive rhythms, no systems-level investigation has yet been conducted. Methodology/Principal Findings In the present study, by analyzing multiple female goldfish brain microarray datasets, we have characterized global gene expression patterns for a seasonal cycle. A core set of genes (873 genes) in the hypothalamus were identified to be differentially expressed between May, August and December, which correspond to physiologically distinct stages that are sexually mature (prespawning), sexual regression, and early gonadal redevelopment, respectively. Expression changes of these genes are also shared by another brain region, the telencephalon, as revealed by multivariate analysis. More importantly, by examining one dataset obtained from fish in October who were kept under long-daylength photoperiod (16 h) typical of the springtime breeding season (May), we observed that the expression of identified genes appears regulated by photoperiod, a major factor controlling vertebrate reproductive cyclicity. Gene ontology analysis revealed that hormone genes and genes functionally involved in G-protein coupled receptor signaling pathway and transmission of nerve impulses are significantly enriched in an expression pattern, whose transition is located between prespawning and sexually regressed stages. The existence of seasonal expression patterns was verified for several genes including isotocin, ependymin II, GABAA gamma2 receptor, calmodulin, and aromatase b by independent samplings of goldfish brains from six seasonal time points and real-time PCR assays. Conclusions/Significance Using both theoretical and experimental strategies, we report for the first time global gene expression patterns throughout a breeding season which may account for dynamic neuroendocrine regulation of seasonal reproductive development. PMID:19503831

  4. Spectral biclustering of microarray data: coclustering genes and conditions.

    PubMed

    Kluger, Yuval; Basri, Ronen; Chang, Joseph T; Gerstein, Mark

    2003-04-01

    Global analyses of RNA expression levels are useful for classifying genes and overall phenotypes. Often these classification problems are linked, and one wants to find "marker genes" that are differentially expressed in particular sets of "conditions." We have developed a method that simultaneously clusters genes and conditions, finding distinctive "checkerboard" patterns in matrices of gene expression data, if they exist. In a cancer context, these checkerboards correspond to genes that are markedly up- or downregulated in patients with particular types of tumors. Our method, spectral biclustering, is based on the observation that checkerboard structures in matrices of expression data can be found in eigenvectors corresponding to characteristic expression patterns across genes or conditions. In addition, these eigenvectors can be readily identified by commonly used linear algebra approaches, in particular the singular value decomposition (SVD), coupled with closely integrated normalization steps. We present a number of variants of the approach, depending on whether the normalization over genes and conditions is done independently or in a coupled fashion. We then apply spectral biclustering to a selection of publicly available cancer expression data sets, and examine the degree to which the approach is able to identify checkerboard structures. Furthermore, we compare the performance of our biclustering methods against a number of reasonable benchmarks (e.g., direct application of SVD or normalized cuts to raw data).

  5. Optimal Scaling of Digital Transcriptomes

    PubMed Central

    Glusman, Gustavo; Caballero, Juan; Robinson, Max; Kutlu, Burak; Hood, Leroy

    2013-01-01

    Deep sequencing of transcriptomes has become an indispensable tool for biology, enabling expression levels for thousands of genes to be compared across multiple samples. Since transcript counts scale with sequencing depth, counts from different samples must be normalized to a common scale prior to comparison. We analyzed fifteen existing and novel algorithms for normalizing transcript counts, and evaluated the effectiveness of the resulting normalizations. For this purpose we defined two novel and mutually independent metrics: (1) the number of “uniform” genes (genes whose normalized expression levels have a sufficiently low coefficient of variation), and (2) low Spearman correlation between normalized expression profiles of gene pairs. We also define four novel algorithms, one of which explicitly maximizes the number of uniform genes, and compared the performance of all fifteen algorithms. The two most commonly used methods (scaling to a fixed total value, or equalizing the expression of certain ‘housekeeping’ genes) yielded particularly poor results, surpassed even by normalization based on randomly selected gene sets. Conversely, seven of the algorithms approached what appears to be optimal normalization. Three of these algorithms rely on the identification of “ubiquitous” genes: genes expressed in all the samples studied, but never at very high or very low levels. We demonstrate that these include a “core” of genes expressed in many tissues in a mutually consistent pattern, which is suitable for use as an internal normalization guide. The new methods yield robustly normalized expression values, which is a prerequisite for the identification of differentially expressed and tissue-specific genes as potential biomarkers. PMID:24223126

  6. The transcriptional landscape of age in human peripheral blood

    PubMed Central

    Peters, Marjolein J.; Joehanes, Roby; Pilling, Luke C.; Schurmann, Claudia; Conneely, Karen N.; Powell, Joseph; Reinmaa, Eva; Sutphin, George L.; Zhernakova, Alexandra; Schramm, Katharina; Wilson, Yana A.; Kobes, Sayuko; Tukiainen, Taru; Nalls, Michael A.; Hernandez, Dena G.; Cookson, Mark R.; Gibbs, Raphael J.; Hardy, John; Ramasamy, Adaikalavan; Zonderman, Alan B.; Dillman, Allissa; Traynor, Bryan; Smith, Colin; Longo, Dan L.; Trabzuni, Daniah; Troncoso, Juan; van der Brug, Marcel; Weale, Michael E.; O'Brien, Richard; Johnson, Robert; Walker, Robert; Zielke, Ronald H.; Arepalli, Sampath; Ryten, Mina; Singleton, Andrew B.; Ramos, Yolande F.; Göring, Harald H. H.; Fornage, Myriam; Liu, Yongmei; Gharib, Sina A.; Stranger, Barbara E.; De Jager, Philip L.; Aviv, Abraham; Levy, Daniel; Murabito, Joanne M.; Munson, Peter J.; Huan, Tianxiao; Hofman, Albert; Uitterlinden, André G.; Rivadeneira, Fernando; van Rooij, Jeroen; Stolk, Lisette; Broer, Linda; Verbiest, Michael M. P. J.; Jhamai, Mila; Arp, Pascal; Metspalu, Andres; Tserel, Liina; Milani, Lili; Samani, Nilesh J.; Peterson, Pärt; Kasela, Silva; Codd, Veryan; Peters, Annette; Ward-Caviness, Cavin K.; Herder, Christian; Waldenberger, Melanie; Roden, Michael; Singmann, Paula; Zeilinger, Sonja; Illig, Thomas; Homuth, Georg; Grabe, Hans-Jörgen; Völzke, Henry; Steil, Leif; Kocher, Thomas; Murray, Anna; Melzer, David; Yaghootkar, Hanieh; Bandinelli, Stefania; Moses, Eric K.; Kent, Jack W.; Curran, Joanne E.; Johnson, Matthew P.; Williams-Blangero, Sarah; Westra, Harm-Jan; McRae, Allan F.; Smith, Jennifer A.; Kardia, Sharon L. R.; Hovatta, Iiris; Perola, Markus; Ripatti, Samuli; Salomaa, Veikko; Henders, Anjali K.; Martin, Nicholas G.; Smith, Alicia K.; Mehta, Divya; Binder, Elisabeth B.; Nylocks, K Maria; Kennedy, Elizabeth M.; Klengel, Torsten; Ding, Jingzhong; Suchy-Dicey, Astrid M.; Enquobahrie, Daniel A.; Brody, Jennifer; Rotter, Jerome I.; Chen, Yii-Der I.; Houwing-Duistermaat, Jeanine; Kloppenburg, Margreet; Slagboom, P. Eline; Helmer, Quinta; den Hollander, Wouter; Bean, Shannon; Raj, Towfique; Bakhshi, Noman; Wang, Qiao Ping; Oyston, Lisa J.; Psaty, Bruce M.; Tracy, Russell P.; Montgomery, Grant W.; Turner, Stephen T.; Blangero, John; Meulenbelt, Ingrid; Ressler, Kerry J.; Yang, Jian; Franke, Lude; Kettunen, Johannes; Visscher, Peter M.; Neely, G. Gregory; Korstanje, Ron; Hanson, Robert L.; Prokisch, Holger; Ferrucci, Luigi; Esko, Tonu; Teumer, Alexander; van Meurs, Joyce B. J.; Johnson, Andrew D.

    2015-01-01

    Disease incidences increase with age, but the molecular characteristics of ageing that lead to increased disease susceptibility remain inadequately understood. Here we perform a whole-blood gene expression meta-analysis in 14,983 individuals of European ancestry (including replication) and identify 1,497 genes that are differentially expressed with chronological age. The age-associated genes do not harbor more age-associated CpG-methylation sites than other genes, but are instead enriched for the presence of potentially functional CpG-methylation sites in enhancer and insulator regions that associate with both chronological age and gene expression levels. We further used the gene expression profiles to calculate the ‘transcriptomic age' of an individual, and show that differences between transcriptomic age and chronological age are associated with biological features linked to ageing, such as blood pressure, cholesterol levels, fasting glucose, and body mass index. The transcriptomic prediction model adds biological relevance and complements existing epigenetic prediction models, and can be used by others to calculate transcriptomic age in external cohorts. PMID:26490707

  7. The Evolution and Expression Pattern of Human Overlapping lncRNA and Protein-coding Gene Pairs.

    PubMed

    Ning, Qianqian; Li, Yixue; Wang, Zhen; Zhou, Songwen; Sun, Hong; Yu, Guangjun

    2017-03-27

    Long non-coding RNA overlapping with protein-coding gene (lncRNA-coding pair) is a special type of overlapping genes. Protein-coding overlapping genes have been well studied and increasing attention has been paid to lncRNAs. By studying lncRNA-coding pairs in human genome, we showed that lncRNA-coding pairs were more likely to be generated by overprinting and retaining genes in lncRNA-coding pairs were given higher priority than non-overlapping genes. Besides, the preference of overlapping configurations preserved during evolution was based on the origin of lncRNA-coding pairs. Further investigations showed that lncRNAs promoting the splicing of their embedded protein-coding partners was a unilateral interaction, but the existence of overlapping partners improving the gene expression was bidirectional and the effect was decreased with the increased evolutionary age of genes. Additionally, the expression of lncRNA-coding pairs showed an overall positive correlation and the expression correlation was associated with their overlapping configurations, local genomic environment and evolutionary age of genes. Comparison of the expression correlation of lncRNA-coding pairs between normal and cancer samples found that the lineage-specific pairs including old protein-coding genes may play an important role in tumorigenesis. This work presents a systematically comprehensive understanding of the evolution and the expression pattern of human lncRNA-coding pairs.

  8. ROLES OF CELL-INTRINSIC AND MICROENVIRONMENTAL FACTORS IN PHOTORECEPTOR CELL DIFFERENTIATION

    PubMed Central

    Bradford, Rebecca L.; Wang, Chenwei; Zack, Donald J.; Adler, Ruben

    2005-01-01

    Photoreceptor differentiation requires the coordinated expression of numerous genes. It is unknown whether those genes share common regulatory mechanisms or are independently regulated by distinct mechanisms. To distinguish between these scenarios, we have used in situ hybridization, RT-PCR and real time PCR to analyze the expression of visual pigments and other photoreceptor-specific genes during chick embryo retinal development in ovo, as well as in retinal cell cultures treated with molecules that regulate the expression of particular visual pigments. In ovo, onset of gene expression was asynchronous, becoming detectable at the time of photoreceptor generation (ED 5–8) for some photoreceptor genes, but only around the time of outer segment formation (ED 14–16) for others. Treatment of retinal cell cultures with activin, staurosporine or CNTF selectively induced or down-regulated specific visual pigment genes, but many cognate rod- or cone-specific genes were not affected by the treatments. These results indicate that many photoreceptor genes are independently regulated during development, are consistent with the existence of at least two distinct stages of gene expression during photoreceptor differentiation, suggest that intrinsic, coordinated regulation of a cascade of gene expression triggered by a commitment to the photoreceptor fate is not a general mechanism of photoreceptor differentiation, and imply that using a single photoreceptor-specific “marker” as a proxy to identify photoreceptor cell fate is problematic. PMID:16120439

  9. Meta-analysis of cancer gene expression signatures reveals new cancer genes, SAGE tags and tumor associated regions of co-regulation

    PubMed Central

    Kavak, Erşen; Ünlü, Mustafa; Nistér, Monica; Koman, Ahmet

    2010-01-01

    Cancer is among the major causes of human death and its mechanism(s) are not fully understood. We applied a novel meta-analysis approach to multiple sets of merged serial analysis of gene expression and microarray cancer data in order to analyze transcriptome alterations in human cancer. Our methodology, which we denote ‘COgnate Gene Expression patterNing in tumours’ (COGENT), unmasked numerous genes that were differentially expressed in multiple cancers. COGENT detected well-known tumor-associated (TA) genes such as TP53, EGFR and VEGF, as well as many multi-cancer, but not-yet-tumor-associated genes. In addition, we identified 81 co-regulated regions on the human genome (RIDGEs) by using expression data from all cancers. Some RIDGEs (28%) consist of paralog genes while another subset (30%) are specifically dysregulated in tumors but not in normal tissues. Furthermore, a significant number of RIDGEs are associated with GC-rich regions on the genome. All assembled data is freely available online (www.oncoreveal.org) as a tool implementing COGENT analysis of multi-cancer genes and RIDGEs. These findings engender a deeper understanding of cancer biology by demonstrating the existence of a pool of under-studied multi-cancer genes and by highlighting the cancer-specificity of some TA-RIDGEs. PMID:20621981

  10. TESTING HIGH-DIMENSIONAL COVARIANCE MATRICES, WITH APPLICATION TO DETECTING SCHIZOPHRENIA RISK GENES

    PubMed Central

    Zhu, Lingxue; Lei, Jing; Devlin, Bernie; Roeder, Kathryn

    2017-01-01

    Scientists routinely compare gene expression levels in cases versus controls in part to determine genes associated with a disease. Similarly, detecting case-control differences in co-expression among genes can be critical to understanding complex human diseases; however statistical methods have been limited by the high dimensional nature of this problem. In this paper, we construct a sparse-Leading-Eigenvalue-Driven (sLED) test for comparing two high-dimensional covariance matrices. By focusing on the spectrum of the differential matrix, sLED provides a novel perspective that accommodates what we assume to be common, namely sparse and weak signals in gene expression data, and it is closely related with Sparse Principal Component Analysis. We prove that sLED achieves full power asymptotically under mild assumptions, and simulation studies verify that it outperforms other existing procedures under many biologically plausible scenarios. Applying sLED to the largest gene-expression dataset obtained from post-mortem brain tissue from Schizophrenia patients and controls, we provide a novel list of genes implicated in Schizophrenia and reveal intriguing patterns in gene co-expression change for Schizophrenia subjects. We also illustrate that sLED can be generalized to compare other gene-gene “relationship” matrices that are of practical interest, such as the weighted adjacency matrices. PMID:29081874

  11. TESTING HIGH-DIMENSIONAL COVARIANCE MATRICES, WITH APPLICATION TO DETECTING SCHIZOPHRENIA RISK GENES.

    PubMed

    Zhu, Lingxue; Lei, Jing; Devlin, Bernie; Roeder, Kathryn

    2017-09-01

    Scientists routinely compare gene expression levels in cases versus controls in part to determine genes associated with a disease. Similarly, detecting case-control differences in co-expression among genes can be critical to understanding complex human diseases; however statistical methods have been limited by the high dimensional nature of this problem. In this paper, we construct a sparse-Leading-Eigenvalue-Driven (sLED) test for comparing two high-dimensional covariance matrices. By focusing on the spectrum of the differential matrix, sLED provides a novel perspective that accommodates what we assume to be common, namely sparse and weak signals in gene expression data, and it is closely related with Sparse Principal Component Analysis. We prove that sLED achieves full power asymptotically under mild assumptions, and simulation studies verify that it outperforms other existing procedures under many biologically plausible scenarios. Applying sLED to the largest gene-expression dataset obtained from post-mortem brain tissue from Schizophrenia patients and controls, we provide a novel list of genes implicated in Schizophrenia and reveal intriguing patterns in gene co-expression change for Schizophrenia subjects. We also illustrate that sLED can be generalized to compare other gene-gene "relationship" matrices that are of practical interest, such as the weighted adjacency matrices.

  12. Promoter architecture dictates cell-to-cell variability in gene expression.

    PubMed

    Jones, Daniel L; Brewster, Robert C; Phillips, Rob

    2014-12-19

    Variability in gene expression among genetically identical cells has emerged as a central preoccupation in the study of gene regulation; however, a divide exists between the predictions of molecular models of prokaryotic transcriptional regulation and genome-wide experimental studies suggesting that this variability is indifferent to the underlying regulatory architecture. We constructed a set of promoters in Escherichia coli in which promoter strength, transcription factor binding strength, and transcription factor copy numbers are systematically varied, and used messenger RNA (mRNA) fluorescence in situ hybridization to observe how these changes affected variability in gene expression. Our parameter-free models predicted the observed variability; hence, the molecular details of transcription dictate variability in mRNA expression, and transcriptional noise is specifically tunable and thus represents an evolutionarily accessible phenotypic parameter. Copyright © 2014, American Association for the Advancement of Science.

  13. Array data extractor (ADE): a LabVIEW program to extract and merge gene array data.

    PubMed

    Kurtenbach, Stefan; Kurtenbach, Sarah; Zoidl, Georg

    2013-12-01

    Large data sets from gene expression array studies are publicly available offering information highly valuable for research across many disciplines ranging from fundamental to clinical research. Highly advanced bioinformatics tools have been made available to researchers, but a demand for user-friendly software allowing researchers to quickly extract expression information for multiple genes from multiple studies persists. Here, we present a user-friendly LabVIEW program to automatically extract gene expression data for a list of genes from multiple normalized microarray datasets. Functionality was tested for 288 class A G protein-coupled receptors (GPCRs) and expression data from 12 studies comparing normal and diseased human hearts. Results confirmed known regulation of a beta 1 adrenergic receptor and further indicate novel research targets. Although existing software allows for complex data analyses, the LabVIEW based program presented here, "Array Data Extractor (ADE)", provides users with a tool to retrieve meaningful information from multiple normalized gene expression datasets in a fast and easy way. Further, the graphical programming language used in LabVIEW allows applying changes to the program without the need of advanced programming knowledge.

  14. BFDCA: A Comprehensive Tool of Using Bayes Factor for Differential Co-Expression Analysis.

    PubMed

    Wang, Duolin; Wang, Juexin; Jiang, Yuexu; Liang, Yanchun; Xu, Dong

    2017-02-03

    Comparing the gene-expression profiles between biological conditions is useful for understanding gene regulation underlying complex phenotypes. Along this line, analysis of differential co-expression (DC) has gained attention in the recent years, where genes under one condition have different co-expression patterns compared with another. We developed an R package Bayes Factor approach for Differential Co-expression Analysis (BFDCA) for DC analysis. BFDCA is unique in integrating various aspects of DC patterns (including Shift, Cross, and Re-wiring) into one uniform Bayes factor. We tested BFDCA using simulation data and experimental data. Simulation results indicate that BFDCA outperforms existing methods in accuracy and robustness of detecting DC pairs and DC modules. Results of using experimental data suggest that BFDCA can cluster disease-related genes into functional DC subunits and estimate the regulatory impact of disease-related genes well. BFDCA also achieves high accuracy in predicting case-control phenotypes by using significant DC gene pairs as markers. BFDCA is publicly available at http://dx.doi.org/10.17632/jdz4vtvnm3.1. Copyright © 2016 Elsevier Ltd. All rights reserved.

  15. Genes@Work: an efficient algorithm for pattern discovery and multivariate feature selection in gene expression data.

    PubMed

    Lepre, Jorge; Rice, J Jeremy; Tu, Yuhai; Stolovitzky, Gustavo

    2004-05-01

    Despite the growing literature devoted to finding differentially expressed genes in assays probing different tissues types, little attention has been paid to the combinatorial nature of feature selection inherent to large, high-dimensional gene expression datasets. New flexible data analysis approaches capable of searching relevant subgroups of genes and experiments are needed to understand multivariate associations of gene expression patterns with observed phenotypes. We present in detail a deterministic algorithm to discover patterns of multivariate gene associations in gene expression data. The patterns discovered are differential with respect to a control dataset. The algorithm is exhaustive and efficient, reporting all existent patterns that fit a given input parameter set while avoiding enumeration of the entire pattern space. The value of the pattern discovery approach is demonstrated by finding a set of genes that differentiate between two types of lymphoma. Moreover, these genes are found to behave consistently in an independent dataset produced in a different laboratory using different arrays, thus validating the genes selected using our algorithm. We show that the genes deemed significant in terms of their multivariate statistics will be missed using other methods. Our set of pattern discovery algorithms including a user interface is distributed as a package called Genes@Work. This package is freely available to non-commercial users and can be downloaded from our website (http://www.research.ibm.com/FunGen).

  16. Validation of reference genes for quantitative RT-PCR studies of gene expression in perennial ryegrass (Lolium perenne L.)

    PubMed Central

    2010-01-01

    Background Perennial ryegrass (Lolium perenne L.) is an important pasture and turf crop. Biotechniques such as gene expression studies are being employed to improve traits in this temperate grass. Quantitative reverse transcription-polymerase chain reaction (qRT-PCR) is among the best methods available for determining changes in gene expression. Before analysis of target gene expression, it is essential to select an appropriate normalisation strategy to control for non-specific variation between samples. Reference genes that have stable expression at different biological and physiological states can be effectively used for normalisation; however, their expression stability must be validated before use. Results Existing Serial Analysis of Gene Expression data were queried to identify six moderately expressed genes that had relatively stable gene expression throughout the year. These six candidate reference genes (eukaryotic elongation factor 1 alpha, eEF1A; TAT-binding protein homolog 1, TBP-1; eukaryotic translation initiation factor 4 alpha, eIF4A; YT521-B-like protein family protein, YT521-B; histone 3, H3; ubiquitin-conjugating enzyme, E2) were validated for qRT-PCR normalisation in 442 diverse perennial ryegrass (Lolium perenne L.) samples sourced from field- and laboratory-grown plants under a wide range of experimental conditions. Eukaryotic EF1A is encoded by members of a multigene family exhibiting differential expression and necessitated the expression analysis of different eEF1A encoding genes; a highly expressed eEF1A (h), a moderately, but stably expressed eEF1A (s), and combined expression of multigene eEF1A (m). NormFinder identified eEF1A (s) and YT521-B as the best combination of two genes for normalisation of gene expression data in perennial ryegrass following different defoliation management in the field. Conclusions This study is unique in the magnitude of samples tested with the inclusion of numerous field-grown samples, helping pave the way to conduct gene expression studies in perennial biomass crops under field-conditions. From our study several stably expressed reference genes have been validated. This provides useful candidates for reference gene selection in perennial ryegrass under conditions other than those tested here. PMID:20089196

  17. Modeling genome-wide dynamic regulatory network in mouse lungs with influenza infection using high-dimensional ordinary differential equations.

    PubMed

    Wu, Shuang; Liu, Zhi-Ping; Qiu, Xing; Wu, Hulin

    2014-01-01

    The immune response to viral infection is regulated by an intricate network of many genes and their products. The reverse engineering of gene regulatory networks (GRNs) using mathematical models from time course gene expression data collected after influenza infection is key to our understanding of the mechanisms involved in controlling influenza infection within a host. A five-step pipeline: detection of temporally differentially expressed genes, clustering genes into co-expressed modules, identification of network structure, parameter estimate refinement, and functional enrichment analysis, is developed for reconstructing high-dimensional dynamic GRNs from genome-wide time course gene expression data. Applying the pipeline to the time course gene expression data from influenza-infected mouse lungs, we have identified 20 distinct temporal expression patterns in the differentially expressed genes and constructed a module-based dynamic network using a linear ODE model. Both intra-module and inter-module annotations and regulatory relationships of our inferred network show some interesting findings and are highly consistent with existing knowledge about the immune response in mice after influenza infection. The proposed method is a computationally efficient, data-driven pipeline bridging experimental data, mathematical modeling, and statistical analysis. The application to the influenza infection data elucidates the potentials of our pipeline in providing valuable insights into systematic modeling of complicated biological processes.

  18. Comparative Bacterial Proteomics: Analysis of the Core Genome Concept

    PubMed Central

    Callister, Stephen J.; McCue, Lee Ann; Turse, Joshua E.; Monroe, Matthew E.; Auberry, Kenneth J.; Smith, Richard D.; Adkins, Joshua N.; Lipton, Mary S.

    2008-01-01

    While comparative bacterial genomic studies commonly predict a set of genes indicative of common ancestry, experimental validation of the existence of this core genome requires extensive measurement and is typically not undertaken. Enabled by an extensive proteome database developed over six years, we have experimentally verified the expression of proteins predicted from genomic ortholog comparisons among 17 environmental and pathogenic bacteria. More exclusive relationships were observed among the expressed protein content of phenotypically related bacteria, which is indicative of the specific lifestyles associated with these organisms. Although genomic studies can establish relative orthologous relationships among a set of bacteria and propose a set of ancestral genes, our proteomics study establishes expressed lifestyle differences among conserved genes and proposes a set of expressed ancestral traits. PMID:18253490

  19. A powerful nonparametric method for detecting differentially co-expressed genes: distance correlation screening and edge-count test.

    PubMed

    Zhang, Qingyang

    2018-05-16

    Differential co-expression analysis, as a complement of differential expression analysis, offers significant insights into the changes in molecular mechanism of different phenotypes. A prevailing approach to detecting differentially co-expressed genes is to compare Pearson's correlation coefficients in two phenotypes. However, due to the limitations of Pearson's correlation measure, this approach lacks the power to detect nonlinear changes in gene co-expression which is common in gene regulatory networks. In this work, a new nonparametric procedure is proposed to search differentially co-expressed gene pairs in different phenotypes from large-scale data. Our computational pipeline consisted of two main steps, a screening step and a testing step. The screening step is to reduce the search space by filtering out all the independent gene pairs using distance correlation measure. In the testing step, we compare the gene co-expression patterns in different phenotypes by a recently developed edge-count test. Both steps are distribution-free and targeting nonlinear relations. We illustrate the promise of the new approach by analyzing the Cancer Genome Atlas data and the METABRIC data for breast cancer subtypes. Compared with some existing methods, the new method is more powerful in detecting nonlinear type of differential co-expressions. The distance correlation screening can greatly improve computational efficiency, facilitating its application to large data sets.

  20. RAS oncogene-mediated deregulation of the transcriptome: from molecular signature to function.

    PubMed

    Schäfer, Reinhold; Sers, Christine

    2011-01-01

    Transcriptome analysis of cancer cells has developed into a standard procedure to elucidate multiple features of the malignant process and to link gene expression to clinical properties. Gene expression profiling based on microarrays provides essentially correlative information and needs to be transferred to the functional level in order to understand the activity and contribution of individual genes or sets of genes as elements of the gene signature. To date, there exist significant gaps in the functional understanding of gene expression profiles. Moreover, the processes that drive the profound transcriptional alterations that characterize cancer cells remain mainly elusive. We have used pathway-restricted gene expression profiles derived from RAS oncogene-transformed cells and from RAS-expressing cancer cells to identify regulators downstream of the MAPK pathway.We describe the role of epigenetic regulation exemplified by the control of several immune genes in generic cell lines and colorectal cancer cells, particularly the functional interaction between signaling and DNA methylation. Moreover, we assess the role of the architectural transcription factor high mobility AT-hook 2 (HMGA2) as a regulator of the RAS-responsive transcriptome in ovarian epithelial cells. Finally, we describe an integrated approach combining pathway interference in colorectal cancer cells, gene expression profiling and computational analysis of regulatory elements of deregulated target genes. This strategy resulted in the identification of Y-box binding protein 1 (YBX1) as a regulator of MAPK-dependent proliferation and gene expression. The implications for a therapeutic application of HMGA2 gene silencing and the role of YBX1 as a prognostic factor are discussed.

  1. A novel harmony search-K means hybrid algorithm for clustering gene expression data

    PubMed Central

    Nazeer, KA Abdul; Sebastian, MP; Kumar, SD Madhu

    2013-01-01

    Recent progress in bioinformatics research has led to the accumulation of huge quantities of biological data at various data sources. The DNA microarray technology makes it possible to simultaneously analyze large number of genes across different samples. Clustering of microarray data can reveal the hidden gene expression patterns from large quantities of expression data that in turn offers tremendous possibilities in functional genomics, comparative genomics, disease diagnosis and drug development. The k- ¬means clustering algorithm is widely used for many practical applications. But the original k-¬means algorithm has several drawbacks. It is computationally expensive and generates locally optimal solutions based on the random choice of the initial centroids. Several methods have been proposed in the literature for improving the performance of the k-¬means algorithm. A meta-heuristic optimization algorithm named harmony search helps find out near-global optimal solutions by searching the entire solution space. Low clustering accuracy of the existing algorithms limits their use in many crucial applications of life sciences. In this paper we propose a novel Harmony Search-K means Hybrid (HSKH) algorithm for clustering the gene expression data. Experimental results show that the proposed algorithm produces clusters with better accuracy in comparison with the existing algorithms. PMID:23390351

  2. A novel harmony search-K means hybrid algorithm for clustering gene expression data.

    PubMed

    Nazeer, Ka Abdul; Sebastian, Mp; Kumar, Sd Madhu

    2013-01-01

    Recent progress in bioinformatics research has led to the accumulation of huge quantities of biological data at various data sources. The DNA microarray technology makes it possible to simultaneously analyze large number of genes across different samples. Clustering of microarray data can reveal the hidden gene expression patterns from large quantities of expression data that in turn offers tremendous possibilities in functional genomics, comparative genomics, disease diagnosis and drug development. The k- ¬means clustering algorithm is widely used for many practical applications. But the original k-¬means algorithm has several drawbacks. It is computationally expensive and generates locally optimal solutions based on the random choice of the initial centroids. Several methods have been proposed in the literature for improving the performance of the k-¬means algorithm. A meta-heuristic optimization algorithm named harmony search helps find out near-global optimal solutions by searching the entire solution space. Low clustering accuracy of the existing algorithms limits their use in many crucial applications of life sciences. In this paper we propose a novel Harmony Search-K means Hybrid (HSKH) algorithm for clustering the gene expression data. Experimental results show that the proposed algorithm produces clusters with better accuracy in comparison with the existing algorithms.

  3. IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing

    PubMed Central

    Deonovic, Benjamin; Wang, Yunhao; Weirather, Jason; Wang, Xiu-Jie; Au, Kin Fai

    2017-01-01

    Abstract Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only. PMID:27899656

  4. TIPMaP: a web server to establish transcript isoform profiles from reliable microarray probes.

    PubMed

    Chitturi, Neelima; Balagannavar, Govindkumar; Chandrashekar, Darshan S; Abinaya, Sadashivam; Srini, Vasan S; Acharya, Kshitish K

    2013-12-27

    Standard 3' Affymetrix gene expression arrays have contributed a significantly higher volume of existing gene expression data than other microarray platforms. These arrays were designed to identify differentially expressed genes, but not their alternatively spliced transcript forms. No resource can currently identify expression pattern of specific mRNA forms using these microarray data, even though it is possible to do this. We report a web server for expression profiling of alternatively spliced transcripts using microarray data sets from 31 standard 3' Affymetrix arrays for human, mouse and rat species. The tool has been experimentally validated for mRNAs transcribed or not-detected in a human disease condition (non-obstructive azoospermia, a male infertility condition). About 4000 gene expression datasets were downloaded from a public repository. 'Good probes' with complete coverage and identity to latest reference transcript sequences were first identified. Using them, 'Transcript specific probe-clusters' were derived for each platform and used to identify expression status of possible transcripts. The web server can lead the user to datasets corresponding to specific tissues, conditions via identifiers of the microarray studies or hybridizations, keywords, official gene symbols or reference transcript identifiers. It can identify, in the tissues and conditions of interest, about 40% of known transcripts as 'transcribed', 'not-detected' or 'differentially regulated'. Corresponding additional information for probes, genes, transcripts and proteins can be viewed too. We identified the expression of transcripts in a specific clinical condition and validated a few of these transcripts by experiments (using reverse transcription followed by polymerase chain reaction). The experimental observations indicated higher agreements with the web server results, than contradictions. The tool is accessible at http://resource.ibab.ac.in/TIPMaP. The newly developed online tool forms a reliable means for identification of alternatively spliced transcript-isoforms that may be differentially expressed in various tissues, cell types or physiological conditions. Thus, by making better use of existing data, TIPMaP avoids the dependence on precious tissue-samples, in experiments with a goal to establish expression profiles of alternative splice forms--at least in some cases.

  5. Concerted action of two dlx paralogs in sensory placode formation.

    PubMed

    Solomon, Keely S; Fritz, Andreas

    2002-07-01

    Sensory placodes are ectodermal thickenings that give rise to elements of the vertebrate cranial sensory nervous system, including the inner ear and nose. Although mutations have been described in humans, mice and zebrafish that perturb ear and nose development, no mutation is known to prevent sensory placode formation. Thus, it has been postulated that a functional redundancy exists in the genetic mechanisms that govern sensory placode development. We describe a zebrafish deletion mutation, b380, which results in a lack of both otic and olfactory placodes. The b380 deletion removes several known genes and expressed sequence tags, including dlx3 and dlx7, two transcription factors that share a homoeobox domain similar in sequence to the Drosophila Distal-less gene. dlx3 and dlx7 are expressed in an overlapping pattern in the regions that produce the otic and olfactory placodes in zebrafish. We present evidence suggesting that it is specifically the removal of these two genes that leads to the otic and olfactory phenotype of b380 mutants. Using morpholinos, antisense oligonucleotides that effectively block translation of target genes, we find that functional reduction of both dlx genes contributes to placode loss. Expression patterns of the otic marker pax2.1, olfactory marker anxV and eya1, a marker of both placodes, in morpholino-injected embryos recapitulate the reduced expression of these genes seen in b380 mutants. We also examine expression of dlx3 and dlx7 in the morpholino-injected embryos and present evidence for existence of auto- and cross-regulatory control of expression among these genes. We demonstrate that dlx3 is necessary and sufficient for proper otic and olfactory placode development. However, our results indicate that dlx3 and dlx7 act in concert and their importance in placode formation is only revealed by inactivating both paralogs.

  6. Diversity in Expression of Phosphorus (P) Responsive Genes in Cucumis melo L

    PubMed Central

    Fita, Ana; Bowen, Helen C.; Hayden, Rory M.; Nuez, Fernando; Picó, Belén; Hammond, John P.

    2012-01-01

    Background Phosphorus (P) is a major limiting nutrient for plant growth in many soils. Studies in model species have identified genes involved in plant adaptations to low soil P availability. However, little information is available on the genetic bases of these adaptations in vegetable crops. In this respect, sequence data for melon now makes it possible to identify melon orthologues of candidate P responsive genes, and the expression of these genes can be used to explain the diversity in the root system adaptation to low P availability, recently observed in this species. Methodology and Findings Transcriptional responses to P starvation were studied in nine diverse melon accessions by comparing the expression of eight candidate genes (Cm-PAP10.1, Cm-PAP10.2, Cm-RNS1, Cm-PPCK1, Cm-transferase, Cm-SQD1, Cm-DGD1 and Cm-SPX2) under P replete and P starved conditions. Differences among melon accessions were observed in response to P starvation, including differences in plant morphology, P uptake, P use efficiency (PUE) and gene expression. All studied genes were up regulated under P starvation conditions. Differences in the expression of genes involved in P mobilization and remobilization (Cm-PAP10.1, Cm-PAP10.2 and Cm-RNS1) under P starvation conditions explained part of the differences in P uptake and PUE among melon accessions. The levels of expression of the other studied genes were diverse among melon accessions, but contributed less to the phenotypical response of the accessions. Conclusions This is the first time that these genes have been described in the context of P starvation responses in melon. There exists significant diversity in gene expression levels and P use efficiency among melon accessions as well as significant correlations between gene expression levels and phenotypical measurements. PMID:22536378

  7. Comparative Analysis of Gene Expression for Convergent Evolution of Camera Eye Between Octopus and Human

    PubMed Central

    Ogura, Atsushi; Ikeo, Kazuho; Gojobori, Takashi

    2004-01-01

    Although the camera eye of the octopus is very similar to that of humans, phylogenetic and embryological analyses have suggested that their camera eyes have been acquired independently. It has been known as a typical example of convergent evolution. To study the molecular basis of convergent evolution of camera eyes, we conducted a comparative analysis of gene expression in octopus and human camera eyes. We sequenced 16,432 ESTs of the octopus eye, leading to 1052 nonredundant genes that have matches in the protein database. Comparing these 1052 genes with 13,303 already-known ESTs of the human eye, 729 (69.3%) genes were commonly expressed between the human and octopus eyes. On the contrary, when we compared octopus eye ESTs with human connective tissue ESTs, the expression similarity was quite low. To trace the evolutionary changes that are potentially responsible for camera eye formation, we also compared octopus-eye ESTs with the completed genome sequences of other organisms. We found that 1019 out of the 1052 genes had already existed at the common ancestor of bilateria, and 875 genes were conserved between humans and octopuses. It suggests that a larger number of conserved genes and their similar gene expression may be responsible for the convergent evolution of the camera eye. PMID:15289475

  8. Personality and gene expression: Do individual differences exist in the leukocyte transcriptome?

    PubMed

    Vedhara, Kavita; Gill, Sana; Eldesouky, Lameese; Campbell, Bruce K; Arevalo, Jesusa M G; Ma, Jeffrey; Cole, Steven W

    2015-02-01

    The temporal and situational stability of personality has led generations of researchers to hypothesize that personality may have enduring effects on health, but the biological mechanisms of such relationships remain poorly understood. In the present study, we utilized a functional genomics approach to examine the relationship between the 5 major dimensions of personality and patterns of gene expression as predicted by 'behavioural immune response' theory. We specifically focussed on two sets of genes previously linked to stress, threat, and adverse socio-environmental conditions: pro-inflammatory genes and genes involved in Type I interferon and antibody responses. An opportunity sample of 121 healthy individuals was recruited (86 females; mean age 24 years). Individuals completed a validated measure of personality; questions relating to current health behaviours; and provided a 5ml sample of peripheral blood for gene expression analysis. Extraversion was associated with increased expression of pro-inflammatory genes and Conscientiousness was associated with reduced expression of pro-inflammatory genes. Both associations were independent of health behaviours, negative affect, and leukocyte subset distributions. Antiviral and antibody-related gene expression was not associated with any personality dimension. The present data shed new light on the long-observed epidemiological associations between personality, physical health, and human longevity. Further research is required to elucidate the biological mechanisms underlying these associations. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. Transcriptome-Wide Identification of Reference Genes for Expression Analysis of Soybean Responses to Drought Stress along the Day.

    PubMed

    Marcolino-Gomes, Juliana; Rodrigues, Fabiana Aparecida; Fuganti-Pagliarini, Renata; Nakayama, Thiago Jonas; Ribeiro Reis, Rafaela; Bouças Farias, Jose Renato; Harmon, Frank G; Correa Molinari, Hugo Bruno; Correa Molinari, Mayla Daiane; Nepomuceno, Alexandre

    2015-01-01

    The soybean transcriptome displays strong variation along the day in optimal growth conditions and also in response to adverse circumstances, like drought stress. However, no study conducted to date has presented suitable reference genes, with stable expression along the day, for relative gene expression quantification in combined studies on drought stress and diurnal oscillations. Recently, water deficit responses have been associated with circadian clock oscillations at the transcription level, revealing the existence of hitherto unknown processes and increasing the demand for studies on plant responses to drought stress and its oscillation during the day. We performed data mining from a transcriptome-wide background using microarrays and RNA-seq databases to select an unpublished set of candidate reference genes, specifically chosen for the normalization of gene expression in studies on soybean under both drought stress and diurnal oscillations. Experimental validation and stability analysis in soybean plants submitted to drought stress and sampled during a 24 h timecourse showed that four of these newer reference genes (FYVE, NUDIX, Golgin-84 and CYST) indeed exhibited greater expression stability than the conventionally used housekeeping genes (ELF1-β and β-actin) under these conditions. We also demonstrated the effect of using reference candidate genes with different stability values to normalize the relative expression data from a drought-inducible soybean gene (DREB5) evaluated in different periods of the day.

  10. Personality and gene expression: Do individual differences exist in the leukocyte transcriptome?

    PubMed Central

    Vedhara, Kavita; Gill, Sana; Eldesouky, Lameese; Campbell, Bruce K.; Arevalo, Jesusa M. G.; Ma, Jeffrey; Cole, Steven W.

    2014-01-01

    Background The temporal and situational stability of personality has led generations of researchers to hypothesise that personality may have enduring effects on health, but the biological mechanisms of such relationships remain poorly understood. In the present study, we utilized a functional genomics approach to examine the relationship between the 5 major dimensions of personality and patterns of gene expression as predicted by ‘behavioural immune response’ theory. We specifically focussed on two sets of genes previously linked to stress, threat, and adverse socio-environmental conditions: pro-inflammatory genes and genes involved in Type I interferon and antibody responses. Methods An opportunity sample of 121 healthy individuals was recruited (86 females; mean age 24 years). Individuals completed a validated measure of personality; questions relating to current health behaviours; and provided a 5 ml sample of peripheral blood for gene expression analysis. Results Extraversion was associated with increased expression of pro-inflammatory genes and Conscientiousness was associated with reduced expression of pro-inflammatory genes. Both associations were independent of health behaviours, negative affect, and leukocyte subset distributions. Antiviral and antibody-related gene expression was not associated with any personality dimension. Conclusions The present data shed new light on the long-observed epidemiological associations between personality, physical health, and human longevity. Further research is required to elucidate the biological mechanisms underlying these associations. PMID:25459894

  11. Circadian clock gene LATE ELONGATED HYPOCOTYL directly regulates the timing of floral scent emission in Petunia

    PubMed Central

    Fenske, Myles P.; Hewett Hazelton, Kristen D.; Hempton, Andrew K.; Shim, Jae Sung; Yamamoto, Breanne M.; Riffell, Jeffrey A.; Imaizumi, Takato

    2015-01-01

    Flowers present a complex display of signals to attract pollinators, including the emission of floral volatiles. Volatile emission is highly regulated, and many species restrict emissions to specific times of the day. This rhythmic emission of scent is regulated by the circadian clock; however, the mechanisms have remained unknown. In Petunia hybrida, volatile emissions are dominated by products of the floral volatile benzenoid/phenylpropanoid (FVBP) metabolic pathway. Here we demonstrate that the circadian clock gene P. hybrida LATE ELONGATED HYPOCOTYL (LHY; PhLHY) regulates the daily expression patterns of the FVBP pathway genes and floral volatile production. PhLHY expression peaks in the morning, antiphasic to the expression of P. hybrida GIGANTEA (PhGI), the master scent regulator ODORANT1 (ODO1), and many other evening-expressed FVBP genes. Overexpression phenotypes of PhLHY in Arabidopsis caused an arrhythmic clock phenotype, which resembles those of LHY overexpressors. In Petunia, constitutive expression of PhLHY depressed the expression levels of PhGI, ODO1, evening-expressed FVBP pathway genes, and FVBP emission in flowers. Additionally, in the Petunia lines in which PhLHY expression was reduced, the timing of peak expression of PhGI, ODO1, and the FVBP pathway genes advanced to the morning. Moreover, PhLHY protein binds to cis-regulatory elements called evening elements that exist in promoters of ODO1 and other FVBP genes. Thus, our results imply that PhLHY directly sets the timing of floral volatile emission by restricting the expression of ODO1 and other FVBP genes to the evening in Petunia. PMID:26124104

  12. Circadian clock gene LATE ELONGATED HYPOCOTYL directly regulates the timing of floral scent emission in Petunia.

    PubMed

    Fenske, Myles P; Hewett Hazelton, Kristen D; Hempton, Andrew K; Shim, Jae Sung; Yamamoto, Breanne M; Riffell, Jeffrey A; Imaizumi, Takato

    2015-08-04

    Flowers present a complex display of signals to attract pollinators, including the emission of floral volatiles. Volatile emission is highly regulated, and many species restrict emissions to specific times of the day. This rhythmic emission of scent is regulated by the circadian clock; however, the mechanisms have remained unknown. In Petunia hybrida, volatile emissions are dominated by products of the floral volatile benzenoid/phenylpropanoid (FVBP) metabolic pathway. Here we demonstrate that the circadian clock gene P. hybrida LATE ELONGATED HYPOCOTYL (LHY; PhLHY) regulates the daily expression patterns of the FVBP pathway genes and floral volatile production. PhLHY expression peaks in the morning, antiphasic to the expression of P. hybrida GIGANTEA (PhGI), the master scent regulator ODORANT1 (ODO1), and many other evening-expressed FVBP genes. Overexpression phenotypes of PhLHY in Arabidopsis caused an arrhythmic clock phenotype, which resembles those of LHY overexpressors. In Petunia, constitutive expression of PhLHY depressed the expression levels of PhGI, ODO1, evening-expressed FVBP pathway genes, and FVBP emission in flowers. Additionally, in the Petunia lines in which PhLHY expression was reduced, the timing of peak expression of PhGI, ODO1, and the FVBP pathway genes advanced to the morning. Moreover, PhLHY protein binds to cis-regulatory elements called evening elements that exist in promoters of ODO1 and other FVBP genes. Thus, our results imply that PhLHY directly sets the timing of floral volatile emission by restricting the expression of ODO1 and other FVBP genes to the evening in Petunia.

  13. Differentially co-expressed interacting protein pairs discriminate samples under distinct stages of HIV type 1 infection.

    PubMed

    Yoon, Dukyong; Kim, Hyosil; Suh-Kim, Haeyoung; Park, Rae Woong; Lee, KiYoung

    2011-01-01

    Microarray analyses based on differentially expressed genes (DEGs) have been widely used to distinguish samples across different cellular conditions. However, studies based on DEGs have not been able to clearly determine significant differences between samples of pathophysiologically similar HIV-1 stages, e.g., between acute and chronic progressive (or AIDS) or between uninfected and clinically latent stages. We here suggest a novel approach to allow such discrimination based on stage-specific genetic features of HIV-1 infection. Our approach is based on co-expression changes of genes known to interact. The method can identify a genetic signature for a single sample as contrasted with existing protein-protein-based analyses with correlational designs. Our approach distinguishes each sample using differentially co-expressed interacting protein pairs (DEPs) based on co-expression scores of individual interacting pairs within a sample. The co-expression score has positive value if two genes in a sample are simultaneously up-regulated or down-regulated. And the score has higher absolute value if expression-changing ratios are similar between the two genes. We compared characteristics of DEPs with that of DEGs by evaluating their usefulness in separation of HIV-1 stage. And we identified DEP-based network-modules and their gene-ontology enrichment to find out the HIV-1 stage-specific gene signature. Based on the DEP approach, we observed clear separation among samples from distinct HIV-1 stages using clustering and principal component analyses. Moreover, the discrimination power of DEPs on the samples (70-100% accuracy) was much higher than that of DEGs (35-45%) using several well-known classifiers. DEP-based network analysis also revealed the HIV-1 stage-specific network modules; the main biological processes were related to "translation," "RNA splicing," "mRNA, RNA, and nucleic acid transport," and "DNA metabolism." Through the HIV-1 stage-related modules, changing stage-specific patterns of protein interactions could be observed. DEP-based method discriminated the HIV-1 infection stages clearly, and revealed a HIV-1 stage-specific gene signature. The proposed DEP-based method might complement existing DEG-based approaches in various microarray expression analyses.

  14. Rce1, a novel transcriptional repressor, regulates cellulase gene expression by antagonizing the transactivator Xyr1 in Trichoderma reesei.

    PubMed

    Cao, Yanli; Zheng, Fanglin; Wang, Lei; Zhao, Guolei; Chen, Guanjun; Zhang, Weixin; Liu, Weifeng

    2017-07-01

    Cellulase gene expression in the model cellulolytic fungus Trichoderma reesei is supposed to be controlled by an intricate regulatory network involving multiple transcription factors. Here, we identified a novel transcriptional repressor of cellulase gene expression, Rce1. Disruption of the rce1 gene not only facilitated the induced expression of cellulase genes but also led to a significant delay in terminating the induction process. However, Rce1 did not participate in Cre1-mediated catabolite repression. Electrophoretic mobility shift (EMSA) and DNase I footprinting assays in combination with chromatin immunoprecipitation (ChIP) demonstrated that Rce1 could bind directly to a cbh1 (cellobiohydrolase 1-encoding) gene promoter region containing a cluster of Xyr1 binding sites. Furthermore, competitive binding assays revealed that Rce1 antagonized Xyr1 from binding to the cbh1 promoter. These results indicate that intricate interactions exist between a variety of transcription factors to ensure tight and energy-efficient regulation of cellulase gene expression in T. reesei. This study also provides important clues regarding increased cellulase production in T. reesei. © 2017 John Wiley & Sons Ltd.

  15. Multiscale Embedded Gene Co-expression Network Analysis

    PubMed Central

    Song, Won-Min; Zhang, Bin

    2015-01-01

    Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma. PMID:26618778

  16. Multiscale Embedded Gene Co-expression Network Analysis.

    PubMed

    Song, Won-Min; Zhang, Bin

    2015-11-01

    Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  17. Investigation of Seasonal and Latitudinal Effects on the Expression of Clock Genes in Drosophila

    NASA Astrophysics Data System (ADS)

    Hosseini, Seyede Sanaz; Nazarimehr, Fahimeh; Jafari, Sajad

    The primary goal in this work is to develop a dynamical model capturing the influence of seasonal and latitudinal variations on the expression of Drosophila clock genes. To this end, we study a specific dynamical system with strange attractors that exhibit changes of Drosophila activity in a range of latitudes and across different seasons. Bifurcations of this system are analyzed to peruse the effect of season and latitude on the behavior of clock genes. Existing experimental data collected from the activity of Drosophila melanogaster corroborate the dynamical model.

  18. Genome-Wide Identification and Transcriptome-Based Expression Profiling of the Sox Gene Family in the Nile Tilapia (Oreochromis niloticus)

    PubMed Central

    Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou

    2016-01-01

    The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts. PMID:26907269

  19. Genome-Wide Identification and Transcriptome-Based Expression Profiling of the Sox Gene Family in the Nile Tilapia (Oreochromis niloticus).

    PubMed

    Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou

    2016-02-23

    The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts.

  20. Evolution of the bovine lysozyme gene family: changes in gene expression and reversion of function.

    PubMed

    Irwin, D M

    1995-09-01

    Recruitment of lysozyme to a digestive function in ruminant artiodactyls is associated with amplification of the gene. At least four of the approximately ten genes are expressed in the stomach, and several are expressed in nonstomach tissues. Characterization of additional lysozymelike sequences in the bovine genome has identified most, if not all, of the members of this gene family. There are at least six stomachlike lysozyme genes, two of which are pseudogenes. The stomach lysozyme pseudogenes show a pattern of concerted evolution similar to that of the functional stomach genes. At least four nonstomach lysozyme genes exist. The nonstomach lysozyme genes are not monophyletic. A gene encoding a tracheal lysozyme was isolated, and the stomach lysozyme of advanced ruminants was found to be more closely related to the tracheal lysozyme than to the stomach lysozyme of the camel or other nonstomach lysozyme genes of ruminants. The tracheal lysozyme shares with stomach lysozymes of advanced ruminants the deletion of amino acid 103, and several other adaptive sequence characteristics of stomach lysozymes. I suggest here that tracheal lysozyme has reverted from a functional stomach lysozyme. Tracheal lysozyme then represents a second instance of a change in lysozyme gene expression and function within ruminants.

  1. Bi-Force: large-scale bicluster editing and its application to gene expression data biclustering.

    PubMed

    Sun, Peng; Speicher, Nora K; Röttger, Richard; Guo, Jiong; Baumbach, Jan

    2014-05-01

    The explosion of the biological data has dramatically reformed today's biological research. The need to integrate and analyze high-dimensional biological data on a large scale is driving the development of novel bioinformatics approaches. Biclustering, also known as 'simultaneous clustering' or 'co-clustering', has been successfully utilized to discover local patterns in gene expression data and similar biomedical data types. Here, we contribute a new heuristic: 'Bi-Force'. It is based on the weighted bicluster editing model, to perform biclustering on arbitrary sets of biological entities, given any kind of pairwise similarities. We first evaluated the power of Bi-Force to solve dedicated bicluster editing problems by comparing Bi-Force with two existing algorithms in the BiCluE software package. We then followed a biclustering evaluation protocol in a recent review paper from Eren et al. (2013) (A comparative analysis of biclustering algorithms for gene expressiondata. Brief. Bioinform., 14:279-292.) and compared Bi-Force against eight existing tools: FABIA, QUBIC, Cheng and Church, Plaid, BiMax, Spectral, xMOTIFs and ISA. To this end, a suite of synthetic datasets as well as nine large gene expression datasets from Gene Expression Omnibus were analyzed. All resulting biclusters were subsequently investigated by Gene Ontology enrichment analysis to evaluate their biological relevance. The distinct theoretical foundation of Bi-Force (bicluster editing) is more powerful than strict biclustering. We thus outperformed existing tools with Bi-Force at least when following the evaluation protocols from Eren et al. Bi-Force is implemented in Java and integrated into the open source software package of BiCluE. The software as well as all used datasets are publicly available at http://biclue.mpi-inf.mpg.de. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Homeobox genes and melatonin synthesis: regulatory roles of the cone-rod homeobox transcription factor in the rodent pineal gland.

    PubMed

    Rohde, Kristian; Møller, Morten; Rath, Martin Fredensborg

    2014-01-01

    Nocturnal synthesis of melatonin in the pineal gland is controlled by a circadian rhythm in arylalkylamine N-acetyltransferase (AANAT) enzyme activity. In the rodent, Aanat gene expression displays a marked circadian rhythm; release of norepinephrine in the gland at night causes a cAMP-based induction of Aanat transcription. However, additional transcriptional control mechanisms exist. Homeobox genes, which are generally known to encode transcription factors controlling developmental processes, are also expressed in the mature rodent pineal gland. Among these, the cone-rod homeobox (CRX) transcription factor is believed to control pineal-specific Aanat expression. Based on recent advances in our understanding of Crx in the rodent pineal gland, we here suggest that homeobox genes play a role in adult pineal physiology both by ensuring pineal-specific Aanat expression and by facilitating cAMP response element-based circadian melatonin production.

  3. Reduced mycorrhizal colonization (rmc) tomato mutant lacks expression of SymRK signaling pathway genes

    PubMed Central

    Nair, Aswathy; Bhargava, Sujata

    2012-01-01

    Comparison of the expression of 13 genes involved in arbuscular mycorrhizal (AM) symbiosis was performed in a wild type tomato (Solanum lycopersicum cv 76R) and its reduced mycorrhizal colonization mutant rmc in response to colonization with Glomus fasiculatum. Four defense-related genes were induced to a similar extent in the mutant and wild type AM colonized plants, indicating a systemic response to AM colonization. Genes related to nutrient exchange between the symbiont partners showed higher expression in the AM roots of wild type plants than the mutant plants, which correlated with their arbuscular frequency. A symbiosis receptor kinase that is involved in both nodulation and AM symbiosis was not expressed in the rmc mutant. The fact that some colonization was observed in rmc was suggestive of the existence of an alternate colonization signaling pathway for AM symbiosis in this mutant. PMID:23221680

  4. Homeobox Genes and Melatonin Synthesis: Regulatory Roles of the Cone-Rod Homeobox Transcription Factor in the Rodent Pineal Gland

    PubMed Central

    Rath, Martin Fredensborg

    2014-01-01

    Nocturnal synthesis of melatonin in the pineal gland is controlled by a circadian rhythm in arylalkylamine N-acetyltransferase (AANAT) enzyme activity. In the rodent, Aanat gene expression displays a marked circadian rhythm; release of norepinephrine in the gland at night causes a cAMP-based induction of Aanat transcription. However, additional transcriptional control mechanisms exist. Homeobox genes, which are generally known to encode transcription factors controlling developmental processes, are also expressed in the mature rodent pineal gland. Among these, the cone-rod homeobox (CRX) transcription factor is believed to control pineal-specific Aanat expression. Based on recent advances in our understanding of Crx in the rodent pineal gland, we here suggest that homeobox genes play a role in adult pineal physiology both by ensuring pineal-specific Aanat expression and by facilitating cAMP response element-based circadian melatonin production. PMID:24877149

  5. GeneNetFinder2: Improved Inference of Dynamic Gene Regulatory Relations with Multiple Regulators.

    PubMed

    Han, Kyungsook; Lee, Jeonghoon

    2016-01-01

    A gene involved in complex regulatory interactions may have multiple regulators since gene expression in such interactions is often controlled by more than one gene. Another thing that makes gene regulatory interactions complicated is that regulatory interactions are not static, but change over time during the cell cycle. Most research so far has focused on identifying gene regulatory relations between individual genes in a particular stage of the cell cycle. In this study we developed a method for identifying dynamic gene regulations of several types from the time-series gene expression data. The method can find gene regulations with multiple regulators that work in combination or individually as well as those with single regulators. The method has been implemented as the second version of GeneNetFinder (hereafter called GeneNetFinder2) and tested on several gene expression datasets. Experimental results with gene expression data revealed the existence of genes that are not regulated by individual genes but rather by a combination of several genes. Such gene regulatory relations cannot be found by conventional methods. Our method finds such regulatory relations as well as those with multiple, independent regulators or single regulators, and represents gene regulatory relations as a dynamic network in which different gene regulatory relations are shown in different stages of the cell cycle. GeneNetFinder2 is available at http://bclab.inha.ac.kr/GeneNetFinder and will be useful for modeling dynamic gene regulations with multiple regulators.

  6. A qRT-PCR assay for the expression of all Mal d 1 isoallergen genes

    PubMed Central

    2013-01-01

    Background A considerable number of individuals suffer from oral allergy syndrome (OAS) to apple, resulting in the avoidance of apple consumption. Apple cultivars differ greatly in their allergenic properties, but knowledge of the causes for such differences is incomplete. Mal d 1 is considered the major apple allergen. For Mal d 1, a wide range of isoallergens and variants exist, and they are encoded by a large gene family. To identify the specific proteins/genes that are potentially involved in the allergy, we developed a PCR assay to monitor the expression of each individual Mal d 1 gene. Gene-specific primer pairs were designed for the exploitation of sequence differences among Mal d 1 genes. The specificity of these primers was validated using both in silico and in vitro techniques. Subsequently, this assay was applied to the peel and flesh of fruits from the two cultivars ‘Florina’ and ‘Gala’. Results We successfully developed gene-specific primer pairs for each of the 31 Mal d 1 genes and incorporated them into a qRT-PCR assay. The results from the application of the assay showed that 11 genes were not expressed in fruit. In addition, differential expression was observed among the Mal d 1 genes that were expressed in the fruit. Moreover, the expression levels were tissue and cultivar dependent. Conclusion The assay developed in this study facilitated the first characterisation of the expression levels of all known Mal d 1 genes in a gene-specific manner. Using this assay on different fruit tissues and cultivars, we obtained knowledge concerning gene relevance in allergenicity. This study provides new perspectives for research on both plant breeding and immunotherapy. PMID:23522122

  7. GENE EXPRESSION CHANGES IN MOUSE BLADDER TISSUE IN RESPONSE TO INORGANIC ARSENIC

    EPA Science Inventory

    Chronic human exposures to high arsenic concentrations are associated with lung, skin, and bladder cancer. Considerable controversy exists concerning arsenic mode of action and low dose extrapolation. This investigation was designed to identify dose-response changes in gene expre...

  8. TTG2 controls the developmental regulation of seed coat tannins in Arabidopsis by regulating vacuolar transport steps in the proanthocyanidin pathway.

    PubMed

    Gonzalez, Antonio; Brown, Matthew; Hatlestad, Greg; Akhavan, Neda; Smith, Tyler; Hembd, Austin; Moore, Joshua; Montes, David; Mosley, Trenell; Resendez, Juan; Nguyen, Huy; Wilson, Lyndsey; Campbell, Annabelle; Sudarshan, Duncan; Lloyd, Alan

    2016-11-01

    The brown color of Arabidopsis seeds is caused by the deposition of proanthocyanidins (PAs or condensed tannins) in their inner testa layer. A transcription factor complex consisting of TT2, TT8 and TTG1 controls expression of PA biosynthetic genes, just as similar TTG1-dependent complexes have been shown to control flavonoid pigment pathway gene expression in general. However, PA synthesis is controlled by at least one other gene. TTG2 mutants lack the pigmentation found in wild-type seeds, but produce other flavonoid compounds, such as anthocyanins in the shoot, suggesting that TTG2 regulates genes in the PA biosynthetic branch of the flavonoid pathway. We analyzed the expression of PA biosynthetic genes within the developing seeds of ttg2-1 and wild-type plants for potential TTG2 regulatory targets. We found that expression of TT12, encoding a MATE type transporter, is dependent on TTG2 and that TTG2 can bind to the upstream regulatory region of TT12 suggesting that TTG2 directly regulates TT12. Ectopic expression of TT12 in ttg2-1 plants partially restores seed coat pigmentation. Moreover, we show that TTG2 regulation of TT12 is dependent on TTG1 and that TTG1 and TTG2 physically interact. The observation that TTG1 interacts with TTG2, a WRKY type transcription factor, proposes the existence of a novel TTG1-containing complex, and an addendum to the existing paradigm of flavonoid pathway regulation. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. Evolutionarily conserved ELOVL4 gene expression in the vertebrate retina.

    PubMed

    Lagali, Pamela S; Liu, Jiafan; Ambasudhan, Rajesh; Kakuk, Laura E; Bernstein, Steven L; Seigel, Gail M; Wong, Paul W; Ayyagari, Radha

    2003-07-01

    The gene elongation of very long chain fatty acids-4 (ELOVL4) has been shown to underlie phenotypically heterogeneous forms of autosomal dominant macular degeneration. In this study, the extent of evolutionary conservation and the existence and localization of retinal expression of this gene was investigated across a wide variety of species. Southern blot analysis of genomic DNA and bioinformatic analysis using the human ELOVL4 cDNA and protein sequences, respectively, were performed to identify species in which ELOVL4 orthologues and/or homologues are present. Retinal RNA and protein extracts derived from different species were assessed by Northern hybridization and immunoblot techniques to assess evolutionary conservation of gene expression. Immunohistochemical analysis of tissue sections prepared from various mammalian retinas was performed to determine the distribution of ELOVL4 and homologous proteins within specific retinal cell layers. The existence of ELOVL4 sequence orthologues and homologues was confirmed by both Southern blot analysis and in silico searches of protein sequence databases. Phylogenetic analysis places ELOVL4 among a large family of known and putative fatty acid elongase proteins. Northern blot analysis revealed the presence of multiple transcripts corresponding to ELOVL4 homologues expressed in the retina of several different mammalian species. Conserved proteins were also detected among retinal extracts of different mammals and were found to localize predominantly to the photoreceptor cell layer within retinal tissue preparations. The ELOVL4 gene is highly conserved throughout evolution and is expressed in the photoreceptor cells of the retina in a variety of different species, which suggests that it plays a critical role in retinal cell biology.

  10. Season-dependent effects of photoperiod and temperature on circadian rhythm of arylalkylamine N-acetyltransferase2 gene expression in pineal organ of an air-breathing catfish, Clarias gariepinus.

    PubMed

    Singh, Kshetrimayum Manisana; Saha, Saurav; Gupta, Braj Bansh Prasad

    2017-08-01

    Arylalkylamine N-acetyltransferase (AANAT) activity, aanat gene expression and melatonin production have been reported to exhibit prominent circadian rhythm in the pineal organ of most species of fish. Three types of aanat genes are expressed in fish, but the fish pineal organ predominantly expresses aanat2 gene. Increase and decrease in daylength is invariably associated with increase and decrease in temperature, respectively. But so far no attempt has been made to delineate the role of photoperiod and temperature in regulation of the circadian rhythm of aanat2 gene expression in the pineal organ of any fish with special reference to seasons. Therefore, we studied effects of various lighting regimes (12L-12D, 16L-8D, 8L-16D, LL and DD) at a constant temperature (25°C) and effects of different temperatures (15°, 25° and 35°C) under a common photoperiod 12L-12D on circadian rhythm of aanat2 gene expression in the pineal organ of Clarias gariepinus during summer and winter seasons. Aanat2 gene expression in fish pineal organ was studied by measuring aanat2 mRNA levels using Real-Time PCR. Our findings indicate that the pineal organ of C. gariepinus exhibits a prominent circadian rhythm of aanat2 gene expression irrespective of photoperiods, temperatures and seasons, and the circadian rhythm of aanat2 gene expression responds differently to different photoperiods and temperatures in a season-dependent manner. Existence of circadian rhythm of aanat2 gene expression in pineal organs maintained in vitro under 12L-12D and DD conditions as well as a free running rhythm of the gene expression in pineal organ of the fish maintained under LL and DD conditions suggest that the fish pineal organ possesses an endogenous circadian oscillator, which is entrained by light-dark cycle. Copyright © 2017 Elsevier B.V. All rights reserved.

  11. Pluripotency, Differentiation, and Reprogramming: A Gene Expression Dynamics Model with Epigenetic Feedback Regulation

    PubMed Central

    Miyamoto, Tadashi; Furusawa, Chikara; Kaneko, Kunihiko

    2015-01-01

    Embryonic stem cells exhibit pluripotency: they can differentiate into all types of somatic cells. Pluripotent genes such as Oct4 and Nanog are activated in the pluripotent state, and their expression decreases during cell differentiation. Inversely, expression of differentiation genes such as Gata6 and Gata4 is promoted during differentiation. The gene regulatory network controlling the expression of these genes has been described, and slower-scale epigenetic modifications have been uncovered. Although the differentiation of pluripotent stem cells is normally irreversible, reprogramming of cells can be experimentally manipulated to regain pluripotency via overexpression of certain genes. Despite these experimental advances, the dynamics and mechanisms of differentiation and reprogramming are not yet fully understood. Based on recent experimental findings, we constructed a simple gene regulatory network including pluripotent and differentiation genes, and we demonstrated the existence of pluripotent and differentiated states from the resultant dynamical-systems model. Two differentiation mechanisms, interaction-induced switching from an expression oscillatory state and noise-assisted transition between bistable stationary states, were tested in the model. The former was found to be relevant to the differentiation process. We also introduced variables representing epigenetic modifications, which controlled the threshold for gene expression. By assuming positive feedback between expression levels and the epigenetic variables, we observed differentiation in expression dynamics. Additionally, with numerical reprogramming experiments for differentiated cells, we showed that pluripotency was recovered in cells by imposing overexpression of two pluripotent genes and external factors to control expression of differentiation genes. Interestingly, these factors were consistent with the four Yamanaka factors, Oct4, Sox2, Klf4, and Myc, which were necessary for the establishment of induced pluripotent stem cells. These results, based on a gene regulatory network and expression dynamics, contribute to our wider understanding of pluripotency, differentiation, and reprogramming of cells, and they provide a fresh viewpoint on robustness and control during development. PMID:26308610

  12. Evaluation of Reference Genes for Gene Expression Analysis Using Quantitative RT-PCR in Azospirillum brasilense

    PubMed Central

    McMillan, Mary; Pereg, Lily

    2014-01-01

    Azospirillum brasilense is a nitrogen fixing bacterium that has been shown to have various beneficial effects on plant growth and yield. Under normal conditions A. brasilense exists in a motile flagellated form, which, under starvation or stress conditions, can undergo differentiation into an encapsulated, cyst-like form. Quantitative RT-PCR can be used to analyse changes in gene expression during this differentiation process. The accuracy of quantification of mRNA levels by qRT-PCR relies on the normalisation of data against stably expressed reference genes. No suitable set of reference genes has yet been described for A. brasilense. Here we evaluated the expression of ten candidate reference genes (16S rRNA, gapB, glyA, gyrA, proC, pykA, recA, recF, rpoD, and tpiA) in wild-type and mutant A. brasilense strains under different culture conditions, including conditions that induce differentiation. Analysis with the software programs BestKeeper, NormFinder and GeNorm indicated that gyrA, glyA and recA are the most stably expressed reference genes in A. brasilense. The results also suggested that the use of two reference genes (gyrA and glyA) is sufficient for effective normalisation of qRT-PCR data. PMID:24841066

  13. Evaluation of reference genes for gene expression analysis using quantitative RT-PCR in Azospirillum brasilense.

    PubMed

    McMillan, Mary; Pereg, Lily

    2014-01-01

    Azospirillum brasilense is a nitrogen fixing bacterium that has been shown to have various beneficial effects on plant growth and yield. Under normal conditions A. brasilense exists in a motile flagellated form, which, under starvation or stress conditions, can undergo differentiation into an encapsulated, cyst-like form. Quantitative RT-PCR can be used to analyse changes in gene expression during this differentiation process. The accuracy of quantification of mRNA levels by qRT-PCR relies on the normalisation of data against stably expressed reference genes. No suitable set of reference genes has yet been described for A. brasilense. Here we evaluated the expression of ten candidate reference genes (16S rRNA, gapB, glyA, gyrA, proC, pykA, recA, recF, rpoD, and tpiA) in wild-type and mutant A. brasilense strains under different culture conditions, including conditions that induce differentiation. Analysis with the software programs BestKeeper, NormFinder and GeNorm indicated that gyrA, glyA and recA are the most stably expressed reference genes in A. brasilense. The results also suggested that the use of two reference genes (gyrA and glyA) is sufficient for effective normalisation of qRT-PCR data.

  14. An improved Pearson's correlation proximity-based hierarchical clustering for mining biological association between genes.

    PubMed

    Booma, P M; Prabhakaran, S; Dhanalakshmi, R

    2014-01-01

    Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality.

  15. An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes

    PubMed Central

    Booma, P. M.; Prabhakaran, S.; Dhanalakshmi, R.

    2014-01-01

    Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality. PMID:25136661

  16. The biology and evolution of transposable elements in parasites.

    PubMed

    Thomas, M Carmen; Macias, Francisco; Alonso, Carlos; López, Manuel C

    2010-07-01

    Transposable elements (TEs) are dynamic elements that can reshape host genomes by generating rearrangements with the potential to create or disrupt genes, to shuffle existing genes, and to modulate their patterns of expression. In the genomes of parasites that infect mammals several TEs have been identified that probably have been maintained throughout evolution due to their contribution to gene function and regulation of gene expression. This review addresses how TEs are organized, how they colonize the genomes of mammalian parasites, the functional role these elements play in parasite biology, and the interactions between these elements and the parasite genome. Copyright 2010 Elsevier Ltd. All rights reserved.

  17. Evidence that molecular changes in cells occur before morphological alterations during the progression of breast ductal carcinoma

    PubMed Central

    Castro, Nadia P; Osório, Cynthia ABT; Torres, César; Bastos, Elen P; Mourão-Neto, Mário; Soares, Fernando A; Brentani, Helena P; Carraro, Dirce M

    2008-01-01

    Introduction Ductal carcinoma in situ (DCIS) of the breast includes a heterogeneous group of preinvasive tumors with uncertain evolution. Definition of the molecular factors necessary for progression to invasive disease is crucial to determining which lesions are likely to become invasive. To obtain insight into the molecular basis of DCIS, we compared the gene expression pattern of cells from the following samples: non-neoplastic, pure DCIS, in situ component of lesions with co-existing invasive ductal carcinoma, and invasive ductal carcinoma. Methods Forty-one samples were evaluated: four non-neoplastic, five pure DCIS, 22 in situ component of lesions with co-existing invasive ductal carcinoma, and 10 invasive ductal carcinoma. Pure cell populations were isolated using laser microdissection. Total RNA was purified, DNase treated, and amplified using the T7-based method. Microarray analysis was conducted using a customized cDNA platform. The concept of molecular divergence was applied to classify the sample groups using analysis of variance followed by Tukey's test. Results Among the tumor sample groups, cells from pure DCIS exhibited the most divergent molecular profile, consequently identifying cells from in situ component of lesions with co-existing invasive ductal carcinoma as very similar to cells from invasive lesions. Additionally, we identified 147 genes that were differentially expressed between pure DCIS and in situ component of lesions with co-existing invasive ductal carcinoma, which can discriminate samples representative of in situ component of lesions with co-existing invasive ductal carcinoma from 60% of pure DCIS samples. A gene subset was evaluated using quantitative RT-PCR, which confirmed differential expression for 62.5% and 60.0% of them using initial and partial independent sample groups, respectively. Among these genes, LOX and SULF-1 exhibited features that identify them as potential participants in the malignant process of DCIS. Conclusions We identified new genes that are potentially involved in the malignant transformation of DCIS, and our findings strongly suggest that cells from the in situ component of lesions with co-existing invasive ductal carcinoma exhibit molecular alterations that enable them to invade the surrounding tissue before morphological changes in the lesion become apparent. PMID:18928525

  18. Evidence that molecular changes in cells occur before morphological alterations during the progression of breast ductal carcinoma.

    PubMed

    Castro, Nadia P; Osório, Cynthia A B T; Torres, César; Bastos, Elen P; Mourão-Neto, Mário; Soares, Fernando A; Brentani, Helena P; Carraro, Dirce M

    2008-01-01

    Ductal carcinoma in situ (DCIS) of the breast includes a heterogeneous group of preinvasive tumors with uncertain evolution. Definition of the molecular factors necessary for progression to invasive disease is crucial to determining which lesions are likely to become invasive. To obtain insight into the molecular basis of DCIS, we compared the gene expression pattern of cells from the following samples: non-neoplastic, pure DCIS, in situ component of lesions with co-existing invasive ductal carcinoma, and invasive ductal carcinoma. Forty-one samples were evaluated: four non-neoplastic, five pure DCIS, 22 in situ component of lesions with co-existing invasive ductal carcinoma, and 10 invasive ductal carcinoma. Pure cell populations were isolated using laser microdissection. Total RNA was purified, DNase treated, and amplified using the T7-based method. Microarray analysis was conducted using a customized cDNA platform. The concept of molecular divergence was applied to classify the sample groups using analysis of variance followed by Tukey's test. Among the tumor sample groups, cells from pure DCIS exhibited the most divergent molecular profile, consequently identifying cells from in situ component of lesions with co-existing invasive ductal carcinoma as very similar to cells from invasive lesions. Additionally, we identified 147 genes that were differentially expressed between pure DCIS and in situ component of lesions with co-existing invasive ductal carcinoma, which can discriminate samples representative of in situ component of lesions with co-existing invasive ductal carcinoma from 60% of pure DCIS samples. A gene subset was evaluated using quantitative RT-PCR, which confirmed differential expression for 62.5% and 60.0% of them using initial and partial independent sample groups, respectively. Among these genes, LOX and SULF-1 exhibited features that identify them as potential participants in the malignant process of DCIS. We identified new genes that are potentially involved in the malignant transformation of DCIS, and our findings strongly suggest that cells from the in situ component of lesions with co-existing invasive ductal carcinoma exhibit molecular alterations that enable them to invade the surrounding tissue before morphological changes in the lesion become apparent.

  19. Diurnal oscillations of soybean circadian clock and drought responsive genes.

    PubMed

    Marcolino-Gomes, Juliana; Rodrigues, Fabiana Aparecida; Fuganti-Pagliarini, Renata; Bendix, Claire; Nakayama, Thiago Jonas; Celaya, Brandon; Molinari, Hugo Bruno Correa; de Oliveira, Maria Cristina Neves; Harmon, Frank G; Nepomuceno, Alexandre

    2014-01-01

    Rhythms produced by the endogenous circadian clock play a critical role in allowing plants to respond and adapt to the environment. While there is a well-established regulatory link between the circadian clock and responses to abiotic stress in model plants, little is known of the circadian system in crop species like soybean. This study examines how drought impacts diurnal oscillation of both drought responsive and circadian clock genes in soybean. Drought stress induced marked changes in gene expression of several circadian clock-like components, such as LCL1-, GmELF4- and PRR-like genes, which had reduced expression in stressed plants. The same conditions produced a phase advance of expression for the GmTOC1-like, GmLUX-like and GmPRR7-like genes. Similarly, the rhythmic expression pattern of the soybean drought-responsive genes DREB-, bZIP-, GOLS-, RAB18- and Remorin-like changed significantly after plant exposure to drought. In silico analysis of promoter regions of these genes revealed the presence of cis-elements associated both with stress and circadian clock regulation. Furthermore, some soybean genes with upstream ABRE elements were responsive to abscisic acid treatment. Our results indicate that some connection between the drought response and the circadian clock may exist in soybean since (i) drought stress affects gene expression of circadian clock components and (ii) several stress responsive genes display diurnal oscillation in soybeans.

  20. Cloning of a gene (RIG-G) associated with retinoic acid-induced differentiation of acute promyelocytic leukemia cells and representing a new member of a family of interferon-stimulated genes

    PubMed Central

    Yu, Man; Tong, Jian-Hua; Mao, Mao; Kan, Li-Xin; Liu, Meng-Min; Sun, Yi-Wu; Fu, Gang; Jing, Yong-Kui; Yu, Long; Lepaslier, Denis; Lanotte, Michel; Wang, Zhen-Yi; Chen, Zhu; Waxman, Samuel; Wang, Ya-Xin; Tan, Jia-Zhen; Chen, Sai-Juan

    1997-01-01

    In a cell line (NB4) derived from a patient with acute promyelocytic leukemia, all-trans-retinoic acid (ATRA) and interferon (IFN) induce the expression of a novel gene we call RIG-G (for retinoic acid-induced gene G). This gene codes for a 58-kDa protein containing 490 amino acids with several potential sites for post-translational modification. In untreated NB4 cells, the expression of RIG-G is undetectable. ATRA treatment induces the transcriptional expression of RIG-G relatively late (12–24 hr) in a protein synthesis-dependent manner, whereas IFN-α induces its expression early (30 min to 3 hr). Database search has revealed a high-level homology between RIG-G and several IFN-stimulated genes in human (ISG54K, ISG56K, and IFN-inducible and retinoic acid-inducible 58K gene) and some other species, defining a well conserved gene family. The gene is composed of two exons and has been mapped by fluorescence in situ hybridization to chromosome 10q24, where two other human IFN-stimulated gene members are localized. A synergistic induction of RIG-G expression in NB4 cells by combined treatment with ATRA and IFNs suggests that a collaboration exists between their respective signaling pathways. PMID:9207104

  1. Diurnal Oscillations of Soybean Circadian Clock and Drought Responsive Genes

    PubMed Central

    Marcolino-Gomes, Juliana; Rodrigues, Fabiana Aparecida; Fuganti-Pagliarini, Renata; Bendix, Claire; Nakayama, Thiago Jonas; Celaya, Brandon; Molinari, Hugo Bruno Correa; de Oliveira, Maria Cristina Neves; Harmon, Frank G.; Nepomuceno, Alexandre

    2014-01-01

    Rhythms produced by the endogenous circadian clock play a critical role in allowing plants to respond and adapt to the environment. While there is a well-established regulatory link between the circadian clock and responses to abiotic stress in model plants, little is known of the circadian system in crop species like soybean. This study examines how drought impacts diurnal oscillation of both drought responsive and circadian clock genes in soybean. Drought stress induced marked changes in gene expression of several circadian clock-like components, such as LCL1-, GmELF4- and PRR-like genes, which had reduced expression in stressed plants. The same conditions produced a phase advance of expression for the GmTOC1-like, GmLUX-like and GmPRR7-like genes. Similarly, the rhythmic expression pattern of the soybean drought-responsive genes DREB-, bZIP-, GOLS-, RAB18- and Remorin-like changed significantly after plant exposure to drought. In silico analysis of promoter regions of these genes revealed the presence of cis-elements associated both with stress and circadian clock regulation. Furthermore, some soybean genes with upstream ABRE elements were responsive to abscisic acid treatment. Our results indicate that some connection between the drought response and the circadian clock may exist in soybean since (i) drought stress affects gene expression of circadian clock components and (ii) several stress responsive genes display diurnal oscillation in soybeans. PMID:24475115

  2. The low noise limit in gene expression

    DOE PAGES

    Dar, Roy D.; Weinberger, Leor S.; Cox, Chris D.; ...

    2015-10-21

    Protein noise measurements are increasingly used to elucidate biophysical parameters. Unfortunately noise analyses are often at odds with directly measured parameters. Here we show that these inconsistencies arise from two problematic analytical choices: (i) the assumption that protein translation rate is invariant for different proteins of different abundances, which has inadvertently led to (ii) the assumption that a large constitutive extrinsic noise sets the low noise limit in gene expression. While growing evidence suggests that transcriptional bursting may set the low noise limit, variability in translational bursting has been largely ignored. We show that genome-wide systematic variation in translational efficiencymore » can-and in the case of E. coli does-control the low noise limit in gene expression. Therefore constitutive extrinsic noise is small and only plays a role in the absence of a systematic variation in translational efficiency. Lastly, these results show the existence of two distinct expression noise patterns: (1) a global noise floor uniformly imposed on all genes by expression bursting; and (2) high noise distributed to only a select group of genes.« less

  3. Identification and expression analysis of zebrafish glypicans during embryonic development.

    PubMed

    Gupta, Mansi; Brand, Michael

    2013-01-01

    Heparan sulfate Proteoglycans (HSPG) are ubiquitous molecules with indispensable functions in various biological processes. Glypicans are a family of HSPG's, characterized by a Gpi-anchor which directs them to the cell surface and/or extracellular matrix where they regulate growth factor signaling during development and disease. We report the identification and expression pattern of glypican genes from zebrafish. The zebrafish genome contains 10 glypican homologs, as opposed to six in mammals, which are highly conserved and are phylogenetically related to the mammalian genes. Some of the fish glypicans like Gpc1a, Gpc3, Gpc4, Gpc6a and Gpc6b show conserved synteny with their mammalian cognate genes. Many glypicans are expressed during the gastrulation stage, but their expression becomes more tissue specific and defined during somitogenesis stages, particularly in the developing central nervous system. Existence of multiple glypican orthologs in fish with diverse expression pattern suggests highly specialized and/or redundant function of these genes during embryonic development.

  4. VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

    PubMed

    Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M

    2013-12-16

    Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis) whereby the recovered sub-networks reconfirm established plant gene functions and also identify novel associations. Together, we present valuable insights into grapevine transcriptional regulation by developing network models applicable to researchers in their prioritisation of gene candidates, for on-going study of biological processes related to grapevine development, metabolism and stress responses.

  5. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Liebhaber, S.A.; Weiss, I.; Cash, F.E.

    Synthesis of normal human hemoglobin A, {alpha}{sub 2}{beta}{sub 2}, is based upon balanced expression of genes in the {alpha}-globin gene cluster on chromosome 15 and the {beta}-globin gene cluster on chromosome 11. Full levels of erythroid-specific activation of the {beta}-globin cluster depend on sequences located at a considerable distance 5{prime} to the {beta}-globin gene, referred to as the locus-activating or dominant control region. The existence of an analogous element(s) upstream of the {alpha}-globin cluster has been suggested from observations on naturally occurring deletions and experimental studies. The authors have identified an individual with {alpha}-thalassemia in whom structurally normal {alpha}-globin genesmore » have been inactivated in cis by a discrete de novo 35-kilobase deletion located {approximately}30 kilobases 5{prime} from the {alpha}-globin gene cluster. They conclude that this deletion inactivates expression of the {alpha}-globin genes by removing one or more of the previously identified upstream regulatory sequences that are critical to expression of the {alpha}-globin genes.« less

  6. Reconstructing directed gene regulatory network by only gene expression data.

    PubMed

    Zhang, Lu; Feng, Xi Kang; Ng, Yen Kaow; Li, Shuai Cheng

    2016-08-18

    Accurately identifying gene regulatory network is an important task in understanding in vivo biological activities. The inference of such networks is often accomplished through the use of gene expression data. Many methods have been developed to evaluate gene expression dependencies between transcription factor and its target genes, and some methods also eliminate transitive interactions. The regulatory (or edge) direction is undetermined if the target gene is also a transcription factor. Some methods predict the regulatory directions in the gene regulatory networks by locating the eQTL single nucleotide polymorphism, or by observing the gene expression changes when knocking out/down the candidate transcript factors; regrettably, these additional data are usually unavailable, especially for the samples deriving from human tissues. In this study, we propose the Context Based Dependency Network (CBDN), a method that is able to infer gene regulatory networks with the regulatory directions from gene expression data only. To determine the regulatory direction, CBDN computes the influence of source to target by evaluating the magnitude changes of expression dependencies between the target gene and the others with conditioning on the source gene. CBDN extends the data processing inequality by involving the dependency direction to distinguish between direct and transitive relationship between genes. We also define two types of important regulators which can influence a majority of the genes in the network directly or indirectly. CBDN can detect both of these two types of important regulators by averaging the influence functions of candidate regulator to the other genes. In our experiments with simulated and real data, even with the regulatory direction taken into account, CBDN outperforms the state-of-the-art approaches for inferring gene regulatory network. CBDN identifies the important regulators in the predicted network: 1. TYROBP influences a batch of genes that are related to Alzheimer's disease; 2. ZNF329 and RB1 significantly regulate those 'mesenchymal' gene expression signature genes for brain tumors. By merely leveraging gene expression data, CBDN can efficiently infer the existence of gene-gene interactions as well as their regulatory directions. The constructed networks are helpful in the identification of important regulators for complex diseases.

  7. Multiple Multi-Copper Oxidase Gene Families in Basidiomycetes – What for?

    PubMed Central

    Kües, Ursula; Rühl, Martin

    2011-01-01

    Genome analyses revealed in various basidiomycetes the existence of multiple genes for blue multi-copper oxidases (MCOs). Whole genomes are now available from saprotrophs, white rot and brown rot species, plant and animal pathogens and ectomycorrhizal species. Total numbers (from 1 to 17) and types of mco genes differ between analyzed species with no easy to recognize connection of gene distribution to fungal life styles. Types of mco genes might be present in one and absent in another fungus. Distinct types of genes have been multiplied at speciation in different organisms. Phylogenetic analysis defined different subfamilies of laccases sensu stricto (specific to Agaricomycetes), classical Fe2+-oxidizing Fet3-like ferroxidases, potential ferroxidases/laccases exhibiting either one or both of these enzymatic functions, enzymes clustering with pigment MCOs and putative ascorbate oxidases. Biochemically best described are laccases sensu stricto due to their proposed roles in degradation of wood, straw and plant litter and due to the large interest in these enzymes in biotechnology. However, biological functions of laccases and other MCOs are generally little addressed. Functions in substrate degradation, symbiontic and pathogenic intercations, development, pigmentation and copper homeostasis have been put forward. Evidences for biological functions are in most instances rather circumstantial by correlations of expression. Multiple factors impede research on biological functions such as difficulties of defining suitable biological systems for molecular research, the broad and overlapping substrate spectrum multi-copper oxidases usually possess, the low existent knowledge on their natural substrates, difficulties imposed by low expression or expression of multiple enzymes, and difficulties in expressing enzymes heterologously. PMID:21966246

  8. Network-based differential gene expression analysis suggests cell cycle related genes regulated by E2F1 underlie the molecular difference between smoker and non-smoker lung adenocarcinoma

    PubMed Central

    2013-01-01

    Background Differential gene expression (DGE) analysis is commonly used to reveal the deregulated molecular mechanisms of complex diseases. However, traditional DGE analysis (e.g., the t test or the rank sum test) tests each gene independently without considering interactions between them. Top-ranked differentially regulated genes prioritized by the analysis may not directly relate to the coherent molecular changes underlying complex diseases. Joint analyses of co-expression and DGE have been applied to reveal the deregulated molecular modules underlying complex diseases. Most of these methods consist of separate steps: first to identify gene-gene relationships under the studied phenotype then to integrate them with gene expression changes for prioritizing signature genes, or vice versa. It is warrant a method that can simultaneously consider gene-gene co-expression strength and corresponding expression level changes so that both types of information can be leveraged optimally. Results In this paper, we develop a gene module based method for differential gene expression analysis, named network-based differential gene expression (nDGE) analysis, a one-step integrative process for prioritizing deregulated genes and grouping them into gene modules. We demonstrate that nDGE outperforms existing methods in prioritizing deregulated genes and discovering deregulated gene modules using simulated data sets. When tested on a series of smoker and non-smoker lung adenocarcinoma data sets, we show that top differentially regulated genes identified by the rank sum test in different sets are not consistent while top ranked genes defined by nDGE in different data sets significantly overlap. nDGE results suggest that a differentially regulated gene module, which is enriched for cell cycle related genes and E2F1 targeted genes, plays a role in the molecular differences between smoker and non-smoker lung adenocarcinoma. Conclusions In this paper, we develop nDGE to prioritize deregulated genes and group them into gene modules by simultaneously considering gene expression level changes and gene-gene co-regulations. When applied to both simulated and empirical data, nDGE outperforms the traditional DGE method. More specifically, when applied to smoker and non-smoker lung cancer sets, nDGE results illustrate the molecular differences between smoker and non-smoker lung cancer. PMID:24341432

  9. Between strain and tissue differences exist in global hydroxymethylation after acute ozone exposure.

    EPA Science Inventory

    Epigenetics have been increasingly recognized as a mechanism linking environment and gene expression with both normal physiologic function as well as disease states. Demethylation of cysteine residues, generally leading to gene activation, is an oxygen-dependent reaction and crea...

  10. Gene expression in Pseudomonas aeruginosa exposed to hydroxyl-radicals.

    PubMed

    Aharoni, Noa; Mamane, Hadas; Biran, Dvora; Lakretz, Anat; Ron, Eliora Z

    2018-05-01

    Recent studies have shown the efficiency of hydroxyl radicals generated via ultraviolet (UV)-based advanced oxidation processes (AOPs) combined with hydrogen peroxide (UV/H 2 O 2 ) as a treatment process in water. The effects of AOP treatments on bacterial gene expression was examined using Pseudomonas aeruginosa strain PAO1 as a model-organism bacterium. Many bacterial genes are not expressed all the time, but their expression is regulated. The regulation is at the beginning of the gene, in a genetic region called "promoter" and affects the level of transcription (synthesis of messenger RNA) and translation (synthesis of protein). The level of expression of the regulated genes can change as a function of environmental conditions, and they can be expressed more (induced, upregulated) or less (downregulated). Exposure of strain PAO1 to UV/H 2 O 2 treatment resulted in a major change in gene expression, including elevated expression of several genes. One interesting gene is PA3237, which was significantly upregulated under UV/H 2 O 2 as compared to UV or H 2 O 2 treatments alone. The induction of this gene is probably due to formation of radicals, as it is abolished in the presence of the radical scavenger tert-butanol (TBA) and is seen even when the bacteria are added after the treatment (post-treatment exposure). Upregulation of the PA3237 promoter could also be detected using a reporter gene, suggesting the use of such genetic constructs to develop biosensors for monitoring AOPs in water-treatment plants. Currently biosensors for AOPs do not exist, consequently impairing the ability to monitor these processes on-line according to radical exposure in natural waters. Copyright © 2018 Elsevier Ltd. All rights reserved.

  11. Array data extractor (ADE): a LabVIEW program to extract and merge gene array data

    PubMed Central

    2013-01-01

    Background Large data sets from gene expression array studies are publicly available offering information highly valuable for research across many disciplines ranging from fundamental to clinical research. Highly advanced bioinformatics tools have been made available to researchers, but a demand for user-friendly software allowing researchers to quickly extract expression information for multiple genes from multiple studies persists. Findings Here, we present a user-friendly LabVIEW program to automatically extract gene expression data for a list of genes from multiple normalized microarray datasets. Functionality was tested for 288 class A G protein-coupled receptors (GPCRs) and expression data from 12 studies comparing normal and diseased human hearts. Results confirmed known regulation of a beta 1 adrenergic receptor and further indicate novel research targets. Conclusions Although existing software allows for complex data analyses, the LabVIEW based program presented here, “Array Data Extractor (ADE)”, provides users with a tool to retrieve meaningful information from multiple normalized gene expression datasets in a fast and easy way. Further, the graphical programming language used in LabVIEW allows applying changes to the program without the need of advanced programming knowledge. PMID:24289243

  12. Non-invasive imaging using reporter genes altering cellular water permeability

    NASA Astrophysics Data System (ADS)

    Mukherjee, Arnab; Wu, Di; Davis, Hunter C.; Shapiro, Mikhail G.

    2016-12-01

    Non-invasive imaging of gene expression in live, optically opaque animals is important for multiple applications, including monitoring of genetic circuits and tracking of cell-based therapeutics. Magnetic resonance imaging (MRI) could enable such monitoring with high spatiotemporal resolution. However, existing MRI reporter genes based on metalloproteins or chemical exchange probes are limited by their reliance on metals or relatively low sensitivity. Here we introduce a new class of MRI reporters based on the human water channel aquaporin 1. We show that aquaporin overexpression produces contrast in diffusion-weighted MRI by increasing tissue water diffusivity without affecting viability. Low aquaporin levels or mixed populations comprising as few as 10% aquaporin-expressing cells are sufficient to produce MRI contrast. We characterize this new contrast mechanism through experiments and simulations, and demonstrate its utility in vivo by imaging gene expression in tumours. Our results establish an alternative class of sensitive, metal-free reporter genes for non-invasive imaging.

  13. Multiple HOM-C gene interactions specify cell fates in the nematode central nervous system.

    PubMed

    Salser, S J; Loer, C M; Kenyon, C

    1993-09-01

    Intricate patterns of overlapping HOM-C gene expression along the A/P axis have been observed in many organisms; however, the significance of these patterns in establishing the ultimate fates of individual cells is not well understood. We have examined the expression of the Caenorhabditis elegans Antennapedia homolog mab-5 and its role in specifying cell fates in the posterior of the ventral nerve cord. We find that the pattern of fates specified by mab-5 not only depends on mab-5 expression but also on post-translational interactions with the neighboring HOM-C gene lin-39 and a second, inferred gene activity. Where mab-5 expression overlaps with lin-39 activity, they can interact in two different ways depending on the cell type: They can either effectively neutralize one another where they are both expressed or lin-39 can predominate over mab-5. As observed for Antennapedia in Drosophila, expression of mab-5 itself is repressed by the next most posterior HOM-C gene, egl-5. Thus, a surprising diversity in HOM-C regulatory mechanisms exists within a small set of cells even in a simple organism.

  14. Querying Co-regulated Genes on Diverse Gene Expression Datasets Via Biclustering.

    PubMed

    Deveci, Mehmet; Küçüktunç, Onur; Eren, Kemal; Bozdağ, Doruk; Kaya, Kamer; Çatalyürek, Ümit V

    2016-01-01

    Rapid development and increasing popularity of gene expression microarrays have resulted in a number of studies on the discovery of co-regulated genes. One important way of discovering such co-regulations is the query-based search since gene co-expressions may indicate a shared role in a biological process. Although there exist promising query-driven search methods adapting clustering, they fail to capture many genes that function in the same biological pathway because microarray datasets are fraught with spurious samples or samples of diverse origin, or the pathways might be regulated under only a subset of samples. On the other hand, a class of clustering algorithms known as biclustering algorithms which simultaneously cluster both the items and their features are useful while analyzing gene expression data, or any data in which items are related in only a subset of their samples. This means that genes need not be related in all samples to be clustered together. Because many genes only interact under specific circumstances, biclustering may recover the relationships that traditional clustering algorithms can easily miss. In this chapter, we briefly summarize the literature using biclustering for querying co-regulated genes. Then we present a novel biclustering approach and evaluate its performance by a thorough experimental analysis.

  15. Identification of unique cis-element pattern on simulated microgravity treated Arabidopsis by in silico and gene expression

    NASA Astrophysics Data System (ADS)

    Soh, Hyuncheol; Choi, Yongsang; Lee, Taek-Kyun; Yeo, Up-Dong; Han, Kyeongsik; Auh, Chungkyun; Lee, Sukchan

    2012-08-01

    Arabidopsis gene expression microarray (44 K) was used to detect genes highly induced under simulated microgravity stress (SMS). Ten SMS-inducible genes were selected from the microarray data and these 10 genes were found to be abundantly expressed in 3-week-old plants. Nine out of the 10 SMS-inducible genes were also expressed in response to the three abiotic stresses of drought, touch, and wounding in 3-week-old Arabidopsis plants respectively. However, WRKY46 was elevated only in response to SMS. Six other WRKY genes did not respond to SMS. To clarify the characteristics of the genes expressed at high levels in response to SMS, 20 cis-elements in the promoters of the 40 selected genes including the 10 SMS-inducible genes, the 6 WRKY genes, and abiotic stress-inducible genes were analyzed and their spatial positions on each promoter were determined. Four cis-elements (M/T-G-T-P from MYB1AT or TATABOX5, GT1CONSENSUS, TATABOX5, and POLASIG1) showed a unique spatial arrangement in most SMS-inducible genes including WRKY46. Therefore the M/T-G-T-P cis-element patterns identified in the promoter of WRKY46 may play important roles in regulating gene expression in response to SMS. The presences of the cis-element patterns suggest that the order or spatial positioning of certain groups of cis-elements is more important than the existence or numbers of specific cis-elements. Taken together, our data indicate that WRKY46 is a novel SMS inducible transcription factor and the unique spatial arrangement of cis-elements shown in WRKY46 promoter may play an important role for its response to SMS.

  16. Transcriptional interference networks coordinate the expression of functionally related genes clustered in the same genomic loci

    PubMed Central

    Boldogköi, Zsolt

    2012-01-01

    The regulation of gene expression is essential for normal functioning of biological systems in every form of life. Gene expression is primarily controlled at the level of transcription, especially at the phase of initiation. Non-coding RNAs are one of the major players at every level of genetic regulation, including the control of chromatin organization, transcription, various post-transcriptional processes, and translation. In this study, the Transcriptional Interference Network (TIN) hypothesis was put forward in an attempt to explain the global expression of antisense RNAs and the overall occurrence of tandem gene clusters in the genomes of various biological systems ranging from viruses to mammalian cells. The TIN hypothesis suggests the existence of a novel layer of genetic regulation, based on the interactions between the transcriptional machineries of neighboring genes at their overlapping regions, which are assumed to play a fundamental role in coordinating gene expression within a cluster of functionally linked genes. It is claimed that the transcriptional overlaps between adjacent genes are much more widespread in genomes than is thought today. The Waterfall model of the TIN hypothesis postulates a unidirectional effect of upstream genes on the transcription of downstream genes within a cluster of tandemly arrayed genes, while the Seesaw model proposes a mutual interdependence of gene expression between the oppositely oriented genes. The TIN represents an auto-regulatory system with an exquisitely timed and highly synchronized cascade of gene expression in functionally linked genes located in close physical proximity to each other. In this study, we focused on herpesviruses. The reason for this lies in the compressed nature of viral genes, which allows a tight regulation and an easier investigation of the transcriptional interactions between genes. However, I believe that the same or similar principles can be applied to cellular organisms too. PMID:22783276

  17. Transcriptional interference networks coordinate the expression of functionally related genes clustered in the same genomic loci.

    PubMed

    Boldogköi, Zsolt

    2012-01-01

    The regulation of gene expression is essential for normal functioning of biological systems in every form of life. Gene expression is primarily controlled at the level of transcription, especially at the phase of initiation. Non-coding RNAs are one of the major players at every level of genetic regulation, including the control of chromatin organization, transcription, various post-transcriptional processes, and translation. In this study, the Transcriptional Interference Network (TIN) hypothesis was put forward in an attempt to explain the global expression of antisense RNAs and the overall occurrence of tandem gene clusters in the genomes of various biological systems ranging from viruses to mammalian cells. The TIN hypothesis suggests the existence of a novel layer of genetic regulation, based on the interactions between the transcriptional machineries of neighboring genes at their overlapping regions, which are assumed to play a fundamental role in coordinating gene expression within a cluster of functionally linked genes. It is claimed that the transcriptional overlaps between adjacent genes are much more widespread in genomes than is thought today. The Waterfall model of the TIN hypothesis postulates a unidirectional effect of upstream genes on the transcription of downstream genes within a cluster of tandemly arrayed genes, while the Seesaw model proposes a mutual interdependence of gene expression between the oppositely oriented genes. The TIN represents an auto-regulatory system with an exquisitely timed and highly synchronized cascade of gene expression in functionally linked genes located in close physical proximity to each other. In this study, we focused on herpesviruses. The reason for this lies in the compressed nature of viral genes, which allows a tight regulation and an easier investigation of the transcriptional interactions between genes. However, I believe that the same or similar principles can be applied to cellular organisms too.

  18. Genomic resources for songbird research and their use in characterizing gene expression during brain development

    PubMed Central

    Li, XiaoChing; Wang, Xiu-Jie; Tannenhauser, Jonathan; Podell, Sheila; Mukherjee, Piali; Hertel, Moritz; Biane, Jeremy; Masuda, Shoko; Nottebohm, Fernando; Gaasterland, Terry

    2007-01-01

    Vocal learning and neuronal replacement have been studied extensively in songbirds, but until recently, few molecular and genomic tools for songbird research existed. Here we describe new molecular/genomic resources developed in our laboratory. We made cDNA libraries from zebra finch (Taeniopygia guttata) brains at different developmental stages. A total of 11,000 cDNA clones from these libraries, representing 5,866 unique gene transcripts, were randomly picked and sequenced from the 3′ ends. A web-based database was established for clone tracking, sequence analysis, and functional annotations. Our cDNA libraries were not normalized. Sequencing ESTs without normalization produced many developmental stage-specific sequences, yielding insights into patterns of gene expression at different stages of brain development. In particular, the cDNA library made from brains at posthatching day 30–50, corresponding to the period of rapid song system development and song learning, has the most diverse and richest set of genes expressed. We also identified five microRNAs whose sequences are highly conserved between zebra finch and other species. We printed cDNA microarrays and profiled gene expression in the high vocal center of both adult male zebra finches and canaries (Serinus canaria). Genes differentially expressed in the high vocal center were identified from the microarray hybridization results. Selected genes were validated by in situ hybridization. Networks among the regulated genes were also identified. These resources provide songbird biologists with tools for genome annotation, comparative genomics, and microarray gene expression analysis. PMID:17426146

  19. Pan- and core- network analysis of co-expression genes in a model plant

    DOE PAGES

    He, Fei; Maslov, Sergei

    2016-12-16

    Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less

  20. Pan- and core- network analysis of co-expression genes in a model plant

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    He, Fei; Maslov, Sergei

    Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less

  1. Transcriptomic insights into the genetic basis of mammalian limb diversity.

    PubMed

    Maier, Jennifer A; Rivas-Astroza, Marcelo; Deng, Jenny; Dowling, Anna; Oboikovitz, Paige; Cao, Xiaoyi; Behringer, Richard R; Cretekos, Chris J; Rasweiler, John J; Zhong, Sheng; Sears, Karen E

    2017-03-23

    From bat wings to whale flippers, limb diversification has been crucial to the evolutionary success of mammals. We performed the first transcriptome-wide study of limb development in multiple species to explore the hypothesis that mammalian limb diversification has proceeded through the differential expression of conserved shared genes, rather than by major changes to limb patterning. Specifically, we investigated the manner in which the expression of shared genes has evolved within and among mammalian species. We assembled and compared transcriptomes of bat, mouse, opossum, and pig fore- and hind limbs at the ridge, bud, and paddle stages of development. Results suggest that gene expression patterns exhibit larger variation among species during later than earlier stages of limb development, while within species results are more mixed. Consistent with the former, results also suggest that genes expressed at later developmental stages tend to have a younger evolutionary age than genes expressed at earlier stages. A suite of key limb-patterning genes was identified as being differentially expressed among the homologous limbs of all species. However, only a small subset of shared genes is differentially expressed in the fore- and hind limbs of all examined species. Similarly, a small subset of shared genes is differentially expressed within the fore- and hind limb of a single species and among the forelimbs of different species. Taken together, results of this study do not support the existence of a phylotypic period of limb development ending at chondrogenesis, but do support the hypothesis that the hierarchical nature of development translates into increasing variation among species as development progresses.

  2. Evolution, functional differentiation, and co-expression of the RLK gene family revealed in Jilin ginseng, Panax ginseng C.A. Meyer.

    PubMed

    Lin, Yanping; Wang, Kangyu; Li, Xiangyu; Sun, Chunyu; Yin, Rui; Wang, Yanfang; Wang, Yi; Zhang, Meiping

    2018-02-21

    Most genes in a genome exist in the form of a gene family; therefore, it is necessary to have knowledge of how a gene family functions to comprehensively understand organismal biology. The receptor-like kinase (RLK)-encoding gene family is one of the most important gene families in plants. It plays important roles in biotic and abiotic stress tolerances, and growth and development. However, little is known about the functional differentiation and relationships among the gene members within a gene family in plants. This study has isolated 563 RLK genes (designated as PgRLK genes) expressed in Jilin ginseng (Panax ginseng C.A. Meyer), investigated their evolution, and deciphered their functional diversification and relationships. The PgRLK gene family is highly diverged and formed into eight types. The LRR type is the earliest and most prevalent, while only the Lec type originated after P. ginseng evolved. Furthermore, although the members of the PgRLK gene family all encode receptor-like protein kinases and share conservative domains, they are functionally very diverse, participating in numerous biological processes. The expressions of different members of the PgRLK gene family are extremely variable within a tissue, at a developmental stage and in the same cultivar, but most of the genes tend to express correlatively, forming a co-expression network. These results not only provide a deeper and comprehensive understanding of the evolution, functional differentiation and correlation of a gene family in plants, but also an RLK genic resource useful for enhanced ginseng genetic improvement.

  3. Transcriptional response of honey bee larvae infected with the bacterial pathogen Paenibacillus larvae.

    PubMed

    Cornman, Robert Scott; Lopez, Dawn; Evans, Jay D

    2013-01-01

    American foulbrood disease of honey bees is caused by the bacterium Paenibacillus larvae. Infection occurs per os in larvae and systemic infection requires a breaching of the host peritrophic matrix and midgut epithelium. Genetic variation exists for both bacterial virulence and host resistance, and a general immunity is achieved by larvae as they age, the basis of which has not been identified. To quickly identify a pool of candidate genes responsive to P. larvae infection, we sequenced transcripts from larvae inoculated with P. larvae at 12 hours post-emergence and incubated for 72 hours, and compared expression levels to a control cohort. We identified 75 genes with significantly higher expression and six genes with significantly lower expression. In addition to several antimicrobial peptides, two genes encoding peritrophic-matrix domains were also up-regulated. Extracellular matrix proteins, proteases/protease inhibitors, and members of the Osiris gene family were prevalent among differentially regulated genes. However, analysis of Drosophila homologs of differentially expressed genes revealed spatial and temporal patterns consistent with developmental asynchrony as a likely confounder of our results. We therefore used qPCR to measure the consistency of gene expression changes for a subset of differentially expressed genes. A replicate experiment sampled at both 48 and 72 hours post infection allowed further discrimination of genes likely to be involved in host response. The consistently responsive genes in our test set included a hymenopteran-specific protein tyrosine kinase, a hymenopteran specific serine endopeptidase, a cytochrome P450 (CYP9Q1), and a homolog of trynity, a zona pellucida domain protein. Of the known honey bee antimicrobial peptides, apidaecin was responsive at both time-points studied whereas hymenoptaecin was more consistent in its level of change between biological replicates and had the greatest increase in expression by RNA-seq analysis.

  4. Differential gene expression in Varroa jacobsoni mites following a host shift to European honey bees (Apis mellifera).

    PubMed

    Andino, Gladys K; Gribskov, Michael; Anderson, Denis L; Evans, Jay D; Hunt, Greg J

    2016-11-16

    Varroa mites are widely considered the biggest honey bee health problem worldwide. Until recently, Varroa jacobsoni has been found to live and reproduce only in Asian honey bee (Apis cerana) colonies, while V. destructor successfully reproduces in both A. cerana and A. mellifera colonies. However, we have identified an island population of V. jacobsoni that is highly destructive to A. mellifera, the primary species used for pollination and honey production. The ability of these populations of mites to cross the host species boundary potentially represents an enormous threat to apiculture, and is presumably due to genetic variation that exists among populations of V. jacobsoni that influences gene expression and reproductive status. In this work, we investigate differences in gene expression between populations of V. jacobsoni reproducing on A. cerana and those either reproducing or not capable of reproducing on A. mellifera, in order to gain insight into differences that allow V. jacobsoni to overcome its normal species tropism. We sequenced and assembled a de novo transcriptome of V. jacobsoni. We also performed a differential gene expression analysis contrasting biological replicates of V. jacobsoni populations that differ in their ability to reproduce on A. mellifera. Using the edgeR, EBSeq and DESeq R packages for differential gene expression analysis, we found 287 differentially expressed genes (FDR ≤ 0.05), of which 91% were up regulated in mites reproducing on A. mellifera. In addition, mites found reproducing on A. mellifera showed substantially more variation in expression among replicates. We searched for orthologous genes in public databases and were able to associate 100 of these 287 differentially expressed genes with a functional description. There is differential gene expression between the two mite groups, with more variation in gene expression among mites that were able to reproduce on A. mellifera. A small set of genes showed reduced expression in mites on the A. mellifera host, including putative transcription factors and digestive tract developmental genes. The vast majority of differentially expressed genes were up-regulated in this host. This gene set showed enrichment for genes associated with mitochondrial respiratory function and apoptosis, suggesting that mites on this host may be experiencing higher stress, and may be less optimally adapted to parasitize it. Some genes involved in reproduction and oogenesis were also overexpressed, which should be further studied in regards to this host shift.

  5. Differential gene expression between African American and European American colorectal cancer patients.

    PubMed

    Jovov, Biljana; Araujo-Perez, Felix; Sigel, Carlie S; Stratford, Jeran K; McCoy, Amber N; Yeh, Jen Jen; Keku, Temitope

    2012-01-01

    The incidence and mortality of colorectal cancer (CRC) is higher in African Americans (AAs) than other ethnic groups in the U. S., but reasons for the disparities are unknown. We performed gene expression profiling of sporadic CRCs from AAs vs. European Americans (EAs) to assess the contribution to CRC disparities. We evaluated the gene expression of 43 AA and 43 EA CRC tumors matched by stage and 40 matching normal colorectal tissues using the Agilent human whole genome 4x44K cDNA arrays. Gene and pathway analyses were performed using Significance Analysis of Microarrays (SAM), Ten-fold cross validation, and Ingenuity Pathway Analysis (IPA). SAM revealed that 95 genes were differentially expressed between AA and EA patients at a false discovery rate of ≤5%. Using IPA we determined that most prominent disease and pathway associations of differentially expressed genes were related to inflammation and immune response. Ten-fold cross validation demonstrated that following 10 genes can predict ethnicity with an accuracy of 94%: CRYBB2, PSPH, ADAL, VSIG10L, C17orf81, ANKRD36B, ZNF835, ARHGAP6, TRNT1 and WDR8. Expression of these 10 genes was validated by qRT-PCR in an independent test set of 28 patients (10 AA, 18 EA). Our results are the first to implicate differential gene expression in CRC racial disparities and indicate prominent difference in CRC inflammation between AA and EA patients. Differences in susceptibility to inflammation support the existence of distinct tumor microenvironments in these two patient populations.

  6. Differential Gene Expression between African American and European American Colorectal Cancer Patients

    PubMed Central

    Jovov, Biljana; Araujo-Perez, Felix; Sigel, Carlie S.; Stratford, Jeran K.; McCoy, Amber N.; Yeh, Jen Jen; Keku, Temitope

    2012-01-01

    The incidence and mortality of colorectal cancer (CRC) is higher in African Americans (AAs) than other ethnic groups in the U. S., but reasons for the disparities are unknown. We performed gene expression profiling of sporadic CRCs from AAs vs. European Americans (EAs) to assess the contribution to CRC disparities. We evaluated the gene expression of 43 AA and 43 EA CRC tumors matched by stage and 40 matching normal colorectal tissues using the Agilent human whole genome 4x44K cDNA arrays. Gene and pathway analyses were performed using Significance Analysis of Microarrays (SAM), Ten-fold cross validation, and Ingenuity Pathway Analysis (IPA). SAM revealed that 95 genes were differentially expressed between AA and EA patients at a false discovery rate of ≤5%. Using IPA we determined that most prominent disease and pathway associations of differentially expressed genes were related to inflammation and immune response. Ten-fold cross validation demonstrated that following 10 genes can predict ethnicity with an accuracy of 94%: CRYBB2, PSPH, ADAL, VSIG10L, C17orf81, ANKRD36B, ZNF835, ARHGAP6, TRNT1 and WDR8. Expression of these 10 genes was validated by qRT-PCR in an independent test set of 28 patients (10 AA, 18 EA). Our results are the first to implicate differential gene expression in CRC racial disparities and indicate prominent difference in CRC inflammation between AA and EA patients. Differences in susceptibility to inflammation support the existence of distinct tumor microenvironments in these two patient populations. PMID:22276153

  7. An Expression of Periodic Phenomena of Fashion on Sexual Selection Model with Conformity Genes and Memes

    NASA Astrophysics Data System (ADS)

    Mutoh, Atsuko; Tokuhara, Shinya; Kanoh, Masayoshi; Oboshi, Tamon; Kato, Shohei; Itoh, Hidenori

    It is generally thought that living things have trends in their preferences. The mechanism of occurrence of another trends in successive periods is concerned in their conformity. According to social impact theory, the minority is always exists in the group. There is a possibility that the minority make the transition to the majority by conforming agents. Because of agent's promotion of their conform actions, the majority can make the transition. We proposed an evolutionary model with both genes and memes, and elucidated the interaction between genes and memes on sexual selection. In this paper, we propose an agent model for sexual selection imported the concept of conformity. Using this model we try an environment where male agents and female agents are existed, we find that periodic phenomena of fashion are expressed. And we report the influence of conformity and differentiation on the transition of their preferences.

  8. Developing a PTEN-ERG Signature to Improve Molecular Risk Stratification in Prostate Cancer

    DTIC Science & Technology

    2017-10-01

    that there exist distinctive molecular correlates of PTEN loss in the context of ETS-negative versus ETS-positive human prostate cancers and that...distinctive molecular correlates of PTEN loss in the context of ETS-negative versus ETS-positive human PCa and that these may drive prognosis...and MSKCC cohort, correlate these data with gene expression data from the same cohort to confirm ETS status and enable full gene expression analyses of

  9. Adaptation of muscle gene expression to changes in contractile activity

    NASA Technical Reports Server (NTRS)

    Booth, F. W.; Babij, P.; Thomason, D. B.; Wong, T. S.; Morrison, P. R.

    1987-01-01

    A review of the existing literature regarding the effects of different types of physical activities on the gene expression of adult skeletal muscles leads us to conclude that each type of exercise training program has, as a result, a different phenotype, which means that there are multiple mechanisms, each producing a unique phenotype. A portion of the facts which support this position is presented and interpreted here. [Abstract translated from the original French by NASA].

  10. Transcriptome profile of a bovine respiratory disease pathogen: Mannheimia haemolytica PHL213

    PubMed Central

    2012-01-01

    Background Computational methods for structural gene annotation have propelled gene discovery but face certain drawbacks with regards to prokaryotic genome annotation. Identification of transcriptional start sites, demarcating overlapping gene boundaries, and identifying regulatory elements such as small RNA are not accurate using these approaches. In this study, we re-visit the structural annotation of Mannheimia haemolytica PHL213, a bovine respiratory disease pathogen. M. haemolytica is one of the causative agents of bovine respiratory disease that results in about $3 billion annual losses to the cattle industry. We used RNA-Seq and analyzed the data using freely-available computational methods and resources. The aim was to identify previously unannotated regions of the genome using RNA-Seq based expression profile to complement the existing annotation of this pathogen. Results Using the Illumina Genome Analyzer, we generated 9,055,826 reads (average length ~76 bp) and aligned them to the reference genome using Bowtie. The transcribed regions were analyzed using SAMTOOLS and custom Perl scripts in conjunction with BLAST searches and available gene annotation information. The single nucleotide resolution map enabled the identification of 14 novel protein coding regions as well as 44 potential novel sRNA. The basal transcription profile revealed that 2,506 of the 2,837 annotated regions were expressed in vitro, at 95.25% coverage, representing all broad functional gene categories in the genome. The expression profile also helped identify 518 potential operon structures involving 1,086 co-expressed pairs. We also identified 11 proteins with mutated/alternate start codons. Conclusions The application of RNA-Seq based transcriptome profiling to structural gene annotation helped correct existing annotation errors and identify potential novel protein coding regions and sRNA. We used computational tools to predict regulatory elements such as promoters and terminators associated with the novel expressed regions for further characterization of these novel functional elements. Our study complements the existing structural annotation of Mannheimia haemolytica PHL213 based on experimental evidence. Given the role of sRNA in virulence gene regulation and stress response, potential novel sRNA described in this study can form the framework for future studies to determine the role of sRNA, if any, in M. haemolytica pathogenesis. PMID:23046475

  11. Exploring Valid Reference Genes for Quantitative Real-time PCR Analysis in Plutella xylostella (Lepidoptera: Plutellidae)

    PubMed Central

    Fu, Wei; Xie, Wen; Zhang, Zhuo; Wang, Shaoli; Wu, Qingjun; Liu, Yong; Zhou, Xiaomao; Zhou, Xuguo; Zhang, Youjun

    2013-01-01

    Abstract: Quantitative real-time PCR (qRT-PCR), a primary tool in gene expression analysis, requires an appropriate normalization strategy to control for variation among samples. The best option is to compare the mRNA level of a target gene with that of reference gene(s) whose expression level is stable across various experimental conditions. In this study, expression profiles of eight candidate reference genes from the diamondback moth, Plutella xylostella, were evaluated under diverse experimental conditions. RefFinder, a web-based analysis tool, integrates four major computational programs including geNorm, Normfinder, BestKeeper, and the comparative ΔCt method to comprehensively rank the tested candidate genes. Elongation factor 1 (EF1) was the most suited reference gene for the biotic factors (development stage, tissue, and strain). In contrast, although appropriate reference gene(s) do exist for several abiotic factors (temperature, photoperiod, insecticide, and mechanical injury), we were not able to identify a single universal reference gene. Nevertheless, a suite of candidate reference genes were specifically recommended for selected experimental conditions. Our finding is the first step toward establishing a standardized qRT-PCR analysis of this agriculturally important insect pest. PMID:23983612

  12. A High-Throughput Data Mining of Single Nucleotide Polymorphisms in Coffea Species Expressed Sequence Tags Suggests Differential Homeologous Gene Expression in the Allotetraploid Coffea arabica1[W

    PubMed Central

    Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães

    2010-01-01

    Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed. PMID:20864545

  13. Spectral Biclustering of Microarray Data: Coclustering Genes and Conditions

    PubMed Central

    Kluger, Yuval; Basri, Ronen; Chang, Joseph T.; Gerstein, Mark

    2003-01-01

    Global analyses of RNA expression levels are useful for classifying genes and overall phenotypes. Often these classification problems are linked, and one wants to find “marker genes” that are differentially expressed in particular sets of “conditions.” We have developed a method that simultaneously clusters genes and conditions, finding distinctive “checkerboard” patterns in matrices of gene expression data, if they exist. In a cancer context, these checkerboards correspond to genes that are markedly up- or downregulated in patients with particular types of tumors. Our method, spectral biclustering, is based on the observation that checkerboard structures in matrices of expression data can be found in eigenvectors corresponding to characteristic expression patterns across genes or conditions. In addition, these eigenvectors can be readily identified by commonly used linear algebra approaches, in particular the singular value decomposition (SVD), coupled with closely integrated normalization steps. We present a number of variants of the approach, depending on whether the normalization over genes and conditions is done independently or in a coupled fashion. We then apply spectral biclustering to a selection of publicly available cancer expression data sets, and examine the degree to which the approach is able to identify checkerboard structures. Furthermore, we compare the performance of our biclustering methods against a number of reasonable benchmarks (e.g., direct application of SVD or normalized cuts to raw data). PMID:12671006

  14. Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE

    PubMed Central

    Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.

    2009-01-01

    Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438

  15. Defining the human macula transcriptome and candidate retinal disease genes using EyeSAGE.

    PubMed

    Bowes Rickman, Catherine; Ebright, Jessica N; Zavodni, Zachary J; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P; Wistow, Graeme; Boon, Kathy; Hauser, Michael A

    2006-06-01

    To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. The EyeSAGE database, combining three different gene-profiling platforms including the authors' multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions.

  16. TA-GC cloning: A new simple and versatile technique for the directional cloning of PCR products for recombinant protein expression.

    PubMed

    Niarchos, Athanasios; Siora, Anastasia; Konstantinou, Evangelia; Kalampoki, Vasiliki; Lagoumintzis, George; Poulas, Konstantinos

    2017-01-01

    During the last few decades, the recombinant protein expression finds more and more applications. The cloning of protein-coding genes into expression vectors is required to be directional for proper expression, and versatile in order to facilitate gene insertion in multiple different vectors for expression tests. In this study, the TA-GC cloning method is proposed, as a new, simple and efficient method for the directional cloning of protein-coding genes in expression vectors. The presented method features several advantages over existing methods, which tend to be relatively more labour intensive, inflexible or expensive. The proposed method relies on the complementarity between single A- and G-overhangs of the protein-coding gene, obtained after a short incubation with T4 DNA polymerase, and T and C overhangs of the novel vector pET-BccI, created after digestion with the restriction endonuclease BccI. The novel protein-expression vector pET-BccI also facilitates the screening of transformed colonies for recombinant transformants. Evaluation experiments of the proposed TA-GC cloning method showed that 81% of the transformed colonies contained recombinant pET-BccI plasmids, and 98% of the recombinant colonies expressed the desired protein. This demonstrates that TA-GC cloning could be a valuable method for cloning protein-coding genes in expression vectors.

  17. TA-GC cloning: A new simple and versatile technique for the directional cloning of PCR products for recombinant protein expression

    PubMed Central

    Niarchos, Athanasios; Siora, Anastasia; Konstantinou, Evangelia; Kalampoki, Vasiliki; Poulas, Konstantinos

    2017-01-01

    During the last few decades, the recombinant protein expression finds more and more applications. The cloning of protein-coding genes into expression vectors is required to be directional for proper expression, and versatile in order to facilitate gene insertion in multiple different vectors for expression tests. In this study, the TA-GC cloning method is proposed, as a new, simple and efficient method for the directional cloning of protein-coding genes in expression vectors. The presented method features several advantages over existing methods, which tend to be relatively more labour intensive, inflexible or expensive. The proposed method relies on the complementarity between single A- and G-overhangs of the protein-coding gene, obtained after a short incubation with T4 DNA polymerase, and T and C overhangs of the novel vector pET-BccI, created after digestion with the restriction endonuclease BccI. The novel protein-expression vector pET-BccI also facilitates the screening of transformed colonies for recombinant transformants. Evaluation experiments of the proposed TA-GC cloning method showed that 81% of the transformed colonies contained recombinant pET-BccI plasmids, and 98% of the recombinant colonies expressed the desired protein. This demonstrates that TA-GC cloning could be a valuable method for cloning protein-coding genes in expression vectors. PMID:29091919

  18. In silico gene expression profiling in Cannabis sativa.

    PubMed

    Massimino, Luca

    2017-01-01

    The cannabis plant and its active ingredients (i.e., cannabinoids and terpenoids) have been socially stigmatized for half a century. Luckily, with more than 430,000 published scientific papers and about 600 ongoing and completed clinical trials, nowadays cannabis is employed for the treatment of many different medical conditions. Nevertheless, even if a large amount of high-throughput functional genomic data exists, most researchers feature a strong background in molecular biology but lack advanced bioinformatics skills. In this work, publicly available gene expression datasets have been analyzed giving rise to a total of 40,224 gene expression profiles taken from cannabis plant tissue at different developmental stages. The resource presented here will provide researchers with a starting point for future investigations with Cannabis sativa .

  19. Comprehensive Gene Expression Analysis of Rice Aleurone Cells: Probing the Existence of an Alternative Gibberellin Receptor1

    PubMed Central

    Yano, Kenji; Aya, Koichiro; Hirano, Ko; Ordonio, Reynante Lacsamana; Ueguchi-Tanaka, Miyako; Matsuoka, Makoto

    2015-01-01

    Current gibberellin (GA) research indicates that GA must be perceived in plant nuclei by its cognate receptor, GIBBERELLIN INSENSITIVE DWARF1 (GID1). Recognition of GA by GID1 relieves the repression mediated by the DELLA protein, a model known as the GID1-DELLA GA perception system. There have been reports of potential GA-binding proteins in the plasma membrane that perceive GA and induce α-amylase expression in cereal aleurone cells, which is mechanistically different from the GID1-DELLA system. Therefore, we examined the expression of the rice (Oryza sativa) α-amylase genes in rice mutants impaired in the GA receptor (gid1) and the DELLA repressor (slender rice1; slr1) and confirmed their lack of response to GA in gid1 mutants and constitutive expression in slr1 mutants. We also examined the expression of GA-regulated genes by genome-wide microarray and quantitative reverse transcription-polymerase chain reaction analyses and confirmed that all GA-regulated genes are modulated by the GID1-DELLA system. Furthermore, we studied the regulatory network involved in GA signaling by using a set of mutants defective in genes involved in GA perception and gene expression, namely gid1, slr1, gid2 (a GA-related F-box protein mutant), and gamyb (a GA-related trans-acting factor mutant). Almost all GA up-regulated genes were regulated by the four named GA-signaling components. On the other hand, GA down-regulated genes showed different expression patterns with respect to GID2 and GAMYB (e.g. a considerable number of genes are not controlled by GAMYB or GID2 and GAMYB). Based on these observations, we present a comprehensive discussion of the intricate network of GA-regulated genes in rice aleurone cells. PMID:25511432

  20. Evolution under canalization and the dual roles of microRNAs—A hypothesis

    PubMed Central

    Wu, Chung-I; Shen, Yang; Tang, Tian

    2009-01-01

    Canalization refers to the process by which phenotypes are stabilized within species. Evolution by natural selection can proceed efficiently only when phenotypes are canalized. The existence and identity of canalizing genes have thus been an important, but controversial topic. Recent evidence has increasingly hinted that microRNAs may be involved in canalizing gene expression. Their paradoxical properties (e.g., strongly conserved but functionally dispensable) suggest unconventional regulatory roles. We synthesized published and unpublished results and hypothesize that miRNAs may have dual functions—in gene expression tuning and in expression buffering. In tuning, miRNAs modify the mean expression level of their targets, but in buffering they merely reduce the variance around a preset mean. In light of the constant emergence of new miRNAs, we further discuss the relative importance of these two functions in evolution. PMID:19411598

  1. GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data.

    PubMed

    Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E

    2016-03-11

    Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.

  2. Gene doping.

    PubMed

    Azzazy, Hassan M E

    2010-01-01

    Gene doping abuses the legitimate approach of gene therapy. While gene therapy aims to correct genetic disorders by introducing a foreign gene to replace an existing faulty one or by manipulating existing gene(s) to achieve a therapeutic benefit, gene doping employs the same concepts to bestow performance advantages on athletes over their competitors. Recent developments in genetic engineering have contributed significantly to the progress of gene therapy research and currently numerous clinical trials are underway. Some athletes and their staff are probably watching this progress closely. Any gene that plays a role in muscle development, oxygen delivery to tissues, neuromuscular coordination, or even pain control is considered a candidate for gene dopers. Unfortunately, detecting gene doping is technically very difficult because the transgenic proteins expressed by the introduced genes are similar to their endogenous counterparts. Researchers today are racing the clock because assuring the continued integrity of sports competition depends on their ability to develop effective detection strategies in preparation for the 2012 Olympics, which may mark the appearance of genetically modified athletes.

  3. Paternal poly (ADP-ribose) metabolism modulates retention of inheritable sperm histones and early embryonic gene expression.

    PubMed

    Ihara, Motomasa; Meyer-Ficca, Mirella L; Leu, N Adrian; Rao, Shilpa; Li, Fan; Gregory, Brian D; Zalenskaya, Irina A; Schultz, Richard M; Meyer, Ralph G

    2014-05-01

    To achieve the extreme nuclear condensation necessary for sperm function, most histones are replaced with protamines during spermiogenesis in mammals. Mature sperm retain only a small fraction of nucleosomes, which are, in part, enriched on gene regulatory sequences, and recent findings suggest that these retained histones provide epigenetic information that regulates expression of a subset of genes involved in embryo development after fertilization. We addressed this tantalizing hypothesis by analyzing two mouse models exhibiting abnormal histone positioning in mature sperm due to impaired poly(ADP-ribose) (PAR) metabolism during spermiogenesis and identified altered sperm histone retention in specific gene loci genome-wide using MNase digestion-based enrichment of mononucleosomal DNA. We then set out to determine the extent to which expression of these genes was altered in embryos generated with these sperm. For control sperm, most genes showed some degree of histone association, unexpectedly suggesting that histone retention in sperm genes is not an all-or-none phenomenon and that a small number of histones may remain associated with genes throughout the genome. The amount of retained histones, however, was altered in many loci when PAR metabolism was impaired. To ascertain whether sperm histone association and embryonic gene expression are linked, the transcriptome of individual 2-cell embryos derived from such sperm was determined using microarrays and RNA sequencing. Strikingly, a moderate but statistically significant portion of the genes that were differentially expressed in these embryos also showed different histone retention in the corresponding gene loci in sperm of their fathers. These findings provide new evidence for the existence of a linkage between sperm histone retention and gene expression in the embryo.

  4. Paternal Poly (ADP-ribose) Metabolism Modulates Retention of Inheritable Sperm Histones and Early Embryonic Gene Expression

    PubMed Central

    Leu, N. Adrian; Rao, Shilpa; Li, Fan; Gregory, Brian D.; Zalenskaya, Irina A.; Schultz, Richard M.; Meyer, Ralph G.

    2014-01-01

    To achieve the extreme nuclear condensation necessary for sperm function, most histones are replaced with protamines during spermiogenesis in mammals. Mature sperm retain only a small fraction of nucleosomes, which are, in part, enriched on gene regulatory sequences, and recent findings suggest that these retained histones provide epigenetic information that regulates expression of a subset of genes involved in embryo development after fertilization. We addressed this tantalizing hypothesis by analyzing two mouse models exhibiting abnormal histone positioning in mature sperm due to impaired poly(ADP-ribose) (PAR) metabolism during spermiogenesis and identified altered sperm histone retention in specific gene loci genome-wide using MNase digestion-based enrichment of mononucleosomal DNA. We then set out to determine the extent to which expression of these genes was altered in embryos generated with these sperm. For control sperm, most genes showed some degree of histone association, unexpectedly suggesting that histone retention in sperm genes is not an all-or-none phenomenon and that a small number of histones may remain associated with genes throughout the genome. The amount of retained histones, however, was altered in many loci when PAR metabolism was impaired. To ascertain whether sperm histone association and embryonic gene expression are linked, the transcriptome of individual 2-cell embryos derived from such sperm was determined using microarrays and RNA sequencing. Strikingly, a moderate but statistically significant portion of the genes that were differentially expressed in these embryos also showed different histone retention in the corresponding gene loci in sperm of their fathers. These findings provide new evidence for the existence of a linkage between sperm histone retention and gene expression in the embryo. PMID:24810616

  5. Evidence for participation of GCS1 in fertilization of the starlet sea anemone Nematostella vectensis: Implication of a common mechanism of sperm–egg fusion in plants and animals

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ebchuqin, Eerdundagula; Yokota, Naoto; Yamada, Lixy

    Highlights: • GCS1 is a sperm transmembrane protein that is essential for gamete fusion in flowering plants. • The GCS1 gene is present not only in angiosperms but also in unicellular organisms and animals. • NvGCS1 gene is expressed in the testis and GCS1 protein exists in sperm of a sea anemone. • Anti-GCS1 antibodies inhibited the fertilization, showing the participation in fertilization. - Abstract: It has been reported that GCS1 (Generative Cell Specific 1) is a transmembrane protein that is exclusively expressed in sperm cells and is essential for gamete fusion in flowering plants. The GCS1 gene is presentmore » not only in angiosperms but also in unicellular organisms and animals, implying the occurrence of a common or ancestral mechanism of GCS1-mediated gamete fusion. In order to elucidate the common mechanism, we investigated the role of GCS1 in animal fertilization using a sea anemone (Cnidaria), Nematostella vectensis. Although the existence of the GCS1 gene in N. vectensis has been reported, the expression of GCS1 in sperm and the role of GCS1 in fertilization are not known. In this study, we showed that the GCS1 gene is expressed in the testis and that GCS1 protein exists in sperm by in situ hybridization and proteomic analysis, respectively. Then we made four peptide antibodies against the N-terminal extracellular region of NvGCS1. These antibodies specifically reacted to NvGCS1 among sperm proteins on the basis of Western analysis and potently inhibited fertilization in a concentration-dependent manner. These results indicate that sperm GCS1 plays a pivotal role in fertilization, most probably in sperm–egg fusion, in a starlet sea anemone, suggesting a common gamete-fusion mechanism shared by eukaryotic organisms.« less

  6. Eukaryotic genomes may exhibit up to 10 generic classes of gene promoters.

    PubMed

    Gagniuc, Paul; Ionescu-Tirgoviste, Constantin

    2012-09-28

    The main function of gene promoters appears to be the integration of different gene products in their biological pathways in order to maintain homeostasis. Generally, promoters have been classified in two major classes, namely TATA and CpG. Nevertheless, many genes using the same combinatorial formation of transcription factors have different gene expression patterns. Accordingly, we tried to ask ourselves some fundamental questions: Why certain genes have an overall predisposition for higher gene expression levels than others? What causes such a predisposition? Is there a structural relationship of these sequences in different tissues? Is there a strong phylogenetic relationship between promoters of closely related species? In order to gain valuable insights into different promoter regions, we obtained a series of image-based patterns which allowed us to identify 10 generic classes of promoters. A comprehensive analysis was undertaken for promoter sequences from Arabidopsis thaliana, Drosophila melanogaster, Homo sapiens and Oryza sativa, and a more extensive analysis of tissue-specific promoters in humans. We observed a clear preference for these species to use certain classes of promoters for specific biological processes. Moreover, in humans, we found that different tissues use distinct classes of promoters, reflecting an emerging promoter network. Depending on the tissue type, comparisons made between these classes of promoters reveal a complementarity between their patterns whereas some other classes of promoters have been observed to occur in competition. Furthermore, we also noticed the existence of some transitional states between these classes of promoters that may explain certain evolutionary mechanisms, which suggest a possible predisposition for specific levels of gene expression and perhaps for a different number of factors responsible for triggering gene expression. Our conclusions are based on comprehensive data from three different databases and a new computer model whose core is using Kappa index of coincidence. To fully understand the connections between gene promoters and gene expression, we analyzed thousands of promoter sequences using our Kappa Index of Coincidence method and a specialized Optical Character Recognition (OCR) neural network. Under our criteria, 10 classes of promoters were detected. In addition, the existence of "transitional" promoters suggests that there is an evolutionary weighted continuum between classes, depending perhaps upon changes in their gene products.

  7. Genome-wide identification and expression analysis of the apple ASR gene family in response to Alternaria alternata f. sp. mali.

    PubMed

    Huang, Kaihui; Zhong, Yan; Li, Yingjun; Zheng, Dan; Cheng, Zong-Ming

    2016-10-01

    The ABA/water stress/ripening-induced (ASR) gene family exists universally in higher plants, and many ASR genes are up-regulated during periods of environmental stress and fruit ripening. Although a considerable amount of research has been performed investigating ASR gene response to abiotic stresses, relatively little is known about their roles in response to biotic stresses. In this report, we identified five ASR genes in apple (Malus × domestica) and explored their phylogenetic relationship, duplication events, and selective pressure. Five apple ASR genes (Md-ASR) were divided into two clades based on phylogenetic analysis. Species-specific duplication was detected in M. domestica ASR genes. Leaves of 'Golden delicious' and 'Starking' were infected with Alternaria alternata f. sp. mali, which causes apple blotch disease, and examined for the expression of the ASR genes in lesion areas during the first 72 h after inoculation. Md-ASR genes showed different expression patterns at different sampling times in 'Golden delicious' and 'Starking'. The activities of stress-related enzymes, peroxidase (POD), superoxide dismutase (SOD), catalase (CAT), phenylalanine ammonia lyase (PAL), and polyphenoloxidase (PPO), and the content of malondialdehyde (MDA) were also measured in different stages of disease development in two cultivars. The ASR gene expression patterns and theses physiological indexes for disease resistance suggested that Md-ASR genes are involved in biotic stress responses in apple.

  8. Genome-wide identification, phylogeny, and gonadal expression of fox genes in Nile tilapia, Oreochromis niloticus.

    PubMed

    Yuan, Jing; Tao, Wenjing; Cheng, Yunying; Huang, Baofeng; Wang, Deshou

    2014-08-01

    The fox genes play important roles in various biological processes, including sexual development. In the present study, we isolated 65 fox genes, belonging to 18 subfamilies named A-R, from Nile tilapia through genome-wide screening. Twenty-four of them have two or three (foxm1) copies. Furthermore, 16, 25, 68, and 45 fox members were isolated from nematodes, protochordates, teleosts, and tetrapods, respectively. Phylogenetic analyses indicated fox gene family had undergone three expansions parallel to the three rounds of genome duplication during evolution. We also analyzed the clustered fox genes and found that apparent linkage duplication existed in teleosts, which further supported fish-specific genome duplication hypothesis. In addition, species- and lineage-specific duplication is another reason for fox gene family expansion. Based on the four pairs of XX and XY gonadal transcriptome data from four critical developmental stages, we analyzed the expression profile of all fox genes and identified sexually dimorphic fox genes at each stage. All fox genes were detected in gonads, with 15 of them at the background expression level (total read per kb per million reads, RPKM < 10), 29 at moderate expression level (10 < total RPKM < 100), and 21 at high expression level (total RPKM > 100). There are 27, 24, 28, and 9 sexually dimorphic fox genes at 5, 30, 90, and 180 days after hatching (dah), respectively. foxq1a, foxf1, foxr1, and foxr1 were identified as the most differentially expressed genes at each stage. foxl2 was characterized as XX-dominant gene, while foxd5, foxi3, foxn3, foxj1a, foxj3b, and foxo6b were characterized as XY-dominant genes. qPCR and in situ hybridization of foxh1 and foxj1a were performed to confirm the expression profiles and to validate the transcriptome data. Our results suggest that fox genes might play important roles in sex determination and gonadal development in teleosts.

  9. Genome-wide RNAi screening identifies protein damage as a regulator of osmoprotective gene expression.

    PubMed

    Lamitina, Todd; Huang, Chunyi George; Strange, Kevin

    2006-08-08

    The detection, stabilization, and repair of stress-induced damage are essential requirements for cellular life. All cells respond to osmotic stress-induced water loss with increased expression of genes that mediate accumulation of organic osmolytes, solutes that function as chemical chaperones and restore osmotic homeostasis. The signals and signaling mechanisms that regulate osmoprotective gene expression in animal cells are poorly understood. Here, we show that gpdh-1 and gpdh-2, genes that mediate the accumulation of the organic osmolyte glycerol, are essential for survival of the nematode Caenorhabditis elegans during osmotic stress. Expression of GFP driven by the gpdh-1 promoter (P(gpdh-1)::GFP) is detected only during hypertonic stress but is not induced by other stressors. Using P(gpdh-1)::GFP expression as a phenotype, we screened approximately 16,000 genes by RNAi feeding and identified 122 that cause constitutive activation of gpdh-1 expression and glycerol accumulation. Many of these genes function to regulate protein translation and cotranslational protein folding and to target and degrade denatured proteins, suggesting that the accumulation of misfolded proteins functions as a signal to activate osmoprotective gene expression and organic osmolyte accumulation in animal cells. Consistent with this hypothesis, 73% of these protein-homeostasis genes have been shown to slow age-dependent protein aggregation in C. elegans. Because diverse environmental stressors and numerous disease states result in protein misfolding, mechanisms must exist that discriminate between osmotically induced and other forms of stress-induced protein damage. Our findings provide a foundation for understanding how these damage-selectivity mechanisms function.

  10. Genome-wide RNAi screening identifies protein damage as a regulator of osmoprotective gene expression

    PubMed Central

    Lamitina, Todd; Huang, Chunyi George; Strange, Kevin

    2006-01-01

    The detection, stabilization, and repair of stress-induced damage are essential requirements for cellular life. All cells respond to osmotic stress-induced water loss with increased expression of genes that mediate accumulation of organic osmolytes, solutes that function as chemical chaperones and restore osmotic homeostasis. The signals and signaling mechanisms that regulate osmoprotective gene expression in animal cells are poorly understood. Here, we show that gpdh-1 and gpdh-2, genes that mediate the accumulation of the organic osmolyte glycerol, are essential for survival of the nematode Caenorhabditis elegans during osmotic stress. Expression of GFP driven by the gpdh-1 promoter (Pgpdh-1::GFP) is detected only during hypertonic stress but is not induced by other stressors. Using Pgpdh-1::GFP expression as a phenotype, we screened ≈16,000 genes by RNAi feeding and identified 122 that cause constitutive activation of gpdh-1 expression and glycerol accumulation. Many of these genes function to regulate protein translation and cotranslational protein folding and to target and degrade denatured proteins, suggesting that the accumulation of misfolded proteins functions as a signal to activate osmoprotective gene expression and organic osmolyte accumulation in animal cells. Consistent with this hypothesis, 73% of these protein-homeostasis genes have been shown to slow age-dependent protein aggregation in C. elegans. Because diverse environmental stressors and numerous disease states result in protein misfolding, mechanisms must exist that discriminate between osmotically induced and other forms of stress-induced protein damage. Our findings provide a foundation for understanding how these damage-selectivity mechanisms function. PMID:16880390

  11. Spermatogenesis Drives Rapid Gene Creation and Masculinization of the X Chromosome in Stalk-Eyed Flies (Diopsidae).

    PubMed

    Baker, Richard H; Narechania, Apurva; DeSalle, Rob; Johns, Philip M; Reinhardt, Josephine A; Wilkinson, Gerald S

    2016-03-26

    Throughout their evolutionary history, genomes acquire new genetic material that facilitates phenotypic innovation and diversification. Developmental processes associated with reproduction are particularly likely to involve novel genes. Abundant gene creation impacts the evolution of chromosomal gene content and general regulatory mechanisms such as dosage compensation. Numerous studies in model organisms have found complex and, at times contradictory, relationships among these genomic attributes highlighting the need to examine these patterns in other systems characterized by abundant sexual selection. Therefore, we examined the association among novel gene creation, tissue-specific gene expression, and chromosomal gene content within stalk-eyed flies. Flies in this family are characterized by strong sexual selection and the presence of a newly evolved X chromosome. We generated RNA-seq transcriptome data from the testes for three species within the family and from seven additional tissues in the highly dimorphic species,Teleopsis dalmanni Analysis of dipteran gene orthology reveals dramatic testes-specific gene creation in stalk-eyed flies, involving numerous gene families that are highly conserved in other insect groups. Identification of X-linked genes for the three species indicates that the X chromosome arose prior to the diversification of the family. The most striking feature of this X chromosome is that it is highly masculinized, containing nearly twice as many testes-specific genes as expected based on its size. All the major processes that may drive differential sex chromosome gene content-creation of genes with male-specific expression, development of male-specific expression from pre-existing genes, and movement of genes with male-specific expression-are elevated on the X chromosome ofT. dalmanni This masculinization occurs despite evidence that testes expressed genes do not achieve the same levels of gene expression on the X chromosome as they do on the autosomes. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  12. In situ hybridization analysis of the expression of futsch, tau, and MESK2 homologues in the brain of the European honeybee (Apis mellifera L.).

    PubMed

    Kaneko, Kumi; Hori, Sayaka; Morimoto, Mai M; Nakaoka, Takayoshi; Paul, Rajib Kumar; Fujiyuki, Tomoko; Shirai, Kenichi; Wakamoto, Akiko; Tsuboko, Satomi; Takeuchi, Hideaki; Kubo, Takeo

    2010-02-16

    The importance of visual sense in Hymenopteran social behavior is suggested by the existence of a Hymenopteran insect-specific neural circuit related to visual processing and the fact that worker honeybee brain changes morphologically according to its foraging experience. To analyze molecular and neural bases that underlie the visual abilities of the honeybees, we used a cDNA microarray to search for gene(s) expressed in a neural cell-type preferential manner in a visual center of the honeybee brain, the optic lobes (OLs). Expression analysis of candidate genes using in situ hybridization revealed two genes expressed in a neural cell-type preferential manner in the OLs. One is a homologue of Drosophila futsch, which encodes a microtubule-associated protein and is preferentially expressed in the monopolar cells in the lamina of the OLs. The gene for another microtubule-associated protein, tau, which functionally overlaps with futsch, was also preferentially expressed in the monopolar cells, strongly suggesting the functional importance of these two microtubule-associated proteins in monopolar cells. The other gene encoded a homologue of Misexpression Suppressor of Dominant-negative Kinase Suppressor of Ras 2 (MESK2), which might activate Ras/MAPK-signaling in Drosophila. MESK2 was expressed preferentially in a subclass of neurons located in the ventral region between the lamina and medulla neuropil in the OLs, suggesting that this subclass is a novel OL neuron type characterized by MESK2-expression. These three genes exhibited similar expression patterns in the worker, drone, and queen brains, suggesting that they function similarly irrespective of the honeybee sex or caste. Here we identified genes that are expressed in a monopolar cell (Amfutsch and Amtau) or ventral medulla-preferential manner (AmMESK2) in insect OLs. These genes may aid in visualizing neurites of monopolar cells and ventral medulla cells, as well as in analyzing the function of these neurons.

  13. TimesVector: a vectorized clustering approach to the analysis of time series transcriptome data from multiple phenotypes.

    PubMed

    Jung, Inuk; Jo, Kyuri; Kang, Hyejin; Ahn, Hongryul; Yu, Youngjae; Kim, Sun

    2017-12-01

    Identifying biologically meaningful gene expression patterns from time series gene expression data is important to understand the underlying biological mechanisms. To identify significantly perturbed gene sets between different phenotypes, analysis of time series transcriptome data requires consideration of time and sample dimensions. Thus, the analysis of such time series data seeks to search gene sets that exhibit similar or different expression patterns between two or more sample conditions, constituting the three-dimensional data, i.e. gene-time-condition. Computational complexity for analyzing such data is very high, compared to the already difficult NP-hard two dimensional biclustering algorithms. Because of this challenge, traditional time series clustering algorithms are designed to capture co-expressed genes with similar expression pattern in two sample conditions. We present a triclustering algorithm, TimesVector, specifically designed for clustering three-dimensional time series data to capture distinctively similar or different gene expression patterns between two or more sample conditions. TimesVector identifies clusters with distinctive expression patterns in three steps: (i) dimension reduction and clustering of time-condition concatenated vectors, (ii) post-processing clusters for detecting similar and distinct expression patterns and (iii) rescuing genes from unclassified clusters. Using four sets of time series gene expression data, generated by both microarray and high throughput sequencing platforms, we demonstrated that TimesVector successfully detected biologically meaningful clusters of high quality. TimesVector improved the clustering quality compared to existing triclustering tools and only TimesVector detected clusters with differential expression patterns across conditions successfully. The TimesVector software is available at http://biohealth.snu.ac.kr/software/TimesVector/. sunkim.bioinfo@snu.ac.kr. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  14. A versatile genetic tool for post-translational control of gene expression in Drosophila melanogaster

    PubMed Central

    Sethi, Sachin

    2017-01-01

    Several techniques have been developed to manipulate gene expression temporally in intact neural circuits. However, the applicability of current tools developed for in vivo studies in Drosophila is limited by their incompatibility with existing GAL4 lines and side effects on physiology and behavior. To circumvent these limitations, we adopted a strategy to reversibly regulate protein degradation with a small molecule by using a destabilizing domain (DD). We show that this system is effective across different tissues and developmental stages. We further show that this system can be used to control in vivo gene expression levels with low background, large dynamic range, and in a reversible manner without detectable side effects on the lifespan or behavior of the animal. Additionally, we engineered tools for chemically controlling gene expression (GAL80-DD) and recombination (FLP-DD). We demonstrate the applicability of this technology in manipulating neuronal activity and for high-efficiency sparse labeling of neuronal populations. PMID:29140243

  15. IAOseq: inferring abundance of overlapping genes using RNA-seq data.

    PubMed

    Sun, Hong; Yang, Shuang; Tun, Liangliang; Li, Yixue

    2015-01-01

    Overlapping transcription constitutes a common mechanism for regulating gene expression. A major limitation of the overlapping transcription assays is the lack of high throughput expression data. We developed a new tool (IAOseq) that is based on reads distributions along the transcribed regions to identify the expression levels of overlapping genes from standard RNA-seq data. Compared with five commonly used quantification methods, IAOseq showed better performance in the estimation accuracy of overlapping transcription levels. For the same strand overlapping transcription, currently existing high-throughput methods are rarely available to distinguish which strand was present in the original mRNA template. The IAOseq results showed that the commonly used methods gave an average of 1.6 fold overestimation of the expression levels of same strand overlapping genes. This work provides a useful tool for mining overlapping transcription levels from standard RNA-seq libraries. IAOseq could be used to help us understand the complex regulatory mechanism mediated by overlapping transcripts. IAOseq is freely available at http://lifecenter.sgst.cn/main/en/IAO_seq.jsp.

  16. GATA-dependent regulation of TPO-induced c-mpl gene expression during megakaryopoiesis.

    PubMed

    Sunohara, Masataka; Morikawa, Shigeru; Fuse, Akira; Sato, Iwao

    2014-01-01

    Thrombopoietin (TPO) and its receptor, c-Mpl, play the crucial role during megakaryocytopoiesis. Previously, we have shown that the promoter activity of c-mpl induced by TPO is modulated by transcription through a PKC-dependent pathway and that GATA(-77) is involved as a positive regulatory element in TPO-induced c-mpl gene expression in the megakaryoblastic CMK cells. In this research, to examine participating possibility of GATA promoter element in TPO- induced c-mpl gene expression through a PKC-independent pathway, the promoter activity of site-directed mutagenesis and the effect of potein kinase C modulator were measured by a transient transfection assay system. Together with our previous results on the TPO-induced c-mpl promoter, this study indicates destruction of -77GATA in c-mpl promoter decreased the activity by 47.3% under existence of GF109203. These results suggest that GATA promoter element plays significant role in TPO-induced c-mpl gene expression through a PKC-independent pathway.

  17. Massive-scale gene co-expression network construction and robustness testing using random matrix theory.

    PubMed

    Gibson, Scott M; Ficklin, Stephen P; Isaacson, Sven; Luo, Feng; Feltus, Frank A; Smith, Melissa C

    2013-01-01

    The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust.

  18. Establishment of a novel collagenase perfusion method to isolate rat pancreatic stellate cells and investigation of their gene expression of TGF-beta1, type I collagen, and CTGF in primary culture or freshly isolated cells.

    PubMed

    Shinji, Toshiyuki; Ujike, Kozo; Ochi, Koji; Kusano, Nobuchika; Kikui, Tetsuya; Matsumura, Naoki; Emori, Yasuyuki; Seno, Toshinobu; Koide, Norio

    2002-08-01

    In studies of the pathogenesis of pancreatic fibrosis, pancreatic stellate cells (PSCs) have recently gained attention. In the present study, we established a new collagenase perfusion method through thoracic aorta cannulation to isolate PSCs, and we studied gene expression of TGF-beta1, type I collagen, and connective tissue growth factor using primary cultured PSCs. Our method facilitated PSC isolation, and by our new method, 4.3 +/- 1.2 x 10(6) PSCs were obtained from a rat. In comparing the expression of these genes with that of hepatic stellate cells (HSCs), we observed a similar pattern, although PSCs expressed type I collagen gene earlier than did HSCs. These results suggest that PSCs may play an important role in fibrosis of the pancreas, as HSCs do in liver fibrosis; in addition, PSCs may exist in a preactivated state or may be more easily activated than are HSCs. We also isolated the PSCs from a WBN/Kob rat, the spontaneous pancreatitis rat, and compared the gene expression with that from a normal rat.

  19. Distinct iris gene expression profiles of primary angle closure glaucoma and primary open angle glaucoma and their interaction with ocular biometric parameters.

    PubMed

    Seet, Li-Fong; Narayanaswamy, Arun; Finger, Sharon N; Htoon, Hla M; Nongpiur, Monisha E; Toh, Li Zhen; Ho, Henrietta; Perera, Shamira A; Wong, Tina T

    2016-11-01

    This study aimed to evaluate differences in iris gene expression profiles between primary angle closure glaucoma (PACG) and primary open angle glaucoma (POAG) and their interaction with biometric characteristics. Prospective study. Thirty-five subjects with PACG and thirty-three subjects with POAG who required trabeculectomy were enrolled at the Singapore National Eye Centre, Singapore. Iris specimens, obtained by iridectomy, were analysed by real-time polymerase chain reaction for expression of type I collagen, vascular endothelial growth factor (VEGF)-A, -B and -C, as well as VEGF receptors (VEGFRs) 1 and 2. Anterior segment optical coherence tomography (ASOCT) imaging for biometric parameters, including anterior chamber depth (ACD), anterior chamber volume (ACV) and lens vault (LV), was also performed pre-operatively. Relative mRNA levels between PACG and POAG irises, biometric measurements, discriminant analyses using genes and biometric parameters. COL1A1, VEGFB, VEGFC and VEGFR2 mRNA expression was higher in PACG compared to POAG irises. LV, ACD and ACV were significantly different between the two subgroups. Discriminant analyses based on gene expression, biometric parameters or a combination of both gene expression and biometrics (LV and ACV), correctly classified 94.1%, 85.3% and 94.1% of the original PACG and POAG cases, respectively. The discriminant function combining genes and biometrics demonstrated the highest accuracy in cross-validated classification of the two glaucoma subtypes. Distinct iris gene expression supports the pathophysiological differences that exist between PACG and POAG. Biometric parameters can combine with iris gene expression to more accurately define PACG from POAG. © 2016 The Authors. Clinical & Experimental Ophthalmology published by John Wiley & Sons Australia, Ltd on behalf of Royal Australian and New Zealand College of Ophthalmologists.

  20. Genetic divergence in the transcriptional engram of chronic alcohol abuse: A laser-capture RNA-seq study of the mouse mesocorticolimbic system.

    PubMed

    Mulligan, Megan K; Mozhui, Khyobeni; Pandey, Ashutosh K; Smith, Maren L; Gong, Suzhen; Ingels, Jesse; Miles, Michael F; Lopez, Marcelo F; Lu, Lu; Williams, Robert W

    2017-02-01

    Genetic factors that influence the transition from initial drinking to dependence remain enigmatic. Recent studies have leveraged chronic intermittent ethanol (CIE) paradigms to measure changes in brain gene expression in a single strain at 0, 8, 72 h, and even 7 days following CIE. We extend these findings using LCM RNA-seq to profile expression in 11 brain regions in two inbred strains - C57BL/6J (B6) and DBA/2J (D2) - 72 h following multiple cycles of ethanol self-administration and CIE. Linear models identified differential expression based on treatment, region, strain, or interactions with treatment. Nearly 40% of genes showed a robust effect (FDR < 0.01) of region, and hippocampus CA1, cortex, bed nucleus stria terminalis, and nucleus accumbens core had the highest number of differentially expressed genes after treatment. Another 8% of differentially expressed genes demonstrated a robust effect of strain. As expected, based on similar studies in B6, treatment had a much smaller impact on expression; only 72 genes (p < 0.01) are modulated by treatment (independent of region or strain). Strikingly, many more genes (415) show a strain-specific and largely opposite response to treatment and are enriched in processes related to RNA metabolism, transcription factor activity, and mitochondrial function. Over 3 times as many changes in gene expression were detected in D2 compared to B6, and weighted gene co-expression network analysis (WGCNA) module comparison identified more modules enriched for treatment effects in D2. Substantial strain differences exist in the temporal pattern of transcriptional neuroadaptation to CIE, and these may drive individual differences in risk of addiction following excessive alcohol consumption. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. Application of the Gini correlation coefficient to infer regulatory relationships in transcriptome analysis.

    PubMed

    Ma, Chuang; Wang, Xiangfeng

    2012-09-01

    One of the computational challenges in plant systems biology is to accurately infer transcriptional regulation relationships based on correlation analyses of gene expression patterns. Despite several correlation methods that are applied in biology to analyze microarray data, concerns regarding the compatibility of these methods with the gene expression data profiled by high-throughput RNA transcriptome sequencing (RNA-Seq) technology have been raised. These concerns are mainly due to the fact that the distribution of read counts in RNA-Seq experiments is different from that of fluorescence intensities in microarray experiments. Therefore, a comprehensive evaluation of the existing correlation methods and, if necessary, introduction of novel methods into biology is appropriate. In this study, we compared four existing correlation methods used in microarray analysis and one novel method called the Gini correlation coefficient on previously published microarray-based and sequencing-based gene expression data in Arabidopsis (Arabidopsis thaliana) and maize (Zea mays). The comparisons were performed on more than 11,000 regulatory relationships in Arabidopsis, including 8,929 pairs of transcription factors and target genes. Our analyses pinpointed the strengths and weaknesses of each method and indicated that the Gini correlation can compensate for the shortcomings of the Pearson correlation, the Spearman correlation, the Kendall correlation, and the Tukey's biweight correlation. The Gini correlation method, with the other four evaluated methods in this study, was implemented as an R package named rsgcc that can be utilized as an alternative option for biologists to perform clustering analyses of gene expression patterns or transcriptional network analyses.

  2. Gene expression profiles associated with depression in patients with chronic hepatitis C (CH-C)

    PubMed Central

    Birerdinc, Aybike; Afendy, Arian; Stepanova, Maria; Younossi, Issah; Baranova, Ancha; Younossi, Zobair M

    2012-01-01

    The standard treatment for CH-C, pegylated interferon-α and ribavirin (PEG-IFN + RBV), is associated with depression. Recent studies have proposed a new role for cytokines in the pathogenesis of depression. We aimed to assess differential gene expression related to depression in CH-C patients treated with PEG-IFN + RBV. We included 67 CH-C patients being treated with PEG-IFN+RBV. Of the entire study cohort, 22% had pre-existing depression, while another 37% developed new depression in course of the treatment. Pretreatment blood samples were collected into PAXgene™ RNA tubes, the RNAs extracted from peripheral blood mononuclear cells (PBMCs) were used for one step RT-PCR to profile 160 mRNAs. Differentially expressed genes were separated into up- and down-regulated genes according to presence or absence of depression at baseline (pre-existing depression) or following the initiation of treatment (treatment-related depression). The mRNA expression profile associated with any depression and with treatment-related depression included four and six genes, respectively. Our data demonstrate a significant down-regulation of TGF-β1 and the shift of Th1-Th2 cytokine balance in the depression associated with IFN-based treatment of HCV infection. We propose that TGF-β1 plays an important role in the imbalance of Th1/Th2 in patients with CH-C and depression. With further validation, TGF-β1 and other components of Th1/Th2 regulation pathway may provide a future marker for CH-C patients predisposed to depression. PMID:23139898

  3. Gene expression profiles associated with depression in patients with chronic hepatitis C (CH-C).

    PubMed

    Birerdinc, Aybike; Afendy, Arian; Stepanova, Maria; Younossi, Issah; Baranova, Ancha; Younossi, Zobair M

    2012-09-01

    The standard treatment for CH-C, pegylated interferon-α and ribavirin (PEG-IFN + RBV), is associated with depression. Recent studies have proposed a new role for cytokines in the pathogenesis of depression. We aimed to assess differential gene expression related to depression in CH-C patients treated with PEG-IFN + RBV. We included 67 CH-C patients being treated with PEG-IFN+RBV. Of the entire study cohort, 22% had pre-existing depression, while another 37% developed new depression in course of the treatment. Pretreatment blood samples were collected into PAXgene™ RNA tubes, the RNAs extracted from peripheral blood mononuclear cells (PBMCs) were used for one step RT-PCR to profile 160 mRNAs. Differentially expressed genes were separated into up- and down-regulated genes according to presence or absence of depression at baseline (pre-existing depression) or following the initiation of treatment (treatment-related depression). The mRNA expression profile associated with any depression and with treatment-related depression included four and six genes, respectively. Our data demonstrate a significant down-regulation of TGF-β1 and the shift of Th1-Th2 cytokine balance in the depression associated with IFN-based treatment of HCV infection. We propose that TGF-β1 plays an important role in the imbalance of Th1/Th2 in patients with CH-C and depression. With further validation, TGF-β1 and other components of Th1/Th2 regulation pathway may provide a future marker for CH-C patients predisposed to depression.

  4. Application of the Gini Correlation Coefficient to Infer Regulatory Relationships in Transcriptome Analysis[W][OA

    PubMed Central

    Ma, Chuang; Wang, Xiangfeng

    2012-01-01

    One of the computational challenges in plant systems biology is to accurately infer transcriptional regulation relationships based on correlation analyses of gene expression patterns. Despite several correlation methods that are applied in biology to analyze microarray data, concerns regarding the compatibility of these methods with the gene expression data profiled by high-throughput RNA transcriptome sequencing (RNA-Seq) technology have been raised. These concerns are mainly due to the fact that the distribution of read counts in RNA-Seq experiments is different from that of fluorescence intensities in microarray experiments. Therefore, a comprehensive evaluation of the existing correlation methods and, if necessary, introduction of novel methods into biology is appropriate. In this study, we compared four existing correlation methods used in microarray analysis and one novel method called the Gini correlation coefficient on previously published microarray-based and sequencing-based gene expression data in Arabidopsis (Arabidopsis thaliana) and maize (Zea mays). The comparisons were performed on more than 11,000 regulatory relationships in Arabidopsis, including 8,929 pairs of transcription factors and target genes. Our analyses pinpointed the strengths and weaknesses of each method and indicated that the Gini correlation can compensate for the shortcomings of the Pearson correlation, the Spearman correlation, the Kendall correlation, and the Tukey’s biweight correlation. The Gini correlation method, with the other four evaluated methods in this study, was implemented as an R package named rsgcc that can be utilized as an alternative option for biologists to perform clustering analyses of gene expression patterns or transcriptional network analyses. PMID:22797655

  5. MIDAS: Mining differentially activated subpaths of KEGG pathways from multi-class RNA-seq data.

    PubMed

    Lee, Sangseon; Park, Youngjune; Kim, Sun

    2017-07-15

    Pathway based analysis of high throughput transcriptome data is a widely used approach to investigate biological mechanisms. Since a pathway consists of multiple functions, the recent approach is to determine condition specific sub-pathways or subpaths. However, there are several challenges. First, few existing methods utilize explicit gene expression information from RNA-seq. More importantly, subpath activity is usually an average of statistical scores, e.g., correlations, of edges in a candidate subpath, which fails to reflect gene expression quantity information. In addition, none of existing methods can handle multiple phenotypes. To address these technical problems, we designed and implemented an algorithm, MIDAS, that determines condition specific subpaths, each of which has different activities across multiple phenotypes. MIDAS utilizes gene expression quantity information fully and the network centrality information to determine condition specific subpaths. To test performance of our tool, we used TCGA breast cancer RNA-seq gene expression profiles with five molecular subtypes. 36 differentially activate subpaths were determined. The utility of our method, MIDAS, was demonstrated in four ways. All 36 subpaths are well supported by the literature information. Subsequently, we showed that these subpaths had a good discriminant power for five cancer subtype classification and also had a prognostic power in terms of survival analysis. Finally, in a performance comparison of MIDAS to a recent subpath prediction method, PATHOME, our method identified more subpaths and much more genes that are well supported by the literature information. http://biohealth.snu.ac.kr/software/MIDAS/. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  6. Contribution of transposable elements in the plant's genome.

    PubMed

    Sahebi, Mahbod; Hanafi, Mohamed M; van Wijnen, Andre J; Rice, David; Rafii, M Y; Azizi, Parisa; Osman, Mohamad; Taheri, Sima; Bakar, Mohd Faizal Abu; Isa, Mohd Noor Mat; Noor, Yusuf Muhammad

    2018-07-30

    Plants maintain extensive growth flexibility under different environmental conditions, allowing them to continuously and rapidly adapt to alterations in their environment. A large portion of many plant genomes consists of transposable elements (TEs) that create new genetic variations within plant species. Different types of mutations may be created by TEs in plants. Many TEs can avoid the host's defense mechanisms and survive alterations in transposition activity, internal sequence and target site. Thus, plant genomes are expected to utilize a variety of mechanisms to tolerate TEs that are near or within genes. TEs affect the expression of not only nearby genes but also unlinked inserted genes. TEs can create new promoters, leading to novel expression patterns or alternative coding regions to generate alternate transcripts in plant species. TEs can also provide novel cis-acting regulatory elements that act as enhancers or inserts within original enhancers that are required for transcription. Thus, the regulation of plant gene expression is strongly managed by the insertion of TEs into nearby genes. TEs can also lead to chromatin modifications and thereby affect gene expression in plants. TEs are able to generate new genes and modify existing gene structures by duplicating, mobilizing and recombining gene fragments. They can also facilitate cellular functions by sharing their transposase-coding regions. Hence, TE insertions can not only act as simple mutagens but can also alter the elementary functions of the plant genome. Here, we review recent discoveries concerning the contribution of TEs to gene expression in plant genomes and discuss the different mechanisms by which TEs can affect plant gene expression and reduce host defense mechanisms. Copyright © 2018 Elsevier B.V. All rights reserved.

  7. Statistical approach for selection of biologically informative genes.

    PubMed

    Das, Samarendra; Rai, Anil; Mishra, D C; Rai, Shesh N

    2018-05-20

    Selection of informative genes from high dimensional gene expression data has emerged as an important research area in genomics. Many gene selection techniques have been proposed so far are either based on relevancy or redundancy measure. Further, the performance of these techniques has been adjudged through post selection classification accuracy computed through a classifier using the selected genes. This performance metric may be statistically sound but may not be biologically relevant. A statistical approach, i.e. Boot-MRMR, was proposed based on a composite measure of maximum relevance and minimum redundancy, which is both statistically sound and biologically relevant for informative gene selection. For comparative evaluation of the proposed approach, we developed two biological sufficient criteria, i.e. Gene Set Enrichment with QTL (GSEQ) and biological similarity score based on Gene Ontology (GO). Further, a systematic and rigorous evaluation of the proposed technique with 12 existing gene selection techniques was carried out using five gene expression datasets. This evaluation was based on a broad spectrum of statistically sound (e.g. subject classification) and biological relevant (based on QTL and GO) criteria under a multiple criteria decision-making framework. The performance analysis showed that the proposed technique selects informative genes which are more biologically relevant. The proposed technique is also found to be quite competitive with the existing techniques with respect to subject classification and computational time. Our results also showed that under the multiple criteria decision-making setup, the proposed technique is best for informative gene selection over the available alternatives. Based on the proposed approach, an R Package, i.e. BootMRMR has been developed and available at https://cran.r-project.org/web/packages/BootMRMR. This study will provide a practical guide to select statistical techniques for selecting informative genes from high dimensional expression data for breeding and system biology studies. Published by Elsevier B.V.

  8. Transcriptome Profiling of Buffalograss Challenged with the Leaf Spot Pathogen Curvularia inaequalis.

    PubMed

    Amaradasa, Bimal S; Amundsen, Keenan

    2016-01-01

    Buffalograss (Bouteloua dactyloides) is a low maintenance U. S. native turfgrass species with exceptional drought, heat, and cold tolerance. Leaf spot caused by Curvularia inaequalis negatively impacts buffalograss visual quality. Two leaf spot susceptible and two resistant buffalograss lines were challenged with C. inaequalis. Samples were collected from treated and untreated leaves when susceptible lines showed symptoms. Transcriptome sequencing was done and differentially expressed genes were identified. Approximately 27 million raw sequencing reads were produced per sample. More than 86% of the sequencing reads mapped to an existing buffalograss reference transcriptome. De novo assembly of unmapped reads was merged with the existing reference to produce a more complete transcriptome. There were 461 differentially expressed transcripts between the resistant and susceptible lines when challenged with the pathogen and 1552 in its absence. Previously characterized defense-related genes were identified among the differentially expressed transcripts. Twenty one resistant line transcripts were similar to genes regulating pattern triggered immunity and 20 transcripts were similar to genes regulating effector triggered immunity. There were also nine up-regulated transcripts in resistance lines which showed potential to initiate systemic acquired resistance (SAR) and three transcripts encoding pathogenesis-related proteins which are downstream products of SAR. This is the first study characterizing changes in the buffalograss transcriptome when challenged with C. inaequalis.

  9. Clustering of time-course gene expression profiles using normal mixture models with autoregressive random effects

    PubMed Central

    2012-01-01

    Background Time-course gene expression data such as yeast cell cycle data may be periodically expressed. To cluster such data, currently used Fourier series approximations of periodic gene expressions have been found not to be sufficiently adequate to model the complexity of the time-course data, partly due to their ignoring the dependence between the expression measurements over time and the correlation among gene expression profiles. We further investigate the advantages and limitations of available models in the literature and propose a new mixture model with autoregressive random effects of the first order for the clustering of time-course gene-expression profiles. Some simulations and real examples are given to demonstrate the usefulness of the proposed models. Results We illustrate the applicability of our new model using synthetic and real time-course datasets. We show that our model outperforms existing models to provide more reliable and robust clustering of time-course data. Our model provides superior results when genetic profiles are correlated. It also gives comparable results when the correlation between the gene profiles is weak. In the applications to real time-course data, relevant clusters of coregulated genes are obtained, which are supported by gene-function annotation databases. Conclusions Our new model under our extension of the EMMIX-WIRE procedure is more reliable and robust for clustering time-course data because it adopts a random effects model that allows for the correlation among observations at different time points. It postulates gene-specific random effects with an autocorrelation variance structure that models coregulation within the clusters. The developed R package is flexible in its specification of the random effects through user-input parameters that enables improved modelling and consequent clustering of time-course data. PMID:23151154

  10. Correlating Gene-specific DNA Methylation Changes with Expression and Transcriptional Activity of Astrocytic KCNJ10 (Kir4.1)

    PubMed Central

    Nwaobi, Sinifunanya E.; Olsen, Michelle L.

    2015-01-01

    DNA methylation serves to regulate gene expression through the covalent attachment of a methyl group onto the C5 position of a cytosine in a cytosine-guanine dinucleotide. While DNA methylation provides long-lasting and stable changes in gene expression, patterns and levels of DNA methylation are also subject to change based on a variety of signals and stimuli. As such, DNA methylation functions as a powerful and dynamic regulator of gene expression. The study of neuroepigenetics has revealed a variety of physiological and pathological states that are associated with both global and gene-specific changes in DNA methylation. Specifically, striking correlations between changes in gene expression and DNA methylation exist in neuropsychiatric and neurodegenerative disorders, during synaptic plasticity, and following CNS injury. However, as the field of neuroepigenetics continues to expand its understanding of the role of DNA methylation in CNS physiology, delineating causal relationships in regards to changes in gene expression and DNA methylation are essential. Moreover, in regards to the larger field of neuroscience, the presence of vast region and cell-specific differences requires techniques that address these variances when studying the transcriptome, proteome, and epigenome. Here we describe FACS sorting of cortical astrocytes that allows for subsequent examination of a both RNA transcription and DNA methylation. Furthermore, we detail a technique to examine DNA methylation, methylation sensitive high resolution melt analysis (MS-HRMA) as well as a luciferase promoter assay. Through the use of these combined techniques one is able to not only explore correlative changes between DNA methylation and gene expression, but also directly assess if changes in the DNA methylation status of a given gene region are sufficient to affect transcriptional activity. PMID:26436772

  11. Endothelin-1 expression is strongly repressed by AU-rich elements in the 3′-untranslated region of the gene

    PubMed Central

    2004-01-01

    The regulation of the synthesis of the endothelial-derived vasoconstrictor ET-1 (endothelin-1) is a complex process that occurs mainly at the mRNA level. Transcription of the gene accounts for an important part of the regulation of expression, as already described for different modulators such as the cytokine TGF-β (transforming growth factor-β). However, very little is known about mechanisms governing ET-1 expression at the post-transcriptional level. The aim of the present study was to investigate the regulation of the ET-1 expression at this level. Since the 3′-UTR (3′-untranslated region) of mRNAs commonly contains genetic determinants for the post-transcriptional control of gene expression, we focused on the potential role of the 3′-UTR of ET-1 mRNA. Experiments performed with luciferase reporter constructs containing the 3′-UTR showed that this region exerts a potent destabilizing effect. Deletional analyses allowed us to locate this activity within a region at positions 924–1127. Some (but not all) of the AREs (AU-rich elements) present in this region were found to be essential for this mRNA-destabilizing activity. We also present evidence that cytosolic proteins from endothelial cells interact specifically with these RNA elements, and that a close correlation exists between the ability of the AREs to destabilize ET-1 mRNA and the binding of proteins to these elements. Our results are compatible with the existence of a strong repressional control of ET-1 expression mediated by destabilization of the mRNA exerted through the interaction of specific cytosolic proteins with AREs present in the 3′-UTR of the gene. PMID:15595926

  12. Apoptosis-Related Gene Expression in an Adult Cohort with Crimean-Congo Hemorrhagic Fever.

    PubMed

    Guler, Nil; Eroglu, Cafer; Yilmaz, Hava; Karadag, Adil; Alacam, Hasan; Sunbul, Mustafa; Fletcher, Tom E; Leblebicioglu, Hakan

    2016-01-01

    Crimean-Congo Hemorrhagic Fever (CCHF) is a life threatening acute viral infection characterized by fever, bleeding, leukopenia and thrombocytopenia. It is a major emerging infectious diseases threat, but its pathogenesis remains poorly understood and few data exist for the role of apoptosis in acute infection. We aimed to assess apoptotic gene expression in leukocytes in a cross-sectional cohort study of adults with CCHF. Twenty participants with CCHF and 10 healthy controls were recruited at a tertiary CCHF unit in Turkey; at admission baseline blood tests were collected and total RNA was isolated. The RealTime ready Human Apoptosis Panel was used for real-time PCR, detecting differences in gene expression. Participants had CCHF severity grading scores (SGS) with low risk score (10 out of 20) and intermediate or high risk scores (10 out of 20) for mortality. Five of 20 participants had a fatal outcome. Gene expression analysis showed modulation of pro-apoptotic and anti-apoptotic genes that facilitate apoptosis in the CCHF patient group. Dominant extrinsic pathway activation, mostly related with TNF family members was observed. Severe and fatal cases suggest additional intrinsic pathway activation. The clinical significance of relative gene expression is not clear, and larger longitudinal studies with simultaneous measurement of host and viral factors are recommended.

  13. The novel product of a five-exon stargazin-related gene abolishes CaV2.2 calcium channel expression

    PubMed Central

    Moss, Fraser J.; Viard, Patricia; Davies, Anthony; Bertaso, Federica; Page, Karen M.; Graham, Alex; Cantí, Carles; Plumpton, Mary; Plumpton, Christopher; Clare, Jeffrey J.; Dolphin, Annette C.

    2002-01-01

    We have cloned and characterized a new member of the voltage-dependent Ca2+ channel γ subunit family, with a novel gene structure and striking properties. Unlike the genes of other potential γ subunits identified by their homology to the stargazin gene, CACNG7 is a five-, and not four-exon gene whose mRNA encodes a protein we have designated γ7. Expression of human γ7 has been localized specifically to brain. N-type current through CaV2.2 channels was almost abolished when co-expressed transiently with γ7 in either Xenopus oocytes or COS-7 cells. Furthermore, immunocytochemistry and western blots show that γ7 has this effect by causing a large reduction in expression of CaV2.2 rather than by interfering with trafficking or biophysical properties of the channel. No effect of transiently expressed γ7 was observed on pre-existing endogenous N-type calcium channels in sympathetic neurones. Low homology to the stargazin-like γ subunits, different gene structure and the unique functional properties of γ7 imply that it represents a distinct subdivision of the family of proteins identified by their structural and sequence homology to stargazin. PMID:11927536

  14. Genomic Comparison of the P-ATPase Gene Family in Four Cotton Species and Their Expression Patterns in Gossypium hirsutum.

    PubMed

    Chen, Wen; Si, Guo-Yang; Zhao, Gang; Abdullah, Muhammad; Guo, Ning; Li, Da-Hui; Sun, Xu; Cai, Yong-Ping; Lin, Yi; Gao, Jun-Shan

    2018-05-05

    Plant P-type H⁺-ATPase (P-ATPase) is a membrane protein existing in the plasma membrane that plays an important role in the transmembrane transport of plant cells. To understand the variety and quantity of P-ATPase proteins in different cotton species, we combined four databases from two diploid cotton species ( Gossypium raimondii and G. arboreum ) and two tetraploid cotton species ( G. hirsutum and G. barbadense ) to screen the P-ATPase gene family and resolved the evolutionary relationships between the former cotton species. We identified 53, 51, 99 and 98 P-ATPase genes from G. arboretum, G. raimondii , G. barbadense and G. hirsutum , respectively. The structural and phylogenetic analyses revealed that the gene structure was consistent between P-ATPase genes, with a close evolutionary relationship. The expression analysis of P-ATPase genes showed that many P-ATPase genes were highly expressed in various tissues and at different fiber developmental stages in G. hirsutum , suggesting that they have potential functions during growth and fiber development in cotton.

  15. An Adaptive Genetic Association Test Using Double Kernel Machines.

    PubMed

    Zhan, Xiang; Epstein, Michael P; Ghosh, Debashis

    2015-10-01

    Recently, gene set-based approaches have become very popular in gene expression profiling studies for assessing how genetic variants are related to disease outcomes. Since most genes are not differentially expressed, existing pathway tests considering all genes within a pathway suffer from considerable noise and power loss. Moreover, for a differentially expressed pathway, it is of interest to select important genes that drive the effect of the pathway. In this article, we propose an adaptive association test using double kernel machines (DKM), which can both select important genes within the pathway as well as test for the overall genetic pathway effect. This DKM procedure first uses the garrote kernel machines (GKM) test for the purposes of subset selection and then the least squares kernel machine (LSKM) test for testing the effect of the subset of genes. An appealing feature of the kernel machine framework is that it can provide a flexible and unified method for multi-dimensional modeling of the genetic pathway effect allowing for both parametric and nonparametric components. This DKM approach is illustrated with application to simulated data as well as to data from a neuroimaging genetics study.

  16. Impact of Cigarette Smoke on the Human and Mouse Lungs: A Gene-Expression Comparison Study

    PubMed Central

    Morissette, Mathieu C.; Lamontagne, Maxime; Bérubé, Jean-Christophe; Gaschler, Gordon; Williams, Andrew; Yauk, Carole; Couture, Christian; Laviolette, Michel; Hogg, James C.; Timens, Wim; Halappanavar, Sabina; Stampfli, Martin R.; Bossé, Yohan

    2014-01-01

    Cigarette smoke is well known for its adverse effects on human health, especially on the lungs. Basic research is essential to identify the mechanisms involved in the development of cigarette smoke-related diseases, but translation of new findings from pre-clinical models to the clinic remains difficult. In the present study, we aimed at comparing the gene expression signature between the lungs of human smokers and mice exposed to cigarette smoke to identify the similarities and differences. Using human and mouse whole-genome gene expression arrays, changes in gene expression, signaling pathways and biological functions were assessed. We found that genes significantly modulated by cigarette smoke in humans were enriched for genes modulated by cigarette smoke in mice, suggesting a similar response of both species. Sixteen smoking-induced genes were in common between humans and mice including six newly reported to be modulated by cigarette smoke. In addition, we identified a new conserved pulmonary response to cigarette smoke in the induction of phospholipid metabolism/degradation pathways. Finally, the majority of biological functions modulated by cigarette smoke in humans were also affected in mice. Altogether, the present study provides information on similarities and differences in lung gene expression response to cigarette smoke that exist between human and mouse. Our results foster the idea that animal models should be used to study the involvement of pathways rather than single genes in human diseases. PMID:24663285

  17. Transcriptome database resource and gene expression atlas for the rose

    PubMed Central

    2012-01-01

    Background For centuries roses have been selected based on a number of traits. Little information exists on the genetic and molecular basis that contributes to these traits, mainly because information on expressed genes for this economically important ornamental plant is scarce. Results Here, we used a combination of Illumina and 454 sequencing technologies to generate information on Rosa sp. transcripts using RNA from various tissues and in response to biotic and abiotic stresses. A total of 80714 transcript clusters were identified and 76611 peptides have been predicted among which 20997 have been clustered into 13900 protein families. BLASTp hits in closely related Rosaceae species revealed that about half of the predicted peptides in the strawberry and peach genomes have orthologs in Rosa dataset. Digital expression was obtained using RNA samples from organs at different development stages and under different stress conditions. qPCR validated the digital expression data for a selection of 23 genes with high or low expression levels. Comparative gene expression analyses between the different tissues and organs allowed the identification of clusters that are highly enriched in given tissues or under particular conditions, demonstrating the usefulness of the digital gene expression analysis. A web interface ROSAseq was created that allows data interrogation by BLAST, subsequent analysis of DNA clusters and access to thorough transcript annotation including best BLAST matches on Fragaria vesca, Prunus persica and Arabidopsis. The rose peptides dataset was used to create the ROSAcyc resource pathway database that allows access to the putative genes and enzymatic pathways. Conclusions The study provides useful information on Rosa expressed genes, with thorough annotation and an overview of expression patterns for transcripts with good accuracy. PMID:23164410

  18. Epigenetic silencing of a foreign gene in nuclear transformants of Chlamydomonas.

    PubMed Central

    Cerutti, H; Johnson, A M; Gillham, N W; Boynton, J E

    1997-01-01

    The unstable expression of introduced genes poses a serious problem for the application of transgenic technology in plants. In transformants of the unicellular green alga Chlamydomonas reinhardtii, expression of a eubacterial aadA gene, conferring spectinomycin resistance, is transcriptionally suppressed by a reversible epigenetic mechanism(s). Variations in the size and frequency of colonies surviving on different concentrations of spectinomycin as well as the levels of transcriptional activity of the introduced transgene(s) suggest the existence of intermediate expression states in genetically identical cells. Gene silencing does not correlate with methylation of the integrated DNA and does not involve large alterations in its chromatin structure, as revealed by digestion with restriction endonucleases and DNase I. Transgene repression is enhanced by lower temperatures, similar to position effect variegation in Drosophila. By analogy to epigenetic phenomena in several eukaryotes, our results suggest a possible role for (hetero)chromatic chromosomal domains in transcriptional inactivation. PMID:9212467

  19. The vertebrate phylotypic stage and an early bilaterian-related stage in mouse embryogenesis defined by genomic information.

    PubMed

    Irie, Naoki; Sehara-Fujisawa, Atsuko

    2007-01-12

    Embryos of taxonomically different vertebrates are thought to pass through a stage in which they resemble one another morphologically. This "vertebrate phylotypic stage" may represent the basic vertebrate body plan that was established in the common ancestor of vertebrates. However, much controversy remains about when the phylotypic stage appears, and whether it even exists. To overcome the limitations of studies based on morphological comparison, we explored a comprehensive quantitative method for defining the constrained stage using expressed sequence tag (EST) data, gene ontologies (GO), and available genomes of various animals. If strong developmental constraints occur during the phylotypic stage of vertebrate embryos, then genes conserved among vertebrates would be highly expressed at this stage. We established a novel method for evaluating the ancestral nature of mouse embryonic stages that does not depend on comparative morphology. The numerical "ancestor index" revealed that the mouse indeed has a highly conserved embryonic period at embryonic day 8.0-8.5, the time of appearance of the pharyngeal arch and somites. During this period, the mouse prominently expresses GO-determined developmental genes shared among vertebrates. Similar analyses revealed the existence of a bilaterian-related period, during which GO-determined developmental genes shared among bilaterians are markedly expressed at the cleavage-to-gastrulation period. The genes associated with the phylotypic stage identified by our method are essential in embryogenesis. Our results demonstrate that the mid-embryonic stage of the mouse is indeed highly constrained, supporting the existence of the phylotypic stage. Furthermore, this candidate stage is preceded by a putative bilaterian ancestor-related period. These results not only support the developmental hourglass model, but also highlight the hierarchical aspect of embryogenesis proposed by von Baer. Identification of conserved stages and tissues by this method in various animals would be a powerful tool to examine the phylotypic stage hypothesis, and to understand which kinds of developmental events and gene sets are evolutionarily constrained and how they limit the possible variations of animal basic body plans.

  20. Identification and expression analysis of the genes involved in serotonin biosynthesis and transduction in the field cricket Gryllus bimaculatus.

    PubMed

    Watanabe, T; Sadamoto, Hitoshi; Aonuma, H

    2011-10-01

    Serotonin (5-HT) modulates various aspects of behaviours such as aggressive behaviour and circadian behaviour in the cricket. To elucidate the molecular basis of the cricket 5-HT system, we identified 5-HT-related genes in the field cricket Gryllus bimaculatus DeGeer. Complementary DNA of tryptophan hydroxylase and phenylalanine-tryptophan hydroxylase, which convert tryptophan into 5-hydroxy-L-tryptophan (5-HTP), and that of aromatic L-amino acid decarboxylase, which converts 5-HTP into 5-HT, were isolated from a cricket brain cDNA library. In addition, four 5-HT receptor genes (5-HT(1A) , 5-HT(1B) , 5-HT(2α) , and 5-HT(7) ) were identified. Expression analysis of the tryptophan hydroxylase gene TRH and phenylalanine-tryptophan hydroxylase gene TPH, which are selectively involved in neuronal and peripheral 5-HT synthesis in Drosophila, suggested that two 5-HT synthesis pathways co-exist in the cricket neuronal tissues. The four 5-HT receptor genes were expressed in various tissues at differential expression levels, suggesting that the 5-HT system is widely distributed in the cricket. © 2011 The Authors. Insect Molecular Biology © 2011 The Royal Entomological Society.

  1. Post-transcriptional inducible gene regulation by natural antisense RNA.

    PubMed

    Nishizawa, Mikio; Ikeya, Yukinobu; Okumura, Tadayoshi; Kimura, Tominori

    2015-01-01

    Accumulating data indicate the existence of natural antisense transcripts (asRNAs), frequently transcribed from eukaryotic genes and do not encode proteins in many cases. However, their importance has been overlooked due to their heterogeneity, low expression level, and unknown function. Genes induced in responses to various stimuli are transcriptionally regulated by the activation of a gene promoter and post-transcriptionally regulated by controlling mRNA stability and translatability. A low-copy-number asRNA may post-transcriptionally regulate gene expression with cis-controlling elements on the mRNA. The asRNA itself may act as regulatory RNA in concert with trans-acting factors, including various RNA-binding proteins that bind to cis-controlling elements, microRNAs, and drugs. A novel mechanism that regulates mRNA stability includes the interaction of asRNA with mRNA by hybridization to loops in secondary structures. Furthermore, recent studies have shown that the functional network of mRNAs, asRNAs, and microRNAs finely tunes the levels of mRNA expression. The post-transcriptional mechanisms via these RNA-RNA interactions may play pivotal roles to regulate inducible gene expression and present the possibility of the involvement of asRNAs in various diseases.

  2. Simulation Modeling to Compare High-Throughput, Low-Iteration Optimization Strategies for Metabolic Engineering

    PubMed Central

    Heinsch, Stephen C.; Das, Siba R.; Smanski, Michael J.

    2018-01-01

    Increasing the final titer of a multi-gene metabolic pathway can be viewed as a multivariate optimization problem. While numerous multivariate optimization algorithms exist, few are specifically designed to accommodate the constraints posed by genetic engineering workflows. We present a strategy for optimizing expression levels across an arbitrary number of genes that requires few design-build-test iterations. We compare the performance of several optimization algorithms on a series of simulated expression landscapes. We show that optimal experimental design parameters depend on the degree of landscape ruggedness. This work provides a theoretical framework for designing and executing numerical optimization on multi-gene systems. PMID:29535690

  3. Presence and Expression of Microbial Genes Regulating Soil Nitrogen Dynamics Along the Tanana River Successional Sequence

    NASA Astrophysics Data System (ADS)

    Boone, R. D.; Rogers, S. L.

    2004-12-01

    We report on work to assess the functional gene sequences for soil microbiota that control nitrogen cycle pathways along the successional sequence (willow, alder, poplar, white spruce, black spruce) on the Tanana River floodplain, Interior Alaska. Microbial DNA and mRNA were extracted from soils (0-10 cm depth) for amoA (ammonium monooxygenase), nifH (nitrogenase reductase), napA (nitrate reductase), and nirS and nirK (nitrite reductase) genes. Gene presence was determined by amplification of a conserved sequence of each gene employing sequence specific oligonucleotide primers and Polymerase Chain Reaction (PCR). Expression of the genes was measured via nested reverse transcriptase PCR amplification of the extracted mRNA. Amplified PCR products were visualized on agarose electrophoresis gels. All five successional stages show evidence for the presence and expression of microbial genes that regulate N fixation (free-living), nitrification, and nitrate reduction. We detected (1) nifH, napA, and nirK presence and amoA expression (mRNA production) for all five successional stages and (2) nirS and amoA presence and nifH, nirK, and napA expression for early successional stages (willow, alder, poplar). The results highlight that the existing body of previous process-level work has not sufficiently considered the microbial potential for a nitrate economy and free-living N fixation along the complete floodplain successional sequence.

  4. The bromodomain protein LEX-1 acts with TAM-1 to modulate gene expression in C. elegans.

    PubMed

    Tseng, Rong-Jeng; Armstrong, Kristin R; Wang, Xiaodong; Chamberlin, Helen M

    2007-11-01

    In many organisms, repetitive DNA serves as a trigger for gene silencing. However, some gene expression is observed from repetitive genomic regions such as heterochromatin, suggesting mechanisms exist to modulate the silencing effects. From a genetic screen in C. elegans, we have identified mutations in two genes important for expression of repetitive sequences: lex-1 and tam-1. Here we show that lex-1 encodes a protein containing an ATPase domain and a bromodomain. LEX-1 is similar to the yeast Yta7 protein, which maintains boundaries between silenced and active chromatin. tam-1 has previously been shown to encode a RING finger/B-box protein that modulates gene expression from repetitive DNA. We find that lex-1, like tam-1, acts as a class B synthetic multivulva (synMuv) gene. However, since lex-1 and tam-1 mutants have normal P granule localization, it suggests they act through a mechanism distinct from other class B synMuvs. We observe intragenic (interallelic) complementation with lex-1 and a genetic interaction between lex-1 and tam-1, data consistent with the idea that the gene products function in the same biological process, perhaps as part of a protein complex. We propose that LEX-1 and TAM-1 function together to influence chromatin structure and to promote expression from repetitive sequences.

  5. Ambient pH Controls Glycogen Levels by Regulating Glycogen Synthase Gene Expression in Neurospora crassa. New Insights into the pH Signaling Pathway

    PubMed Central

    Cupertino, Fernanda Barbosa; Freitas, Fernanda Zanolli; de Paula, Renato Magalhães; Bertolini, Maria Célia

    2012-01-01

    Glycogen is a polysaccharide widely distributed in microorganisms and animal cells and its metabolism is under intricate regulation. Its accumulation in a specific situation results from the balance between glycogen synthase and glycogen phosphorylase activities that control synthesis and degradation, respectively. These enzymes are highly regulated at transcriptional and post-translational levels. The existence of a DNA motif for the Aspergillus nidulans pH responsive transcription factor PacC in the promoter of the gene encoding glycogen synthase (gsn) in Neurospora crassa prompted us to investigate whether this transcription factor regulates glycogen accumulation. Transcription factors such as PacC in A. nidulans and Rim101p in Saccharomyces cerevisiae play a role in the signaling pathway that mediates adaptation to ambient pH by inducing the expression of alkaline genes and repressing acidic genes. We showed here that at pH 7.8 pacC was over-expressed and gsn was down-regulated in wild-type N. crassa coinciding with low glycogen accumulation. In the pacCKO strain the glycogen levels and gsn expression at alkaline pH were, respectively, similar to and higher than the wild-type strain at normal pH (5.8). These results characterize gsn as an acidic gene and suggest a regulatory role for PACC in gsn expression. The truncated recombinant protein, containing the DNA-binding domain specifically bound to a gsn DNA fragment containing the PacC motif. DNA-protein complexes were observed with extracts from cells grown at normal and alkaline pH and confirmed by ChIP-PCR analysis. The PACC present in these extracts showed equal molecular mass, indicating that the protein is already processed at normal pH, in contrast to A. nidulans. Together, these results show that the pH signaling pathway controls glycogen accumulation by regulating gsn expression and suggest the existence of a different mechanism for PACC activation in N. crassa. PMID:22952943

  6. Diurnal and developmental differences in gene expression between adult dispersing and flightless morphs of the wing polymorphic cricket, Gryllus firmus: Implications for life-history evolution.

    PubMed

    Zera, Anthony J; Vellichirammal, Neetha Nanoth; Brisson, Jennifer A

    2018-04-12

    The functional basis of life history adaptation is a key topic of research in life history evolution. Studies of wing-polymorphism in the cricket Gryllus firmus have played a prominent role in this field. However, prior in-depth investigations of morph specialization have primarily focused on a single hormone, juvenile hormone, and a single aspect of intermediary metabolism, the fatty-acid biosynthetic component of lipid metabolism. Moreover, the role of diurnal variation in life history adaptation in G. firmus has been understudied, as is the case for organisms in general. Here, we identify genes whose expression differs consistently between the morphs independent of time-of-day during early adulthood, as well as genes that exhibit a strong pattern of morph-specific diurnal expression. We find strong, consistent, morph-specific differences in the expression of genes involved in endocrine regulation, carbohydrate and lipid metabolism, and immunity - in particular, in the expression of an insulin-like-peptide precursor gene and genes involved in triglyceride production. We also find that the flight-capable morph exhibited a substantially greater number of genes exhibiting diurnal change in gene expression compared with the flightless morph, correlated with the greater circadian change in the hemolymph juvenile titer in the dispersing morph. In fact, diurnal differences in expression within the dispersing morph at different times of the day were significantly greater in magnitude than differences between dispersing and flightless morphs at the same time-of-day. These results provide important baseline information regarding the potential role of variable gene expression on life history specialization in morphs of G. firmus, and the first information on genetically-variable, diurnal change in gene expression, associated with a key life history polymorphism. These results also suggest the existence of prominent morph-specific circadian differences in gene expression in G. firmus, possibly caused by the morph-specific circadian rhythm in the juvenile hormone titer. Copyright © 2018 Elsevier Ltd. All rights reserved.

  7. Gene Expression Profiling of Liver Cancer Stem Cells by RNA-Sequencing

    PubMed Central

    Lam, Chi Tat; Ng, Michael N. P.; Yu, Wan Ching; Lau, Joyce; Wan, Timothy; Wang, Xiaoqi; Yan, Zhixiang; Liu, Hang; Fan, Sheung Tat

    2012-01-01

    Background Accumulating evidence supports that tumor growth and cancer relapse are driven by cancer stem cells. Our previous work has demonstrated the existence of CD90+ liver cancer stem cells (CSCs) in hepatocellular carcinoma (HCC). Nevertheless, the characteristics of these cells are still poorly understood. In this study, we employed a more sensitive RNA-sequencing (RNA-Seq) to compare the gene expression profiling of CD90+ cells sorted from tumor (CD90+CSCs) with parallel non-tumorous liver tissues (CD90+NTSCs) and elucidate the roles of putative target genes in hepatocarcinogenesis. Methodology/Principal Findings CD90+ cells were sorted respectively from tumor and adjacent non-tumorous human liver tissues using fluorescence-activated cell sorting. The amplified RNAs of CD90+ cells from 3 HCC patients were subjected to RNA-Seq analysis. A differential gene expression profile was established between CD90+CSCs and CD90+NTSCs, and validated by quantitative real-time PCR (qRT-PCR) on the same set of amplified RNAs, and further confirmed in an independent cohort of 12 HCC patients. Five hundred genes were differentially expressed (119 up-regulated and 381 down-regulated genes) between CD90+CSCs and CD90+NTSCs. Gene ontology analysis indicated that the over-expressed genes in CD90+CSCs were associated with inflammation, drug resistance and lipid metabolism. Among the differentially expressed genes, glypican-3 (GPC3), a member of glypican family, was markedly elevated in CD90+CSCs compared to CD90+NTSCs. Immunohistochemistry demonstrated that GPC3 was highly expressed in forty-two human liver tumor tissues but absent in adjacent non-tumorous liver tissues. Flow cytometry indicated that GPC3 was highly expressed in liver CD90+CSCs and mature cancer cells in liver cancer cell lines and human liver tumor tissues. Furthermore, GPC3 expression was positively correlated with the number of CD90+CSCs in liver tumor tissues. Conclusions/Significance The identified genes, such as GPC3 that are distinctly expressed in liver CD90+CSCs, may be promising gene candidates for HCC therapy without inducing damages to normal liver stem cells. PMID:22606345

  8. Constitutive gene expression and specification of tissue identity in adult planarian biology

    PubMed Central

    Reddien, Peter W.

    2011-01-01

    Planarians are flatworms that constitutively maintain adult tissues through cell turnover and can regenerate entire organisms from tiny body fragments. In addition to requiring new cells (from neoblasts), these feats require mechanisms that specify tissue identity in the adult. Critical roles for Wnt and BMP signaling in regeneration and maintenance of the body axes have been uncovered, among other regulatory factors. Available data indicate that genes involved in positional identity regulation at key embryonic stages in other animals display persisting regionalized expression in adult planarians. These expression patterns suggest that a constitutively active gene expression map exists for maintenance of the planarian body. Planarians therefore present a fertile ground for identification of factors regulating regionalization of the metazoan body plan and for study of the attributes of these factors that can lead to maintenance and regeneration of adult tissues. PMID:21680047

  9. Effects of 4-chlorophenol wastewater treatment on sludge acute toxicity, microbial diversity and functional genes expression in an activated sludge process.

    PubMed

    Zhao, Jianguo; Li, Yahe; Li, Yu; Yu, Zeya; Chen, Xiurong

    2018-05-31

    In this study, the effects of 4-chlorophenol (4-CP) wastewater treatment on sludge acute toxicity of luminescent bacteria, microbial diversity and functional genes expression of Pseudomonas were explored. Results showed that in the entire operational process, the sludge acute toxicity acclimated by 4-CP in a sequencing batch bioreactor (SBR) was significantly higher than the control SBR without 4-CP. The dominant phyla in acclimated SBR were Proteobacteria and Firmicutes, which also existed in control SBR. Some identified genera in acclimated SBR were responsible for 4-CP degradation. At the stable operational stages, the functional genes expression of Pseudomonas in acclimated SBR was down-regulated at the end of SBR cycle, and their expression mechanisms needed further research. This study provides a theoretical support to comprehensively understand the sludge performance in industrial wastewater treatment. Copyright © 2018 Elsevier Ltd. All rights reserved.

  10. The expression of Longus type 4 pilus of enterotoxigenic Escherichia coli is regulated by LngR and LngS and by H-NS, CpxR and CRP global regulators.

    PubMed

    De la Cruz, Miguel A; Ruiz-Tagle, Alejandro; Ares, Miguel A; Pacheco, Sabino; Yáñez, Jorge A; Cedillo, Lilia; Torres, Javier; Girón, Jorge A

    2017-05-01

    Enterotoxigenic Escherichia coli produces a long type 4 pilus called Longus. The regulatory elements and the environmental signals controlling the expression of Longus-encoding genes are unknown. We identified two genes lngR and lngS in the Longus operon, whose predicted products share homology with transcriptional regulators. Isogenic lngR and lngS mutants were considerably affected in transcription of lngA pilin gene. The expression of lngA, lngR and lngS genes was optimally expressed at 37°C at pH 7.5. The presence of glucose and sodium chloride had a positive effect on Longus expression. The presence of divalent ions, particularly calcium, appears to be an important stimulus for Longus production. In addition, we studied H-NS, CpxR and CRP global regulators, on Longus expression. The response regulator CpxR appears to function as a positive regulator of lng genes as the cpxR mutant showed reduced levels of lngRSA expression. In contrast, H-NS and CRP function as negative regulators since expression of lngA was up-regulated in isogenic hns and crp mutants. H-NS and CRP were required for salt- and glucose-mediated regulation of Longus. Our data suggest the existence of a complex regulatory network controlling Longus expression, involving both local and global regulators in response to different environmental signals. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.

  11. Relative IGF-1 and IGF-2 gene expression in maternal and fetal tissues from diabetic swine

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wolverton, C.K.; Leaman, D.W.; White, M.E.

    1990-02-26

    Fourteen pregnant, crossbred gilts were utilized in this study. Seven gilts were injected with alloxan (50 mg/kg) at day 75 of gestation to induce diabetes. Gilts underwent caesarean section on day 105 of gestation. Samples were collected from maternal skeletal muscle, adipose tissue, uterus and endometrium; and from fetal skeletal muscle, adipose tissue, placenta, liver, lung, kidney, heart, brain and spleen. Tissues were frozen in liquid nitrogen for later analysis of IGF-1 and IGF-2 gene expression. Samples were pooled and total RNA was isolated using the guanidine isothiocynate method. Total mRNA was analyzed by dot blot hybridization. Blots were probedmore » with {sup 32}P-cDNA for porcine IGF-1 and rat IGF-2. IGF-1 gene expression in maternal tissues was unaffected by diabetes. Maternal diabetes increased IGF-2 mRNA in maternal adipose tissue but exhibited no effect in muscle or uterus. Expression of IGF-2 by maternal endometrium was decreased by diabetes. Maternal diabetes induced an increase in IGF-1 gene expression in muscle and placenta while causing an increase in IGF-2 expression in fetal liver and placenta. IGF-2 mRNA was lower in lung from fetuses of diabetic mothers than in controls. These results suggest that maternal diabetes alters IGF-1 and IGF-2 gene expression in specific tissues and differential regulation of these genes appears to exist in the mother and developing fetus.« less

  12. Genome-wide DNA methylation profiling integrated with gene expression profiling identifies PAX9 as a novel prognostic marker in chronic lymphocytic leukemia.

    PubMed

    Rani, Lata; Mathur, Nitin; Gupta, Ritu; Gogia, Ajay; Kaur, Gurvinder; Dhanjal, Jaspreet Kaur; Sundar, Durai; Kumar, Lalit; Sharma, Atul

    2017-01-01

    In chronic lymphocytic leukemia (CLL), epigenomic and genomic studies have expanded the existing knowledge about the disease biology and led to the identification of potential biomarkers relevant for implementation of personalized medicine. In this study, an attempt has been made to examine and integrate the global DNA methylation changes with gene expression profile and their impact on clinical outcome in early stage CLL patients. The integration of DNA methylation profile ( n  = 14) with the gene expression profile ( n  = 21) revealed 142 genes as hypermethylated-downregulated and; 62 genes as hypomethylated-upregulated in early stage CLL patients compared to CD19+ B-cells from healthy individuals. The mRNA expression levels of 17 genes identified to be differentially methylated and/or differentially expressed was further examined in early stage CLL patients ( n  = 93) by quantitative real time PCR (RQ-PCR). Significant differences were observed in the mRNA expression of MEIS1 , PMEPA1 , SOX7 , SPRY1 , CDK6 , TBX2 , and SPRY2 genes in CLL cells as compared to B-cells from healthy individuals. The analysis in the IGHV mutation based categories (Unmutated = 39, Mutated = 54) revealed significantly higher mRNA expression of CRY1 and PAX9 genes in the IGHV unmutated subgroup ( p  < 0.001). The relative risk of treatment initiation was significantly higher among patients with high expression of CRY1 (RR = 1.91, p  = 0.005) or PAX9 (RR = 1.87, p  = 0.001). High expression of CRY1 (HR: 3.53, p  < 0.001) or PAX9 (HR: 3.14, p  < 0.001) gene was significantly associated with shorter time to first treatment. The high expression of PAX9 gene (HR: 3.29, 95% CI 1.172-9.272, p  = 0.016) was also predictive of shorter overall survival in CLL. The DNA methylation changes associated with mRNA expression of CRY1 and PAX9 genes allow risk stratification of early stage CLL patients. This comprehensive analysis supports the concept that the epigenetic changes along with the altered expression of genes have the potential to predict clinical outcome in early stage CLL patients.

  13. Infraspecific DNA methylation polymorphism in cotton (Gossypium hirsutum L.).

    PubMed

    Keyte, Anna L; Percifield, Ryan; Liu, Bao; Wendel, Jonathan F

    2006-01-01

    Cytosine methylation is important in the epigenetic regulation of gene expression and development in plants and has been implicated in silencing duplicate genes after polyploid formation in several plant groups. Relatively little information exists, however, on levels and patterns of methylation polymorphism (MP) at homologous loci within species. Here we explored the levels and patterns of methylation-polymorphism diversity at CCGG sites within allotetraploid cotton, Gossypium hirsutum, using a methylation-sensitive amplified fragment length polymorphism screen and a selected set of 20 G. hirsutum accessions for which we have information on genetic polymorphism levels and relationships. Methylation and MP exist at high levels within G. hirsutum: of 150 HpaII/MspI sites surveyed, 48 were methylated at the inner cytosine (32%) and 32 of these were polymorphic (67%). Both these values are higher than comparable measures of genetic diversity using restriction fragment length polymorphisms. The high percentage of methylation-polymorphic sites and potential relationship to gene expression underscore the potential significance of MP within and among populations. We speculate that biased correlation of methylation-polymorphic sites and genes in cotton may be a consequence of polyploidy and the attendant doubling of all genes.

  14. Gene expression profiling in rat kidney after intratracheal exposure to cadmium-doped nanoparticles

    NASA Astrophysics Data System (ADS)

    Coccini, Teresa; Roda, Elisa; Fabbri, Marco; Sacco, Maria Grazia; Gribaldo, Laura; Manzo, Luigi

    2012-08-01

    While nephrotoxicity of cadmium is well documented, very limited information exists on renal effects of exposure to cadmium-containing nanomaterials. In this work, "omics" methodologies have been used to assess the action of cadmium-containing silica nanoparticles (Cd-SiNPs) in the kidney of Sprague-Dawley rats exposed intratracheally. Groups of animals received a single dose of Cd-SiNPs (1 mg/rat), CdCl2 (400 μg/rat) or 0.1 ml saline (control). Renal gene expression was evaluated 7 and 30 days post exposure by DNA microarray technology using the Agilent Whole Rat Genome Microarray 4x44K. Gene modulating effects were observed in kidney at both time periods after treatment with Cd-SiNPs. The number of differentially expressed genes being 139 and 153 at the post exposure days 7 and 30, respectively. Renal gene expression changes were also observed in the kidney of CdCl2-treated rats with a total of 253 and 70 probes modulated at 7 and 30 days, respectively. Analysis of renal gene expression profiles at day 7 indicated in both Cd-SiNP and CdCl2 groups downregulation of several cluster genes linked to immune function, oxidative stress, and inflammation processes. Differing from day 7, the majority of cluster gene categories modified by nanoparticles in kidney 30 days after dosing were genes implicated in cell regulation and apoptosis. Modest renal gene expression changes were observed at day 30 in rats treated with CdCl2. These results indicate that kidney may be a susceptible target for subtle long-lasting molecular alterations produced by cadmium nanoparticles locally instilled in the lung.

  15. Targeted and genome-scale methylomics reveals gene body signatures in human cell lines

    PubMed Central

    Ball, Madeleine Price; Li, Jin Billy; Gao, Yuan; Lee, Je-Hyuk; LeProust, Emily; Park, In-Hyun; Xie, Bin; Daley, George Q.; Church, George M.

    2012-01-01

    Cytosine methylation, an epigenetic modification of DNA, is a target of growing interest for developing high throughput profiling technologies. Here we introduce two new, complementary techniques for cytosine methylation profiling utilizing next generation sequencing technology: bisulfite padlock probes (BSPPs) and methyl sensitive cut counting (MSCC). In the first method, we designed a set of ~10,000 BSPPs distributed over the ENCODE pilot project regions to take advantage of existing expression and chromatin immunoprecipitation data. We observed a pattern of low promoter methylation coupled with high gene body methylation in highly expressed genes. Using the second method, MSCC, we gathered genome-scale data for 1.4 million HpaII sites and confirmed that gene body methylation in highly expressed genes is a consistent phenomenon over the entire genome. Our observations highlight the usefulness of techniques which are not inherently or intentionally biased in favor of only profiling particular subsets like CpG islands or promoter regions. PMID:19329998

  16. Sustained expression of MCP-1 by low wall shear stress loading concomitant with turbulent flow on endothelial cells of intracranial aneurysm.

    PubMed

    Aoki, Tomohiro; Yamamoto, Kimiko; Fukuda, Miyuki; Shimogonya, Yuji; Fukuda, Shunichi; Narumiya, Shuh

    2016-05-09

    Enlargement of a pre-existing intracranial aneurysm is a well-established risk factor of rupture. Excessive low wall shear stress concomitant with turbulent flow in the dome of an aneurysm may contribute to progression and rupture. However, how stress conditions regulate enlargement of a pre-existing aneurysm remains to be elucidated. Wall shear stress was calculated with 3D-computational fluid dynamics simulation using three cases of unruptured intracranial aneurysm. The resulting value, 0.017 Pa at the dome, was much lower than that in the parent artery. We loaded wall shear stress corresponding to the value and also turbulent flow to the primary culture of endothelial cells. We then obtained gene expression profiles by RNA sequence analysis. RNA sequence analysis detected hundreds of differentially expressed genes among groups. Gene ontology and pathway analysis identified signaling related with cell division/proliferation as overrepresented in the low wall shear stress-loaded group, which was further augmented by the addition of turbulent flow. Moreover, expression of some chemoattractants for inflammatory cells, including MCP-1, was upregulated under low wall shear stress with concomitant turbulent flow. We further examined the temporal sequence of expressions of factors identified in an in vitro study using a rat model. No proliferative cells were detected, but MCP-1 expression was induced and sustained in the endothelial cell layer. Low wall shear stress concomitant with turbulent flow contributes to sustained expression of MCP-1 in endothelial cells and presumably plays a role in facilitating macrophage infiltration and exacerbating inflammation, which leads to enlargement or rupture.

  17. The impact of rare variation on gene expression across tissues.

    PubMed

    Li, Xin; Kim, Yungil; Tsang, Emily K; Davis, Joe R; Damani, Farhan N; Chiang, Colby; Hess, Gaelen T; Zappala, Zachary; Strober, Benjamin J; Scott, Alexandra J; Li, Amy; Ganna, Andrea; Bassik, Michael C; Merker, Jason D; Hall, Ira M; Battle, Alexis; Montgomery, Stephen B

    2017-10-11

    Rare genetic variants are abundant in humans and are expected to contribute to individual disease risk. While genetic association studies have successfully identified common genetic variants associated with susceptibility, these studies are not practical for identifying rare variants. Efforts to distinguish pathogenic variants from benign rare variants have leveraged the genetic code to identify deleterious protein-coding alleles, but no analogous code exists for non-coding variants. Therefore, ascertaining which rare variants have phenotypic effects remains a major challenge. Rare non-coding variants have been associated with extreme gene expression in studies using single tissues, but their effects across tissues are unknown. Here we identify gene expression outliers, or individuals showing extreme expression levels for a particular gene, across 44 human tissues by using combined analyses of whole genomes and multi-tissue RNA-sequencing data from the Genotype-Tissue Expression (GTEx) project v6p release. We find that 58% of underexpression and 28% of overexpression outliers have nearby conserved rare variants compared to 8% of non-outliers. Additionally, we developed RIVER (RNA-informed variant effect on regulation), a Bayesian statistical model that incorporates expression data to predict a regulatory effect for rare variants with higher accuracy than models using genomic annotations alone. Overall, we demonstrate that rare variants contribute to large gene expression changes across tissues and provide an integrative method for interpretation of rare variants in individual genomes.

  18. A structured sparse regression method for estimating isoform expression level from multi-sample RNA-seq data.

    PubMed

    Zhang, L; Liu, X J

    2016-06-03

    With the rapid development of next-generation high-throughput sequencing technology, RNA-seq has become a standard and important technique for transcriptome analysis. For multi-sample RNA-seq data, the existing expression estimation methods usually deal with each single-RNA-seq sample, and ignore that the read distributions are consistent across multiple samples. In the current study, we propose a structured sparse regression method, SSRSeq, to estimate isoform expression using multi-sample RNA-seq data. SSRSeq uses a non-parameter model to capture the general tendency of non-uniformity read distribution for all genes across multiple samples. Additionally, our method adds a structured sparse regularization, which not only incorporates the sparse specificity between a gene and its corresponding isoform expression levels, but also reduces the effects of noisy reads, especially for lowly expressed genes and isoforms. Four real datasets were used to evaluate our method on isoform expression estimation. Compared with other popular methods, SSRSeq reduced the variance between multiple samples, and produced more accurate isoform expression estimations, and thus more meaningful biological interpretations.

  19. Meta-Analysis of Multiple Sclerosis Microarray Data Reveals Dysregulation in RNA Splicing Regulatory Genes.

    PubMed

    Paraboschi, Elvezia Maria; Cardamone, Giulia; Rimoldi, Valeria; Gemmati, Donato; Spreafico, Marta; Duga, Stefano; Soldà, Giulia; Asselta, Rosanna

    2015-09-30

    Abnormalities in RNA metabolism and alternative splicing (AS) are emerging as important players in complex disease phenotypes. In particular, accumulating evidence suggests the existence of pathogenic links between multiple sclerosis (MS) and altered AS, including functional studies showing that an imbalance in alternatively-spliced isoforms may contribute to disease etiology. Here, we tested whether the altered expression of AS-related genes represents a MS-specific signature. A comprehensive comparative analysis of gene expression profiles of publicly-available microarray datasets (190 MS cases, 182 controls), followed by gene-ontology enrichment analysis, highlighted a significant enrichment for differentially-expressed genes involved in RNA metabolism/AS. In detail, a total of 17 genes were found to be differentially expressed in MS in multiple datasets, with CELF1 being dysregulated in five out of seven studies. We confirmed CELF1 downregulation in MS (p=0.0015) by real-time RT-PCRs on RNA extracted from blood cells of 30 cases and 30 controls. As a proof of concept, we experimentally verified the unbalance in alternatively-spliced isoforms in MS of the NFAT5 gene, a putative CELF1 target. In conclusion, for the first time we provide evidence of a consistent dysregulation of splicing-related genes in MS and we discuss its possible implications in modulating specific AS events in MS susceptibility genes.

  20. Transcriptional network inference from functional similarity and expression data: a global supervised approach.

    PubMed

    Ambroise, Jérôme; Robert, Annie; Macq, Benoit; Gala, Jean-Luc

    2012-01-06

    An important challenge in system biology is the inference of biological networks from postgenomic data. Among these biological networks, a gene transcriptional regulatory network focuses on interactions existing between transcription factors (TFs) and and their corresponding target genes. A large number of reverse engineering algorithms were proposed to infer such networks from gene expression profiles, but most current methods have relatively low predictive performances. In this paper, we introduce the novel TNIFSED method (Transcriptional Network Inference from Functional Similarity and Expression Data), that infers a transcriptional network from the integration of correlations and partial correlations of gene expression profiles and gene functional similarities through a supervised classifier. In the current work, TNIFSED was applied to predict the transcriptional network in Escherichia coli and in Saccharomyces cerevisiae, using datasets of 445 and 170 affymetrix arrays, respectively. Using the area under the curve of the receiver operating characteristics and the F-measure as indicators, we showed the predictive performance of TNIFSED to be better than unsupervised state-of-the-art methods. TNIFSED performed slightly worse than the supervised SIRENE algorithm for the target genes identification of the TF having a wide range of yet identified target genes but better for TF having only few identified target genes. Our results indicate that TNIFSED is complementary to the SIRENE algorithm, and particularly suitable to discover target genes of "orphan" TFs.

  1. Bmi1 represses Ink4a/Arf and Hox genes to regulate stem cells in the rodent incisor

    PubMed Central

    Biehs, Brian; Hu, Jimmy Kuang-Hsien; Strauli, Nicolas B.; Sangiorgi, Eugenio; Jung, Heekyung; Heber, Ralf-Peter; Ho, Sunita; Goodwin, Alice F.; Dasen, Jeremy S.; Capecchi, Mario R.; Klein, Ophir D.

    2013-01-01

    The polycomb group gene Bmi1 is required for maintenance of adult stem cells in many organs1, 2. Inactivation of Bmi1 leads to impaired stem cell self-renewal due to deregulated gene expression. One critical target of BMI1 is Ink4a/Arf, which encodes the cell cycle inhibitors p16ink4a and p19Arf3. However, deletion of Ink4a/Arf only partially rescues Bmi1 null phenotypes4, indicating that other important targets of BMI1 exist. Here, using the continuously-growing mouse incisor as a model system, we report that Bmi1 is expressed by incisor stem cells and that deletion of Bmi1 resulted in fewer stem cells, perturbed gene expression, and defective enamel production. Transcriptional profiling revealed that Hox expression is normally repressed by BMI1 in the adult, and functional assays demonstrated that BMI1-mediated repression of Hox genes preserves the undifferentiated state of stem cells. As Hox gene upregulation has also been reported in other systems when Bmi1 is inactivated1, 2, 5–7, our findings point to a general mechanism whereby BMI1-mediated repression of Hox genes is required for the maintenance of adult stem cells and for prevention of inappropriate differentiation. PMID:23728424

  2. Expression Profiling of Transcriptome and Its Associated Disease Risk in Yang Deficiency Constitution of Healthy Subjects

    PubMed Central

    Yu, Ruoxi; Yang, Yin; Han, Yuanyuan; Hou, Pengwei; Li, Yingshuai; Li, Siqi

    2016-01-01

    Objectives. Differences among healthy subjects and associated disease risks are of substantial interest in clinical medicine. According to the theory of “constitution-disease correlation” in traditional Chinese medicine, we try to find out if there is any connection between intolerance of cold in Yang deficiency constitution and molecular evidence and if there is any gene expression basis in specific disorders. Methods. Peripheral blood mononuclear cells were collected from Chinese Han individuals with Yang deficiency constitution (n = 20) and balanced constitution (n = 8) (aged 18–28) and global gene expression profiles were determined between them using the Affymetrix HG-U133 Plus 2.0 array. Results. The results showed that when the fold change was ≥1.2 and q ≤ 0.05, 909 genes were upregulated in the Yang deficiency constitution, while 1189 genes were downregulated. According to our research differential genes found in Yang deficiency constitution were usually related to lower immunity, metabolic disorders, and cancer tendency. Conclusion. Gene expression disturbance exists in Yang deficiency constitution, which corresponds to the concept of constitution and gene classification. It also suggests people with Yang deficiency constitution are susceptible to autoimmune diseases, enteritis, arthritis, metabolism disorders, and cancer, which provides molecular evidence for the theory of “constitution-disease correlation.” PMID:28484499

  3. Transcriptome-wide analysis supports environmental adaptations of two Pinus pinaster populations from contrasting habitats.

    PubMed

    Cañas, Rafael A; Feito, Isabel; Fuente-Maqueda, José Francisco; Ávila, Concepción; Majada, Juan; Cánovas, Francisco M

    2015-11-06

    Maritime pine (Pinus pinaster Aiton) grows in a range of different climates in the southwestern Mediterranean region and the existence of a variety of latitudinal ecotypes or provenances is well established. In this study, we have conducted a deep analysis of the transcriptome in needles from two P. pinaster provenances, Leiria (Portugal) and Tamrabta (Morocco), which were grown in northern Spain under the same conditions. An oligonucleotide microarray (PINARRAY3) and RNA-Seq were used for whole-transcriptome analyses, and we found that 90.95% of the data were concordant between the two platforms. Furthermore, the two methods identified very similar percentages of differentially expressed genes with values of 5.5% for PINARRAY3 and 5.7% for RNA-Seq. In total, 6,023 transcripts were shared and 88 differentially expressed genes overlapped in the two platforms. Among the differentially expressed genes, all transport related genes except aquaporins were expressed at higher levels in Tamrabta than in Leiria. In contrast, genes involved in secondary metabolism were expressed at higher levels in Tamrabta, and photosynthesis-related genes were expressed more highly in Leiria. The genes involved in light sensing in plants were well represented in the differentially expressed groups of genes. In addition, increased levels of hormones such as abscisic acid, gibberellins, jasmonic and salicylic acid were observed in Leiria. Both transcriptome platforms have proven to be useful resources, showing complementary and reliable results. The results presented here highlight the different abilities of the two maritime pine populations to sense environmental conditions and reveal one type of regulation that can be ascribed to different genetic and epigenetic backgrounds.

  4. Interleukin-1β modulates smooth muscle cell phenotype to a distinct inflammatory state relative to PDGF-DD via NF-κB-dependent mechanisms.

    PubMed

    Alexander, Matthew R; Murgai, Meera; Moehle, Christopher W; Owens, Gary K

    2012-04-02

    Smooth muscle cell (SMC) phenotypic modulation in atherosclerosis and in response to PDGF in vitro involves repression of differentiation marker genes and increases in SMC proliferation, migration, and matrix synthesis. However, SMCs within atherosclerotic plaques can also express a number of proinflammatory genes, and in cultured SMCs the inflammatory cytokine IL-1β represses SMC marker gene expression and induces inflammatory gene expression. Studies herein tested the hypothesis that IL-1β modulates SMC phenotype to a distinct inflammatory state relative to PDGF-DD. Genome-wide gene expression analysis of IL-1β- or PDGF-DD-treated SMCs revealed that although both stimuli repressed SMC differentiation marker gene expression, IL-1β distinctly induced expression of proinflammatory genes, while PDGF-DD primarily induced genes involved in cell proliferation. Promoters of inflammatory genes distinctly induced by IL-1β exhibited over-representation of NF-κB binding sites, and NF-κB inhibition in SMCs reduced IL-1β-induced upregulation of proinflammatory genes as well as repression of SMC differentiation marker genes. Interestingly, PDGF-DD-induced SMC marker gene repression was not NF-κB dependent. Finally, immunofluorescent staining of mouse atherosclerotic lesions revealed the presence of cells positive for the marker of an IL-1β-stimulated inflammatory SMC, chemokine (C-C motif) ligand 20 (CCL20), but not the PDGF-DD-induced gene, regulator of G protein signaling 17 (RGS17). Results demonstrate that IL-1β- but not PDGF-DD-induced phenotypic modulation of SMC is characterized by NF-κB-dependent activation of proinflammatory genes, suggesting the existence of a distinct inflammatory SMC phenotype. In addition, studies provide evidence for the possible utility of CCL20 and RGS17 as markers of inflammatory and proliferative state SMCs within atherosclerotic plaques in vivo.

  5. The puroindoline b-2 variants are expressed at low levels relative to the puroindoline D1 genes in wheat seeds

    USDA-ARS?s Scientific Manuscript database

    Grain hardness in wheat is largely controlled by the Hardness locus. This locus contains the Puroindoline a and b genes, which were thought to exist as single copy genes on chromosome 5D. In fact, four additional copies of Pinb have been reported, termed Pinb-2v-1 – Pinb-2v4, which map to the grou...

  6. Gene expression profiles of Arabidopsis Cvi seeds during dormancy cycling indicate a common underlying dormancy control mechanism.

    PubMed

    Cadman, Cassandra S C; Toorop, Peter E; Hilhorst, Henk W M; Finch-Savage, William E

    2006-06-01

    Physiologically dormant seeds, like those of Arabidopsis, will cycle through dormant states as seasons change until the environment is favourable for seedling establishment. This phenomenon is widespread in the plant kingdom, but has not been studied at the molecular level. Full-genome microarrays were used for a global transcript analysis of Arabidopsis thaliana (accession Cvi) seeds in a range of dormant and dry after-ripened states during cycling. Principal component analysis of the expression patterns observed showed that they differed in newly imbibed primary dormant seeds, as commonly used in experimental studies, compared with those in the maintained primary and secondary dormant states that exist during cycling. Dormant and after-ripened seeds appear to have equally active although distinct gene expression programmes, dormant seeds having greatly reduced gene expression associated with protein synthesis, potentially controlling the completion of germination. A core set of 442 genes were identified that had higher expression in all dormant states compared with after-ripened states. Abscisic acid (ABA) responsive elements were significantly over-represented in this set of genes the expression of which was enhanced when multiple copies of the elements were present. ABA regulation of dormancy was further supported by expression patterns of key genes in ABA synthesis/catabolism, and dormancy loss in the presence of fluridone. The data support an ABA-gibberelic acid hormone balance mechanism controlling cycling through dormant states that depends on synthetic and catabolic pathways of both hormones. Many of the most highly expressed genes in dormant states were stress-related even in the absence of abiotic stress, indicating that ABA, stress and dormancy responses overlap significantly at the transcriptome level.

  7. Gene expression-based detection of radiation exposure in mice after treatment with granulocyte colony-stimulating factor and lipopolysaccharide.

    PubMed

    Tucker, James D; Grever, William E; Joiner, Michael C; Konski, Andre A; Thomas, Robert A; Smolinski, Joseph M; Divine, George W; Auner, Gregory W

    2012-02-01

    In a large-scale nuclear incident, many thousands of people may be exposed to a wide range of radiation doses. Rapid biological dosimetry will be required on an individualized basis to estimate the exposures and to make treatment decisions. To ameliorate the adverse effects of exposure, victims may be treated with one or more cytokine growth factors, including granulocyte colony-stimulating factor (G-CSF), which has therapeutic efficacy for treating radiation-induced bone marrow ablation by stimulating granulopoiesis. The existence of infections and the administration of G-CSF each may confound the ability to achieve reliable dosimetry by gene expression analysis. In this study, C57BL/6 mice were used to determine the extent to which G-CSF and lipopolysaccharide (LPS, which simulates infection by gram-negative bacteria) alter the expression of genes that are either radiation-responsive or non-responsive, i.e., show potential for use as endogenous controls. Mice were acutely exposed to (60)Co γ rays at either 0 Gy or 6 Gy. Two hours later the animals were injected with either 0.1 mg/kg of G-CSF or 0.3 mg/kg of LPS. Expression levels of 96 different gene targets were evaluated in peripheral blood after an additional 4 or 24 h using real-time quantitative PCR. The results indicate that the expression levels of some genes are altered by LPS, but altered expression after G-CSF treatment was generally not observed. The expression levels of many genes therefore retain utility for biological dosimetry or as endogenous controls. These data suggest that PCR-based quantitative gene expression analyses may have utility in radiation biodosimetry in humans even in the presence of an infection or after treatment with G-CSF.

  8. XBP-1 Regulates a Subset of Endoplasmic Reticulum Resident Chaperone Genes in the Unfolded Protein Response

    PubMed Central

    Lee, Ann-Hwee; Iwakoshi, Neal N.; Glimcher, Laurie H.

    2003-01-01

    The mammalian unfolded protein response (UPR) protects the cell against the stress of misfolded proteins in the endoplasmic reticulum (ER). We have investigated here the contribution of the UPR transcription factors XBP-1, ATF6α, and ATF6β to UPR target gene expression. Gene profiling of cell lines lacking these factors yielded several XBP-1-dependent UPR target genes, all of which appear to act in the ER. These included the DnaJ/Hsp40-like genes, p58IPK, ERdj4, and HEDJ, as well as EDEM, protein disulfide isomerase-P5, and ribosome-associated membrane protein 4 (RAMP4), whereas expression of BiP was only modestly dependent on XBP-1. Surprisingly, given previous reports that enforced expression of ATF6α induced a subset of UPR target genes, cells deficient in ATF6α, ATF6β, or both had minimal defects in upregulating UPR target genes by gene profiling analysis, suggesting the presence of compensatory mechanism(s) for ATF6 in the UPR. Since cells lacking both XBP-1 and ATF6α had significantly impaired induction of select UPR target genes and ERSE reporter activation, XBP-1 and ATF6α may serve partially redundant functions. No UPR target genes that required ATF6β were identified, nor, in contrast to XBP-1 and ATF6α, did the activity of the UPRE or ERSE promoters require ATF6β, suggesting a minor role for it during the UPR. Collectively, these results suggest that the IRE1/XBP-1 pathway is required for efficient protein folding, maturation, and degradation in the ER and imply the existence of subsets of UPR target genes as defined by their dependence on XBP-1. Further, our observations suggest the existence of additional, as-yet-unknown, key regulators of the UPR. PMID:14559994

  9. Conservation of regulatory sequences and gene expression patterns in the disintegrating Drosophila Hox gene complex

    PubMed Central

    Negre, Bárbara; Casillas, Sònia; Suzanne, Magali; Sánchez-Herrero, Ernesto; Akam, Michael; Nefedov, Michael; Barbadilla, Antonio; de Jong, Pieter; Ruiz, Alfredo

    2005-01-01

    Homeotic (Hox) genes are usually clustered and arranged in the same order as they are expressed along the anteroposterior body axis of metazoans. The mechanistic explanation for this colinearity has been elusive, and it may well be that a single and universal cause does not exist. The Hox-gene complex (HOM-C) has been rearranged differently in several Drosophila species, producing a striking diversity of Hox gene organizations. We investigated the genomic and functional consequences of the two HOM-C splits present in Drosophila buzzatii. Firstly, we sequenced two regions of the D. buzzatii genome, one containing the genes labial and abdominal A, and another one including proboscipedia, and compared their organization with that of D. melanogaster and D. pseudoobscura in order to map precisely the two splits. Then, a plethora of conserved noncoding sequences, which are putative enhancers, were identified around the three Hox genes closer to the splits. The position and order of these enhancers are conserved, with minor exceptions, between the three Drosophila species. Finally, we analyzed the expression patterns of the same three genes in embryos and imaginal discs of four Drosophila species with different Hox-gene organizations. The results show that their expression patterns are conserved despite the HOM-C splits. We conclude that, in Drosophila, Hox-gene clustering is not an absolute requirement for proper function. Rather, the organization of Hox genes is modular, and their clustering seems the result of phylogenetic inertia more than functional necessity. PMID:15867430

  10. Transcriptional Response of Honey Bee Larvae Infected with the Bacterial Pathogen Paenibacillus larvae

    PubMed Central

    Cornman, Robert Scott; Lopez, Dawn; Evans, Jay D.

    2013-01-01

    American foulbrood disease of honey bees is caused by the bacterium Paenibacillus larvae. Infection occurs per os in larvae and systemic infection requires a breaching of the host peritrophic matrix and midgut epithelium. Genetic variation exists for both bacterial virulence and host resistance, and a general immunity is achieved by larvae as they age, the basis of which has not been identified. To quickly identify a pool of candidate genes responsive to P. larvae infection, we sequenced transcripts from larvae inoculated with P. larvae at 12 hours post-emergence and incubated for 72 hours, and compared expression levels to a control cohort. We identified 75 genes with significantly higher expression and six genes with significantly lower expression. In addition to several antimicrobial peptides, two genes encoding peritrophic-matrix domains were also up-regulated. Extracellular matrix proteins, proteases/protease inhibitors, and members of the Osiris gene family were prevalent among differentially regulated genes. However, analysis of Drosophila homologs of differentially expressed genes revealed spatial and temporal patterns consistent with developmental asynchrony as a likely confounder of our results. We therefore used qPCR to measure the consistency of gene expression changes for a subset of differentially expressed genes. A replicate experiment sampled at both 48 and 72 hours post infection allowed further discrimination of genes likely to be involved in host response. The consistently responsive genes in our test set included a hymenopteran-specific protein tyrosine kinase, a hymenopteran specific serine endopeptidase, a cytochrome P450 (CYP9Q1), and a homolog of trynity, a zona pellucida domain protein. Of the known honey bee antimicrobial peptides, apidaecin was responsive at both time-points studied whereas hymenoptaecin was more consistent in its level of change between biological replicates and had the greatest increase in expression by RNA-seq analysis. PMID:23762370

  11. Identification of Candidate B-Lymphoma Genes by Cross-Species Gene Expression Profiling

    PubMed Central

    Tompkins, Van S.; Han, Seong-Su; Olivier, Alicia; Syrbu, Sergei; Bair, Thomas; Button, Anna; Jacobus, Laura; Wang, Zebin; Lifton, Samuel; Raychaudhuri, Pradip; Morse, Herbert C.; Weiner, George; Link, Brian; Smith, Brian J.; Janz, Siegfried

    2013-01-01

    Comparative genome-wide expression profiling of malignant tumor counterparts across the human-mouse species barrier has a successful track record as a gene discovery tool in liver, breast, lung, prostate and other cancers, but has been largely neglected in studies on neoplasms of mature B-lymphocytes such as diffuse large B cell lymphoma (DLBCL) and Burkitt lymphoma (BL). We used global gene expression profiles of DLBCL-like tumors that arose spontaneously in Myc-transgenic C57BL/6 mice as a phylogenetically conserved filter for analyzing the human DLBCL transcriptome. The human and mouse lymphomas were found to have 60 concordantly deregulated genes in common, including 8 genes that Cox hazard regression analysis associated with overall survival in a published landmark dataset of DLBCL. Genetic network analysis of the 60 genes followed by biological validation studies indicate FOXM1 as a candidate DLBCL and BL gene, supporting a number of studies contending that FOXM1 is a therapeutic target in mature B cell tumors. Our findings demonstrate the value of the “mouse filter” for genomic studies of human B-lineage neoplasms for which a vast knowledge base already exists. PMID:24130802

  12. Retransformation of marker-free potato for enhanced resistance against fungal pathogens by pyramiding chitinase and wasabi defensin genes.

    PubMed

    Khan, Raham Sher; Darwish, Nader Ahmed; Khattak, Bushra; Ntui, Valentine Otang; Kong, Kynet; Shimomae, Kazuki; Nakamura, Ikuo; Mii, Masahiro

    2014-09-01

    Multi-auto-transformation vector system has been one of the strategies to produce marker-free transgenic plants without using selective chemicals and plant growth regulators and thus facilitating transgene stacking. In the study reported here, retransformation was carried out in marker-free transgenic potato CV. May Queen containing ChiC gene (isolated from Streptomyces griseus strain HUT 6037) with wasabi defensin (WD) gene (isolated from Wasabia japonica) to pyramid the two disease resistant genes. Molecular analyses of the developed shoots confirmed the existence of both the genes of interest (ChiC and WD) in transgenic plants. Co-expression of the genes was confirmed by RT-PCR, northern blot, and western blot analyses. Disease resistance assay of in vitro plants showed that the transgenic lines co-expressing both the ChiC and WD genes had higher resistance against the fungal pathogens, Fusarium oxysporum (Fusarium wilt) and Alternaria solani (early blight) compared to the non-transformed control and the transgenic lines expressing either of the ChiC or WD genes. The disease resistance potential of the transgenic plants could be increased by transgene stacking or multiple transformations.

  13. Mitochondria, oligodendrocytes and inflammation in bipolar disorder: evidence from transcriptome studies points to intriguing parallels with multiple sclerosis

    PubMed Central

    Konradi, Christine; Sillivan, Stephanie E.; Clay, Hayley B.

    2011-01-01

    Gene expression studies of bipolar disorder (BPD) have shown changes in transcriptome profiles in multiple brain regions. Here we summarize the most consistent findings in the scientific literature, and compare them to data from schizophrenia (SZ) and major depressive disorder (MDD). The transcriptome profiles of all three disorders overlap, making the existence of a BPD-specific profile unlikely. Three groups of functionally related genes are consistently expressed at altered levels in BPD, SZ and MDD. Genes involved in energy metabolism and mitochondrial function are downregulated, genes involved in immune response and inflammation are upregulated, and genes expressed in oligodendrocytes are downregulated. Experimental paradigms for multiple sclerosis demonstrate a tight link between energy metabolism, inflammation and demyelination. These studies also show variabilities in the extent of oligodendrocyte stress, which can vary from a downregulation of oligodendrocyte genes, such as observed in psychiatric disorders, to cell death and brain lesions seen in multiple sclerosis. We conclude that experimental models of multiple sclerosis could be of interest for the research of BPD, SZ and MDD. PMID:21310238

  14. Efficient expression systems for cysteine proteases of malaria parasites

    PubMed Central

    Sarduy, Emir Salas; de los A. Chávez Planes, María

    2013-01-01

    Papain-like cysteine proteases of malaria parasites are considered important chemotherapeutic targets or valuable models for the evaluation of drug candidates. Consequently, many of these enzymes have been cloned and expressed in Escherichia coli for their biochemical characterization. However, their expression has been problematic, showing low yield and leading to the formation of insoluble aggregates. Given that highly-productive expression systems are required for the high-throughput evaluation of inhibitors, we analyzed the existing expression systems to identify the causes of such apparent issues. We found that significant divergences in codon and nucleotide composition from host genes are the most probable cause of expression failure, and propose several strategies to overcome these limitations. Finally we predict that yeast hosts Saccharomyces cerevisiae and Pichia pastoris may be better suited than E. coli for the efficient expression of plasmodial genes, presumably leading to soluble and active products reproducing structural and functional characteristics of the natural enzymes. PMID:23018863

  15. Development of novel types of plastid transformation vectors and evaluation of factors controlling expression.

    PubMed

    Herz, Stefan; Füssl, Monika; Steiger, Sandra; Koop, Hans-Ulrich

    2005-12-01

    Two new vector types for plastid transformation were developed and uidA reporter gene expression was compared to standard transformation vectors. The first vector type does not contain any plastid promoter, instead it relies on extension of existing plastid operons and was therefore named "operon-extension" vector. When a strongly expressed plastid operon like psbA was extended by the reporter gene with this vector type, the expression level was superior to that of a standard vector under control of the 16S rRNA promoter. Different insertion sites, promoters and 5'-UTRs were analysed for their effect on reporter gene expression with standard and operon-extension vectors. The 5'-UTR of phage 7 gene 10 in combination with a modified N-terminus was found to yield the highest expression levels. Expression levels were also strongly dependent on external factors like plant or leaf age or light intensity. In the second vector type, named "split" plastid transformation vector, modules of the expression cassette were distributed on two separate vectors. Upon co-transformation of plastids with these vectors, the complete expression cassette became inserted into the plastome. This result can be explained by successive co-integration of the split vectors and final loop-out recombination of the duplicated sequences. The split vector concept was validated with different vector pairs.

  16. ABC gene expression profiles have clinical importance and possibly form a new hallmark of cancer.

    PubMed

    Dvorak, Pavel; Pesta, Martin; Soucek, Pavel

    2017-05-01

    Adenosine triphosphate-binding cassette proteins constitute a large family of active transporters through extracellular and intracellular membranes. Increased drug efflux based on adenosine triphosphate-binding cassette protein activity is related to the development of cancer cell chemoresistance. Several articles have focused on adenosine triphosphate-binding cassette gene expression profiles (signatures), based on the expression of all 49 human adenosine triphosphate-binding cassette genes, in individual tumor types and reported connections to established clinicopathological features. The aim of this study was to test our theory about the existence of adenosine triphosphate-binding cassette gene expression profiles common to multiple types of tumors, which may modify tumor progression and provide clinically relevant information. Such general adenosine triphosphate-binding cassette profiles could constitute a new attribute of carcinogenesis. Our combined cohort consisted of tissues from 151 cancer patients-breast, colorectal, and pancreatic carcinomas. Standard protocols for RNA isolation and quantitative real-time polymerase chain reaction were followed. Gene expression data from individual tumor types as well as a merged tumor dataset were analyzed by bioinformatics tools. Several general adenosine triphosphate-binding cassette profiles, with differences in gene functions, were established and shown to have significant relations to clinicopathological features such as tumor size, histological grade, or clinical stage. Genes ABCC7, A3, A8, A12, and C8 prevailed among the most upregulated or downregulated ones. In conclusion, the results supported our theory about general adenosine triphosphate-binding cassette gene expression profiles and their importance for cancer on clinical as well as research levels. The presence of ABCC7 (official symbol CFTR) among the genes with key roles in the profiles supports the emerging evidence about its crucial role in various cancers. Graphical abstract.

  17. Bacterial evolution through the selective loss of beneficial Genes. Trade-offs in expression involving two loci.

    PubMed Central

    Zinser, Erik R; Schneider, Dominique; Blot, Michel; Kolter, Roberto

    2003-01-01

    The loss of preexisting genes or gene activities during evolution is a major mechanism of ecological specialization. Evolutionary processes that can account for gene loss or inactivation have so far been restricted to one of two mechanisms: direct selection for the loss of gene activities that are disadvantageous under the conditions of selection (i.e., antagonistic pleiotropy) and selection-independent genetic drift of neutral (or nearly neutral) mutations (i.e., mutation accumulation). In this study we demonstrate with an evolved strain of Escherichia coli that a third, distinct mechanism exists by which gene activities can be lost. This selection-dependent mechanism involves the expropriation of one gene's upstream regulatory element by a second gene via a homologous recombination event. Resulting from this genetic exchange is the activation of the second gene and a concomitant inactivation of the first gene. This gene-for-gene expression tradeoff provides a net fitness gain, even if the forfeited activity of the first gene can play a positive role in fitness under the conditions of selection. PMID:12930738

  18. RNA expression of genes involved in cytarabine metabolism and transport predicts cytarabine response in acute myeloid leukemia.

    PubMed

    Abraham, Ajay; Varatharajan, Savitha; Karathedath, Sreeja; Philip, Chepsy; Lakshmi, Kavitha M; Jayavelu, Ashok Kumar; Mohanan, Ezhilpavai; Janet, Nancy Beryl; Srivastava, Vivi M; Shaji, Ramachandran V; Zhang, Wei; Abraham, Aby; Viswabandya, Auro; George, Biju; Chandy, Mammen; Srivastava, Alok; Mathews, Vikram; Balasubramanian, Poonkuzhali

    2015-07-01

    Variation in terms of outcome and toxic side effects of treatment exists among acute myeloid leukemia (AML) patients on chemotherapy with cytarabine (Ara-C) and daunorubicin (Dnr). Candidate Ara-C metabolizing gene expression in primary AML cells is proposed to account for this variation. Ex vivo Ara-C sensitivity was determined in primary AML samples using MTT assay. mRNA expression of candidate Ara-C metabolizing genes were evaluated by RQPCR analysis. Global gene expression profiling was carried out for identifying differentially expressed genes between exvivo Ara-C sensitive and resistant samples. Wide interindividual variations in ex vivo Ara-C cytotoxicity were observed among samples from patients with AML and were stratified into sensitive, intermediately sensitive and resistant, based on IC50 values obtained by MTT assay. RNA expression of deoxycytidine kinase (DCK), human equilibrative nucleoside transporter-1 (ENT1) and ribonucleotide reductase M1 (RRM1) were significantly higher and cytidine deaminase (CDA) was significantly lower in ex vivo Ara-C sensitive samples. Higher DCK and RRM1 expression in AML patient's blast correlated with better DFS. Ara-C resistance index (RI), a mathematically derived quotient was proposed based on candidate gene expression pattern. Ara-C ex vivo sensitive samples were found to have significantly lower RI compared with resistant as well as samples from patients presenting with relapse. Patients with low RI supposedly highly sensitive to Ara-C were found to have higher incidence of induction death (p = 0.002; RR: 4.35 [95% CI: 1.69-11.22]). Global gene expression profiling undertaken to find out additional contributors of Ara-C resistance identified many apoptosis as well as metabolic pathway genes to be differentially expressed between Ara-C resistant and sensitive samples. This study highlights the importance of evaluating expression of candidate Ara-C metabolizing genes in predicting ex vivo drug response as well as treatment outcome. RI could be a predictor of ex vivo Ara-C response irrespective of cytogenetic and molecular risk groups and a potential biomarker for AML treatment outcome and toxicity. Original submitted 22 December 2014; Revision submitted 9 April 2015.

  19. MX2 Gene Expression Tends to be Downregulated in Subjects with HLA-DQB1*0602

    PubMed Central

    Tanaka, Susumu; Honda, Yutaka; Honda, Makoto

    2008-01-01

    Objective: There is a close association between narcolepsy and the human leukocyte antigen (HLA)-DQB1*0602. The detailed influence and function of this specific HLA allele with regard to narcolepsy have not yet been elucidated. Our previous report identified the myxovirus resistance 2 (MX2) gene as a narcolepsy-specific dysregulated gene; however, the report had a limitation—the control groups were not HLA matched. In this study, we examined the possibility of an association between MX2 expression and HLA haplotypes. Designs: The expression levels of the MX2 gene in 3 groups (24 narcolepsy with cataplexy patients; 24 age-, sex-, and HLA-DQB1 genotype-matched controls; and 24 age- and sex-matched controls without the HLA-DQB1*0602 allele) were measured by quantitative real-time RT-PCR. Results: The expression level of the MX2 gene tended to be downregulated in subjects carrying HLA-DQB1*0602, compared with that of the control subjects without this allele. There was no difference in the MX2 expression level between the narcolepsy subjects and the HLA-DQB1 genotype-matched control subjects. Conclusion: Our previous finding—the narcolepsy-specific reduction of MX2 gene expression—was not replicated in this follow-up study. The expression level of the MX2 gene in white blood cells was found to be lower in subjects with the HLA-DQB1*0602 than in subjects without this allele, suggesting that there exists a relationship between the HLA-DQB1*0602 allele and MX2 gene expression. This might be a possible explanation for the strong HLA association observed in narcolepsy. Citation: Tanaka S; Honda Y; Honda M. MX2 gene expression tends to be downregulated in subjects with HLA-DQB1*0602. SLEEP 2008;31(5):749-751. PMID:18517045

  20. Geometry of the Gene Expression Space of Individual Cells

    PubMed Central

    Korem, Yael; Szekely, Pablo; Hart, Yuval; Sheftel, Hila; Hausser, Jean; Mayo, Avi; Rothenberg, Michael E.; Kalisky, Tomer; Alon, Uri

    2015-01-01

    There is a revolution in the ability to analyze gene expression of single cells in a tissue. To understand this data we must comprehend how cells are distributed in a high-dimensional gene expression space. One open question is whether cell types form discrete clusters or whether gene expression forms a continuum of states. If such a continuum exists, what is its geometry? Recent theory on evolutionary trade-offs suggests that cells that need to perform multiple tasks are arranged in a polygon or polyhedron (line, triangle, tetrahedron and so on, generally called polytopes) in gene expression space, whose vertices are the expression profiles optimal for each task. Here, we analyze single-cell data from human and mouse tissues profiled using a variety of single-cell technologies. We fit the data to shapes with different numbers of vertices, compute their statistical significance, and infer their tasks. We find cases in which single cells fill out a continuum of expression states within a polyhedron. This occurs in intestinal progenitor cells, which fill out a tetrahedron in gene expression space. The four vertices of this tetrahedron are each enriched with genes for a specific task related to stemness and early differentiation. A polyhedral continuum of states is also found in spleen dendritic cells, known to perform multiple immune tasks: cells fill out a tetrahedron whose vertices correspond to key tasks related to maturation, pathogen sensing and communication with lymphocytes. A mixture of continuum-like distributions and discrete clusters is found in other cell types, including bone marrow and differentiated intestinal crypt cells. This approach can be used to understand the geometry and biological tasks of a wide range of single-cell datasets. The present results suggest that the concept of cell type may be expanded. In addition to discreet clusters in gene-expression space, we suggest a new possibility: a continuum of states within a polyhedron, in which the vertices represent specialists at key tasks. PMID:26161936

  1. Ancient human miRNAs are more likely to have broad functions and disease associations than young miRNAs.

    PubMed

    Patel, Vir D; Capra, John A

    2017-08-31

    microRNAs (miRNAs) are essential to the regulation of gene expression in eukaryotes, and improper expression of miRNAs contributes to hundreds of diseases. Despite the essential functions of miRNAs, the evolutionary dynamics of how they are integrated into existing gene regulatory and functional networks is not well understood. Knowledge of the origin and evolutionary history a gene has proven informative about its functions and disease associations; we hypothesize that incorporating the evolutionary origins of miRNAs into analyses will help resolve differences in their functional dynamics and how they influence disease. We computed the phylogenetic age of miRNAs across 146 species and quantified the relationship between human miRNA age and several functional attributes. Older miRNAs are significantly more likely to be associated with disease than younger miRNAs, and the number of associated diseases increases with age. As has been observed for genes, the miRNAs associated with different diseases have different age profiles. For example, human miRNAs implicated in cancer are enriched for origins near the dawn of animal multicellularity. Consistent with the increasing contribution of miRNAs to disease with age, older miRNAs target more genes than younger miRNAs, and older miRNAs are expressed in significantly more tissues. Furthermore, miRNAs of all ages exhibit a strong preference to target older genes; 93% of validated miRNA gene targets were in existence at the origin of the targeting miRNA. Finally, we find that human miRNAs in evolutionarily related families are more similar in their targets and expression profiles than unrelated miRNAs. Considering the evolutionary origin and history of a miRNA provides useful context for the analysis of its function. Consistent with recent work in Drosophila, our results support a model in which miRNAs increase their expression and functional regulatory interactions over evolutionary time, and thus older miRNAs have increased potential to cause disease. We anticipate that these patterns hold across mammalian species; however, comprehensively evaluating them will require refining miRNA annotations across species and collecting functional data in non-human systems.

  2. Knowledge-guided gene prioritization reveals new insights into the mechanisms of chemoresistance.

    PubMed

    Emad, Amin; Cairns, Junmei; Kalari, Krishna R; Wang, Liewei; Sinha, Saurabh

    2017-08-11

    Identification of genes whose basal mRNA expression predicts the sensitivity of tumor cells to cytotoxic treatments can play an important role in individualized cancer medicine. It enables detailed characterization of the mechanism of action of drugs. Furthermore, screening the expression of these genes in the tumor tissue may suggest the best course of chemotherapy or a combination of drugs to overcome drug resistance. We developed a computational method called ProGENI to identify genes most associated with the variation of drug response across different individuals, based on gene expression data. In contrast to existing methods, ProGENI also utilizes prior knowledge of protein-protein and genetic interactions, using random walk techniques. Analysis of two relatively new and large datasets including gene expression data on hundreds of cell lines and their cytotoxic responses to a large compendium of drugs reveals a significant improvement in prediction of drug sensitivity using genes identified by ProGENI compared to other methods. Our siRNA knockdown experiments on ProGENI-identified genes confirmed the role of many new genes in sensitivity to three chemotherapy drugs: cisplatin, docetaxel, and doxorubicin. Based on such experiments and extensive literature survey, we demonstrate that about 73% of our top predicted genes modulate drug response in selected cancer cell lines. In addition, global analysis of genes associated with groups of drugs uncovered pathways of cytotoxic response shared by each group. Our results suggest that knowledge-guided prioritization of genes using ProGENI gives new insight into mechanisms of drug resistance and identifies genes that may be targeted to overcome this phenomenon.

  3. Distinct profiles of expressed sequence tags during intestinal regeneration in the sea cucumber Holothuria glaberrima

    PubMed Central

    Rojas-Cartagena, Carmencita; Ortíz-Pineda, Pablo; Ramírez-Gómez, Francisco; Suárez-Castillo, Edna C.; Matos-Cruz, Vanessa; Rodríguez, Carlos; Ortíz-Zuazaga, Humberto; García-Arrarás, José E.

    2010-01-01

    Repair and regeneration are key processes for tissue maintenance, and their disruption may lead to disease states. Little is known about the molecular mechanisms that underline the repair and regeneration of the digestive tract. The sea cucumber Holothuria glaberrima represents an excellent model to dissect and characterize the molecular events during intestinal regeneration. To study the gene expression profile, cDNA libraries were constructed from normal, 3-day, and 7-day regenerating intestines of H. glaberrima. Clones were randomly sequenced and queried against the nonredundant protein database at the National Center for Biotechnology Information. RT-PCR analyses were made of several genes to determine their expression profile during intestinal regeneration. A total of 5,173 sequences from three cDNA libraries were obtained. About 46.2, 35.6, and 26.2% of the sequences for the normal, 3-days, and 7-days cDNA libraries, respectively, shared significant similarity with known sequences in the protein database of GenBank but only present 10% of similarity among them. Analysis of the libraries in terms of functional processes, protein domains, and most common sequences suggests that a differential expression profile is taking place during the regeneration process. Further examination of the expressed sequence tag dataset revealed that 12 putative genes are differentially expressed at significant level (R > 6). Experimental validation by RT-PCR analysis reveals that at least three genes (unknown C-4677-1, melanotransferrin, and centaurin) present a differential expression during regeneration. These findings strongly suggest that the gene expression profile varies among regeneration stages and provide evidence for the existence of differential gene expression. PMID:17579180

  4. Complete TCRα gene locus control region activity in T cells derived in vitro from embryonic stem cells

    PubMed Central

    Lahiji, Armin; Kučerová-Levisohn, Martina; Lovett, Jordana; Holmes, Roxanne; Zúñiga-Pflücker, Juan Carlos; Ortiz, Benjamin D.

    2013-01-01

    Locus Control Regions (LCR) are cis-acting gene regulatory elements with the unique, integration site-independent ability to transfer the characteristics of their locus-of-origin’s gene expression pattern to a linked transgene in mice. LCR activities have been discovered in numerous T cell lineage expressed gene loci. These elements can be adapted to the design of stem cell gene therapy vectors that direct robust therapeutic gene expression to the T cell progeny of engineered stem cells. Currently, transgenic mice provide the only experimental approach that wholly supports all the critical aspects of LCR activity. Herein we report manifestation of all key features of mouse T cell receptor (TCR)-α gene LCR function in T cells derived in vitro from mouse embryonic stem cells (ESC). High level, copy number-related TCRα LCR-linked reporter gene expression levels are cell type-restricted in this system, and upregulated during the expected stage transition of T cell development. We further report that de novo introduction of TCRα LCR linked transgenes into existing T cell lines yields incomplete LCR activity. Together, these data indicate that establishing full TCRα LCR activity requires critical molecular events occurring prior to final T-lineage determination. This study additionally validates a novel, tractable and more rapid approach for the study of LCR activity in T cells, and its translation to therapeutic genetic engineering. PMID:23720809

  5. DeepSAGE Based Differential Gene Expression Analysis under Cold and Freeze Stress in Seabuckthorn (Hippophae rhamnoides L.)

    PubMed Central

    Chaudhary, Saurabh; Sharma, Prakash C.

    2015-01-01

    Seabuckthorn (Hippophae rhamnoides L.), an important plant species of Indian Himalayas, is well known for its immense medicinal and nutritional value. The plant has the ability to sustain growth in harsh environments of extreme temperatures, drought and salinity. We employed DeepSAGE, a tag based approach, to identify differentially expressed genes under cold and freeze stress in seabuckthorn. In total 36.2 million raw tags including 13.9 million distinct tags were generated using Illumina sequencing platform for three leaf tissue libraries including control (CON), cold stress (CS) and freeze stress (FS). After discarding low quality tags, 35.5 million clean tags including 7 million distinct clean tags were obtained. In all, 11922 differentially expressed genes (DEGs) including 6539 up regulated and 5383 down regulated genes were identified in three comparative setups i.e. CON vs CS, CON vs FS and CS vs FS. Gene ontology and KEGG pathway analysis were performed to assign gene ontology term to DEGs and ascertain their biological functions. DEGs were mapped back to our existing seabuckthorn transcriptome assembly comprising of 88,297 putative unigenes leading to the identification of 428 cold and freeze stress responsive genes. Expression of randomly selected 22 DEGs was validated using qRT-PCR that further supported our DeepSAGE results. The present study provided a comprehensive view of global gene expression profile of seabuckthorn under cold and freeze stresses. The DeepSAGE data could also serve as a valuable resource for further functional genomics studies aiming selection of candidate genes for development of abiotic stress tolerant transgenic plants. PMID:25803684

  6. DeepSAGE based differential gene expression analysis under cold and freeze stress in seabuckthorn (Hippophae rhamnoides L.).

    PubMed

    Chaudhary, Saurabh; Sharma, Prakash C

    2015-01-01

    Seabuckthorn (Hippophae rhamnoides L.), an important plant species of Indian Himalayas, is well known for its immense medicinal and nutritional value. The plant has the ability to sustain growth in harsh environments of extreme temperatures, drought and salinity. We employed DeepSAGE, a tag based approach, to identify differentially expressed genes under cold and freeze stress in seabuckthorn. In total 36.2 million raw tags including 13.9 million distinct tags were generated using Illumina sequencing platform for three leaf tissue libraries including control (CON), cold stress (CS) and freeze stress (FS). After discarding low quality tags, 35.5 million clean tags including 7 million distinct clean tags were obtained. In all, 11922 differentially expressed genes (DEGs) including 6539 up regulated and 5383 down regulated genes were identified in three comparative setups i.e. CON vs CS, CON vs FS and CS vs FS. Gene ontology and KEGG pathway analysis were performed to assign gene ontology term to DEGs and ascertain their biological functions. DEGs were mapped back to our existing seabuckthorn transcriptome assembly comprising of 88,297 putative unigenes leading to the identification of 428 cold and freeze stress responsive genes. Expression of randomly selected 22 DEGs was validated using qRT-PCR that further supported our DeepSAGE results. The present study provided a comprehensive view of global gene expression profile of seabuckthorn under cold and freeze stresses. The DeepSAGE data could also serve as a valuable resource for further functional genomics studies aiming selection of candidate genes for development of abiotic stress tolerant transgenic plants.

  7. Integrative DNA methylation and gene expression analysis to assess the universality of the CpG island methylator phenotype.

    PubMed

    Moarii, Matahi; Reyal, Fabien; Vert, Jean-Philippe

    2015-10-13

    The CpG island methylator phenotype (CIMP) was first characterized in colorectal cancer but since has been extensively studied in several other tumor types such as breast, bladder, lung, and gastric. CIMP is of clinical importance as it has been reported to be associated with prognosis or response to treatment. However, the identification of a universal molecular basis to define CIMP across tumors has remained elusive. We perform a genome-wide methylation analysis of over 2000 tumor samples from 5 cancer sites to assess the existence of a CIMP with common molecular basis across cancers. We then show that the CIMP phenotype is associated with specific gene expression variations. However, we do not find a common genetic signature in all tissues associated with CIMP. Our results suggest the existence of a universal epigenetic and transcriptomic signature that defines the CIMP across several tumor types but does not indicate the existence of a common genetic signature of CIMP.

  8. DrImpute: imputing dropout events in single cell RNA sequencing data.

    PubMed

    Gong, Wuming; Kwak, Il-Youp; Pota, Pruthvi; Koyano-Nakagawa, Naoko; Garry, Daniel J

    2018-06-08

    The single cell RNA sequencing (scRNA-seq) technique begin a new era by allowing the observation of gene expression at the single cell level. However, there is also a large amount of technical and biological noise. Because of the low number of RNA transcriptomes and the stochastic nature of the gene expression pattern, there is a high chance of missing nonzero entries as zero, which are called dropout events. We develop DrImpute to impute dropout events in scRNA-seq data. We show that DrImpute has significantly better performance on the separation of the dropout zeros from true zeros than existing imputation algorithms. We also demonstrate that DrImpute can significantly improve the performance of existing tools for clustering, visualization and lineage reconstruction of nine published scRNA-seq datasets. DrImpute can serve as a very useful addition to the currently existing statistical tools for single cell RNA-seq analysis. DrImpute is implemented in R and is available at https://github.com/gongx030/DrImpute .

  9. Genomics of Natural Populations: How Differentially Expressed Genes Shape the Evolution of Chromosomal Inversions in Drosophila pseudoobscura

    PubMed Central

    Fuller, Zachary L.; Haynes, Gwilym D.; Richards, Stephen; Schaeffer, Stephen W.

    2016-01-01

    Chromosomal rearrangements can shape the structure of genetic variation in the genome directly through alteration of genes at breakpoints or indirectly by holding combinations of genetic variants together due to reduced recombination. The third chromosome of Drosophila pseudoobscura is a model system to test hypotheses about how rearrangements are established in populations because its third chromosome is polymorphic for >30 gene arrangements that were generated by a series of overlapping inversion mutations. Circumstantial evidence has suggested that these gene arrangements are selected. Despite the expected homogenizing effects of extensive gene flow, the frequencies of arrangements form gradients or clines in nature, which have been stable since the system was first described >80 years ago. Furthermore, multiple arrangements exist at appreciable frequencies across several ecological niches providing the opportunity for heterokaryotypes to form. In this study, we tested whether genes are differentially expressed among chromosome arrangements in first instar larvae, adult females and males. In addition, we asked whether transcriptional patterns in heterokaryotypes are dominant, semidominant, overdominant, or underdominant. We find evidence for a significant abundance of differentially expressed genes across the inverted regions of the third chromosome, including an enrichment of genes involved in sensory perception for males. We find the majority of loci show additivity in heterokaryotypes. Our results suggest that multiple genes have expression differences among arrangements that were either captured by the original inversion mutation or accumulated after it reached polymorphic frequencies, providing a potential source of genetic variation for selection to act upon. These data suggest that the inversions are favored because of their indirect effect of recombination suppression that has held different combinations of differentially expressed genes together in the various gene arrangement backgrounds. PMID:27401754

  10. Functional Analyses of NSF1 in Wine Yeast Using Interconnected Correlation Clustering and Molecular Analyses

    PubMed Central

    Bessonov, Kyrylo; Walkey, Christopher J.; Shelp, Barry J.; van Vuuren, Hennie J. J.; Chiu, David; van der Merwe, George

    2013-01-01

    Analyzing time-course expression data captured in microarray datasets is a complex undertaking as the vast and complex data space is represented by a relatively low number of samples as compared to thousands of available genes. Here, we developed the Interdependent Correlation Clustering (ICC) method to analyze relationships that exist among genes conditioned on the expression of a specific target gene in microarray data. Based on Correlation Clustering, the ICC method analyzes a large set of correlation values related to gene expression profiles extracted from given microarray datasets. ICC can be applied to any microarray dataset and any target gene. We applied this method to microarray data generated from wine fermentations and selected NSF1, which encodes a C2H2 zinc finger-type transcription factor, as the target gene. The validity of the method was verified by accurate identifications of the previously known functional roles of NSF1. In addition, we identified and verified potential new functions for this gene; specifically, NSF1 is a negative regulator for the expression of sulfur metabolism genes, the nuclear localization of Nsf1 protein (Nsf1p) is controlled in a sulfur-dependent manner, and the transcription of NSF1 is regulated by Met4p, an important transcriptional activator of sulfur metabolism genes. The inter-disciplinary approach adopted here highlighted the accuracy and relevancy of the ICC method in mining for novel gene functions using complex microarray datasets with a limited number of samples. PMID:24130853

  11. Massive-Scale Gene Co-Expression Network Construction and Robustness Testing Using Random Matrix Theory

    PubMed Central

    Isaacson, Sven; Luo, Feng; Feltus, Frank A.; Smith, Melissa C.

    2013-01-01

    The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust. PMID:23409071

  12. Defining the limits of physiological plasticity: how gene expression can assess and predict the consequences of ocean change

    PubMed Central

    Evans, Tyler G.; Hofmann, Gretchen E.

    2012-01-01

    Anthropogenic stressors, such as climate change, are driving fundamental shifts in the abiotic characteristics of marine ecosystems. As the environmental aspects of our world's oceans deviate from evolved norms, of major concern is whether extant marine species possess the capacity to cope with such rapid change. In what many scientists consider the post-genomic era, tools that exploit the availability of DNA sequence information are being increasingly recognized as relevant to questions surrounding ocean change and marine conservation. In this review, we highlight the application of high-throughput gene-expression profiling, primarily transcriptomics, to the field of marine conservation physiology. Through the use of case studies, we illustrate how gene expression can be used to standardize metrics of sub-lethal stress, track organism condition in natural environments and bypass phylogenetic barriers that hinder the application of other physiological techniques to conservation. When coupled with fine-scale monitoring of environmental variables, gene-expression profiling provides a powerful approach to conservation capable of informing diverse issues related to ocean change, from coral bleaching to the spread of invasive species. Integrating novel approaches capable of improving existing conservation strategies, including gene-expression profiling, will be critical to ensuring the ecological and economic health of the global ocean. PMID:22566679

  13. Comprehensive gene expression analysis of rice aleurone cells: probing the existence of an alternative gibberellin receptor.

    PubMed

    Yano, Kenji; Aya, Koichiro; Hirano, Ko; Ordonio, Reynante Lacsamana; Ueguchi-Tanaka, Miyako; Matsuoka, Makoto

    2015-02-01

    Current gibberellin (GA) research indicates that GA must be perceived in plant nuclei by its cognate receptor, GIBBERELLIN INSENSITIVE DWARF1 (GID1). Recognition of GA by GID1 relieves the repression mediated by the DELLA protein, a model known as the GID1-DELLA GA perception system. There have been reports of potential GA-binding proteins in the plasma membrane that perceive GA and induce α-amylase expression in cereal aleurone cells, which is mechanistically different from the GID1-DELLA system. Therefore, we examined the expression of the rice (Oryza sativa) α-amylase genes in rice mutants impaired in the GA receptor (gid1) and the DELLA repressor (slender rice1; slr1) and confirmed their lack of response to GA in gid1 mutants and constitutive expression in slr1 mutants. We also examined the expression of GA-regulated genes by genome-wide microarray and quantitative reverse transcription-polymerase chain reaction analyses and confirmed that all GA-regulated genes are modulated by the GID1-DELLA system. Furthermore, we studied the regulatory network involved in GA signaling by using a set of mutants defective in genes involved in GA perception and gene expression, namely gid1, slr1, gid2 (a GA-related F-box protein mutant), and gamyb (a GA-related trans-acting factor mutant). Almost all GA up-regulated genes were regulated by the four named GA-signaling components. On the other hand, GA down-regulated genes showed different expression patterns with respect to GID2 and GAMYB (e.g. a considerable number of genes are not controlled by GAMYB or GID2 and GAMYB). Based on these observations, we present a comprehensive discussion of the intricate network of GA-regulated genes in rice aleurone cells. © 2015 American Society of Plant Biologists. All Rights Reserved.

  14. Gene Expression (mRNA) Markers for Differentiating between Malignant and Benign Follicular Thyroid Tumours

    PubMed Central

    Wojtas, Bartosz; Pfeifer, Aleksandra; Oczko-Wojciechowska, Malgorzata; Krajewska, Jolanta; Czarniecka, Agnieszka; Kukulska, Aleksandra; Eszlinger, Markus; Musholt, Thomas; Stokowy, Tomasz; Swierniak, Michal; Stobiecka, Ewa; Chmielik, Ewa; Rusinek, Dagmara; Tyszkiewicz, Tomasz; Halczok, Monika; Hauptmann, Steffen; Lange, Dariusz; Jarzab, Michal; Paschke, Ralf; Jarzab, Barbara

    2017-01-01

    Distinguishing between follicular thyroid cancer (FTC) and follicular thyroid adenoma (FTA) constitutes a long-standing diagnostic problem resulting in equivocal histopathological diagnoses. There is therefore a need for additional molecular markers. To identify molecular differences between FTC and FTA, we analyzed the gene expression microarray data of 52 follicular neoplasms. We also performed a meta-analysis involving 14 studies employing high throughput methods (365 follicular neoplasms analyzed). Based on these two analyses, we selected 18 genes differentially expressed between FTA and FTC. We validated them by quantitative real-time polymerase chain reaction (qRT-PCR) in an independent set of 71 follicular neoplasms from formaldehyde-fixed paraffin embedded (FFPE) tissue material. We confirmed differential expression for 7 genes (CPQ, PLVAP, TFF3, ACVRL1, ZFYVE21, FAM189A2, and CLEC3B). Finally, we created a classifier that distinguished between FTC and FTA with an accuracy of 78%, sensitivity of 76%, and specificity of 80%, based on the expression of 4 genes (CPQ, PLVAP, TFF3, ACVRL1). In our study, we have demonstrated that meta-analysis is a valuable method for selecting possible molecular markers. Based on our results, we conclude that there might exist a plausible limit of gene classifier accuracy of approximately 80%, when follicular tumors are discriminated based on formalin-fixed postoperative material. PMID:28574441

  15. Gene Expression (mRNA) Markers for Differentiating between Malignant and Benign Follicular Thyroid Tumours.

    PubMed

    Wojtas, Bartosz; Pfeifer, Aleksandra; Oczko-Wojciechowska, Malgorzata; Krajewska, Jolanta; Czarniecka, Agnieszka; Kukulska, Aleksandra; Eszlinger, Markus; Musholt, Thomas; Stokowy, Tomasz; Swierniak, Michal; Stobiecka, Ewa; Chmielik, Ewa; Rusinek, Dagmara; Tyszkiewicz, Tomasz; Halczok, Monika; Hauptmann, Steffen; Lange, Dariusz; Jarzab, Michal; Paschke, Ralf; Jarzab, Barbara

    2017-06-02

    Distinguishing between follicular thyroid cancer (FTC) and follicular thyroid adenoma (FTA) constitutes a long-standing diagnostic problem resulting in equivocal histopathological diagnoses. There is therefore a need for additional molecular markers. To identify molecular differences between FTC and FTA, we analyzed the gene expression microarray data of 52 follicular neoplasms. We also performed a meta-analysis involving 14 studies employing high throughput methods (365 follicular neoplasms analyzed). Based on these two analyses, we selected 18 genes differentially expressed between FTA and FTC. We validated them by quantitative real-time polymerase chain reaction (qRT-PCR) in an independent set of 71 follicular neoplasms from formaldehyde-fixed paraffin embedded (FFPE) tissue material. We confirmed differential expression for 7 genes ( CPQ , PLVAP , TFF3 , ACVRL1 , ZFYVE21 , FAM189A2 , and CLEC3B ). Finally, we created a classifier that distinguished between FTC and FTA with an accuracy of 78%, sensitivity of 76%, and specificity of 80%, based on the expression of 4 genes ( CPQ , PLVAP , TFF3 , ACVRL1 ). In our study, we have demonstrated that meta-analysis is a valuable method for selecting possible molecular markers. Based on our results, we conclude that there might exist a plausible limit of gene classifier accuracy of approximately 80%, when follicular tumors are discriminated based on formalin-fixed postoperative material.

  16. Genome-wide screening of indicator genes for assessing the potential carcinogenic risk of Nanjing city drinking water.

    PubMed

    Zhang, Rui; Cheng, Shupei; Li, Aimin; Sun, Jie; Zhang, Yan; Zhang, Xuxiang

    2011-07-01

    Effects of all pollutants existing in the Nanjing city drinking water (DWNC) on mouse gene transcription levels were measured to assess the DWNC carcinogenic risks and to identify candidate indicator genes for assessing and early warning the cancer risks. Transcriptional expression levels of 14,000 hepatic genes for the treatment group mice (Mus musculus, ICR) fed with DWNC for 90 days were detected using the GeneChip(®) Mouse Genome 430A 2.0 array. The analysis indicated that the transcriptional levels of 294 genes were up-regulated and 542 ones were down-regulated. Of these genes, 12 ones identified to be involved in at least five different types of cancers were further analyzed. An interrogation by Kyoto Encyclopedia of Genes and Genomes (KEGG) revealed that three (including ITGAV, CCND1 and SMAD2) of the 12 genes were mapped to pathway in cancer. Gene Ontology (GO) function annotation also showed that they were associated with the functional categories of cell cycle regulation, adhesion, apoptosis, signal transduction and so on which are closely implicated in tumorigenesis and progression. The correlations between the aberrant expressions of them and the genesis and progression of cancers have been further documented by a number of scientific researches. These results might demonstrate that the potential toxicity and carcinogenic risks were associated with DWNC. Moreover, ITGAV, CCND1 and SMAD2 were identified as the most likely candidate indicator genes for the assessment of the combined carcinogenic risk of all pollutants existing in DWNC.

  17. Salivary Immunoglobulin Gene Expression in Patients with Caries

    PubMed Central

    Santín, Gema Regina Guadarrama; Salgado, Angel Visoso; Bastida, Norma Margarita Montiel; Gómez, Isaías de la Rosa; Benítez, Jonnathan Guadalupe Santillán; Zerón, Hugo Mendieta

    2017-01-01

    BACKGROUND: Immunoglobulins mediate the host’s humoral immune response are expressed in saliva. AIM: To quantify the FcαR, FcγRIIB, and FcαμR gene expression in the saliva of Mexican patients with caries in mixed and permanent dentition. SUBJECTS AND METHODS: This was a comparative cross-sectional study. mRNA was isolated from 200 μL of saliva following the RNA III Tissue Fresh-frozen protocol of the MagNA Pure LC Instrument 2.0 (Roche Diagnostics GmbH, Nederland BV) and the FcαR, FcαμR and FcγRIIB were quantified through TaqMan Assays. RESULTS: One hundred individuals, 50 with mixed dentition and 50 with permanent dentition, were included in the study. Statistically, it was found a significant difference (p = 0.025) in the IgG (FcγRIIB) expression between the studied groups. CONCLUSION: Although we confirmed the existence of FcαR, FcγRIIB and FcαμR gene expression in saliva, only a significant difference in the expression of FcγRIIB between the mixed dentition and permanent dentition was found. PMID:28507635

  18. Human growth is associated with distinct patterns of gene expression in evolutionarily conserved networks

    PubMed Central

    2013-01-01

    Background A co-ordinated tissue-independent gene expression profile associated with growth is present in rodent models and this is hypothesised to extend to all mammals. Growth in humans has similarities to other mammals but the return to active long bone growth in the pubertal growth spurt is a distinctly human growth event. The aim of this study was to describe gene expression and biological pathways associated with stages of growth in children and to assess tissue-independent expression patterns in relation to human growth. Results We conducted gene expression analysis on a library of datasets from normal children with age annotation, collated from the NCBI Gene Expression Omnibus (GEO) and EBI Arrayexpress databases. A primary data set was generated using cells of lymphoid origin from normal children; the expression of 688 genes (ANOVA false discovery rate modified p-value, q < 0.1) was associated with age, and subsets of these genes formed clusters that correlated with the phases of growth – infancy, childhood, puberty and final height. Network analysis on these clusters identified evolutionarily conserved growth pathways (NOTCH, VEGF, TGFB, WNT and glucocorticoid receptor – Hyper-geometric test, q < 0.05). The greatest degree of network ‘connectivity’ and hence functional significance was present in infancy (Wilcoxon test, p < 0.05), which then decreased through to adulthood. These observations were confirmed in a separate validation data set from lymphoid tissue. Similar biological pathways were observed to be associated with development-related gene expression in other tissues (conjunctival epithelia, temporal lobe brain tissue and bone marrow) suggesting the existence of a tissue-independent genetic program for human growth and maturation. Conclusions Similar evolutionarily conserved pathways have been associated with gene expression and child growth in multiple tissues. These expression profiles associate with the developmental phases of growth including the return to active long bone growth in puberty, a distinctly human event. These observations also have direct medical relevance to pathological changes that induce disease in children. Taking into account development-dependent gene expression profiles for normal children will be key to the appropriate selection of genes and pathways as potential biomarkers of disease or as drug targets. PMID:23941278

  19. DEXTER: Disease-Expression Relation Extraction from Text.

    PubMed

    Gupta, Samir; Dingerdissen, Hayley; Ross, Karen E; Hu, Yu; Wu, Cathy H; Mazumder, Raja; Vijay-Shanker, K

    2018-01-01

    Gene expression levels affect biological processes and play a key role in many diseases. Characterizing expression profiles is useful for clinical research, and diagnostics and prognostics of diseases. There are currently several high-quality databases that capture gene expression information, obtained mostly from large-scale studies, such as microarray and next-generation sequencing technologies, in the context of disease. The scientific literature is another rich source of information on gene expression-disease relationships that not only have been captured from large-scale studies but have also been observed in thousands of small-scale studies. Expression information obtained from literature through manual curation can extend expression databases. While many of the existing databases include information from literature, they are limited by the time-consuming nature of manual curation and have difficulty keeping up with the explosion of publications in the biomedical field. In this work, we describe an automated text-mining tool, Disease-Expression Relation Extraction from Text (DEXTER) to extract information from literature on gene and microRNA expression in the context of disease. One of the motivations in developing DEXTER was to extend the BioXpress database, a cancer-focused gene expression database that includes data derived from large-scale experiments and manual curation of publications. The literature-based portion of BioXpress lags behind significantly compared to expression information obtained from large-scale studies and can benefit from our text-mined results. We have conducted two different evaluations to measure the accuracy of our text-mining tool and achieved average F-scores of 88.51 and 81.81% for the two evaluations, respectively. Also, to demonstrate the ability to extract rich expression information in different disease-related scenarios, we used DEXTER to extract information on differential expression information for 2024 genes in lung cancer, 115 glycosyltransferases in 62 cancers and 826 microRNA in 171 cancers. All extractions using DEXTER are integrated in the literature-based portion of BioXpress.Database URL: http://biotm.cis.udel.edu/DEXTER.

  20. Comparison of alternative approaches for analysing multi-level RNA-seq data

    PubMed Central

    Mohorianu, Irina; Bretman, Amanda; Smith, Damian T.; Fowler, Emily K.; Dalmay, Tamas

    2017-01-01

    RNA sequencing (RNA-seq) is widely used for RNA quantification in the environmental, biological and medical sciences. It enables the description of genome-wide patterns of expression and the identification of regulatory interactions and networks. The aim of RNA-seq data analyses is to achieve rigorous quantification of genes/transcripts to allow a reliable prediction of differential expression (DE), despite variation in levels of noise and inherent biases in sequencing data. This can be especially challenging for datasets in which gene expression differences are subtle, as in the behavioural transcriptomics test dataset from D. melanogaster that we used here. We investigated the power of existing approaches for quality checking mRNA-seq data and explored additional, quantitative quality checks. To accommodate nested, multi-level experimental designs, we incorporated sample layout into our analyses. We employed a subsampling without replacement-based normalization and an identification of DE that accounted for the hierarchy and amplitude of effect sizes within samples, then evaluated the resulting differential expression call in comparison to existing approaches. In a final step to test for broader applicability, we applied our approaches to a published set of H. sapiens mRNA-seq samples, The dataset-tailored methods improved sample comparability and delivered a robust prediction of subtle gene expression changes. The proposed approaches have the potential to improve key steps in the analysis of RNA-seq data by incorporating the structure and characteristics of biological experiments. PMID:28792517

  1. Cat odor exposure induces distinct changes in the exploratory behavior and Wfs1 gene expression in C57Bl/6 and 129Sv mice.

    PubMed

    Raud, Sirli; Sütt, Silva; Plaas, Mario; Luuk, Hendrik; Innos, Jürgen; Philips, Mari-Anne; Kõks, Sulev; Vasar, Eero

    2007-10-16

    129Sv and C57Bl/6 (Bl6) strains are two most widely used inbred mice strains for generation of transgenic animals. The present study confirms the existence of substantial differences in the behavior of these two mice strains. The exploratory behavior of Bl6 mice in a novel environment was significantly higher compared to 129Sv mice. The exposure of mice to cat odor-induced an anxiety-like state in Bl6, but not in 129Sv mice. The levels of Wfs1 gene expression did not differ in the prefrontal cortex, mesolimbic area and temporal lobe of experimentally naive Bl6 and 129Sv mice. However, after cat odor exposure the expression of Wfs1 gene was significantly lower in the mesolimbic area and temporal lobe of Bl6 mice compared to 129Sv strain. Dynamics of Wfs1 gene expression and exploratory behavior suggest that the down-regulation of Wfs1 gene in Bl6 mice might be related to the increased anxiety. Further studies are needed to test the robustness and possible causal relationship of this finding.

  2. Kinetics of nif Gene Expression in a Nitrogen-Fixing Bacterium

    PubMed Central

    Poza-Carrión, César; Jiménez-Vicente, Emilio; Navarro-Rodríguez, Mónica; Echavarri-Erasun, Carlos

    2014-01-01

    Nitrogen fixation is a tightly regulated trait. Switching from N2 fixation-repressing conditions to the N2-fixing state is carefully controlled in diazotrophic bacteria mainly because of the high energy demand that it imposes. By using quantitative real-time PCR and quantitative immunoblotting, we show here how nitrogen fixation (nif) gene expression develops in Azotobacter vinelandii upon derepression. Transient expression of the transcriptional activator-encoding gene, nifA, was followed by subsequent, longer-duration waves of expression of the nitrogenase biosynthetic and structural genes. Importantly, expression timing, expression levels, and NifA dependence varied greatly among the nif operons. Moreover, the exact concentrations of Nif proteins and their changes over time were determined for the first time. Nif protein concentrations were exquisitely balanced, with FeMo cofactor biosynthetic proteins accumulating at levels 50- to 100-fold lower than those of the structural proteins. Mutants lacking nitrogenase structural genes or impaired in FeMo cofactor biosynthesis showed overenhanced responses to derepression that were proportional to the degree of nitrogenase activity impairment, consistent with the existence of at least two negative-feedback regulatory mechanisms. The first such mechanism responded to the levels of fixed nitrogen, whereas the second mechanism appeared to respond to the levels of the mature NifDK component. Altogether, these findings provide a framework to engineer N2 fixation in nondiazotrophs. PMID:24244007

  3. Kinetics of Nif gene expression in a nitrogen-fixing bacterium.

    PubMed

    Poza-Carrión, César; Jiménez-Vicente, Emilio; Navarro-Rodríguez, Mónica; Echavarri-Erasun, Carlos; Rubio, Luis M

    2014-02-01

    Nitrogen fixation is a tightly regulated trait. Switching from N2 fixation-repressing conditions to the N2-fixing state is carefully controlled in diazotrophic bacteria mainly because of the high energy demand that it imposes. By using quantitative real-time PCR and quantitative immunoblotting, we show here how nitrogen fixation (nif) gene expression develops in Azotobacter vinelandii upon derepression. Transient expression of the transcriptional activator-encoding gene, nifA, was followed by subsequent, longer-duration waves of expression of the nitrogenase biosynthetic and structural genes. Importantly, expression timing, expression levels, and NifA dependence varied greatly among the nif operons. Moreover, the exact concentrations of Nif proteins and their changes over time were determined for the first time. Nif protein concentrations were exquisitely balanced, with FeMo cofactor biosynthetic proteins accumulating at levels 50- to 100-fold lower than those of the structural proteins. Mutants lacking nitrogenase structural genes or impaired in FeMo cofactor biosynthesis showed overenhanced responses to derepression that were proportional to the degree of nitrogenase activity impairment, consistent with the existence of at least two negative-feedback regulatory mechanisms. The first such mechanism responded to the levels of fixed nitrogen, whereas the second mechanism appeared to respond to the levels of the mature NifDK component. Altogether, these findings provide a framework to engineer N2 fixation in nondiazotrophs.

  4. Transcript expression profiling for adventitious roots of Panax ginseng Meyer.

    PubMed

    Subramaniyam, Sathiyamoorthy; Mathiyalagan, Ramya; Natarajan, Sathishkumar; Kim, Yu-Jin; Jang, Moon-Gi; Park, Jun-Hyung; Yang, Deok Chun

    2014-08-01

    Panax ginseng Meyer is one of the major medicinal plants in oriental countries belonging to the Araliaceae family which are the primary source for ginsenosides. However, very few genes were characterized for ginsenoside pathway, due to the limited genome information. Through this study, we obtained a comprehensive transcriptome from adventitious roots, which were treated with methyl jasmonic acids for different time points (control, 2h, 6h, 12h, and 24h) and sequenced by RNA 454 pyrosequencing technology. Reference transcriptome 39,304,529 (0.04GB) was obtained from 5,724,987,880 bases (5.7GB) of 22 libraries by de novo assembly and 35,266 (58.5%) transcripts were annotated with biological schemas (GO and KEGG). The digital gene expression patterns were obtained from in vitro grown adventitious root sequences which mapped to reference, from that, 3813 (6.3%) unique transcripts were involved in ≥2 fold up and downregulations. Finally, candidates for ginsenoside pathway genes were predicted from observed expression patterns. Among them, 30 transcription factors, 20 cytochromes, and 11 glycosyl transferases were predicted as ginsenoside candidates. These data can remarkably expand the existing transcriptome resources of Panax, especially to predict existence of gene networks in P. ginseng. The entity of the data provides a valuable platform to reveal more on secondary metabolism and abiotic stresses from P. ginseng in vitro grown adventitious roots. Copyright © 2014 Elsevier B.V. All rights reserved.

  5. A Gene Expression Signature Associated with Overall Survival in Patients with Hepatocellular Carcinoma Suggests a New Treatment Strategy.

    PubMed

    Gillet, Jean-Pierre; Andersen, Jesper B; Madigan, James P; Varma, Sudhir; Bagni, Rachel K; Powell, Katie; Burgan, William E; Wu, Chung-Pu; Calcagno, Anna Maria; Ambudkar, Suresh V; Thorgeirsson, Snorri S; Gottesman, Michael M

    2016-02-01

    Despite improvements in the management of liver cancer, the survival rate for patients with hepatocellular carcinoma (HCC) remains dismal. The survival benefit of systemic chemotherapy for the treatment of liver cancer is only marginal. Although the reasons for treatment failure are multifactorial, intrinsic resistance to chemotherapy plays a primary role. Here, we analyzed the expression of 377 multidrug resistance (MDR)-associated genes in two independent cohorts of patients with advanced HCC, with the aim of finding ways to improve survival in this poor-prognosis cancer. Taqman-based quantitative polymerase chain reaction revealed a 45-gene signature that predicts overall survival (OS) in patients with HCC. Using the Connectivity Map Tool, we were able to identify drugs that converted the gene expression profiles of HCC cell lines from ones matching patients with poor OS to profiles associated with good OS. We found three compounds that convert the gene expression profiles of three HCC cell lines to gene expression profiles associated with good OS. These compounds increase histone acetylation, which correlates with the synergistic sensitization of those MDR tumor cells to conventional chemotherapeutic agents, including cisplatin, sorafenib, and 5-fluorouracil. Our results indicate that it is possible to modulate gene expression profiles in HCC cell lines to those associated with better outcome. This approach also increases sensitization of HCC cells toward conventional chemotherapeutic agents. This work suggests new treatment strategies for a disease for which few therapeutic options exist. U.S. Government work not protected by U.S. copyright.

  6. Lex-SVM: exploring the potential of exon expression profiling for disease classification.

    PubMed

    Yuan, Xiongying; Zhao, Yi; Liu, Changning; Bu, Dongbo

    2011-04-01

    Exon expression profiling technologies, including exon arrays and RNA-Seq, measure the abundance of every exon in a gene. Compared with gene expression profiling technologies like 3' array, exon expression profiling technologies could detect alterations in both transcription and alternative splicing, therefore they are expected to be more sensitive in diagnosis. However, exon expression profiling also brings higher dimension, more redundancy, and significant correlation among features. Ignoring the correlation structure among exons of a gene, a popular classification method like L1-SVM selects exons individually from each gene and thus is vulnerable to noise. To overcome this limitation, we present in this paper a new variant of SVM named Lex-SVM to incorporate correlation structure among exons and known splicing patterns to promote classification performance. Specifically, we construct a new norm, ex-norm, including our prior knowledge on exon correlation structure to regularize the coefficients of a linear SVM. Lex-SVM can be solved efficiently using standard linear programming techniques. The advantage of Lex-SVM is that it can select features group-wisely, force features in a subgroup to take equal weihts and exclude the features that contradict the majority in the subgroup. Experimental results suggest that on exon expression profile, Lex-SVM is more accurate than existing methods. Lex-SVM also generates a more compact model and selects genes more consistently in cross-validation. Unlike L1-SVM selecting only one exon in a gene, Lex-SVM assigns equal weights to as many exons in a gene as possible, lending itself easier for further interpretation.

  7. Analysis of gene expression in a developmental context emphasizes distinct biological leitmotifs in human cancers

    PubMed Central

    Naxerova, Kamila; Bult, Carol J; Peaston, Anne; Fancher, Karen; Knowles, Barbara B; Kasif, Simon; Kohane, Isaac S

    2008-01-01

    Background In recent years, the molecular underpinnings of the long-observed resemblance between neoplastic and immature tissue have begun to emerge. Genome-wide transcriptional profiling has revealed similar gene expression signatures in several tumor types and early developmental stages of their tissue of origin. However, it remains unclear whether such a relationship is a universal feature of malignancy, whether heterogeneities exist in the developmental component of different tumor types and to which degree the resemblance between cancer and development is a tissue-specific phenomenon. Results We defined a developmental landscape by summarizing the main features of ten developmental time courses and projected gene expression from a variety of human tumor types onto this landscape. This comparison demonstrates a clear imprint of developmental gene expression in a wide range of tumors and with respect to different, even non-cognate developmental backgrounds. Our analysis reveals three classes of cancers with developmentally distinct transcriptional patterns. We characterize the biological processes dominating these classes and validate the class distinction with respect to a new time series of murine embryonic lung development. Finally, we identify a set of genes that are upregulated in most cancers and we show that this signature is active in early development. Conclusion This systematic and quantitative overview of the relationship between the neoplastic and developmental transcriptome spanning dozens of tissues provides a reliable outline of global trends in cancer gene expression, reveals potentially clinically relevant differences in the gene expression of different cancer types and represents a reference framework for interpretation of smaller-scale functional studies. PMID:18611264

  8. The plant energy-dissipating mitochondrial systems: depicting the genomic structure and the expression profiles of the gene families of uncoupling protein and alternative oxidase in monocots and dicots.

    PubMed

    Borecky, Jirí; Nogueira, Fábio T S; de Oliveira, Kívia A P; Maia, Ivan G; Vercesi, Aníbal E; Arruda, Paulo

    2006-01-01

    The simultaneous existence of alternative oxidases and uncoupling proteins in plants has raised the question as to why plants need two energy-dissipating systems with apparently similar physiological functions. A probably complete plant uncoupling protein gene family is described and the expression profiles of this family compared with the multigene family of alternative oxidases in Arabidopsis thaliana and sugarcane (Saccharum sp.) employed as dicot and monocot models, respectively. In total, six uncoupling protein genes, AtPUMP1-6, were recognized within the Arabidopsis genome and five (SsPUMP1-5) in a sugarcane EST database. The recombinant AtPUMP5 protein displayed similar biochemical properties as AtPUMP1. Sugarcane possessed four Arabidopsis AOx1-type orthologues (SsAOx1a-1d); no sugarcane orthologue corresponding to Arabidopsis AOx2-type genes was identified. Phylogenetic and expression analyses suggested that AtAOx1d does not belong to the AOx1-type family but forms a new (AOx3-type) family. Tissue-enriched expression profiling revealed that uncoupling protein genes were expressed more ubiquitously than the alternative oxidase genes. Distinct expression patterns among gene family members were observed between monocots and dicots and during chilling stress. These findings suggest that the members of each energy-dissipating system are subject to different cell or tissue/organ transcriptional regulation. As a result, plants may respond more flexibly to adverse biotic and abiotic conditions, in which oxidative stress is involved.

  9. Identifying Stress Transcription Factors Using Gene Expression and TF-Gene Association Data

    PubMed Central

    Wu, Wei-Sheng; Chen, Bor-Sen

    2007-01-01

    Unicellular organisms such as yeasts have evolved to survive environmental stresses by rapidly reorganizing the genomic expression program to meet the challenges of harsh environments. The complex adaptation mechanisms to stress remain to be elucidated. In this study, we developed Stress Transcription Factor Identification Algorithm (STFIA), which integrates gene expression and TF-gene association data to identify the stress transcription factors (TFs) of six kinds of stresses. We identified some general stress TFs that are in response to various stresses, and some specific stress TFs that are in response to one specific stress. The biological significance of our findings is validated by the literature. We found that a small number of TFs may be sufficient to control a wide variety of expression patterns in yeast under different stresses. Two implications can be inferred from this observation. First, the adaptation mechanisms to different stresses may have a bow-tie structure. Second, there may exist extensive regulatory cross-talk among different stress responses. In conclusion, this study proposes a network of the regulators of stress responses and their mechanism of action. PMID:20066130

  10. Supervised group Lasso with applications to microarray data analysis

    PubMed Central

    Ma, Shuangge; Song, Xiao; Huang, Jian

    2007-01-01

    Background A tremendous amount of efforts have been devoted to identifying genes for diagnosis and prognosis of diseases using microarray gene expression data. It has been demonstrated that gene expression data have cluster structure, where the clusters consist of co-regulated genes which tend to have coordinated functions. However, most available statistical methods for gene selection do not take into consideration the cluster structure. Results We propose a supervised group Lasso approach that takes into account the cluster structure in gene expression data for gene selection and predictive model building. For gene expression data without biological cluster information, we first divide genes into clusters using the K-means approach and determine the optimal number of clusters using the Gap method. The supervised group Lasso consists of two steps. In the first step, we identify important genes within each cluster using the Lasso method. In the second step, we select important clusters using the group Lasso. Tuning parameters are determined using V-fold cross validation at both steps to allow for further flexibility. Prediction performance is evaluated using leave-one-out cross validation. We apply the proposed method to disease classification and survival analysis with microarray data. Conclusion We analyze four microarray data sets using the proposed approach: two cancer data sets with binary cancer occurrence as outcomes and two lymphoma data sets with survival outcomes. The results show that the proposed approach is capable of identifying a small number of influential gene clusters and important genes within those clusters, and has better prediction performance than existing methods. PMID:17316436

  11. Increased asthma and adipose tissue inflammatory gene expression with obesity and Inuit migration to a western country.

    PubMed

    Backer, Vibeke; Baines, Katherine J; Powell, Heather; Porsbjerg, Celeste; Gibson, Peter G

    2016-02-01

    An overlap between obesity and asthma exists, and inflammatory cells in adipose tissue could drive the development of asthma. Comparison of adipose tissue gene expression among Inuit living in Greenland to those in Denmark provides an opportunity to assess how changes in adipose tissue inflammation can be modified by migration and diet. To examine mast cell and inflammatory markers in adipose tissue and the association with asthma. Two Inuit populations were recruited, one living in Greenland and another in Denmark. All underwent adipose subcutaneous biopsy, followed by clinical assessment of asthma, and measurement of AHR. Adipose tissue biopsies were homogenised, RNA extracted, and PCR was performed to determine the relative gene expression of mast cell (tryptase, chymase, CPA3) and inflammatory markers (IL-6, IL-1β, and CD163). Of the 1059 Greenlandic Inuit participants, 556 were living in Greenland and 6.4% had asthma. Asthma was increased in Denmark (9%) compared to Greenland (3.6%, p < 0.0001) and associated with increased adipose tissue IL-6 gene expression and increased BMI. There was no association between asthma and adipose tissue mast cell gene expression. Pro-inflammatory gene expression (IL-6, IL-1β) was higher in those living in Denmark, and with increasing BMI and dietary changes. The anti-inflammatory (M2) macrophage marker, CD163, was higher in Greenland-dwelling Inuit (p < 0.01). No association was found between gene expression of mast cell markers in adipose tissue and asthma. Among Greenlandic Inuit, adipose tissue inflammation is also increased in those who migrate to Denmark, possibly as a result of dietary changes. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. Shoot to root communication is necessary to control the expression of iron-acquisition genes in Strategy I plants.

    PubMed

    García, María J; Romera, Francisco J; Stacey, Minviluz G; Stacey, Gary; Villar, Eduardo; Alcántara, Esteban; Pérez-Vicente, Rafael

    2013-01-01

    Previous research showed that auxin, ethylene, and nitric oxide (NO) can activate the expression of iron (Fe)-acquisition genes in the roots of Strategy I plants grown with low levels of Fe, but not in plants grown with high levels of Fe. However, it is still an open question as to how Fe acts as an inhibitor and which pool of Fe (e.g., root, phloem, etc.) in the plant acts as the key regulator for gene expression control. To further clarify this, we studied the effect of the foliar application of Fe on the expression of Fe-acquisition genes in several Strategy I plants, including wild-type cultivars of Arabidopsis [Arabidopsis thaliana (L.) Heynh], pea [Pisum sativum L.], tomato [Solanum lycopersicon Mill.], and cucumber [Cucumis sativus L.], as well as mutants showing constitutive expression of Fe-acquisition genes when grown under Fe-sufficient conditions [Arabidopsis opt3-2 and frd3-3, pea dgl and brz, and tomato chln (chloronerva)]. The results showed that the foliar application of Fe blocked the expression of Fe-acquisition genes in the wild-type cultivars and in the frd3-3, brz, and chln mutants, but not in the opt3-2 and dgl mutants, probably affected in the transport of a Fe-related repressive signal in the phloem. Moreover, the addition of either ACC (ethylene precursor) or GSNO (NO donor) to Fe-deficient plants up-regulated the expression of Fe-acquisition genes, but this effect did not occur in Fe-deficient plants sprayed with foliar Fe, again suggesting the existence of a Fe-related repressive signal moving from leaves to roots.

  13. The aquaglyceroporin AQP9 contributes to the sex-specific effects of in utero arsenic exposure on placental gene expression.

    PubMed

    Winterbottom, Emily F; Koestler, Devin C; Fei, Dennis Liang; Wika, Eric; Capobianco, Anthony J; Marsit, Carmen J; Karagas, Margaret R; Robbins, David J

    2017-06-14

    Sex-specific factors play a major role in human health and disease, including responses to environmental stresses such as toxicant exposure. Increasing evidence suggests that such sex differences also exist during fetal development. In a previous report using the resources of the New Hampshire Birth Cohort Study (NHBCS), we found that low-to-moderate in utero exposure to arsenic, a highly toxic and widespread pollutant, was associated with altered expression of several key developmental genes in the fetal portion of the placenta. These associations were sex-dependent, suggesting that in utero arsenic exposure differentially impacts male and female fetuses. In the present study, we investigated the molecular basis for these sex-specific responses to arsenic. Using NanoString technology, we further analyzed the fetal placenta samples from the NHBCS for the expression of genes encoding arsenic transporters and metabolic enzymes. Multivariable linear regression analysis was used to examine their relationship with arsenic exposure and with key developmental genes, after stratification by fetal sex. We found that maternal arsenic exposure was strongly associated with expression of the AQP9 gene, encoding an aquaglyceroporin transporter, in female but not male fetal placenta. Moreover, AQP9 expression associated with that of a subset of female-specific arsenic-responsive genes. Our results suggest that AQP9 is upregulated in response to arsenic exposure in female, but not male, fetal placenta. Based on these results and prior studies, increased AQP9 expression may lead to increased arsenic transport in the female fetal placenta, which in turn may alter the expression patterns of key developmental genes that we have previously shown to be associated with arsenic exposure. Thus, this study suggests that AQP9 may play a role in the sex-specific effects of in utero arsenic exposure.

  14. The anti-Müllerian hormone (AMH) induces forkhead box L2 (FOXL2) expression in primary culture of human granulosa cells in vitro.

    PubMed

    Sacchi, Sandro; Marinaro, Federica; Xella, Susanna; Marsella, Tiziana; Tagliasacchi, Daniela; La Marca, Antonio

    2017-09-01

    Anti-Müllerian hormone (AMH) and forkhead box L2 (FOXL2) are two pivotal genes expressed in human granulosa cells (hGCs) where both genes share similar inhibitory functions on activation and follicular growth in order to preserve the ovarian follicle reserve. Furthermore, AMH and FOXL2 contribute to inhibit steroidogenesis, decreasing or preventing the activation of gonadotrophin-dependent aromatase CYP19A1 cytochrome P450 family 19 subfamily A member 1 (CYP19A1). The purpose of this study is to evaluate the role of AMH in regulating the expression of FOXL2. Primary cultures of hGCs were treated with increasing concentrations of recombinant human AMH (rhAMH; range 10-100 ng/ml) for 3 h. Negative controls were performed using corresponding amounts of AMH vehicle. Total RNA or proteins were purified and quantified by spectrophotometry. FOXL2 and CYP19A1 gene expression, normalized by reference gene ribosomal protein S7 (RpS7), was evaluated by RT-qPCR. Each reaction was repeated in triplicate. Statistical analysis was performed. Extracted proteins were analyzed by immunoblot using anti-FOXL2 and anti-β-actin as primary antibodies. rhAMH treatments tested did not modulate the basal expression of aromatase CYP19A1 gene. rhAMH (50 ng/ml) was able to increase FOXL2 gene expression and its intracellular content. This study demonstrated the existence of an AMH-FOXL2 relationship in hGCs. AMH is capable of increasing both gene and protein expression of FOXL2. Because FOXL2 induces AMH transcription, these ovarian factors could be finely regulated by a positive feedback loop mechanism to preserve the ovarian follicle reserve.

  15. Gene expression profiling in multiple myeloma--reporting of entities, risk, and targets in clinical routine.

    PubMed

    Meissner, Tobias; Seckinger, Anja; Rème, Thierry; Hielscher, Thomas; Möhler, Thomas; Neben, Kai; Goldschmidt, Hartmut; Klein, Bernard; Hose, Dirk

    2011-12-01

    Multiple myeloma is an incurable malignant plasma cell disease characterized by survival ranging from several months to more than 15 years. Assessment of risk and underlying molecular heterogeneity can be excellently done by gene expression profiling (GEP), but its way into clinical routine is hampered by the lack of an appropriate reporting tool and the integration with other prognostic factors into a single "meta" risk stratification. The GEP-report (GEP-R) was built as an open-source software developed in R for gene expression reporting in clinical practice using Affymetrix microarrays. GEP-R processes new samples by applying a documentation-by-value strategy to the raw data to be able to assign thresholds and grouping algorithms defined on a reference cohort of 262 patients with multiple myeloma. Furthermore, we integrated expression-based and conventional prognostic factors within one risk stratification (HM-metascore). The GEP-R comprises (i) quality control, (ii) sample identity control, (iii) biologic classification, (iv) risk stratification, and (v) assessment of target genes. The resulting HM-metascore is defined as the sum over the weighted factors gene expression-based risk-assessment (UAMS-, IFM-score), proliferation, International Staging System (ISS) stage, t(4;14), and expression of prognostic target genes (AURKA, IGF1R) for which clinical grade inhibitors exist. The HM-score delineates three significantly different groups of 13.1%, 72.1%, and 14.7% of patients with a 6-year survival rate of 89.3%, 60.6%, and 18.6%, respectively. GEP reporting allows prospective assessment of risk and target gene expression and integration of current prognostic factors in clinical routine, being customizable about novel parameters or other cancer entities. ©2011 AACR.

  16. Overexpression of genes involved in miRNA biogenesis in medullary thyroid carcinomas with RET mutation.

    PubMed

    Puppin, Cinzia; Durante, Cosimo; Sponziello, Marialuisa; Verrienti, Antonella; Pecce, Valeria; Lavarone, Elisa; Baldan, Federica; Campese, Antonio Francesco; Boichard, Amelie; Lacroix, Ludovic; Russo, Diego; Filetti, Sebastiano; Damante, Giuseppe

    2014-11-01

    Abnormal expression of non-coding micro RNA (miRNA) has been described in medullary thyroid carcinoma (MTC). Expression of genes encoding factors involved in miRNA biogenesis results often deregulated in human cancer and correlates with aggressive clinical behavior. In this study, expression of four genes involved in miRNA biogenesis (DICER, DROSHA, DCGR8, and XPO5) was investigated in 54 specimens of MTC. Among them, 33 and 13 harbored RET and RAS mutations, respectively. DICER, DGCR8, and XPO5 mRNA levels were significantly overexpressed in MTC harboring RET mutations, in particular, in the presence of RET634 mutation. When MTCs with RET and RAS mutations were compared, only DGCR8 displayed a significant difference, while MTCs with RAS mutations did not show significant differences with respect to non-mutated tumors. We then attempted to correlate expression of miRNA biogenesis genes with tumor aggressiveness. According to the TNM status, MTCs were divided in two groups and compared (N0 M0 vs. N1 and/or M1): for all four genes no significant difference was detected. Cell line experiments, in which expression of a RET mutation is silenced by siRNA, suggest the existence of a causal relationship between RET mutation and overexpression of DICER, DGCR8, and XPO5 genes. These findings demonstrate that RET- but not RAS-driven tumorigenic alterations include abnormalities in the expression of some important genes involved in miRNA biogenesis that could represent new potential markers for targeted therapies in the treatment of RET-mutated MTCs aimed to restore the normal miRNA expression profile.

  17. Highly tissue specific expression of Sphinx supports its male courtship related role in Drosophila melanogaster.

    PubMed

    Chen, Ying; Dai, Hongzheng; Chen, Sidi; Zhang, Luoying; Long, Manyuan

    2011-04-26

    Sphinx is a lineage-specific non-coding RNA gene involved in regulating courtship behavior in Drosophila melanogaster. The 5' flanking region of the gene is conserved across Drosophila species, with the proximal 300 bp being conserved out to D. virilis and a further 600 bp region being conserved amongst the melanogaster subgroup (D. melanogaster, D. simulans, D. sechellia, D. yakuba, and D. erecta). Using a green fluorescence protein transformation system, we demonstrated that a 253 bp region of the highly conserved segment was sufficient to drive sphinx expression in male accessory gland. GFP signals were also observed in brain, wing hairs and leg bristles. An additional ∼800 bp upstream region was able to enhance expression specifically in proboscis, suggesting the existence of enhancer elements. Using anti-GFP staining, we identified putative sphinx expression signal in the brain antennal lobe and inner antennocerebral tract, suggesting that sphinx might be involved in olfactory neuron mediated regulation of male courtship behavior. Whole genome expression profiling of the sphinx knockout mutation identified significant up-regulated gene categories related to accessory gland protein function and odor perception, suggesting sphinx might be a negative regulator of its target genes.

  18. Highly Tissue Specific Expression of Sphinx Supports Its Male Courtship Related Role in Drosophila melanogaster

    PubMed Central

    Chen, Sidi; Zhang, Luoying; Long, Manyuan

    2011-01-01

    Sphinx is a lineage-specific non-coding RNA gene involved in regulating courtship behavior in Drosophila melanogaster. The 5′ flanking region of the gene is conserved across Drosophila species, with the proximal 300 bp being conserved out to D. virilis and a further 600 bp region being conserved amongst the melanogaster subgroup (D. melanogaster, D. simulans, D. sechellia, D. yakuba, and D. erecta). Using a green fluorescence protein transformation system, we demonstrated that a 253 bp region of the highly conserved segment was sufficient to drive sphinx expression in male accessory gland. GFP signals were also observed in brain, wing hairs and leg bristles. An additional ∼800 bp upstream region was able to enhance expression specifically in proboscis, suggesting the existence of enhancer elements. Using anti-GFP staining, we identified putative sphinx expression signal in the brain antennal lobe and inner antennocerebral tract, suggesting that sphinx might be involved in olfactory neuron mediated regulation of male courtship behavior. Whole genome expression profiling of the sphinx knockout mutation identified significant up-regulated gene categories related to accessory gland protein function and odor perception, suggesting sphinx might be a negative regulator of its target genes. PMID:21541324

  19. Expression of Listeria monocytogenes key virulence genes during growth in liquid medium, on rocket and melon at 4, 10 and 30 °C.

    PubMed

    Hadjilouka, Agni; Molfeta, Christina; Panagiotopoulou, Olga; Paramithiotis, Spiros; Mataragas, Marios; Drosinos, Eleftherios H

    2016-05-01

    The aim of the present study was to assess the expression of key virulence genes, during growth of a Listeria monocytogenes isolate in liquid medium, on melon and rocket at different temperatures and time. For that purpose, BHI broth, rocket and melon were inoculated at 7.0-7.5 log CFU mL(-1) or g(-1)and stored at 4, 10 and 30 °C. Sampling took place upon inoculation and after 0.5, 6 and 24 h of incubation. The RNA was stabilized and the expression of hly, plcA, plcB, sigB, inlA, inlB, inlC, inlJ, lmo2672 and lmo2470 was assessed by RT-qPCR. The results obtained were summarized into two observations; the first one referring to the interactive effect of incubation temperature and type of substrate and the second one to the effect of time on gene expression. Regarding the latter, nearly all genes were regulated upon inoculation and exhibited differential expression in the subsequent sampling times indicating the existence of additional regulatory mechanisms yet to be explored. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Identification of Genes Uniquely Expressed in the Germ-Line Tissues of the Jewel Wasp Nasonia vitripennis

    PubMed Central

    Ferree, Patrick M.; Fang, Christopher; Mastrodimos, Mariah; Hay, Bruce A.; Amrhein, Henry; Akbari, Omar S.

    2015-01-01

    The jewel wasp Nasonia vitripennis is a rising model organism for the study of haplo-diploid reproduction characteristic of hymenopteran insects, which include all wasps, bees, and ants. We performed transcriptional profiling of the ovary, the female soma, and the male soma of N. vitripennis to complement a previously existing transcriptome of the wasp testis. These data were deposited into an open-access genome browser for visualization of transcripts relative to their gene models. We used these data to identify the assemblies of genes uniquely expressed in the germ-line tissues. We found that 156 protein-coding genes are expressed exclusively in the wasp testis compared with only 22 in the ovary. Of the testis-specific genes, eight are candidates for male-specific DNA packaging proteins known as protamines. We found very similar expression patterns of centrosome associated genes in the testis and ovary, arguing that de novo centrosome formation, a key process for development of unfertilized eggs into males, likely does not rely on large-scale transcriptional differences between these tissues. In contrast, a number of meiosis-related genes show a bias toward testis-specific expression, despite the lack of true meiosis in N. vitripennis males. These patterns may reflect an unexpected complexity of male gamete production in the haploid males of this organism. Broadly, these data add to the growing number of genomic and genetic tools available in N. vitripennis for addressing important biological questions in this rising insect model organism. PMID:26464360

  1. Alien/CSN2 gene expression is regulated by thyroid hormone in rat brain.

    PubMed

    Tenbaum, Stephan P; Juenemann, Stefan; Schlitt, Thomas; Bernal, Juan; Renkawitz, Rainer; Muñoz, Alberto; Baniahmad, Aria

    2003-02-01

    Alien has been described as a corepressor for the thyroid hormone receptor (TR). Corepressors are coregulators that mediate gene silencing of DNA-bound transcriptional repressors. We describe here that Alien gene expression in vivo is regulated by thyroid hormone both in the rat brain and in cultured cells. In situ hybridization revealed that Alien is widely expressed in the mouse embryo and also throughout the rat brain. Hypothyroid animals exhibit lower expression of both Alien mRNAs and protein levels as compared with normal animals. Accordingly, we show that Alien gene is inducible after thyroid hormone treatment both in vivo and in cell culture. In cultured cells, the hormonal induction is mediated by either TRalpha or TRbeta, while cells lacking detectable amounts of functional TR lack hormonal induction of Alien. We have detected two Alien-specific mRNAs by Northern experiments and two Alien-specific proteins in vivo and in cell lines by Western analysis, one of the two forms representing the CSN2 subunit of the COP9 signalosome. Interestingly, both Alien mRNAs and both detected proteins are regulated by thyroid hormone in vivo and in cell lines. Furthermore, we provide evidence for the existence of at least two Alien genes in rodents. Taken together, we conclude that Alien gene expression is under control of TR and thyroid hormone. This suggests a negative feedback mechanism between TR and its own corepressor. Thus, the reduction of corepressor levels may represent a control mechanism of TR-mediated gene silencing.

  2. An Adaptive Genetic Association Test Using Double Kernel Machines

    PubMed Central

    Zhan, Xiang; Epstein, Michael P.; Ghosh, Debashis

    2014-01-01

    Recently, gene set-based approaches have become very popular in gene expression profiling studies for assessing how genetic variants are related to disease outcomes. Since most genes are not differentially expressed, existing pathway tests considering all genes within a pathway suffer from considerable noise and power loss. Moreover, for a differentially expressed pathway, it is of interest to select important genes that drive the effect of the pathway. In this article, we propose an adaptive association test using double kernel machines (DKM), which can both select important genes within the pathway as well as test for the overall genetic pathway effect. This DKM procedure first uses the garrote kernel machines (GKM) test for the purposes of subset selection and then the least squares kernel machine (LSKM) test for testing the effect of the subset of genes. An appealing feature of the kernel machine framework is that it can provide a flexible and unified method for multi-dimensional modeling of the genetic pathway effect allowing for both parametric and nonparametric components. This DKM approach is illustrated with application to simulated data as well as to data from a neuroimaging genetics study. PMID:26640602

  3. Rax Homeoprotein Regulates Photoreceptor Cell Maturation and Survival in Association with Crx in the Postnatal Mouse Retina

    PubMed Central

    Irie, Shoichi; Sanuki, Rikako; Muranishi, Yuki; Kato, Kimiko; Chaya, Taro

    2015-01-01

    The Rax homeobox gene plays essential roles in multiple processes of vertebrate retina development. Many vertebrate species possess Rax and Rax2 genes, and different functions have been suggested. In contrast, mice contain a single Rax gene, and its functional roles in late retinal development are still unclear. To clarify mouse Rax function in postnatal photoreceptor development and maintenance, we generated conditional knockout mice in which Rax in maturing or mature photoreceptor cells was inactivated by tamoxifen treatment (Rax iCKO mice). When Rax was inactivated in postnatal Rax iCKO mice, developing photoreceptor cells showed a significant decrease in the level of the expression of rod and cone photoreceptor genes and mature adult photoreceptors exhibited a specific decrease in cone cell numbers. In luciferase assays, we found that Rax and Crx cooperatively transactivate Rhodopsin and cone opsin promoters and that an optimum Rax expression level to transactivate photoreceptor gene expression exists. Furthermore, Rax and Crx colocalized in maturing photoreceptor cells, and their coimmunoprecipitation was observed in cultured cells. Taken together, these results suggest that Rax plays essential roles in the maturation of both cones and rods and in the survival of cones by regulating photoreceptor gene expression with Crx in the postnatal mouse retina. PMID:25986607

  4. Upregulation of the ESR1 Gene and ESR Ratio (ESR1/ESR2) is Associated with a Worse Prognosis in Papillary Thyroid Carcinoma: The Impact of the Estrogen Receptor α/β Expression on Clinical Outcomes in Papillary Thyroid Carcinoma Patients.

    PubMed

    Yi, Jin Wook; Kim, Su-Jin; Kim, Jong Kyu; Seong, Chan Yong; Yu, Hyeong Won; Chai, Young Jun; Choi, June Young; Lee, Kyu Eun

    2017-11-01

    A gender disparity exists with respect to the incidence of papillary thyroid cancer (PTC), suggesting that sex hormones such as estrogen play a role in PTC development and progression. In this study, we compared estrogen receptor gene expression patterns in PTCs to determine the clinical significance of estrogen gene expression in PTC. We analyzed ESR1 and ESR2 messenger RNA expression counts using data from The Cancer Genome Atlas (TCGA). To validate the results of TCGA analysis, we analyzed microarray data (GSE 54958) from the Gene Expression Omnibus. ESR1 gene expression and ESR ratio (ESR1/ESR2) were significantly higher in PTC tissues than in paired normal thyroid tissues (mean 659.427 vs. 264.045 for ESR1, 92.017 vs. 19.064 for ESR ratio). Among female patients, ESR1 expression and ESR ratio were negatively correlated with increased age. ESR1 expression and ESR ratio were higher in patients with classic PTC, lymphovascular invasion, BRAF V600E mutation, and radioiodine therapy. Classification analysis demonstrated that higher ESR1 expression and a higher ESR ratio faced a worse overall survival (hazard ratio 6.348 for ESR1, 4.031 for ESR ratio). Validation microarray analysis demonstrated that ESR1 expression and ESR ratio were higher in tumor tissues, classic PTC, and BRAF V600E . Higher ESR1 expression and a higher ESR ratio were associated with aggressive prognostic factors and worse overall survival in female PTC patients. Our results suggest that ESR1 and ESR ratio can be used as prognostic markers to predict female patient survival and have potential as a therapeutic target.

  5. Methylation patterns in marginal zone lymphoma.

    PubMed

    Arribas, Alberto J; Bertoni, Francesco

    Promoter DNA methylation is a major regulator of gene expression and transcription. The identification of methylation changes is important for understanding disease pathogenesis, for identifying prognostic markers and can drive novel therapeutic approaches. In this review we summarize the current knowledge regarding DNA methylation in MALT lymphoma, splenic marginal zone lymphoma, nodal marginal zone lymphoma. Despite important differences in the study design for different publications and the existence of a sole large and genome-wide methylation study for splenic marginal zone lymphoma, it is clear that DNA methylation plays an important role in marginal zone lymphomas, in which it contributes to the inactivation of tumor suppressors but also to the expression of genes sustaining tumor cell survival and proliferation. Existing preclinical data provide the rationale to target the methylation machinery in these disorders. Copyright © 2016 Elsevier Ltd. All rights reserved.

  6. The JNK-like MAPK KGB-1 of Caenorhabditis elegans promotes reproduction, lifespan, and gene expressions for protein biosynthesis and germline homeostasis but interferes with hyperosmotic stress tolerance.

    PubMed

    Gerke, Peter; Keshet, Alex; Mertenskötter, Ansgar; Paul, Rüdiger J

    2014-01-01

    This study focused on the role of the JNK-like MAPK (mitogen-activated protein kinase) KGB-1 (kinase, GLH-binding 1) for osmoprotection and other vital functions. We mapped KGB-1 expression patterns and determined lifespan, reproduction and survival rates as well as changes in body volume, motility, and GPDH (glycerol-3-phosphate dehydrogenase) activity for glycerol production in wildtype (WT), different signaling mutants (including a kgb-1 deletion mutant, kgb-1∆) and RNAi-treated worms under control and hyperosmotic conditions. KGB-1-mediated gene expressions were studied, for instance, by RNA Sequencing, with the resulting transcriptome data analyzed using orthology-based approaches. Surprisingly, mutation/RNAi of kgb-1 and fos-1 (gene for an AP-1, activator protein 1, element) significantly promoted hyperosmotic resistance, even though hyperosmotic GPDH activity was higher in WT than in kgb-1∆. KGB-1 and moderate hyperosmolarity promoted and severe hyperosmolarity repressed kgb-1, fos-1, and jun-1 (gene for another AP-1 element) expression. Transcriptome profiling revealed, for instance, down-regulated genes for protein biosynthesis and up-regulated genes for membrane transporters in kgb-1∆ and up-regulated genes for GPDH-1 or detoxification in WT, with the latter indicating cellular damage and less effective osmoprotection in WT. KGB-1 promotes reproduction and lifespan and fosters gene expressions for AP-1 elements, protein biosynthesis, and balanced gametogenesis, but inhibits expressions for membrane transporters perhaps in order to control energy consumption. Reduced protein biosyntheses and enhanced membrane transports in kgb-1∆ most likely contribute to the high hyperosmotic tolerance of the mutant by easing the burden of the existing chaperone machinery and promoting regulatory volume increases upon hyperosmotic stress.

  7. Hybridization between Yellowstone cutthroat trout and rainbow trout alters the expression of muscle growth-related genes and their relationships with growth patterns

    USGS Publications Warehouse

    Ostberg, Carl O.; Chase, Dorothy M.; Hauser, Lorenz

    2015-01-01

    Hybridization creates novel gene combinations that may generate important evolutionary novelty, but may also reduce existing adaptation by interrupting inherent biological processes, such as genotype-environment interactions. Hybridization often causes substantial change in patterns of gene expression, which, in turn, may cause phenotypic change. Rainbow trout (Oncorhynchus mykiss) and cutthroat trout (O. clarkii) produce viable hybrids in the wild, and introgressive hybridization with introduced rainbow trout is a major conservation concern for native cutthroat trout. The two species differ in body shape, which is likely an evolutionary adaptation to their native environments, and their hybrids tend to show intermediate morphology. The characterization of gene expression patterns may provide insights on the genetic basis of hybrid and parental morphologies, as well as on the ecological performance of hybrids in the wild. Here, we evaluated the expression of eight growth-related genes (MSTN-1a, MSTN-1b, MyoD1a, MyoD1b, MRF-4, IGF-1, IGF-2, and CAST-L) and the relationship of these genes with growth traits (length, weight, and condition factor) in six line crosses: both parental species, both reciprocal F1 hybrids, and both first-generation backcrosses (F1 x rainbow trout and F1 x cutthroat trout). Four of these genes were differentially expressed among rainbow, cutthroat, and their hybrids. Transcript abundance was significantly correlated with growth traits across the parent species, but not across hybrids. Our findings suggest that rainbow and cutthroat trout exhibit differences in muscle growth regulation, that transcriptional networks may be modified by hybridization, and that hybridization disrupts intrinsic relationships between gene expression and growth patterns that may be functionally important for phenotypic adaptations.

  8. Identification of gene expression profiling associated with erlotinib-related skin toxicity in pancreatic adenocarcinoma patients

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Caba, Octavio, E-mail: ocaba@ujaen.es

    Erlotinib is an epidermal growth factor receptor (EGFR) tyrosine kinase inhibitor that showed activity against pancreatic ductal adenocarcinoma (PDAC). The drug's most frequently reported side effect as a result of EGFR inhibition is skin rash (SR), a symptom which has been associated with a better therapeutic response to the drug. Gene expression profiling can be used as a tool to predict which patients will develop this important cutaneous manifestation. The aim of the present study was to identify which genes may influence the appearance of SR in PDAC patients. The study included 34 PDAC patients treated with erlotinib: 21 patientsmore » developed any grade of SR, while 13 patients did not (controls). Before administering any chemotherapy regimen and the development of SR, we collected RNA from peripheral blood samples of all patients and studied the differential gene expression pattern using the Illumina microarray platform HumanHT-12 v4 Expression BeadChip. Seven genes (FAM46C, IFITM3, GMPR, DENND6B, SELENBP1, NOL10, and SIAH2), involved in different pathways including regulatory, migratory, and signalling processes, were downregulated in PDAC patients with SR. Our results suggest the existence of a gene expression profiling significantly correlated with erlotinib-induced SR in PDAC that could be used as prognostic indicator in this patients. - Highlights: • Skin rash (SR) is the most characteristic side effect of erlotinib in PDAC patients. • Erlotinib-induced SR has been associated with a better clinical outcome. • Gene expression profiling was used to determine who will develop this manifestation. • 7 genes involved in different pathways were downregulated in PDAC patients with SR. • Our profile correlated with erlotinib-induced SR in PDAC could be used for prognosis.« less

  9. QUADrATiC: scalable gene expression connectivity mapping for repurposing FDA-approved therapeutics.

    PubMed

    O'Reilly, Paul G; Wen, Qing; Bankhead, Peter; Dunne, Philip D; McArt, Darragh G; McPherson, Suzanne; Hamilton, Peter W; Mills, Ken I; Zhang, Shu-Dong

    2016-05-04

    Gene expression connectivity mapping has proven to be a powerful and flexible tool for research. Its application has been shown in a broad range of research topics, most commonly as a means of identifying potential small molecule compounds, which may be further investigated as candidates for repurposing to treat diseases. The public release of voluminous data from the Library of Integrated Cellular Signatures (LINCS) programme further enhanced the utilities and potentials of gene expression connectivity mapping in biomedicine. We describe QUADrATiC ( http://go.qub.ac.uk/QUADrATiC ), a user-friendly tool for the exploration of gene expression connectivity on the subset of the LINCS data set corresponding to FDA-approved small molecule compounds. It enables the identification of compounds for repurposing therapeutic potentials. The software is designed to cope with the increased volume of data over existing tools, by taking advantage of multicore computing architectures to provide a scalable solution, which may be installed and operated on a range of computers, from laptops to servers. This scalability is provided by the use of the modern concurrent programming paradigm provided by the Akka framework. The QUADrATiC Graphical User Interface (GUI) has been developed using advanced Javascript frameworks, providing novel visualization capabilities for further analysis of connections. There is also a web services interface, allowing integration with other programs or scripts. QUADrATiC has been shown to provide an improvement over existing connectivity map software, in terms of scope (based on the LINCS data set), applicability (using FDA-approved compounds), usability and speed. It offers potential to biological researchers to analyze transcriptional data and generate potential therapeutics for focussed study in the lab. QUADrATiC represents a step change in the process of investigating gene expression connectivity and provides more biologically-relevant results than previous alternative solutions.

  10. Expression and copy number gains of the RET gene in 631 early and mid stage non‐small cell lung cancer cases

    PubMed Central

    Tan, Ling; Hu, Yerong; Tao, Yongguang; Wang, Bin; Xiao, Jun; Tang, Zhenjie; Lu, Ting

    2018-01-01

    Background To identify whether RET is a potential target for NSCLC treatment, we examined the status of the RET gene in 631 early and mid stage NSCLC cases from south central China. Methods RET expression was identified by Western blot. RET‐positive expression samples were verified by immunohistochemistry. RET gene mutation, copy number variation, and rearrangement were analyzed by DNA Sanger sequencing, TaqMan copy number assays, and reverse transcription‐PCR. ALK and ROS1 expression levels were tested by Western blot and EGFR mutation using Sanger sequencing. Results The RET‐positive rate was 2.5% (16/631). RET‐positive expression was related to poorer tumor differentiation (P < 0.05). In the 16 RET‐positive samples, only two samples of moderately and poorly differentiated lung adenocarcinomas displayed RET rearrangement, both in RET‐KIF5B fusion partners. Neither ALK nor ROS1 translocation was found. The EGFR mutation rate in RET‐positive samples was significantly lower than in RET‐negative samples (P < 0.05). Conclusion RET‐positive expression in early and mid stage NSCLC cases from south central China is relatively low and is related to poorer tumor differentiation. RET gene alterations (copy number gain and rearrangement) exist in all RET‐positive samples. RET‐positive expression is a relatively independent factor in NSCLC patients, which indicates that the RET gene may be a novel target site for personalized treatment of NSCLC. PMID:29473341

  11. Alterations in Bronchial Airway miRNA Expression for Lung Cancer Detection.

    PubMed

    Pavel, Ana B; Campbell, Joshua D; Liu, Gang; Elashoff, David; Dubinett, Steven; Smith, Kate; Whitney, Duncan; Lenburg, Marc E; Spira, Avrum

    2017-11-01

    We have previously shown that gene expression alterations in normal-appearing bronchial epithelial cells can serve as a lung cancer detection biomarker in smokers. Given that miRNAs regulate airway gene expression responses to smoking, we evaluated whether miRNA expression is also altered in the bronchial epithelium of smokers with lung cancer. Using epithelial brushings from the mainstem bronchus of patients undergoing bronchoscopy for suspected lung cancer (as part of the AEGIS-1/2 clinical trials), we profiled miRNA expression via small-RNA sequencing from 347 current and former smokers for which gene expression data were also available. Patients were followed for one year postbronchoscopy until a final diagnosis of lung cancer ( n = 194) or benign disease ( n = 153) was made. Following removal of 6 low-quality samples, we used 138 patients (AEGIS-1) as a discovery set to identify four miRNAs (miR-146a-5p, miR-324-5p, miR-223-3p, and miR-223-5p) that were downregulated in the bronchial airway of lung cancer patients (ANOVA P < 0.002, FDR < 0.2). The expression of these miRNAs is significantly more negatively correlated with the expression of their mRNA targets than with the expression of other nontarget genes (K-S P < 0.05). Furthermore, these mRNA targets are enriched among genes whose expression is elevated in cancer patients (GSEA FDR < 0.001). Finally, we found that the addition of miR-146a-5p to an existing mRNA biomarker for lung cancer significantly improves its performance (AUC) in the 203 samples (AEGIS-1/2) serving an independent test set (DeLong P < 0.05). Our findings suggest that there are miRNAs whose expression is altered in the cytologically normal bronchial epithelium of smokers with lung cancer, and that they may regulate cancer-associated gene expression differences. Cancer Prev Res; 10(11); 651-9. ©2017 AACR . ©2017 American Association for Cancer Research.

  12. Expression of interest: transcriptomics and the designation of conservation units.

    PubMed

    Hansen, Michael M

    2010-05-01

    An important task within conservation genetics consists in defining intraspecific conservation units. Most conceptual frameworks involve two steps: (i) identifying demographically independent units, and (ii) evaluating their degree of adaptive divergence. Whereas a plethora of methods are available for delineating genetic population structure, assessment of functional genetic divergence remains a challenge. In this issue, Tymchuk et al. (2010) study Atlantic salmon (Salmo salar) populations using both microsatellite markers and analysis of global gene expression. They show that important gene expression differences exist that can be interpreted in the context of different ecological conditions experienced by the populations, along with the populations' histories. This demonstrates an important potential role of transcriptomics for designating conservation units.

  13. A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification.

    PubMed

    Xie, Xin-Ping; Xie, Yu-Feng; Wang, Hong-Qiang

    2017-08-23

    Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be addressed carefully for this goal. This paper proposes a regulation probability model-based meta-analysis, jGRP, for identifying differentially expressed genes (DEGs). The method integrates multiple transcriptomics data sets in a gene regulatory space instead of in a gene expression space, which makes it easy to capture and manage data heterogeneity across studies from different laboratories or platforms. Specifically, we transform gene expression profiles into a united gene regulation profile across studies by mathematically defining two gene regulation events between two conditions and estimating their occurring probabilities in a sample. Finally, a novel differential expression statistic is established based on the gene regulation profiles, realizing accurate and flexible identification of DEGs in gene regulation space. We evaluated the proposed method on simulation data and real-world cancer datasets and showed the effectiveness and efficiency of jGRP in identifying DEGs identification in the context of meta-analysis. Data heterogeneity largely influences the performance of meta-analysis of DEGs identification. Existing different meta-analysis methods were revealed to exhibit very different degrees of sensitivity to study heterogeneity. The proposed method, jGRP, can be a standalone tool due to its united framework and controllable way to deal with study heterogeneity.

  14. Alterations of physiology and gene expression due to long-term magnesium-deficiency differ between leaves and roots of Citrus reticulata.

    PubMed

    Jin, Xiao-Lin; Ma, Cui-Lan; Yang, Lin-Tong; Chen, Li-Song

    2016-07-01

    Seedlings of Ponkan (Citrus reticulata) were irrigated with nutrient solution containing 0 (Mg-deficiency) or 1mM MgSO4 (control) every two day for 16 weeks. Thereafter, we examined magnesium (Mg)-deficiency-induced changes in leaf and root gas exchange, total soluble proteins and gene expression. Mg-deficiency lowered leaf CO2 assimilation, and increased leaf dark respiration. However, Mg-deficient roots had lower respiration. Total soluble protein level was not significantly altered by Mg-deficiency in roots, but was lower in Mg-deficient leaves than in controls. Using cDNA-AFLP, we obtained 70 and 71 differentially expressed genes from leaves and roots. These genes mainly functioned in signal transduction, stress response, carbohydrate and energy metabolism, cell transport, cell wall and cytoskeleton metabolism, nucleic acid, and protein metabolisms. Lipid metabolism (Ca(2+) signals)-related Mg-deficiency-responsive genes were isolated only from roots (leaves). Although little difference existed in the number of Mg-deficiency-responsive genes between them both, most of these genes only presented in Mg-deficient leaves or roots, and only four genes were shared by them both. Our data clearly demonstrated that Mg-deficiency-induced alterations of physiology and gene expression greatly differed between leaves and roots. In addition, we focused our discussion on the causes for photosynthetic decline in Mg-deficient leaves and the responses of roots to Mg-deficiency. Copyright © 2016 Elsevier GmbH. All rights reserved.

  15. Database of cattle candidate genes and genetic markers for milk production and mastitis

    PubMed Central

    Ogorevc, J; Kunej, T; Razpet, A; Dovc, P

    2009-01-01

    A cattle database of candidate genes and genetic markers for milk production and mastitis has been developed to provide an integrated research tool incorporating different types of information supporting a genomic approach to study lactation, udder development and health. The database contains 943 genes and genetic markers involved in mammary gland development and function, representing candidates for further functional studies. The candidate loci were drawn on a genetic map to reveal positional overlaps. For identification of candidate loci, data from seven different research approaches were exploited: (i) gene knockouts or transgenes in mice that result in specific phenotypes associated with mammary gland (143 loci); (ii) cattle QTL for milk production (344) and mastitis related traits (71); (iii) loci with sequence variations that show specific allele-phenotype interactions associated with milk production (24) or mastitis (10) in cattle; (iv) genes with expression profiles associated with milk production (207) or mastitis (107) in cattle or mouse; (v) cattle milk protein genes that exist in different genetic variants (9); (vi) miRNAs expressed in bovine mammary gland (32) and (vii) epigenetically regulated cattle genes associated with mammary gland function (1). Fourty-four genes found by multiple independent analyses were suggested as the most promising candidates and were further in silico analysed for expression levels in lactating mammary gland, genetic variability and top biological functions in functional networks. A miRNA target search for mammary gland expressed miRNAs identified 359 putative binding sites in 3′UTRs of candidate genes. PMID:19508288

  16. BioVLAB-mCpG-SNP-EXPRESS: A system for multi-level and multi-perspective analysis and exploration of DNA methylation, sequence variation (SNPs), and gene expression from multi-omics data.

    PubMed

    Chae, Heejoon; Lee, Sangseon; Seo, Seokjun; Jung, Daekyoung; Chang, Hyeonsook; Nephew, Kenneth P; Kim, Sun

    2016-12-01

    Measuring gene expression, DNA sequence variation, and DNA methylation status is routinely done using high throughput sequencing technologies. To analyze such multi-omics data and explore relationships, reliable bioinformatics systems are much needed. Existing systems are either for exploring curated data or for processing omics data in the form of a library such as R. Thus scientists have much difficulty in investigating relationships among gene expression, DNA sequence variation, and DNA methylation using multi-omics data. In this study, we report a system called BioVLAB-mCpG-SNP-EXPRESS for the integrated analysis of DNA methylation, sequence variation (SNPs), and gene expression for distinguishing cellular phenotypes at the pairwise and multiple phenotype levels. The system can be deployed on either the Amazon cloud or a publicly available high-performance computing node, and the data analysis and exploration of the analysis result can be conveniently done using a web-based interface. In order to alleviate analysis complexity, all the process are fully automated, and graphical workflow system is integrated to represent real-time analysis progression. The BioVLAB-mCpG-SNP-EXPRESS system works in three stages. First, it processes and analyzes multi-omics data as input in the form of the raw data, i.e., FastQ files. Second, various integrated analyses such as methylation vs. gene expression and mutation vs. methylation are performed. Finally, the analysis result can be explored in a number of ways through a web interface for the multi-level, multi-perspective exploration. Multi-level interpretation can be done by either gene, gene set, pathway or network level and multi-perspective exploration can be explored from either gene expression, DNA methylation, sequence variation, or their relationship perspective. The utility of the system is demonstrated by performing analysis of phenotypically distinct 30 breast cancer cell line data set. BioVLAB-mCpG-SNP-EXPRESS is available at http://biohealth.snu.ac.kr/software/biovlab_mcpg_snp_express/. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. Finding genes discriminating smokers from non-smokers by applying a growing self-organizing clustering method to large airway epithelium cell microarray data.

    PubMed

    Shahdoust, Maryam; Hajizadeh, Ebrahim; Mozdarani, Hossein; Chehrei, Ali

    2013-01-01

    Cigarette smoking is the major risk factor for development of lung cancer. Identification of effects of tobacco on airway gene expression may provide insight into the causes. This research aimed to compare gene expression of large airway epithelium cells in normal smokers (n=13) and non-smokers (n=9) in order to find genes which discriminate the two groups and assess cigarette smoking effects on large airway epithelium cells. Genes discriminating smokers from non-smokers were identified by applying a neural network clustering method, growing self-organizing maps (GSOM), to microarray data according to class discrimination scores. An index was computed based on differentiation between each mean of gene expression in the two groups. This clustering approach provided the possibility of comparing thousands of genes simultaneously. The applied approach compared the mean of 7,129 genes in smokers and non-smokers simultaneously and classified the genes of large airway epithelium cells which had differently expressed in smokers comparing with non-smokers. Seven genes were identified which had the highest different expression in smokers compared with the non-smokers group: NQO1, H19, ALDH3A1, AKR1C1, ABHD2, GPX2 and ADH7. Most (NQO1, ALDH3A1, AKR1C1, H19 and GPX2) are known to be clinically notable in lung cancer studies. Furthermore, statistical discriminate analysis showed that these genes could classify samples in smokers and non-smokers correctly with 100% accuracy. With the performed GSOM map, other nodes with high average discriminate scores included genes with alterations strongly related to the lung cancer such as AKR1C3, CYP1B1, UCHL1 and AKR1B10. This clustering by comparing expression of thousands of genes at the same time revealed alteration in normal smokers. Most of the identified genes were strongly relevant to lung cancer in the existing literature. The genes may be utilized to identify smokers with increased risk for lung cancer. A large sample study is now recommended to determine relations between the genes ABHD2 and ADH7 and smoking.

  18. Evolution and Distribution of Teleost myomiRNAs: Functionally Diversified myomiRs in Teleosts.

    PubMed

    Siddique, Bhuiyan Sharmin; Kinoshita, Shigeharu; Wongkarangkana, Chaninya; Asakawa, Shuichi; Watabe, Shugo

    2016-06-01

    Myosin heavy chain (MYH) genes belong to a multigene family, and the regulated expression of each member determines the physiological and contractile muscle properties. Among these, MYH6, MYH7, and MYH14 occupy unique positions in the mammalian MYH gene family because of their specific expression in slow/cardiac muscles and the existence of intronic micro(mi) RNAs. MYH6, MYH7, and MYH14 encode miR-208a, miR-208b, and miR-499, respectively. These MYH encoded miRNAs are designated as myomiRs because of their muscle-specific expression and functions. In mammals, myomiRs and host MYHs form a transcription network involved in muscle fiber-type specification; thus, genomic positions and expression patterns of them are well conserved. However, our previous studies revealed divergent distribution and expression of MYH14/miR-499 among teleosts, suggesting the unique evolution of myomiRs and host MYHs in teleosts. Here, we examined distribution and expression of myomiRs and host MYHs in various teleost species. The major cardiac MYH isoforms in teleosts are an intronless gene, atrial myosin heavy chain (amhc), and ventricular myosin heavy chain (vmhc) gene that encodes an intronic miRNA, miR-736. Phylogenetic analysis revealed that vmhc/miR-736 is a teleost-specific myomiR that differed from tetrapoda MYH6/MYH7/miR-208s. Teleost genomes also contain species-specific orthologs in addition to vmhc and amhc, indicating complex gene duplication and gene loss events during teleost evolution. In medaka and torafugu, miR-499 was highly expressed in slow/cardiac muscles whereas the expression of miR-736 was quite low and not muscle specific. These results suggest functional diversification of myomiRs in teleost with the diversification of host MYHs.

  19. [Prokaryotic expression and immunogenicity analysis of the chimeric HBcAg containing APP beta cleavage site peptide and Aβ(1-15);].

    PubMed

    Feng, Gai-feng; Wang, Jun-yang; Jin, Hui; Wang, Wei-xi; Qian, Yi-hua; Yang, Wei-na; Wang, Quan-ying; Yang, Guang-xiao

    2011-11-01

    To construct the recombinant prokaryotic expression plasmid pET/c-ABCSP-Aβ(15-c);, and evaluate the immunogenicity of the fusion protein expressed in E.coli. The gene fragment HBc88-144 was amplified by PCR and subcloned to pUC19. The APP beta cleavage site peptide(ABCSP) and Aβ(1-15); gene(ABCSP-Aβ(15);) was amplified by PCR and inserted downstream of HBc1-71 in pGEMEX/c1-71. After restriction enzyme digestion, c1-17-ABCSP-Aβ(15); were connected with HBc88-144, yielding the recombinant gene c-ABCSP-Aβ(15-c);. c-ABCSP-Aβ(15-c); gene was subcloned into pET-28a(+).The fusion protein expressed in transformed E.coli BL21 was induced with IPTG and analyzed by SDS-PAGE. The virus-like particles (VLP) formed by fusion protein was observed with Transmission Electron Microscope (TEM). 4 Kunming (KM) mice received intraperitoneal injection (i.p) of fusion protein VLP. The antibody was detected by indirect ELISA. The recombinant gene was confirmed by restriction enzyme digestion and DNA sequencing. After IPTG induction, fusion protein was expressed and mainly existed in the sediment of the bacterial lysate. The expression level was 40% of all the proteins in the sediment. The fusion protein could form VLP. After 5 times of immunization, the titer of anti-ABCSP and anti-Aβantibody in sera of KM mice reached up to 1:5 000 and 1:10 000 respectively, while the anti-HBc antibody was undetectable. Recombinant c-ABCSP-Aβ(15-c); gene can be expressed in E.coli. The expressed protein could form VLP and has a strong immunogenicity. This study lays the foundation for the study of AD genetic engineering vaccine.

  20. Genome-wide analysis and expression profile of the bZIP transcription factor gene family in grapevine (Vitis vinifera)

    PubMed Central

    2014-01-01

    Background Basic leucine zipper (bZIP) transcription factor gene family is one of the largest and most diverse families in plants. Current studies have shown that the bZIP proteins regulate numerous growth and developmental processes and biotic and abiotic stress responses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant bZIP family members remains very limited. Results We identified 55 bZIP transcription factor-encoding genes in the grapevine (Vitis vinifera) genome, and divided them into 10 groups according to the phylogenetic relationship with those in Arabidopsis. The chromosome distribution and the collinearity analyses suggest that expansion of the grapevine bZIP (VvbZIP) transcription factor family was greatly contributed by the segment/chromosomal duplications, which may be associated with the grapevine genome fusion events. Nine intron/exon structural patterns within the bZIP domain and the additional conserved motifs were identified among all VvbZIP proteins, and showed a high group-specificity. The predicted specificities on DNA-binding domains indicated that some highly conserved amino acid residues exist across each major group in the tree of land plant life. The expression patterns of VvbZIP genes across the grapevine gene expression atlas, based on microarray technology, suggest that VvbZIP genes are involved in grapevine organ development, especially seed development. Expression analysis based on qRT-PCR indicated that VvbZIP genes are extensively involved in drought- and heat-responses, with possibly different mechanisms. Conclusions The genome-wide identification, chromosome organization, gene structures, evolutionary and expression analyses of grapevine bZIP genes provide an overall insight of this gene family and their potential involvement in growth, development and stress responses. This will facilitate further research on the bZIP gene family regarding their evolutionary history and biological functions. PMID:24725365

  1. Medicago truncatula contains a second gene encoding a plastid located glutamine synthetase exclusively expressed in developing seeds.

    PubMed

    Seabra, Ana R; Vieira, Cristina P; Cullimore, Julie V; Carvalho, Helena G

    2010-08-19

    Nitrogen is a crucial nutrient that is both essential and rate limiting for plant growth and seed production. Glutamine synthetase (GS), occupies a central position in nitrogen assimilation and recycling, justifying the extensive number of studies that have been dedicated to this enzyme from several plant sources. All plants species studied to date have been reported as containing a single, nuclear gene encoding a plastid located GS isoenzyme per haploid genome. This study reports the existence of a second nuclear gene encoding a plastid located GS in Medicago truncatula. This study characterizes a new, second gene encoding a plastid located glutamine synthetase (GS2) in M. truncatula. The gene encodes a functional GS isoenzyme with unique kinetic properties, which is exclusively expressed in developing seeds. Based on molecular data and the assumption of a molecular clock, it is estimated that the gene arose from a duplication event that occurred about 10 My ago, after legume speciation and that duplicated sequences are also present in closely related species of the Vicioide subclade. Expression analysis by RT-PCR and western blot indicate that the gene is exclusively expressed in developing seeds and its expression is related to seed filling, suggesting a specific function of the enzyme associated to legume seed metabolism. Interestingly, the gene was found to be subjected to alternative splicing over the first intron, leading to the formation of two transcripts with similar open reading frames but varying 5' UTR lengths, due to retention of the first intron. To our knowledge, this is the first report of alternative splicing on a plant GS gene. This study shows that Medicago truncatula contains an additional GS gene encoding a plastid located isoenzyme, which is functional and exclusively expressed during seed development. Legumes produce protein-rich seeds requiring high amounts of nitrogen, we postulate that this gene duplication represents a functional innovation of plastid located GS related to storage protein accumulation exclusive to legume seed metabolism.

  2. De novo transcriptome assembly and quantification reveal differentially expressed genes between soft-seed and hard-seed pomegranate (Punica granatum L.).

    PubMed

    Xue, Hui; Cao, Shangyin; Li, Haoxian; Zhang, Jie; Niu, Juan; Chen, Lina; Zhang, Fuhong; Zhao, Diguang

    2017-01-01

    Pomegranate (Punica granatum L.) belongs to Punicaceae, and is valued for its social, ecological, economic, and aesthetic values, as well as more recently for its health benefits. The 'Tunisia' variety has softer seeds and big arils that are easily swallowed. It is a widely popular fruit; however, the molecular mechanisms of the formation of hard and soft seeds is not yet clear. We conducted a de novo assembly of the seed transcriptome in P. granatum L. and revealed differential gene expression between the soft-seed and hard-seed pomegranate varieties. A total of 35.1 Gb of data were acquired in this study, including 280,881,106 raw reads. Additionally, de novo transcriptome assembly generated 132,287 transcripts and 105,743 representative unigenes; approximately 13,805 unigenes (37.7%) were longer than 1,000 bp. Using bioinformatics annotation libraries, a total of 76,806 unigenes were annotated and, among the high-quality reads, 72.63% had at least one significant match to an existing gene model. Gene expression and differentially expressed genes were analyzed. The seed formation of the two pomegranate cultivars involves lignin biosynthesis and metabolism, including some genes encoding laccase and peroxidase, WRKY, MYB, and NAC transcription factors. In the hard-seed pomegranate, lignin-related genes and cellulose synthesis-related genes were highly expressed; in soft-seed pomegranates, expression of genes related to flavonoids and programmed cell death was slightly higher. We validated selection of the identified genes using qRT-PCR. This is the first transcriptome analysis of P. granatum L. This transcription sequencing greatly enriched the pomegranate molecular database, and the high-quality SSRs generated in this study will aid the gene cloning from pomegranate in the future. It provides important insights into the molecular mechanisms underlying the formation of soft seeds in pomegranate.

  3. De novo transcriptome assembly and quantification reveal differentially expressed genes between soft-seed and hard-seed pomegranate (Punica granatum L.)

    PubMed Central

    Xue, Hui; Cao, Shangyin; Li, Haoxian; Zhang, Jie; Niu, Juan; Chen, Lina; Zhang, Fuhong; Zhao, Diguang

    2017-01-01

    Pomegranate (Punica granatum L.) belongs to Punicaceae, and is valued for its social, ecological, economic, and aesthetic values, as well as more recently for its health benefits. The ‘Tunisia’ variety has softer seeds and big arils that are easily swallowed. It is a widely popular fruit; however, the molecular mechanisms of the formation of hard and soft seeds is not yet clear. We conducted a de novo assembly of the seed transcriptome in P. granatum L. and revealed differential gene expression between the soft-seed and hard-seed pomegranate varieties. A total of 35.1 Gb of data were acquired in this study, including 280,881,106 raw reads. Additionally, de novo transcriptome assembly generated 132,287 transcripts and 105,743 representative unigenes; approximately 13,805 unigenes (37.7%) were longer than 1,000 bp. Using bioinformatics annotation libraries, a total of 76,806 unigenes were annotated and, among the high-quality reads, 72.63% had at least one significant match to an existing gene model. Gene expression and differentially expressed genes were analyzed. The seed formation of the two pomegranate cultivars involves lignin biosynthesis and metabolism, including some genes encoding laccase and peroxidase, WRKY, MYB, and NAC transcription factors. In the hard-seed pomegranate, lignin-related genes and cellulose synthesis-related genes were highly expressed; in soft-seed pomegranates, expression of genes related to flavonoids and programmed cell death was slightly higher. We validated selection of the identified genes using qRT-PCR. This is the first transcriptome analysis of P. granatum L. This transcription sequencing greatly enriched the pomegranate molecular database, and the high-quality SSRs generated in this study will aid the gene cloning from pomegranate in the future. It provides important insights into the molecular mechanisms underlying the formation of soft seeds in pomegranate. PMID:28594931

  4. A Novel Method to Predict Highly Expressed Genes Based on Radius Clustering and Relative Synonymous Codon Usage.

    PubMed

    Tran, Tuan-Anh; Vo, Nam Tri; Nguyen, Hoang Duc; Pham, Bao The

    2015-12-01

    Recombinant proteins play an important role in many aspects of life and have generated a huge income, notably in the industrial enzyme business. A gene is introduced into a vector and expressed in a host organism-for example, E. coli-to obtain a high productivity of target protein. However, transferred genes from particular organisms are not usually compatible with the host's expression system because of various reasons, for example, codon usage bias, GC content, repetitive sequences, and secondary structure. The solution is developing programs to optimize for designing a nucleotide sequence whose origin is from peptide sequences using properties of highly expressed genes (HEGs) of the host organism. Existing data of HEGs determined by practical and computer-based methods do not satisfy for qualifying and quantifying. Therefore, the demand for developing a new HEG prediction method is critical. We proposed a new method for predicting HEGs and criteria to evaluate gene optimization. Codon usage bias was weighted by amplifying the difference between HEGs and non-highly expressed genes (non-HEGs). The number of predicted HEGs is 5% of the genome. In comparison with Puigbò's method, the result is twice as good as Puigbò's one, in kernel ratio and kernel sensitivity. Concerning transcription/translation factor proteins (TF), the proposed method gives low TF sensitivity, while Puigbò's method gives moderate one. In summary, the results indicated that the proposed method can be a good optional applying method to predict optimized genes for particular organisms, and we generated an HEG database for further researches in gene design.

  5. Persistent Alterations of Gene Expression Profiling of Human Peripheral Blood Mononuclear Cells From Smokers

    PubMed Central

    Weng, Daniel Y.; Chen, Jinguo; Taslim, Cenny; Hsu, Ping-Ching; Marian, Catalin; David, Sean P.; Loffredo, Christopher A.; Shields, Peter G.

    2016-01-01

    The number of validated biomarkers of tobacco smoke exposure is limited, and none exist for tobacco-related cancer. Additional biomarkers for smoke, effects on cellular systems in vivo are needed to improve early detection of lung cancer, and to assist the Food and Drug Administration in regulating exposures to tobacco products. We assessed the effects of smoking on the gene expression using human cell cultures and blood from a cross-sectional study. We profiled global transcriptional changes in cultured smokers’ peripheral blood mononuclear cells (PBMCs) treated with cigarette smoke condensate (CSC) in vitro (n = 7) and from well-characterized smokers’ blood (n = 36). ANOVA with adjustment for covariates and Pearson correlation were used for statistical analysis in this study. CSC in vitro altered the expression of 1 178 genes (177 genes with > 1.5-fold-change) at P < 0.05. In vivo, PBMCs of heavy and light smokers differed for 614 genes (29 with > 1.5-fold-change) at P < 0.05 (309 remaining significant after adjustment for age, race, and gender). Forty-one genes were persistently altered both in vitro and in vivo, 22 having the same expression pattern reported for non-small cell lung cancer. Our data provides evidence that persistent alterations of gene expression in vitro and in vivo may relate to carcinogenic effects of cigarette smoke, and the identified genes may serve as potential biomarkers for cancer. The use of an in vitro model to corroborate results from human studies provides a novel way to understand human exposure and effect. PMID:26294040

  6. Integrated microarray and ChIP analysis identifies multiple Foxa2 dependent target genes in the notochord.

    PubMed

    Tamplin, Owen J; Cox, Brian J; Rossant, Janet

    2011-12-15

    The node and notochord are key tissues required for patterning of the vertebrate body plan. Understanding the gene regulatory network that drives their formation and function is therefore important. Foxa2 is a key transcription factor at the top of this genetic hierarchy and finding its targets will help us to better understand node and notochord development. We performed an extensive microarray-based gene expression screen using sorted embryonic notochord cells to identify early notochord-enriched genes. We validated their specificity to the node and notochord by whole mount in situ hybridization. This provides the largest available resource of notochord-expressed genes, and therefore candidate Foxa2 target genes in the notochord. Using existing Foxa2 ChIP-seq data from adult liver, we were able to identify a set of genes expressed in the notochord that had associated regions of Foxa2-bound chromatin. Given that Foxa2 is a pioneer transcription factor, we reasoned that these sites might represent notochord-specific enhancers. Candidate Foxa2-bound regions were tested for notochord specific enhancer function in a zebrafish reporter assay and 7 novel notochord enhancers were identified. Importantly, sequence conservation or predictive models could not have readily identified these regions. Mutation of putative Foxa2 binding elements in two of these novel enhancers abrogated reporter expression and confirmed their Foxa2 dependence. The combination of highly specific gene expression profiling and genome-wide ChIP analysis is a powerful means of understanding developmental pathways, even for small cell populations such as the notochord. Copyright © 2011 Elsevier Inc. All rights reserved.

  7. Analysis of gene network robustness based on saturated fixed point attractors

    PubMed Central

    2014-01-01

    The analysis of gene network robustness to noise and mutation is important for fundamental and practical reasons. Robustness refers to the stability of the equilibrium expression state of a gene network to variations of the initial expression state and network topology. Numerical simulation of these variations is commonly used for the assessment of robustness. Since there exists a great number of possible gene network topologies and initial states, even millions of simulations may be still too small to give reliable results. When the initial and equilibrium expression states are restricted to being saturated (i.e., their elements can only take values 1 or −1 corresponding to maximum activation and maximum repression of genes), an analytical gene network robustness assessment is possible. We present this analytical treatment based on determination of the saturated fixed point attractors for sigmoidal function models. The analysis can determine (a) for a given network, which and how many saturated equilibrium states exist and which and how many saturated initial states converge to each of these saturated equilibrium states and (b) for a given saturated equilibrium state or a given pair of saturated equilibrium and initial states, which and how many gene networks, referred to as viable, share this saturated equilibrium state or the pair of saturated equilibrium and initial states. We also show that the viable networks sharing a given saturated equilibrium state must follow certain patterns. These capabilities of the analytical treatment make it possible to properly define and accurately determine robustness to noise and mutation for gene networks. Previous network research conclusions drawn from performing millions of simulations follow directly from the results of our analytical treatment. Furthermore, the analytical results provide criteria for the identification of model validity and suggest modified models of gene network dynamics. The yeast cell-cycle network is used as an illustration of the practical application of this analytical treatment. PMID:24650364

  8. Comparison and evaluation of gene therapy and epigenetic approaches for wound healing.

    PubMed

    Cutroneo, K R; Chiu, J F

    2000-01-01

    During the past decade considerable evidence has mounted concerning the importance of growth factors in the wound healing process both for cell replication and for stimulating reparative cells to synthesize and secrete extracellular matrix components. During normal wound healing the growth factor concentration has to be maintained at a certain level. If the growth factor concentration is too low, normal healing fails to occur. Whereas if the growth factor concentration is too high due to either over-expression of the growth factor or too much growth factor being applied to the wound, aberrant wound healing will occur. One approach for controlling the amount of growth factor at the wound site during normal healing is through gene therapy and the titration of gene dosage. However if a narrow window exists between the beneficial therapeutic effect and toxic effects with increasing gene dosage, an agent may be necessary to give in combination with gene therapy to regulate the over-expression of growth factor. In addition to genetic approaches to regulate wound healing, epigenetic approaches also exist. Antisense oligodeoxynucleotides have been shown to regulate wound repair in certain model systems and to determine the protein(s) necessary for normal wound healing. A novel approach to regulate the activity of collagen genes, thereby affecting fibrosis, is to use a sense oligodeoxynucleotide having the same sequence of the cis element which regulates the promoter activity of a particular collagen gene. This exogenous oligodeoxynucleotide will compete with the cis element in the collagen gene for the trans-acting factor which regulates promoter activity. These epigenetic approaches afford the opportunity to regulate over-expression of growth factor and therefore preclude the potential toxic effects of gene therapy. Both genetic and epigenetic approaches for regulating the wound healing process, either normal or aberrant wound healing, have certain advantages and disadvantages which are discussed in the present article.

  9. Expression studies of the PIS-regulated genes suggest different mechanisms of sex determination within mammals.

    PubMed

    Pannetier, M; Servel, N; Cocquet, J; Besnard, N; Cotinot, C; Pailhoux, E

    2003-01-01

    In mammals, the Y-located SRY gene is known to induce testis formation from the indifferent gonad. A related gene, SOX9, also plays a critical role in testis differentiation in mammals, in birds and reptiles. It is now assumed that SRY acts upstream of SOX9 in the sex determination cascade, but the regulatory link which should exist between these two genes remains unknown. Studies on XX sex reversal in polled goats (PIS mutation: Polled Intersex Syndrome) have led to the discovery of a female-specific locus crucial for ovarian differentiation. This genomic region is composed of at least two genes, FOXL2 and PISRT1, which share a common transcriptional regulatory region, PIS. In this review, we present the expression pattern of these PIS-regulated genes in mice. The FOXL2 expression profile of mice is similar to that described in goats in accordance with a conserved role of this ovarian differentiating gene in mammals. On the contrary, the PISRT1 expression profile is different between mice and goats, suggesting different mechanisms of the primary switch in the testis determination process within mammals. A model based on two different modes of SOX9 regulation in mice and other mammals is proposed in order to integrate our results into the current scheme of gonad differentiation. Copyright 2003 S. Karger AG, Basel

  10. Partial least squares based identification of Duchenne muscular dystrophy specific genes.

    PubMed

    An, Hui-bo; Zheng, Hua-cheng; Zhang, Li; Ma, Lin; Liu, Zheng-yan

    2013-11-01

    Large-scale parallel gene expression analysis has provided a greater ease for investigating the underlying mechanisms of Duchenne muscular dystrophy (DMD). Previous studies typically implemented variance/regression analysis, which would be fundamentally flawed when unaccounted sources of variability in the arrays existed. Here we aim to identify genes that contribute to the pathology of DMD using partial least squares (PLS) based analysis. We carried out PLS-based analysis with two datasets downloaded from the Gene Expression Omnibus (GEO) database to identify genes contributing to the pathology of DMD. Except for the genes related to inflammation, muscle regeneration and extracellular matrix (ECM) modeling, we found some genes with high fold change, which have not been identified by previous studies, such as SRPX, GPNMB, SAT1, and LYZ. In addition, downregulation of the fatty acid metabolism pathway was found, which may be related to the progressive muscle wasting process. Our results provide a better understanding for the downstream mechanisms of DMD.

  11. Genome-wide analysis of the R2R3-MYB transcription factor gene family in sweet orange (Citrus sinensis).

    PubMed

    Liu, Chaoyang; Wang, Xia; Xu, Yuantao; Deng, Xiuxin; Xu, Qiang

    2014-10-01

    MYB transcription factor represents one of the largest gene families in plant genomes. Sweet orange (Citrus sinensis) is one of the most important fruit crops worldwide, and recently the genome has been sequenced. This provides an opportunity to investigate the organization and evolutionary characteristics of sweet orange MYB genes from whole genome view. In the present study, we identified 100 R2R3-MYB genes in the sweet orange genome. A comprehensive analysis of this gene family was performed, including the phylogeny, gene structure, chromosomal localization and expression pattern analyses. The 100 genes were divided into 29 subfamilies based on the sequence similarity and phylogeny, and the classification was also well supported by the highly conserved exon/intron structures and motif composition. The phylogenomic comparison of MYB gene family among sweet orange and related plant species, Arabidopsis, cacao and papaya suggested the existence of functional divergence during evolution. Expression profiling indicated that sweet orange R2R3-MYB genes exhibited distinct temporal and spatial expression patterns. Our analysis suggested that the sweet orange MYB genes may play important roles in different plant biological processes, some of which may be potentially involved in citrus fruit quality. These results will be useful for future functional analysis of the MYB gene family in sweet orange.

  12. Profiling and bioinformatic analysis of circular RNA expression regulated by c-Myc.

    PubMed

    Gou, Qiheng; Wu, Ke; Zhou, Jian-Kang; Xie, Yuxin; Liu, Lunxu; Peng, Yong

    2017-09-22

    The c-Myc transcription factor is involved in cell proliferation, cell cycle and apoptosis by activating or repressing transcription of multiple genes. Circular RNAs (circRNAs) are widely expressed non-coding RNAs participating in the regulation of gene expression. Using a high-throughput microarray assay, we showed that Myc regulates the expression of certain circRNAs. A total of 309 up- and 252 down-regulated circRNAs were identified. Among them, randomly selected 8 circRNAs were confirmed by real-time PCR. Subsequently, Myc-binding sites were found to generally exist in the promoter regions of differentially expressed circRNAs. Based on miRNA sponge mechanism, we constructed circRNAs/miRNAs network regulated by Myc, suggesting that circRNAs may widely regulate protein expression through miRNA sponge mechanism. Lastly, we took advantage of Gene Ontology and KEGG analyses to point out that Myc-regulated circRNAs could impact cell proliferation through affecting Ras signaling pathway and pathways in cancer. Our study for the first time demonstrated that Myc transcription factor regulates the expression of circRNAs, adding a novel component of the Myc tumorigenic program and opening a window to investigate the function of certain circRNAs in tumorigenesis.

  13. AP1 Keeps Chromatin Poised for Action | Center for Cancer Research

    Cancer.gov

    The human genome harbors gene-encoding DNA, the blueprint for building proteins that regulate cellular function. Embedded across the genome, in non-coding regions, are DNA elements to which regulatory factors bind. The interaction of regulatory factors with DNA at these sites modifies gene expression to modulate cell activity. In cells, DNA exists in a complex with proteins

  14. Dynamics of Agglutinin-Like Sequence (ALS) Protein Localization on the Surface of Candida Albicans

    ERIC Educational Resources Information Center

    Coleman, David Andrew

    2009-01-01

    The ALS gene family encodes large cell-surface glycoproteins associated with "C. albicans" pathogenesis. Als proteins are thought to act as adhesin molecules binding to host tissues. Wide variation in expression levels among the ALS genes exists and is related to cell morphology and environmental conditions. "ALS1," "ALS3," and "ALS4" are three of…

  15. The super elongation complex (SEC) and MLL in development and disease

    PubMed Central

    Smith, Edwin; Lin, Chengqi; Shilatifard, Ali

    2011-01-01

    Transcriptional regulation at the level of elongation is vital for the control of gene expression and metazoan development. The mixed lineage leukemia (MLL) protein and its Drosophila homolog, Trithorax, which exist within COMPASS (complex of proteins associated with Set1)-like complexes, are master regulators of development. They are required for proper homeotic gene expression, in part through methylation of histone H3 on Lys 4. In humans, the MLL gene is involved in a large number of chromosomal translocations that create chimeric proteins, fusing the N terminus of MLL to several proteins that share little sequence similarity. Several frequent translocation partners of MLL were found recently to coexist in a super elongation complex (SEC) that includes known transcription elongation factors such as eleven-nineteen lysine-rich leukemia (ELL) and P-TEFb. Importantly, the SEC is required for HOX gene expression in leukemic cells, suggesting that chromosomal translocations involving MLL could lead to the overexpression of HOX and other genes through the involvement of the SEC. Here, we review the normal developmental roles of MLL and the SEC, and how MLL fusion proteins can mediate leukemogenesis. PMID:21460034

  16. A detailed gene expression study of the Miscanthus genus reveals changes in the transcriptome associated with the rejuvenation of spring rhizomes.

    PubMed

    Barling, Adam; Swaminathan, Kankshita; Mitros, Therese; James, Brandon T; Morris, Juliette; Ngamboma, Ornella; Hall, Megan C; Kirkpatrick, Jessica; Alabady, Magdy; Spence, Ashley K; Hudson, Matthew E; Rokhsar, Daniel S; Moose, Stephen P

    2013-12-09

    The Miscanthus genus of perennial C4 grasses contains promising biofuel crops for temperate climates. However, few genomic resources exist for Miscanthus, which limits understanding of its interesting biology and future genetic improvement. A comprehensive catalog of expressed sequences were generated from a variety of Miscanthus species and tissue types, with an emphasis on characterizing gene expression changes in spring compared to fall rhizomes. Illumina short read sequencing technology was used to produce transcriptome sequences from different tissues and organs during distinct developmental stages for multiple Miscanthus species, including Miscanthus sinensis, Miscanthus sacchariflorus, and their interspecific hybrid Miscanthus × giganteus. More than fifty billion base-pairs of Miscanthus transcript sequence were produced. Overall, 26,230 Sorghum gene models (i.e., ~ 96% of predicted Sorghum genes) had at least five Miscanthus reads mapped to them, suggesting that a large portion of the Miscanthus transcriptome is represented in this dataset. The Miscanthus × giganteus data was used to identify genes preferentially expressed in a single tissue, such as the spring rhizome, using Sorghum bicolor as a reference. Quantitative real-time PCR was used to verify examples of preferential expression predicted via RNA-Seq. Contiguous consensus transcript sequences were assembled for each species and annotated using InterProScan. Sequences from the assembled transcriptome were used to amplify genomic segments from a doubled haploid Miscanthus sinensis and from Miscanthus × giganteus to further disentangle the allelic and paralogous variations in genes. This large expressed sequence tag collection creates a valuable resource for the study of Miscanthus biology by providing detailed gene sequence information and tissue preferred expression patterns. We have successfully generated a database of transcriptome assemblies and demonstrated its use in the study of genes of interest. Analysis of gene expression profiles revealed biological pathways that exhibit altered regulation in spring compared to fall rhizomes, which are consistent with their different physiological functions. The expression profiles of the subterranean rhizome provides a better understanding of the biological activities of the underground stem structures that are essentials for perenniality and the storage or remobilization of carbon and nutrient resources.

  17. The 'warrior gene' and the Mãori people: the responsibility of the geneticists.

    PubMed

    Perbal, Laurence

    2013-09-01

    The 'gene of' is a teleosemantic expression that conveys a simplistic and linear relationship between a gene and a phenotype. Throughout the 20th century, geneticists studied these genes of traits. The studies were often polemical when they concerned human traits: the 'crime gene', 'poverty gene', 'IQ gene', 'gay gene' or 'gene of alcoholism'. Quite recently, a controversy occurred in 2006 in New Zealand that started with the claim that a 'warrior gene' exists in the Mãori community. This claim came from a geneticist working on the MAOA gene. This article is interested in the responsibility of that researcher regarding the origin of the controversy. Several errors were made: overestimation of results, abusive use of the 'gene of' kind of expression, poor communication with the media and a lack of scientific culture. The issues of the debate were not taken into account sufficiently, either from the political, social, ethical or even the genetic points of view. After more than 100 years of debates around 'genes of' all kinds (here, the 'warrior gene'), geneticists may not hide themselves behind the media when a controversy occurs. Responsibilities have to be assumed. © 2012 John Wiley & Sons Ltd.

  18. Structural and functional studies of a family of Dictyostelium discoideum developmentally regulated, prestalk genes coding for small proteins.

    PubMed

    Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro

    2008-01-03

    The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.

  19. Altering the selection capabilities of common cloning vectors via restriction enzyme mediated gene disruption

    PubMed Central

    2013-01-01

    Background The cloning of gene sequences forms the basis for many molecular biological studies. One important step in the cloning process is the isolation of bacterial transformants carrying vector DNA. This involves a vector-encoded selectable marker gene, which in most cases, confers resistance to an antibiotic. However, there are a number of circumstances in which a different selectable marker is required or may be preferable. Such situations can include restrictions to host strain choice, two phase cloning experiments and mutagenesis experiments, issues that result in additional unnecessary cloning steps, in which the DNA needs to be subcloned into a vector with a suitable selectable marker. Results We have used restriction enzyme mediated gene disruption to modify the selectable marker gene of a given vector by cloning a different selectable marker gene into the original marker present in that vector. Cloning a new selectable marker into a pre-existing marker was found to change the selection phenotype conferred by that vector, which we were able to demonstrate using multiple commonly used vectors and multiple resistance markers. This methodology was also successfully applied not only to cloning vectors, but also to expression vectors while keeping the expression characteristics of the vector unaltered. Conclusions Changing the selectable marker of a given vector has a number of advantages and applications. This rapid and efficient method could be used for co-expression of recombinant proteins, optimisation of two phase cloning procedures, as well as multiple genetic manipulations within the same host strain without the need to remove a pre-existing selectable marker in a previously genetically modified strain. PMID:23497512

  20. Global Genetic Response in a Cancer Cell: Self-Organized Coherent Expression Dynamics

    PubMed Central

    Tsuchiya, Masa; Hashimoto, Midori; Takenaka, Yoshiko; Motoike, Ikuko N.; Yoshikawa, Kenichi

    2014-01-01

    Understanding the basic mechanism of the spatio-temporal self-control of genome-wide gene expression engaged with the complex epigenetic molecular assembly is one of major challenges in current biological science. In this study, the genome-wide dynamical profile of gene expression was analyzed for MCF-7 breast cancer cells induced by two distinct ErbB receptor ligands: epidermal growth factor (EGF) and heregulin (HRG), which drive cell proliferation and differentiation, respectively. We focused our attention to elucidate how global genetic responses emerge and to decipher what is an underlying principle for dynamic self-control of genome-wide gene expression. The whole mRNA expression was classified into about a hundred groups according to the root mean square fluctuation (rmsf). These expression groups showed characteristic time-dependent correlations, indicating the existence of collective behaviors on the ensemble of genes with respect to mRNA expression and also to temporal changes in expression. All-or-none responses were observed for HRG and EGF (biphasic statistics) at around 10–20 min. The emergence of time-dependent collective behaviors of expression occurred through bifurcation of a coherent expression state (CES). In the ensemble of mRNA expression, the self-organized CESs reveals distinct characteristic expression domains for biphasic statistics, which exhibits notably the presence of criticality in the expression profile as a route for genomic transition. In time-dependent changes in the expression domains, the dynamics of CES reveals that the temporal development of the characteristic domains is characterized as autonomous bistable switch, which exhibits dynamic criticality (the temporal development of criticality) in the genome-wide coherent expression dynamics. It is expected that elucidation of the biophysical origin for such critical behavior sheds light on the underlying mechanism of the control of whole genome. PMID:24831017

  1. Novel Bioengineered Cassava Expressing an Archaeal Starch Degradation System and a Bacterial ADP-Glucose Pyrophosphorylase for Starch Self-Digestibility and Yield Increase

    PubMed Central

    Ligaba-Osena, Ayalew; Jones, Jenna; Donkor, Emmanuel; Chandrayan, Sanjeev; Pole, Farris; Wu, Chang-Hao; Vieille, Claire; Adams, Michael W. W.; Hankoua, Bertrand B.

    2018-01-01

    To address national and global low-carbon fuel targets, there is great interest in alternative plant species such as cassava (Manihot esculenta), which are high-yielding, resilient, and are easily converted to fuels using the existing technology. In this study the genes encoding hyperthermophilic archaeal starch-hydrolyzing enzymes, α-amylase and amylopullulanase from Pyrococcus furiosus and glucoamylase from Sulfolobus solfataricus, together with the gene encoding a modified ADP-glucose pyrophosphorylase (glgC) from Escherichia coli, were simultaneously expressed in cassava roots to enhance starch accumulation and its subsequent hydrolysis to sugar. A total of 13 multigene expressing transgenic lines were generated and characterized phenotypically and genotypically. Gene expression analysis using quantitative RT-PCR showed that the microbial genes are expressed in the transgenic roots. Multigene-expressing transgenic lines produced up to 60% more storage root yield than the non-transgenic control, likely due to glgC expression. Total protein extracted from the transgenic roots showed up to 10-fold higher starch-degrading activity in vitro than the protein extracted from the non-transgenic control. Interestingly, transgenic tubers released threefold more glucose than the non-transgenic control when incubated at 85°C for 21-h without exogenous application of thermostable enzymes, suggesting that the archaeal enzymes produced in planta maintain their activity and thermostability. PMID:29541080

  2. Novel Bioengineered Cassava Expressing an Archaeal Starch Degradation System and a Bacterial ADP-Glucose Pyrophosphorylase for Starch Self-Digestibility and Yield Increase.

    PubMed

    Ligaba-Osena, Ayalew; Jones, Jenna; Donkor, Emmanuel; Chandrayan, Sanjeev; Pole, Farris; Wu, Chang-Hao; Vieille, Claire; Adams, Michael W W; Hankoua, Bertrand B

    2018-01-01

    To address national and global low-carbon fuel targets, there is great interest in alternative plant species such as cassava ( Manihot esculenta ), which are high-yielding, resilient, and are easily converted to fuels using the existing technology. In this study the genes encoding hyperthermophilic archaeal starch-hydrolyzing enzymes, α-amylase and amylopullulanase from Pyrococcus furiosus and glucoamylase from Sulfolobus solfataricus , together with the gene encoding a modified ADP-glucose pyrophosphorylase ( glgC ) from Escherichia coli , were simultaneously expressed in cassava roots to enhance starch accumulation and its subsequent hydrolysis to sugar. A total of 13 multigene expressing transgenic lines were generated and characterized phenotypically and genotypically. Gene expression analysis using quantitative RT-PCR showed that the microbial genes are expressed in the transgenic roots. Multigene-expressing transgenic lines produced up to 60% more storage root yield than the non-transgenic control, likely due to glgC expression. Total protein extracted from the transgenic roots showed up to 10-fold higher starch-degrading activity in vitro than the protein extracted from the non-transgenic control. Interestingly, transgenic tubers released threefold more glucose than the non-transgenic control when incubated at 85°C for 21-h without exogenous application of thermostable enzymes, suggesting that the archaeal enzymes produced in planta maintain their activity and thermostability.

  3. Non-target Effects of Green Fluorescent Protein (GFP)-derived Double-Stranded RNA (dsRNA-GFP) Used in Honey Bee RNA Interference (RNAi) Assays

    PubMed Central

    Nunes, Francis M. F.; Aleixo, Aline C.; Barchuk, Angel R.; Bomtorin, Ana D.; Grozinger, Christina M.; Simões, Zilá L. P.

    2013-01-01

    RNA interference has been frequently applied to modulate gene function in organisms where the production and maintenance of mutants is challenging, as in our model of study, the honey bee, Apis mellifera. A green fluorescent protein (GFP)-derived double-stranded RNA (dsRNA-GFP) is currently commonly used as control in honey bee RNAi experiments, since its gene does not exist in the A. mellifera genome. Although dsRNA-GFP is not expected to trigger RNAi responses in treated bees, undesirable effects on gene expression, pigmentation or developmental timing are often observed. Here, we performed three independent experiments using microarrays to examine the effect of dsRNA-GFP treatment (introduced by feeding) on global gene expression patterns in developing worker bees. Our data revealed that the expression of nearly 1,400 genes was altered in response to dsRNA-GFP, representing around 10% of known honey bee genes. Expression changes appear to be the result of both direct off-target effects and indirect downstream secondary effects; indeed, there were several instances of sequence similarity between putative siRNAs generated from the dsRNA-GFP construct and genes whose expression levels were altered. In general, the affected genes are involved in important developmental and metabolic processes associated with RNA processing and transport, hormone metabolism, immunity, response to external stimulus and to stress. These results suggest that multiple dsRNA controls should be employed in RNAi studies in honey bees. Furthermore, any RNAi studies involving these genes affected by dsRNA-GFP in our studies should use a different dsRNA control. PMID:26466797

  4. A gene expression analysis of cell wall biosynthetic genes in Malus × domestica infected by ‘Candidatus Phytoplasma mali’

    PubMed Central

    Guerriero, Gea; Giorno, Filomena; Ciccotti, Anna Maria; Schmidt, Silvia; Baric, Sanja

    2016-01-01

    Apple proliferation (AP) represents a serious threat to several fruit-growing areas and is responsible for great economic losses. Several studies have highlighted the key role played by the cell wall in response to pathogen attack. The existence of a cell wall integrity signaling pathway which senses perturbations in the cell wall architecture upon abiotic/biotic stresses and activates specific defence responses has been widely demonstrated in plants. More recently a role played by cell wall-related genes has also been reported in plants infected by phytoplasmas. With the aim of shedding light on the cell wall response to AP disease in the economically relevant fruit-tree Malus × domestica Borkh., we investigated the expression of the cellulose (CesA) and callose synthase (CalS) genes in different organs (i.e., leaves, roots and branch phloem) of healthy and infected symptomatic outdoor-grown trees, sampled over the course of two time points (i.e., spring and autumn 2011), as well as in in vitro micropropagated control and infected plantlets. A strong up-regulation in the expression of cell wall biosynthetic genes was recorded in roots from infected trees. Secondary cell wall CesAs showed up-regulation in the phloem tissue from branches of infected plants, while either a down-regulation of some genes or no major changes were observed in the leaves. Micropropagated plantlets also showed an increase in cell wall-related genes and constitute a useful system for a general assessment of gene expression analysis upon phytoplasma infection. Finally, we also report the presence of several ‘knot’-like structures along the roots of infected apple trees and discuss the occurrence of this interesting phenotype in relation to the gene expression results and the modalities of phytoplasma diffusion. PMID:23086810

  5. Transcript Assembly and Quantification by RNA-Seq Reveals Differentially Expressed Genes between Soft-Endocarp and Hard-Endocarp Hawthorns

    PubMed Central

    Zhang, Feng; Liu, Zhongchi; Li, Xiaoming; Li, Wenran; Ma, Yue; Li, He; Liu, Yuexue; Zhang, Zhihong

    2013-01-01

    Hawthorn (Crataegus spp.) is an important pome with a long history as a fruit, an ornamental, and a source of medicine. Fruits of hawthorn are marked by hard stony endocarps, but a hawthorn germplasm with soft and thin endocarp was found in Liaoning province of China. To elucidate the molecular mechanism underlying the soft endocarp of hawthorn, we conducted a de novo assembly of the fruit transcriptome of Crataegus pinnatifida and compared gene expression profiles between the soft-endocarp and the hard-endocarp hawthorn varieties. De novo assembly yielded 52,673 putative unigenes, 20.4% of which are longer than 1,000 bp. Among the high-quality unique sequences, 35,979 (68.3%) had at least one significant match to an existing gene model. A total of 1,218 genes, represented 2.31% total putative unigenes, were differentially expressed between the soft-endocarp hawthorn and the hard-endocarp hawthorn. Among these differentially expressed genes, a number of lignin biosynthetic pathway genes were down-regulated while almost all the flavonoid biosynthetic pathway genes were strongly up-regulated, concomitant with the formation of soft endocarp. In addition, we have identified some MYB and NAC transcription factors that could potentially control lignin and flavonoid biosynthesis. The altered expression levels of the genes encoding lignin biosynthetic enzymes, MYB and NAC transcription factors were confirmed by quantitative RT-PCR. This is the first transcriptome analysis of Crataegus genus. The high quality ESTs generated in this study will aid future gene cloning from hawthorn. Our study provides important insights into the molecular mechanisms underlying soft endocarp formation in hawthorn. PMID:24039819

  6. Non-Target Effects of Green Fluorescent Protein (GFP)-Derived Double-Stranded RNA (dsRNA-GFP) Used in Honey Bee RNA Interference (RNAi) Assays.

    PubMed

    Nunes, Francis M F; Aleixo, Aline C; Barchuk, Angel R; Bomtorin, Ana D; Grozinger, Christina M; Simões, Zilá L P

    2013-01-04

    RNA interference has been frequently applied to modulate gene function in organisms where the production and maintenance of mutants is challenging, as in our model of study, the honey bee, Apis mellifera. A green fluorescent protein (GFP)-derived double-stranded RNA (dsRNA-GFP) is currently commonly used as control in honey bee RNAi experiments, since its gene does not exist in the A. mellifera genome. Although dsRNA-GFP is not expected to trigger RNAi responses in treated bees, undesirable effects on gene expression, pigmentation or developmental timing are often observed. Here, we performed three independent experiments using microarrays to examine the effect of dsRNA-GFP treatment (introduced by feeding) on global gene expression patterns in developing worker bees. Our data revealed that the expression of nearly 1,400 genes was altered in response to dsRNA-GFP, representing around 10% of known honey bee genes. Expression changes appear to be the result of both direct off-target effects and indirect downstream secondary effects; indeed, there were several instances of sequence similarity between putative siRNAs generated from the dsRNA-GFP construct and genes whose expression levels were altered. In general, the affected genes are involved in important developmental and metabolic processes associated with RNA processing and transport, hormone metabolism, immunity, response to external stimulus and to stress. These results suggest that multiple dsRNA controls should be employed in RNAi studies in honey bees. Furthermore, any RNAi studies involving these genes affected by dsRNA-GFP in our studies should use a different dsRNA control.

  7. CHRFAM7A: a human-specific α7-nicotinic acetylcholine receptor gene shows differential responsiveness of human intestinal epithelial cells to LPS

    PubMed Central

    Dang, Xitong; Eliceiri, Brian P.; Baird, Andrew; Costantini, Todd W.

    2015-01-01

    The human genome contains a unique, distinct, and human-specific α7-nicotinic acetylcholine receptor (α7nAChR) gene [CHRNA7 (gene-encoding α7-nicotinic acetylcholine receptor)] called CHRFAM7A (gene-encoding dup-α7-nicotinic acetylcholine receptor) on a locus of chromosome 15 associated with mental illness, including schizophrenia. Located 5′ upstream from the “wild-type” CHRNA7 gene that is found in other vertebrates, we demonstrate CHRFAM7A expression in a broad range of epithelial cells and sequenced the CHRFAM7A transcript found in normal human fetal small intestine epithelial (FHs) cells to prove its identity. We then compared its expression to CHRNA7 in 11 gut epithelial cell lines, showed that there is a differential response to LPS when compared to CHRNA7, and characterized the CHRFAM7A promoter. We report that both CHRFAM7A and CHRNA7 gene expression are widely distributed in human epithelial cell lines but that the levels of CHRFAM7A gene expression vary up to 5000-fold between different gut epithelial cells. A 3-hour treatment of epithelial cells with 100 ng/ml LPS increased CHRFAM7A gene expression by almost 1000-fold but had little effect on CHRNA7 gene expression. Mapping the regulatory elements responsible for CHRFAM7A gene expression identifies a 1 kb sequence in the UTR of the CHRFAM7A gene that is modulated by LPS. Taken together, these data establish the presence, identity, and differential regulation of the human-specific CHRFAM7A gene in human gut epithelial cells. In light of the fact that CHRFAM7A expression is reported to modulate ligand binding to, and alter the activity of, the wild-type α7nAChR ligand-gated pentameric ion channel, the findings point to the existence of a species-specific α7nAChR response that might regulate gut epithelial function in a human-specific fashion.—Dang, X., Eliceiri, B. P., Baird, A., Costantini, T. W. CHRFAM7A: a human-specific α7-nicotinic acetylcholine receptor gene shows differential responsiveness of human intestinal epithelial cells to LPS. PMID:25681457

  8. Gene delivery for cancer therapy.

    PubMed

    Zhang, Teng

    2014-01-01

    Gene therapy has potential in the treatment of human cancers. However, its clinical implication has only achieved little success due to the lack of an efficient gene delivery system. A major hurdle in the current available approaches is in the ability to transduce target tissues at very high efficiencies that ultimately lead to therapeutic levels of transgene expression. This review outlines the characteristics and utilities of several available gene delivery systems, including their advantages and drawbacks in the context of cancer treatment. A perspective of existing challenges and future directions is also included.

  9. Integrative Approach to Pain Genetics Identifies Pain Sensitivity Loci across Diseases

    PubMed Central

    Ruau, David; Dudley, Joel T.; Chen, Rong; Phillips, Nicholas G.; Swan, Gary E.; Lazzeroni, Laura C.; Clark, J. David

    2012-01-01

    Identifying human genes relevant for the processing of pain requires difficult-to-conduct and expensive large-scale clinical trials. Here, we examine a novel integrative paradigm for data-driven discovery of pain gene candidates, taking advantage of the vast amount of existing disease-related clinical literature and gene expression microarray data stored in large international repositories. First, thousands of diseases were ranked according to a disease-specific pain index (DSPI), derived from Medical Subject Heading (MESH) annotations in MEDLINE. Second, gene expression profiles of 121 of these human diseases were obtained from public sources. Third, genes with expression variation significantly correlated with DSPI across diseases were selected as candidate pain genes. Finally, selected candidate pain genes were genotyped in an independent human cohort and prospectively evaluated for significant association between variants and measures of pain sensitivity. The strongest signal was with rs4512126 (5q32, ABLIM3, P = 1.3×10−10) for the sensitivity to cold pressor pain in males, but not in females. Significant associations were also observed with rs12548828, rs7826700 and rs1075791 on 8q22.2 within NCALD (P = 1.7×10−4, 1.8×10−4, and 2.2×10−4 respectively). Our results demonstrate the utility of a novel paradigm that integrates publicly available disease-specific gene expression data with clinical data curated from MEDLINE to facilitate the discovery of pain-relevant genes. This data-derived list of pain gene candidates enables additional focused and efficient biological studies validating additional candidates. PMID:22685391

  10. Topographical mapping of α- and β-keratins on developing chicken skin integuments: Functional interaction and evolutionary perspectives

    PubMed Central

    Wu, Ping; Ng, Chen Siang; Yan, Jie; Lai, Yung-Chih; Chen, Chih-Kuan; Lai, Yu-Ting; Wu, Siao-Man; Chen, Jiun-Jie; Luo, Weiqi; Widelitz, Randall B.; Li, Wen-Hsiung; Chuong, Cheng-Ming

    2015-01-01

    Avian integumentary organs include feathers, scales, claws, and beaks. They cover the body surface and play various functions to help adapt birds to diverse environments. These keratinized structures are mainly composed of corneous materials made of α-keratins, which exist in all vertebrates, and β-keratins, which only exist in birds and reptiles. Here, members of the keratin gene families were used to study how gene family evolution contributes to novelty and adaptation, focusing on tissue morphogenesis. Using chicken as a model, we applied RNA-seq and in situ hybridization to map α- and β-keratin genes in various skin appendages at embryonic developmental stages. The data demonstrate that temporal and spatial α- and β-keratin expression is involved in establishing the diversity of skin appendage phenotypes. Embryonic feathers express a higher proportion of β-keratin genes than other skin regions. In feather filament morphogenesis, β-keratins show intricate complexity in diverse substructures of feather branches. To explore functional interactions, we used a retrovirus transgenic system to ectopically express mutant α- or antisense β-keratin forms. α- and β-keratins show mutual dependence and mutations in either keratin type results in disrupted keratin networks and failure to form proper feather branches. Our data suggest that combinations of α- and β-keratin genes contribute to the morphological and structural diversity of different avian skin appendages, with feather-β-keratins conferring more possible composites in building intrafeather architecture complexity, setting up a platform of morphological evolution of functional forms in feathers. PMID:26598683

  11. Identification of diagnostic markers in colorectal cancer via integrative epigenomics and genomics data

    PubMed Central

    KOK-SIN, TEOW; MOKHTAR, NORFILZA MOHD; HASSAN, NUR ZARINA ALI; SAGAP, ISMAIL; ROSE, ISA MOHAMED; HARUN, ROSLAN; JAMAL, RAHMAN

    2015-01-01

    Apart from genetic mutations, epigenetic alteration is a common phenomenon that contributes to neoplastic transformation in colorectal cancer. Transcriptional silencing of tumor-suppressor genes without changes in the DNA sequence is explained by the existence of promoter hypermethylation. To test this hypothesis, we integrated the epigenome and transcriptome data from a similar set of colorectal tissue samples. Methylation profiling was performed using the Illumina InfiniumHumanMethylation27 BeadChip on 55 paired cancer and adjacent normal epithelial cells. Fifteen of the 55 paired tissues were used for gene expression profiling using the Affymetrix GeneChip Human Gene 1.0 ST array. Validation was carried out on 150 colorectal tissues using the methylation-specific multiplex ligation-dependent probe amplification (MS-MLPA) technique. PCA and supervised hierarchical clustering in the two microarray datasets showed good separation between cancer and normal samples. Significant genes from the two analyses were obtained based on a ≥2-fold change and a false discovery rate (FDR) P-value of <0.05. We identified 1,081 differentially hypermethylated CpG sites and 36 hypomethylated CpG sites. We also found 709 upregulated and 699 downregulated genes from the gene expression profiling. A comparison of the two datasets revealed 32 overlapping genes with 27 being hypermethylated with downregulated expression and 4 hypermethylated with upregulated expression. One gene was found to be hypomethylated and downregulated. The most enriched molecular pathway identified was cell adhesion molecules that involved 4 overlapped genes, JAM2, NCAM1, ITGA8 and CNTN1. In the present study, we successfully identified a group of genes that showed methylation and gene expression changes in well-defined colorectal cancer tissues with high purity. The integrated analysis gives additional insight regarding the regulation of colorectal cancer-associated genes and their underlying mechanisms that contribute to colorectal carcinogenesis. PMID:25997610

  12. Nuclear envelope and genome interactions in cell fate

    PubMed Central

    Talamas, Jessica A.; Capelson, Maya

    2015-01-01

    The eukaryotic cell nucleus houses an organism’s genome and is the location within the cell where all signaling induced and development-driven gene expression programs are ultimately specified. The genome is enclosed and separated from the cytoplasm by the nuclear envelope (NE), a double-lipid membrane bilayer, which contains a large variety of trans-membrane and associated protein complexes. In recent years, research regarding multiple aspects of the cell nucleus points to a highly dynamic and coordinated concert of efforts between chromatin and the NE in regulation of gene expression. Details of how this concert is orchestrated and how it directs cell differentiation and disease are coming to light at a rapid pace. Here we review existing and emerging concepts of how interactions between the genome and the NE may contribute to tissue specific gene expression programs to determine cell fate. PMID:25852741

  13. Analysis of the Prefoldin Gene Family in 14 Plant Species

    PubMed Central

    Cao, Jun

    2016-01-01

    Prefoldin is a hexameric molecular chaperone complex present in all eukaryotes and archaea. The evolution of this gene family in plants is unknown. Here, I identified 140 prefoldin genes in 14 plant species. These prefoldin proteins were divided into nine groups through phylogenetic analysis. Highly conserved gene organization and motif distribution exist in each prefoldin group, implying their functional conservation. I also observed the segmental duplication of maize prefoldin gene family. Moreover, a few functional divergence sites were identified within each group pairs. Functional network analyses identified 78 co-expressed genes, and most of them were involved in carrying, binding and kinase activity. Divergent expression profiles of the maize prefoldin genes were further investigated in different tissues and development periods and under auxin and some abiotic stresses. I also found a few cis-elements responding to abiotic stress and phytohormone in the upstream sequences of the maize prefoldin genes. The results provided a foundation for exploring the characterization of the prefoldin genes in plants and will offer insights for additional functional studies. PMID:27014333

  14. Phytoremediation of chromium using Salix species: cloning ESTs and candidate genes involved in the Cr response.

    PubMed

    Quaggiotti, Silvia; Barcaccia, Gianni; Schiavon, Michela; Nicolé, Silvia; Galla, Giulio; Rossignolo, Virginia; Soattin, Marica; Malagoli, Mario

    2007-11-01

    In this research a differential display based on the detection of cDNA-AFLP markers was used to identify candidate genes potentially involved in the regulation of the response to chromium in four different willow species (Salix alba, Salix eleagnos, Salix fragilis and Salix matsudana) chosen on the basis of their suitability in phytoremediation techniques. Our approach enabled the assay of a large set of mRNA-related fragments and increased the reliability of amplification-based transcriptome analysis. The vast majority of transcript-derived fragments were shared among samples within species and thus attributable to constitutively expressed genes. However, a number of differentially expressed mRNAs were scored in each species and a total of 68 transcripts displaying an altered expression in response to Cr were isolated and sequenced. Public database querying revealed that 44.1% and 4.4% of the cloned ESTs score significant similarity with genes encoding proteins having known or putative function, or with genes coding for unknown proteins, respectively, whereas the remaining 51.5% did not retrieve any homology. Semi-quantitative RT-PCR analysis of seven candidate genes fully confirmed the expression patterns obtained by cDNA-AFLP. Our results indicate the existence of common mechanisms of gene regulation in response to Cr, pathogen attack and senescence-mediated programmed cell death, and suggest a role for the genes isolated in the cross-talk of the signaling pathways governing the adaptation to biotic and abiotic stresses.

  15. Identification of Differentially Expressed Thyroid Hormone Responsive Genes from the Brain of the Mexican Axolotl (Ambystoma mexicanum) ✧

    PubMed Central

    Huggins, P; Johnson, CK; Schoergendorfer, A; Putta, S; Bathke, AC; Stromberg, AJ; Voss, SR

    2011-01-01

    The Mexican axolotl (Ambystoma mexicanum) presents an excellent model to investigate mechanisms of brain development that are conserved among vertebrates. In particular, metamorphic changes of the brain can be induced in free-living aquatic juveniles and adults by simply adding thyroid hormone (T4) to rearing water. Whole brains were sampled from juvenile A. mexicanum that were exposed to 0, 8, and 18 days of 50 nM T4, and these were used to isolate RNA and make normalized cDNA libraries for 454 DNA sequencing. A total of 1,875,732 high quality cDNA reads were assembled with existing ESTs to obtain 5,884 new contigs for human RefSeq protein models, and to develop a custom Affymetrix gene expression array (Amby_002) with approximately 20,000 probe sets. The Amby_002 array was used to identify 303 transcripts that differed statistically (p < 0.05, fold change > 1.5) as a function of days of T4 treatment. Further statistical analyses showed that Amby_002 performed concordantly in comparison to an existing, small format expression array. This study introduces a new A. mexicanum microarray resource for the community and the first lists of T4-responsive genes from the brain of a salamander amphibian. PMID:21457787

  16. Identification of differentially expressed thyroid hormone responsive genes from the brain of the Mexican Axolotl (Ambystoma mexicanum).

    PubMed

    Huggins, P; Johnson, C K; Schoergendorfer, A; Putta, S; Bathke, A C; Stromberg, A J; Voss, S R

    2012-01-01

    The Mexican axolotl (Ambystoma mexicanum) presents an excellent model to investigate mechanisms of brain development that are conserved among vertebrates. In particular, metamorphic changes of the brain can be induced in free-living aquatic juveniles and adults by simply adding thyroid hormone (T4) to rearing water. Whole brains were sampled from juvenile A. mexicanum that were exposed to 0, 8, and 18 days of 50 nM T4, and these were used to isolate RNA and make normalized cDNA libraries for 454 DNA sequencing. A total of 1,875,732 high quality cDNA reads were assembled with existing ESTs to obtain 5884 new contigs for human RefSeq protein models, and to develop a custom Affymetrix gene expression array (Amby_002) with approximately 20,000 probe sets. The Amby_002 array was used to identify 303 transcripts that differed statistically (p<0.05, fold change >1.5) as a function of days of T4 treatment. Further statistical analyses showed that Amby_002 performed concordantly in comparison to an existing, small format expression array. This study introduces a new A. mexicanum microarray resource for the community and the first lists of T4-responsive genes from the brain of a salamander amphibian. Copyright © 2011 Elsevier Inc. All rights reserved.

  17. Integrating genome-wide association studies and gene expression data highlights dysregulated multiple sclerosis risk pathways.

    PubMed

    Liu, Guiyou; Zhang, Fang; Jiang, Yongshuai; Hu, Yang; Gong, Zhongying; Liu, Shoufeng; Chen, Xiuju; Jiang, Qinghua; Hao, Junwei

    2017-02-01

    Much effort has been expended on identifying the genetic determinants of multiple sclerosis (MS). Existing large-scale genome-wide association study (GWAS) datasets provide strong support for using pathway and network-based analysis methods to investigate the mechanisms underlying MS. However, no shared genetic pathways have been identified to date. We hypothesize that shared genetic pathways may indeed exist in different MS-GWAS datasets. Here, we report results from a three-stage analysis of GWAS and expression datasets. In stage 1, we conducted multiple pathway analyses of two MS-GWAS datasets. In stage 2, we performed a candidate pathway analysis of the large-scale MS-GWAS dataset. In stage 3, we performed a pathway analysis using the dysregulated MS gene list from seven human MS case-control expression datasets. In stage 1, we identified 15 shared pathways. In stage 2, we successfully replicated 14 of these 15 significant pathways. In stage 3, we found that dysregulated MS genes were significantly enriched in 10 of 15 MS risk pathways identified in stages 1 and 2. We report shared genetic pathways in different MS-GWAS datasets and highlight some new MS risk pathways. Our findings provide new insights on the genetic determinants of MS.

  18. Genome-Wide Analysis of SREBP1 Activity around the Clock Reveals Its Combined Dependency on Nutrient and Circadian Signals

    PubMed Central

    Naldi, Aurélien; Baruchet, Michaël; Canella, Donatella; Le Martelot, Gwendal; Guex, Nicolas; Desvergne, Béatrice; Delorenzi, Mauro; Deplancke, Bart; Desvergne, Béatrice; Guex, Nicolas; Herr, Winship; Naef, Felix; Rougemont, Jacques; Schibler, Ueli; Deplancke, Bart; Guex, Nicolas; Herr, Winship; Guex, Nicolas; Andersin, Teemu; Cousin, Pascal; Gilardi, Federica; Gos, Pascal; Martelot, Gwendal Le; Lammers, Fabienne; Canella, Donatella; Gilardi, Federica; Raghav, Sunil; Fabbretti, Roberto; Fortier, Arnaud; Long, Li; Vlegel, Volker; Xenarios, Ioannis; Migliavacca, Eugenia; Praz, Viviane; Guex, Nicolas; Naef, Felix; Rougemont, Jacques; David, Fabrice; Jarosz, Yohan; Kuznetsov, Dmitry; Liechti, Robin; Martin, Olivier; Delafontaine, Julien; Sinclair, Lucas; Cajan, Julia; Krier, Irina; Leleu, Marion; Migliavacca, Eugenia; Molina, Nacho; Naldi, Aurélien; Rey, Guillaume; Symul, Laura; Guex, Nicolas; Naef, Felix; Rougemont, Jacques; Bernasconi, David; Delorenzi, Mauro; Andersin, Teemu; Canella, Donatella; Gilardi, Federica; Martelot, Gwendal Le; Lammers, Fabienne; Baruchet, Michaël; Raghav, Sunil

    2014-01-01

    In mammals, the circadian clock allows them to anticipate and adapt physiology around the 24 hours. Conversely, metabolism and food consumption regulate the internal clock, pointing the existence of an intricate relationship between nutrient state and circadian homeostasis that is far from being understood. The Sterol Regulatory Element Binding Protein 1 (SREBP1) is a key regulator of lipid homeostasis. Hepatic SREBP1 function is influenced by the nutrient-response cycle, but also by the circadian machinery. To systematically understand how the interplay of circadian clock and nutrient-driven rhythm regulates SREBP1 activity, we evaluated the genome-wide binding of SREBP1 to its targets throughout the day in C57BL/6 mice. The recruitment of SREBP1 to the DNA showed a highly circadian behaviour, with a maximum during the fed status. However, the temporal expression of SREBP1 targets was not always synchronized with its binding pattern. In particular, different expression phases were observed for SREBP1 target genes depending on their function, suggesting the involvement of other transcription factors in their regulation. Binding sites for Hepatocyte Nuclear Factor 4 (HNF4) were specifically enriched in the close proximity of SREBP1 peaks of genes, whose expression was shifted by about 8 hours with respect to SREBP1 binding. Thus, the cross-talk between hepatic HNF4 and SREBP1 may underlie the expression timing of this subgroup of SREBP1 targets. Interestingly, the proper temporal expression profile of these genes was dramatically changed in Bmal1 −/− mice upon time-restricted feeding, for which a rhythmic, but slightly delayed, binding of SREBP1 was maintained. Collectively, our results show that besides the nutrient-driven regulation of SREBP1 nuclear translocation, a second layer of modulation of SREBP1 transcriptional activity, strongly dependent from the circadian clock, exists. This system allows us to fine tune the expression timing of SREBP1 target genes, thus helping to temporally separate the different physiological processes in which these genes are involved. PMID:24603613

  19. Cloning and functional expression of a gene encoding a P1 type nucleoside transporter from Trypanosoma brucei.

    PubMed

    Sanchez, M A; Ullman, B; Landfear, S M; Carter, N S

    1999-10-15

    Nucleoside transporters are likely to play a central role in the biochemistry of the parasite Trypanosoma brucei, since these protozoa are unable to synthesize purines de novo and must salvage them from their hosts. Furthermore, nucleoside transporters have been implicated in the uptake of antiparasitic and experimental drugs in these and other parasites. We have cloned the gene for a T. brucei nucleoside transporter, TbNT2, and shown that this permease is related in sequence to mammalian equilibrative nucleoside transporters. Expression of the TbNT2 gene in Xenopus oocytes reveals that the permease transports adenosine, inosine, and guanosine and hence has the substrate specificity of the P1 type nucleoside transporters that have been previously characterized by uptake assays in intact parasites. TbNT2 mRNA is expressed in bloodstream form (mammalian host stage) parasites but not in procyclic form (insect stage) parasites, indicating that the gene is developmentally regulated during the parasite life cycle. Genomic Southern blots suggest that there are multiple genes related in sequence to TbNT2, implying the existence of a family of nucleoside transporter genes in these parasites.

  20. A Novel ‘Gene Insertion/Marker Out’ (GIMO) Method for Transgene Expression and Gene Complementation in Rodent Malaria Parasites

    PubMed Central

    Sajid, Mohammed; Chevalley-Maurel, Séverine; Ramesar, Jai; Klop, Onny; Franke-Fayard, Blandine M. D.; Janse, Chris J.; Khan, Shahid M.

    2011-01-01

    Research on the biology of malaria parasites has greatly benefited from the application of reverse genetic technologies, in particular through the analysis of gene deletion mutants and studies on transgenic parasites that express heterologous or mutated proteins. However, transfection in Plasmodium is limited by the paucity of drug-selectable markers that hampers subsequent genetic modification of the same mutant. We report the development of a novel ‘gene insertion/marker out’ (GIMO) method for two rodent malaria parasites, which uses negative selection to rapidly generate transgenic mutants ready for subsequent modifications. We have created reference mother lines for both P. berghei ANKA and P. yoelii 17XNL that serve as recipient parasites for GIMO-transfection. Compared to existing protocols GIMO-transfection greatly simplifies and speeds up the generation of mutants expressing heterologous proteins, free of drug-resistance genes, and requires far fewer laboratory animals. In addition we demonstrate that GIMO-transfection is also a simple and fast method for genetic complementation of mutants with a gene deletion or mutation. The implementation of GIMO-transfection procedures should greatly enhance Plasmodium reverse-genetic research. PMID:22216235

  1. Tunable Control of an Escherichia coli Expression System for the Overproduction of Membrane Proteins by Titrated Expression of a Mutant lac Repressor.

    PubMed

    Kim, Seong Keun; Lee, Dae-Hee; Kim, Oh Cheol; Kim, Jihyun F; Yoon, Sung Ho

    2017-09-15

    Most inducible expression systems suffer from growth defects, leaky basal induction, and inhomogeneous expression levels within a host cell population. These difficulties are most prominent with the overproduction of membrane proteins that are toxic to host cells. Here, we developed an Escherichia coli inducible expression system for membrane protein production based on titrated expression of a mutant lac repressor (mLacI). Performance of the mLacI inducible system was evaluated in conjunction with commonly used lac operator-based expression vectors using a T7 or tac promoter. Remarkably, expression of a target gene can be titrated by the dose-dependent addition of l-rhamnose, and the expression levels were homogeneous in the cell population. The developed system was successfully applied to overexpress three membrane proteins that were otherwise difficult to produce in E. coli. This gene expression control system can be easily applied to a broad range of existing protein expression systems and should be useful in constructing genetic circuits that require precise output signals.

  2. Pre-Bilaterian Origins of the Hox Cluster and the Hox Code: Evidence from the Sea Anemone, Nematostella vectensis

    PubMed Central

    Ryan, Joseph F.; Mazza, Maureen E.; Pang, Kevin; Matus, David Q.; Baxevanis, Andreas D.; Martindale, Mark Q.; Finnerty, John R.

    2007-01-01

    Background Hox genes were critical to many morphological innovations of bilaterian animals. However, early Hox evolution remains obscure. Phylogenetic, developmental, and genomic analyses on the cnidarian sea anemone Nematostella vectensis challenge recent claims that the Hox code is a bilaterian invention and that no “true” Hox genes exist in the phylum Cnidaria. Methodology/Principal Findings Phylogenetic analyses of 18 Hox-related genes from Nematostella identify putative Hox1, Hox2, and Hox9+ genes. Statistical comparisons among competing hypotheses bolster these findings, including an explicit consideration of the gene losses implied by alternate topologies. In situ hybridization studies of 20 Hox-related genes reveal that multiple Hox genes are expressed in distinct regions along the primary body axis, supporting the existence of a pre-bilaterian Hox code. Additionally, several Hox genes are expressed in nested domains along the secondary body axis, suggesting a role in “dorsoventral” patterning. Conclusions/Significance A cluster of anterior and posterior Hox genes, as well as ParaHox cluster of genes evolved prior to the cnidarian-bilaterian split. There is evidence to suggest that these clusters were formed from a series of tandem gene duplication events and played a role in patterning both the primary and secondary body axes in a bilaterally symmetrical common ancestor. Cnidarians and bilaterians shared a common ancestor some 570 to 700 million years ago, and as such, are derived from a common body plan. Our work reveals several conserved genetic components that are found in both of these diverse lineages. This finding is consistent with the hypothesis that a set of developmental rules established in the common ancestor of cnidarians and bilaterians is still at work today. PMID:17252055

  3. Structural, evolutionary and genetic analysis of the histidine biosynthetic "core" in the genus Burkholderia.

    PubMed

    Papaleo, Maria Cristiana; Russo, Edda; Fondi, Marco; Emiliani, Giovanni; Frandi, Antonio; Brilli, Matteo; Pastorelli, Roberta; Fani, Renato

    2009-12-01

    In this work a detailed analysis of the structure, the expression and the organization of his genes belonging to the core of histidine biosynthesis (hisBHAF) in 40 newly determined and 13 available sequences of Burkholderia strains was carried out. Data obtained revealed a strong conservation of the structure and organization of these genes through the entire genus. The phylogenetic analysis showed the monophyletic origin of this gene cluster and indicated that it did not undergo horizontal gene transfer events. The analysis of the intergenic regions, based on the substitution rate, entropy plot and bendability suggested the existence of a putative transcription promoter upstream of hisB, that was supported by the genetic analysis that showed that this cluster was able to complement Escherichia colihisA, hisB, and hisF mutations. Moreover, a preliminary transcriptional analysis and the analysis of microarray data revealed that the expression of the his core was constitutive. These findings are in agreement with the fact that the entire Burkholderiahis operon is heterogeneous, in that it contains "alien" genes apparently not involved in histidine biosynthesis. Besides, they also support the idea that the proteobacterial his operon was piece-wisely assembled, i.e. through accretion of smaller units containing only some of the genes (eventually together with their own promoters) involved in this biosynthetic route. The correlation existing between the structure, organization and regulation of his "core" genes and the function(s) they perform in cellular metabolism is discussed.

  4. THEMIS and PTPRK in celiac intestinal mucosa: coexpression in disease and after in vitro gliadin challenge

    PubMed Central

    Bondar, Constanza; Plaza-Izurieta, Leticia; Fernandez-Jimenez, Nora; Irastorza, Iñaki; Withoff, Sebo; Wijmenga, Cisca; Chirdo, Fernando; Bilbao, Jose Ramon

    2014-01-01

    Celiac disease (CD) is an immune mediated, polygenic disorder, where HLA-DQ2/DQ8 alleles contribute around 35% to genetic risk, but several other genes are also involved. Genome-wide association studies (GWASs) and the more recent immunochip genotyping projects have fine-mapped 39 regions of genetic susceptibility to the disease, most of which harbor candidate genes that could participate in this disease process. We focused our attention to the GWAS peak on chr6: 127.99–128.38 Mb, a region including two genes, thymocyte-expressed molecule involved in selection (THEMIS) and protein tyrosine phosphatase, receptor type, kappa (PTPRK), both of which have immune-related functions. The aim of this work was to evaluate the expression levels of these two genes in duodenal mucosa of active and treated CD patients and in controls, and to determine whether SNPs (rs802734, rs55743914, rs72975916, rs10484718 and rs9491896) associated with CD have any influence on gene expression. THEMIS showed higher expression in active CD compared with treated patients and controls, whereas PTPRK showed lower expression. Our study confirmed the association of this region with CD in our population, but only the genotype of rs802734 showed some influence in the expression of THEMIS. On the other hand, we found a significant positive correlation between THEMIS and PTPRK mRNA levels in CD patients but not in controls. Our results suggest a possible role for both candidate genes in CD pathogenesis and the existence of complex, regulatory relationships that reside in the vast non-coding, functional intergenic regions of the genome. Further investigation is needed to clarify the impact of the disease-associated SNPs on gene function. PMID:23820479

  5. Molecular characterization and expression analysis of ubiquitin-activating enzyme E1 gene in Citrus reticulata.

    PubMed

    Miao, Hong-Xia; Qin, Yong-Hua; Ye, Zi-Xing; Hu, Gui-Bing

    2013-01-25

    Ubiquitin-activating enzyme E1 (UBE1) catalyzes the first step in the ubiquitination reaction, which targets a protein for degradation via a proteasome pathway. UBE1 plays an important role in metabolic processes. In this study, full-length cDNA and DNA sequences of UBE1 gene, designated CrUBE1, were obtained from 'Wuzishatangju' (self-incompatible, SI) and 'Shatangju' (self-compatible, SC) mandarins. 5 amino acids and 8 bases were different in cDNA and DNA sequences of CrUBE1 between 'Wuzishatangju' and 'Shatangju', respectively. Southern blot analysis showed that there existed only one copy of the CrUBE1 gene in genome of 'Wuzishatangju' and 'Shatangju'. The temporal and spatial expression characteristics of the CrUBE1 gene were investigated using semi-quantitative RT-PCR (SqPCR) and quantitative real-time PCR (qPCR). The expression level of the CrUBE1 gene in anthers of 'Shatangju' was approximately 10-fold higher than in anthers of 'Wuzishatangju'. The highest expression level of CrUBE1 was detected in pistils at 7days after self-pollination of 'Wuzishatangju', which was approximately 5-fold higher than at 0 h. To obtain CrUBE1 protein, the full-length cDNA of CrUBE1 genes from 'Wuzishatangju' and 'Shatangju' were successfully expressed in Pichia pastoris. Pollen germination frequency of 'Wuzishatangju' was significantly inhibited with increasing of CrUBE1 protein concentrations from 'Wuzishatangju'. Copyright © 2012 Elsevier B.V. All rights reserved.

  6. contamDE: differential expression analysis of RNA-seq data for contaminated tumor samples.

    PubMed

    Shen, Qi; Hu, Jiyuan; Jiang, Ning; Hu, Xiaohua; Luo, Zewei; Zhang, Hong

    2016-03-01

    Accurate detection of differentially expressed genes between tumor and normal samples is a primary approach of cancer-related biomarker identification. Due to the infiltration of tumor surrounding normal cells, the expression data derived from tumor samples would always be contaminated with normal cells. Ignoring such cellular contamination would deflate the power of detecting DE genes and further confound the biological interpretation of the analysis results. For the time being, there does not exists any differential expression analysis approach for RNA-seq data in literature that can properly account for the contamination of tumor samples. Without appealing to any extra information, we develop a new method 'contamDE' based on a novel statistical model that associates RNA-seq expression levels with cell types. It is demonstrated through simulation studies that contamDE could be much more powerful than the existing methods that ignore the contamination. In the application to two cancer studies, contamDE uniquely found several potential therapy and prognostic biomarkers of prostate cancer and non-small cell lung cancer. An R package contamDE is freely available at http://homepage.fudan.edu.cn/zhangh/softwares/ zhanghfd@fudan.edu.cn Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  7. Presence and Functionality of Mating Type Genes in the Supposedly Asexual Filamentous Fungus Aspergillus oryzae

    PubMed Central

    Wada, Ryuta; Maruyama, Jun-ichi; Yamaguchi, Haruka; Yamamoto, Nanase; Wagu, Yutaka; Paoletti, Mathieu; Archer, David B.; Dyer, Paul S.

    2012-01-01

    The potential for sexual reproduction in Aspergillus oryzae was assessed by investigating the presence and functionality of MAT genes. Previous genome studies had identified a MAT1-1 gene in the reference strain RIB40. We now report the existence of a complementary MAT1-2 gene and the sequencing of an idiomorphic region from A. oryzae strain AO6. This allowed the development of a PCR diagnostic assay, which detected isolates of the MAT1-1 and MAT1-2 genotypes among 180 strains assayed, including industrial tane-koji isolates. Strains used for sake and miso production showed a near-1:1 ratio of the MAT1-1 and MAT1-2 mating types, whereas strains used for soy sauce production showed a significant bias toward the MAT1-2 mating type. MAT1-1 and MAT1-2 isogenic strains were then created by genetic manipulation of the resident idiomorph, and gene expression was compared by DNA microarray and quantitative real-time PCR (qRT-PCR) methodologies under conditions in which MAT genes were expressed. Thirty-three genes were found to be upregulated more than 10-fold in either the MAT1-1 host strain or the MAT1-2 gene replacement strain relative to each other, showing that both the MAT1-1 and MAT1-2 genes functionally regulate gene expression in A. oryzae in a mating type-dependent manner, the first such report for a supposedly asexual fungus. MAT1-1 expression specifically upregulated an α-pheromone precursor gene, but the functions of most of the genes affected were unknown. The results are consistent with a heterothallic breeding system in A. oryzae, and prospects for the discovery of a sexual cycle are discussed. PMID:22327593

  8. AP1 Keeps Chromatin Poised for Action | Center for Cancer Research

    Cancer.gov

    The human genome harbors gene-encoding DNA, the blueprint for building proteins that regulate cellular function. Embedded across the genome, in non-coding regions, are DNA elements to which regulatory factors bind. The interaction of regulatory factors with DNA at these sites modifies gene expression to modulate cell activity. In cells, DNA exists in a complex with proteins called chromatin that compacts the DNA in the nucleus, strongly restricting access to DNA sequences. As a result, regulatory factors only interact with a small subset of their potential binding elements in a given cell to regulate genes. How factors recognize and select sites in chromatin across the genome is not well understood -- but several discoveries in CCR’s Laboratory of Receptor Biology and Gene Expression (LRBGE) have shed light on the mechanisms that direct factors to DNA.

  9. The oxytocin receptor gene (OXTR) localizes to human chromosome 3p25 by fluorescence in situ hybridization and PCR analysis of somatic cell hybrids

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Simmons, C.F. Jr.; Clancy, T.E.; Quan, R.

    1995-04-10

    The human oxytocin receptor regulates parturition and myometrial contractility, breast milk let-down, and reproductive behaviors in the mammalian central nervous system. Kimura et al. recently identified a human oxytocin receptor cDNA by means of expression cloning from a human myometrial cDNA library. To elucidate further the molecular mechanisms that regulate oxytocin receptor gene expression and to define the expected Mendelian inheritance of possible human disease states, we must determine the number of genes, their localization, and their organization and structure. We summarize below our data indicating that the human oxytocin receptor gene is localized to 3p25 and exists as amore » single copy in the haploid genome. 9 refs., 2 figs.« less

  10. Functional importance of cardiac enhancer-associated noncoding RNAs in heart development and disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ounzain, Samir; Pezzuto, Iole; Micheletti, Rudi

    We report here that the key information processing units within gene regulatory networks are enhancers. Enhancer activity is associated with the production of tissue-specific noncoding RNAs, yet the existence of such transcripts during cardiac development has not been established. Using an integrated genomic approach, we demonstrate that fetal cardiac enhancers generate long noncoding RNAs (lncRNAs) during cardiac differentiation and morphogenesis. Enhancer expression correlates with the emergence of active enhancer chromatin states, the initiation of RNA polymerase II at enhancer loci and expression of target genes. Orthologous human sequences are also transcribed in fetal human hearts and cardiac progenitor cells. Throughmore » a systematic bioinformatic analysis, we identified and characterized, for the first time, a catalog of lncRNAs that are expressed during embryonic stem cell differentiation into cardiomyocytes and associated with active cardiac enhancer sequences. RNA-sequencing demonstrates that many of these transcripts are polyadenylated, multi-exonic long noncoding RNAs. Moreover, knockdown of two enhancer-associated lncRNAs resulted in the specific downregulation of their predicted target genes. Interestingly, the reactivation of the fetal gene program, a hallmark of the stress response in the adult heart, is accompanied by increased expression of fetal cardiac enhancer transcripts. Altogether, these findings demonstrate that the activity of cardiac enhancers and expression of their target genes are associated with the production of enhancer-derived lncRNAs.« less

  11. Functional importance of cardiac enhancer-associated noncoding RNAs in heart development and disease

    DOE PAGES

    Ounzain, Samir; Pezzuto, Iole; Micheletti, Rudi; ...

    2014-08-19

    We report here that the key information processing units within gene regulatory networks are enhancers. Enhancer activity is associated with the production of tissue-specific noncoding RNAs, yet the existence of such transcripts during cardiac development has not been established. Using an integrated genomic approach, we demonstrate that fetal cardiac enhancers generate long noncoding RNAs (lncRNAs) during cardiac differentiation and morphogenesis. Enhancer expression correlates with the emergence of active enhancer chromatin states, the initiation of RNA polymerase II at enhancer loci and expression of target genes. Orthologous human sequences are also transcribed in fetal human hearts and cardiac progenitor cells. Throughmore » a systematic bioinformatic analysis, we identified and characterized, for the first time, a catalog of lncRNAs that are expressed during embryonic stem cell differentiation into cardiomyocytes and associated with active cardiac enhancer sequences. RNA-sequencing demonstrates that many of these transcripts are polyadenylated, multi-exonic long noncoding RNAs. Moreover, knockdown of two enhancer-associated lncRNAs resulted in the specific downregulation of their predicted target genes. Interestingly, the reactivation of the fetal gene program, a hallmark of the stress response in the adult heart, is accompanied by increased expression of fetal cardiac enhancer transcripts. Altogether, these findings demonstrate that the activity of cardiac enhancers and expression of their target genes are associated with the production of enhancer-derived lncRNAs.« less

  12. Circadian Clock Gene Expression in the Coral Favia fragum over Diel and Lunar Reproductive Cycles

    PubMed Central

    Hoadley, Kenneth D.; Szmant, Alina M.; Pyott, Sonja J.

    2011-01-01

    Natural light cycles synchronize behavioral and physiological cycles over varying time periods in both plants and animals. Many scleractinian corals exhibit diel cycles of polyp expansion and contraction entrained by diel sunlight patterns, and monthly cycles of spawning or planulation that correspond to lunar moonlight cycles. The molecular mechanisms for regulating such cycles are poorly understood. In this study, we identified four molecular clock genes (cry1, cry2, clock and cycle) in the scleractinian coral, Favia fragum, and investigated patterns of gene expression hypothesized to be involved in the corals' diel polyp behavior and lunar reproductive cycles. Using quantitative PCR, we measured fluctuations in expression of these clock genes over both diel and monthly spawning timeframes. Additionally, we assayed gene expression and polyp expansion-contraction behavior in experimental corals in normal light:dark (control) or constant dark treatments. Well-defined and reproducible diel patterns in cry1, cry2, and clock expression were observed in both field-collected and the experimental colonies maintained under control light:dark conditions, but no pattern was observed for cycle. Colonies in the control light:dark treatment also displayed diel rhythms of tentacle expansion and contraction. Experimental colonies in the constant dark treatment lost diel patterns in cry1, cry2, and clock expression and displayed a diminished and less synchronous pattern of tentacle expansion and contraction. We observed no pattern in cry1, cry2, clock, or cycle expression correlated with monthly spawning events suggesting these genes are not involved in the entrainment of reproductive cycles to lunar light cycles in F. fragum. Our results suggest a molecular clock mechanism, potentially similar to that in described in fruit flies, exists within F. fragum. PMID:21573070

  13. A quantitative validated model reveals two phases of transcriptional regulation for the gap gene giant in Drosophila.

    PubMed

    Hoermann, Astrid; Cicin-Sain, Damjan; Jaeger, Johannes

    2016-03-15

    Understanding eukaryotic transcriptional regulation and its role in development and pattern formation is one of the big challenges in biology today. Most attempts at tackling this problem either focus on the molecular details of transcription factor binding, or aim at genome-wide prediction of expression patterns from sequence through bioinformatics and mathematical modelling. Here we bridge the gap between these two complementary approaches by providing an integrative model of cis-regulatory elements governing the expression of the gap gene giant (gt) in the blastoderm embryo of Drosophila melanogaster. We use a reverse-engineering method, where mathematical models are fit to quantitative spatio-temporal reporter gene expression data to infer the regulatory mechanisms underlying gt expression in its anterior and posterior domains. These models are validated through prediction of gene expression in mutant backgrounds. A detailed analysis of our data and models reveals that gt is regulated by domain-specific CREs at early stages, while a late element drives expression in both the anterior and the posterior domains. Initial gt expression depends exclusively on inputs from maternal factors. Later, gap gene cross-repression and gt auto-activation become increasingly important. We show that auto-regulation creates a positive feedback, which mediates the transition from early to late stages of regulation. We confirm the existence and role of gt auto-activation through targeted mutagenesis of Gt transcription factor binding sites. In summary, our analysis provides a comprehensive picture of spatio-temporal gene regulation by different interacting enhancer elements for an important developmental regulator. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  14. Fyn-Dependent Gene Networks in Acute Ethanol Sensitivity

    PubMed Central

    Farris, Sean P.; Miles, Michael F.

    2013-01-01

    Studies in humans and animal models document that acute behavioral responses to ethanol are predisposing factor for the risk of long-term drinking behavior. Prior microarray data from our laboratory document strain- and brain region-specific variation in gene expression profile responses to acute ethanol that may be underlying regulators of ethanol behavioral phenotypes. The non-receptor tyrosine kinase Fyn has previously been mechanistically implicated in the sedative-hypnotic response to acute ethanol. To further understand how Fyn may modulate ethanol behaviors, we used whole-genome expression profiling. We characterized basal and acute ethanol-evoked (3 g/kg) gene expression patterns in nucleus accumbens (NAC), prefrontal cortex (PFC), and ventral midbrain (VMB) of control and Fyn knockout mice. Bioinformatics analysis identified a set of Fyn-related gene networks differently regulated by acute ethanol across the three brain regions. In particular, our analysis suggested a coordinate basal decrease in myelin-associated gene expression within NAC and PFC as an underlying factor in sensitivity of Fyn null animals to ethanol sedation. An in silico analysis across the BXD recombinant inbred (RI) strains of mice identified a significant correlation between Fyn expression and a previously published ethanol loss-of-righting-reflex (LORR) phenotype. By combining PFC gene expression correlates to Fyn and LORR across multiple genomic datasets, we identified robust Fyn-centric gene networks related to LORR. Our results thus suggest that multiple system-wide changes exist within specific brain regions of Fyn knockout mice, and that distinct Fyn-dependent expression networks within PFC may be important determinates of the LORR due to acute ethanol. These results add to the interpretation of acute ethanol behavioral sensitivity in Fyn kinase null animals, and identify Fyn-centric gene networks influencing variance in ethanol LORR. Such networks may also inform future design of pharmacotherapies for the treatment and prevention of alcohol use disorders. PMID:24312422

  15. Yin yang 1 and adipogenic gene network expression in longissimus muscle of beef cattle in response to nutritional management.

    PubMed

    Moisá, Sonia J; Shike, Daniel W; Meteer, William T; Keisler, Duane; Faulkner, Dan B; Loor, Juan J

    2013-01-01

    Among 36 differentially-expressed genes during growth in longissimus muscle (LM) of Angus steers, Yin Yang 1 (YY1) had the most relationships with other genes including some associated with adipocyte differentiation. The objective of this study was to examine the effect of nutritional management on mRNA expression of YY1 along with its targets genes PPARG, GTF2B, KAT2B, IGFBP5 and STAT5B. Longissimus from Angus and Angus × Simmental steers (7 total/treatment) on early weaning plus high-starch (EWS), normal weaning plus starch creep feeding (NWS), or normal weaning without starch creep feeding (NWN) was biopsied at 0, 96, and 240 days on treatments. Results suggest that YY1 does not exert control of adipogenesis in LM, and its expression is not sensitive to weaning age. Among the YY1-related genes, EWS led to greater IGFBP5 during growing and finishing phases. Pro-adipogenic transcriptional regulation was detected in EWS due to greater PPARG and VDR at 96 and 240 d vs. 0 d. GTF2B and KAT2B expression was lower in response to NWS and EWS than NWN, and was most pronounced at 240 d. The increase in PPARG and GTF2B expression between 96 and 240 d underscored the existence of a molecular programming mechanism that was sensitive to age and dietary starch. Such response partly explains the greater carcass fat deposition observed in response to NWS.

  16. The Evolution of Human Cells in Terms of Protein Innovation

    PubMed Central

    Sardar, Adam J.; Oates, Matt E.; Fang, Hai; Forrest, Alistair R.R.; Kawaji, Hideya; Gough, Julian; Rackham, Owen J.L.

    2014-01-01

    Humans are composed of hundreds of cell types. As the genomic DNA of each somatic cell is identical, cell type is determined by what is expressed and when. Until recently, little has been reported about the determinants of human cell identity, particularly from the joint perspective of gene evolution and expression. Here, we chart the evolutionary past of all documented human cell types via the collective histories of proteins, the principal product of gene expression. FANTOM5 data provide cell-type–specific digital expression of human protein-coding genes and the SUPERFAMILY resource is used to provide protein domain annotation. The evolutionary epoch in which each protein was created is inferred by comparison with domain annotation of all other completely sequenced genomes. Studying the distribution across epochs of genes expressed in each cell type reveals insights into human cellular evolution in terms of protein innovation. For each cell type, its history of protein innovation is charted based on the genes it expresses. Combining the histories of all cell types enables us to create a timeline of cell evolution. This timeline identifies the possibility that our common ancestor Coelomata (cavity-forming animals) provided the innovation required for the innate immune system, whereas cells which now form the brain of human have followed a trajectory of continually accumulating novel proteins since Opisthokonta (boundary of animals and fungi). We conclude that exaptation of existing domain architectures into new contexts is the dominant source of cell-type–specific domain architectures. PMID:24692656

  17. Differential gene expression responses distinguish contact and respiratory sensitizers and nonsensitizing irritants in the local lymph node assay.

    PubMed

    Adenuga, David; Woolhiser, Michael R; Gollapudi, B Bhaskar; Boverhof, Darrell R

    2012-04-01

    Genomic approaches have the potential to enhance the specificity and predictive accuracy of existing toxicology endpoints, including those for chemical sensitization. The present study was conducted to determine whether gene expression responses can distinguish contact sensitizers (1-chloro-2,4-dinitrobenzene [DNCB] and hexyl cinnamic aldehyde [HCA]), respiratory sensitizers (ortho-phthalaldehyde and trimellitic anhydride [TMA]), and nonsensitizing irritants (methyl salicylate [MS] and nonanoic acid [NA]) in the local lymph node assay (LLNA). Female Balb/c mice received doses of each chemical as per the standard LLNA dosing regimen on days 1, 2, and 3. Auricular lymph nodes were analyzed for tritiated thymidine ((3)HTdR) incorporation on day 6 and for gene expression responses on days 6 and 10. All chemicals induced dose-dependent increases in stimulation index, which correlated strongly with the number of differentially expressed genes. A majority of genes modulated by the irritants were similarly altered by the sensitizers, consistent with the irritating effects of the sensitizers. However, a select number of responses involved with immune-specific functions, such as dendritic cell activation, were unique to the sensitizers and may offer the ability to distinguish sensitizers from irritants. Genes for the mast cell proteases 1 and 8, Lgals7, Tim2, Aicda, Il4, and Akr1c18 were more strongly regulated by respiratory sensitizers compared with contact sensitizers and may represent potential biomarkers for discriminating between contact and respiratory sensitizers. Collectively, these data suggest that gene expression responses may serve as useful biomarkers to distinguish between respiratory and contact sensitizers and nonsensitizing irritants in the LLNA.

  18. Expression of three topologically distinct membrane proteins elicits unique stress response pathways in the yeast Saccharomyces cerevisiae.

    PubMed

    Buck, Teresa M; Jordan, Rick; Lyons-Weiler, James; Adelman, Joshua L; Needham, Patrick G; Kleyman, Thomas R; Brodsky, Jeffrey L

    2015-06-01

    Misfolded membrane proteins are retained in the endoplasmic reticulum (ER) and are subject to ER-associated degradation, which clears the secretory pathway of potentially toxic species. While the transcriptional response to environmental stressors has been extensively studied, limited data exist describing the cellular response to misfolded membrane proteins. To this end, we expressed and then compared the transcriptional profiles elicited by the synthesis of three ER retained, misfolded ion channels: The α-subunit of the epithelial sodium channel, ENaC, the cystic fibrosis transmembrane conductance regulator, CFTR, and an inwardly rectifying potassium channel, Kir2.1, which vary in their mass, membrane topologies, and quaternary structures. To examine transcriptional profiles in a null background, the proteins were expressed in yeast, which was previously used to examine the degradation requirements for each substrate. Surprisingly, the proteins failed to induce a canonical unfolded protein response or heat shock response, although messages encoding several cytosolic and ER lumenal protein folding factors rose when αENaC or CFTR was expressed. In contrast, the levels of these genes were unaltered by Kir2.1 expression; instead, the yeast iron regulon was activated. Nevertheless, a significant number of genes that respond to various environmental stressors were upregulated by all three substrates, and compared with previous microarray data we deduced the existence of a group of genes that reflect a novel misfolded membrane protein response. These data indicate that aberrant proteins in the ER elicit profound yet unique cellular responses. Copyright © 2015 the American Physiological Society.

  19. A computational method for drug repositioning using publicly available gene expression data.

    PubMed

    Shabana, K M; Abdul Nazeer, K A; Pradhan, Meeta; Palakal, Mathew

    2015-01-01

    The identification of new therapeutic uses of existing drugs, or drug repositioning, offers the possibility of faster drug development, reduced risk, lesser cost and shorter paths to approval. The advent of high throughput microarray technology has enabled comprehensive monitoring of transcriptional response associated with various disease states and drug treatments. This data can be used to characterize disease and drug effects and thereby give a measure of the association between a given drug and a disease. Several computational methods have been proposed in the literature that make use of publicly available transcriptional data to reposition drugs against diseases. In this work, we carry out a data mining process using publicly available gene expression data sets associated with a few diseases and drugs, to identify the existing drugs that can be used to treat genes causing lung cancer and breast cancer. Three strong candidates for repurposing have been identified- Letrozole and GDC-0941 against lung cancer, and Ribavirin against breast cancer. Letrozole and GDC-0941 are drugs currently used in breast cancer treatment and Ribavirin is used in the treatment of Hepatitis C.

  20. A statistical approach to identify, monitor, and manage incomplete curated data sets.

    PubMed

    Howe, Douglas G

    2018-04-02

    Many biological knowledge bases gather data through expert curation of published literature. High data volume, selective partial curation, delays in access, and publication of data prior to the ability to curate it can result in incomplete curation of published data. Knowing which data sets are incomplete and how incomplete they are remains a challenge. Awareness that a data set may be incomplete is important for proper interpretation, to avoiding flawed hypothesis generation, and can justify further exploration of published literature for additional relevant data. Computational methods to assess data set completeness are needed. One such method is presented here. In this work, a multivariate linear regression model was used to identify genes in the Zebrafish Information Network (ZFIN) Database having incomplete curated gene expression data sets. Starting with 36,655 gene records from ZFIN, data aggregation, cleansing, and filtering reduced the set to 9870 gene records suitable for training and testing the model to predict the number of expression experiments per gene. Feature engineering and selection identified the following predictive variables: the number of journal publications; the number of journal publications already attributed for gene expression annotation; the percent of journal publications already attributed for expression data; the gene symbol; and the number of transgenic constructs associated with each gene. Twenty-five percent of the gene records (2483 genes) were used to train the model. The remaining 7387 genes were used to test the model. One hundred and twenty-two and 165 of the 7387 tested genes were identified as missing expression annotations based on their residuals being outside the model lower or upper 95% confidence interval respectively. The model had precision of 0.97 and recall of 0.71 at the negative 95% confidence interval and precision of 0.76 and recall of 0.73 at the positive 95% confidence interval. This method can be used to identify data sets that are incompletely curated, as demonstrated using the gene expression data set from ZFIN. This information can help both database resources and data consumers gauge when it may be useful to look further for published data to augment the existing expertly curated information.

  1. Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs

    PubMed Central

    Guttman, Mitchell; Garber, Manuel; Levin, Joshua Z.; Donaghey, Julie; Robinson, James; Adiconis, Xian; Fan, Lin; Koziol, Magdalena J.; Gnirke, Andreas; Nusbaum, Chad; Rinn, John L.; Lander, Eric S.; Regev, Aviv

    2010-01-01

    RNA-Seq provides an unbiased way to study a transcriptome, including both coding and non-coding genes. To date, most RNA-Seq studies have critically depended on existing annotations, and thus focused on expression levels and variation in known transcripts. Here, we present Scripture, a method to reconstruct the transcriptome of a mammalian cell using only RNA-Seq reads and the genome sequence. We apply it to mouse embryonic stem cells, neuronal precursor cells, and lung fibroblasts to accurately reconstruct the full-length gene structures for the vast majority of known expressed genes. We identify substantial variation in protein-coding genes, including thousands of novel 5′-start sites, 3′-ends, and internal coding exons. We then determine the gene structures of over a thousand lincRNA and antisense loci. Our results open the way to direct experimental manipulation of thousands of non-coding RNAs, and demonstrate the power of ab initio reconstruction to render a comprehensive picture of mammalian transcriptomes. PMID:20436462

  2. A computational approach to identify cellular heterogeneity and tissue-specific gene regulatory networks.

    PubMed

    Jambusaria, Ankit; Klomp, Jeff; Hong, Zhigang; Rafii, Shahin; Dai, Yang; Malik, Asrar B; Rehman, Jalees

    2018-06-07

    The heterogeneity of cells across tissue types represents a major challenge for studying biological mechanisms as well as for therapeutic targeting of distinct tissues. Computational prediction of tissue-specific gene regulatory networks may provide important insights into the mechanisms underlying the cellular heterogeneity of cells in distinct organs and tissues. Using three pathway analysis techniques, gene set enrichment analysis (GSEA), parametric analysis of gene set enrichment (PGSEA), alongside our novel model (HeteroPath), which assesses heterogeneously upregulated and downregulated genes within the context of pathways, we generated distinct tissue-specific gene regulatory networks. We analyzed gene expression data derived from freshly isolated heart, brain, and lung endothelial cells and populations of neurons in the hippocampus, cingulate cortex, and amygdala. In both datasets, we found that HeteroPath segregated the distinct cellular populations by identifying regulatory pathways that were not identified by GSEA or PGSEA. Using simulated datasets, HeteroPath demonstrated robustness that was comparable to what was seen using existing gene set enrichment methods. Furthermore, we generated tissue-specific gene regulatory networks involved in vascular heterogeneity and neuronal heterogeneity by performing motif enrichment of the heterogeneous genes identified by HeteroPath and linking the enriched motifs to regulatory transcription factors in the ENCODE database. HeteroPath assesses contextual bidirectional gene expression within pathways and thus allows for transcriptomic assessment of cellular heterogeneity. Unraveling tissue-specific heterogeneity of gene expression can lead to a better understanding of the molecular underpinnings of tissue-specific phenotypes.

  3. Emergent Self-Organized Criticality in Gene Expression Dynamics: Temporal Development of Global Phase Transition Revealed in a Cancer Cell Line

    PubMed Central

    Tsuchiya, Masa; Giuliani, Alessandro; Hashimoto, Midori; Erenpreisa, Jekaterina; Yoshikawa, Kenichi

    2015-01-01

    Background The underlying mechanism of dynamic control of the genome-wide expression is a fundamental issue in bioscience. We addressed it in terms of phase transition by a systemic approach based on both density analysis and characteristics of temporal fluctuation for the time-course mRNA expression in differentiating MCF-7 breast cancer cells. Methodology In a recent work, we suggested criticality as an essential aspect of dynamic control of genome-wide gene expression. Criticality was evident by a unimodal-bimodal transition through flattened unimodal expression profile. The flatness on the transition suggests the existence of a critical transition at which up- and down-regulated expression is balanced. Mean field (averaging) behavior of mRNAs based on the temporal expression changes reveals a sandpile type of transition in the flattened profile. Furthermore, around the transition, a self-similar unimodal-bimodal transition of the whole expression occurs in the density profile of an ensemble of mRNA expression. These singular and scaling behaviors identify the transition as the expression phase transition driven by self-organized criticality (SOC). Principal Findings Emergent properties of SOC through a mean field approach are revealed: i) SOC, as a form of genomic phase transition, consolidates distinct critical states of expression, ii) Coupling of coherent stochastic oscillations between critical states on different time-scales gives rise to SOC, and iii) Specific gene clusters (barcode genes) ranging in size from kbp to Mbp reveal similar SOC to genome-wide mRNA expression and ON-OFF synchronization to critical states. This suggests that the cooperative gene regulation of topological genome sub-units is mediated by the coherent phase transitions of megadomain-scaled conformations between compact and swollen chromatin states. Conclusion and Significance In summary, our study provides not only a systemic method to demonstrate SOC in whole-genome expression, but also introduces novel, physically grounded concepts for a breakthrough in the study of biological regulation. PMID:26067993

  4. Conceptualizing adverse outcome pathways for ...

    EPA Pesticide Factsheets

    Cyclooxygenase (COX) inhibition is of concern in fish because COX inhibitors (e.g., ibuprofen) are ubiquitous in aquatic systems/fish tissues, and can disrupt synthesis of prostaglandins that modulate a variety of essential biological functions (e.g., reproduction). This study utilized newly generated high content (transcriptomic and metabolomic) empirical data in combination with existing high throughput (ACTOR, epa.gov) toxicity data to facilitate development of adverse outcome pathways (AOPs) for molecular initiating event (MIE) of COX inhibition. We examined effects of a waterborne, 96h exposure to three COX inhibitors (indomethacin (IN; 100 µg/L), ibuprofen (IB; 200 µg/L) and celecoxib (CX; 20 µg/L) on the liver metabolome and ovarian gene expression (using oligonucleotide microarray 4 x15K platform) in sexually mature fathead minnows (n=8). Differentially expressed genes were identified (t-test, p < 0.01), and functional analyses performed to determine enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways (p < 0.05). Principal component analysis indicated that liver metabolomics profiles of IN, IB and CX were not significantly different from control or one another. When compared to control, exposure to IB and CX resulted in differential expression of comparable numbers of genes (IB = 433, CX= 545). In contrast, 2558 genes were differentially expressed in IN-treated fish. KEGG pathway analyses show that IN had extensive effects on oocyte meios

  5. Spatial gradients of protein-level time delays set the pace of the traveling segmentation clock waves

    PubMed Central

    Ay, Ahmet; Holland, Jack; Sperlea, Adriana; Devakanmalai, Gnanapackiam Sheela; Knierer, Stephan; Sangervasi, Sebastian; Stevenson, Angel; Özbudak, Ertuğrul M.

    2014-01-01

    The vertebrate segmentation clock is a gene expression oscillator controlling rhythmic segmentation of the vertebral column during embryonic development. The period of oscillations becomes longer as cells are displaced along the posterior to anterior axis, which results in traveling waves of clock gene expression sweeping in the unsegmented tissue. Although various hypotheses necessitating the inclusion of additional regulatory genes into the core clock network at different spatial locations have been proposed, the mechanism underlying traveling waves has remained elusive. Here, we combined molecular-level computational modeling and quantitative experimentation to solve this puzzle. Our model predicts the existence of an increasing gradient of gene expression time delays along the posterior to anterior direction to recapitulate spatiotemporal profiles of the traveling segmentation clock waves in different genetic backgrounds in zebrafish. We validated this prediction by measuring an increased time delay of oscillatory Her1 protein production along the unsegmented tissue. Our results refuted the need for spatial expansion of the core feedback loop to explain the occurrence of traveling waves. Spatial regulation of gene expression time delays is a novel way of creating dynamic patterns; this is the first report demonstrating such a control mechanism in any tissue and future investigations will explore the presence of analogous examples in other biological systems. PMID:25336742

  6. Interaction of Osmotic Stress, Temperature, and Abscisic Acid in the Regulation of Gene Expression in Arabidopsis

    PubMed Central

    Xiong, Liming; Ishitani, Manabu; Zhu, Jian-Kang

    1999-01-01

    The impact of simultaneous environmental stresses on plants and how they respond to combined stresses compared with single stresses is largely unclear. By using a transgene (RD29A-LUC) consisting of the firefly luciferase coding sequence (LUC) driven by the stress-responsive RD29A promoter, we investigated the interactive effects of temperature, osmotic stress, and the phytohormone abscisic acid (ABA) in the regulation of gene expression in Arabidopsis seedlings. Results indicated that both positive and negative interactions exist among the studied stress factors in regulating gene expression. At a normal growth temperature (22°C), osmotic stress and ABA act synergistically to induce the transgene expression. Low temperature inhibits the response to osmotic stress or to combined treatment of osmotic stress and ABA, whereas low temperature and ABA treatments are additive in inducing transgene expression. Although high temperature alone does not activate the transgene, it significantly amplifies the effects of ABA and osmotic stress. The effect of multiple stresses in the regulation of RD29A-LUC expression in signal transduction mutants was also studied. The results are discussed in the context of cold and osmotic stress signal transduction pathways. PMID:9880362

  7. Dissecting Transcriptional Heterogeneity in Pluripotency: Single Cell Analysis of Mouse Embryonic Stem Cells.

    PubMed

    Guedes, Ana M V; Henrique, Domingos; Abranches, Elsa

    2016-01-01

    Mouse Embryonic Stem cells (mESCs) show heterogeneous and dynamic expression of important pluripotency regulatory factors. Single-cell analysis has revealed the existence of cell-to-cell variability in the expression of individual genes in mESCs. Understanding how these heterogeneities are regulated and what their functional consequences are is crucial to obtain a more comprehensive view of the pluripotent state.In this chapter we describe how to analyze transcriptional heterogeneity by monitoring gene expression of Nanog, Oct4, and Sox2, using single-molecule RNA FISH in single mESCs grown in different cell culture medium. We describe in detail all the steps involved in the protocol, from RNA detection to image acquisition and processing, as well as exploratory data analysis.

  8. Genome-wide gene expression and RNA half-life measurements allow predictions of regulation and metabolic behavior in Methanosarcina acetivorans

    DOE PAGES

    Peterson, Joseph R.; Thor, ShengShee; Kohler, Lars; ...

    2016-11-16

    Here, while a few studies on the variations in mRNA expression and half-lives measured under different growth conditions have been used to predict patterns of regulation in bacterial organisms, the extent to which this information can also play a role in defining metabolic phenotypes has yet to be examined systematically. Here we present the first comprehensive study for a model methanogen. As a result, we use expression and half-life data for the methanogen Methanosarcina acetivorans growing on fast- and slow-growth substrates to examine the regulation of its genes. Unlike Escherichia coli where only small shifts in half-lives were observed, wemore » found that most mRNA have significantly longer half-lives for slow growth on acetate compared to fast growth on methanol or trimethylamine. Interestingly, half-life shifts are not uniform across functional classes of enzymes, suggesting the existence of a selective stabilization mechanism for mRNAs. Using the transcriptomics data we determined whether transcription or degradation rate controls the change in transcript abundance. Degradation was found to control abundance for about half of the metabolic genes underscoring its role in regulating metabolism. Genes involved in half of the metabolic reactions were found to be differentially expressed among the substrates suggesting the existence of drastically different metabolic phenotypes that extend beyond just the methanogenesis pathways. By integrating expression data with an updated metabolic model of the organism (iST807) significant differences in pathway flux and production of metabolites were predicted for the three growth substrates. In conclusion, this study provides the first global picture of differential expression and half-lives for a class II methanogen, as well as provides the first evidence in a single organism that drastic genome-wide shifts in RNA half-lives can be modulated by growth substrate. We determined which genes in each metabolic pathway control the flux and classified them as regulated by transcription (e.g. transcription factor) or degradation (e.g. post-transcriptional modification). We found that more than half of genes in metabolism were controlled by degradation. Our results suggest that M. acetivorans employs extensive post-transcriptional regulation to optimize key metabolic steps, and more generally that degradation could play a much greater role in optimizing an organism’s metabolism than previously thought.« less

  9. Genome-wide gene expression and RNA half-life measurements allow predictions of regulation and metabolic behavior in Methanosarcina acetivorans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peterson, Joseph R.; Thor, ShengShee; Kohler, Lars

    Here, while a few studies on the variations in mRNA expression and half-lives measured under different growth conditions have been used to predict patterns of regulation in bacterial organisms, the extent to which this information can also play a role in defining metabolic phenotypes has yet to be examined systematically. Here we present the first comprehensive study for a model methanogen. As a result, we use expression and half-life data for the methanogen Methanosarcina acetivorans growing on fast- and slow-growth substrates to examine the regulation of its genes. Unlike Escherichia coli where only small shifts in half-lives were observed, wemore » found that most mRNA have significantly longer half-lives for slow growth on acetate compared to fast growth on methanol or trimethylamine. Interestingly, half-life shifts are not uniform across functional classes of enzymes, suggesting the existence of a selective stabilization mechanism for mRNAs. Using the transcriptomics data we determined whether transcription or degradation rate controls the change in transcript abundance. Degradation was found to control abundance for about half of the metabolic genes underscoring its role in regulating metabolism. Genes involved in half of the metabolic reactions were found to be differentially expressed among the substrates suggesting the existence of drastically different metabolic phenotypes that extend beyond just the methanogenesis pathways. By integrating expression data with an updated metabolic model of the organism (iST807) significant differences in pathway flux and production of metabolites were predicted for the three growth substrates. In conclusion, this study provides the first global picture of differential expression and half-lives for a class II methanogen, as well as provides the first evidence in a single organism that drastic genome-wide shifts in RNA half-lives can be modulated by growth substrate. We determined which genes in each metabolic pathway control the flux and classified them as regulated by transcription (e.g. transcription factor) or degradation (e.g. post-transcriptional modification). We found that more than half of genes in metabolism were controlled by degradation. Our results suggest that M. acetivorans employs extensive post-transcriptional regulation to optimize key metabolic steps, and more generally that degradation could play a much greater role in optimizing an organism’s metabolism than previously thought.« less

  10. Gene expression information improves reliability of receptor status in breast cancer patients

    PubMed Central

    Kenn, Michael; Schlangen, Karin; Castillo-Tong, Dan Cacsire; Singer, Christian F.; Cibena, Michael; Koelbl, Heinz; Schreiner, Wolfgang

    2017-01-01

    Immunohistochemical (IHC) determination of receptor status in breast cancer patients is frequently inaccurate. Since it directs the choice of systemic therapy, it is essential to increase its reliability. We increase the validity of IHC receptor expression by additionally considering gene expression (GE) measurements. Crisp therapeutic decisions are based on IHC estimates, even if they are borderline reliable. We further improve decision quality by a responsibility function, defining a critical domain for gene expression. Refined normalization is devised to file any newly diagnosed patient into existing data bases. Our approach renders receptor estimates more reliable by identifying patients with questionable receptor status. The approach is also more efficient since the rate of conclusive samples is increased. We have curated and evaluated gene expression data, together with clinical information, from 2880 breast cancer patients. Combining IHC with gene expression information yields a method more reliable and also more efficient as compared to common practice up to now. Several types of possibly suboptimal treatment allocations, based on IHC receptor status alone, are enumerated. A ‘therapy allocation check’ identifies patients possibly miss-classified. Estrogen: false negative 8%, false positive 6%. Progesterone: false negative 14%, false positive 11%. HER2: false negative 2%, false positive 50%. Possible implications are discussed. We propose an ‘expression look-up-plot’, allowing for a significant potential to improve the quality of precision medicine. Methods are developed and exemplified here for breast cancer patients, but they may readily be transferred to diagnostic data relevant for therapeutic decisions in other fields of oncology. PMID:29100391

  11. Identification of the Neuromuscular Junction Transcriptome of Extraocular Muscle by Laser Capture Microdissection

    PubMed Central

    Ketterer, Caroline; Zeiger, Ulrike; Budak, Murat T.; Rubinstein, Neal A.; Khurana, Tejvir S.

    2010-01-01

    Purpose. To examine and characterize the profile of genes expressed at the synapses or neuromuscular junctions (NMJs) of extraocular muscles (EOMs) compared with those expressed at the tibialis anterior (TA). Methods. Adult rat eyeballs with rectus EOMs attached and TAs were dissected, snap frozen, serially sectioned, and stained for acetylcholinesterase (AChE) to identify the NMJs. Approximately 6000 NMJs for rectus EOM (EOMsyn), 6000 NMJs for TA (TAsyn), equal amounts of NMJ-free fiber regions (EOMfib, TAfib), and underlying myonuclei and RNAs were captured by laser capture microdissection (LCM). RNA was processed for microarray-based expression profiling. Expression profiles and interaction lists were generated for genes differentially expressed at synaptic and nonsynaptic regions of EOM (EOMsyn versus EOMfib) and TA (TAsyn versus TAfib). Profiles were validated by using real-time quantitative polymerase chain reaction (qPCR). Results. The regional transcriptomes associated with NMJs of EOMs and TAs were identified. Two hundred seventy-five genes were preferentially expressed in EOMsyn (compared with EOMfib), 230 in TAsyn (compared with TAfib), and 288 additional transcripts expressed in both synapses. Identified genes included novel genes as well as well-known, evolutionarily conserved synaptic markers (e.g., nicotinic acetylcholine receptor (AChR) alpha (Chrna) and epsilon (Chrne) subunits and nestin (Nes). Conclusions. Transcriptome level differences exist between EOM synaptic regions and TA synaptic regions. The definition of the synaptic transcriptome provides insight into the mechanism of formation and functioning of the unique synapses of EOM and their differential involvement in diseases noted in the EOM allotype. PMID:20393109

  12. The Bioinformatic Analysis of the Dysregulated Genes and MicroRNAs in Entorhinal Cortex, Hippocampus, and Blood for Alzheimer's Disease

    PubMed Central

    Pang, Xiaocong; Zhao, Ying; Wang, Jinhua; Zhou, Qimeng; Xu, Lvjie; Kang, De

    2017-01-01

    Aim The incidence of Alzheimer's disease (AD) has been increasing in recent years, but there exists no cure and the pathological mechanisms are not fully understood. This study aimed to find out the pathogenesis of learning and memory impairment, new biomarkers, potential therapeutic targets, and drugs for AD. Methods We downloaded the microarray data of entorhinal cortex (EC) and hippocampus (HIP) of AD and controls from Gene Expression Omnibus (GEO) database, and then the differentially expressed genes (DEGs) in EC and HIP regions were analyzed for functional and pathway enrichment. Furthermore, we utilized the DEGs to construct coexpression networks to identify hub genes and discover the small molecules which were capable of reversing the gene expression profile of AD. Finally, we also analyzed microarray and RNA-seq dataset of blood samples to find the biomarkers related to gene expression in brain. Results We found some functional hub genes, such as ErbB2, ErbB4, OCT3, MIF, CDK13, and GPI. According to GO and KEGG pathway enrichment, several pathways were significantly dysregulated in EC and HIP. CTSD and VCAM1 were dysregulated significantly in blood, EC, and HIP, which were potential biomarkers for AD. Target genes of four microRNAs had similar GO_terms distribution with DEGs in EC and HIP. In addtion, small molecules were screened out for AD treatment. Conclusion These biological pathways and DEGs or hub genes will be useful to elucidate AD pathogenesis and identify novel biomarkers or drug targets for developing improved diagnostics and therapeutics against AD. PMID:29359159

  13. The role of Cdx2 as a lineage specific transcriptional repressor for pluripotent network during the first developmental cell lineage segregation.

    PubMed

    Huang, Daosheng; Guo, Guoji; Yuan, Ping; Ralston, Amy; Sun, Lingang; Huss, Mikael; Mistri, Tapan; Pinello, Luca; Ng, Huck Hui; Yuan, Guocheng; Ji, Junfeng; Rossant, Janet; Robson, Paul; Han, Xiaoping

    2017-12-07

    The first cellular differentiation event in mouse development leads to the formation of the blastocyst consisting of the inner cell mass (ICM) and trophectoderm (TE). The transcription factor CDX2 is required for proper TE specification, where it promotes expression of TE genes, and represses expression of Pou5f1 (OCT4). However its downstream network in the developing embryo is not fully characterized. Here, we performed high-throughput single embryo qPCR analysis in Cdx2 null embryos to identify CDX2-regulated targets in vivo. To identify genes likely to be regulated by CDX2 directly, we performed CDX2 ChIP-Seq on trophoblast stem (TS) cells. In addition, we examined the dynamics of gene expression changes using inducible CDX2 embryonic stem (ES) cells, so that we could predict which CDX2-bound genes are activated or repressed by CDX2 binding. By integrating these data with observations of chromatin modifications, we identify putative novel regulatory elements that repress gene expression in a lineage-specific manner. Interestingly, we found CDX2 binding sites within regulatory elements of key pluripotent genes such as Pou5f1 and Nanog, pointing to the existence of a novel mechanism by which CDX2 maintains repression of OCT4 in trophoblast. Our study proposes a general mechanism in regulating lineage segregation during mammalian development.

  14. Gene Repression in Haloarchaea Using the CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)-Cas I-B System.

    PubMed

    Stachler, Aris-Edda; Marchfelder, Anita

    2016-07-15

    The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system is used by bacteria and archaea to fend off foreign genetic elements. Since its discovery it has been developed into numerous applications like genome editing and regulation of transcription in eukaryotes and bacteria. For archaea currently no tools for transcriptional repression exist. Because molecular biology analyses in archaea become more and more widespread such a tool is vital for investigating the biological function of essential genes in archaea. Here we use the model archaeon Haloferax volcanii to demonstrate that its endogenous CRISPR-Cas system I-B can be harnessed to repress gene expression in archaea. Deletion of cas3 and cas6b genes results in efficient repression of transcription. crRNAs targeting the promoter region reduced transcript levels down to 8%. crRNAs targeting the reading frame have only slight impact on transcription. crRNAs that target the coding strand repress expression only down to 88%, whereas crRNAs targeting the template strand repress expression down to 8%. Repression of an essential gene results in reduction of transcription levels down to 22%. Targeting efficiencies can be enhanced by expressing a catalytically inactive Cas3 mutant. Genes can be targeted on plasmids or on the chromosome, they can be monocistronic or part of a polycistronic operon. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  15. Gene Repression in Haloarchaea Using the CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)-Cas I-B System*

    PubMed Central

    Stachler, Aris-Edda; Marchfelder, Anita

    2016-01-01

    The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system is used by bacteria and archaea to fend off foreign genetic elements. Since its discovery it has been developed into numerous applications like genome editing and regulation of transcription in eukaryotes and bacteria. For archaea currently no tools for transcriptional repression exist. Because molecular biology analyses in archaea become more and more widespread such a tool is vital for investigating the biological function of essential genes in archaea. Here we use the model archaeon Haloferax volcanii to demonstrate that its endogenous CRISPR-Cas system I-B can be harnessed to repress gene expression in archaea. Deletion of cas3 and cas6b genes results in efficient repression of transcription. crRNAs targeting the promoter region reduced transcript levels down to 8%. crRNAs targeting the reading frame have only slight impact on transcription. crRNAs that target the coding strand repress expression only down to 88%, whereas crRNAs targeting the template strand repress expression down to 8%. Repression of an essential gene results in reduction of transcription levels down to 22%. Targeting efficiencies can be enhanced by expressing a catalytically inactive Cas3 mutant. Genes can be targeted on plasmids or on the chromosome, they can be monocistronic or part of a polycistronic operon. PMID:27226589

  16. Rax Homeoprotein Regulates Photoreceptor Cell Maturation and Survival in Association with Crx in the Postnatal Mouse Retina.

    PubMed

    Irie, Shoichi; Sanuki, Rikako; Muranishi, Yuki; Kato, Kimiko; Chaya, Taro; Furukawa, Takahisa

    2015-08-01

    The Rax homeobox gene plays essential roles in multiple processes of vertebrate retina development. Many vertebrate species possess Rax and Rax2 genes, and different functions have been suggested. In contrast, mice contain a single Rax gene, and its functional roles in late retinal development are still unclear. To clarify mouse Rax function in postnatal photoreceptor development and maintenance, we generated conditional knockout mice in which Rax in maturing or mature photoreceptor cells was inactivated by tamoxifen treatment (Rax iCKO mice). When Rax was inactivated in postnatal Rax iCKO mice, developing photoreceptor cells showed a significant decrease in the level of the expression of rod and cone photoreceptor genes and mature adult photoreceptors exhibited a specific decrease in cone cell numbers. In luciferase assays, we found that Rax and Crx cooperatively transactivate Rhodopsin and cone opsin promoters and that an optimum Rax expression level to transactivate photoreceptor gene expression exists. Furthermore, Rax and Crx colocalized in maturing photoreceptor cells, and their coimmunoprecipitation was observed in cultured cells. Taken together, these results suggest that Rax plays essential roles in the maturation of both cones and rods and in the survival of cones by regulating photoreceptor gene expression with Crx in the postnatal mouse retina. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  17. Integrating Microarray Data and GRNs.

    PubMed

    Koumakis, L; Potamias, G; Tsiknakis, M; Zervakis, M; Moustakis, V

    2016-01-01

    With the completion of the Human Genome Project and the emergence of high-throughput technologies, a vast amount of molecular and biological data are being produced. Two of the most important and significant data sources come from microarray gene-expression experiments and respective databanks (e,g., Gene Expression Omnibus-GEO (http://www.ncbi.nlm.nih.gov/geo)), and from molecular pathways and Gene Regulatory Networks (GRNs) stored and curated in public (e.g., Kyoto Encyclopedia of Genes and Genomes-KEGG (http://www.genome.jp/kegg/pathway.html), Reactome (http://www.reactome.org/ReactomeGWT/entrypoint.html)) as well as in commercial repositories (e.g., Ingenuity IPA (http://www.ingenuity.com/products/ipa)). The association of these two sources aims to give new insight in disease understanding and reveal new molecular targets in the treatment of specific phenotypes.Three major research lines and respective efforts that try to utilize and combine data from both of these sources could be identified, namely: (1) de novo reconstruction of GRNs, (2) identification of Gene-signatures, and (3) identification of differentially expressed GRN functional paths (i.e., sub-GRN paths that distinguish between different phenotypes). In this chapter, we give an overview of the existing methods that support the different types of gene-expression and GRN integration with a focus on methodologies that aim to identify phenotype-discriminant GRNs or subnetworks, and we also present our methodology.

  18. Coffee cysteine proteinases and related inhibitors with high expression during grain maturation and germination

    PubMed Central

    2012-01-01

    Background Cysteine proteinases perform multiple functions in seeds, including participation in remodelling polypeptides and recycling amino acids during maturation and germination. Currently, few details exist concerning these genes and proteins in coffee. Furthermore, there is limited information on the cysteine proteinase inhibitors which influence the activities of these proteinases. Results Two cysteine proteinase (CP) and four cysteine proteinase inhibitor (CPI) gene sequences have been identified in coffee with significant expression during the maturation and germination of coffee grain. Detailed expression analysis of the cysteine proteinase genes CcCP1 and CcCP4 in Robusta using quantitative RT-PCR showed that these transcripts accumulate primarily during grain maturation and germination/post germination. The corresponding proteins were expressed in E. coli and purified, but only one, CcCP4, which has a KDDL/KDEL C-terminal sequence, was found to be active after a short acid treatment. QRT-PCR expression analysis of the four cysteine proteinase inhibitor genes in Robusta showed that CcCPI-1 is primarily expressed in developing and germinating grain and CcCPI-4 is very highly expressed during the late post germination period, as well as in mature, but not immature leaves. Transcripts corresponding to CcCPI-2 and CcCPI-3 were detected in most tissues examined at relatively similar, but generally low levels. Conclusions Several cysteine proteinase and cysteine proteinase inhibitor genes with strong, relatively specific expression during coffee grain maturation and germination are presented. The temporal expression of the CcCP1 gene suggests it is involved in modifying proteins during late grain maturation and germination. The expression pattern of CcCP4, and its close identity with KDEL containing CP proteins, implies this proteinase may play a role in protein and/or cell remodelling during late grain germination, and that it is likely to play a strong role in the programmed cell death associated with post-germination of the coffee grain. Expression analysis of the cysteine proteinase inhibitor genes suggests that CcCPI-1 could primarily be involved in modulating the activity of grain CP activity; while CcCPI-4 may play roles modulating grain CP activity and in the protection of the young coffee seedlings from insects and pathogens. CcCPI-2 and CcCPI-3, having lower and more widespread expression, could be more general "house-keeping" CPI genes. PMID:22380654

  19. Evolutionary characterization of pig interferon-inducible transmembrane gene family and member expression dynamics in tracheobronchial lymph nodes of pigs infected with swine respiratory disease viruses.

    PubMed

    Miller, Laura C; Jiang, Zhihua; Sang, Yongming; Harhay, Gregory P; Lager, Kelly M

    2014-06-15

    Studies have found that a cluster of duplicated gene loci encoding the interferon-inducible transmembrane proteins (IFITMs) family have antiviral activity against several viruses, including influenza A virus. The gene family has 5 and 7 members in humans and mice, respectively. Here, we confirm the current annotation of pig IFITM1, IFITM2, IFITM3, IFITM5, IFITM1L1 and IFITM1L4, manually annotated IFITM1L2, IFITM1L3, IFITM5L, IFITM3L1 and IFITM3L2, and provide expressed sequence tag (EST) and/or mRNA evidence, not contained with the NCBI Reference Sequence database (RefSeq), for the existence of IFITM6, IFITM7 and a new IFITM1-like (IFITM1LN) gene in pigs. Phylogenic analyses showed seven porcine IFITM genes with highly conserved human/mouse orthologs known to have anti-viral activity. Digital Gene Expression Tag Profiling (DGETP) of swine tracheobronchial lymph nodes (TBLN) of pigs infected with swine influenza virus (SIV), porcine pseudorabies virus, porcine reproductive and respiratory syndrome virus or porcine circovirus type 2 over 14 days post-inoculation (dpi) showed that gene expression abundance differs dramatically among pig IFITM family members, ranging from 0 to over 3000 tags per million. In particular, SIV up-regulated IFITM1 by 5.9 fold at 3 dpi. Bayesian framework further identified pig IFITM1 and IFITM3 as differentially expressed genes in the overall transcriptome analysis. In addition to being a component of protein complexes involved in homotypic adhesion, the IFITM1 is also associated with pathways related to regulation of cell proliferation and IFITM3 is involved in immune responses. Published by Elsevier B.V.

  20. Microarray labeling extension values: laboratory signatures for Affymetrix GeneChips

    PubMed Central

    Lee, Yun-Shien; Chen, Chun-Houh; Tsai, Chi-Neu; Tsai, Chia-Lung; Chao, Angel; Wang, Tzu-Hao

    2009-01-01

    Interlaboratory comparison of microarray data, even when using the same platform, imposes several challenges to scientists. RNA quality, RNA labeling efficiency, hybridization procedures and data-mining tools can all contribute variations in each laboratory. In Affymetrix GeneChips, about 11–20 different 25-mer oligonucleotides are used to measure the level of each transcript. Here, we report that ‘labeling extension values (LEVs)’, which are correlation coefficients between probe intensities and probe positions, are highly correlated with the gene expression levels (GEVs) on eukayotic Affymetrix microarray data. By analyzing LEVs and GEVs in the publicly available 2414 cel files of 20 Affymetrix microarray types covering 13 species, we found that correlations between LEVs and GEVs only exist in eukaryotic RNAs, but not in prokaryotic ones. Surprisingly, Affymetrix results of the same specimens that were analyzed in different laboratories could be clearly differentiated only by LEVs, leading to the identification of ‘laboratory signatures’. In the examined dataset, GSE10797, filtering out high-LEV genes did not compromise the discovery of biological processes that are constructed by differentially expressed genes. In conclusion, LEVs provide a new filtering parameter for microarray analysis of gene expression and it may improve the inter- and intralaboratory comparability of Affymetrix GeneChips data. PMID:19295132

  1. Brain gene expression changes elicited by peripheral vitellogenin knockdown in the honey bee.

    PubMed

    Wheeler, M M; Ament, S A; Rodriguez-Zas, S L; Robinson, G E

    2013-10-01

    Vitellogenin (Vg) is best known as a yolk protein precursor. Vg also functions to regulate behavioural maturation in adult honey bee workers, but the underlying molecular mechanisms by which it exerts this novel effect are largely unknown. We used abdominal vitellogenin (vg) knockdown with RNA interference (RNAi) and brain transcriptomic profiling to gain insights into how Vg influences honey bee behavioural maturation. We found that vg knockdown caused extensive gene expression changes in the bee brain, with much of this transcriptional response involving changes in central biological functions such as energy metabolism. vg knockdown targeted many of the same genes that show natural, maturation-related differences, but the direction of change for the genes in these two contrasts was not correlated. By contrast, vg knockdown targeted many of the same genes that are regulated by juvenile hormone (JH) and there was a significant correlation for the direction of change for the genes in these two contrasts. These results indicate that the tight coregulatory relationship that exists between JH and Vg in the regulation of honey bee behavioural maturation is manifest at the genomic level and suggest that these two physiological factors act through common pathways to regulate brain gene expression and behaviour. © 2013 Royal Entomological Society.

  2. Prediction and characterisation of a highly conserved, remote and cAMP responsive enhancer that regulates Msx1 gene expression in cardiac neural crest and outflow tract.

    PubMed

    Miller, Kerry Ann; Davidson, Scott; Liaros, Angela; Barrow, John; Lear, Marissa; Heine, Danielle; Hoppler, Stefan; MacKenzie, Alasdair

    2008-05-15

    Double knockouts of the Msx1 and Msx2 genes in the mouse result in severe cardiac outflow tract malformations similar to those frequently found in newborn infants. Despite the known role of the Msx genes in cardiac formation little is known of the regulatory systems (ligand receptor, signal transduction and protein-DNA interactions) that regulate the tissue-specific expression of the Msx genes in mammals during the formation of the outflow tract. In the present study we have used a combination of multi-species comparative genomics, mouse transgenic analysis and in-situ hybridisation to predict and validate the existence of a remote ultra-conserved enhancer that supports the expression of the Msx1 gene in migrating mouse cardiac neural crest and the outflow tract primordia. Furthermore, culturing of embryonic explants derived from transgenic lines with agonists of the PKC and PKA signal transduction systems demonstrates that this remote enhancer is influenced by PKA but not PKC dependent gene regulatory systems. These studies demonstrate the efficacy of combining comparative genomics and transgenic analyses and provide a platform for the study of the possible roles of Msx gene mis-regulation in the aetiology of congenital heart malformation.

  3. Gene Expression Analysis of Plum pox virus (Sharka) Susceptibility/Resistance in Apricot (Prunus armeniaca L.).

    PubMed

    Rubio, Manuel; Ballester, Ana Rosa; Olivares, Pedro Manuel; Castro de Moura, Manuel; Dicenta, Federico; Martínez-Gómez, Pedro

    2015-01-01

    RNA-Seq has proven to be a very powerful tool in the analysis of the Plum pox virus (PPV, sharka disease)/Prunus interaction. This technique is an important complementary tool to other means of studying genomics. In this work an analysis of gene expression of resistance/susceptibility to PPV in apricot is performed. RNA-Seq has been applied to analyse the gene expression changes induced by PPV infection in leaves from two full-sib apricot genotypes, "Rojo Pasión" and "Z506-7", resistant and susceptible to PPV, respectively. Transcriptomic analyses revealed the existence of more than 2,000 genes related to the pathogen response and resistance to PPV in apricot. These results showed that the response to infection by the virus in the susceptible genotype is associated with an induction of genes involved in pathogen resistance such as the allene oxide synthase, S-adenosylmethionine synthetase 2 and the major MLP-like protein 423. Over-expression of the Dicer protein 2a may indicate the suppression of a gene silencing mechanism of the plant by PPV HCPro and P1 PPV proteins. On the other hand, there were 164 genes involved in resistance mechanisms that have been identified in apricot, 49 of which are located in the PPVres region (scaffold 1 positions from 8,050,804 to 8,244,925), which is responsible for PPV resistance in apricot. Among these genes in apricot there are several MATH domain-containing genes, although other genes inside (Pleiotropic drug resistance 9 gene) or outside (CAP, Cysteine-rich secretory proteins, Antigen 5 and Pathogenesis-related 1 protein; and LEA, Late embryogenesis abundant protein) PPVres region could also be involved in the resistance.

  4. Gene Expression Analysis of Plum pox virus (Sharka) Susceptibility/Resistance in Apricot (Prunus armeniaca L.)

    PubMed Central

    Rubio, Manuel; Ballester, Ana Rosa; Olivares, Pedro Manuel; Castro de Moura, Manuel; Dicenta, Federico; Martínez-Gómez, Pedro

    2015-01-01

    RNA-Seq has proven to be a very powerful tool in the analysis of the Plum pox virus (PPV, sharka disease)/Prunus interaction. This technique is an important complementary tool to other means of studying genomics. In this work an analysis of gene expression of resistance/susceptibility to PPV in apricot is performed. RNA-Seq has been applied to analyse the gene expression changes induced by PPV infection in leaves from two full-sib apricot genotypes, “Rojo Pasión” and “Z506-7”, resistant and susceptible to PPV, respectively. Transcriptomic analyses revealed the existence of more than 2,000 genes related to the pathogen response and resistance to PPV in apricot. These results showed that the response to infection by the virus in the susceptible genotype is associated with an induction of genes involved in pathogen resistance such as the allene oxide synthase, S-adenosylmethionine synthetase 2 and the major MLP-like protein 423. Over-expression of the Dicer protein 2a may indicate the suppression of a gene silencing mechanism of the plant by PPV HCPro and P1 PPV proteins. On the other hand, there were 164 genes involved in resistance mechanisms that have been identified in apricot, 49 of which are located in the PPVres region (scaffold 1 positions from 8,050,804 to 8,244,925), which is responsible for PPV resistance in apricot. Among these genes in apricot there are several MATH domain-containing genes, although other genes inside (Pleiotropic drug resistance 9 gene) or outside (CAP, Cysteine-rich secretory proteins, Antigen 5 and Pathogenesis-related 1 protein; and LEA, Late embryogenesis abundant protein) PPVres region could also be involved in the resistance. PMID:26658051

  5. Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.

    PubMed

    Deeter, Anthony; Dalman, Mark; Haddad, Joseph; Duan, Zhong-Hui

    2017-01-01

    The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways.

  6. Embryonic transcriptome and proteome analyses on hepatic lipid metabolism in chickens divergently selected for abdominal fat content.

    PubMed

    Na, Wei; Wu, Yuan-Yuan; Gong, Peng-Fei; Wu, Chun-Yan; Cheng, Bo-Han; Wang, Yu-Xiang; Wang, Ning; Du, Zhi-Qiang; Li, Hui

    2018-05-23

    In avian species, liver is the main site of de novo lipogenesis, and hepatic lipid metabolism relates closely to adipose fat deposition. Using our fat and lean chicken lines of striking differences in abdominal fat content, post-hatch lipid metabolism in both liver and adipose tissues has been studied extensively. However, whether molecular discrepancy for hepatic lipid metabolism exists in chicken embryos remains obscure. We performed transcriptome and proteome profiling on chicken livers at five embryonic stages (E7, E12, E14, E17 and E21) between the fat and lean chicken lines. At each stage, 521, 141, 882, 979 and 169 differentially expressed genes were found by the digital gene expression, respectively, which were significantly enriched in the metabolic, PPAR signaling and fatty acid metabolism pathways. Quantitative proteomics analysis found 20 differentially expressed proteins related to lipid metabolism, PPAR signaling, fat digestion and absorption, and oxidative phosphorylation pathways. Combined analysis showed that genes and proteins related to lipid transport (intestinal fatty acid-binding protein, nucleoside diphosphate kinase, and apolipoprotein A-I), lipid clearance (heat shock protein beta-1) and energy metabolism (NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 10 and succinate dehydrogenase flavoprotein subunit) were significantly differentially expressed between the two lines. For hepatic lipid metabolism at embryonic stages, molecular differences related to lipid transport, lipid clearance and energy metabolism exist between the fat and lean chicken lines, which might contribute to the striking differences of abdominal fat deposition at post-hatch stages.

  7. Translating standards into practice - one Semantic Web API for Gene Expression.

    PubMed

    Deus, Helena F; Prud'hommeaux, Eric; Miller, Michael; Zhao, Jun; Malone, James; Adamusiak, Tomasz; McCusker, Jim; Das, Sudeshna; Rocca Serra, Philippe; Fox, Ronan; Marshall, M Scott

    2012-08-01

    Sharing and describing experimental results unambiguously with sufficient detail to enable replication of results is a fundamental tenet of scientific research. In today's cluttered world of "-omics" sciences, data standards and standardized use of terminologies and ontologies for biomedical informatics play an important role in reporting high-throughput experiment results in formats that can be interpreted by both researchers and analytical tools. Increasing adoption of Semantic Web and Linked Data technologies for the integration of heterogeneous and distributed health care and life sciences (HCLSs) datasets has made the reuse of standards even more pressing; dynamic semantic query federation can be used for integrative bioinformatics when ontologies and identifiers are reused across data instances. We present here a methodology to integrate the results and experimental context of three different representations of microarray-based transcriptomic experiments: the Gene Expression Atlas, the W3C BioRDF task force approach to reporting Provenance of Microarray Experiments, and the HSCI blood genomics project. Our approach does not attempt to improve the expressivity of existing standards for genomics but, instead, to enable integration of existing datasets published from microarray-based transcriptomic experiments. SPARQL Construct is used to create a posteriori mappings of concepts and properties and linking rules that match entities based on query constraints. We discuss how our integrative approach can encourage reuse of the Experimental Factor Ontology (EFO) and the Ontology for Biomedical Investigations (OBIs) for the reporting of experimental context and results of gene expression studies. Copyright © 2012 Elsevier Inc. All rights reserved.

  8. Evolutionary changes in lamin expression in the vertebrate lineage

    PubMed Central

    Stick, Reimer; Peter, Annette

    2017-01-01

    ABSTRACT The nuclear lamina is involved in fundamental nuclear functions and provides mechanical stability to the nucleus. Lamin filaments form a meshwork closely apposed to the inner nuclear membrane and a small fraction of lamins exist in the nuclear interior. Mutations in lamin genes cause severe hereditary diseases, the laminopathies. During vertebrate evolution the lamin protein family has expanded. While most vertebrate genomes contain 4 lamin genes, encoding the lamins A, B1, B2, and LIII, the majority of non-vertebrate genomes harbor only a single lamin gene. We have collected lamin gene and cDNA sequence information for representatives of the major vertebrate lineages. With the help of RNA-seq data we have determined relative lamin expression levels for representative tissues for species of 9 different gnathostome lineages. Here we report that the level of lamin A expression is low in cartilaginous fishes and ancient fishes and increases toward the mammals. Lamin B1 expression shows an inverse tendency to that of lamin A. Possible implications for the change in the lamin A to B ratio is discussed in the light of its role in nuclear mechanics. PMID:28430006

  9. Biomarkers of adult and developmental neurotoxicity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Slikker, William; Bowyer, John F.

    2005-08-07

    Neurotoxicity may be defined as any adverse effect on the structure or function of the central and/or peripheral nervous system by a biological, chemical, or physical agent. A multidisciplinary approach is necessary to assess adult and developmental neurotoxicity due to the complex and diverse functions of the nervous system. The overall strategy for understanding developmental neurotoxicity is based on two assumptions: (1) significant differences in the adult versus the developing nervous system susceptibility to neurotoxicity exist and they are often developmental stage dependent; (2) a multidisciplinary approach using neurobiological, including gene expression assays, neurophysiological, neuropathological, and behavioral function is necessarymore » for a precise assessment of neurotoxicity. Application of genomic approaches to developmental studies must use the same criteria for evaluating microarray studies as those in adults including consideration of reproducibility, statistical analysis, homogenous cell populations, and confirmation with non-array methods. A study using amphetamine to induce neurotoxicity supports the following: (1) gene expression data can help define neurotoxic mechanism(s) (2) gene expression changes can be useful biomarkers of effect, and (3) the site-selective nature of gene expression in the nervous system may mandate assessment of selective cell populations.« less

  10. Deciphering the associations between gene expression and copy number alteration using a sparse double Laplacian shrinkage approach

    PubMed Central

    Shi, Xingjie; Zhao, Qing; Huang, Jian; Xie, Yang; Ma, Shuangge

    2015-01-01

    Motivation: Both gene expression levels (GEs) and copy number alterations (CNAs) have important biological implications. GEs are partly regulated by CNAs, and much effort has been devoted to understanding their relations. The regulation analysis is challenging with one gene expression possibly regulated by multiple CNAs and one CNA potentially regulating the expressions of multiple genes. The correlations among GEs and among CNAs make the analysis even more complicated. The existing methods have limitations and cannot comprehensively describe the regulation. Results: A sparse double Laplacian shrinkage method is developed. It jointly models the effects of multiple CNAs on multiple GEs. Penalization is adopted to achieve sparsity and identify the regulation relationships. Network adjacency is computed to describe the interconnections among GEs and among CNAs. Two Laplacian shrinkage penalties are imposed to accommodate the network adjacency measures. Simulation shows that the proposed method outperforms the competing alternatives with more accurate marker identification. The Cancer Genome Atlas data are analysed to further demonstrate advantages of the proposed method. Availability and implementation: R code is available at http://works.bepress.com/shuangge/49/ Contact: shuangge.ma@yale.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26342102

  11. A two-step hierarchical hypothesis set testing framework, with applications to gene expression data on ordered categories

    PubMed Central

    2014-01-01

    Background In complex large-scale experiments, in addition to simultaneously considering a large number of features, multiple hypotheses are often being tested for each feature. This leads to a problem of multi-dimensional multiple testing. For example, in gene expression studies over ordered categories (such as time-course or dose-response experiments), interest is often in testing differential expression across several categories for each gene. In this paper, we consider a framework for testing multiple sets of hypothesis, which can be applied to a wide range of problems. Results We adopt the concept of the overall false discovery rate (OFDR) for controlling false discoveries on the hypothesis set level. Based on an existing procedure for identifying differentially expressed gene sets, we discuss a general two-step hierarchical hypothesis set testing procedure, which controls the overall false discovery rate under independence across hypothesis sets. In addition, we discuss the concept of the mixed-directional false discovery rate (mdFDR), and extend the general procedure to enable directional decisions for two-sided alternatives. We applied the framework to the case of microarray time-course/dose-response experiments, and proposed three procedures for testing differential expression and making multiple directional decisions for each gene. Simulation studies confirm the control of the OFDR and mdFDR by the proposed procedures under independence and positive correlations across genes. Simulation results also show that two of our new procedures achieve higher power than previous methods. Finally, the proposed methodology is applied to a microarray dose-response study, to identify 17 β-estradiol sensitive genes in breast cancer cells that are induced at low concentrations. Conclusions The framework we discuss provides a platform for multiple testing procedures covering situations involving two (or potentially more) sources of multiplicity. The framework is easy to use and adaptable to various practical settings that frequently occur in large-scale experiments. Procedures generated from the framework are shown to maintain control of the OFDR and mdFDR, quantities that are especially relevant in the case of multiple hypothesis set testing. The procedures work well in both simulations and real datasets, and are shown to have better power than existing methods. PMID:24731138

  12. CHESS (CgHExpreSS): a comprehensive analysis tool for the analysis of genomic alterations and their effects on the expression profile of the genome.

    PubMed

    Lee, Mikyung; Kim, Yangseok

    2009-12-16

    Genomic alterations frequently occur in many cancer patients and play important mechanistic roles in the pathogenesis of cancer. Furthermore, they can modify the expression level of genes due to altered copy number in the corresponding region of the chromosome. An accumulating body of evidence supports the possibility that strong genome-wide correlation exists between DNA content and gene expression. Therefore, more comprehensive analysis is needed to quantify the relationship between genomic alteration and gene expression. A well-designed bioinformatics tool is essential to perform this kind of integrative analysis. A few programs have already been introduced for integrative analysis. However, there are many limitations in their performance of comprehensive integrated analysis using published software because of limitations in implemented algorithms and visualization modules. To address this issue, we have implemented the Java-based program CHESS to allow integrative analysis of two experimental data sets: genomic alteration and genome-wide expression profile. CHESS is composed of a genomic alteration analysis module and an integrative analysis module. The genomic alteration analysis module detects genomic alteration by applying a threshold based method or SW-ARRAY algorithm and investigates whether the detected alteration is phenotype specific or not. On the other hand, the integrative analysis module measures the genomic alteration's influence on gene expression. It is divided into two separate parts. The first part calculates overall correlation between comparative genomic hybridization ratio and gene expression level by applying following three statistical methods: simple linear regression, Spearman rank correlation and Pearson's correlation. In the second part, CHESS detects the genes that are differentially expressed according to the genomic alteration pattern with three alternative statistical approaches: Student's t-test, Fisher's exact test and Chi square test. By successive operations of two modules, users can clarify how gene expression levels are affected by the phenotype specific genomic alterations. As CHESS was developed in both Java application and web environments, it can be run on a web browser or a local machine. It also supports all experimental platforms if a properly formatted text file is provided to include the chromosomal position of probes and their gene identifiers. CHESS is a user-friendly tool for investigating disease specific genomic alterations and quantitative relationships between those genomic alterations and genome-wide gene expression profiling.

  13. Identification of candidate genes involved in neuroblastoma progression by combining genomic and expression microarrays with survival data.

    PubMed

    Łastowska, M; Viprey, V; Santibanez-Koref, M; Wappler, I; Peters, H; Cullinane, C; Roberts, P; Hall, A G; Tweddle, D A; Pearson, A D J; Lewis, I; Burchill, S A; Jackson, M S

    2007-11-22

    Identifying genes, whose expression is consistently altered by chromosomal gains or losses, is an important step in defining genes of biological relevance in a wide variety of tumour types. However, additional criteria are needed to discriminate further among the large number of candidate genes identified. This is particularly true for neuroblastoma, where multiple genomic copy number changes of proven prognostic value exist. We have used Affymetrix microarrays and a combination of fluorescent in situ hybridization and single nucleotide polymorphism (SNP) microarrays to establish expression profiles and delineate copy number alterations in 30 primary neuroblastomas. Correlation of microarray data with patient survival and analysis of expression within rodent neuroblastoma cell lines were then used to define further genes likely to be involved in the disease process. Using this approach, we identify >1000 genes within eight recurrent genomic alterations (loss of 1p, 3p, 4p, 10q and 11q, 2p gain, 17q gain, and the MYCN amplicon) whose expression is consistently altered by copy number change. Of these, 84 correlate with patient survival, with the minimal regions of 17q gain and 4p loss being enriched significantly for such genes. These include genes involved in RNA and DNA metabolism, and apoptosis. Orthologues of all but one of these genes on 17q are overexpressed in rodent neuroblastoma cell lines. A significant excess of SNPs whose copy number correlates with survival is also observed on proximal 4p in stage 4 tumours, and we find that deletion of 4p is associated with improved outcome in an extended cohort of tumours. These results define the major impact of genomic copy number alterations upon transcription within neuroblastoma, and highlight genes on distal 17q and proximal 4p for downstream analyses. They also suggest that integration of discriminators, such as survival and comparative gene expression, with microarray data may be useful in the identification of critical genes within regions of loss or gain in many human cancers.

  14. Development and characterization of K562 cell clones expressing BCL11A-XL: Decreased hemoglobin production with fetal hemoglobin inducers and its rescue with mithramycin

    PubMed Central

    Finotti, Alessia; Gasparello, Jessica; Breveglieri, Giulia; Cosenza, Lucia Carmela; Montagner, Giulia; Bresciani, Alberto; Altamura, Sergio; Bianchi, Nicoletta; Martini, Elisa; Gallerani, Eleonora; Borgatti, Monica; Gambari, Roberto

    2015-01-01

    Induction of fetal hemoglobin (HbF) is considered a promising strategy in the treatment of β-thalassemia, in which production of adult hemoglobin (HbA) is impaired by mutations affecting the β-globin gene. Recent results indicate that B-cell lymphoma/leukemia 11A (BCL11A) is a major repressor of γ-globin gene expression. Therefore, disrupting the binding of the BCL11A transcriptional repressor complex to the γ-globin gene promoter provides a novel approach for inducing expression of the γ-globin genes. To develop a cellular screening system for the identification of BCL11A inhibitors, we produced K562 cell clones with integrated copies of a BCL11A-XL expressing vector. We characterized 12 K562 clones expressing different levels of BCL11A-XL and found that a clear inverse relationship does exist between the levels of BCL11A-XL and the extent of hemoglobinization induced by a panel of HbF inducers. Using mithramycin as an inducer, we found that this molecule was the only HbF inducer efficient in rescuing the ability to differentiate along the erythroid program, even in K562 cell clones expressing high levels of BCL11A-XL, suggesting that BCL11A-XL activity is counteracted by mithramycin. PMID:26342260

  15. Evolution and Expression of Tissue Globins in Ray-Finned Fishes.

    PubMed

    Gallagher, Michael D; Macqueen, Daniel J

    2017-01-01

    The globin gene family encodes oxygen-binding hemeproteins conserved across the major branches of multicellular life. The origins and evolutionary histories of complete globin repertoires have been established for many vertebrates, but there remain major knowledge gaps for ray-finned fish. Therefore, we used phylogenetic, comparative genomic and gene expression analyses to discover and characterize canonical “non-blood” globin family members (i.e., myoglobin, cytoglobin, neuroglobin, globin-X, and globin-Y) across multiple ray-finned fish lineages, revealing novel gene duplicates (paralogs) conserved from whole genome duplication (WGD) and small-scale duplication events. Our key findings were that: (1) globin-X paralogs in teleosts have been retained from the teleost-specific WGD, (2) functional paralogs of cytoglobin, neuroglobin, and globin-X, but not myoglobin, have been conserved from the salmonid-specific WGD, (3) triplicate lineage-specific myoglobin paralogs are conserved in arowanas (Osteoglossiformes), which arose by tandem duplication and diverged under positive selection, (4) globin-Y is retained in multiple early branching fish lineages that diverged before teleosts, and (5) marked variation in tissue-specific expression of globin gene repertoires exists across ray-finned fish evolution, including several previously uncharacterized sites of expression. In this respect, our data provide an interesting link between myoglobin expression and the evolution of air breathing in teleosts. Together, our findings demonstrate great-unrecognized diversity in the repertoire and expression of nonblood globins that has arisen during ray-finned fish evolution.

  16. Long non-coding RNA expression patterns in lung tissues of chronic cigarette smoke induced COPD mouse model.

    PubMed

    Zhang, Haiyun; Sun, Dejun; Li, Defu; Zheng, Zeguang; Xu, Jingyi; Liang, Xue; Zhang, Chenting; Wang, Sheng; Wang, Jian; Lu, Wenju

    2018-05-15

    Long non-coding RNAs (lncRNAs) have critical regulatory roles in protein-coding gene expression. Aberrant expression profiles of lncRNAs have been observed in various human diseases. In this study, we investigated transcriptome profiles in lung tissues of chronic cigarette smoke (CS)-induced COPD mouse model. We found that 109 lncRNAs and 260 mRNAs were significantly differential expressed in lungs of chronic CS-induced COPD mouse model compared with control animals. GO and KEGG analyses indicated that differentially expressed lncRNAs associated protein-coding genes were mainly involved in protein processing of endoplasmic reticulum pathway, and taurine and hypotaurine metabolism pathway. The combination of high throughput data analysis and the results of qRT-PCR validation in lungs of chronic CS-induced COPD mouse model, 16HBE cells with CSE treatment and PBMC from patients with COPD revealed that NR_102714 and its associated protein-coding gene UCHL1 might be involved in the development of COPD both in mouse and human. In conclusion, our study demonstrated that aberrant expression profiles of lncRNAs and mRNAs existed in lungs of chronic CS-induced COPD mouse model. From animal models perspective, these results might provide further clues to investigate biological functions of lncRNAs and their potential target protein-coding genes in the pathogenesis of COPD.

  17. A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns

    PubMed Central

    Mollah, Mohammad Manir Hossain; Jamal, Rahman; Mokhtar, Norfilza Mohd; Harun, Roslan; Mollah, Md. Nurul Haque

    2015-01-01

    Background Identifying genes that are differentially expressed (DE) between two or more conditions with multiple patterns of expression is one of the primary objectives of gene expression data analysis. Several statistical approaches, including one-way analysis of variance (ANOVA), are used to identify DE genes. However, most of these methods provide misleading results for two or more conditions with multiple patterns of expression in the presence of outlying genes. In this paper, an attempt is made to develop a hybrid one-way ANOVA approach that unifies the robustness and efficiency of estimation using the minimum β-divergence method to overcome some problems that arise in the existing robust methods for both small- and large-sample cases with multiple patterns of expression. Results The proposed method relies on a β-weight function, which produces values between 0 and 1. The β-weight function with β = 0.2 is used as a measure of outlier detection. It assigns smaller weights (≥ 0) to outlying expressions and larger weights (≤ 1) to typical expressions. The distribution of the β-weights is used to calculate the cut-off point, which is compared to the observed β-weight of an expression to determine whether that gene expression is an outlier. This weight function plays a key role in unifying the robustness and efficiency of estimation in one-way ANOVA. Conclusion Analyses of simulated gene expression profiles revealed that all eight methods (ANOVA, SAM, LIMMA, EBarrays, eLNN, KW, robust BetaEB and proposed) perform almost identically for m = 2 conditions in the absence of outliers. However, the robust BetaEB method and the proposed method exhibited considerably better performance than the other six methods in the presence of outliers. In this case, the BetaEB method exhibited slightly better performance than the proposed method for the small-sample cases, but the the proposed method exhibited much better performance than the BetaEB method for both the small- and large-sample cases in the presence of more than 50% outlying genes. The proposed method also exhibited better performance than the other methods for m > 2 conditions with multiple patterns of expression, where the BetaEB was not extended for this condition. Therefore, the proposed approach would be more suitable and reliable on average for the identification of DE genes between two or more conditions with multiple patterns of expression. PMID:26413858

  18. Role of cardiomyocyte circadian clock in myocardial metabolic adaptation

    USDA-ARS?s Scientific Manuscript database

    Marked circadian rhythmicities in cardiovascular physiology and pathophysiology exist. The cardiomyocyte circadian clock has recently been linked to circadian rhythms in myocardial gene expression, metabolism, and contractile function. For instance, the cardiomyocyte circadian clock is essential f...

  19. Replication-dependent histone genes are actively transcribed in differentiating and aging retinal neurons

    PubMed Central

    Banday, Abdul Rouf; Baumgartner, Marybeth; Al Seesi, Sahar; Karunakaran, Devi Krishna Priya; Venkatesh, Aditya; Congdon, Sean; Lemoine, Christopher; Kilcollins, Ashley M; Mandoiu, Ion; Punzo, Claudio; Kanadia, Rahul N

    2014-01-01

    In the mammalian genome, each histone family contains multiple replication-dependent paralogs, which are found in clusters where their transcription is thought to be coupled to the cell cycle. Here, we wanted to interrogate the transcriptional regulation of these paralogs during retinal development and aging. We employed deep sequencing, quantitative PCR, in situ hybridization (ISH), and microarray analysis, which revealed that replication-dependent histone genes were not only transcribed in progenitor cells but also in differentiating neurons. Specifically, by ISH analysis we found that different histone genes were actively transcribed in a subset of neurons between postnatal day 7 and 14. Interestingly, within a histone family, not all paralogs were transcribed at the same level during retinal development. For example, expression of Hist1h1b was higher embryonically, while that of Hist1h1c was higher postnatally. Finally, expression of replication-dependent histone genes was also observed in the aging retina. Moreover, transcription of replication-dependent histones was independent of rapamycin-mediated mTOR pathway inactivation. Overall, our data suggest the existence of variant nucleosomes produced by the differential expression of the replication-dependent histone genes across retinal development. Also, the expression of a subset of replication-dependent histone isotypes in senescent neurons warrants re-examining these genes as “replication-dependent.” Thus, our findings underscore the importance of understanding the transcriptional regulation of replication-dependent histone genes in the maintenance and functioning of neurons. PMID:25486194

  20. Global Transcriptomic Analysis of Targeted Silencing of Two Paralogous ACC Oxidase Genes in Banana

    PubMed Central

    Xia, Yan; Kuan, Chi; Chiu, Chien-Hsiang; Chen, Xiao-Jing; Do, Yi-Yin; Huang, Pung-Ling

    2016-01-01

    Among 18 1-aminocyclopropane-1-carboxylic acid (ACC) oxidase homologous genes existing in the banana genome there are two genes, Mh-ACO1 and Mh-ACO2, that participate in banana fruit ripening. To better understand the physiological functions of Mh-ACO1 and Mh-ACO2, two hairpin-type siRNA expression vectors targeting both the Mh-ACO1 and Mh-ACO2 were constructed and incorporated into the banana genome by Agrobacterium-mediated transformation. The generation of Mh-ACO1 and Mh-ACO2 RNAi transgenic banana plants was confirmed by Southern blot analysis. To gain insights into the functional diversity and complexity between Mh-ACO1 and Mh-ACO2, transcriptome sequencing of banana fruits using the Illumina next-generation sequencer was performed. A total of 32,093,976 reads, assembled into 88,031 unigenes for 123,617 transcripts were obtained. Significantly enriched Gene Oncology (GO) terms and the number of differentially expressed genes (DEGs) with GO annotation were ‘catalytic activity’ (1327, 56.4%), ‘heme binding’ (65, 2.76%), ‘tetrapyrrole binding’ (66, 2.81%), and ‘oxidoreductase activity’ (287, 12.21%). Real-time RT-PCR was further performed with mRNAs from both peel and pulp of banana fruits in Mh-ACO1 and Mh-ACO2 RNAi transgenic plants. The results showed that expression levels of genes related to ethylene signaling in ripening banana fruits were strongly influenced by the expression of genes associated with ethylene biosynthesis. PMID:27681726

  1. Genome-wide patterns of promoter sharing and co-expression in bovine skeletal muscle.

    PubMed

    Gu, Quan; Nagaraj, Shivashankar H; Hudson, Nicholas J; Dalrymple, Brian P; Reverter, Antonio

    2011-01-12

    Gene regulation by transcription factors (TF) is species, tissue and time specific. To better understand how the genetic code controls gene expression in bovine muscle we associated gene expression data from developing Longissimus thoracis et lumborum skeletal muscle with bovine promoter sequence information. We created a highly conserved genome-wide promoter landscape comprising 87,408 interactions relating 333 TFs with their 9,242 predicted target genes (TGs). We discovered that the complete set of predicted TGs share an average of 2.75 predicted TF binding sites (TFBSs) and that the average co-expression between a TF and its predicted TGs is higher than the average co-expression between the same TF and all genes. Conversely, pairs of TFs sharing predicted TGs showed a co-expression correlation higher that pairs of TFs not sharing TGs. Finally, we exploited the co-occurrence of predicted TFBS in the context of muscle-derived functionally-coherent modules including cell cycle, mitochondria, immune system, fat metabolism, muscle/glycolysis, and ribosome. Our findings enabled us to reverse engineer a regulatory network of core processes, and correctly identified the involvement of E2F1, GATA2 and NFKB1 in the regulation of cell cycle, fat, and muscle/glycolysis, respectively. The pivotal implication of our research is two-fold: (1) there exists a robust genome-wide expression signal between TFs and their predicted TGs in cattle muscle consistent with the extent of promoter sharing; and (2) this signal can be exploited to recover the cellular mechanisms underpinning transcription regulation of muscle structure and development in bovine. Our study represents the first genome-wide report linking tissue specific co-expression to co-regulation in a non-model vertebrate.

  2. Comparative study of MSX-2, DLX-5, and DLX-7 gene expression during early human tooth development.

    PubMed

    Davideau, J L; Demri, P; Hotton, D; Gu, T T; MacDougall, M; Sharpe, P; Forest, N; Berdal, A

    1999-12-01

    Msx and Dlx family transcription factors are key elements of craniofacial development and act in specific combinations with growth factors to control the position and shape of various skeletal structures in mice. In humans, the mutations of MSX and DLX genes are associated with specific syndromes, such as tooth agenesis, craniosynostosis, and tricho-dento-osseous syndrome. To establish some relationships between those reported human syndromes, previous experimental data in mice, and the expression patterns of MSX and DLX homeogenes in the human dentition, we investigated MSX-2, DLX-5, and DLX-7 expression patterns and compared them in orofacial tissues of 7.5- to 9-wk-old human embryos by using in situ hybridization. Our data showed that MSX-2 was strongly expressed in the progenitor cells of human orofacial skeletal structures, including mandible and maxilla bones, Meckel's cartilage, and tooth germs, as shown for DLX-5. DLX-7 expression was restricted to the vestibular lamina and, later on, to the vestibular part of dental epithelium. The comparison of MSX-2, DLX-5, and DLX-7 expression patterns during the early stages of development of different human tooth types showed the existence of spatially ordered sequences of homeogene expression along the vestibular/lingual axis of dental epithelium. The expression of MSX-2 in enamel knot, as well as the coincident expression of MSX-2, DLX-5, and DLX-7 in a restricted vestibular area of dental epithelium, suggests the existence of various organizing centers involved in the control of human tooth morphogenesis.

  3. ReadqPCR and NormqPCR: R packages for the reading, quality checking and normalisation of RT-qPCR quantification cycle (Cq) data.

    PubMed

    Perkins, James R; Dawes, John M; McMahon, Steve B; Bennett, David L H; Orengo, Christine; Kohl, Matthias

    2012-07-02

    Measuring gene transcription using real-time reverse transcription polymerase chain reaction (RT-qPCR) technology is a mainstay of molecular biology. Technologies now exist to measure the abundance of many transcripts in parallel. The selection of the optimal reference gene for the normalisation of this data is a recurring problem, and several algorithms have been developed in order to solve it. So far nothing in R exists to unite these methods, together with other functions to read in and normalise the data using the chosen reference gene(s). We have developed two R/Bioconductor packages, ReadqPCR and NormqPCR, intended for a user with some experience with high-throughput data analysis using R, who wishes to use R to analyse RT-qPCR data. We illustrate their potential use in a workflow analysing a generic RT-qPCR experiment, and apply this to a real dataset. Packages are available from http://www.bioconductor.org/packages/release/bioc/html/ReadqPCR.htmland http://www.bioconductor.org/packages/release/bioc/html/NormqPCR.html These packages increase the repetoire of RT-qPCR analysis tools available to the R user and allow them to (amongst other things) read their data into R, hold it in an ExpressionSet compatible R object, choose appropriate reference genes, normalise the data and look for differential expression between samples.

  4. Construction and analysis of gene-gene dynamics influence networks based on a Boolean model.

    PubMed

    Mazaya, Maulida; Trinh, Hung-Cuong; Kwon, Yung-Keun

    2017-12-21

    Identification of novel gene-gene relations is a crucial issue to understand system-level biological phenomena. To this end, many methods based on a correlation analysis of gene expressions or structural analysis of molecular interaction networks have been proposed. They have a limitation in identifying more complicated gene-gene dynamical relations, though. To overcome this limitation, we proposed a measure to quantify a gene-gene dynamical influence (GDI) using a Boolean network model and constructed a GDI network to indicate existence of a dynamical influence for every ordered pair of genes. It represents how much a state trajectory of a target gene is changed by a knockout mutation subject to a source gene in a gene-gene molecular interaction (GMI) network. Through a topological comparison between GDI and GMI networks, we observed that the former network is denser than the latter network, which implies that there exist many gene pairs of dynamically influencing but molecularly non-interacting relations. In addition, a larger number of hub genes were generated in the GDI network. On the other hand, there was a correlation between these networks such that the degree value of a node was positively correlated to each other. We further investigated the relationships of the GDI value with structural properties and found that there are negative and positive correlations with the length of a shortest path and the number of paths, respectively. In addition, a GDI network could predict a set of genes whose steady-state expression is affected in E. coli gene-knockout experiments. More interestingly, we found that the drug-targets with side-effects have a larger number of outgoing links than the other genes in the GDI network, which implies that they are more likely to influence the dynamics of other genes. Finally, we found biological evidences showing that the gene pairs which are not molecularly interacting but dynamically influential can be considered for novel gene-gene relationships. Taken together, construction and analysis of the GDI network can be a useful approach to identify novel gene-gene relationships in terms of the dynamical influence.

  5. Transcriptome analysis of Schistosoma mansoni larval development using serial analysis of gene expression (SAGE).

    PubMed

    Taft, A S; Vermeire, J J; Bernier, J; Birkeland, S R; Cipriano, M J; Papa, A R; McArthur, A G; Yoshino, T P

    2009-04-01

    Infection of the snail, Biomphalaria glabrata, by the free-swimming miracidial stage of the human blood fluke, Schistosoma mansoni, and its subsequent development to the parasitic sporocyst stage is critical to establishment of viable infections and continued human transmission. We performed a genome-wide expression analysis of the S. mansoni miracidia and developing sporocyst using Long Serial Analysis of Gene Expression (LongSAGE). Five cDNA libraries were constructed from miracidia and in vitro cultured 6- and 20-day-old sporocysts maintained in sporocyst medium (SM) or in SM conditioned by previous cultivation with cells of the B. glabrata embryonic (Bge) cell line. We generated 21 440 SAGE tags and mapped 13 381 to the S. mansoni gene predictions (v4.0e) either by estimating theoretical 3' UTR lengths or using existing 3' EST sequence data. Overall, 432 transcripts were found to be differentially expressed amongst all 5 libraries. In total, 172 tags were differentially expressed between miracidia and 6-day conditioned sporocysts and 152 were differentially expressed between miracidia and 6-day unconditioned sporocysts. In addition, 53 and 45 tags, respectively, were differentially expressed in 6-day and 20-day cultured sporocysts, due to the effects of exposure to Bge cell-conditioned medium.

  6. Chromosome doubling to overcome the chrysanthemum cross barrier based on insight from transcriptomic and proteomic analyses.

    PubMed

    Zhang, Fengjiao; Hua, Lichun; Fei, Jiangsong; Wang, Fan; Liao, Yuan; Fang, Weimin; Chen, Fadi; Teng, Nianjun

    2016-08-09

    Cross breeding is the most commonly used method in chrysanthemum (Chrysanthemum morifolium) breeding; however, cross barriers always exist in these combinations. Many studies have shown that paternal chromosome doubling can often overcome hybridization barriers during cross breeding, although the underlying mechanism has seldom been investigated. In this study, we performed two crosses: C. morifolium (pollen receptor) × diploid C. nankingense (pollen donor) and C. morifolium × tetraploid C. nankingense. Seeds were obtained only from the latter cross. RNA-Seq and isobaric tags for relative and absolute quantitation (iTRAQ) were used to investigate differentially expressed genes and proteins during key embryo development stages in the latter cross. A previously performed cross, C. morifolium × diploid C. nankingense, was compared to our results and revealed that transcription factors (i.e., the agamous-like MADS-box protein AGL80 and the leucine-rich repeat receptor protein kinase EXS), hormone-responsive genes (auxin-binding protein 1), genes and proteins related to metabolism (ATP-citrate synthase, citrate synthase and malate dehydrogenase) and other genes reported to contribute to embryo development (i.e., LEA, elongation factor and tubulin) had higher expression levels in the C. morifolium × tetraploid C. nankingense cross. In contrast, genes related to senescence and cell death were down-regulated in the C. morifolium × tetraploid C. nankingense cross. The data resources helped elucidate the gene and protein expression profiles and identify functional genes during different development stages. When the chromosomes from the male parent are doubled, the genes contributing to normal embryo developmentare more abundant. However, genes with negative functions were suppressed, suggesting that chromosome doubling may epigenetically inhibit the expression of these genes and allow the embryo to develop normally.

  7. The GmFAD7 gene family from soybean: identification of novel genes and tissue-specific conformations of the FAD7 enzyme involved in desaturase activity.

    PubMed

    Andreu, Vanesa; Lagunas, Beatriz; Collados, Raquel; Picorel, Rafael; Alfonso, Miguel

    2010-07-01

    The FAD7 gene encodes a omega3 fatty acid desaturase which catalyses the production of trienoic fatty acids (TAs) in plant chloroplasts. A novel GmFAD7 gene (named GmFAD7-2) has been identified in soybean, with high homology to the previously annotated GmFAD7 gene. Genomic sequencing analysis together with searches at the soybean genome database further confirmed that both GmFAD7 genes were located in two different loci within the soybean genome, suggesting that the soybean omega3 plastidial desaturase FAD7 is encoded by two different paralogous genes. Both GmFAD7-1 and GmFAD7-2 genes were expressed in all soybean tissues examined, displaying their highest mRNA accumulation in leaves. This expression profile contrasted with GmFAD3A and GmFAD3B mRNA accumulation, which was very low in this tissue. These results suggested a concerted control of plastidial and reticular omega3 desaturase gene expression in soybean mature leaves. Analysis of GmFAD7 protein distribution in different soybean tissues showed that, in mature leaves, two bands were detected, coincident with the higher expression level of both GmFAD7 genes and the highest 18:3 fatty acid accumulation. By contrast, in seeds, where FAD7 activity is low, specific GmFAD7 protein conformations were observed. These GmFAD7 protein conformations were affected in vitro by changes in the redox conditions of thiol groups and iron availability. These results suggest the existence of tissue-specific post-translational regulatory mechanisms affecting the distribution and conformation of the FAD7 enzymes related with the control of its activity.

  8. CEM-designer: design of custom expression microarrays in the post-ENCODE Era.

    PubMed

    Arnold, Christian; Externbrink, Fabian; Hackermüller, Jörg; Reiche, Kristin

    2014-11-10

    Microarrays are widely used in gene expression studies, and custom expression microarrays are popular to monitor expression changes of a customer-defined set of genes. However, the complexity of transcriptomes uncovered recently make custom expression microarray design a non-trivial task. Pervasive transcription and alternative processing of transcripts generate a wealth of interweaved transcripts that requires well-considered probe design strategies and is largely neglected in existing approaches. We developed the web server CEM-Designer that facilitates microarray platform independent design of custom expression microarrays for complex transcriptomes. CEM-Designer covers (i) the collection and generation of a set of unique target sequences from different sources and (ii) the selection of a set of sensitive and specific probes that optimally represents the target sequences. Probe design itself is left to third party software to ensure that probes meet provider-specific constraints. CEM-Designer is available at http://designpipeline.bioinf.uni-leipzig.de. Copyright © 2014 Elsevier B.V. All rights reserved.

  9. Structure, inheritance, and expression of hybrid poplar (Populus trichocarpa x Populus deltoides) phenylalanine ammonia-lyase genes.

    PubMed Central

    Subramaniam, R; Reinold, S; Molitor, E K; Douglas, C J

    1993-01-01

    A heterologous probe encoding phenylalanine ammonia-lyase (PAL) was used to identify PAL clones in cDNA libraries made with RNA from young leaf tissue of two Populus deltoides x P. trichocarpa F1 hybrid clones. Sequence analysis of a 2.4-kb cDNA confirmed its identity as a full-length PAl clone. The predicted amino acid sequence is conserved in comparison with that of PAL genes from several other plants. Southern blot analysis of popular genomic DNA from parental and hybrid individuals, restriction site polymorphism in PAL cDNA clones, and sequence heterogeneity in the 3' ends of several cDNA clones suggested that PAL is encoded by at least two genes that can be distinguished by HindIII restriction site polymorphisms. Clones containing each type of PAL gene were isolated from a poplar genomic library. Analysis of the segregation of PAL-specific HindIII restriction fragment-length polymorphisms demonstrated the existence of two independently segregating PAL loci, one of which was mapped to a linkage group of the poplar genetic map. Developmentally regulated PAL expression in poplar was analyzed using RNA blots. Highest expression was observed in young stems, apical buds, and young leaves. Expression was lower in older stems and undetectable in mature leaves. Cellular localization of PAL expression by in situ hybridization showed very high levels of expression in subepidermal cells of leaves early during leaf development. In stems and petioles, expression was associated with subepidermal cells and vascular tissues. PMID:8108506

  10. The anabolic/androgenic steroid nandrolone exacerbates gene expression modifications induced by mutant SOD1 in muscles of mice models of amyotrophic lateral sclerosis

    PubMed Central

    Galbiati, Mariarita; Onesto, Elisa; Zito, Arianna; Crippa, Valeria; Rusmini, Paola; Mariotti, Raffaella; Bentivoglio, Marina; Bendotti, Caterina; Poletti, Angelo

    2012-01-01

    Anabolic/androgenic steroids (AAS) are drugs that enhance muscle mass, and are often illegally utilized in athletes to improve their performances. Recent data suggest that the increased risk for amyotrophic lateral sclerosis (ALS) in male soccer and football players could be linked to AAS abuse. ALS is a motor neuron disease mainly occurring in sporadic (sALS) forms, but some familial forms (fALS) exist and have been linked to mutations in different genes. Some of these, in their wild type (wt) form, have been proposed as risk factors for sALS, i.e. superoxide dismutase 1 (SOD1) gene, whose mutations are causative of about 20% of fALS. Notably, SOD1 toxicity might occur both in motor neurons and in muscle cells. Using gastrocnemius muscles of mice overexpressing human mutant SOD1 (mutSOD1) at different disease stages, we found that the expression of a selected set of genes associated to muscle atrophy, MyoD, myogenin, atrogin-1, and transforming growth factor (TGF)β1, is up-regulated already at the presymptomatic stage. Atrogin-1 gene expression was increased also in mice overexpressing human wtSOD1. Similar alterations were found in axotomized mouse muscles and in cultured ALS myoblast models. In these ALS models, we then evaluated the pharmacological effects of the synthetic AAS nandrolone on the expression of the genes modified in ALS muscle. Nandrolone administration had no effects on MyoD, myogenin, and atrogin-1 expression, but it significantly increased TGFβ1 expression at disease onset. Altogether, these data suggest that, in fALS, muscle gene expression is altered at early stages, and AAS may exacerbate some of the alterations induced by SOD1 possibly acting as a contributing factor also in sALS. PMID:22178654

  11. The anabolic/androgenic steroid nandrolone exacerbates gene expression modifications induced by mutant SOD1 in muscles of mice models of amyotrophic lateral sclerosis.

    PubMed

    Galbiati, Mariarita; Onesto, Elisa; Zito, Arianna; Crippa, Valeria; Rusmini, Paola; Mariotti, Raffaella; Bentivoglio, Marina; Bendotti, Caterina; Poletti, Angelo

    2012-02-01

    Anabolic/androgenic steroids (AAS) are drugs that enhance muscle mass, and are often illegally utilized in athletes to improve their performances. Recent data suggest that the increased risk for amyotrophic lateral sclerosis (ALS) in male soccer and football players could be linked to AAS abuse. ALS is a motor neuron disease mainly occurring in sporadic (sALS) forms, but some familial forms (fALS) exist and have been linked to mutations in different genes. Some of these, in their wild type (wt) form, have been proposed as risk factors for sALS, i.e. superoxide dismutase 1 (SOD1) gene, whose mutations are causative of about 20% of fALS. Notably, SOD1 toxicity might occur both in motor neurons and in muscle cells. Using gastrocnemius muscles of mice overexpressing human mutant SOD1 (mutSOD1) at different disease stages, we found that the expression of a selected set of genes associated to muscle atrophy, MyoD, myogenin, atrogin-1, and transforming growth factor (TGF)β1, is up-regulated already at the presymptomatic stage. Atrogin-1 gene expression was increased also in mice overexpressing human wtSOD1. Similar alterations were found in axotomized mouse muscles and in cultured ALS myoblast models. In these ALS models, we then evaluated the pharmacological effects of the synthetic AAS nandrolone on the expression of the genes modified in ALS muscle. Nandrolone administration had no effects on MyoD, myogenin, and atrogin-1 expression, but it significantly increased TGFβ1 expression at disease onset. Altogether, these data suggest that, in fALS, muscle gene expression is altered at early stages, and AAS may exacerbate some of the alterations induced by SOD1 possibly acting as a contributing factor also in sALS. Copyright © 2011 Elsevier Ltd. All rights reserved.

  12. Cogena, a novel tool for co-expressed gene-set enrichment analysis, applied to drug repositioning and drug mode of action discovery.

    PubMed

    Jia, Zhilong; Liu, Ying; Guan, Naiyang; Bo, Xiaochen; Luo, Zhigang; Barnes, Michael R

    2016-05-27

    Drug repositioning, finding new indications for existing drugs, has gained much recent attention as a potentially efficient and economical strategy for accelerating new therapies into the clinic. Although improvement in the sensitivity of computational drug repositioning methods has identified numerous credible repositioning opportunities, few have been progressed. Arguably the "black box" nature of drug action in a new indication is one of the main blocks to progression, highlighting the need for methods that inform on the broader target mechanism in the disease context. We demonstrate that the analysis of co-expressed genes may be a critical first step towards illumination of both disease pathology and mode of drug action. We achieve this using a novel framework, co-expressed gene-set enrichment analysis (cogena) for co-expression analysis of gene expression signatures and gene set enrichment analysis of co-expressed genes. The cogena framework enables simultaneous, pathway driven, disease and drug repositioning analysis. Cogena can be used to illuminate coordinated changes within disease transcriptomes and identify drugs acting mechanistically within this framework. We illustrate this using a psoriatic skin transcriptome, as an exemplar, and recover two widely used Psoriasis drugs (Methotrexate and Ciclosporin) with distinct modes of action. Cogena out-performs the results of Connectivity Map and NFFinder webservers in similar disease transcriptome analyses. Furthermore, we investigated the literature support for the other top-ranked compounds to treat psoriasis and showed how the outputs of cogena analysis can contribute new insight to support the progression of drugs into the clinic. We have made cogena freely available within Bioconductor or https://github.com/zhilongjia/cogena . In conclusion, by targeting co-expressed genes within disease transcriptomes, cogena offers novel biological insight, which can be effectively harnessed for drug discovery and repositioning, allowing the grouping and prioritisation of drug repositioning candidates on the basis of putative mode of action.

  13. Specific c-Jun target genes in malignant melanoma.

    PubMed

    Schummer, Patrick; Kuphal, Silke; Vardimon, Lily; Bosserhoff, Anja K; Kappelmann, Melanie

    2016-05-03

    A fundamental event in the development and progression of malignant melanoma is the de-regulation of cancer-relevant transcription factors. We recently showed that c-Jun is a main regulator of melanoma progression and, thus, is the most important member of the AP-1 transcription factor family in this disease. Surprisingly, no cancer-related specific c-Jun target genes in melanoma were described in the literature, so far. Therefore, we focused on pre-existing ChIP-Seq data (Encyclopedia of DNA Elements) of 3 different non-melanoma cell lines to screen direct c-Jun target genes. Here, a specific c-Jun antibody to immunoprecipitate the associated promoter DNA was used. Consequently, we identified 44 direct c-Jun targets and a detailed analysis of 6 selected genes confirmed their deregulation in malignant melanoma. The identified genes were differentially regulated comparing 4 melanoma cell lines and normal human melanocytes and we confirmed their c-Jun dependency. Direct interaction between c-Jun and the promoter/enhancer regions of the identified genes was confirmed by us via ChIP experiments. Interestingly, we revealed that the direct regulation of target gene expression via c-Jun can be independent of the existence of the classical AP-1 (5´-TGA(C/G)TCA-3´) consensus sequence allowing for the subsequent down- or up-regulation of the expression of these cancer-relevant genes. In summary, the results of this study indicate that c-Jun plays a crucial role in the development and progression of malignant melanoma via direct regulation of cancer-relevant target genes and that inhibition of direct c-Jun targets through inhibition of c-Jun is a potential novel therapeutic option for treatment of malignant melanoma.

  14. Metabolic genes in cancer: their roles in tumor progression and clinical implications

    PubMed Central

    Furuta, Eiji; Okuda, Hiroshi; Kobayashi, Aya; Watabe, Kounosuke

    2010-01-01

    Re-programming of metabolic pathways is a hallmark of physiological changes in cancer cells. The expression of certain genes that directly control the rate of key metabolic pathways including glycolysis, lipogenesis and nucleotide synthesis are drastically altered at different stages of tumor progression. These alterations are generally considered as an adaptation of tumor cells; however, they also contribute to the progression of tumor cells to become more aggressive phenotypes. This review summarizes the recent information about the mechanistic link of these genes to oncogenesis and their potential utility as diagnostic markers as well as for therapeutic targets. We particularly focus on three groups of genes; GLUT1, G6PD, TKTL1 and PGI/AMF in glycolytic pathway, ACLY, ACC1 and FAS in lipogenesis and RRM1, RRM2 and TYMS for nucleotide synthesis. All these genes are highly up-regulated in a variety of tumor cells in cancer patients, and they play active roles in tumor progression rather than expressing merely as a consequence of phenotypic change of the cancer cells. Molecular dissection of their orchestrated networks and understanding the exact mechanism of their expression will provide a window of opportunity to target these genes for specific cancer therapy. We also reviewed existing database of gene microarray to validate the utility of these genes for cancer diagnosis. PMID:20122995

  15. Global Expression Profiling of Low Temperature Induced Genes in the Chilling Tolerant Japonica Rice Jumli Marshi

    PubMed Central

    Chawade, Aakash; Lindlöf, Angelica; Olsson, Björn; Olsson, Olof

    2013-01-01

    Low temperature is a key factor that limits growth and productivity of many important agronomical crops worldwide. Rice (Oryza sativa L.) is negatively affected already at temperatures below +10°C and is therefore denoted as chilling sensitive. However, chilling tolerant rice cultivars exist and can be commercially cultivated at altitudes up to 3,050 meters with temperatures reaching as low as +4°C. In this work, the global transcriptional response to cold stress (+4°C) was studied in the Nepalese highland variety Jumli Marshi (spp. japonica) and 4,636 genes were identified as significantly differentially expressed within 24 hours of cold stress. Comparison with previously published microarray data from one chilling tolerant and two sensitive rice cultivars identified 182 genes differentially expressed (DE) upon cold stress in all four rice cultivars and 511 genes DE only in the chilling tolerant rice. Promoter analysis of the 182 genes suggests a complex cross-talk between ABRE and CBF regulons. Promoter analysis of the 511 genes identified over-represented ABRE motifs but not DRE motifs, suggesting a role for ABA signaling in cold tolerance. Moreover, 2,101 genes were DE in Jumli Marshi alone. By chromosomal localization analysis, 473 of these cold responsive genes were located within 13 different QTLs previously identified as cold associated. PMID:24349120

  16. A novel algorithm for simplification of complex gene classifiers in cancer

    PubMed Central

    Wilson, Raphael A.; Teng, Ling; Bachmeyer, Karen M.; Bissonnette, Mei Lin Z.; Husain, Aliya N.; Parham, David M.; Triche, Timothy J.; Wing, Michele R.; Gastier-Foster, Julie M.; Barr, Frederic G.; Hawkins, Douglas S.; Anderson, James R.; Skapek, Stephen X.; Volchenboum, Samuel L.

    2013-01-01

    The clinical application of complex molecular classifiers as diagnostic or prognostic tools has been limited by the time and cost needed to apply them to patients. Using an existing fifty-gene expression signature known to separate two molecular subtypes of the pediatric cancer rhabdomyosarcoma, we show that an exhaustive iterative search algorithm can distill this complex classifier down to two or three features with equal discrimination. We validated the two-gene signatures using three separate and distinct data sets, including one that uses degraded RNA extracted from formalin-fixed, paraffin-embedded material. Finally, to demonstrate the generalizability of our algorithm, we applied it to a lung cancer data set to find minimal gene signatures that can distinguish survival. Our approach can easily be generalized and coupled to existing technical platforms to facilitate the discovery of simplified signatures that are ready for routine clinical use. PMID:23913937

  17. Inference of Gene Regulatory Networks Using Bayesian Nonparametric Regression and Topology Information.

    PubMed

    Fan, Yue; Wang, Xiao; Peng, Qinke

    2017-01-01

    Gene regulatory networks (GRNs) play an important role in cellular systems and are important for understanding biological processes. Many algorithms have been developed to infer the GRNs. However, most algorithms only pay attention to the gene expression data but do not consider the topology information in their inference process, while incorporating this information can partially compensate for the lack of reliable expression data. Here we develop a Bayesian group lasso with spike and slab priors to perform gene selection and estimation for nonparametric models. B-spline basis functions are used to capture the nonlinear relationships flexibly and penalties are used to avoid overfitting. Further, we incorporate the topology information into the Bayesian method as a prior. We present the application of our method on DREAM3 and DREAM4 datasets and two real biological datasets. The results show that our method performs better than existing methods and the topology information prior can improve the result.

  18. Expression Characterization of Stress Genes Under High and Low Temperature Stresses in the Pacific Oyster, Crassostrea gigas.

    PubMed

    Zhu, Qihui; Zhang, Linlin; Li, Li; Que, Huayong; Zhang, Guofan

    2016-04-01

    As a characteristic sessile inhabitant of the intertidal zone, the Pacific oyster Crassostrea gigas occupies one of the most physically stressful environments on earth. With high exposure to terrestrial conditions, oysters must tolerate broad fluctuations in temperature range. However, oysters' cellular and molecular responses to temperature stresses have not been fully characterized. Here, we analyzed oyster transcriptome data under high and low temperatures. We also identified over 30 key temperature stress-responsive candidate genes, which encoded stress proteins such as heat shock proteins and apoptosis-associated proteins. The expression characterization of these genes under short-term cold and hot environments (5 and 35 °C) and long-term cold environments (5 °C) was detected by quantitative real-time PCR. Most of these genes reached expression peaks during the recovery stage after 24 h of heat stress, and these genes were greatly induced around day 3 in long-term cold stress while responded little to short-term cold stress. In addition, in the second heat stress after 2 days of recovery, oysters showed milder expression in these genes and a lower mortality rate, which indicated the existence of plasticity in the oyster's response to heat stress. We confirmed that homeostatic flexibility and anti-apoptosis might be crucial centers of temperature stress responses in oysters. Furthermore, we analyzed stress gene families in 11 different species and found that the linage-specific expansion of stress genes might be implicated in adaptive evolution. These results indicated that both plasticity and evolution played an important role in the stress response adaptation of oysters.

  19. Transcriptional Activity, Chromosomal Distribution and Expression Effects of Transposable Elements in Coffea Genomes

    PubMed Central

    da Silva, Carlos R. M.; Andrade, Alan C.; Marraccini, Pierre; Teixeira, João B.; Carazzolle, Marcelo F.; Pereira, Gonçalo A. G.; Pereira, Luiz Filipe P.; Vanzela, André L. L.; Wang, Lu; Jordan, I. King; Carareto, Claudia M. A.

    2013-01-01

    Plant genomes are massively invaded by transposable elements (TEs), many of which are located near host genes and can thus impact gene expression. In flowering plants, TE expression can be activated (de-repressed) under certain stressful conditions, both biotic and abiotic, as well as by genome stress caused by hybridization. In this study, we examined the effects of these stress agents on TE expression in two diploid species of coffee, Coffea canephora and C. eugenioides, and their allotetraploid hybrid C. arabica. We also explored the relationship of TE repression mechanisms to host gene regulation via the effects of exonized TE sequences. Similar to what has been seen for other plants, overall TE expression levels are low in Coffea plant cultivars, consistent with the existence of effective TE repression mechanisms. TE expression patterns are highly dynamic across the species and conditions assayed here are unrelated to their classification at the level of TE class or family. In contrast to previous results, cell culture conditions per se do not lead to the de-repression of TE expression in C. arabica. Results obtained here indicate that differing plant drought stress levels relate strongly to TE repression mechanisms. TEs tend to be expressed at significantly higher levels in non-irrigated samples for the drought tolerant cultivars but in drought sensitive cultivars the opposite pattern was shown with irrigated samples showing significantly higher TE expression. Thus, TE genome repression mechanisms may be finely tuned to the ideal growth and/or regulatory conditions of the specific plant cultivars in which they are active. Analysis of TE expression levels in cell culture conditions underscored the importance of nonsense-mediated mRNA decay (NMD) pathways in the repression of Coffea TEs. These same NMD mechanisms can also regulate plant host gene expression via the repression of genes that bear exonized TE sequences. PMID:24244387

  20. The Malus domestica sugar transporter gene family: identifications based on genome and expression profiling related to the accumulation of fruit sugars

    PubMed Central

    Wei, Xiaoyu; Liu, Fengli; Chen, Cheng; Ma, Fengwang; Li, Mingjun

    2014-01-01

    In plants, sugar transporters are involved not only in long-distance transport, but also in sugar accumulations in sink cells. To identify members of sugar transporter gene families and to analyze their function in fruit sugar accumulation, we conducted a phylogenetic analysis of the Malus domestica genome. Expression profiling was performed with shoot tips, mature leaves, and developed fruit of “Gala” apple. Genes for sugar alcohol [including 17 sorbitol transporters (SOTs)], sucrose, and monosaccharide transporters, plus SWEET genes, were selected as candidates in 31, 9, 50, and 27 loci, respectively, of the genome. The monosaccharide transporter family appears to include five subfamilies (30 MdHTs, 8 MdEDR6s, 5 MdTMTs, 3 MdvGTs, and 4 MdpGLTs). Phylogenetic analysis of the protein sequences indicated that orthologs exist among Malus, Vitis, and Arabidopsis. Investigations of transcripts revealed that 68 candidate transporters are expressed in apple, albeit to different extents. Here, we discuss their possible roles based on the relationship between their levels of expression and sugar concentrations. The high accumulation of fructose in apple fruit is possibly linked to the coordination and cooperation between MdTMT1/2 and MdEDR6. By contrast, these fruits show low MdSWEET4.1 expression and a high flux of fructose produced from sorbitol. Our study provides an exhaustive survey of sugar transporter genes and demonstrates that sugar transporter gene families in M. domestica are comparable to those in other species. Expression profiling of these transporters will likely contribute to improving our understanding of their physiological functions in fruit formation and the development of sweetness properties. PMID:25414708

  1. Heterologous expression of the filarial nematode alt gene products reveals their potential to inhibit immune function

    PubMed Central

    Gomez-Escobar, Natalia; Bennett, Clare; Prieto-Lafuente, Lidia; Aebischer, Toni; Blackburn, Clare C; Maizels, Rick M

    2005-01-01

    Background Parasites exploit sophisticated strategies to evade host immunity that require both adaptation of existing genes and evolution of new gene families. We have addressed this question by testing the immunological function of novel genes from helminth parasites, in which conventional transgenesis is not yet possible. We investigated two such novel genes from Brugia malayi termed abundant larval transcript (alt), expression of which reaches ~5% of total transcript at the time parasites enter the human host. Results To test the hypothesis that ALT proteins modulate host immunity, we adopted an alternative transfection strategy to express these products in the protozoan parasite Leishmania mexicana. We then followed the course of infection in vitro in macrophages and in vivo in mice. Expression of ALT proteins, but not a truncated mutant, conferred greater infectivity of macrophages in vitro, reaching 3-fold higher parasite densities. alt-transfected parasites also caused accelerated disease in vivo, and fewer mice were able to clear infection of organisms expressing ALT. alt-transfected parasites were more resistant to IFN-γ-induced killing by macrophages. Expression profiling of macrophages infected with transgenic L. mexicana revealed consistently higher levels of GATA-3 and SOCS-1 transcripts, both associated with the Th2-type response observed in in vivo filarial infection. Conclusion Leishmania transfection is a tractable and informative approach to determining immunological functions of single genes from heterologous organisms. In the case of the filarial ALT proteins, our data suggest that they may participate in the Th2 bias observed in the response to parasite infection by modulating cytokine-induced signalling within immune system cells. PMID:15788098

  2. Seasonal differences in the testicular transcriptome profile of free-living European beavers (Castor fiber L.) determined by the RNA-Seq method

    PubMed Central

    Paukszto, Łukasz; Jastrzębski, Jan P.; Czerwińska, Joanna; Chojnowska, Katarzyna; Kamińska, Barbara; Kurzyńska, Aleksandra; Smolińska, Nina; Giżejewski, Zygmunt; Kamiński, Tadeusz

    2017-01-01

    The European beaver (Castor fiber L.) is an important free-living rodent that inhabits Eurasian temperate forests. Beavers are often referred to as ecosystem engineers because they create or change existing habitats, enhance biodiversity and prepare the environment for diverse plant and animal species. Beavers are protected in most European Union countries, but their genomic background remains unknown. In this study, gene expression patterns in beaver testes and the variations in genetic expression in breeding and non-breeding seasons were determined by high-throughput transcriptome sequencing. Paired-end sequencing in the Illumina HiSeq 2000 sequencer produced a total of 373.06 million of high-quality reads. De novo assembly of contigs yielded 130,741 unigenes with an average length of 1,369.3 nt, N50 value of 1,734, and average GC content of 46.51%. A comprehensive analysis of the testicular transcriptome revealed more than 26,000 highly expressed unigenes which exhibited the highest homology with Rattus norvegicus and Ictidomys tridecemlineatus genomes. More than 8,000 highly expressed genes were found to be involved in fundamental biological processes, cellular components or molecular pathways. The study also revealed 42 genes whose regulation differed between breeding and non-breeding seasons. During the non-breeding period, the expression of 37 genes was up-regulated, and the expression of 5 genes was down-regulated relative to the breeding season. The identified genes encode molecules which are involved in signaling transduction, DNA repair, stress responses, inflammatory processes, metabolism and steroidogenesis. Our results pave the way for further research into season-dependent variations in beaver testes. PMID:28678806

  3. The Malus domestica sugar transporter gene family: identifications based on genome and expression profiling related to the accumulation of fruit sugars.

    PubMed

    Wei, Xiaoyu; Liu, Fengli; Chen, Cheng; Ma, Fengwang; Li, Mingjun

    2014-01-01

    In plants, sugar transporters are involved not only in long-distance transport, but also in sugar accumulations in sink cells. To identify members of sugar transporter gene families and to analyze their function in fruit sugar accumulation, we conducted a phylogenetic analysis of the Malus domestica genome. Expression profiling was performed with shoot tips, mature leaves, and developed fruit of "Gala" apple. Genes for sugar alcohol [including 17 sorbitol transporters (SOTs)], sucrose, and monosaccharide transporters, plus SWEET genes, were selected as candidates in 31, 9, 50, and 27 loci, respectively, of the genome. The monosaccharide transporter family appears to include five subfamilies (30 MdHTs, 8 MdEDR6s, 5 MdTMTs, 3 MdvGTs, and 4 MdpGLTs). Phylogenetic analysis of the protein sequences indicated that orthologs exist among Malus, Vitis, and Arabidopsis. Investigations of transcripts revealed that 68 candidate transporters are expressed in apple, albeit to different extents. Here, we discuss their possible roles based on the relationship between their levels of expression and sugar concentrations. The high accumulation of fructose in apple fruit is possibly linked to the coordination and cooperation between MdTMT1/2 and MdEDR6. By contrast, these fruits show low MdSWEET4.1 expression and a high flux of fructose produced from sorbitol. Our study provides an exhaustive survey of sugar transporter genes and demonstrates that sugar transporter gene families in M. domestica are comparable to those in other species. Expression profiling of these transporters will likely contribute to improving our understanding of their physiological functions in fruit formation and the development of sweetness properties.

  4. Functional regression method for whole genome eQTL epistasis analysis with sequencing data.

    PubMed

    Xu, Kelin; Jin, Li; Xiong, Momiao

    2017-05-18

    Epistasis plays an essential rule in understanding the regulation mechanisms and is an essential component of the genetic architecture of the gene expressions. However, interaction analysis of gene expressions remains fundamentally unexplored due to great computational challenges and data availability. Due to variation in splicing, transcription start sites, polyadenylation sites, post-transcriptional RNA editing across the entire gene, and transcription rates of the cells, RNA-seq measurements generate large expression variability and collectively create the observed position level read count curves. A single number for measuring gene expression which is widely used for microarray measured gene expression analysis is highly unlikely to sufficiently account for large expression variation across the gene. Simultaneously analyzing epistatic architecture using the RNA-seq and whole genome sequencing (WGS) data poses enormous challenges. We develop a nonlinear functional regression model (FRGM) with functional responses where the position-level read counts within a gene are taken as a function of genomic position, and functional predictors where genotype profiles are viewed as a function of genomic position, for epistasis analysis with RNA-seq data. Instead of testing the interaction of all possible pair-wises SNPs, the FRGM takes a gene as a basic unit for epistasis analysis, which tests for the interaction of all possible pairs of genes and use all the information that can be accessed to collectively test interaction between all possible pairs of SNPs within two genome regions. By large-scale simulations, we demonstrate that the proposed FRGM for epistasis analysis can achieve the correct type 1 error and has higher power to detect the interactions between genes than the existing methods. The proposed methods are applied to the RNA-seq and WGS data from the 1000 Genome Project. The numbers of pairs of significantly interacting genes after Bonferroni correction identified using FRGM, RPKM and DESeq were 16,2361, 260 and 51, respectively, from the 350 European samples. The proposed FRGM for epistasis analysis of RNA-seq can capture isoform and position-level information and will have a broad application. Both simulations and real data analysis highlight the potential for the FRGM to be a good choice of the epistatic analysis with sequencing data.

  5. Selection and evaluation of novel reference genes for quantitative reverse transcription PCR (qRT-PCR) based on genome and transcriptome data in Brassica napus L.

    PubMed

    Yang, Hongli; Liu, Jing; Huang, Shunmou; Guo, Tingting; Deng, Linbin; Hua, Wei

    2014-03-15

    Selection of reference genes in Brassica napus, a tetraploid (4×) species, is a very difficult task without information on genome and transcriptome. By now, only several traditional reference genes which show significant expression differentiation under different conditions are used in B. napus. In the present study, based on genome and transcriptome data of the rapeseed Zhongshuang-11 cultivar, 14 candidate reference genes were screened for investigation in different tissues, cultivars, and treated conditions of B. napus. These genes were as follows: ELF5, ENTH, F-BOX7, F-BOX2, FYPP1, GDI1, GYF, MCP2d, OTP80, PPR, SPOC, Unknown1, Unknown2 and UBA. Among them, excluding GYF and FYPP1, another 12 genes, were identified to perform better than traditional reference genes ACTIN7 and GAPDH. To further validate the accuracy of the newly developed reference genes in normalization, expression levels of BnCAT1 (B. napus catalase 1) in different rapeseed tissues and seedlings under stress conditions were normalized by the three most stable reference genes PPR, GDI1, and ENTH and little difference existed in normalization results. To the best of our knowledge, this is the first time B. napus reference genes have been provided with the help of complete genome and transcriptome information. The new reference genes provided in this study are more accurate than previously reported reference genes in quantifying expression levels of B. napus genes. Crown Copyright © 2014. Published by Elsevier B.V. All rights reserved.

  6. Integrative analysis for identification of shared markers from various functional cells/tissues for rheumatoid arthritis.

    PubMed

    Xia, Wei; Wu, Jian; Deng, Fei-Yan; Wu, Long-Fei; Zhang, Yong-Hong; Guo, Yu-Fan; Lei, Shu-Feng

    2017-02-01

    Rheumatoid arthritis (RA) is a systemic autoimmune disease. So far, it is unclear whether there exist common RA-related genes shared in different tissues/cells. In this study, we conducted an integrative analysis on multiple datasets to identify potential shared genes that are significant in multiple tissues/cells for RA. Seven microarray gene expression datasets representing various RA-related tissues/cells were downloaded from the Gene Expression Omnibus (GEO). Statistical analyses, testing both marginal and joint effects, were conducted to identify significant genes shared in various samples. Followed-up analyses were conducted on functional annotation clustering analysis, protein-protein interaction (PPI) analysis, gene-based association analysis, and ELISA validation analysis in in-house samples. We identified 18 shared significant genes, which were mainly involved in the immune response and chemokine signaling pathway. Among the 18 genes, eight genes (PPBP, PF4, HLA-F, S100A8, RNASEH2A, P2RY6, JAG2, and PCBP1) interact with known RA genes. Two genes (HLA-F and PCBP1) are significant in gene-based association analysis (P = 1.03E-31, P = 1.30E-2, respectively). Additionally, PCBP1 also showed differential protein expression levels in in-house case-control plasma samples (P = 2.60E-2). This study represented the first effort to identify shared RA markers from different functional cells or tissues. The results suggested that one of the shared genes, i.e., PCBP1, is a promising biomarker for RA.

  7. Genome-wide DNA methylomes from discrete developmental stages reveal the predominance of non-CpG methylation in Tribolium castaneum

    PubMed Central

    Song, Xiaowen; Huang, Fei; Liu, Juanjuan; Li, Chengjun; Gao, Shanshan; Wu, Wei; Zhai, Mengfan; Yu, Xiaojuan; Xiong, Wenfeng; Xie, Jia

    2017-01-01

    Abstract Cytosine DNA methylation is a vital epigenetic regulator of eukaryotic development. Whether this epigenetic modification occurs in Tribolium castaneum has been controversial, its distribution pattern and functions have not been established. Here, using bisulphite sequencing (BS-Seq), we confirmed the existence of DNA methylation and described the methylation profiles of the four life stages of T. castaneum. In the T. castaneum genome, both symmetrical CpG and non-CpG methylcytosines were observed. Symmetrical CpG methylation, which was catalysed by DNMT1 and occupied a small part in T. castaneum methylome, was primarily enriched in gene bodies and was positively correlated with gene expression levels. Asymmetrical non-CpG methylation, which was predominant in the methylome, was strongly concentrated in intergenic regions and introns but absent from exons. Gene body methylation was negatively correlated with gene expression levels. The distribution pattern and functions of this type of methylation were similar only to the methylome of Drosophila melanogaster, which further supports the existence of a novel methyltransferase in the two species responsible for this type of methylation. This first life-cycle methylome of T. castaneum reveals a novel and unique methylation pattern, which will contribute to the further understanding of the variety and functions of DNA methylation in eukaryotes. PMID:28449092

  8. Characterization of two trpE genes encoding anthranilate synthase {alpha}-subunit in Azospirillum brasilense

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ge Shimei; Xie Baoen; Chen Sanfeng

    2006-03-10

    The previous report from our laboratory has recently identified a new trpE gene (termed trpE {sub 2}) which exists independently in Azospirillum brasilense Yu62. In this study, amplification of trpE(G) (termed trpE {sub 1}(G) here) confirmed that there are two copies of trpE gene, one trpE being fused into trpG while the other trpE existed independently. This is First report to suggest that two copies of the trpE gene exist in this bacterium. Comparison of the nucleotide sequence demonstrated that putative leader peptide, terminator, and anti-terminator were found upstream of trpE {sub 1}(G) while these sequence features did not existmore » in front of trpE {sub 2}. The {beta}-galactosidase activity of an A. brasilense strain carrying a trpE {sub 2}-lacZ fusion remained constant at different tryptophan concentrations, but the {beta}-galactosidase activity of the same strain carrying a trpE {sub 1}(G)-lacZ fusion decreased as the tryptophan concentration increased. These data suggest that the expression of trpE {sub 1}(G) is regulated at the transcriptional level by attenuation while trpE {sub 2} is constantly expressed. The anthranilate synthase assays with trpE {sub 1}(G){sup -} and trpE {sub 2} {sup -} mutants demonstrated that TrpE{sub 1}(G) fusion protein is feedback inhibited by tryptophan while TrpE{sub 2} protein is not. We also found that both trpE {sub 1}(G) and trpE {sub 2} gene products were involved in IAA synthesis.« less

  9. Circular RNA is expressed across the eukaryotic tree of life.

    PubMed

    Wang, Peter L; Bao, Yun; Yee, Muh-Ching; Barrett, Steven P; Hogan, Gregory J; Olsen, Mari N; Dinneny, José R; Brown, Patrick O; Salzman, Julia

    2014-01-01

    An unexpectedly large fraction of genes in metazoans (human, mouse, zebrafish, worm, fruit fly) express high levels of circularized RNAs containing canonical exons. Here we report that circular RNA isoforms are found in diverse species whose most recent common ancestor existed more than one billion years ago: fungi (Schizosaccharomyces pombe and Saccharomyces cerevisiae), a plant (Arabidopsis thaliana), and protists (Plasmodium falciparum and Dictyostelium discoideum). For all species studied to date, including those in this report, only a small fraction of the theoretically possible circular RNA isoforms from a given gene are actually observed. Unlike metazoans, Arabidopsis, D. discoideum, P. falciparum, S. cerevisiae, and S. pombe have very short introns (∼ 100 nucleotides or shorter), yet they still produce circular RNAs. A minority of genes in S. pombe and P. falciparum have documented examples of canonical alternative splicing, making it unlikely that all circular RNAs are by-products of alternative splicing or 'piggyback' on signals used in alternative RNA processing. In S. pombe, the relative abundance of circular to linear transcript isoforms changed in a gene-specific pattern during nitrogen starvation. Circular RNA may be an ancient, conserved feature of eukaryotic gene expression programs.

  10. Circular RNA Is Expressed across the Eukaryotic Tree of Life

    PubMed Central

    Wang, Peter L.; Bao, Yun; Yee, Muh-Ching; Barrett, Steven P.; Hogan, Gregory J.; Olsen, Mari N.; Dinneny, José R.; Brown, Patrick O.; Salzman, Julia

    2014-01-01

    An unexpectedly large fraction of genes in metazoans (human, mouse, zebrafish, worm, fruit fly) express high levels of circularized RNAs containing canonical exons. Here we report that circular RNA isoforms are found in diverse species whose most recent common ancestor existed more than one billion years ago: fungi (Schizosaccharomyces pombe and Saccharomyces cerevisiae), a plant (Arabidopsis thaliana), and protists (Plasmodium falciparum and Dictyostelium discoideum). For all species studied to date, including those in this report, only a small fraction of the theoretically possible circular RNA isoforms from a given gene are actually observed. Unlike metazoans, Arabidopsis, D. discoideum, P. falciparum, S. cerevisiae, and S. pombe have very short introns (∼100 nucleotides or shorter), yet they still produce circular RNAs. A minority of genes in S. pombe and P. falciparum have documented examples of canonical alternative splicing, making it unlikely that all circular RNAs are by-products of alternative splicing or ‘piggyback’ on signals used in alternative RNA processing. In S. pombe, the relative abundance of circular to linear transcript isoforms changed in a gene-specific pattern during nitrogen starvation. Circular RNA may be an ancient, conserved feature of eukaryotic gene expression programs. PMID:24609083

  11. Negative and positive regulation by a short segment in the 5'-flanking region of the human cytomegalovirus major immediate-early gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nelson, J.A.; Reynolds-Kohler, C.; Smith, B.A.

    1987-11-01

    To analyze the significance of inducible DNase I-hypersensitive sites occurring in the 5'-flanking sequence of the major immediate-early gene of human cytomegalovirus (HCMV), various deleted portions of the HCMV immediate-early promoter regulatory region were attached to the chloramphenicol acetyltransferase (CAT) gene and assayed for activity in transiently transfected undifferentiated and differentiated human teratocarcinoma cells, Tera-2. Assays of progressive deletions in the promoter regulatory region indicated that removal of a 395-base-pair portion of this element (nucleotides -750 to -1145) containing two inducible DNase I sites which correlate with gene expression resulted in a 7.5-fold increase in CAT activity in undifferentiated cells.more » However, in permissive differentiated Tera-2, human foreskin fibroblast, and HeLa cells, removal of this regulatory region resulted in decreased activity. In addition, attachment of this HCMV upstream element to a homologous or heterologous promoter increased activity three-to fivefold in permissive cells. Therefore, a cis regulatory element exists 5' to the enhancer of the major immediate-early gene of HCMV. This element negatively modulates expression in nonpermissive cells but positively influences expression in permissive cells.« less

  12. Expression profiling of Ribosomal Protein gene family in dehydration stress responses and characterization of transgenic rice plants overexpressing RPL23A for water-use efficiency and tolerance to drought and salt stresses

    NASA Astrophysics Data System (ADS)

    Moin, Mazahar; Bakshi, Achala; Madhav, M. S.; Kirti, P. B.

    2017-11-01

    Our previous findings on the screening of a large-pool of activation tagged rice plants grown under limited water conditions revealed the activation of Ribosomal Protein Large (RPL) subunit genes, RPL6 and RPL23A in two mutants that exhibited high water-use efficiency (WUE) with the genes getting activated by the integrated 4x enhancers (Moin et al., 2016a). In continuation of these findings, we have comprehensively characterized the Ribosomal Protein (RP) gene family including both small (RPS) and large (RPL) subunits, which have been identified to be encoded by at least 70 representative genes; RP-genes exist as multiple expressed copies with high nucleotide and amino acid sequence similarity. The differential expression of all the representative genes in rice was performed under limited water and drought conditions at progressive time intervals in the present study. More than 50% of the RP genes were upregulated in both shoot and root tissues. Some of them exhibited an overlap in the upregulation under both the treatments indicating that they might have a common role in inducing tolerance under limited water and drought conditions. Among the genes that became significantly upregulated in both the tissues and under both the treatments are RPL6, 7, 23A, 24 and 31 and RPS4, 10 and 18a. To further validate the role of RP genes in WUE and inducing tolerance to other stresses, we have raised transgenic plants overexpressing RPL23A in rice. The high expression lines of RPL23A exhibited low Δ13C, increased quantum efficiency along with suitable growth and yield parameters with respect to negative control under the conditions of limited water availability. The constitutive expression of RPL23A was also associated with transcriptional upregulation of many other RPL and RPS genes. The seedlings of RPL23A high expression lines also showed a significant increase in fresh weight, root length, proline and chlorophyll contents under simulated drought and salt stresses. Taken together, our findings provide a secure basis for the RPL gene family expression as a potential resource for exploring abiotic stress tolerant properties in rice.

  13. Molecular cloning and expression analysis of sea bass (Dicentrarchus labrax L.) tumor necrosis factor-alpha (TNF-alpha).

    PubMed

    Nascimento, Diana S; Pereira, Pedro J B; Reis, Marta I R; do Vale, Ana; Zou, Jun; Silva, Manuel T; Secombes, Christopher J; dos Santos, Nuno M S

    2007-09-01

    In the search for pro-inflammatory genes in sea bass a TNF-alpha gene was cloned and sequenced. The sea bass TNF-alpha (sbTNF-alpha) putative protein conserves the TNF-alpha family signature, as well as the two cysteines usually involved in the formation of a disulfide bond. The mouse TNF-alpha Thr-Leu cleavage sequence and a potential transmembrane domain were also found, suggesting that sbTNF-alpha exists as two forms: a approximately 28 kDa membrane-bound form and a approximately 18.4 kDa soluble protein. The single copy sbTNF-alpha gene contains a four exon-three intron structure similar to other known TNF-alpha genes. Homology modeling of sbTNF-alpha is compatible with the trimeric quaternary architecture of its mammalian counterparts. SbTNF-alpha is constitutively expressed in several unstimulated tissues, and was not up-regulated in the spleen and head-kidney, in response to UV-killed Photobacterium damselae subsp. piscicida. However, an increase of sbTNF-alpha expression was detected in the head-kidney during an experimental infection using the same pathogen.

  14. Semi-supervised prediction of gene regulatory networks using machine learning algorithms.

    PubMed

    Patel, Nihir; Wang, Jason T L

    2015-10-01

    Use of computational methods to predict gene regulatory networks (GRNs) from gene expression data is a challenging task. Many studies have been conducted using unsupervised methods to fulfill the task; however, such methods usually yield low prediction accuracies due to the lack of training data. In this article, we propose semi-supervised methods for GRN prediction by utilizing two machine learning algorithms, namely, support vector machines (SVM) and random forests (RF). The semi-supervised methods make use of unlabelled data for training. We investigated inductive and transductive learning approaches, both of which adopt an iterative procedure to obtain reliable negative training data from the unlabelled data. We then applied our semi-supervised methods to gene expression data of Escherichia coli and Saccharomyces cerevisiae, and evaluated the performance of our methods using the expression data. Our analysis indicated that the transductive learning approach outperformed the inductive learning approach for both organisms. However, there was no conclusive difference identified in the performance of SVM and RF. Experimental results also showed that the proposed semi-supervised methods performed better than existing supervised methods for both organisms.

  15. CnidBase: The Cnidarian Evolutionary Genomics Database

    PubMed Central

    Ryan, Joseph F.; Finnerty, John R.

    2003-01-01

    CnidBase, the Cnidarian Evolutionary Genomics Database, is a tool for investigating the evolutionary, developmental and ecological factors that affect gene expression and gene function in cnidarians. In turn, CnidBase will help to illuminate the role of specific genes in shaping cnidarian biodiversity in the present day and in the distant past. CnidBase highlights evolutionary changes between species within the phylum Cnidaria and structures genomic and expression data to facilitate comparisons to non-cnidarian metazoans. CnidBase aims to further the progress that has already been made in the realm of cnidarian evolutionary genomics by creating a central community resource which will help drive future research and facilitate more accurate classification and comparison of new experimental data with existing data. CnidBase is available at http://cnidbase.bu.edu/. PMID:12519972

  16. Evaluation of a toxicogenomic approach to the local lymph node assay (LLNA).

    PubMed

    Boverhof, Darrell R; Gollapudi, B Bhaskar; Hotchkiss, Jon A; Osterloh-Quiroz, Mandy; Woolhiser, Michael R

    2009-02-01

    Genomic technologies have the potential to enhance and complement existing toxicology endpoints; however, assessment of these approaches requires a systematic evaluation including a robust experimental design with genomic endpoints anchored to traditional toxicology endpoints. The present study was conducted to assess the sensitivity of genomic responses when compared with the traditional local lymph node assay (LLNA) endpoint of lymph node cell proliferation and to evaluate the responses for their ability to provide insights into mode of action. Female BALB/c mice were treated with the sensitizer trimellitic anhydride (TMA), following the standard LLNA dosing regimen, at doses of 0.1, 1, or 10% and traditional tritiated thymidine ((3)HTdR) incorporation and gene expression responses were monitored in the auricular lymph nodes. Additional mice dosed with either vehicle or 10% TMA and sacrificed on day 4 or 10, were also included to examine temporal effects on gene expression. Analysis of (3)HTdR incorporation revealed TMA-induced stimulation indices of 2.8, 22.9, and 61.0 relative to vehicle with an EC(3) of 0.11%. Examination of the dose-response gene expression responses identified 9, 833, and 2122 differentially expressed genes relative to vehicle for the 0.1, 1, and 10% TMA dose groups, respectively. Calculation of EC(3) values for differentially expressed genes did not identify a response that was more sensitive than the (3)HTdR value, although a number of genes displayed comparable sensitivity. Examination of temporal responses revealed 1760, 1870, and 953 differentially expressed genes at the 4-, 6-, and 10-day time points respectively. Functional analysis revealed many responses displayed dose- and time-specific induction patterns within the functional categories of cellular proliferation and immune response, including numerous immunoglobin genes which were highly induced at the day 10 time point. Overall, these experiments have systematically illustrated the potential utility of genomic endpoints to enhance the LLNA and support further exploration of this approach through examination of a more diverse array of chemicals.

  17. Comparison of gene expression profiles altered by comfrey and riddelliine in rat liver

    PubMed Central

    Guo, Lei; Mei, Nan; Dial, Stacey; Fuscoe, James; Chen, Tao

    2007-01-01

    Background Comfrey (Symphytum officinale) is a perennial plant and has been consumed by humans as a vegetable, a tea and an herbal medicine for more than 2000 years. It, however, is hepatotoxic and carcinogenic in experimental animals and hepatotoxic in humans. Pyrrolizidine alkaloids (PAs) exist in many plants and many of them cause liver toxicity and/or cancer in humans and experimental animals. In our previous study, we found that the mutagenicity of comfrey was associated with the PAs contained in the plant. Therefore, we suggest that carcinogenicity of comfrey result from those PAs. To confirm our hypothesis, we compared the expression of genes and processes of biological functions that were altered by comfrey (mixture of the plant with PAs) and riddelliine (a prototype of carcinogenic PA) in rat liver for carcinogenesis in this study. Results Groups of 6 Big Blue Fisher 344 rats were treated with riddelliine at 1 mg/kg body weight by gavage five times a week for 12 weeks or fed a diet containing 8% comfrey root for 12 weeks. Animals were sacrificed one day after the last treatment and the livers were isolated for gene expression analysis. The gene expressions were investigated using Applied Biosystems Rat Whole Genome Survey Microarrays and the biological functions were analyzed with Ingenuity Analysis Pathway software. Although there were large differences between the significant genes and between the biological processes that were altered by comfrey and riddelliine, there were a number of common genes and function processes that were related to carcinogenesis. There was a strong correlation between the two treatments for fold-change alterations in expression of drug metabolizing and cancer-related genes. Conclusion Our results suggest that the carcinogenesis-related gene expression patterns resulting from the treatments of comfrey and riddelliine are very similar, and PAs contained in comfrey are the main active components responsible for carcinogenicity of the plant. PMID:18047722

  18. De Novo Foliar Transcriptome of Chenopodium amaranticolor and Analysis of Its Gene Expression During Virus-Induced Hypersensitive Response

    PubMed Central

    Zhang, Yongqiang; Pei, Xinwu; Zhang, Chao; Lu, Zifeng; Wang, Zhixing; Jia, Shirong; Li, Weimin

    2012-01-01

    Background The hypersensitive response (HR) system of Chenopodium spp. confers broad-spectrum virus resistance. However, little knowledge exists at the genomic level for Chenopodium, thus impeding the advanced molecular research of this attractive feature. Hence, we took advantage of RNA-seq to survey the foliar transcriptome of C. amaranticolor, a Chenopodium species widely used as laboratory indicator for pathogenic viruses, in order to facilitate the characterization of the HR-type of virus resistance. Methodology and Principal Findings Using Illumina HiSeq™ 2000 platform, we obtained 39,868,984 reads with 3,588,208,560 bp, which were assembled into 112,452 unigenes (3,847 clusters and 108,605 singletons). BlastX search against the NCBI NR database identified 61,698 sequences with a cut-off E-value above 10−5. Assembled sequences were annotated with gene descriptions, GO, COG and KEGG terms, respectively. A total number of 738 resistance gene analogs (RGAs) and homology sequences of 6 key signaling proteins within the R proteins-directed signaling pathway were identified. Based on this transcriptome data, we investigated the gene expression profiles over the stage of HR induced by Tobacco mosaic virus and Cucumber mosaic virus by using digital gene expression analysis. Numerous candidate genes specifically or commonly regulated by these two distinct viruses at early and late stages of the HR were identified, and the dynamic changes of the differently expressed genes enriched in the pathway of plant-pathogen interaction were particularly emphasized. Conclusions To our knowledge, this study is the first description of the genetic makeup of C. amaranticolor, providing deep insight into the comprehensive gene expression information at transcriptional level in this species. The 738 RGAs as well as the differentially regulated genes, particularly the common genes regulated by both TMV and CMV, are suitable candidates which merit further functional characterization to dissect the molecular mechanisms and regulatory pathways of the HR-type of virus resistance in Chenopodium. PMID:23029338

  19. Discretization provides a conceptually simple tool to build expression networks.

    PubMed

    Vass, J Keith; Higham, Desmond J; Mudaliar, Manikhandan A V; Mao, Xuerong; Crowther, Daniel J

    2011-04-18

    Biomarker identification, using network methods, depends on finding regular co-expression patterns; the overall connectivity is of greater importance than any single relationship. A second requirement is a simple algorithm for ranking patients on how relevant a gene-set is. For both of these requirements discretized data helps to first identify gene cliques, and then to stratify patients.We explore a biologically intuitive discretization technique which codes genes as up- or down-regulated, with values close to the mean set as unchanged; this allows a richer description of relationships between genes than can be achieved by positive and negative correlation. We find a close agreement between our results and the template gene-interactions used to build synthetic microarray-like data by SynTReN, which synthesizes "microarray" data using known relationships which are successfully identified by our method.We are able to split positive co-regulation into up-together and down-together and negative co-regulation is considered as directed up-down relationships. In some cases these exist in only one direction, with real data, but not with the synthetic data. We illustrate our approach using two studies on white blood cells and derived immortalized cell lines and compare the approach with standard correlation-based computations. No attempt is made to distinguish possible causal links as the search for biomarkers would be crippled by losing highly significant co-expression relationships. This contrasts with approaches like ARACNE and IRIS.The method is illustrated with an analysis of gene-expression for energy metabolism pathways. For each discovered relationship we are able to identify the samples on which this is based in the discretized sample-gene matrix, along with a simplified view of the patterns of gene expression; this helps to dissect the gene-sample relevant to a research topic--identifying sets of co-regulated and anti-regulated genes and the samples or patients in which this relationship occurs.

  20. Transcriptome Analysis of Chlorantraniliprole Resistance Development in the Diamondback Moth Plutella xylostella

    PubMed Central

    Hu, Zhendi; Chen, Huanyu; Yin, Fei; Li, Zhenyu; Dong, Xiaolin; Zhang, Deyong; Ren, Shunxiang; Feng, Xia

    2013-01-01

    Background The diamondback moth Plutella xyllostella has developed a high level of resistance to the latest insecticide chlorantraniliprole. A better understanding of P. xylostella’s resistance mechanism to chlorantraniliprole is needed to develop effective approaches for insecticide resistance management. Principal Findings To provide a comprehensive insight into the resistance mechanisms of P. xylostella to chlorantraniliprole, transcriptome assembly and tag-based digital gene expression (DGE) system were performed using Illumina HiSeq™ 2000. The transcriptome analysis of the susceptible strain (SS) provided 45,231 unigenes (with the size ranging from 200 bp to 13,799 bp), which would be efficient for analyzing the differences in different chlorantraniliprole-resistant P. xylostella stains. DGE analysis indicated that a total of 1215 genes (189 up-regulated and 1026 down-regulated) were gradient differentially expressed among the susceptible strain (SS) and different chlorantraniliprole-resistant P. xylostella strains, including low-level resistance (GXA), moderate resistance (LZA) and high resistance strains (HZA). A detailed analysis of gradient differentially expressed genes elucidated the existence of a phase-dependent divergence of biological investment at the molecular level. The genes related to insecticide resistance, such as P450, GST, the ryanodine receptor, and connectin, had different expression profiles in the different chlorantraniliprole-resistant DGE libraries, suggesting that the genes related to insecticide resistance are involved in P. xylostella resistance development against chlorantraniliprole. To confirm the results from the DGE, the expressional profiles of 4 genes related to insecticide resistance were further validated by qRT-PCR analysis. Conclusions The obtained transcriptome information provides large gene resources available for further studying the resistance development of P. xylostella to pesticides. The DGE data provide comprehensive insights into the gene expression profiles of the different chlorantraniliprole-resistant stains. These genes are specifically related to insecticide resistance, with different expressional profiles facilitating the study of the role of each gene in chlorantraniliprole resistance development. PMID:23977278

  1. Meta-analysis of gene expression patterns in animal models of prenatal alcohol exposure suggests role for protein synthesis inhibition and chromatin remodeling

    PubMed Central

    Rogic, Sanja; Wong, Albertina; Pavlidis, Paul

    2017-01-01

    Background Prenatal alcohol exposure (PAE) can result in an array of morphological, behavioural and neurobiological deficits that can range in their severity. Despite extensive research in the field and a significant progress made, especially in understanding the range of possible malformations and neurobehavioral abnormalities, the molecular mechanisms of alcohol responses in development are still not well understood. There have been multiple transcriptomic studies looking at the changes in gene expression after PAE in animal models, however there is a limited apparent consensus among the reported findings. In an effort to address this issue, we performed a comprehensive re-analysis and meta-analysis of all suitable, publically available expression data sets. Methods We assembled ten microarray data sets of gene expression after PAE in mouse and rat models consisting of samples from a total of 63 ethanol-exposed and 80 control animals. We re-analyzed each data set for differential expression and then used the results to perform meta-analyses considering all data sets together or grouping them by time or duration of exposure (pre- and post-natal, acute and chronic, respectively). We performed network and Gene Ontology enrichment analysis to further characterize the identified signatures. Results For each sub-analysis we identified signatures of differential expressed genes that show support from multiple studies. Overall, the changes in gene expression were more extensive after acute ethanol treatment during prenatal development than in other models. Considering the analysis of all the data together, we identified a robust core signature of 104 genes down-regulated after PAE, with no up-regulated genes. Functional analysis reveals over-representation of genes involved in protein synthesis, mRNA splicing and chromatin organization. Conclusions Our meta-analysis shows that existing studies, despite superficial dissimilarity in findings, share features that allow us to identify a common core signature set of transcriptome changes in PAE. This is an important step to identifying the biological processes that underlie the etiology of FASD. PMID:26996386

  2. Expression profiling of tomato pre-abscission pedicels provides insights into abscission zone properties including competence to respond to abscission signals

    PubMed Central

    2013-01-01

    Background Detachment of plant organs occurs in abscission zones (AZs). During plant growth, the AZ forms, but does not develop further until the cells perceive abscission-promoting signals and initiate detachment. Upon signal perception, abscission initiates immediately; if there is no signal, abscission is not induced and the organ remains attached to the plant. However, little attention has been paid to the genes that maintain competence to respond to the abscission signal in the pre-abscission AZ. Recently, we found that the tomato (Solanum lycopersicum) transcription factors BLIND (Bl), GOBLET (GOB), Lateral suppressor (Ls) and a tomato WUSCHEL homologue (LeWUS) are expressed specifically in pre-abscission tissue, the anthesis pedicel AZs. To advance our understanding of abscission, here we profiled genome-wide gene expression in tomato flower pedicels at the pre-abscission stage. Results We examined the transcriptomes of three tomato flower pedicel regions, the AZ and flanking proximal- (Prox) and distal- (Dis) regions, and identified 89 genes that were preferentially expressed in the AZ compared to both Prox and Dis. These genes included several transcription factors that regulate apical or axillary shoot meristem activity. Also, genes associated with auxin activity were regulated in a Prox-Dis region-specific manner, suggesting that a gradient of auxin exists in the pedicel. A MADS-box gene affecting floral transition was preferentially expressed in the Prox region and other MADS-box genes for floral organ identification were preferentially expressed in Dis, implying that the morphologically similar Prox and Dis regions have distinct identities. We also analyzed the expression of known regulators; in anthesis pedicels, Bl, GOB, Ls and LeWUS were expressed in the vascular cells of the AZ region. However, after an abscission signal, Bl was up-regulated, but GOB, Ls and LeWUS were down-regulated, suggesting that Bl may be a positive regulator of abscission, but the others may be negative regulators. Conclusions This study reveals region-specific gene expression in tomato flower pedicels at anthesis and identifies factors that may determine the physiological properties of the pre-abscission pedicel. The region-specific transcriptional regulators and genes for auxin activity identified here may prevent flower abscission in the absence of signal or establish competence to respond to the abscission signal. PMID:23497084

  3. Arabidopsis whole-transcriptome profiling defines the features of coordinated regulations that occur during secondary growth.

    PubMed

    Ko, Jae-Heung; Han, Kyung-Hwan

    2004-05-01

    Secondary growth in the inflorescence stems of Arabidopsis plants was induced by a combination of short-day and long-day treatments. The induced stems were divided into three different stem developmental stages (i.e., immature, intermediate, and mature) with regard to secondary growth. Whole transcriptome microarrays were used to examine the changes in global gene expression occurring at the different stem developmental stages. Over 70% of the Arabidopsis transcriptome was expressed in the stem tissues. In the mature stems with secondary growth, 567 genes were upregulated 5-fold or higher and 530 were downregulated, when compared to immature stems (with no secondary growth) and 10-day old seedlings (with no inflorescence stem). The transcription phenotypes obtained from the stems at different developmental stages largely confirm the existing insights into the biochemical processes involved in the sequential events that lead to wood formation. The major difference found between the stems undergoing secondary growth and only primary growth was in the expression profiles of transcriptional regulation-and signal transduction-related genes. An analysis of several shoot apical meristem (SAM) activity-related gene expression patterns in the stems indicated that the genetic control of secondary meristem activity might be governed by a different mechanism from that of SAM. The current study established the expression patterns of many unknown genes and identified candidate genes that are involved in the genetic regulation of secondary growth. The findings described in this report should improve our understanding of the molecular mechanisms that regulate the growth and development of the stem.

  4. Human Papillomaviruses; Epithelial Tropisms, and the Development of Neoplasia

    PubMed Central

    Egawa, Nagayasu; Egawa, Kiyofumi; Griffin, Heather; Doorbar, John

    2015-01-01

    Papillomaviruses have evolved over many millions of years to propagate themselves at specific epithelial niches in a range of different host species. This has led to the great diversity of papillomaviruses that now exist, and to the appearance of distinct strategies for epithelial persistence. Many papillomaviruses minimise the risk of immune clearance by causing chronic asymptomatic infections, accompanied by long-term virion-production with only limited viral gene expression. Such lesions are typical of those caused by Beta HPV types in the general population, with viral activity being suppressed by host immunity. A second strategy requires the evolution of sophisticated immune evasion mechanisms, and allows some HPV types to cause prominent and persistent papillomas, even in immune competent individuals. Some Alphapapillomavirus types have evolved this strategy, including those that cause genital warts in young adults or common warts in children. These strategies reflect broad differences in virus protein function as well as differences in patterns of viral gene expression, with genotype-specific associations underlying the recent introduction of DNA testing, and also the introduction of vaccines to protect against cervical cancer. Interestingly, it appears that cellular environment and the site of infection affect viral pathogenicity by modulating viral gene expression. With the high-risk HPV gene products, changes in E6 and E7 expression are thought to account for the development of neoplasias at the endocervix, the anal and cervical transformation zones, and the tonsilar crypts and other oropharyngeal sites. A detailed analysis of site-specific patterns of gene expression and gene function is now prompted. PMID:26193301

  5. A comparative study of ripening among berries of the grape cluster reveals an altered transcriptional programme and enhanced ripening rate in delayed berries

    PubMed Central

    Gouthu, Satyanarayana; O’Neil, Shawn T.; Di, Yanming; Ansarolia, Mitra; Megraw, Molly; Deluc, Laurent G.

    2014-01-01

    Transcriptional studies in relation to fruit ripening generally aim to identify the transcriptional states associated with physiological ripening stages and the transcriptional changes between stages within the ripening programme. In non-climacteric fruits such as grape, all ripening-related genes involved in this programme have not been identified, mainly due to the lack of mutants for comparative transcriptomic studies. A feature in grape cluster ripening (Vitis vinifera cv. Pinot noir), where all berries do not initiate the ripening at the same time, was exploited to study their shifted ripening programmes in parallel. Berries that showed marked ripening state differences in a véraison-stage cluster (ripening onset) ultimately reached similar ripeness states toward maturity, indicating the flexibility of the ripening programme. The expression variance between these véraison-stage berry classes, where 11% of the genes were found to be differentially expressed, was reduced significantly toward maturity, resulting in the synchronization of their transcriptional states. Defined quantitative expression changes (transcriptional distances) not only existed between the véraison transitional stages, but also between the véraison to maturity stages, regardless of the berry class. It was observed that lagging berries complete their transcriptional programme in a shorter time through altered gene expressions and ripening-related hormone dynamics, and enhance the rate of physiological ripening progression. Finally, the reduction in expression variance of genes can identify new genes directly associated with ripening and also assess the relevance of gene activity to the phase of the ripening programme. PMID:25135520

  6. Avian Paramyxovirus Type-3 as a Vaccine Vector: Identification of a Genome Location for High Level Expression of a Foreign Gene

    PubMed Central

    Yoshida, Asuka; Samal, Siba K.

    2017-01-01

    Avian paramyxovirus serotype 3 (APMV-3) causes infection in a wide variety of avian species, but it does not cause apparent diseases in chickens. On the contrary, APMV-1, also known as Newcastle disease virus (NDV), can cause severe disease in chickens. Currently, natural low virulence strains of NDV are used as live-attenuated vaccines throughout the world. NDV is also being evaluated as a vaccine vector against poultry pathogens. However, due to routine vaccination programs, chickens often possess pre-existing antibodies against NDV, which may cause the chickens to be less sensitive to recombinant NDV vaccines expressing antigens of other avian pathogens. Therefore, it may be possible for an APMV-3 vector vaccine to circumvent this issue. In this study, we determined the optimal insertion site in the genome of APMV-3 for high level expression of a foreign gene. We generated recombinant APMV-3 viruses expressing the green fluorescent protein (GFP) by inserting the GFP gene at five different intergenic regions in the genome. The levels of GFP transcription and translation were evaluated. Interestingly, the levels of GFP transcription and translation did not follow the 3′-to-5′ attenuation mechanism of non-segmented, negative-sense RNA viruses. The insertion of GFP gene into the P-M gene junction resulted in higher level of expression of GFP than when the gene was inserted into the upstream N-P gene junction. Unlike NDV, insertion of GFP did not attenuate the growth efficiency of AMPV-3. Thus, APMV-3 could be a more useful vaccine vector for avian pathogens than NDV. PMID:28473820

  7. Avian Paramyxovirus Type-3 as a Vaccine Vector: Identification of a Genome Location for High Level Expression of a Foreign Gene.

    PubMed

    Yoshida, Asuka; Samal, Siba K

    2017-01-01

    Avian paramyxovirus serotype 3 (APMV-3) causes infection in a wide variety of avian species, but it does not cause apparent diseases in chickens. On the contrary, APMV-1, also known as Newcastle disease virus (NDV), can cause severe disease in chickens. Currently, natural low virulence strains of NDV are used as live-attenuated vaccines throughout the world. NDV is also being evaluated as a vaccine vector against poultry pathogens. However, due to routine vaccination programs, chickens often possess pre-existing antibodies against NDV, which may cause the chickens to be less sensitive to recombinant NDV vaccines expressing antigens of other avian pathogens. Therefore, it may be possible for an APMV-3 vector vaccine to circumvent this issue. In this study, we determined the optimal insertion site in the genome of APMV-3 for high level expression of a foreign gene. We generated recombinant APMV-3 viruses expressing the green fluorescent protein (GFP) by inserting the GFP gene at five different intergenic regions in the genome. The levels of GFP transcription and translation were evaluated. Interestingly, the levels of GFP transcription and translation did not follow the 3'-to-5' attenuation mechanism of non-segmented, negative-sense RNA viruses. The insertion of GFP gene into the P-M gene junction resulted in higher level of expression of GFP than when the gene was inserted into the upstream N-P gene junction. Unlike NDV, insertion of GFP did not attenuate the growth efficiency of AMPV-3. Thus, APMV-3 could be a more useful vaccine vector for avian pathogens than NDV.

  8. Gene Expression Profiles of Human Dendritic Cells Interacting with Aspergillus fumigatus in a Bilayer Model of the Alveolar Epithelium/Endothelium Interface

    PubMed Central

    Morton, Charles Oliver; Fliesser, Mirjam; Dittrich, Marcus; Mueller, Tobias; Bauer, Ruth; Kneitz, Susanne; Hope, William; Rogers, Thomas Richard; Einsele, Hermann; Loeffler, Juergen

    2014-01-01

    The initial stages of the interaction between the host and Aspergillus fumigatus at the alveolar surface of the human lung are critical in the establishment of aspergillosis. Using an in vitro bilayer model of the alveolus, including both the epithelium (human lung adenocarcinoma epithelial cell line, A549) and endothelium (human pulmonary artery epithelial cells, HPAEC) on transwell membranes, it was possible to closely replicate the in vivo conditions. Two distinct sub-groups of dendritic cells (DC), monocyte-derived DC (moDC) and myeloid DC (mDC), were included in the model to examine immune responses to fungal infection at the alveolar surface. RNA in high quantity and quality was extracted from the cell layers on the transwell membrane to allow gene expression analysis using tailored custom-made microarrays, containing probes for 117 immune-relevant genes. This microarray data indicated minimal induction of immune gene expression in A549 alveolar epithelial cells in response to germ tubes of A. fumigatus. In contrast, the addition of DC to the system greatly increased the number of differentially expressed immune genes. moDC exhibited increased expression of genes including CLEC7A, CD209 and CCL18 in the absence of A. fumigatus compared to mDC. In the presence of A. fumigatus, both DC subgroups exhibited up-regulation of genes identified in previous studies as being associated with the exposure of DC to A. fumigatus and exhibiting chemotactic properties for neutrophils, including CXCL2, CXCL5, CCL20, and IL1B. This model closely approximated the human alveolus allowing for an analysis of the host pathogen interface that complements existing animal models of IA. PMID:24870357

  9. Yin Yang 1 and Adipogenic Gene Network Expression in Longissimus Muscle of Beef Cattle in Response to Nutritional Management

    PubMed Central

    Moisá, Sonia J.; Shike, Daniel W.; Meteer, William T.; Keisler, Duane; Faulkner, Dan B.; Loor, Juan J.

    2013-01-01

    Among 36 differentially-expressed genes during growth in longissimus muscle (LM) of Angus steers, Yin Yang 1 (YY1) had the most relationships with other genes including some associated with adipocyte differentiation. The objective of this study was to examine the effect of nutritional management on mRNA expression of YY1 along with its targets genes PPARG, GTF2B, KAT2B, IGFBP5 and STAT5B. Longissimus from Angus and Angus × Simmental steers (7 total/treatment) on early weaning plus high-starch (EWS), normal weaning plus starch creep feeding (NWS), or normal weaning without starch creep feeding (NWN) was biopsied at 0, 96, and 240 days on treatments. Results suggest that YY1 does not exert control of adipogenesis in LM, and its expression is not sensitive to weaning age. Among the YY1-related genes, EWS led to greater IGFBP5 during growing and finishing phases. Pro-adipogenic transcriptional regulation was detected in EWS due to greater PPARG and VDR at 96 and 240 d vs. 0 d. GTF2B and KAT2B expression was lower in response to NWS and EWS than NWN, and was most pronounced at 240 d. The increase in PPARG and GTF2B expression between 96 and 240 d underscored the existence of a molecular programming mechanism that was sensitive to age and dietary starch. Such response partly explains the greater carcass fat deposition observed in response to NWS. PMID:23700364

  10. Identification, expression and phylogenetic analysis of EgG1Y162 from Echinococcus granulosus.

    PubMed

    Zhang, Fengbo; Ma, Xiumin; Zhu, Yuejie; Wang, Hongying; Liu, Xianfei; Zhu, Min; Ma, Haimei; Wen, Hao; Fan, Haining; Ding, Jianbing

    2014-01-01

    This study was to clone, identify and analyze the characteristics of egG1Y162 gene from Echinococcus granulosus. Genomic DNA and total RNAs were extracted from four different developmental stages of protoscolex, germinal layer, adult and egg of Echinococcus granulosus, respectively. Fluorescent quantitative PCR was used for analyzing the expression of egG1Y162 gene. Prokaryotic expression plasmid of pET41a-EgG1Y162 was constructed to express recombinant His-EgG1Y162 antigen. Western blot analysis was performed to detect antigenicity of EgG1Y162 antigen. Gene sequence, amino acid alignment and phylogenetic tree of EgG1Y162 were analyzed by BLAST, online Spidey and MEGA4 software, respectively. EgG1Y162 gene was expressed in four developmental stages of Echinococcus granulosus. And, egG1Y162 gene expression was the highest in the adult stage, with the relative value of 19.526, significantly higher than other three stages. Additionally, Western blot analysis revealed that EgG1Y162 recombinant protein had good reaction with serum samples from Echinococcus granulosus infected human and dog. Moreover, EgG1Y162 antigen was phylogenetically closest to EmY162 antigen, with the similarity over 90%. Our study identified EgG1Y162 antigen in Echinococcus granulosus for the first time. EgG1Y162 antigen had a high similarity with EmY162 antigen, with the genetic differences mainly existing in the intron region. And, EgG1Y162 recombinant protein showed good antigenicity.

  11. Morphological, Genome and Gene Expression Changes in Newly Induced Autopolyploid Chrysanthemum lavandulifolium (Fisch. ex Trautv.) Makino.

    PubMed

    Gao, Ri; Wang, Haibin; Dong, Bin; Yang, Xiaodong; Chen, Sumei; Jiang, Jiafu; Zhang, Zhaohe; Liu, Chen; Zhao, Nan; Chen, Fadi

    2016-10-09

    Autopolyploidy is widespread in higher plants and plays an important role in the process of evolution. The present study successfully induced autotetraploidys from Chrysanthemum lavandulifolium by colchicine. The plant morphology, genomic, transcriptomic, and epigenetic changes between tetraploid and diploid plants were investigated. Ligulate flower, tubular flower and leaves of tetraploid plants were greater than those of the diploid plants. Compared with diploid plants, the genome changed as a consequence of polyploidization in tetraploid plants, namely, 1.1% lost fragments and 1.6% novel fragments occurred. In addition, DNA methylation increased after genome doubling in tetraploid plants. Among 485 common transcript-derived fragments (TDFs), which existed in tetraploid and diploid progenitors, 62 fragments were detected as differentially expressed TDFs, 6.8% of TDFs exhibited up-regulated gene expression in the tetraploid plants and 6.0% exhibited down-regulation. The present study provides a reference for further studying the autopolyploidization role in the evolution of C. lavandulifolium. In conclusion, the autopolyploid C. lavandulifolium showed a global change in morphology, genome and gene expression compared with corresponding diploid.

  12. Diversity in global gene expression and morphology across a watercress (Nasturtium officinale R. Br.) germplasm collection: first steps to breeding

    PubMed Central

    Payne, Adrienne C.; Clarkson, Graham J.J.; Rothwell, Steve; Taylor, Gail

    2015-01-01

    Watercress (Nasturtium officinale R. Br.) is a nutrient intense, leafy crop that is consumed raw or in soups across the globe, but for which, currently no genomic resources or breeding programme exists. Promising morphological, biochemical and functional genomic variation was identified for the first time in a newly established watercress germplasm collection, consisting of 48 watercress accessions sourced from contrasting global locations. Stem length, stem diameter and anti-oxidant (AO) potential varied across the accessions. This variation was used to identify three extreme contrasting accessions for further analysis. Variation in global gene expression was investigated using an Affymetrix Arabidopsis ATH1 microarray gene chip, using the commercial control (C), an accession selected for dwarf phenotype with a high AO potential (dwarfAO, called ‘Boldrewood’) and one with high AO potential alone. A set of transcripts significantly differentially expressed between these three accessions, were identified, including transcripts involved in the regulation of growth and development and those involved in secondary metabolism. In particular, when differential gene expression was compared between C and dwarfAO, the dwarfAO was characterised by increased expression of genes encoding glucosinolates, which are known precursors of phenethyl isothiocyanate, linked to the anti-carcinogenic effects well-documented in watercress. This study provides the first analysis of natural variation across the watercress genome and has identified important underpinning information for future breeding for enhanced anti-carcinogenic properties and morphology traits in this nutrient-intense crop. PMID:26504575

  13. Comparison of normalization methods for differential gene expression analysis in RNA-Seq experiments

    PubMed Central

    Maza, Elie; Frasse, Pierre; Senin, Pavel; Bouzayen, Mondher; Zouine, Mohamed

    2013-01-01

    In recent years, RNA-Seq technologies became a powerful tool for transcriptome studies. However, computational methods dedicated to the analysis of high-throughput sequencing data are yet to be standardized. In particular, it is known that the choice of a normalization procedure leads to a great variability in results of differential gene expression analysis. The present study compares the most widespread normalization procedures and proposes a novel one aiming at removing an inherent bias of studied transcriptomes related to their relative size. Comparisons of the normalization procedures are performed on real and simulated data sets. Real RNA-Seq data sets analyses, performed with all the different normalization methods, show that only 50% of significantly differentially expressed genes are common. This result highlights the influence of the normalization step on the differential expression analysis. Real and simulated data sets analyses give similar results showing 3 different groups of procedures having the same behavior. The group including the novel method named “Median Ratio Normalization” (MRN) gives the lower number of false discoveries. Within this group the MRN method is less sensitive to the modification of parameters related to the relative size of transcriptomes such as the number of down- and upregulated genes and the gene expression levels. The newly proposed MRN method efficiently deals with intrinsic bias resulting from relative size of studied transcriptomes. Validation with real and simulated data sets confirmed that MRN is more consistent and robust than existing methods. PMID:26442135

  14. Radiation-induced alternative transcripts as detected in total and polysome-bound mRNA.

    PubMed

    Wahba, Amy; Ryan, Michael C; Shankavaram, Uma T; Camphausen, Kevin; Tofilon, Philip J

    2018-01-02

    Alternative splicing is a critical event in the posttranscriptional regulation of gene expression. To investigate whether this process influences radiation-induced gene expression we defined the effects of ionizing radiation on the generation of alternative transcripts in total cellular mRNA (the transcriptome) and polysome-bound mRNA (the translatome) of the human glioblastoma stem-like cell line NSC11. For these studies, RNA-Seq profiles from control and irradiated cells were compared using the program SpliceSeq to identify transcripts and splice variations induced by radiation. As compared to the transcriptome (total RNA) of untreated cells, the radiation-induced transcriptome contained 92 splice events suggesting that radiation induced alternative splicing. As compared to the translatome (polysome-bound RNA) of untreated cells, the radiation-induced translatome contained 280 splice events of which only 24 were overlapping with the radiation-induced transcriptome. These results suggest that radiation not only modifies alternative splicing of precursor mRNA, but also results in the selective association of existing mRNA isoforms with polysomes. Comparison of radiation-induced alternative transcripts to radiation-induced gene expression in total RNA revealed little overlap (about 3%). In contrast, in the radiation-induced translatome, about 38% of the induced alternative transcripts corresponded to genes whose expression level was affected in the translatome. This study suggests that whereas radiation induces alternate splicing, the alternative transcripts present at the time of irradiation may play a role in the radiation-induced translational control of gene expression and thus cellular radioresponse.

  15. Identification and expression analysis of BoMF25, a novel polygalacturonase gene involved in pollen development of Brassica oleracea.

    PubMed

    Lyu, Meiling; Liang, Ying; Yu, Youjian; Ma, Zhiming; Song, Limin; Yue, Xiaoyan; Cao, Jiashu

    2015-06-01

    BoMF25 acts on pollen wall. Polygalacturonase (PG) is a pectin-digesting enzyme involved in numerous plant developmental processes and is described to be of critical importance for pollen wall development. In the present study, a PG gene, BoMF25, was isolated from Brassica oleracea. BoMF25 is the homologous gene of At4g35670, a PG gene in Arabidopsis thaliana with a high expression level at the tricellular pollen stage. Collinear analysis revealed that the orthologous gene of BoMF25 in Brassica campestris (syn. B. rapa) genome was probably lost because of genome deletion and reshuffling. Sequence analysis indicated that BoMF25 contained four classical conserved domains (I, II, III, and IV) of PG protein. Homology and phylogenetic analyses showed that BoMF25 was clustered in Clade F. The putative promoter sequence, containing classical cis-acting elements and pollen-specific motifs, could drive green fluorescence protein expression in onion epidermal cells. Quantitative RT-PCR analysis suggested that BoMF25 was mainly expressed in the anther at the late stage of pollen development. In situ hybridization analysis also indicated that the strong and specific expression signal of BoMF25 existed in pollen grains at the mature pollen stage. Subcellular localization showed that the fluorescence signal was observed in the cell wall of onion epidermal cells, which suggested that BoMF25 may be a secreted protein localized in the pollen wall.

  16. Extending bicluster analysis to annotate unclassified ORFs and predict novel functional modules using expression data

    PubMed Central

    Bryan, Kenneth; Cunningham, Pádraig

    2008-01-01

    Background Microarrays have the capacity to measure the expressions of thousands of genes in parallel over many experimental samples. The unsupervised classification technique of bicluster analysis has been employed previously to uncover gene expression correlations over subsets of samples with the aim of providing a more accurate model of the natural gene functional classes. This approach also has the potential to aid functional annotation of unclassified open reading frames (ORFs). Until now this aspect of biclustering has been under-explored. In this work we illustrate how bicluster analysis may be extended into a 'semi-supervised' ORF annotation approach referred to as BALBOA. Results The efficacy of the BALBOA ORF classification technique is first assessed via cross validation and compared to a multi-class k-Nearest Neighbour (kNN) benchmark across three independent gene expression datasets. BALBOA is then used to assign putative functional annotations to unclassified yeast ORFs. These predictions are evaluated using existing experimental and protein sequence information. Lastly, we employ a related semi-supervised method to predict the presence of novel functional modules within yeast. Conclusion In this paper we demonstrate how unsupervised classification methods, such as bicluster analysis, may be extended using of available annotations to form semi-supervised approaches within the gene expression analysis domain. We show that such methods have the potential to improve upon supervised approaches and shed new light on the functions of unclassified ORFs and their co-regulation. PMID:18831786

  17. Characterization of the definitive classical calpain family of vertebrates using phylogenetic, evolutionary and expression analyses.

    PubMed

    Macqueen, Daniel J; Wilcox, Alexander H

    2014-04-09

    The calpains are a superfamily of proteases with extensive relevance to human health and welfare. Vast research attention is given to the vertebrate 'classical' subfamily, making it surprising that the evolutionary origins, distribution and relationships of these genes is poorly characterized. Consequently, there exists uncertainty about the conservation of gene family structure, function and expression that has been principally defined from work with mammals. Here, more than 200 vertebrate classical calpains were incorporated in phylogenetic analyses spanning an unprecedented range of taxa, including jawless and cartilaginous fish. We demonstrate that the common vertebrate ancestor had at least six classical calpains, including a single gene that gave rise to CAPN11, 1, 2 and 8 in the early jawed fish lineage, plus CAPN3, 9, 12, 13 and a novel calpain gene, hereafter named CAPN17. We reveal that while all vertebrate classical calpains have been subject to persistent purifying selection during evolution, the degree and nature of selective pressure has often been lineage-dependent. The tissue expression of the complete classic calpain family was assessed in representative teleost fish, amphibians, reptiles and mammals. This highlighted systematic divergence in expression across vertebrate taxa, with most classic calpain genes from fish and amphibians having more extensive tissue distribution than in amniotes. Our data suggest that classical calpain functions have frequently diverged during vertebrate evolution and challenge the ongoing value of the established system of classifying calpains by expression.

  18. Characterization of the definitive classical calpain family of vertebrates using phylogenetic, evolutionary and expression analyses

    PubMed Central

    Macqueen, Daniel J.; Wilcox, Alexander H.

    2014-01-01

    The calpains are a superfamily of proteases with extensive relevance to human health and welfare. Vast research attention is given to the vertebrate ‘classical’ subfamily, making it surprising that the evolutionary origins, distribution and relationships of these genes is poorly characterized. Consequently, there exists uncertainty about the conservation of gene family structure, function and expression that has been principally defined from work with mammals. Here, more than 200 vertebrate classical calpains were incorporated in phylogenetic analyses spanning an unprecedented range of taxa, including jawless and cartilaginous fish. We demonstrate that the common vertebrate ancestor had at least six classical calpains, including a single gene that gave rise to CAPN11, 1, 2 and 8 in the early jawed fish lineage, plus CAPN3, 9, 12, 13 and a novel calpain gene, hereafter named CAPN17. We reveal that while all vertebrate classical calpains have been subject to persistent purifying selection during evolution, the degree and nature of selective pressure has often been lineage-dependent. The tissue expression of the complete classic calpain family was assessed in representative teleost fish, amphibians, reptiles and mammals. This highlighted systematic divergence in expression across vertebrate taxa, with most classic calpain genes from fish and amphibians having more extensive tissue distribution than in amniotes. Our data suggest that classical calpain functions have frequently diverged during vertebrate evolution and challenge the ongoing value of the established system of classifying calpains by expression. PMID:24718597

  19. chromoWIZ: a web tool to query and visualize chromosome-anchored genes from cereal and model genomes.

    PubMed

    Nussbaumer, Thomas; Kugler, Karl G; Schweiger, Wolfgang; Bader, Kai C; Gundlach, Heidrun; Spannagl, Manuel; Poursarebani, Naser; Pfeifer, Matthias; Mayer, Klaus F X

    2014-12-10

    Over the last years reference genome sequences of several economically and scientifically important cereals and model plants became available. Despite the agricultural significance of these crops only a small number of tools exist that allow users to inspect and visualize the genomic position of genes of interest in an interactive manner. We present chromoWIZ, a web tool that allows visualizing the genomic positions of relevant genes and comparing these data between different plant genomes. Genes can be queried using gene identifiers, functional annotations, or sequence homology in four grass species (Triticum aestivum, Hordeum vulgare, Brachypodium distachyon, Oryza sativa). The distribution of the anchored genes is visualized along the chromosomes by using heat maps. Custom gene expression measurements, differential expression information, and gene-to-group mappings can be uploaded and can be used for further filtering. This tool is mainly designed for breeders and plant researchers, who are interested in the location and the distribution of candidate genes as well as in the syntenic relationships between different grass species. chromoWIZ is freely available and online accessible at http://mips.helmholtz-muenchen.de/plant/chromoWIZ/index.jsp.

  20. Inferring Gene Regulatory Networks by Singular Value Decomposition and Gravitation Field Algorithm

    PubMed Central

    Zheng, Ming; Wu, Jia-nan; Huang, Yan-xin; Liu, Gui-xia; Zhou, You; Zhou, Chun-guang

    2012-01-01

    Reconstruction of gene regulatory networks (GRNs) is of utmost interest and has become a challenge computational problem in system biology. However, every existing inference algorithm from gene expression profiles has its own advantages and disadvantages. In particular, the effectiveness and efficiency of every previous algorithm is not high enough. In this work, we proposed a novel inference algorithm from gene expression data based on differential equation model. In this algorithm, two methods were included for inferring GRNs. Before reconstructing GRNs, singular value decomposition method was used to decompose gene expression data, determine the algorithm solution space, and get all candidate solutions of GRNs. In these generated family of candidate solutions, gravitation field algorithm was modified to infer GRNs, used to optimize the criteria of differential equation model, and search the best network structure result. The proposed algorithm is validated on both the simulated scale-free network and real benchmark gene regulatory network in networks database. Both the Bayesian method and the traditional differential equation model were also used to infer GRNs, and the results were used to compare with the proposed algorithm in our work. And genetic algorithm and simulated annealing were also used to evaluate gravitation field algorithm. The cross-validation results confirmed the effectiveness of our algorithm, which outperforms significantly other previous algorithms. PMID:23226565

Top