Sample records for silico transcriptome analysis

  1. Metabolic engineering of Escherichia coli for the production of l-valine based on transcriptome analysis and in silico gene knockout simulation

    PubMed Central

    Park, Jin Hwan; Lee, Kwang Ho; Kim, Tae Yong; Lee, Sang Yup

    2007-01-01

    The l-valine production strain of Escherichia coli was constructed by rational metabolic engineering and stepwise improvement based on transcriptome analysis and gene knockout simulation of the in silico genome-scale metabolic network. Feedback inhibition of acetohydroxy acid synthase isoenzyme III by l-valine was removed by site-directed mutagenesis, and the native promoter containing the transcriptional attenuator leader regions of the ilvGMEDA and ilvBN operon was replaced with the tac promoter. The ilvA, leuA, and panB genes were deleted to make more precursors available for l-valine biosynthesis. This engineered Val strain harboring a plasmid overexpressing the ilvBN genes produced 1.31 g/liter l-valine. Comparative transcriptome profiling was performed during batch fermentation of the engineered and control strains. Among the down-regulated genes, the lrp and ygaZH genes, which encode a global regulator Lrp and l-valine exporter, respectively, were overexpressed. Amplification of the lrp, ygaZH, and lrp-ygaZH genes led to the enhanced production of l-valine by 21.6%, 47.1%, and 113%, respectively. Further improvement was achieved by using in silico gene knockout simulation, which identified the aceF, mdh, and pfkA genes as knockout targets. The VAMF strain (Val ΔaceF Δmdh ΔpfkA) overexpressing the ilvBN, ilvCED, ygaZH, and lrp genes was able to produce 7.55 g/liter l-valine from 20 g/liter glucose in batch culture, resulting in a high yield of 0.378 g of l-valine per gram of glucose. These results suggest that an industrially competitive strain can be efficiently developed by metabolic engineering based on combined rational modification, transcriptome profiling, and systems-level in silico analysis. PMID:17463081

  2. In silico Pathway Activation Network Decomposition Analysis (iPANDA) as a method for biomarker development.

    PubMed

    Ozerov, Ivan V; Lezhnina, Ksenia V; Izumchenko, Evgeny; Artemov, Artem V; Medintsev, Sergey; Vanhaelen, Quentin; Aliper, Alexander; Vijg, Jan; Osipov, Andreyan N; Labat, Ivan; West, Michael D; Buzdin, Anton; Cantor, Charles R; Nikolsky, Yuri; Borisov, Nikolay; Irincheeva, Irina; Khokhlovich, Edward; Sidransky, David; Camargo, Miguel Luiz; Zhavoronkov, Alex

    2016-11-16

    Signalling pathway activation analysis is a powerful approach for extracting biologically relevant features from large-scale transcriptomic and proteomic data. However, modern pathway-based methods often fail to provide stable pathway signatures of a specific phenotype or reliable disease biomarkers. In the present study, we introduce the in silico Pathway Activation Network Decomposition Analysis (iPANDA) as a scalable robust method for biomarker identification using gene expression data. The iPANDA method combines precalculated gene coexpression data with gene importance factors based on the degree of differential gene expression and pathway topology decomposition for obtaining pathway activation scores. Using Microarray Analysis Quality Control (MAQC) data sets and pretreatment data on Taxol-based neoadjuvant breast cancer therapy from multiple sources, we demonstrate that iPANDA provides significant noise reduction in transcriptomic data and identifies highly robust sets of biologically relevant pathway signatures. We successfully apply iPANDA for stratifying breast cancer patients according to their sensitivity to neoadjuvant therapy.

  3. In silico Pathway Activation Network Decomposition Analysis (iPANDA) as a method for biomarker development

    PubMed Central

    Ozerov, Ivan V.; Lezhnina, Ksenia V.; Izumchenko, Evgeny; Artemov, Artem V.; Medintsev, Sergey; Vanhaelen, Quentin; Aliper, Alexander; Vijg, Jan; Osipov, Andreyan N.; Labat, Ivan; West, Michael D.; Buzdin, Anton; Cantor, Charles R.; Nikolsky, Yuri; Borisov, Nikolay; Irincheeva, Irina; Khokhlovich, Edward; Sidransky, David; Camargo, Miguel Luiz; Zhavoronkov, Alex

    2016-01-01

    Signalling pathway activation analysis is a powerful approach for extracting biologically relevant features from large-scale transcriptomic and proteomic data. However, modern pathway-based methods often fail to provide stable pathway signatures of a specific phenotype or reliable disease biomarkers. In the present study, we introduce the in silico Pathway Activation Network Decomposition Analysis (iPANDA) as a scalable robust method for biomarker identification using gene expression data. The iPANDA method combines precalculated gene coexpression data with gene importance factors based on the degree of differential gene expression and pathway topology decomposition for obtaining pathway activation scores. Using Microarray Analysis Quality Control (MAQC) data sets and pretreatment data on Taxol-based neoadjuvant breast cancer therapy from multiple sources, we demonstrate that iPANDA provides significant noise reduction in transcriptomic data and identifies highly robust sets of biologically relevant pathway signatures. We successfully apply iPANDA for stratifying breast cancer patients according to their sensitivity to neoadjuvant therapy. PMID:27848968

  4. In Silico Functional Networks Identified in Fish Nucleated Red Blood Cells by Means of Transcriptomic and Proteomic Profiling.

    PubMed

    Puente-Marin, Sara; Nombela, Iván; Ciordia, Sergio; Mena, María Carmen; Chico, Verónica; Coll, Julio; Ortega-Villaizan, María Del Mar

    2018-04-09

    Nucleated red blood cells (RBCs) of fish have, in the last decade, been implicated in several immune-related functions, such as antiviral response, phagocytosis or cytokine-mediated signaling. RNA-sequencing (RNA-seq) and label-free shotgun proteomic analyses were carried out for in silico functional pathway profiling of rainbow trout RBCs. For RNA-seq, a de novo assembly was conducted, in order to create a transcriptome database for RBCs. For proteome profiling, we developed a proteomic method that combined: (a) fractionation into cytosolic and membrane fractions, (b) hemoglobin removal of the cytosolic fraction, (c) protein digestion, and (d) a novel step with pH reversed-phase peptide fractionation and final Liquid Chromatography Electrospray Ionization Tandem Mass Spectrometric (LC ESI-MS/MS) analysis of each fraction. Combined transcriptome- and proteome- sequencing data identified, in silico, novel and striking immune functional networks for rainbow trout nucleated RBCs, which are mainly linked to innate and adaptive immunity. Functional pathways related to regulation of hematopoietic cell differentiation, antigen presentation via major histocompatibility complex class II (MHCII), leukocyte differentiation and regulation of leukocyte activation were identified. These preliminary findings further implicate nucleated RBCs in immune function, such as antigen presentation and leukocyte activation.

  5. In Silico Functional Networks Identified in Fish Nucleated Red Blood Cells by Means of Transcriptomic and Proteomic Profiling

    PubMed Central

    Puente-Marin, Sara; Ciordia, Sergio; Mena, María Carmen; Chico, Verónica; Coll, Julio

    2018-01-01

    Nucleated red blood cells (RBCs) of fish have, in the last decade, been implicated in several immune-related functions, such as antiviral response, phagocytosis or cytokine-mediated signaling. RNA-sequencing (RNA-seq) and label-free shotgun proteomic analyses were carried out for in silico functional pathway profiling of rainbow trout RBCs. For RNA-seq, a de novo assembly was conducted, in order to create a transcriptome database for RBCs. For proteome profiling, we developed a proteomic method that combined: (a) fractionation into cytosolic and membrane fractions, (b) hemoglobin removal of the cytosolic fraction, (c) protein digestion, and (d) a novel step with pH reversed-phase peptide fractionation and final Liquid Chromatography Electrospray Ionization Tandem Mass Spectrometric (LC ESI-MS/MS) analysis of each fraction. Combined transcriptome- and proteome- sequencing data identified, in silico, novel and striking immune functional networks for rainbow trout nucleated RBCs, which are mainly linked to innate and adaptive immunity. Functional pathways related to regulation of hematopoietic cell differentiation, antigen presentation via major histocompatibility complex class II (MHCII), leukocyte differentiation and regulation of leukocyte activation were identified. These preliminary findings further implicate nucleated RBCs in immune function, such as antigen presentation and leukocyte activation. PMID:29642539

  6. A comparative in silico linear B-cell epitope prediction and characterization for South American and African Trypanosoma vivax strains.

    PubMed

    Guedes, Rafael Lucas Muniz; Rodrigues, Carla Monadeli Filgueira; Coatnoan, Nicolas; Cosson, Alain; Cadioli, Fabiano Antonio; Garcia, Herakles Antonio; Gerber, Alexandra Lehmkuhl; Machado, Rosangela Zacarias; Minoprio, Paola Marcella Camargo; Teixeira, Marta Maria Geraldes; de Vasconcelos, Ana Tereza Ribeiro

    2018-02-27

    Trypanosoma vivax is a parasite widespread across Africa and South America. Immunological methods using recombinant antigens have been developed aiming at specific and sensitive detection of infections caused by T. vivax. Here, we sequenced for the first time the transcriptome of a virulent T. vivax strain (Lins), isolated from an outbreak of severe disease in South America (Brazil) and performed a computational integrated analysis of genome, transcriptome and in silico predictions to identify and characterize putative linear B-cell epitopes from African and South American T. vivax. A total of 2278, 3936 and 4062 linear B-cell epitopes were respectively characterized for the transcriptomes of T. vivax LIEM-176 (Venezuela), T. vivax IL1392 (Nigeria) and T. vivax Lins (Brazil) and 4684 for the genome of T. vivax Y486 (Nigeria). The results presented are a valuable theoretical source that may pave the way for highly sensitive and specific diagnostic tools. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  7. De novo transcriptome analysis of rose-scented geranium provides insights into the metabolic specificity of terpene and tartaric acid biosynthesis.

    PubMed

    Narnoliya, Lokesh K; Kaushal, Girija; Singh, Sudhir P; Sangwan, Rajender S

    2017-01-13

    Rose-scented geranium (Pelargonium sp.) is a perennial herb that produces a high value essential oil of fragrant significance due to the characteristic compositional blend of rose-oxide and acyclic monoterpenoids in foliage. Recently, the plant has also been shown to produce tartaric acid in leaf tissues. Rose-scented geranium represents top-tier cash crop in terms of economic returns and significance of the plant and plant products. However, there has hardly been any study on its metabolism and functional genomics, nor any genomic expression dataset resource is available in public domain. Therefore, to begin the gains in molecular understanding of specialized metabolic pathways of the plant, de novo sequencing of rose-scented geranium leaf transcriptome, transcript assembly, annotation, expression profiling as well as their validation were carried out. De novo transcriptome analysis resulted a total of 78,943 unique contigs (average length: 623 bp, and N50 length: 752 bp) from 15.44 million high quality raw reads. In silico functional annotation led to the identification of several putative genes representing terpene, ascorbic acid and tartaric acid biosynthetic pathways, hormone metabolism, and transcription factors. Additionally, a total of 6,040 simple sequence repeat (SSR) motifs were identified in 6.8% of the expressed transcripts. The highest frequency of SSR was of tri-nucleotides (50%). Further, transcriptome assembly was validated for randomly selected putative genes by standard PCR-based approach. In silico expression profile of assembled contigs were validated by real-time PCR analysis of selected transcripts. Being the first report on transcriptome analysis of rose-scented geranium the data sets and the leads and directions reflected in this investigation will serve as a foundation for pursuing and understanding molecular aspects of its biology, and specialized metabolic pathways, metabolic engineering, genetic diversity as well as molecular breeding.

  8. Spliced synthetic genes as internal controls in RNA sequencing experiments.

    PubMed

    Hardwick, Simon A; Chen, Wendy Y; Wong, Ted; Deveson, Ira W; Blackburn, James; Andersen, Stacey B; Nielsen, Lars K; Mattick, John S; Mercer, Tim R

    2016-09-01

    RNA sequencing (RNA-seq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and diversity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNA-seq analysis. We have developed a set of spike-in RNA standards, termed 'sequins' (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but they align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and it provides scaling factors for normalization between samples. We demonstrate the use of sequins in RNA-seq experiments to measure sample-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA samples. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome.

  9. In silico lineage tracing through single cell transcriptomics identifies a neural stem cell population in planarians.

    PubMed

    Molinaro, Alyssa M; Pearson, Bret J

    2016-04-27

    The planarian Schmidtea mediterranea is a master regenerator with a large adult stem cell compartment. The lack of transgenic labeling techniques in this animal has hindered the study of lineage progression and has made understanding the mechanisms of tissue regeneration a challenge. However, recent advances in single-cell transcriptomics and analysis methods allow for the discovery of novel cell lineages as differentiation progresses from stem cell to terminally differentiated cell. Here we apply pseudotime analysis and single-cell transcriptomics to identify adult stem cells belonging to specific cellular lineages and identify novel candidate genes for future in vivo lineage studies. We purify 168 single stem and progeny cells from the planarian head, which were subjected to single-cell RNA sequencing (scRNAseq). Pseudotime analysis with Waterfall and gene set enrichment analysis predicts a molecularly distinct neoblast sub-population with neural character (νNeoblasts) as well as a novel alternative lineage. Using the predicted νNeoblast markers, we demonstrate that a novel proliferative stem cell population exists adjacent to the brain. scRNAseq coupled with in silico lineage analysis offers a new approach for studying lineage progression in planarians. The lineages identified here are extracted from a highly heterogeneous dataset with minimal prior knowledge of planarian lineages, demonstrating that lineage purification by transgenic labeling is not a prerequisite for this approach. The identification of the νNeoblast lineage demonstrates the usefulness of the planarian system for computationally predicting cellular lineages in an adult context coupled with in vivo verification.

  10. Chloroplast microsatellite markers for Artocarpus (Moraceae) developed from transcriptome sequences

    USDA-ARS?s Scientific Manuscript database

    Premise of the study: Chloroplast microsatellite loci were characterized from transcriptomes of Artocarpus (A.) altilis (breadfruit) and A. camansi (breadnut). They were tested in A. odoratissimus (terap) and A. altilis and evaluated in silico for two congeners. Methods and Results: 15 simple seque...

  11. Chloroplast microsatellite markers for Artocarpus (Moraceae) developed from transcriptome sequences1

    PubMed Central

    Gardner, Elliot M.; Laricchia, Kristen M.; Murphy, Matthew; Ragone, Diane; Scheffler, Brian E.; Simpson, Sheron; Williams, Evelyn W.; Zerega, Nyree J. C.

    2015-01-01

    Premise of the study: Chloroplast microsatellite loci were characterized from transcriptomes of Artocarpus altilis (breadfruit) and A. camansi (breadnut). They were tested in A. odoratissimus (terap) and A. altilis and evaluated in silico for two congeners. Methods and Results: Fifteen simple sequence repeats (SSRs) were identified in chloroplast sequences from four Artocarpus transcriptome assemblies. The markers were evaluated using capillary electrophoresis in A. odoratissimus (105 accessions) and A. altilis (73). They were also evaluated in silico in A. altilis (10), A. camansi (6), and A. altilis × A. mariannensis (7) transcriptomes. All loci were polymorphic in at least one species, with all 15 polymorphic in A. camansi. Per species, average alleles per locus ranged between 2.2 and 2.5. Three loci had evidence of fragment-length homoplasy. Conclusions: These markers will complement existing nuclear markers by enabling confident identification of maternal and clone lines, which are often important in vegetatively propagated crops such as breadfruit. PMID:26421253

  12. Transcriptome mining and in silico structural and functional analysis of ascorbic acid and tartaric acid biosynthesis pathway enzymes in rose-scanted geranium.

    PubMed

    Narnoliya, Lokesh K; Sangwan, Rajender S; Singh, Sudhir P

    2018-06-01

    Rose-scented geranium (Pelargonium sp.) is widely known as aromatic and medicinal herb, accumulating specialized metabolites of high economic importance, such as essential oils, ascorbic acid, and tartaric acid. Ascorbic acid and tartaric acid are multifunctional metabolites of human value to be used as vital antioxidants and flavor enhancing agents in food products. No information is available related to the structural and functional properties of the enzymes involved in ascorbic acid and tartaric acid biosynthesis in rose-scented geranium. In the present study, transcriptome mining was done to identify full-length genes, followed by their bioinformatic and molecular modeling investigations and understanding of in silico structural and functional properties of these enzymes. Evolutionary conserved domains were identified in the pathway enzymes. In silico physicochemical characterization of the catalytic enzymes revealed isoelectric point (pI), instability index, aliphatic index, and grand average hydropathy (GRAVY) values of the enzymes. Secondary structural prediction revealed abundant proportion of alpha helix and random coil confirmations in the pathway enzymes. Three-dimensional homology models were developed for these enzymes. The predicted structures showed significant structural similarity with their respective templates in root mean square deviation analysis. Ramachandran plot analysis of the modeled enzymes revealed that more than 84% of the amino acid residues were within the favored regions. Further, functionally important residues were identified corresponding to catalytic sites located in the enzymes. To, our best knowledge, this is the first report which provides a foundation on functional annotation and structural determination of ascorbic acid and tartaric acid pathway enzymes in rose-scanted geranium.

  13. In silico gene expression analysis – an overview

    PubMed Central

    Murray, David; Doran, Peter; MacMathuna, Padraic; Moss, Alan C

    2007-01-01

    Efforts aimed at deciphering the molecular basis of complex disease are underpinned by the availability of high throughput strategies for the identification of biomolecules that drive the disease process. The completion of the human genome-sequencing project, coupled to major technological developments, has afforded investigators myriad opportunities for multidimensional analysis of biological systems. Nowhere has this research explosion been more evident than in the field of transcriptomics. Affordable access and availability to the technology that supports such investigations has led to a significant increase in the amount of data generated. As most biological distinctions are now observed at a genomic level, a large amount of expression information is now openly available via public databases. Furthermore, numerous computational based methods have been developed to harness the power of these data. In this review we provide a brief overview of in silico methodologies for the analysis of differential gene expression such as Serial Analysis of Gene Expression and Digital Differential Display. The performance of these strategies, at both an operational and result/output level is assessed and compared. The key considerations that must be made when completing an in silico expression analysis are also presented as a roadmap to facilitate biologists. Furthermore, to highlight the importance of these in silico methodologies in contemporary biomedical research, examples of current studies using these approaches are discussed. The overriding goal of this review is to present the scientific community with a critical overview of these strategies, so that they can be effectively added to the tool box of biomedical researchers focused on identifying the molecular mechanisms of disease. PMID:17683638

  14. Insights into the Melipona scutellaris (Hymenoptera, Apidae, Meliponini) fat body transcriptome.

    PubMed

    de Sousa, Cristina Soares; Serrão, José Eduardo; Bonetti, Ana Maria; Amaral, Isabel Marques Rodrigues; Kerr, Warwick Estevam; Maranhão, Andréa Queiroz; Ueira-Vieira, Carlos

    2013-07-01

    The insect fat body is a multifunctional organ analogous to the vertebrate liver. The fat body is involved in the metabolism of juvenile hormone, regulation of environmental stress, production of immunity regulator-like proteins in cells and protein storage. However, very little is known about the molecular mechanisms involved in fat body physiology in stingless bees. In this study, we analyzed the transcriptome of the fat body from the stingless bee Melipona scutellaris. In silico analysis of a set of cDNA library sequences yielded 1728 expressed sequence tags (ESTs) and 997 high-quality sequences that were assembled into 29 contigs and 117 singlets. The BLAST X tool showed that 86% of the ESTs shared similarity with Apis mellifera (honeybee) genes. The M. scutellaris fat body ESTs encoded proteins with roles in numerous physiological processes, including anti-oxidation, phosphorylation, metabolism, detoxification, transmembrane transport, intracellular transport, cell proliferation, protein hydrolysis and protein synthesis. This is the first report to describe a transcriptomic analysis of specific organs of M. scutellaris. Our findings provide new insights into the physiological role of the fat body in stingless bees.

  15. Insights into the Melipona scutellaris (Hymenoptera, Apidae, Meliponini) fat body transcriptome

    PubMed Central

    de Sousa, Cristina Soares; Serrão, José Eduardo; Bonetti, Ana Maria; Amaral, Isabel Marques Rodrigues; Kerr, Warwick Estevam; Maranhão, Andréa Queiroz; Ueira-Vieira, Carlos

    2013-01-01

    The insect fat body is a multifunctional organ analogous to the vertebrate liver. The fat body is involved in the metabolism of juvenile hormone, regulation of environmental stress, production of immunity regulator-like proteins in cells and protein storage. However, very little is known about the molecular mechanisms involved in fat body physiology in stingless bees. In this study, we analyzed the transcriptome of the fat body from the stingless bee Melipona scutellaris. In silico analysis of a set of cDNA library sequences yielded 1728 expressed sequence tags (ESTs) and 997 high-quality sequences that were assembled into 29 contigs and 117 singlets. The BLAST X tool showed that 86% of the ESTs shared similarity with Apis mellifera (honeybee) genes. The M. scutellaris fat body ESTs encoded proteins with roles in numerous physiological processes, including anti-oxidation, phosphorylation, metabolism, detoxification, transmembrane transport, intracellular transport, cell proliferation, protein hydrolysis and protein synthesis. This is the first report to describe a transcriptomic analysis of specific organs of M. scutellaris. Our findings provide new insights into the physiological role of the fat body in stingless bees. PMID:23885214

  16. RNA-Seq and Gene Network Analysis Uncover Activation of an ABA-Dependent Signalosome During the Cork Oak Root Response to Drought

    PubMed Central

    Magalhães, Alexandre P.; Verde, Nuno; Reis, Francisca; Martins, Inês; Costa, Daniela; Lino-Neto, Teresa; Castro, Pedro H.; Tavares, Rui M.; Azevedo, Herlânder

    2016-01-01

    Quercus suber (cork oak) is a West Mediterranean species of key economic interest, being extensively explored for its ability to generate cork. Like other Mediterranean plants, Q. suber is significantly threatened by climatic changes, imposing the need to quickly understand its physiological and molecular adaptability to drought stress imposition. In the present report, we uncovered the differential transcriptome of Q. suber roots exposed to long-term drought, using an RNA-Seq approach. 454-sequencing reads were used to de novo assemble a reference transcriptome, and mapping of reads allowed the identification of 546 differentially expressed unigenes. These were enriched in both effector genes (e.g., LEA, chaperones, transporters) as well as regulatory genes, including transcription factors (TFs) belonging to various different classes, and genes associated with protein turnover. To further extend functional characterization, we identified the orthologs of differentially expressed unigenes in the model species Arabidopsis thaliana, which then allowed us to perform in silico functional inference, including gene network analysis for protein function, protein subcellular localization and gene co-expression, and in silico enrichment analysis for TFs and cis-elements. Results indicated the existence of extensive transcriptional regulatory events, including activation of ABA-responsive genes and ABF-dependent signaling. We were then able to establish that a core ABA-signaling pathway involving PP2C-SnRK2-ABF components was induced in stressed Q. suber roots, identifying a key mechanism in this species’ response to drought. PMID:26793200

  17. In silico mining and PCR-based approaches to transcription factor discovery in non-model plants: gene discovery of the WRKY transcription factors in conifers.

    PubMed

    Liu, Jun-Jun; Xiang, Yu

    2011-01-01

    WRKY transcription factors are key regulators of numerous biological processes in plant growth and development, as well as plant responses to abiotic and biotic stresses. Research on biological functions of plant WRKY genes has focused in the past on model plant species or species with largely characterized transcriptomes. However, a variety of non-model plants, such as forest conifers, are essential as feed, biofuel, and wood or for sustainable ecosystems. Identification of WRKY genes in these non-model plants is equally important for understanding the evolutionary and function-adaptive processes of this transcription factor family. Because of limited genomic information, the rarity of regulatory gene mRNAs in transcriptomes, and the sequence divergence to model organism genes, identification of transcription factors in non-model plants using methods similar to those generally used for model plants is difficult. This chapter describes a gene family discovery strategy for identification of WRKY transcription factors in conifers by a combination of in silico-based prediction and PCR-based experimental approaches. Compared to traditional cDNA library screening or EST sequencing at transcriptome scales, this integrated gene discovery strategy provides fast, simple, reliable, and specific methods to unveil the WRKY gene family at both genome and transcriptome levels in non-model plants.

  18. Specific Transcriptome Changes Associated with Blood Pressure Reduction in Hypertensive Patients After Relaxation Response Training.

    PubMed

    Bhasin, Manoj K; Denninger, John W; Huffman, Jeff C; Joseph, Marie G; Niles, Halsey; Chad-Friedman, Emma; Goldman, Roberta; Buczynski-Kelley, Beverly; Mahoney, Barbara A; Fricchione, Gregory L; Dusek, Jeffery A; Benson, Herbert; Zusman, Randall M; Libermann, Towia A

    2018-05-01

    Mind-body practices that elicit the relaxation response (RR) have been demonstrated to reduce blood pressure (BP) in essential hypertension (HTN) and may be an adjunct to antihypertensive drug therapy. However, the molecular mechanisms by which the RR reduces BP remain undefined. Genomic determinants associated with responsiveness to an 8-week RR-based mind-body intervention for lowering HTN in 13 stage 1 hypertensive patients classified as BP responders and 11 as nonresponders were identified. Transcriptome analysis in peripheral blood mononuclear cells identified 1771 genes regulated by the RR in responders. Biological process- and pathway-based analysis of transcriptome data demonstrated enrichment in the following gene categories: immune regulatory pathways and metabolism (among downregulated genes); glucose metabolism, cardiovascular system development, and circadian rhythm (among upregulated genes). Further in silico estimation of cell abundance from the microarray data showed enrichment of the anti-inflammatory M2 subtype of macrophages in BP responders. Nuclear factor-κB, vascular endothelial growth factor, and insulin were critical molecules emerging from interactive network analysis. These findings provide the first insights into the molecular mechanisms that are associated with the beneficial effects of the RR on HTN.

  19. Specific Transcriptome Changes Associated with Blood Pressure Reduction in Hypertensive Patients After Relaxation Response Training

    PubMed Central

    Bhasin, Manoj K.; Denninger, John W.; Huffman, Jeff C.; Joseph, Marie G.; Niles, Halsey; Chad-Friedman, Emma; Goldman, Roberta; Buczynski-Kelley, Beverly; Mahoney, Barbara A.; Fricchione, Gregory L.; Dusek, Jeffery A.; Benson, Herbert; Zusman, Randall M.

    2018-01-01

    Abstract Objective: Mind–body practices that elicit the relaxation response (RR) have been demonstrated to reduce blood pressure (BP) in essential hypertension (HTN) and may be an adjunct to antihypertensive drug therapy. However, the molecular mechanisms by which the RR reduces BP remain undefined. Design: Genomic determinants associated with responsiveness to an 8-week RR-based mind–body intervention for lowering HTN in 13 stage 1 hypertensive patients classified as BP responders and 11 as nonresponders were identified. Results: Transcriptome analysis in peripheral blood mononuclear cells identified 1771 genes regulated by the RR in responders. Biological process- and pathway-based analysis of transcriptome data demonstrated enrichment in the following gene categories: immune regulatory pathways and metabolism (among downregulated genes); glucose metabolism, cardiovascular system development, and circadian rhythm (among upregulated genes). Further in silico estimation of cell abundance from the microarray data showed enrichment of the anti-inflammatory M2 subtype of macrophages in BP responders. Nuclear factor-κB, vascular endothelial growth factor, and insulin were critical molecules emerging from interactive network analysis. Conclusions: These findings provide the first insights into the molecular mechanisms that are associated with the beneficial effects of the RR on HTN. PMID:29616846

  20. Petri Net computational modelling of Langerhans cell Interferon Regulatory Factor Network predicts their role in T cell activation.

    PubMed

    Polak, Marta E; Ung, Chuin Ying; Masapust, Joanna; Freeman, Tom C; Ardern-Jones, Michael R

    2017-04-06

    Langerhans cells (LCs) are able to orchestrate adaptive immune responses in the skin by interpreting the microenvironmental context in which they encounter foreign substances, but the regulatory basis for this has not been established. Utilising systems immunology approaches combining in silico modelling of a reconstructed gene regulatory network (GRN) with in vitro validation of the predictions, we sought to determine the mechanisms of regulation of immune responses in human primary LCs. The key role of Interferon regulatory factors (IRFs) as controllers of the human Langerhans cell response to epidermal cytokines was revealed by whole transcriptome analysis. Applying Boolean logic we assembled a Petri net-based model of the IRF-GRN which provides molecular pathway predictions for the induction of different transcriptional programmes in LCs. In silico simulations performed after model parameterisation with transcription factor expression values predicted that human LC activation of antigen-specific CD8 T cells would be differentially regulated by epidermal cytokine induction of specific IRF-controlled pathways. This was confirmed by in vitro measurement of IFN-γ production by activated T cells. As a proof of concept, this approach shows that stochastic modelling of a specific immune networks renders transcriptome data valuable for the prediction of functional outcomes of immune responses.

  1. De Novo Assembly and Comparative Transcriptome Analyses of Red and Green Morphs of Sweet Basil Grown in Full Sunlight.

    PubMed

    Torre, Sara; Tattini, Massimiliano; Brunetti, Cecilia; Guidi, Lucia; Gori, Antonella; Marzano, Cristina; Landi, Marco; Sebastiani, Federico

    2016-01-01

    Sweet basil (Ocimum basilicum), one of the most popular cultivated herbs worldwide, displays a number of varieties differing in several characteristics, such as the color of the leaves. The development of a reference transcriptome for sweet basil, and the analysis of differentially expressed genes in acyanic and cyanic cultivars exposed to natural sunlight irradiance, has interest from horticultural and biological point of views. There is still great uncertainty about the significance of anthocyanins in photoprotection, and how green and red morphs may perform when exposed to photo-inhibitory light, a condition plants face on daily and seasonal basis. We sequenced the leaf transcriptome of the green-leaved Tigullio (TIG) and the purple-leaved Red Rubin (RR) exposed to full sunlight over a four-week experimental period. We assembled and annotated 111,007 transcripts. A total of 5,468 and 5,969 potential SSRs were identified in TIG and RR, respectively, out of which 66 were polymorphic in silico. Comparative analysis of the two transcriptomes showed 2,372 differentially expressed genes (DEGs) clustered in 222 enriched Gene ontology terms. Green and red basil mostly differed for transcripts abundance of genes involved in secondary metabolism. While the biosynthesis of waxes was up-regulated in red basil, the biosynthesis of flavonols and carotenoids was up-regulated in green basil. Data from our study provides a comprehensive transcriptome survey, gene sequence resources and microsatellites that can be used for further investigations in sweet basil. The analysis of DEGs and their functional classification also offers new insights on the functional role of anthocyanins in photoprotection.

  2. In Silico Identification of Protein Disulfide Isomerase Gene Families in the De Novo Assembled Transcriptomes of Four Different Species of the Genus Conus.

    PubMed

    Figueroa-Montiel, Andrea; Ramos, Marco A; Mares, Rosa E; Dueñas, Salvador; Pimienta, Genaro; Ortiz, Ernesto; Possani, Lourival D; Licea-Navarro, Alexei F

    2016-01-01

    Small peptides isolated from the venom of the marine snails belonging to the genus Conus have been largely studied because of their therapeutic value. These peptides can be classified in two groups. The largest one is composed by peptides rich in disulfide bonds, and referred to as conotoxins. Despite the importance of conotoxins given their pharmacology value, little is known about the protein disulfide isomerase (PDI) enzymes that are required to catalyze their correct folding. To discover the PDIs that may participate in the folding and structural maturation of conotoxins, the transcriptomes of the venom duct of four different species of Conus from the peninsula of Baja California (Mexico) were assembled. Complementary DNA (cDNA) libraries were constructed for each species and sequenced using a Genome Analyzer Illumina platform. The raw RNA-seq data was converted into transcript sequences using Trinity, a de novo assembler that allows the grouping of reads into contigs without a reference genome. An N50 value of 605 was established as a reference for future assemblies of Conus transcriptomes using this software. Transdecoder was used to extract likely coding sequences from Trinity transcripts, and PDI-specific sequence motif "APWCGHCK" was used to capture potential PDIs. An in silico analysis was performed to characterize the group of PDI protein sequences encoded by the duct-transcriptome of each species. The computational approach entailed a structural homology characterization, based on the presence of functional Thioredoxin-like domains. Four different PDI families were characterized, which are constituted by a total of 41 different gene sequences. The sequences had an average of 65% identity with other PDIs. Using MODELLER 9.14, the homology-based three-dimensional structure prediction of a subset of the sequences reported, showed the expected thioredoxin fold which was confirmed by a "simulated annealing" method.

  3. Transcriptome difference and potential crosstalk between liver and mammary tissue in mid-lactation primiparous dairy cows.

    PubMed

    Bu, Dengpan; Bionaz, Massimo; Wang, Mengzhi; Nan, Xuemei; Ma, Lu; Wang, Jiaqi

    2017-01-01

    Liver and mammary gland are among the most important organs during lactation in dairy cows. With the purpose of understanding both the different and the complementary roles and the crosstalk of those two organs during lactation, a transcriptome analysis was performed on liver and mammary tissues of 10 primiparous dairy cows in mid-lactation. The analysis was performed using a 4×44K Bovine Agilent microarray chip. The transcriptome difference between the two tissues was analyzed using SAS JMP Genomics using ANOVA with a false discovery rate correction (FDR). The analysis uncovered >9,000 genes differentially expressed (DEG) between the two tissues with a FDR<0.001. The functional analysis of the DEG uncovered a larger metabolic (especially related to lipid) and inflammatory response capacity in liver compared with mammary tissue while the mammary tissue had a larger protein synthesis and secretion, proliferation/differentiation, signaling, and innate immune system capacity compared with the liver. A plethora of endogenous compounds, cytokines, and transcription factors were estimated to control the DEG between the two tissues. Compared with mammary tissue, the liver transcriptome appeared to be under control of a large array of ligand-dependent nuclear receptors and, among endogenous chemical, fatty acids and bacteria-derived compounds. Compared with liver, the transcriptome of the mammary tissue was potentially under control of a large number of growth factors and miRNA. The in silico crosstalk analysis between the two tissues revealed an overall large communication with a reciprocal control of lipid metabolism, innate immune system adaptation, and proliferation/differentiation. In summary the transcriptome analysis confirmed prior known differences between liver and mammary tissue, especially considering the indication of a larger metabolic activity in liver compared with the mammary tissue and the larger protein synthesis, communication, and proliferative capacity in mammary tissue compared with the liver. Relatively novel is the indication by the data that the transcriptome of the liver is highly regulated by dietary and bacteria-related compounds while the mammary transcriptome is more under control of hormones, growth factors, and miRNA. A large crosstalk between the two tissues with a reciprocal control of metabolism and innate immune-adaptation was indicated by the network analysis that allowed uncovering previously unknown crosstalk between liver and mammary tissue for several signaling molecules.

  4. Transcriptome difference and potential crosstalk between liver and mammary tissue in mid-lactation primiparous dairy cows

    PubMed Central

    Bu, Dengpan; Bionaz, Massimo; Wang, Mengzhi; Nan, Xuemei; Ma, Lu; Wang, Jiaqi

    2017-01-01

    Liver and mammary gland are among the most important organs during lactation in dairy cows. With the purpose of understanding both the different and the complementary roles and the crosstalk of those two organs during lactation, a transcriptome analysis was performed on liver and mammary tissues of 10 primiparous dairy cows in mid-lactation. The analysis was performed using a 4×44K Bovine Agilent microarray chip. The transcriptome difference between the two tissues was analyzed using SAS JMP Genomics using ANOVA with a false discovery rate correction (FDR). The analysis uncovered >9,000 genes differentially expressed (DEG) between the two tissues with a FDR<0.001. The functional analysis of the DEG uncovered a larger metabolic (especially related to lipid) and inflammatory response capacity in liver compared with mammary tissue while the mammary tissue had a larger protein synthesis and secretion, proliferation/differentiation, signaling, and innate immune system capacity compared with the liver. A plethora of endogenous compounds, cytokines, and transcription factors were estimated to control the DEG between the two tissues. Compared with mammary tissue, the liver transcriptome appeared to be under control of a large array of ligand-dependent nuclear receptors and, among endogenous chemical, fatty acids and bacteria-derived compounds. Compared with liver, the transcriptome of the mammary tissue was potentially under control of a large number of growth factors and miRNA. The in silico crosstalk analysis between the two tissues revealed an overall large communication with a reciprocal control of lipid metabolism, innate immune system adaptation, and proliferation/differentiation. In summary the transcriptome analysis confirmed prior known differences between liver and mammary tissue, especially considering the indication of a larger metabolic activity in liver compared with the mammary tissue and the larger protein synthesis, communication, and proliferative capacity in mammary tissue compared with the liver. Relatively novel is the indication by the data that the transcriptome of the liver is highly regulated by dietary and bacteria-related compounds while the mammary transcriptome is more under control of hormones, growth factors, and miRNA. A large crosstalk between the two tissues with a reciprocal control of metabolism and innate immune-adaptation was indicated by the network analysis that allowed uncovering previously unknown crosstalk between liver and mammary tissue for several signaling molecules. PMID:28291785

  5. De Novo Assembly and Comparative Transcriptome Analyses of Red and Green Morphs of Sweet Basil Grown in Full Sunlight

    PubMed Central

    Torre, Sara; Tattini, Massimiliano; Brunetti, Cecilia; Guidi, Lucia; Gori, Antonella; Marzano, Cristina; Landi, Marco; Sebastiani, Federico

    2016-01-01

    Sweet basil (Ocimum basilicum), one of the most popular cultivated herbs worldwide, displays a number of varieties differing in several characteristics, such as the color of the leaves. The development of a reference transcriptome for sweet basil, and the analysis of differentially expressed genes in acyanic and cyanic cultivars exposed to natural sunlight irradiance, has interest from horticultural and biological point of views. There is still great uncertainty about the significance of anthocyanins in photoprotection, and how green and red morphs may perform when exposed to photo-inhibitory light, a condition plants face on daily and seasonal basis. We sequenced the leaf transcriptome of the green-leaved Tigullio (TIG) and the purple-leaved Red Rubin (RR) exposed to full sunlight over a four-week experimental period. We assembled and annotated 111,007 transcripts. A total of 5,468 and 5,969 potential SSRs were identified in TIG and RR, respectively, out of which 66 were polymorphic in silico. Comparative analysis of the two transcriptomes showed 2,372 differentially expressed genes (DEGs) clustered in 222 enriched Gene ontology terms. Green and red basil mostly differed for transcripts abundance of genes involved in secondary metabolism. While the biosynthesis of waxes was up-regulated in red basil, the biosynthesis of flavonols and carotenoids was up-regulated in green basil. Data from our study provides a comprehensive transcriptome survey, gene sequence resources and microsatellites that can be used for further investigations in sweet basil. The analysis of DEGs and their functional classification also offers new insights on the functional role of anthocyanins in photoprotection. PMID:27483170

  6. FusionAnalyser: a new graphical, event-driven tool for fusion rearrangements discovery

    PubMed Central

    Piazza, Rocco; Pirola, Alessandra; Spinelli, Roberta; Valletta, Simona; Redaelli, Sara; Magistroni, Vera; Gambacorti-Passerini, Carlo

    2012-01-01

    Gene fusions are common driver events in leukaemias and solid tumours; here we present FusionAnalyser, a tool dedicated to the identification of driver fusion rearrangements in human cancer through the analysis of paired-end high-throughput transcriptome sequencing data. We initially tested FusionAnalyser by using a set of in silico randomly generated sequencing data from 20 known human translocations occurring in cancer and subsequently using transcriptome data from three chronic and three acute myeloid leukaemia samples. in all the cases our tool was invariably able to detect the presence of the correct driver fusion event(s) with high specificity. In one of the acute myeloid leukaemia samples, FusionAnalyser identified a novel, cryptic, in-frame ETS2–ERG fusion. A fully event-driven graphical interface and a flexible filtering system allow complex analyses to be run in the absence of any a priori programming or scripting knowledge. Therefore, we propose FusionAnalyser as an efficient and robust graphical tool for the identification of functional rearrangements in the context of high-throughput transcriptome sequencing data. PMID:22570408

  7. FusionAnalyser: a new graphical, event-driven tool for fusion rearrangements discovery.

    PubMed

    Piazza, Rocco; Pirola, Alessandra; Spinelli, Roberta; Valletta, Simona; Redaelli, Sara; Magistroni, Vera; Gambacorti-Passerini, Carlo

    2012-09-01

    Gene fusions are common driver events in leukaemias and solid tumours; here we present FusionAnalyser, a tool dedicated to the identification of driver fusion rearrangements in human cancer through the analysis of paired-end high-throughput transcriptome sequencing data. We initially tested FusionAnalyser by using a set of in silico randomly generated sequencing data from 20 known human translocations occurring in cancer and subsequently using transcriptome data from three chronic and three acute myeloid leukaemia samples. in all the cases our tool was invariably able to detect the presence of the correct driver fusion event(s) with high specificity. In one of the acute myeloid leukaemia samples, FusionAnalyser identified a novel, cryptic, in-frame ETS2-ERG fusion. A fully event-driven graphical interface and a flexible filtering system allow complex analyses to be run in the absence of any a priori programming or scripting knowledge. Therefore, we propose FusionAnalyser as an efficient and robust graphical tool for the identification of functional rearrangements in the context of high-throughput transcriptome sequencing data.

  8. A case study of an integrative genomic and experimental therapeutic approach for rare tumors: identification of vulnerabilities in a pediatric poorly differentiated carcinoma.

    PubMed

    Dela Cruz, Filemon S; Diolaiti, Daniel; Turk, Andrew T; Rainey, Allison R; Ambesi-Impiombato, Alberto; Andrews, Stuart J; Mansukhani, Mahesh M; Nagy, Peter L; Alvarez, Mariano J; Califano, Andrea; Forouhar, Farhad; Modzelewski, Beata; Mitchell, Chelsey M; Yamashiro, Darrell J; Marks, Lianna J; Glade Bender, Julia L; Kung, Andrew L

    2016-10-31

    Precision medicine approaches are ideally suited for rare tumors where comprehensive characterization may have diagnostic, prognostic, and therapeutic value. We describe the clinical case and molecular characterization of an adolescent with metastatic poorly differentiated carcinoma (PDC). Given the rarity and poor prognosis associated with PDC in children, we utilized genomic analysis and preclinical models to validate oncogenic drivers and identify molecular vulnerabilities. We utilized whole exome sequencing (WES) and transcriptome analysis to identify germline and somatic alterations in the patient's tumor. In silico and in vitro studies were used to determine the functional consequences of genomic alterations. Primary tumor was used to generate a patient-derived xenograft (PDX) model, which was used for in vivo assessment of predicted therapeutic options. WES revealed a novel germline frameshift variant (p.E1554fs) in APC, establishing a diagnosis of Gardner syndrome, along with a somatic nonsense (p.R790*) APC mutation in the tumor. Somatic mutations in TP53, MAX, BRAF, ROS1, and RPTOR were also identified and transcriptome and immunohistochemical analyses suggested hyperactivation of the Wnt/ß-catenin and AKT/mTOR pathways. In silico and biochemical assays demonstrated that the MAX p.R60Q and BRAF p.K483E mutations were activating mutations, whereas the ROS1 and RPTOR mutations were of lower utility for therapeutic targeting. Utilizing a patient-specific PDX model, we demonstrated in vivo activity of mTOR inhibition with temsirolimus and partial response to inhibition of MEK. This clinical case illustrates the depth of investigation necessary to fully characterize the functional significance of the breadth of alterations identified through genomic analysis.

  9. Fungal proteomics: from identification to function.

    PubMed

    Doyle, Sean

    2011-08-01

    Some fungi cause disease in humans and plants, while others have demonstrable potential for the control of insect pests. In addition, fungi are also a rich reservoir of therapeutic metabolites and industrially useful enzymes. Detailed analysis of fungal biochemistry is now enabled by multiple technologies including protein mass spectrometry, genome and transcriptome sequencing and advances in bioinformatics. Yet, the assignment of function to fungal proteins, encoded either by in silico annotated, or unannotated genes, remains problematic. The purpose of this review is to describe the strategies used by many researchers to reveal protein function in fungi, and more importantly, to consolidate the nomenclature of 'unknown function protein' as opposed to 'hypothetical protein' - once any protein has been identified by protein mass spectrometry. A combination of approaches including comparative proteomics, pathogen-induced protein expression and immunoproteomics are outlined, which, when used in combination with a variety of other techniques (e.g. functional genomics, microarray analysis, immunochemical and infection model systems), appear to yield comprehensive and definitive information on protein function in fungi. The relative advantages of proteomic, as opposed to transcriptomic-only, analyses are also described. In the future, combined high-throughput, quantitative proteomics, allied to transcriptomic sequencing, are set to reveal much about protein function in fungi. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  10. Transcriptomics-based strain optimization tool for designing secondary metabolite overproducing strains of Streptomyces coelicolor.

    PubMed

    Kim, Minsuk; Yi, Jeong Sang; Lakshmanan, Meiyappan; Lee, Dong-Yup; Kim, Byung-Gee

    2016-03-01

    In silico model-driven analysis using genome-scale model of metabolism (GEM) has been recognized as a promising method for microbial strain improvement. However, most of the current GEM-based strain design algorithms based on flux balance analysis (FBA) heavily rely on the steady-state and optimality assumptions without considering any regulatory information. Thus, their practical usage is quite limited, especially in its application to secondary metabolites overproduction. In this study, we developed a transcriptomics-based strain optimization tool (tSOT) in order to overcome such limitations by integrating transcriptomic data into GEM. Initially, we evaluated existing algorithms for integrating transcriptomic data into GEM using Streptomyces coelicolor dataset, and identified iMAT algorithm as the only and the best algorithm for characterizing the secondary metabolism of S. coelicolor. Subsequently, we developed tSOT platform where iMAT is adopted to predict the reaction states, and successfully demonstrated its applicability to secondary metabolites overproduction by designing actinorhodin (ACT), a polyketide antibiotic, overproducing strain of S. coelicolor. Mutants overexpressing tSOT targets such as ribulose 5-phosphate 3-epimerase and NADP-dependent malic enzyme showed 2 and 1.8-fold increase in ACT production, thereby validating the tSOT prediction. It is expected that tSOT can be used for solving other metabolic engineering problems which could not be addressed by current strain design algorithms, especially for the secondary metabolite overproductions. © 2015 Wiley Periodicals, Inc.

  11. A Meta-Analysis of Multiple Matched Copy Number and Transcriptomics Data Sets for Inferring Gene Regulatory Relationships

    PubMed Central

    Newton, Richard; Wernisch, Lorenz

    2014-01-01

    Inferring gene regulatory relationships from observational data is challenging. Manipulation and intervention is often required to unravel causal relationships unambiguously. However, gene copy number changes, as they frequently occur in cancer cells, might be considered natural manipulation experiments on gene expression. An increasing number of data sets on matched array comparative genomic hybridisation and transcriptomics experiments from a variety of cancer pathologies are becoming publicly available. Here we explore the potential of a meta-analysis of thirty such data sets. The aim of our analysis was to assess the potential of in silico inference of trans-acting gene regulatory relationships from this type of data. We found sufficient correlation signal in the data to infer gene regulatory relationships, with interesting similarities between data sets. A number of genes had highly correlated copy number and expression changes in many of the data sets and we present predicted potential trans-acted regulatory relationships for each of these genes. The study also investigates to what extent heterogeneity between cell types and between pathologies determines the number of statistically significant predictions available from a meta-analysis of experiments. PMID:25148247

  12. In silico characterization and transcriptomic analysis of nif family genes from Anabaena sp. PCC7120.

    PubMed

    Singh, Shilpi; Shrivastava, Alok Kumar

    2017-10-01

    In silico approaches in conjunction with morphology, nitrogenase activity, and qRT-PCR explore the impact of selected abiotic stressor such as arsenic, salt, cadmium, copper, and butachlor on nitrogen fixing (nif family) genes of diazotrophic cyanobacterium Anabaena sp. PCC7120. A total of 19 nif genes are present within the Anabaena genome that is involved in the process of nitrogen fixation. Docking studies revealed the interaction between these nif gene-encoded proteins and the selected abiotic stressors which were further validated through decreased heterocyst frequency, fragmentation of filaments, and downregulation of nitrogenase activity under these stresses indicating towards their toxic impact on nitrogen fixation potential of filamentous cyanobacterium Anabaena sp. PCC7120. Another appealing finding of this study is even though having similar binding energy and similar interacting residues between arsenic/salt and copper/cadmium to nif-encoded proteins, arsenic and cadmium are more toxic than salt and copper for nitrogenase activity of Anabaena which is crucial for growth and yield of rice paddy and soil reclamation.

  13. Next-generation sequencing (NGS) transcriptomes reveal association of multiple genes and pathways contributing to secondary metabolites accumulation in tuberous roots of Aconitum heterophyllum Wall.

    PubMed

    Pal, Tarun; Malhotra, Nikhil; Chanumolu, Sree Krishna; Chauhan, Rajinder Singh

    2015-07-01

    The transcriptomes of Aconitum heterophyllum were assembled and characterized for the first time to decipher molecular components contributing to biosynthesis and accumulation of metabolites in tuberous roots. Aconitum heterophyllum Wall., popularly known as Atis, is a high-value medicinal herb of North-Western Himalayas. No information exists as of today on genetic factors contributing to the biosynthesis of secondary metabolites accumulating in tuberous roots, thereby, limiting genetic interventions towards genetic improvement of A. heterophyllum. Illumina paired-end sequencing followed by de novo assembly yielded 75,548 transcripts for root transcriptome and 39,100 transcripts for shoot transcriptome with minimum length of 200 bp. Biological role analysis of root versus shoot transcriptomes assigned 27,596 and 16,604 root transcripts; 12,340 and 9398 shoot transcripts into gene ontology and clusters of orthologous group, respectively. KEGG pathway mapping assigned 37 and 31 transcripts onto starch-sucrose metabolism while 329 and 341 KEGG orthologies associated with transcripts were found to be involved in biosynthesis of various secondary metabolites for root and shoot transcriptomes, respectively. In silico expression profiling of the mevalonate/2-C-methyl-D-erythritol 4-phosphate (non-mevalonate) pathway genes for aconites biosynthesis revealed 4 genes HMGR (3-hydroxy-3-methylglutaryl-CoA reductase), MVK (mevalonate kinase), MVDD (mevalonate diphosphate decarboxylase) and HDS (1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase) with higher expression in root transcriptome compared to shoot transcriptome suggesting their key role in biosynthesis of aconite alkaloids. Five genes, GMPase (geranyl diphosphate mannose pyrophosphorylase), SHAGGY, RBX1 (RING-box protein 1), SRF receptor kinases and β-amylase, implicated in tuberous root formation in other plant species showed higher levels of expression in tuberous roots compared to shoots. A total of 15,487 transcription factors belonging to bHLH, MYB, bZIP families and 399 ABC transporters which regulate biosynthesis and accumulation of bioactive compounds were identified in root and shoot transcriptomes. The expression of 5 ABC transporters involved in tuberous root development was validated by quantitative PCR analysis. Network connectivity diagrams were drawn for starch-sucrose metabolism and isoquinoline alkaloid biosynthesis associated with tuberous root growth and secondary metabolism, respectively, in root transcriptome of A. heterophyllum. The current endeavor will be of practical importance in planning a suitable genetic intervention strategy for the improvement of A. heterophyllum.

  14. Transcriptome-wide targets of alternative splicing by RBM4 and possible role in cancer.

    PubMed

    Markus, M Andrea; Yang, Yee Hwa J; Morris, Brian J

    2016-04-01

    This study determined transcriptome-wide targets of the splicing factor RBM4 using Affymetrix GeneChip(®) Human Exon 1.0 ST Arrays and HeLa cells treated with RBM4-specific siRNA. This revealed 238 transcripts that were targeted for alternative splicing. Cross-linking and immunoprecipitation experiments identified 945 RBM4 targets in mouse HEK293 cells, 39% of which were ascribed to "alternative splicing" by in silico pathway analysis. Mouse embryonic stem cells transfected with Rbm4 siRNA hairpins exhibited reduced colony numbers and size consistent with involvement of RBM4 in cell proliferation. RBM4 cDNA probing of a cancer cDNA array involving 18 different tumor types from 13 different tissues and matching normal tissue found overexpression of RBM4 mRNA (p<0.01) in cervical, breast, lung, colon, ovarian and rectal cancers. Many RBM4 targets we identified have been implicated in these cancers. In conclusion, our findings reveal transcriptome-wide targets of RBM4 and point to potential cancer-related targets and mechanisms that may involve RBM4. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. The transcriptome of nitrofen-induced pulmonary hypoplasia in the rat model of congenital diaphragmatic hernia.

    PubMed

    Mahood, Thomas H; Johar, Dina R; Iwasiow, Barbara M; Xu, Wayne; Keijzer, Richard

    2016-05-01

    We currently do not know how the herbicide nitrofen induces lung hypoplasia and congenital diaphragmatic hernia in rats. Our aim was to compare the differentially expressed transcriptome of nitrofen-induced hypoplastic lungs to control lungs in embryonic day 13 rat embryos before the development of embryonic diaphragmatic defects. Using next-generation sequencing technology, we identified the expression profile of microRNA (miRNA) and mRNA genes. Once the dataset was validated by both RT-qPCR and digital-PCR, we conducted gene ontology, miRNA target analysis, and orthologous miRNA sequence matching for the deregulated miRNAs in silico. Our study identified 186 known mRNA and 100 miRNAs which were differentially expressed in nitrofen-induced hypoplastic lungs. Sixty-four rat miRNAs homologous to known human miRNAs were identified. A subset of these genes may promote lung hypoplasia in rat and/or human, and we discuss their associations. Potential miRNA pathways relevant to nitrofen-induced lung hypoplasia include PI3K, TGF-β, and cell cycle kinases. Nitrofen-induced hypoplastic lungs have an abnormal transcriptome that may lead to impaired development.

  16. RNA-Seq transcriptome profiling of mouse oocytes after in vitro maturation and/or vitrification.

    PubMed

    Gao, Lei; Jia, Gongxue; Li, Ai; Ma, Haojia; Huang, Zhengyuan; Zhu, Shien; Hou, Yunpeng; Fu, Xiangwei

    2017-10-16

    In vitro maturation (IVM) and vitrification have been widely used to prepare oocytes before fertilization; however, potential effects of these procedures, such as expression profile changes, are poorly understood. In this study, mouse oocytes were divided into four groups and subjected to combinations of in vitro maturation and/or vitrification treatments. RNA-seq and in silico pathway analysis were used to identify differentially expressed genes (DEGs) that may be involved in oocyte viability after in vitro maturation and/or vitrification. Our results showed that 1) 69 genes were differentially expressed after IVM, 66 of which were up-regulated. Atp5e and Atp5o were enriched in the most significant gene ontology term "mitochondrial membrane part"; thus, these genes may be promising candidate biomarkers for oocyte viability after IVM. 2) The influence of vitrification on the transcriptome of oocytes was negligible, as no DEGs were found between vitrified and fresh oocytes. 3) The MII stage is more suitable for oocyte vitrification with respect to the transcriptome. This study provides a valuable new theoretical basis to further improve the efficiency of in vitro maturation and/or oocyte vitrification.

  17. Transcriptome dynamics in the asexual cycle of the chordate Botryllus schlosseri.

    PubMed

    Campagna, Davide; Gasparini, Fabio; Franchi, Nicola; Vitulo, Nicola; Ballin, Francesca; Manni, Lucia; Valle, Giorgio; Ballarin, Loriano

    2016-04-02

    We performed an analysis of the transcriptome during the blastogenesis of the chordate Botryllus schlosseri, focusing in particular on genes involved in cell death by apoptosis. The tunicate B. schlosseri is an ascidian forming colonies characterized by the coexistence of three blastogenetic generations: filter-feeding adults, buds on adults, and budlets on buds. Cyclically, adult tissues undergo apoptosis and are progressively resorbed and replaced by their buds originated by asexual reproduction. This is a feature of colonial tunicates, the only known chordates that can reproduce asexually. Thanks to a newly developed web-based platform ( http://botryllus.cribi.unipd.it ), we compared the transcriptomes of the mid-cycle, the pre-take-over, and the take-over phases of the colonial blastogenetic cycle. The platform is equipped with programs for comparative analysis and allows to select the statistical stringency. We enriched the genome annotation with 11,337 new genes; 581 transcripts were resolved as complete open reading frames, translated in silico into amino acid sequences and then aligned onto the non-redundant sequence database. Significant differentially expressed genes were classified within the gene ontology categories. Among them, we recognized genes involved in apoptosis activation, de-activation, and regulation. With the current work, we contributed to the improvement of the first released B. schlosseri genome assembly and offer an overview of the transcriptome changes during the blastogenetic cycle, showing up- and down-regulated genes. These results are important for the comprehension of the events underlying colony growth and regression, cell proliferation, colony homeostasis, and competition among different generations.

  18. Stem cells isolated from adipose tissue of obese patients show changes in their transcriptomic profile that indicate loss in stemcellness and increased commitment to an adipocyte-like phenotype

    PubMed Central

    2013-01-01

    Background The adipose tissue is an endocrine regulator and a risk factor for atherosclerosis and cardiovascular disease when by excessive accumulation induces obesity. Although the adipose tissue is also a reservoir for stem cells (ASC) their function and “stemcellness” has been questioned. Our aim was to investigate the mechanisms by which obesity affects subcutaneous white adipose tissue (WAT) stem cells. Results Transcriptomics, in silico analysis, real-time polymerase chain reaction (PCR) and western blots were performed on isolated stem cells from subcutaneous abdominal WAT of morbidly obese patients (ASCmo) and of non-obese individuals (ASCn). ASCmo and ASCn gene expression clustered separately from each other. ASCmo showed downregulation of “stemness” genes and upregulation of adipogenic and inflammatory genes with respect to ASCn. Moreover, the application of bioinformatics and Ingenuity Pathway Analysis (IPA) showed that the transcription factor Smad3 was tentatively affected in obese ASCmo. Validation of this target confirmed a significantly reduced Smad3 nuclear translocation in the isolated ASCmo. Conclusions The transcriptomic profile of the stem cells reservoir in obese subcutaneous WAT is highly modified with significant changes in genes regulating stemcellness, lineage commitment and inflammation. In addition to body mass index, cardiovascular risk factor clustering further affect the ASC transcriptomic profile inducing loss of multipotency and, hence, capacity for tissue repair. In summary, the stem cells in the subcutaneous WAT niche of obese patients are already committed to adipocyte differentiation and show an upregulated inflammatory gene expression associated to their loss of stemcellness. PMID:24040759

  19. Decoding the regulatory landscape of melanoma reveals TEADS as regulators of the invasive cell state

    PubMed Central

    Verfaillie, Annelien; Imrichova, Hana; Atak, Zeynep Kalender; Dewaele, Michael; Rambow, Florian; Hulselmans, Gert; Christiaens, Valerie; Svetlichnyy, Dmitry; Luciani, Flavie; Van den Mooter, Laura; Claerhout, Sofie; Fiers, Mark; Journe, Fabrice; Ghanem, Ghanem-Elias; Herrmann, Carl; Halder, Georg; Marine, Jean-Christophe; Aerts, Stein

    2015-01-01

    Transcriptional reprogramming of proliferative melanoma cells into a phenotypically distinct invasive cell subpopulation is a critical event at the origin of metastatic spreading. Here we generate transcriptome, open chromatin and histone modification maps of melanoma cultures; and integrate this data with existing transcriptome and DNA methylation profiles from tumour biopsies to gain insight into the mechanisms underlying this key reprogramming event. This shows thousands of genomic regulatory regions underlying the proliferative and invasive states, identifying SOX10/MITF and AP-1/TEAD as regulators, respectively. Knockdown of TEADs shows a previously unrecognized role in the invasive gene network and establishes a causative link between these transcription factors, cell invasion and sensitivity to MAPK inhibitors. Using regulatory landscapes and in silico analysis, we show that transcriptional reprogramming underlies the distinct cellular states present in melanoma. Furthermore, it reveals an essential role for the TEADs, linking it to clinically relevant mechanisms such as invasion and resistance. PMID:25865119

  20. In silico method for modelling metabolism and gene product expression at genome scale

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lerman, Joshua A.; Hyduke, Daniel R.; Latif, Haythem

    2012-07-03

    Transcription and translation use raw materials and energy generated metabolically to create the macromolecular machinery responsible for all cellular functions, including metabolism. A biochemically accurate model of molecular biology and metabolism will facilitate comprehensive and quantitative computations of an organism's molecular constitution as a function of genetic and environmental parameters. Here we formulate a model of metabolism and macromolecular expression. Prototyping it using the simple microorganism Thermotoga maritima, we show our model accurately simulates variations in cellular composition and gene expression. Moreover, through in silico comparative transcriptomics, the model allows the discovery of new regulons and improving the genome andmore » transcription unit annotations. Our method presents a framework for investigating molecular biology and cellular physiology in silico and may allow quantitative interpretation of multi-omics data sets in the context of an integrated biochemical description of an organism.« less

  1. Integrated analyses using RNA-Seq data reveal viral genomes, single nucleotide variations, the phylogenetic relationship, and recombination for Apple stem grooving virus.

    PubMed

    Jo, Yeonhwa; Choi, Hoseong; Kim, Sang-Min; Kim, Sun-Lim; Lee, Bong Choon; Cho, Won Kyong

    2016-08-09

    Next-generation sequencing (NGS) provides many possibilities for plant virology research. In this study, we performed integrated analyses using plant transcriptome data for plant virus identification using Apple stem grooving virus (ASGV) as an exemplar virus. We used 15 publicly available transcriptome libraries from three different studies, two mRNA-Seq studies and a small RNA-Seq study. We de novo assembled nearly complete genomes of ASGV isolates Fuji and Cuiguan from apple and pear transcriptomes, respectively, and identified single nucleotide variations (SNVs) of ASGV within the transcriptomes. We demonstrated the application of NGS raw data to confirm viral infections in the plant transcriptomes. In addition, we compared the usability of two de novo assemblers, Trinity and Velvet, for virus identification and genome assembly. A phylogenetic tree revealed that ASGV and Citrus tatter leaf virus (CTLV) are the same virus, which was divided into two clades. Recombination analyses identified six recombination events from 21 viral genomes. Taken together, our in silico analyses using NGS data provide a successful application of plant transcriptomes to reveal extensive information associated with viral genome assembly, SNVs, phylogenetic relationships, and genetic recombination.

  2. In Silico Comparative Transcriptome Analysis of Two Color Morphs of the Common Coral Trout (Plectropomus Leopardus)

    PubMed Central

    Wang, Le; Yu, Cuiping; Guo, Liang; Lin, Haoran; Meng, Zining

    2015-01-01

    The common coral trout is one species of major importance in commercial fisheries and aquaculture. Recently, two different color morphs of Plectropomus leopardus were discovered and the biological importance of the color difference is unknown. Since coral trout species are poorly characterized at the molecular level, we undertook the transcriptomic characterization of the two color morphs, one black and one red coral trout, using Illumina next generation sequencing technologies. The study produced 55162966 and 54588952 paired-end reads, for black and red trout, respectively. De novo transcriptome assembly generated 95367 and 99424 unique sequences in black and red trout, respectively, with 88813 sequences shared between them. Approximately 50% of both trancriptomes were functionally annotated by BLAST searches against protein databases. The two trancriptomes were enriched into 25 functional categories and showed similar profiles of Gene Ontology category compositions. 34110 unigenes were grouped into 259 KEGG pathways. Moreover, we identified 14649 simple sequence repeats (SSRs) and designed primers for potential application. We also discovered 130524 putative single nucleotide polymorphisms (SNPs) in the two transcriptomes, supplying potential genomic resources for the coral trout species. In addition, we identified 936 fast-evolving genes and 165 candidate genes under positive selection between the two color morphs. Finally, 38 candidate genes underlying the mechanism of color and pigmentation were also isolated. This study presents the first transcriptome resources for the common coral trout and provides basic information for the development of genomic tools for the identification, conservation, and understanding of the speciation and local adaptation of coral reef fish species. PMID:26713756

  3. Linking the salt transcriptome with physiological responses of a salt-resistant Populus species as a strategy to identify genes important for stress acclimation.

    PubMed

    Brinker, Monika; Brosché, Mikael; Vinocur, Basia; Abo-Ogiala, Atef; Fayyaz, Payam; Janz, Dennis; Ottow, Eric A; Cullmann, Andreas D; Saborowski, Joachim; Kangasjärvi, Jaakko; Altman, Arie; Polle, Andrea

    2010-12-01

    To investigate early salt acclimation mechanisms in a salt-tolerant poplar species (Populus euphratica), the kinetics of molecular, metabolic, and physiological changes during a 24-h salt exposure were measured. Three distinct phases of salt stress were identified by analyses of the osmotic pressure and the shoot water potential: dehydration, salt accumulation, and osmotic restoration associated with ionic stress. The duration and intensity of these phases differed between leaves and roots. Transcriptome analysis using P. euphratica-specific microarrays revealed clusters of coexpressed genes in these phases, with only 3% overlapping salt-responsive genes in leaves and roots. Acclimation of cellular metabolism to high salt concentrations involved remodeling of amino acid and protein biosynthesis and increased expression of molecular chaperones (dehydrins, osmotin). Leaves suffered initially from dehydration, which resulted in changes in transcript levels of mitochondrial and photosynthetic genes, indicating adjustment of energy metabolism. Initially, decreases in stress-related genes were found, whereas increases occurred only when leaves had restored the osmotic balance by salt accumulation. Comparative in silico analysis of the poplar stress regulon with Arabidopsis (Arabidopsis thaliana) orthologs was used as a strategy to reduce the number of candidate genes for functional analysis. Analysis of Arabidopsis knockout lines identified a lipocalin-like gene (AtTIL) and a gene encoding a protein with previously unknown functions (AtSIS) to play roles in salt tolerance. In conclusion, by dissecting the stress transcriptome of tolerant species, novel genes important for salt endurance can be identified.

  4. Linking the Salt Transcriptome with Physiological Responses of a Salt-Resistant Populus Species as a Strategy to Identify Genes Important for Stress Acclimation1[W][OA

    PubMed Central

    Brinker, Monika; Brosché, Mikael; Vinocur, Basia; Abo-Ogiala, Atef; Fayyaz, Payam; Janz, Dennis; Ottow, Eric A.; Cullmann, Andreas D.; Saborowski, Joachim; Kangasjärvi, Jaakko; Altman, Arie; Polle, Andrea

    2010-01-01

    To investigate early salt acclimation mechanisms in a salt-tolerant poplar species (Populus euphratica), the kinetics of molecular, metabolic, and physiological changes during a 24-h salt exposure were measured. Three distinct phases of salt stress were identified by analyses of the osmotic pressure and the shoot water potential: dehydration, salt accumulation, and osmotic restoration associated with ionic stress. The duration and intensity of these phases differed between leaves and roots. Transcriptome analysis using P. euphratica-specific microarrays revealed clusters of coexpressed genes in these phases, with only 3% overlapping salt-responsive genes in leaves and roots. Acclimation of cellular metabolism to high salt concentrations involved remodeling of amino acid and protein biosynthesis and increased expression of molecular chaperones (dehydrins, osmotin). Leaves suffered initially from dehydration, which resulted in changes in transcript levels of mitochondrial and photosynthetic genes, indicating adjustment of energy metabolism. Initially, decreases in stress-related genes were found, whereas increases occurred only when leaves had restored the osmotic balance by salt accumulation. Comparative in silico analysis of the poplar stress regulon with Arabidopsis (Arabidopsis thaliana) orthologs was used as a strategy to reduce the number of candidate genes for functional analysis. Analysis of Arabidopsis knockout lines identified a lipocalin-like gene (AtTIL) and a gene encoding a protein with previously unknown functions (AtSIS) to play roles in salt tolerance. In conclusion, by dissecting the stress transcriptome of tolerant species, novel genes important for salt endurance can be identified. PMID:20959419

  5. Shedding Some Light over the Floral Metabolism by Arum Lily (Zantedeschia aethiopica) Spathe De Novo Transcriptome Assembly

    PubMed Central

    Cândido, Elizabete de Souza; Fernandes, Gabriel da Rocha; de Alencar, Sérgio Amorim; Cardoso, Marlon Henrique e Silva; Lima, Stella Maris de Freitas; Miranda, Vívian de Jesus; Porto, William Farias; Nolasco, Diego Oliveira; de Oliveira-Júnior, Nelson Gomes; Barbosa, Aulus Estevão Anjos de Deus; Pogue, Robert Edward; Rezende, Taia Maria Berto; Dias, Simoni Campos; Franco, Octávio Luiz

    2014-01-01

    Zantedeschia aethiopica is an evergreen perennial plant cultivated worldwide and commonly used for ornamental and medicinal purposes including the treatment of bacterial infections. However, the current understanding of molecular and physiological mechanisms in this plant is limited, in comparison to other non-model plants. In order to improve understanding of the biology of this botanical species, RNA-Seq technology was used for transcriptome assembly and characterization. Following Z. aethiopica spathe tissue RNA extraction, high-throughput RNA sequencing was performed with the aim of obtaining both abundant and rare transcript data. Functional profiling based on KEGG Orthology (KO) analysis highlighted contigs that were involved predominantly in genetic information (37%) and metabolism (34%) processes. Predicted proteins involved in the plant circadian system, hormone signal transduction, secondary metabolism and basal immunity are described here. In silico screening of the transcriptome data set for antimicrobial peptide (AMP) –encoding sequences was also carried out and three lipid transfer proteins (LTP) were identified as potential AMPs involved in plant defense. Spathe predicted protein maps were drawn, and suggested that major plant efforts are expended in guaranteeing the maintenance of cell homeostasis, characterized by high investment in carbohydrate, amino acid and energy metabolism as well as in genetic information. PMID:24614014

  6. The Air Force In Silico -- Computational Biology in 2025

    DTIC Science & Technology

    2007-11-01

    and chromosome) these new fields are commonly referred to as “~omics.” Proteomics , transcriptomics, metabolomics , epigenomics, physiomics... Bioinformatics , 2006, http://journal.imbio.de/ http://www-bm.ipk-gatersleben.de/stable/php/ journal /articles/pdf/jib-22.pdf (accessed 30 September...Chirino, G. Tansley and I. Dryden, “The implications for Bioinformatics of integration across physical scales,” Journal of Integrative Bioinformatics

  7. Subtractive transcriptome analysis of leaf and rhizome reveals differentially expressed transcripts in Panax sokpayensis.

    PubMed

    Gurung, Bhusan; Bhardwaj, Pardeep K; Talukdar, Narayan C

    2016-11-01

    In the present study, suppression subtractive hybridization (SSH) strategy was used to identify rare and differentially expressed transcripts in leaf and rhizome tissues of Panax sokpayensis. Out of 1102 randomly picked clones, 513 and 374 high quality expressed sequenced tags (ESTs) were generated from leaf and rhizome subtractive libraries, respectively. Out of them, 64.92 % ESTs from leaf and 69.26 % ESTs from rhizome SSH libraries were assembled into different functional categories, while others were of unknown function. In particular, ESTs encoding galactinol synthase 2, ribosomal RNA processing Brix domain protein, and cell division cycle protein 20.1, which are involved in plant growth and development, were most abundant in the leaf SSH library. Other ESTs encoding protein KIAA0664 homologue, ubiquitin-activating enzyme e11, and major latex protein, which are involved in plant immunity and defense response, were most abundant in the rhizome SSH library. Subtractive ESTs also showed similarity with genes involved in ginsenoside biosynthetic pathway, namely farnesyl pyrophosphate synthase, squalene synthase, and dammarenediol synthase. Expression profiles of selected ESTs validated the quality of libraries and confirmed their differential expression in the leaf, stem, and rhizome tissues. In silico comparative analyses revealed that around 13.75 % of unigenes from the leaf SSH library were not represented in the available leaf transcriptome of Panax ginseng. Similarly, around 18.12, 23.75, 25, and 6.25 % of unigenes from the rhizome SSH library were not represented in available root/rhizome transcriptomes of P. ginseng, Panax notoginseng, Panax quinquefolius, and Panax vietnamensis, respectively, indicating a major fraction of novel ESTs. Therefore, these subtractive transcriptomes provide valuable resources for gene discovery in P. sokpayensis and would complement the available transcriptomes from other Panax species.

  8. In-Silico Identification Of Micro-Loops In Myelodysplastic Syndromes

    NASA Astrophysics Data System (ADS)

    Beck, Dominik; Brandl, Miriam; Pham, Tuan D.; Chang, Chung-Che; Zhou, Xiaobo

    2011-06-01

    Micro-loops are regulatory network motifs that leverage transcriptional and posttranscriptional control to effectively regulate the transcriptome. In this paper a regulatory network for Myelodysplastic Syndromes (MDSs) was constructed from the literature and publicly available data sources. The network was filtered using data from deep-sequencing of small RNAs, exon and microarrays. Motif discovery showed that micro-loops might exist in MDS. We further used the identified micro-loops and performed basic network analysis to identify the known disease gene RUNX1/AML, as well as miRNA family hsa-mir-181. This suggested that the concept of micro-loops can be applied to enhance disease gene identification and biomarker discovery.

  9. Reconstruction of the Fatty Acid Biosynthetic Pathway of Exiguobacterium antarcticum B7 Based on Genomic and Bibliomic Data.

    PubMed

    Kawasaki, Regiane; Baraúna, Rafael A; Silva, Artur; Carepo, Marta S P; Oliveira, Rui; Marques, Rodolfo; Ramos, Rommel T J; Schneider, Maria P C

    2016-01-01

    Exiguobacterium antarcticum B7 is extremophile Gram-positive bacteria able to survive in cold environments. A key factor to understanding cold adaptation processes is related to the modification of fatty acids composing the cell membranes of psychrotrophic bacteria. In our study we show the in silico reconstruction of the fatty acid biosynthesis pathway of E. antarcticum B7. To build the stoichiometric model, a semiautomatic procedure was applied, which integrates genome information using KEGG and RAST/SEED. Constraint-based methods, namely, Flux Balance Analysis (FBA) and elementary modes (EM), were applied. FBA was implemented in the sense of hexadecenoic acid production maximization. To evaluate the influence of the gene expression in the fluxome analysis, FBA was also calculated using the log2⁡FC values obtained in the transcriptome analysis at 0°C and 37°C. The fatty acid biosynthesis pathway showed a total of 13 elementary flux modes, four of which showed routes for the production of hexadecenoic acid. The reconstructed pathway demonstrated the capacity of E. antarcticum B7 to de novo produce fatty acid molecules. Under the influence of the transcriptome, the fluxome was altered, promoting the production of short-chain fatty acids. The calculated models contribute to better understanding of the bacterial adaptation at cold environments.

  10. An in-depth comparison of the porcine, murine and human inflammasomes; lessons from the porcine genome and transcriptome.

    PubMed

    Dawson, Harry D; Smith, Allen D; Chen, Celine; Urban, Joseph F

    2017-04-01

    Emerging evidence suggests that swine are a scientifically acceptable intermediate species between rodents and humans to model immune function relevant to humans. The swine genome has recently been sequenced and several preliminary structural and functional analysis of the porcine immunome have been published. Herein we provide an expanded in silico analysis using an improved assembly of the porcine transcriptome that provides an in depth analysis of genes that are related to inflammasomes, responses to Toll-like receptor ligands, and M1 macrophage polarization and Escherichia coli as a model organism. Comparisons of the expansion or contraction of orthologous gene families indicated more similar rates and classes of genes in humans and pigs than in mice; however several novel porcine or artiodactyl-specific paralogs or pseudogenes were identified. Conservation of homology and structural motifs of orthologs revealed that the overall similarity to human proteins was significantly higher for pigs compared to mouse. Despite these similarities, two out of four canonical inflammasome pathways, Absent in melanoma 2 (AIM2) and NLR family and CARD domain containing 4 (NLRC4), were found to be missing in pigs. Pig M1 Mφ polarization in response to interferon-γ (IFN-γ) and lipopolysaccharide (LPS) was assessed, via the transcriptome, using next generation sequencing. Our analysis revealed predominantly human-like responses however some, mouse-like responses were observed, as well as induction of numerous pig or artiodactyl-specific genes. This work supports using swine to model both human immunological and inflammatory responses to infection. However, caution must be exercised as pigs differ from humans in several fundamental pathways. Published by Elsevier B.V.

  11. Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [Cajanus cajan (L.) Millspaugh].

    PubMed

    Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram P; Gupta, Deepak K; Singh, Sangeeta; Dogra, Vivek; Gaikwad, Kishor; Sharma, Tilak R; Raje, Ranjeet S; Bandhopadhya, Tapas K; Datta, Subhojit; Singh, Mahendra N; Bashasab, Fakrudin; Kulwal, Pawan; Wanjari, K B; K Varshney, Rajeev; Cook, Douglas R; Singh, Nagendra K

    2011-01-20

    Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥ 18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea.

  12. Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [Cajanus cajan (L.) Millspaugh

    PubMed Central

    2011-01-01

    Background Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. Results In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. Conclusion We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea. PMID:21251263

  13. Analysis of the floral transcriptome of Tarenaya hassleriana (Cleomaceae), a member of the sister group to the Brassicaceae: towards understanding the base of morphological diversity in Brassicales

    PubMed Central

    2014-01-01

    Background Arabidopsis thaliana, a member of the Brassicaceae family is the dominant genetic model plant. However, while the flowers within the Brassicaceae members are rather uniform, mainly radially symmetrical, mostly white with fixed organ numbers, species within the Cleomaceae, the sister family to the Brassicaceae show a more variable floral morphology. We were interested in understanding the molecular basis for these morphological differences. To this end, the floral transcriptome of a hybrid Tarenaya hassleriana, a Cleomaceae with monosymmetric, bright purple flowers was sequenced, annotated and analyzed in respect to floral regulators. Results We obtained a comprehensive floral transcriptome with high depth and coverage close to saturation analyzed using rarefaction analysis a method well known in biodiversity studies. Gene expression was analyzed by calculating reads per kilobase gene model per million reads (RPKM) and for selected genes in silico expression data was corroborated by qRT-PCR analysis. Candidate transcription factors were identified based on differences in expression pattern between A. thaliana and T. hassleriana, which are likely key regulators of the T. hassleriana specific floral characters such as coloration and male sterility in the hybrid plant used. Analysis of lineage specific genes was carried out with members of the fabids and malvids. Conclusions The floral transcriptome of T. hassleriana provides insights into key pathways involved in the regulation of late anthocyanin biosynthesis, male fertility, flowering time and organ growth regulation which are unique traits compared the model organism A. thaliana. Analysis of lineage specific genes carried out with members of the fabids and malvids suggests an extensive gene birth rate in the lineage leading to core Brassicales while only few genes were potentially lost during core Brassicales evolution, which possibly reflects the result of the At-β whole genome duplication. Our analysis should facilitate further analyses into the molecular mechanisms of floral morphogenesis and pigmentation and the mechanisms underlying the rather diverse floral morphologies in the Cleomaceae. PMID:24548348

  14. Transcriptome Wide Identification and Validation of Calcium Sensor Gene Family in the Developing Spikes of Finger Millet Genotypes for Elucidating Its Role in Grain Calcium Accumulation

    PubMed Central

    Singh, Uma M.; Chandra, Muktesh; Shankhdhar, Shailesh C.; Kumar, Anil

    2014-01-01

    Background In finger millet, calcium is one of the important and abundant mineral elements. The molecular mechanisms involved in calcium accumulation in plants remains poorly understood. Transcriptome sequencing of genetically diverse genotypes of finger millet differing in grain calcium content will help in understanding the trait. Principal Finding In this study, the transcriptome sequencing of spike tissues of two genotypes of finger millet differing in their grain calcium content, were performed for the first time. Out of 109,218 contigs, 78 contigs in case of GP-1 (Low Ca genotype) and out of 120,130 contigs 76 contigs in case of GP-45 (High Ca genotype), were identified as calcium sensor genes. Through in silico analysis all 82 unique calcium sensor genes were classified into eight calcium sensor gene family viz., CaM & CaMLs, CBLs, CIPKs, CRKs, PEPRKs, CDPKs, CaMKs and CCaMK. Out of 82 genes, 12 were found diverse from the rice orthologs. The differential expression analysis on the basis of FPKM value resulted in 24 genes highly expressed in GP-45 and 11 genes highly expressed in GP-1. Ten of the 35 differentially expressed genes could be assigned to three documented pathways involved mainly in stress responses. Furthermore, validation of selected calcium sensor responder genes was also performed by qPCR, in developing spikes of both genotypes grown on different concentration of exogenous calcium. Conclusion Through de novo transcriptome data assembly and analysis, we reported the comprehensive identification and functional characterization of calcium sensor gene family. The calcium sensor gene family identified and characterized in this study will facilitate in understanding the molecular basis of calcium accumulation and development of calcium biofortified crops. Moreover, this study also supported that identification and characterization of gene family through Illumina paired-end sequencing is a potential tool for generating the genomic information of gene family in non-model species. PMID:25157851

  15. Transcriptome wide identification and validation of calcium sensor gene family in the developing spikes of finger millet genotypes for elucidating its role in grain calcium accumulation.

    PubMed

    Singh, Uma M; Chandra, Muktesh; Shankhdhar, Shailesh C; Kumar, Anil

    2014-01-01

    In finger millet, calcium is one of the important and abundant mineral elements. The molecular mechanisms involved in calcium accumulation in plants remains poorly understood. Transcriptome sequencing of genetically diverse genotypes of finger millet differing in grain calcium content will help in understanding the trait. In this study, the transcriptome sequencing of spike tissues of two genotypes of finger millet differing in their grain calcium content, were performed for the first time. Out of 109,218 contigs, 78 contigs in case of GP-1 (Low Ca genotype) and out of 120,130 contigs 76 contigs in case of GP-45 (High Ca genotype), were identified as calcium sensor genes. Through in silico analysis all 82 unique calcium sensor genes were classified into eight calcium sensor gene family viz., CaM & CaMLs, CBLs, CIPKs, CRKs, PEPRKs, CDPKs, CaMKs and CCaMK. Out of 82 genes, 12 were found diverse from the rice orthologs. The differential expression analysis on the basis of FPKM value resulted in 24 genes highly expressed in GP-45 and 11 genes highly expressed in GP-1. Ten of the 35 differentially expressed genes could be assigned to three documented pathways involved mainly in stress responses. Furthermore, validation of selected calcium sensor responder genes was also performed by qPCR, in developing spikes of both genotypes grown on different concentration of exogenous calcium. Through de novo transcriptome data assembly and analysis, we reported the comprehensive identification and functional characterization of calcium sensor gene family. The calcium sensor gene family identified and characterized in this study will facilitate in understanding the molecular basis of calcium accumulation and development of calcium biofortified crops. Moreover, this study also supported that identification and characterization of gene family through Illumina paired-end sequencing is a potential tool for generating the genomic information of gene family in non-model species.

  16. RNA sequencing, de novo assembly and differential analysis of the gill transcriptome of freshwater climbing perch Anabas testudineus after six days of seawater exposure.

    PubMed

    Chen, X L; Lui, E Y; Ip, Y Kwong; Lam, S H

    2018-06-21

    To obtain transcriptomic insights into branchial responses to salinity challenge in Anabas testudineus, this study employed RNA sequencing (RNA-Seq) to analyse the gill transcriptome of A. testudineus exposed to seawater (SW) for 6 days compared with the freshwater (FW) control group. A combined FW and SW gill transcriptome was de novo assembled from 169.9 million 101 bp paired-end reads. In silico validation employing 17 A. testudineus Sanger full-length coding sequences showed that 15/17 of them had greater than 80% of their sequences aligned to the de novo assembled contigs where 5/17 had their full-length (100%) aligned and 9/17 had greater than 90% of their sequences aligned. The combined FW and SW gill transcriptome was mapped to 13780 unique human identifiers at E-value < 1.0E-20 while 952 and 886 identifiers were determined as up and down-regulated by 1.5 fold, respectively, in the gills of A. testudineus in SW when compared with FW. These genes were found to be associated with at least 23 biological processes. A larger proportion of genes encoding enzymes and transporters associated with molecular transport, energy production, metabolisms were up-regulated, while a larger proportion of genes encoding transmembrane receptors, G-protein coupled receptors, kinases and transcription regulators associated with cell cycle, growth, development, signalling, morphology and gene expression were relatively lower in the gills of A. testudineus in SW when compared with FW. High correlation (R = 0.99) was observed between RNA-Seq data and real-time quantitative PCR validation for 13 selected genes. The transcriptomic sequence information will facilitate development of molecular resources and tools while the findings will provide insights for future studies into branchial iono-osmoregulation and related cellular processes in A. testudineus. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  17. Application of Genomic Technologies to the Breeding of Trees

    PubMed Central

    Badenes, Maria L.; Fernández i Martí, Angel; Ríos, Gabino; Rubio-Cabetas, María J.

    2016-01-01

    The recent introduction of next generation sequencing (NGS) technologies represents a major revolution in providing new tools for identifying the genes and/or genomic intervals controlling important traits for selection in breeding programs. In perennial fruit trees with long generation times and large sizes of adult plants, the impact of these techniques is even more important. High-throughput DNA sequencing technologies have provided complete annotated sequences in many important tree species. Most of the high-throughput genotyping platforms described are being used for studies of genetic diversity and population structure. Dissection of complex traits became possible through the availability of genome sequences along with phenotypic variation data, which allow to elucidate the causative genetic differences that give rise to observed phenotypic variation. Association mapping facilitates the association between genetic markers and phenotype in unstructured and complex populations, identifying molecular markers for assisted selection and breeding. Also, genomic data provide in silico identification and characterization of genes and gene families related to important traits, enabling new tools for molecular marker assisted selection in tree breeding. Deep sequencing of transcriptomes is also a powerful tool for the analysis of precise expression levels of each gene in a sample. It consists in quantifying short cDNA reads, obtained by NGS technologies, in order to compare the entire transcriptomes between genotypes and environmental conditions. The miRNAs are non-coding short RNAs involved in the regulation of different physiological processes, which can be identified by high-throughput sequencing of RNA libraries obtained by reverse transcription of purified short RNAs, and by in silico comparison with known miRNAs from other species. All together, NGS techniques and their applications have increased the resources for plant breeding in tree species, closing the former gap of genetic tools between trees and annual species. PMID:27895664

  18. Application of Genomic Technologies to the Breeding of Trees.

    PubMed

    Badenes, Maria L; Fernández I Martí, Angel; Ríos, Gabino; Rubio-Cabetas, María J

    2016-01-01

    The recent introduction of next generation sequencing (NGS) technologies represents a major revolution in providing new tools for identifying the genes and/or genomic intervals controlling important traits for selection in breeding programs. In perennial fruit trees with long generation times and large sizes of adult plants, the impact of these techniques is even more important. High-throughput DNA sequencing technologies have provided complete annotated sequences in many important tree species. Most of the high-throughput genotyping platforms described are being used for studies of genetic diversity and population structure. Dissection of complex traits became possible through the availability of genome sequences along with phenotypic variation data, which allow to elucidate the causative genetic differences that give rise to observed phenotypic variation. Association mapping facilitates the association between genetic markers and phenotype in unstructured and complex populations, identifying molecular markers for assisted selection and breeding. Also, genomic data provide in silico identification and characterization of genes and gene families related to important traits, enabling new tools for molecular marker assisted selection in tree breeding. Deep sequencing of transcriptomes is also a powerful tool for the analysis of precise expression levels of each gene in a sample. It consists in quantifying short cDNA reads, obtained by NGS technologies, in order to compare the entire transcriptomes between genotypes and environmental conditions. The miRNAs are non-coding short RNAs involved in the regulation of different physiological processes, which can be identified by high-throughput sequencing of RNA libraries obtained by reverse transcription of purified short RNAs, and by in silico comparison with known miRNAs from other species. All together, NGS techniques and their applications have increased the resources for plant breeding in tree species, closing the former gap of genetic tools between trees and annual species.

  19. Transcriptome sequencing of different narrow-leafed lupin tissue types provides a comprehensive uni-gene assembly and extensive gene-based molecular markers

    PubMed Central

    Kamphuis, Lars G; Hane, James K; Nelson, Matthew N; Gao, Lingling; Atkins, Craig A; Singh, Karam B

    2015-01-01

    Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is an important grain legume crop that is valuable for sustainable farming and is becoming recognized as a human health food. NLL breeding is directed at improving grain production, disease resistance, drought tolerance and health benefits. However, genetic and genomic studies have been hindered by a lack of extensive genomic resources for the species. Here, the generation, de novo assembly and annotation of transcriptome datasets derived from five different NLL tissue types of the reference accession cv. Tanjil are described. The Tanjil transcriptome was compared to transcriptomes of an early domesticated cv. Unicrop, a wild accession P27255, as well as accession 83A:476, together being the founding parents of two recombinant inbred line (RIL) populations. In silico predictions for transcriptome-derived gene-based length and SNP polymorphic markers were conducted and corroborated using a survey assembly sequence for NLL cv. Tanjil. This yielded extensive indel and SNP polymorphic markers for the two RIL populations. A total of 335 transcriptome-derived markers and 66 BAC-end sequence-derived markers were evaluated, and 275 polymorphic markers were selected to genotype the reference NLL 83A:476 × P27255 RIL population. This significantly improved the completeness, marker density and quality of the reference NLL genetic map. PMID:25060816

  20. Global analysis of gene expression in maize leaves treated with low temperature. II. Combined effect of severe cold (8 °C) and circadian rhythm.

    PubMed

    Jończyk, M; Sobkowiak, A; Trzcinska-Danielewicz, J; Skoneczny, M; Solecka, D; Fronk, J; Sowiński, P

    2017-10-01

    In maize seedlings, severe cold results in dysregulation of circadian pattern of gene expression causing profound modulation of transcription of genes related to photosynthesis and other key biological processes. Plants live highly cyclic life and their response to environmental stresses must allow for underlying biological rhythms. To study the interplay of a stress and a rhythmic cue we investigated transcriptomic response of maize seedlings to low temperature in the context of diurnal gene expression. Severe cold stress had pronounced effect on the circadian rhythm of a substantial proportion of genes. Their response was strikingly dual, comprising either flattening (partial or complete) of the diel amplitude or delay of expression maximum/minimum by several hours. Genes encoding central oscillator components behaved in the same dual manner, unlike their Arabidopsis counterparts reported earlier to cease cycling altogether upon cold treatment. Also numerous genes lacking circadian rhythm responded to the cold by undergoing up- or down-regulation. Notably, the transcriptome changes preceded major physiological manifestations of cold stress. In silico analysis of metabolic processes likely affected by observed gene expression changes indicated major down-regulation of photosynthesis, profound and multifarious modulation of plant hormone levels, and of chromatin structure, transcription, and translation. A role of trehalose and stachyose in cold stress signaling was also suggested. Meta-analysis of published transcriptomic data allowed discrimination between general stress response of maize and that unique to severe cold. Several cis- and trans-factors likely involved in the latter were predicted, albeit none of them seemed to have a major role. These results underscore a key role of modulation of diel gene expression in maize response to severe cold and the unique character of the cold-response of the maize circadian clock.

  1. An in silico argument for mitochondrial microRNA as a determinant of primary non function in liver transplantation.

    PubMed

    Khorsandi, Shirin Elizabeth; Salehi, Siamak; Cortes, Miriam; Vilca-Melendez, Hector; Menon, Krishna; Srinivasan, Parthi; Prachalias, Andreas; Jassem, Wayel; Heaton, Nigel

    2018-02-15

    Mitochondria have their own genomic, transcriptomic and proteomic machinery but are unable to be autonomous, needing both nuclear and mitochondrial genomes. The aim of this work was to use computational biology to explore the involvement of Mitochondrial microRNAs (MitomiRs) and their interactions with the mitochondrial proteome in a clinical model of primary non function (PNF) of the donor after cardiac death (DCD) liver. Archival array data on the differential expression of miRNA in DCD PNF was re-analyzed using a number of publically available computational algorithms. 10 MitomiRs were identified of importance in DCD PNF, 7 with predicted interaction of their seed sequence with the mitochondrial transcriptome that included both coding, and non coding areas of the hypervariability region 1 (HVR1) and control region. Considering miRNA regulation of the nuclear encoded mitochondrial proteome, 7 hypothetical small proteins were identified with homolog function that ranged from co-factor for formation of ATP Synthase, REDOX balance and an importin/exportin protein. In silico, unconventional seed interactions, both non canonical and alternative seed sites, appear to be of greater importance in MitomiR regulation of the mitochondrial genome. Additionally, a number of novel small proteins of relevance in transplantation have been identified which need further characterization.

  2. RNA-Seq mediated root transcriptome analysis of Chlorophytum borivilianum for identification of genes involved in saponin biosynthesis.

    PubMed

    Kumar, Sunil; Kalra, Shikha; Singh, Baljinder; Kumar, Avneesh; Kaur, Jagdeep; Singh, Kashmir

    2016-01-01

    Chlorophytum borivilianum is an important species of liliaceae family, owing to its vital medicinal properties. Plant roots are used for aphrodisiac, adaptogen, anti-aging, health-restorative and health-promoting purposes. Saponins, are considered to be the principal bioactive components responsible for the wide variety of pharmacological properties of this plant. In the present study, we have performed de novo root transcriptome sequencing of C. borivilianum using Illumina Hiseq 2000 platform, to gain molecular insight into saponins biosynthesis. A total of 33,963,356 high-quality reads were obtained after quality filtration. Sequences were assembled using various programs which generated 97,344 transcripts with a size range of 100-5,216 bp and N50 value of 342. Data was analyzed against non-redundant proteins, gene ontology (GO), and enzyme commission (EC) databases. All the genes involved in saponins biosynthesis along with five full-length genes namely farnesyl pyrophosphate synthase, cycloartenol synthase, β-amyrin synthase, cytochrome p450, and sterol-3-glucosyltransferase were identified. Read per exon kilobase per million (RPKM)-based comparative expression profiling was done to study the differential regulation of the genes. In silico expression analysis of seven selected genes of saponin biosynthetic pathway was validated by qRT-PCR.

  3. Single-cell entropy for accurate estimation of differentiation potency from a cell's transcriptome

    NASA Astrophysics Data System (ADS)

    Teschendorff, Andrew E.; Enver, Tariq

    2017-06-01

    The ability to quantify differentiation potential of single cells is a task of critical importance. Here we demonstrate, using over 7,000 single-cell RNA-Seq profiles, that differentiation potency of a single cell can be approximated by computing the signalling promiscuity, or entropy, of a cell's transcriptome in the context of an interaction network, without the need for feature selection. We show that signalling entropy provides a more accurate and robust potency estimate than other entropy-based measures, driven in part by a subtle positive correlation between the transcriptome and connectome. Signalling entropy identifies known cell subpopulations of varying potency and drug resistant cancer stem-cell phenotypes, including those derived from circulating tumour cells. It further reveals that expression heterogeneity within single-cell populations is regulated. In summary, signalling entropy allows in silico estimation of the differentiation potency and plasticity of single cells and bulk samples, providing a means to identify normal and cancer stem-cell phenotypes.

  4. Single-cell entropy for accurate estimation of differentiation potency from a cell's transcriptome

    PubMed Central

    Teschendorff, Andrew E.; Enver, Tariq

    2017-01-01

    The ability to quantify differentiation potential of single cells is a task of critical importance. Here we demonstrate, using over 7,000 single-cell RNA-Seq profiles, that differentiation potency of a single cell can be approximated by computing the signalling promiscuity, or entropy, of a cell's transcriptome in the context of an interaction network, without the need for feature selection. We show that signalling entropy provides a more accurate and robust potency estimate than other entropy-based measures, driven in part by a subtle positive correlation between the transcriptome and connectome. Signalling entropy identifies known cell subpopulations of varying potency and drug resistant cancer stem-cell phenotypes, including those derived from circulating tumour cells. It further reveals that expression heterogeneity within single-cell populations is regulated. In summary, signalling entropy allows in silico estimation of the differentiation potency and plasticity of single cells and bulk samples, providing a means to identify normal and cancer stem-cell phenotypes. PMID:28569836

  5. Reconstruction of the Fatty Acid Biosynthetic Pathway of Exiguobacterium antarcticum B7 Based on Genomic and Bibliomic Data

    PubMed Central

    Kawasaki, Regiane; Carepo, Marta S. P.; Oliveira, Rui; Marques, Rodolfo; Ramos, Rommel T. J.; Schneider, Maria P. C.

    2016-01-01

    Exiguobacterium antarcticum B7 is extremophile Gram-positive bacteria able to survive in cold environments. A key factor to understanding cold adaptation processes is related to the modification of fatty acids composing the cell membranes of psychrotrophic bacteria. In our study we show the in silico reconstruction of the fatty acid biosynthesis pathway of E. antarcticum B7. To build the stoichiometric model, a semiautomatic procedure was applied, which integrates genome information using KEGG and RAST/SEED. Constraint-based methods, namely, Flux Balance Analysis (FBA) and elementary modes (EM), were applied. FBA was implemented in the sense of hexadecenoic acid production maximization. To evaluate the influence of the gene expression in the fluxome analysis, FBA was also calculated using the log2⁡FC values obtained in the transcriptome analysis at 0°C and 37°C. The fatty acid biosynthesis pathway showed a total of 13 elementary flux modes, four of which showed routes for the production of hexadecenoic acid. The reconstructed pathway demonstrated the capacity of E. antarcticum B7 to de novo produce fatty acid molecules. Under the influence of the transcriptome, the fluxome was altered, promoting the production of short-chain fatty acids. The calculated models contribute to better understanding of the bacterial adaptation at cold environments. PMID:27595107

  6. Integrated Quantitative Transcriptome Maps of Human Trisomy 21 Tissues and Cells

    PubMed Central

    Pelleri, Maria Chiara; Cattani, Chiara; Vitale, Lorenza; Antonaros, Francesca; Strippoli, Pierluigi; Locatelli, Chiara; Cocchi, Guido; Piovesan, Allison; Caracausi, Maria

    2018-01-01

    Down syndrome (DS) is due to the presence of an extra full or partial chromosome 21 (Hsa21). The identification of genes contributing to DS pathogenesis could be the key to any rational therapy of the associated intellectual disability. We aim at generating quantitative transcriptome maps in DS integrating all gene expression profile datasets available for any cell type or tissue, to obtain a complete model of the transcriptome in terms of both expression values for each gene and segmental trend of gene expression along each chromosome. We used the TRAM (Transcriptome Mapper) software for this meta-analysis, comparing transcript expression levels and profiles between DS and normal brain, lymphoblastoid cell lines, blood cells, fibroblasts, thymus and induced pluripotent stem cells, respectively. TRAM combined, normalized, and integrated datasets from different sources and across diverse experimental platforms. The main output was a linear expression value that may be used as a reference for each of up to 37,181 mapped transcripts analyzed, related to both known genes and expression sequence tag (EST) clusters. An independent example in vitro validation of fibroblast transcriptome map data was performed through “Real-Time” reverse transcription polymerase chain reaction showing an excellent correlation coefficient (r = 0.93, p < 0.0001) with data obtained in silico. The availability of linear expression values for each gene allowed the testing of the gene dosage hypothesis of the expected 3:2 DS/normal ratio for Hsa21 as well as other human genes in DS, in addition to listing genes differentially expressed with statistical significance. Although a fraction of Hsa21 genes escapes dosage effects, Hsa21 genes are selectively over-expressed in DS samples compared to genes from other chromosomes, reflecting a decisive role in the pathogenesis of the syndrome. Finally, the analysis of chromosomal segments reveals a high prevalence of Hsa21 over-expressed segments over the other genomic regions, suggesting, in particular, a specific region on Hsa21 that appears to be frequently over-expressed (21q22). Our complete datasets are released as a new framework to investigate transcription in DS for individual genes as well as chromosomal segments in different cell types and tissues. PMID:29740474

  7. Transcriptome Profiles of the Protoscoleces of Echinococcus granulosus Reveal that Excretory-Secretory Products Are Essential to Metabolic Adaptation

    PubMed Central

    Pan, Wei; Shen, Yujuan; Han, Xiuming; Wang, Ying; Liu, Hua; Jiang, Yanyan; Zhang, Yumei; Wang, Yanjuan; Xu, Yuxin; Cao, Jianping

    2014-01-01

    Background Cystic hydatid disease (CHD) is caused by the larval stages of the cestode and affects humans and domestic animals worldwide. Protoscoleces (PSCs) are one component of the larval stages that can interact with both definitive and intermediate hosts. Previous genomic and transcriptomic data have provided an overall snapshot of the genomics of the growth and development of this parasite. However, our understanding of how PSCs subvert the immune response of hosts and maintains metabolic adaptation remains unclear. In this study, we used Roche 454 sequencing technology and in silico secretome analysis to explore the transcriptome profiles of the PSCs from E. granulosus and elucidate the potential functions of the excretory-secretory proteins (ESPs) released by the parasite. Methodology/Principal Findings A large number of nonredundant sequences as unigenes were generated (26,514), of which 22,910 (86.4%) were mapped to the newly published E. granulosus genome and 17,705 (66.8%) were distributed within the coding sequence (CDS) regions. Of the 2,280 ESPs predicted from the transcriptome, 138 ESPs were inferred to be involved in the metabolism of carbohydrates, while 124 ESPs were inferred to be involved in the metabolism of protein. Eleven ESPs were identified as intracellular enzymes that regulate glycolysis/gluconeogenesis (GL/GN) pathways, while a further 44 antigenic proteins, 25 molecular chaperones and four proteases were highly represented. Many proteins were also found to be significantly enriched in development-related signaling pathways, such as the TGF-β receptor pathways and insulin pathways. Conclusions/Significance This study provides valuable information on the metabolic adaptation of parasites to their hosts that can be used to aid the development of novel intervention targets for hydatid treatment and control. PMID:25500817

  8. Prediction of the neuropeptidomes of members of the Astacidea (Crustacea, Decapoda) using publicly accessible transcriptome shotgun assembly (TSA) sequence data.

    PubMed

    Christie, Andrew E; Chi, Megan

    2015-12-01

    The decapod infraorder Astacidea is comprised of clawed lobsters and freshwater crayfish. Due to their economic importance and their use as models for investigating neurochemical signaling, much work has focused on elucidating their neurochemistry, particularly their peptidergic systems. Interestingly, no astacidean has been the subject of large-scale peptidomic analysis via in silico transcriptome mining, this despite growing transcriptomic resources for members of this taxon. Here, the publicly accessible astacidean transcriptome shotgun assembly data were mined for putative peptide-encoding transcripts; these sequences were used to predict the structures of mature neuropeptides. One hundred seventy-six distinct peptides were predicted for Procambarus clarkii, including isoforms of adipokinetic hormone-corazonin-like peptide (ACP), allatostatin A (AST-A), allatostatin B, allatostatin C (AST-C) bursicon α, bursicon β, CCHamide, crustacean hyperglycemic hormone (CHH)/ion transport peptide (ITP), diuretic hormone 31 (DH31), eclosion hormone (EH), FMRFamide-like peptide, GSEFLamide, intocin, leucokinin, neuroparsin, neuropeptide F, pigment dispersing hormone, pyrokinin, RYamide, short neuropeptide F (sNPF), SIFamide, sulfakinin and tachykinin-related peptide (TRP). Forty-six distinct peptides, including isoforms of AST-A, AST-C, bursicon α, CCHamide, CHH/ITP, DH31, EH, intocin, myosuppressin, neuroparsin, red pigment concentrating hormone, sNPF and TRP, were predicted for Pontastacus leptodactylus, with a bursicon β and a neuroparsin predicted for Cherax quadricarinatus. The identification of ACP is the first from a decapod, while the predictions of CCHamide, EH, GSEFLamide, intocin, neuroparsin and RYamide are firsts for the Astacidea. Collectively, these data greatly expand the catalog of known astacidean neuropeptides and provide a foundation for functional studies of peptidergic signaling in members of this decapod infraorder. Copyright © 2015 Elsevier Inc. All rights reserved.

  9. Transcriptome-wide identification and characterization of CAD isoforms specific for podophyllotoxin biosynthesis from Podophyllum hexandrum.

    PubMed

    Bhattacharyya, Dipto; Hazra, Saptarshi; Banerjee, Anindyajit; Datta, Riddhi; Kumar, Deepak; Chakrabarti, Saikat; Chattopadhyay, Sharmila

    2016-09-01

    Podophyllotoxin (ptox) is a therapeutically important lignan derived from Podophyllum hexandrum and is used as a precursor for the synthesis of anticancer drugs etoposide, teniposide and etopophose. In spite of its enormous economic significance, genomic information on this endangered medicinal herb is scarce. We have performed de novo transcriptome analysis of methyl jasmonate (MeJA)-treated P. hexandrum cell cultures exhibiting enhanced ptox accumulation. The results revealed the maximum up-regulation of several isoforms of cinnamyl alcohol dehydrogenase (CAD). CAD catalyzes the synthesis of coniferyl alcohol and sinapyl alcohol from coniferaldehyde (CAld) and sinapaldehyde respectively. Coniferyl alcohol can produce both lignin and lignan while sinapyl alcohol produces only lignin. To isolate the CAD isoforms favoring ptox, we deduced full length cDNA sequences of four CAD isoforms: PhCAD1, PhCAD2, PhCAD3 and PhCAD4 from the contigs of the transcriptome data. In vitro enzyme assays indicated a higher affinity for CAld over sinapaldehyde for each isoform. In silico molecular docking analyses also suggested that PhCAD3 has a higher binding preference with CAld over sinapaldehyde, followed by PhCAD4, PhCAD2, and PhCAD1, respectively. The transgenic cell cultures overexpressing these isoforms independently revealed that PhCAD3 favored the maximum accumulation of ptox as compared to lignin followed by PhCAD4 and PhCAD2, whereas, PhCAD1 favored both equally. Together, our study reveals transcriptome-wide identification and characterization of ptox specific CAD isoforms from P. hexandrum. It provides a useful resource for future research not only on the ptox biosynthetic pathway but on overall P. hexandrum, an endangered medicinal herb with immense therapeutic importance.

  10. TLR and IMD signaling pathways from Caligus rogercresseyi (Crustacea: Copepoda): in silico gene expression and SNPs discovery.

    PubMed

    Valenzuela-Muñoz, V; Gallardo-Escárate, C

    2014-02-01

    The Toll and IMD signaling pathways represent one of the first lines of innate immune defense in invertebrates like Drosophila. However, for crustaceans like Caligus rogercresseyi, there is very little genomic information and, consequently, understanding of immune mechanisms. Massive sequencing data obtained for three developmental stages of C. rogercresseyi were used to evaluate in silico the expression patterns and presence of SNPs variants in genes involved in the Toll and IMD pathways. Through RNA-seq analysis, which used 20 contigs corresponding to relevant genes of the Toll and IMD pathways, an overexpression of genes linked to the Toll pathway, such as toll3 and Dorsal, were observed in the copepod stage. For the chalimus and adult stages, overexpression of genes in both pathways, such as Akirin and Tollip and IAP and Toll9, respectively, were observed. On the other hand, PCA statistical analysis inferred that in the chalimus and adult stages, the immune response mechanism was more developed, as evidenced by a relation between these two stages and the genes of both pathways. Moreover, 136 SNPs were identified for 20 contigs in genes of the Toll and IMD pathways. This study provides transcriptomic information about the immune response mechanisms of Caligus, thus providing a foundation for the development of new control strategies through blocking the innate immune response. Copyright © 2013 Elsevier Ltd. All rights reserved.

  11. A Guideline to Family-Wide Comparative State-of-the-Art Quantitative RT-PCR Analysis Exemplified with a Brassicaceae Cross-Species Seed Germination Case Study[W][OA

    PubMed Central

    Graeber, Kai; Linkies, Ada; Wood, Andrew T.A.; Leubner-Metzger, Gerhard

    2011-01-01

    Comparative biology includes the comparison of transcriptome and quantitative real-time RT-PCR (qRT-PCR) data sets in a range of species to detect evolutionarily conserved and divergent processes. Transcript abundance analysis of target genes by qRT-PCR requires a highly accurate and robust workflow. This includes reference genes with high expression stability (i.e., low intersample transcript abundance variation) for correct target gene normalization. Cross-species qRT-PCR for proper comparative transcript quantification requires reference genes suitable for different species. We addressed this issue using tissue-specific transcriptome data sets of germinating Lepidium sativum seeds to identify new candidate reference genes. We investigated their expression stability in germinating seeds of L. sativum and Arabidopsis thaliana by qRT-PCR, combined with in silico analysis of Arabidopsis and Brassica napus microarray data sets. This revealed that reference gene expression stability is higher for a given developmental process between distinct species than for distinct developmental processes within a given single species. The identified superior cross-species reference genes may be used for family-wide comparative qRT-PCR analysis of Brassicaceae seed germination. Furthermore, using germinating seeds, we exemplify optimization of the qRT-PCR workflow for challenging tissues regarding RNA quality, transcript stability, and tissue abundance. Our work therefore can serve as a guideline for moving beyond Arabidopsis by establishing high-quality cross-species qRT-PCR. PMID:21666000

  12. Transcript variations, phylogenetic tree and chromosomal localization of porcine aryl hydrocarbon receptor (AhR) and AhR nuclear translocator (ARNT) genes.

    PubMed

    Sadowska, Agnieszka; Paukszto, Lukasz; Nynca, Anna; Szczerbal, Izabela; Orlowska, Karina; Swigonska, Sylwia; Ruszkowska, Monika; Molcan, Tomasz; Jastrzebski, Jan P; Panasiewicz, Grzegorz; Ciereszko, Renata E

    2017-03-01

    Aryl hydrocarbon receptor (AhR) is a ligand-activated transcription factor best known for mediating xenobiotic-induced toxicity. AhR requires aryl hydrocarbon receptor nuclear translocator (ARNT) to form an active transcription complex and promote the activation of genes which have dioxin responsive element in their regulatory regions. The present study was performed to determine the complete cDNA sequences of porcine AhR and ARNT genes and their chromosomal localization. Total RNA from porcine livers were used to obtain the sequence of the entire porcine transcriptome by next-generation sequencing (NGS; lllumina HiSeq2500). In addition, both, in silico analysis and fluorescence in situ hybridization (FISH) were used to determine chromosomal localization of porcine AhR and ARNT genes. In silico analysis of nucleotide sequences showed that there were two transcript variants of AhR and ARNT genes in the pig. In addition, computer analysis revealed that AhR gene in the pig is located on chromosome 9 and ARNT on chromosome 4. The results of FISH experiment confirmed the localization of porcine AhR and ARNT genes. In the present study, for the first time, the full cDNAs of AhR and ARNT were demonstrated in the pig. In future, it would be interesting to determine the tissue distribution of AhR and ARNT transcript variants in the pig and to test whether these variants are associated with different biological functions and/or different activation pathways.

  13. Systems biology approaches to understand the effects of nutrition and promote health.

    PubMed

    Badimon, Lina; Vilahur, Gemma; Padro, Teresa

    2017-01-01

    Within the last years the implementation of systems biology in nutritional research has emerged as a powerful tool to understand the mechanisms by which dietary components promote health and prevent disease as well as to identify the biologically active molecules involved in such effects. Systems biology, by combining several '-omics' disciplines (mainly genomics/transcriptomics, proteomics and metabolomics), creates large data sets that upon computational integration provide in silico predictive networks that allow a more extensive analysis of the individual response to a nutritional intervention and provide a more global comprehensive understanding of how diet may influence health and disease. Numerous studies have demonstrated that diet and particularly bioactive food components play a pivotal role in helping to counteract environmental-related oxidative damage. Oxidative stress is considered to be strongly implicated in ageing and the pathophysiology of numerous diseases including neurodegenerative disease, cancers, metabolic disorders and cardiovascular diseases. In the following review we will provide insights into the role of systems biology in nutritional research and focus on transcriptomic, proteomic and metabolomics studies that have demonstrated the ability of functional foods and their bioactive components to fight against oxidative damage and contribute to health benefits. © 2016 The British Pharmacological Society.

  14. Transcriptomic Analysis of Neuropeptides and Peptide Hormones in the Barnacle Balanus amphitrite: Evidence of Roles in Larval Settlement

    PubMed Central

    Yan, Xing-Cheng; Chen, Zhang-Fan; Sun, Jin; Matsumura, Kiyotaka; Wu, Rudolf S. S.; Qian, Pei-Yuan

    2012-01-01

    The barnacle Balanus amphitrite is a globally distributed marine crustacean and has been used as a model species for intertidal ecology and biofouling studies. Its life cycle consists of seven planktonic larval stages followed by a sessile juvenile/adult stage. The transitional processes between larval stages and juveniles are crucial for barnacle development and recruitment. Although some studies have been conducted on the neuroanatomy and neuroactive substances of the barnacle, a comprehensive understanding of neuropeptides and peptide hormones remains lacking. To better characterize barnacle neuropeptidome and its potential roles in larval settlement, an in silico identification of putative transcripts encoding neuropeptides/peptide hormones was performed, based on transcriptome of the barnacle B. amphitrite that has been recently sequenced. Potential cleavage sites andstructure of mature peptides were predicted through homology search of known arthropod peptides. In total, 16 neuropeptide families/subfamilies were predicted from the barnacle transcriptome, and 14 of them were confirmed as genuine neuropeptides by Rapid Amplification of cDNA Ends. Analysis of peptide precursor structures and mature sequences showed that some neuropeptides of B. amphitrite are novel isoforms and shared similar characteristics with their homologs from insects. The expression profiling of predicted neuropeptide genes revealed that pigment dispersing hormone, SIFamide, calcitonin, and B-type allatostatin had the highest expression level in cypris stage, while tachykinin-related peptide was down regulated in both cyprids and juveniles. Furthermore, an inhibitor of proprotein convertase related to peptide maturation effectively delayed larval metamorphosis. Combination of real-time PCR results and bioassay indicated that certain neuropeptides may play an important role in cypris settlement. Overall, new insight into neuropeptides/peptide hormones characterized in this study shall provide a platform for unraveling peptidergic control of barnacle larval behavior and settlement process. PMID:23056329

  15. GigaTON: an extensive publicly searchable database providing a new reference transcriptome in the pacific oyster Crassostrea gigas.

    PubMed

    Riviere, Guillaume; Klopp, Christophe; Ibouniyamine, Nabihoudine; Huvet, Arnaud; Boudry, Pierre; Favrel, Pascal

    2015-12-02

    The Pacific oyster, Crassostrea gigas, is one of the most important aquaculture shellfish resources worldwide. Important efforts have been undertaken towards a better knowledge of its genome and transcriptome, which makes now C. gigas becoming a model organism among lophotrochozoans, the under-described sister clade of ecdysozoans within protostomes. These massive sequencing efforts offer the opportunity to assemble gene expression data and make such resource accessible and exploitable for the scientific community. Therefore, we undertook this assembly into an up-to-date publicly available transcriptome database: the GigaTON (Gigas TranscriptOme pipeliNe) database. We assembled 2204 million sequences obtained from 114 publicly available RNA-seq libraries that were realized using all embryo-larval development stages, adult organs, different environmental stressors including heavy metals, temperature, salinity and exposure to air, which were mostly performed as part of the Crassostrea gigas genome project. This data was analyzed in silico and resulted into 56621 newly assembled contigs that were deposited into a publicly available database, the GigaTON database. This database also provides powerful and user-friendly request tools to browse and retrieve information about annotation, expression level, UTRs, splice and polymorphism, and gene ontology associated to all the contigs into each, and between all libraries. The GigaTON database provides a convenient, potent and versatile interface to browse, retrieve, confront and compare massive transcriptomic information in an extensive range of conditions, tissues and developmental stages in Crassostrea gigas. To our knowledge, the GigaTON database constitutes the most extensive transcriptomic database to date in marine invertebrates, thereby a new reference transcriptome in the oyster, a highly valuable resource to physiologists and evolutionary biologists.

  16. Systems-level effects of ectopic galectin-7 reconstitution in cervical cancer and its microenvironment.

    PubMed

    Higareda-Almaraz, Juan Carlos; Ruiz-Moreno, Juan S; Klimentova, Jana; Barbieri, Daniela; Salvador-Gallego, Raquel; Ly, Regina; Valtierra-Gutierrez, Ilse A; Dinsart, Christiane; Rabinovich, Gabriel A; Stulik, Jiri; Rösl, Frank; Rincon-Orozco, Bladimiro

    2016-08-24

    Galectin-7 (Gal-7) is negatively regulated in cervical cancer, and appears to be a link between the apoptotic response triggered by cancer and the anti-tumoral activity of the immune system. Our understanding of how cervical cancer cells and their molecular networks adapt in response to the expression of Gal-7 remains limited. Meta-analysis of Gal-7 expression was conducted in three cervical cancer cohort studies and TCGA. In silico prediction and bisulfite sequencing were performed to inquire epigenetic alterations. To study the effect of Gal-7 on cervical cancer, we ectopically re-expressed it in the HeLa and SiHa cervical cancer cell lines, and analyzed their transcriptome and SILAC-based proteome. We also examined the tumor and microenvironment host cell transcriptomes after xenotransplantation into immunocompromised mice. Differences between samples were assessed with the Kruskall-Wallis, Dunn's Multiple Comparison and T tests. Kaplan-Meier and log-rank tests were used to determine overall survival. Gal-7 was constantly downregulated in our meta-analysis (p < 0.0001). Tumors with combined high Gal-7 and low galectin-1 expression (p = 0.0001) presented significantly better prognoses (p = 0.005). In silico and bisulfite sequencing assays showed de novo methylation in the Gal-7 promoter and first intron. Cells re-expressing Gal-7 showed a high apoptosis ratio (p < 0.05) and their xenografts displayed strong growth retardation (p < 0.001). Multiple gene modules and transcriptional regulators were modulated in response to Gal-7 reconstitution, both in cervical cancer cells and their microenvironments (FDR < 0.05 %). Most of these genes and modules were associated with tissue morphogenesis, metabolism, transport, chemokine activity, and immune response. These functional modules could exert the same effects in vitro and in vivo, even despite different compositions between HeLa and SiHa samples. Gal-7 re-expression affects the regulation of molecular networks in cervical cancer that are involved in diverse cancer hallmarks, such as metabolism, growth control, invasion and evasion of apoptosis. The effect of Gal-7 extends to the microenvironment, where networks involved in its configuration and in immune surveillance are particularly affected.

  17. The Large Mitochondrial Genome of Symbiodinium minutum Reveals Conserved Noncoding Sequences between Dinoflagellates and Apicomplexans

    PubMed Central

    Shoguchi, Eiichi; Shinzato, Chuya; Hisata, Kanako; Satoh, Nori; Mungpakdee, Sutada

    2015-01-01

    Even though mitochondrial genomes, which characterize eukaryotic cells, were first discovered more than 50 years ago, mitochondrial genomics remains an important topic in molecular biology and genome sciences. The Phylum Alveolata comprises three major groups (ciliates, apicomplexans, and dinoflagellates), the mitochondrial genomes of which have diverged widely. Even though the gene content of dinoflagellate mitochondrial genomes is reportedly comparable to that of apicomplexans, the highly fragmented and rearranged genome structures of dinoflagellates have frustrated whole genomic analysis. Consequently, noncoding sequences and gene arrangements of dinoflagellate mitochondrial genomes have not been well characterized. Here we report that the continuous assembled genome (∼326 kb) of the dinoflagellate, Symbiodinium minutum, is AT-rich (∼64.3%) and that it contains three protein-coding genes. Based upon in silico analysis, the remaining 99% of the genome comprises transcriptomic noncoding sequences. RNA edited sites and unique, possible start and stop codons clarify conserved regions among dinoflagellates. Our massive transcriptome analysis shows that almost all regions of the genome are transcribed, including 27 possible fragmented ribosomal RNA genes and 12 uncharacterized small RNAs that are similar to mitochondrial RNA genes of the malarial parasite, Plasmodium falciparum. Gene map comparisons show that gene order is only slightly conserved between S. minutum and P. falciparum. However, small RNAs and intergenic sequences share sequence similarities with P. falciparum, suggesting that the function of noncoding sequences has been preserved despite development of very different genome structures. PMID:26199191

  18. Transcriptome profiling of Diachasmimorpha longicaudata towards useful molecular tools for population management.

    PubMed

    Mannino, M Constanza; Rivarola, Máximo; Scannapieco, Alejandra C; González, Sergio; Farber, Marisa; Cladera, Jorge L; Lanzavecchia, Silvia B

    2016-10-12

    Diachasmimorpha longicaudata (Hymenoptera: Braconidae) is a solitary parasitoid of Tephritidae (Diptera) fruit flies of economic importance currently being mass-reared in bio-factories and successfully used worldwide. A peculiar biological aspect of Hymenoptera is its haplo-diploid life cycle, where females (diploid) develop from fertilized eggs and males (haploid) from unfertilized eggs. Diploid males were described in many species and recently evidenced in D. longicaudata by mean of inbreeding studies. Sex determination in this parasitoid is based on the Complementary Sex Determination (CSD) system, with alleles from at least one locus involved in early steps of this pathway. Since limited information is available about genetics of this parasitoid species, a deeper analysis on D. longicaudata's genomics is required to provide molecular tools for achieving a more cost effective production under artificial rearing conditions. We report here the first transcriptome analysis of male-larvae, adult females and adult males of D. longicaudata using 454-pyrosequencing. A total of 469766 reads were analyzed and 8483 high-quality isotigs were assembled. After functional annotation, a total of 51686 unigenes were produced, from which, 7021 isotigs and 20227 singletons had at least one BLAST hit against the NCBI non-redundant protein database. A preliminary comparison of adult female and male evidenced that 98 transcripts showed differential expression profiles, with at least a 10-fold difference. Among the functionally annotated transcripts we detected four sequences potentially involved in sex determination and three homologues to two known genes involved in the sex determination cascade. Finally, a total of 4674SimpleSequence Repeats (SSRs) were in silico identified and characterized. The information obtained here will significantly contribute to the development of D. longicaudata functional genomics, genetics and population-based genome studies. Thousands of new microsatellite markers were identified as toolkits for population genetics analysis. The transcriptome characterized here is the starting point to elucidate the molecular bases of the sex determination mechanism in this species.

  19. Structural characterization of a novel peptide with antimicrobial activity from the venom gland of the scorpion Tityus stigmurus: Stigmurin.

    PubMed

    de Melo, Edinara Targino; Estrela, Andréia Bergamo; Santos, Elizabeth Cristina Gomes; Machado, Paula Renata Lima; Farias, Kleber Juvenal Silva; Torres, Taffarel Melo; Carvalho, Enéas; Lima, João Paulo Matos Santos; Silva-Júnior, Arnóbio Antonio; Barbosa, Euzébio Guimarães; Fernandes-Pedrosa, Matheus de Freitas

    2015-06-01

    A new antimicrobial peptide, herein named Stigmurin, was selected based on a transcriptomic analysis of the Brazilian yellow scorpion Tityus stigmurus venom gland, an underexplored source for toxic peptides with possible biotechnological applications. Stigmurin was investigated in silico, by circular dichroism (CD) spectroscopy, and in vitro. The CD spectra suggested that this peptide interacts with membranes, changing its conformation in the presence of an amphipathic environment, with predominance of random coil and beta-sheet structures. Stigmurin exhibited antibacterial and antifungal activity, with minimal inhibitory concentrations ranging from 8.7 to 69.5μM. It was also showed that Stigmurin is toxic against SiHa and Vero E6 cell lines. The results suggest that Stigmurin can be considered a potential anti-infective drug. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Integrative transcriptome, proteome, phosphoproteome and genetic mapping reveals new aspects in a fiberless mutant of cotton

    PubMed Central

    Ma, Qi-Feng; Wu, Chun-Hui; Wu, Man; Pei, Wen-Feng; Li, Xing-Li; Wang, Wen-Kui; Zhang, Jinfa; Yu, Ji-Wen; Yu, Shu-Xun

    2016-01-01

    To investigate the molecular mechanisms of fiber initiation in cotton (Gossypium spp.), an integrated approach combining transcriptome, iTRAQ-based proteome and genetic mapping was taken to compare the ovules of the Xuzhou 142 wild type (WT) with its fuzzless-lintless (fl) mutant at −3 and 0 day post-anthesis. A total of 1,953 mRNAs, 187 proteins, and 131 phosphoproteins were differentially expressed (DE) between WT and fl, and the levels of transcripts and their encoded proteins and phosphoproteins were highly congruent. A functional analysis suggested that the abundance of proteins were mainly involved in amino sugar, nucleotide sugar and fatty acid metabolism, one carbon pool for folate metabolism and flavonoid biosynthesis. qRT-PCR, Western blotting, and enzymatic assays were performed to confirm the regulation of these transcripts and proteins. A molecular mapping located the lintless gene li3 in the fl mutant on chromosome 26 for the first time. A further in-silico physical mapping of DE genes with sequence variations between fl and WT identified one and four candidate genes in the li3 and n2 regions, respectively. Taken together, the transcript abundance, phosphorylation status of proteins at the fiber initiation stage and candidate genes have provided insights into regulatory processes underlying cotton fiber initiation. PMID:27075604

  1. Integrative biology approach identifies cytokine targeting strategies for psoriasis.

    PubMed

    Perera, Gayathri K; Ainali, Chrysanthi; Semenova, Ekaterina; Hundhausen, Christian; Barinaga, Guillermo; Kassen, Deepika; Williams, Andrew E; Mirza, Muddassar M; Balazs, Mercedesz; Wang, Xiaoting; Rodriguez, Robert Sanchez; Alendar, Andrej; Barker, Jonathan; Tsoka, Sophia; Ouyang, Wenjun; Nestle, Frank O

    2014-02-12

    Cytokines are critical checkpoints of inflammation. The treatment of human autoimmune disease has been revolutionized by targeting inflammatory cytokines as key drivers of disease pathogenesis. Despite this, there exist numerous pitfalls when translating preclinical data into the clinic. We developed an integrative biology approach combining human disease transcriptome data sets with clinically relevant in vivo models in an attempt to bridge this translational gap. We chose interleukin-22 (IL-22) as a model cytokine because of its potentially important proinflammatory role in epithelial tissues. Injection of IL-22 into normal human skin grafts produced marked inflammatory skin changes resembling human psoriasis. Injection of anti-IL-22 monoclonal antibody in a human xenotransplant model of psoriasis, developed specifically to test potential therapeutic candidates, efficiently blocked skin inflammation. Bioinformatic analysis integrating both the IL-22 and anti-IL-22 cytokine transcriptomes and mapping them onto a psoriasis disease gene coexpression network identified key cytokine-dependent hub genes. Using knockout mice and small-molecule blockade, we show that one of these hub genes, the so far unexplored serine/threonine kinase PIM1, is a critical checkpoint for human skin inflammation and potential future therapeutic target in psoriasis. Using in silico integration of human data sets and biological models, we were able to identify a new target in the treatment of psoriasis.

  2. Leaf transcriptome of two highly divergent genotypes of Urochloa humidicola (Poaceae), a tropical polyploid forage grass adapted to acidic soils and temporary flooding areas.

    PubMed

    Vigna, Bianca Baccili Zanotto; de Oliveira, Fernanda Ancelmo; de Toledo-Silva, Guilherme; da Silva, Carla Cristina; do Valle, Cacilda Borges; de Souza, Anete Pereira

    2016-11-11

    Urochloa humidicola (Koronivia grass) is a polyploid (6x to 9x) species that is used as forage in the tropics. Facultative apospory apomixis is present in most of the genotypes of this species, although one individual has been described as sexual. Molecular studies have been restricted to molecular marker approaches for genetic diversity estimations and linkage map construction. The objectives of the present study were to describe and compare the leaf transcriptome of two important genotypes that are highly divergent in terms of their phenotypes and reproduction modes: the sexual BH031 and the aposporous apomictic cultivar BRS Tupi. We sequenced the leaf transcriptome of Koronivia grass using an Illumina GAIIx system, which produced 13.09 Gb of data that consisted of 163,575,526 paired-end reads between the two libraries. We de novo-assembled 76,196 transcripts with an average length of 1,152 bp and filtered 35,093 non-redundant unigenes. A similarity search against the non-redundant National Center of Biotechnology Information (NCBI) protein database returned 65 % hits. We annotated 24,133 unigenes in the Phytozome database and 14,082 unigenes in the UniProtKB/Swiss-Prot database, assigned 108,334 gene ontology terms to 17,255 unigenes and identified 5,324 unigenes in 327 known metabolic pathways. Comparisons with other grasses via a reciprocal BLAST search revealed a larger number of orthologous genes for the Panicum species. The unigenes were involved in C4 photosynthesis, lignocellulose biosynthesis and flooding stress responses. A search for functional molecular markers revealed 4,489 microsatellites and 560,298 single nucleotide polymorphisms (SNPs). A quantitative real-time PCR analysis validated the RNA-seq expression analysis and allowed for the identification of transcriptomic differences between the two evaluated genotypes. Moreover, 192 unannotated sequences were classified as containing complete open reading frames, suggesting that the new, potentially exclusive genes should be further investigated. The present study represents the first whole-transcriptome sequencing of U. humidicola leaves, providing an important public information source of transcripts and functional molecular markers. The qPCR analysis indicated that the expression of certain transcripts confirmed the differential expression observed in silico, which demonstrated that RNA-seq is useful for identifying differentially expressed and unique genes. These results corroborate the findings from previous studies and suggest a hybrid origin for BH031.

  3. The phosphoproteome of toll-like receptor-activated macrophages

    PubMed Central

    Weintz, Gabriele; Olsen, Jesper V; Frühauf, Katja; Niedzielska, Magdalena; Amit, Ido; Jantsch, Jonathan; Mages, Jörg; Frech, Cornelie; Dölken, Lars; Mann, Matthias; Lang, Roland

    2010-01-01

    Recognition of microbial danger signals by toll-like receptors (TLR) causes re-programming of macrophages. To investigate kinase cascades triggered by the TLR4 ligand lipopolysaccharide (LPS) on systems level, we performed a global, quantitative and kinetic analysis of the phosphoproteome of primary macrophages using stable isotope labelling with amino acids in cell culture, phosphopeptide enrichment and high-resolution mass spectrometry. In parallel, nascent RNA was profiled to link transcription factor (TF) phosphorylation to TLR4-induced transcriptional activation. We reproducibly identified 1850 phosphoproteins with 6956 phosphorylation sites, two thirds of which were not reported earlier. LPS caused major dynamic changes in the phosphoproteome (24% up-regulation and 9% down-regulation). Functional bioinformatic analyses confirmed canonical players of the TLR pathway and highlighted other signalling modules (e.g. mTOR, ATM/ATR kinases) and the cytoskeleton as hotspots of LPS-regulated phosphorylation. Finally, weaving together phosphoproteome and nascent transcriptome data by in silico promoter analysis, we implicated several phosphorylated TFs in primary LPS-controlled gene expression. PMID:20531401

  4. De novo transcriptome sequencing reveals a considerable bias in the incidence of simple sequence repeats towards the downstream of 'Pre-miRNAs' of black pepper.

    PubMed

    Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

    2013-01-01

    Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of '43 pre-miRNA candidates bearing different types of SSR motifs'. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted 'pre-miRNA candidates bearing SSRs'. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted 'pre-miRNA candidates'. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of 'tandem repeats' in miRNAs.

  5. De novo Transcriptome Sequencing Reveals a Considerable Bias in the Incidence of Simple Sequence Repeats towards the Downstream of ‘Pre-miRNAs’ of Black Pepper

    PubMed Central

    Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

    2013-01-01

    Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of ‘43 pre-miRNA candidates bearing different types of SSR motifs’. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted ‘pre-miRNA candidates bearing SSRs’. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted ‘pre-miRNA candidates’. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of ‘tandem repeats’ in miRNAs. PMID:23469176

  6. The Large Mitochondrial Genome of Symbiodinium minutum Reveals Conserved Noncoding Sequences between Dinoflagellates and Apicomplexans.

    PubMed

    Shoguchi, Eiichi; Shinzato, Chuya; Hisata, Kanako; Satoh, Nori; Mungpakdee, Sutada

    2015-07-20

    Even though mitochondrial genomes, which characterize eukaryotic cells, were first discovered more than 50 years ago, mitochondrial genomics remains an important topic in molecular biology and genome sciences. The Phylum Alveolata comprises three major groups (ciliates, apicomplexans, and dinoflagellates), the mitochondrial genomes of which have diverged widely. Even though the gene content of dinoflagellate mitochondrial genomes is reportedly comparable to that of apicomplexans, the highly fragmented and rearranged genome structures of dinoflagellates have frustrated whole genomic analysis. Consequently, noncoding sequences and gene arrangements of dinoflagellate mitochondrial genomes have not been well characterized. Here we report that the continuous assembled genome (∼326 kb) of the dinoflagellate, Symbiodinium minutum, is AT-rich (∼64.3%) and that it contains three protein-coding genes. Based upon in silico analysis, the remaining 99% of the genome comprises transcriptomic noncoding sequences. RNA edited sites and unique, possible start and stop codons clarify conserved regions among dinoflagellates. Our massive transcriptome analysis shows that almost all regions of the genome are transcribed, including 27 possible fragmented ribosomal RNA genes and 12 uncharacterized small RNAs that are similar to mitochondrial RNA genes of the malarial parasite, Plasmodium falciparum. Gene map comparisons show that gene order is only slightly conserved between S. minutum and P. falciparum. However, small RNAs and intergenic sequences share sequence similarities with P. falciparum, suggesting that the function of noncoding sequences has been preserved despite development of very different genome structures. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Meta-Analysis of Global Transcriptomics Suggests that Conserved Genetic Pathways are Responsible for Quercetin and Tannic Acid Mediated Longevity in C. elegans

    PubMed Central

    Pietsch, Kerstin; Saul, Nadine; Swain, Suresh C.; Menzel, Ralph; Steinberg, Christian E. W.; Stürzenbaum, Stephen R.

    2012-01-01

    Recent research has highlighted that the polyphenols Quercetin and Tannic acid are capable of extending the lifespan of Caenorhabditis elegans. To gain a deep understanding of the underlying molecular genetics, we analyzed the global transcriptional patterns of nematodes exposed to three concentrations of Quercetin or Tannic acid, respectively. By means of an intricate meta-analysis it was possible to compare the transcriptomes of polyphenol exposure to recently published datasets derived from (i) longevity mutants or (ii) infection. This detailed comparative in silico analysis facilitated the identification of compound specific and overlapping transcriptional profiles and allowed the prediction of putative mechanistic models of Quercetin and Tannic acid mediated longevity. Lifespan extension due to Quercetin was predominantly driven by the metabolome, TGF-beta signaling, Insulin-like signaling, and the p38 MAPK pathway and Tannic acid’s impact involved, in part, the amino acid metabolism and was modulated by the TGF-beta and the p38 MAPK pathways. DAF-12, which integrates TGF-beta and Insulin-like downstream signaling, and genetic players of the p38 MAPK pathway therefore seem to be crucial regulators for both polyphenols. Taken together, this study underlines how meta-analyses can provide an insight of molecular events that go beyond the traditional categorization into gene ontology-terms and Kyoto encyclopedia of genes and genomes-pathways. It also supports the call to expand the generation of comparative and integrative databases, an effort that is currently still in its infancy. PMID:22493606

  8. RNA-seq analysis identifies potential modulators of gravity response in spores of Ceratopteris (Parkeriaceae): evidence for modulation by calcium pumps and apyrase activity.

    PubMed

    Bushart, Thomas J; Cannon, Ashley E; Ul Haque, Aeraj; San Miguel, Phillip; Mostajeran, Kathy; Clark, Gregory B; Porterfield, D Marshall; Roux, Stanley J

    2013-01-01

    Gravity regulates the magnitude and direction of a trans-cell calcium current in germinating spores of Ceratopteris richardii. Blocking this current with nifedipine blocks the spore's downward polarity alignment, a polarization that is fixed by gravity ∼10 h after light induces the spores to germinate. RNA-seq analysis at 10 h was used to identify genes potentially important for the gravity response. The data set will be valuable for other developmental and phylogenetic studies. De novo Newbler assembly of 958 527 reads from Roche 454 sequencing was executed. The sequences were identified and analyzed using in silico methods. The roles of endomembrane Ca(2+)-ATPase pumps and apyrases in the gravity response were further tested using pharmacological agents. Transcripts related to calcium signaling and ethylene biosynthesis were identified as notable constituents of the transcriptome. Inhibiting the activity of endomembrane Ca(2+)-ATPase pumps with 2,5-di-(t-butyl)-1,4-hydroquinone diminished the trans-cell current, but increased the orientation of the polar axis to gravity. The effects of applied nucleotides and purinoceptor antagonists gave novel evidence implicating extracellular nucleotides as regulators of the gravity response in these fern spores. In addition to revealing general features of the transcriptome of germinating spores, the results highlight a number of calcium-responsive and light-receptive transcripts. Pharmacologic assays indicate endomembrane Ca(2+)-ATPases and extracellular nucleotides may play regulatory roles in the gravity response of Ceratopteris spores.

  9. Analyses of advanced rice anther transcriptomes reveal global tapetum secretory functions and potential proteins for lipid exine formation.

    PubMed

    Huang, Ming-Der; Wei, Fu-Jin; Wu, Cheng-Cheih; Hsing, Yue-Ie Caroline; Huang, Anthony H C

    2009-02-01

    The anthers in flowers perform important functions in sexual reproduction. Several recent studies used microarrays to study anther transcriptomes to explore genes controlling anther development. To analyze the secretion and other functions of the tapetum, we produced transcriptomes of anthers of rice (Oryza sativa subsp. japonica) at six progressive developmental stages and pollen with sequencing-by-synthesis technology. The transcriptomes included at least 18,000 unique transcripts, about 25% of which had antisense transcripts. In silico anther-minus-pollen subtraction produced transcripts largely unique to the tapetum; these transcripts include all the reported tapetum-specific transcripts of orthologs in other species. The differential developmental profiles of the transcripts and their antisense transcripts signify extensive regulation of gene expression in the anther, especially the tapetum, during development. The transcriptomes were used to dissect two major cell/biochemical functions of the tapetum. First, we categorized and charted the developmental profiles of all transcripts encoding secretory proteins present in the cellular exterior; these transcripts represent about 12% and 30% of the those transcripts having more than 100 and 1,000 transcripts per million, respectively. Second, we successfully selected from hundreds of transcripts several transcripts encoding potential proteins for lipid exine synthesis during early anther development. These proteins include cytochrome P450, acyltransferases, and lipid transfer proteins in our hypothesized mechanism of exine synthesis in and export from the tapetum. Putative functioning of these proteins in exine formation is consistent with proteins and metabolites detected in the anther locule fluid obtained by micropipetting.

  10. New insights into plant glycoside hydrolase family 32 in Agave species

    PubMed Central

    Avila de Dios, Emmanuel; Gomez Vargas, Alan D.; Damián Santos, Maura L.; Simpson, June

    2015-01-01

    In order to optimize the use of agaves for commercial applications, an understanding of fructan metabolism in these species at the molecular and genetic level is essential. Based on transcriptome data, this report describes the identification and molecular characterization of cDNAs and deduced amino acid sequences for genes encoding fructosyltransferases, invertases and fructan exohydrolases (FEH) (enzymes belonging to plant glycoside hydrolase family 32) from four different agave species (A. tequilana, A. deserti, A. victoriae-reginae, and A. striata). Conserved amino acid sequences and a hypervariable domain allowed classification of distinct isoforms for each enzyme type. Notably however neither 1-FFT nor 6-SFT encoding cDNAs were identified. In silico analysis revealed that distinct isoforms for certain enzymes found in a single species, showed different levels and tissue specific patterns of expression whereas in other cases expression patterns were conserved both within the species and between different species. Relatively high levels of in silico expression for specific isoforms of both invertases and fructosyltransferases were observed in floral tissues in comparison to vegetative tissues such as leaves and stems and this pattern was confirmed by Quantitative Real Time PCR using RNA obtained from floral and leaf tissue of A. tequilana. Thin layer chromatography confirmed the presence of fructans with degree of polymerization (DP) greater than DP three in both immature buds and fully opened flowers also obtained from A. tequilana. PMID:26300895

  11. New insights into plant glycoside hydrolase family 32 in Agave species.

    PubMed

    Avila de Dios, Emmanuel; Gomez Vargas, Alan D; Damián Santos, Maura L; Simpson, June

    2015-01-01

    In order to optimize the use of agaves for commercial applications, an understanding of fructan metabolism in these species at the molecular and genetic level is essential. Based on transcriptome data, this report describes the identification and molecular characterization of cDNAs and deduced amino acid sequences for genes encoding fructosyltransferases, invertases and fructan exohydrolases (FEH) (enzymes belonging to plant glycoside hydrolase family 32) from four different agave species (A. tequilana, A. deserti, A. victoriae-reginae, and A. striata). Conserved amino acid sequences and a hypervariable domain allowed classification of distinct isoforms for each enzyme type. Notably however neither 1-FFT nor 6-SFT encoding cDNAs were identified. In silico analysis revealed that distinct isoforms for certain enzymes found in a single species, showed different levels and tissue specific patterns of expression whereas in other cases expression patterns were conserved both within the species and between different species. Relatively high levels of in silico expression for specific isoforms of both invertases and fructosyltransferases were observed in floral tissues in comparison to vegetative tissues such as leaves and stems and this pattern was confirmed by Quantitative Real Time PCR using RNA obtained from floral and leaf tissue of A. tequilana. Thin layer chromatography confirmed the presence of fructans with degree of polymerization (DP) greater than DP three in both immature buds and fully opened flowers also obtained from A. tequilana.

  12. Improved evidence-based genome-scale metabolic models for maize leaf, embryo, and endosperm

    PubMed Central

    Seaver, Samuel M. D.; Bradbury, Louis M. T.; Frelin, Océane; Zarecki, Raphy; Ruppin, Eytan; Hanson, Andrew D.; Henry, Christopher S.

    2015-01-01

    There is a growing demand for genome-scale metabolic reconstructions for plants, fueled by the need to understand the metabolic basis of crop yield and by progress in genome and transcriptome sequencing. Methods are also required to enable the interpretation of plant transcriptome data to study how cellular metabolic activity varies under different growth conditions or even within different organs, tissues, and developmental stages. Such methods depend extensively on the accuracy with which genes have been mapped to the biochemical reactions in the plant metabolic pathways. Errors in these mappings lead to metabolic reconstructions with an inflated number of reactions and possible generation of unreliable metabolic phenotype predictions. Here we introduce a new evidence-based genome-scale metabolic reconstruction of maize, with significant improvements in the quality of the gene-reaction associations included within our model. We also present a new approach for applying our model to predict active metabolic genes based on transcriptome data. This method includes a minimal set of reactions associated with low expression genes to enable activity of a maximum number of reactions associated with high expression genes. We apply this method to construct an organ-specific model for the maize leaf, and tissue specific models for maize embryo and endosperm cells. We validate our models using fluxomics data for the endosperm and embryo, demonstrating an improved capacity of our models to fit the available fluxomics data. All models are publicly available via the DOE Systems Biology Knowledgebase and PlantSEED, and our new method is generally applicable for analysis transcript profiles from any plant, paving the way for further in silico studies with a wide variety of plant genomes. PMID:25806041

  13. Improved evidence-based genome-scale metabolic models for maize leaf, embryo, and endosperm

    DOE PAGES

    Seaver, Samuel M.D.; Bradbury, Louis M.T.; Frelin, Océane; ...

    2015-03-10

    There is a growing demand for genome-scale metabolic reconstructions for plants, fueled by the need to understand the metabolic basis of crop yield and by progress in genome and transcriptome sequencing. Methods are also required to enable the interpretation of plant transcriptome data to study how cellular metabolic activity varies under different growth conditions or even within different organs, tissues, and developmental stages. Such methods depend extensively on the accuracy with which genes have been mapped to the biochemical reactions in the plant metabolic pathways. Errors in these mappings lead to metabolic reconstructions with an inflated number of reactions andmore » possible generation of unreliable metabolic phenotype predictions. Here we introduce a new evidence-based genome-scale metabolic reconstruction of maize, with significant improvements in the quality of the gene-reaction associations included within our model. We also present a new approach for applying our model to predict active metabolic genes based on transcriptome data. This method includes a minimal set of reactions associated with low expression genes to enable activity of a maximum number of reactions associated with high expression genes. We apply this method to construct an organ-specific model for the maize leaf, and tissue specific models for maize embryo and endosperm cells. We validate our models using fluxomics data for the endosperm and embryo, demonstrating an improved capacity of our models to fit the available fluxomics data. All models are publicly available via the DOE Systems Biology Knowledgebase and PlantSEED, and our new method is generally applicable for analysis transcript profiles from any plant, paving the way for further in silico studies with a wide variety of plant genomes.« less

  14. Expansion of the neuropeptidome of the globally invasive marine crab Carcinus maenas.

    PubMed

    Christie, Andrew E

    2016-09-01

    Carcinus maenas is widely recognized as one of the world's most successful marine invasive species; its success as an invader is due largely to its ability to thrive under varied environmental conditions. The physiological/behavioral control systems that allow C. maenas to adapt to new environments are undoubtedly under hormonal control, the largest single class of hormones being peptides. While numerous studies have focused on identifying native C. maenas peptides, none has taken advantage of mining transcriptome shotgun assembly (TSA) sequence data, a strategy proven highly successful for peptide discovery in other crustaceans. Here, a C. maenas peptidome was predicted via in silico transcriptome mining. Thirty-seven peptide families were searched for in the extant TSA database, with transcripts encoding precursors for 29 groups identified. The pre/preprohormones deduced from the identified sequences allowed for the prediction of 263 distinct mature peptides, 193 of which are new discoveries for C. maenas. The predicted peptides include isoforms of adipokinetic hormone-corazonin-like peptide, allatostatin A, allatostatin B, allatostatin C, bursicon, CCHamide, corazonin, crustacean cardioactive peptide, crustacean hyperglycemic hormone, diuretic hormone 31, diuretic hormone 44, eclosion hormone, FMRFamide-like peptide, HIGSLYRamide, intocin, leucokinin, myosuppressin, neuroparsin, neuropeptide F, orcokinin, pigment dispersing hormone, proctolin, pyrokinin, red pigment concentrating hormone, RYamide, short neuropeptide F, SIFamide, and tachykinin-related peptide. This peptidome is the largest predicted from any single crustacean using the in silico approach, and provides a platform for investigating peptidergic signaling in C. maenas, including control of the processes that allow for its success as a global marine invader. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. GeNNet: an integrated platform for unifying scientific workflows and graph databases for transcriptome data analysis

    PubMed Central

    Gadelha, Luiz; Ribeiro-Alves, Marcelo; Porto, Fábio

    2017-01-01

    There are many steps in analyzing transcriptome data, from the acquisition of raw data to the selection of a subset of representative genes that explain a scientific hypothesis. The data produced can be represented as networks of interactions among genes and these may additionally be integrated with other biological databases, such as Protein-Protein Interactions, transcription factors and gene annotation. However, the results of these analyses remain fragmented, imposing difficulties, either for posterior inspection of results, or for meta-analysis by the incorporation of new related data. Integrating databases and tools into scientific workflows, orchestrating their execution, and managing the resulting data and its respective metadata are challenging tasks. Additionally, a great amount of effort is equally required to run in-silico experiments to structure and compose the information as needed for analysis. Different programs may need to be applied and different files are produced during the experiment cycle. In this context, the availability of a platform supporting experiment execution is paramount. We present GeNNet, an integrated transcriptome analysis platform that unifies scientific workflows with graph databases for selecting relevant genes according to the evaluated biological systems. It includes GeNNet-Wf, a scientific workflow that pre-loads biological data, pre-processes raw microarray data and conducts a series of analyses including normalization, differential expression inference, clusterization and gene set enrichment analysis. A user-friendly web interface, GeNNet-Web, allows for setting parameters, executing, and visualizing the results of GeNNet-Wf executions. To demonstrate the features of GeNNet, we performed case studies with data retrieved from GEO, particularly using a single-factor experiment in different analysis scenarios. As a result, we obtained differentially expressed genes for which biological functions were analyzed. The results are integrated into GeNNet-DB, a database about genes, clusters, experiments and their properties and relationships. The resulting graph database is explored with queries that demonstrate the expressiveness of this data model for reasoning about gene interaction networks. GeNNet is the first platform to integrate the analytical process of transcriptome data with graph databases. It provides a comprehensive set of tools that would otherwise be challenging for non-expert users to install and use. Developers can add new functionality to components of GeNNet. The derived data allows for testing previous hypotheses about an experiment and exploring new ones through the interactive graph database environment. It enables the analysis of different data on humans, rhesus, mice and rat coming from Affymetrix platforms. GeNNet is available as an open source platform at https://github.com/raquele/GeNNet and can be retrieved as a software container with the command docker pull quelopes/gennet. PMID:28695067

  16. GeNNet: an integrated platform for unifying scientific workflows and graph databases for transcriptome data analysis.

    PubMed

    Costa, Raquel L; Gadelha, Luiz; Ribeiro-Alves, Marcelo; Porto, Fábio

    2017-01-01

    There are many steps in analyzing transcriptome data, from the acquisition of raw data to the selection of a subset of representative genes that explain a scientific hypothesis. The data produced can be represented as networks of interactions among genes and these may additionally be integrated with other biological databases, such as Protein-Protein Interactions, transcription factors and gene annotation. However, the results of these analyses remain fragmented, imposing difficulties, either for posterior inspection of results, or for meta-analysis by the incorporation of new related data. Integrating databases and tools into scientific workflows, orchestrating their execution, and managing the resulting data and its respective metadata are challenging tasks. Additionally, a great amount of effort is equally required to run in-silico experiments to structure and compose the information as needed for analysis. Different programs may need to be applied and different files are produced during the experiment cycle. In this context, the availability of a platform supporting experiment execution is paramount. We present GeNNet, an integrated transcriptome analysis platform that unifies scientific workflows with graph databases for selecting relevant genes according to the evaluated biological systems. It includes GeNNet-Wf, a scientific workflow that pre-loads biological data, pre-processes raw microarray data and conducts a series of analyses including normalization, differential expression inference, clusterization and gene set enrichment analysis. A user-friendly web interface, GeNNet-Web, allows for setting parameters, executing, and visualizing the results of GeNNet-Wf executions. To demonstrate the features of GeNNet, we performed case studies with data retrieved from GEO, particularly using a single-factor experiment in different analysis scenarios. As a result, we obtained differentially expressed genes for which biological functions were analyzed. The results are integrated into GeNNet-DB, a database about genes, clusters, experiments and their properties and relationships. The resulting graph database is explored with queries that demonstrate the expressiveness of this data model for reasoning about gene interaction networks. GeNNet is the first platform to integrate the analytical process of transcriptome data with graph databases. It provides a comprehensive set of tools that would otherwise be challenging for non-expert users to install and use. Developers can add new functionality to components of GeNNet. The derived data allows for testing previous hypotheses about an experiment and exploring new ones through the interactive graph database environment. It enables the analysis of different data on humans, rhesus, mice and rat coming from Affymetrix platforms. GeNNet is available as an open source platform at https://github.com/raquele/GeNNet and can be retrieved as a software container with the command docker pull quelopes/gennet.

  17. Transcriptome-wide single nucleotide polymorphisms (SNPs) for abalone (Haliotis midae): validation and application using GoldenGate medium-throughput genotyping assays.

    PubMed

    Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay

    2013-09-23

    Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.

  18. Whipworm genome and dual-species transcriptome analyses provide molecular insights into an intimate host-parasite interaction.

    PubMed

    Foth, Bernardo J; Tsai, Isheng J; Reid, Adam J; Bancroft, Allison J; Nichol, Sarah; Tracey, Alan; Holroyd, Nancy; Cotton, James A; Stanley, Eleanor J; Zarowiecki, Magdalena; Liu, Jimmy Z; Huckvale, Thomas; Cooper, Philip J; Grencis, Richard K; Berriman, Matthew

    2014-07-01

    Whipworms are common soil-transmitted helminths that cause debilitating chronic infections in man. These nematodes are only distantly related to Caenorhabditis elegans and have evolved to occupy an unusual niche, tunneling through epithelial cells of the large intestine. We report here the whole-genome sequences of the human-infective Trichuris trichiura and the mouse laboratory model Trichuris muris. On the basis of whole-transcriptome analyses, we identify many genes that are expressed in a sex- or life stage-specific manner and characterize the transcriptional landscape of a morphological region with unique biological adaptations, namely, bacillary band and stichosome, found only in whipworms and related parasites. Using RNA sequencing data from whipworm-infected mice, we describe the regulated T helper 1 (TH1)-like immune response of the chronically infected cecum in unprecedented detail. In silico screening identified numerous new potential drug targets against trichuriasis. Together, these genomes and associated functional data elucidate key aspects of the molecular host-parasite interactions that define chronic whipworm infection.

  19. Whipworm genome and dual-species transcriptome analyses provide molecular insights into an intimate host-parasite interaction

    PubMed Central

    Nichol, Sarah; Tracey, Alan; Holroyd, Nancy; Cotton, James A.; Stanley, Eleanor J.; Zarowiecki, Magdalena; Liu, Jimmy Z.; Huckvale, Thomas; Cooper, Philip J.; Grencis, Richard K.; Berriman, Matthew

    2014-01-01

    Whipworms are common soil-transmitted helminths that cause debilitating chronic infections in man. These nematodes are only distantly related to Caenorhabditis elegans and have evolved to occupy an unusual niche, tunneling through epithelial cells of the large intestine. Here we present the genome sequences of the human-infective Trichuris trichiura and the murine laboratory model T. muris. Based on whole transcriptome analyses we identify many genes that are expressed in a gender- or life stage-specific manner and characterise the transcriptional landscape of a morphological region with unique biological adaptations, namely bacillary band and stichosome, found only in whipworms and related parasites. Using RNAseq data from whipworm-infected mice we describe the regulated Th1-like immune response of the chronically infected cecum in unprecedented detail. In silico screening identifies numerous potential new drug targets against trichuriasis. Together, these genomes and associated functional data elucidate key aspects of the molecular host-parasite interactions that define chronic whipworm infection. PMID:24929830

  20. De novo assembly and characterization of leaf transcriptome for the development of functional molecular markers of the extremophile multipurpose tree species Prosopis alba

    PubMed Central

    2013-01-01

    Background Prosopis alba (Fabaceae) is an important native tree adapted to arid and semiarid regions of north-western Argentina which is of great value as multipurpose species. Despite its importance, the genomic resources currently available for the entire Prosopis genus are still limited. Here we describe the development of a leaf transcriptome and the identification of new molecular markers that could support functional genetic studies in natural and domesticated populations of this genus. Results Next generation DNA pyrosequencing technology applied to P. alba transcripts produced a total of 1,103,231 raw reads with an average length of 421 bp. De novo assembling generated a set of 15,814 isotigs and 71,101 non-assembled sequences (singletons) with an average of 991 bp and 288 bp respectively. A total of 39,000 unique singletons were identified after clustering natural and artificial duplicates from pyrosequencing reads. Regarding the non-redundant sequences or unigenes, 22,095 out of 54,814 were successfully annotated with Gene Ontology terms. Moreover, simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 5,992 and 6,236 markers, respectively, throughout the genome. For the validation of the the predicted SSR markers, a subset of 87 SSRs selected through functional annotation evidence was successfully amplified from six DNA samples of seedlings. From this analysis, 11 of these 87 SSRs were identified as polymorphic. Additionally, another set of 123 nuclear polymorphic SSRs were determined in silico, of which 50% have the probability of being effectively polymorphic. Conclusions This study generated a successful global analysis of the P. alba leaf transcriptome after bioinformatic and wet laboratory validations of RNA-Seq data. The limited set of molecular markers currently available will be significantly increased with the thousands of new markers that were identified in this study. This information will strongly contribute to genomics resources for P. alba functional analysis and genetics. Finally, it will also potentially contribute to the development of population-based genome studies in the genera. PMID:24125525

  1. Comparative Analysis and Modeling of the Severity of Steatohepatitis in DDC-Treated Mouse Strains

    PubMed Central

    Pandey, Vikash; Sultan, Marc; Kashofer, Karl; Ralser, Meryem; Amstislavskiy, Vyacheslav; Starmann, Julia; Osprian, Ingrid; Grimm, Christina; Hache, Hendrik; Yaspo, Marie-Laure; Sültmann, Holger; Trauner, Michael; Denk, Helmut; Zatloukal, Kurt; Lehrach, Hans; Wierling, Christoph

    2014-01-01

    Background Non-alcoholic fatty liver disease (NAFLD) has a broad spectrum of disease states ranging from mild steatosis characterized by an abnormal retention of lipids within liver cells to steatohepatitis (NASH) showing fat accumulation, inflammation, ballooning and degradation of hepatocytes, and fibrosis. Ultimately, steatohepatitis can result in liver cirrhosis and hepatocellular carcinoma. Methodology and Results In this study we have analyzed three different mouse strains, A/J, C57BL/6J, and PWD/PhJ, that show different degrees of steatohepatitis when administered a 3,5-diethoxycarbonyl-1,4-dihydrocollidine (DDC) containing diet. RNA-Seq gene expression analysis, protein analysis and metabolic profiling were applied to identify differentially expressed genes/proteins and perturbed metabolite levels of mouse liver samples upon DDC-treatment. Pathway analysis revealed alteration of arachidonic acid (AA) and S-adenosylmethionine (SAMe) metabolism upon other pathways. To understand metabolic changes of arachidonic acid metabolism in the light of disease expression profiles a kinetic model of this pathway was developed and optimized according to metabolite levels. Subsequently, the model was used to study in silico effects of potential drug targets for steatohepatitis. Conclusions We identified AA/eicosanoid metabolism as highly perturbed in DDC-induced mice using a combination of an experimental and in silico approach. Our analysis of the AA/eicosanoid metabolic pathway suggests that 5-hydroxyeicosatetraenoic acid (5-HETE), 15-hydroxyeicosatetraenoic acid (15-HETE) and prostaglandin D2 (PGD2) are perturbed in DDC mice. We further demonstrate that a dynamic model can be used for qualitative prediction of metabolic changes based on transcriptomics data in a disease-related context. Furthermore, SAMe metabolism was identified as being perturbed due to DDC treatment. Several genes as well as some metabolites of this module show differences between A/J and C57BL/6J on the one hand and PWD/PhJ on the other. PMID:25347188

  2. Comparative analysis and modeling of the severity of steatohepatitis in DDC-treated mouse strains.

    PubMed

    Pandey, Vikash; Sultan, Marc; Kashofer, Karl; Ralser, Meryem; Amstislavskiy, Vyacheslav; Starmann, Julia; Osprian, Ingrid; Grimm, Christina; Hache, Hendrik; Yaspo, Marie-Laure; Sültmann, Holger; Trauner, Michael; Denk, Helmut; Zatloukal, Kurt; Lehrach, Hans; Wierling, Christoph

    2014-01-01

    Non-alcoholic fatty liver disease (NAFLD) has a broad spectrum of disease states ranging from mild steatosis characterized by an abnormal retention of lipids within liver cells to steatohepatitis (NASH) showing fat accumulation, inflammation, ballooning and degradation of hepatocytes, and fibrosis. Ultimately, steatohepatitis can result in liver cirrhosis and hepatocellular carcinoma. In this study we have analyzed three different mouse strains, A/J, C57BL/6J, and PWD/PhJ, that show different degrees of steatohepatitis when administered a 3,5-diethoxycarbonyl-1,4-dihydrocollidine (DDC) containing diet. RNA-Seq gene expression analysis, protein analysis and metabolic profiling were applied to identify differentially expressed genes/proteins and perturbed metabolite levels of mouse liver samples upon DDC-treatment. Pathway analysis revealed alteration of arachidonic acid (AA) and S-adenosylmethionine (SAMe) metabolism upon other pathways. To understand metabolic changes of arachidonic acid metabolism in the light of disease expression profiles a kinetic model of this pathway was developed and optimized according to metabolite levels. Subsequently, the model was used to study in silico effects of potential drug targets for steatohepatitis. We identified AA/eicosanoid metabolism as highly perturbed in DDC-induced mice using a combination of an experimental and in silico approach. Our analysis of the AA/eicosanoid metabolic pathway suggests that 5-hydroxyeicosatetraenoic acid (5-HETE), 15-hydroxyeicosatetraenoic acid (15-HETE) and prostaglandin D2 (PGD2) are perturbed in DDC mice. We further demonstrate that a dynamic model can be used for qualitative prediction of metabolic changes based on transcriptomics data in a disease-related context. Furthermore, SAMe metabolism was identified as being perturbed due to DDC treatment. Several genes as well as some metabolites of this module show differences between A/J and C57BL/6J on the one hand and PWD/PhJ on the other.

  3. Dual Transcriptomic Profiling of Host and Microbiota during Health and Disease in Pediatric Asthma.

    PubMed

    Pérez-Losada, Marcos; Castro-Nallar, Eduardo; Bendall, Matthew L; Freishtat, Robert J; Crandall, Keith A

    2015-01-01

    High-throughput sequencing (HTS) analysis of microbial communities from the respiratory airways has heavily relied on the 16S rRNA gene. Given the intrinsic limitations of this approach, airway microbiome research has focused on assessing bacterial composition during health and disease, and its variation in relation to clinical and environmental factors, or other microbiomes. Consequently, very little effort has been dedicated to describing the functional characteristics of the airway microbiota and even less to explore the microbe-host interactions. Here we present a simultaneous assessment of microbiome and host functional diversity and host-microbe interactions from the same RNA-seq experiment, while accounting for variation in clinical metadata. Transcriptomic (host) and metatranscriptomic (microbiota) sequences from the nasal epithelium of 8 asthmatics and 6 healthy controls were separated in silico and mapped to available human and NCBI-NR protein reference databases. Human genes differentially expressed in asthmatics and controls were then used to infer upstream regulators involved in immune and inflammatory responses. Concomitantly, microbial genes were mapped to metabolic databases (COG, SEED, and KEGG) to infer microbial functions differentially expressed in asthmatics and controls. Finally, multivariate analysis was applied to find associations between microbiome characteristics and host upstream regulators while accounting for clinical variation. Our study showed significant differences in the metabolism of microbiomes from asthmatic and non-asthmatic children for up to 25% of the functional properties tested. Enrichment analysis of 499 differentially expressed host genes for inflammatory and immune responses revealed 43 upstream regulators differentially activated in asthma. Microbial adhesion (virulence) and Proteobacteria abundance were significantly associated with variation in the expression of the upstream regulator IL1A; suggesting that microbiome characteristics modulate host inflammatory and immune systems during asthma.

  4. Transcriptome analysis of resistant soybean roots infected by Meloidogyne javanica

    PubMed Central

    de Sá, Maria Eugênia Lisei; Conceição Lopes, Marcus José; de Araújo Campos, Magnólia; Paiva, Luciano Vilela; dos Santos, Regina Maria Amorim; Beneventi, Magda Aparecida; Firmino, Alexandre Augusto Pereira; de Sá, Maria Fátima Grossi

    2012-01-01

    Soybean is an important crop for Brazilian agribusiness. However, many factors can limit its production, especially root-knot nematode infection. Studies on the mechanisms employed by the resistant soybean genotypes to prevent infection by these nematodes are of great interest for breeders. For these reasons, the aim of this work is to characterize the transcriptome of soybean line PI 595099-Meloidogyne javanica interaction through expression analysis. Two cDNA libraries were obtained using a pool of RNA from PI 595099 uninfected and M. javanica (J2) infected roots, collected at 6, 12, 24, 48, 96, 144 and 192 h after inoculation. Around 800 ESTs (Expressed Sequence Tags) were sequenced and clustered into 195 clusters. In silico subtraction analysis identified eleven differentially expressed genes encoding putative proteins sharing amino acid sequence similarities by using BlastX: metallothionein, SLAH4 (SLAC1 Homologue 4), SLAH1 (SLAC1 Homologue 1), zinc-finger proteins, AN1-type proteins, auxin-repressed proteins, thioredoxin and nuclear transport factor 2 (NTF-2). Other genes were also found exclusively in nematode stressed soybean roots, such as NAC domain-containing proteins, MADS-box proteins, SOC1 (suppressor of overexpression of constans 1) proteins, thioredoxin-like protein 4-Coumarate-CoA ligase and the transcription factor (TF) MYBZ2. Among the genes identified in non-stressed roots only were Ser/Thr protein kinases, wound-induced basic protein, ethylene-responsive family protein, metallothionein-like protein cysteine proteinase inhibitor (cystatin) and Putative Kunitz trypsin protease inhibitor. An understanding of the roles of these differentially expressed genes will provide insights into the resistance mechanisms and candidate genes involved in soybean-M. javanica interaction and contribute to more effective control of this pathogen. PMID:22802712

  5. Unraveling the Light-Specific Metabolic and Regulatory Signatures of Rice through Combined in Silico Modeling and Multiomics Analysis1[OPEN

    PubMed Central

    Lim, Sun-Hyung; Kim, Jae Kwang; Ha, Sun-Hwa

    2015-01-01

    Light quality is an important signaling component upon which plants orchestrate various morphological processes, including seed germination and seedling photomorphogenesis. However, it is still unclear how plants, especially food crops, sense various light qualities and modulate their cellular growth and other developmental processes. Therefore, in this work, we initially profiled the transcripts of a model crop, rice (Oryza sativa), under four different light treatments (blue, green, red, and white) as well as in the dark. Concurrently, we reconstructed a fully compartmentalized genome-scale metabolic model of rice cells, iOS2164, containing 2,164 unique genes, 2,283 reactions, and 1,999 metabolites. We then combined the model with transcriptome profiles to elucidate the light-specific transcriptional signatures of rice metabolism. Clearly, light signals mediated rice gene expressions, differentially regulating numerous metabolic pathways: photosynthesis and secondary metabolism were up-regulated in blue light, whereas reserve carbohydrates degradation was pronounced in the dark. The topological analysis of gene expression data with the rice genome-scale metabolic model further uncovered that phytohormones, such as abscisate, ethylene, gibberellin, and jasmonate, are the key biomarkers of light-mediated regulation, and subsequent analysis of the associated genes’ promoter regions identified several light-specific transcription factors. Finally, the transcriptional control of rice metabolism by red and blue light signals was assessed by integrating the transcriptome and metabolome data with constraint-based modeling. The biological insights gained from this integrative systems biology approach offer several potential applications, such as improving the agronomic traits of food crops and designing light-specific synthetic gene circuits in microbial and mammalian systems. PMID:26453433

  6. Genome-wide analysis of copper, iron and zinc transporters in the arbuscular mycorrhizal fungus Rhizophagus irregularis.

    PubMed

    Tamayo, Elisabeth; Gómez-Gallego, Tamara; Azcón-Aguilar, Concepción; Ferrol, Nuria

    2014-01-01

    Arbuscular mycorrhizal fungi (AMF), belonging to the Glomeromycota, are soil microorganisms that establish mutualistic symbioses with the majority of higher plants. The efficient uptake of low mobility mineral nutrients by the fungal symbiont and their further transfer to the plant is a major feature of this symbiosis. Besides improving plant mineral nutrition, AMF can alleviate heavy metal toxicity to their host plants and are able to tolerate high metal concentrations in the soil. Nevertheless, we are far from understanding the key molecular determinants of metal homeostasis in these organisms. To get some insights into these mechanisms, a genome-wide analysis of Cu, Fe and Zn transporters was undertaken, making use of the recently published whole genome of the AMF Rhizophagus irregularis. This in silico analysis allowed identification of 30 open reading frames in the R. irregularis genome, which potentially encode metal transporters. Phylogenetic comparisons with the genomes of a set of reference fungi showed an expansion of some metal transporter families. Analysis of the published transcriptomic profiles of R. irregularis revealed that a set of genes were up-regulated in mycorrhizal roots compared to germinated spores and extraradical mycelium, which suggests that metals are important for plant colonization.

  7. Mining whole genomes and transcriptomes of Jatropha (Jatropha curcas) and Castor bean (Ricinus communis) for NBS-LRR genes and defense response associated transcription factors.

    PubMed

    Sood, Archit; Jaiswal, Varun; Chanumolu, Sree Krishna; Malhotra, Nikhil; Pal, Tarun; Chauhan, Rajinder Singh

    2014-11-01

    Jatropha (Jatropha curcas L.) and Castor bean (Ricinus communis) are oilseed crops of family Euphorbiaceae with the potential of producing high quality biodiesel and having industrial value. Both the bioenergy plants are becoming susceptible to various biotic stresses directly affecting the oil quality and content. No report exists as of today on analysis of Nucleotide Binding Site-Leucine Rich Repeat (NBS-LRR) gene repertoire and defense response transcription factors in both the plant species. In silico analysis of whole genomes and transcriptomes identified 47 new NBS-LRR genes in both the species and 122 and 318 defense response related transcription factors in Jatropha and Castor bean, respectively. The identified NBS-LRR genes and defense response transcription factors were mapped onto the respective genomes. Common and unique NBS-LRR genes and defense related transcription factors were identified in both the plant species. All NBS-LRR genes in both the species were characterized into Toll/interleukin-1 receptor NBS-LRRs (TNLs) and coiled-coil NBS-LRRs (CNLs), position on contigs, gene clusters and motifs and domains distribution. Transcript abundance or expression values were measured for all NBS-LRR genes and defense response transcription factors, suggesting their functional role. The current study provides a repertoire of NBS-LRR genes and transcription factors which can be used in not only dissecting the molecular basis of disease resistance phenotype but also in developing disease resistant genotypes in Jatropha and Castor bean through transgenic or molecular breeding approaches.

  8. Evaluation of the impact of RNA preservation methods of spiders for de novo transcriptome assembly.

    PubMed

    Kono, Nobuaki; Nakamura, Hiroyuki; Ito, Yusuke; Tomita, Masaru; Arakawa, Kazuharu

    2016-05-01

    With advances in high-throughput sequencing technologies, de novo transcriptome sequencing and assembly has become a cost-effective method to obtain comprehensive genetic information of a species of interest, especially in nonmodel species with large genomes such as spiders. However, high-quality RNA is essential for successful sequencing, and sample preservation conditions require careful consideration for the effective storage of field-collected samples. To this end, we report a streamlined feasibility study of various storage conditions and their effects on de novo transcriptome assembly results. The storage parameters considered include temperatures ranging from room temperature to -80°C; preservatives, including ethanol, RNAlater, TRIzol and RNAlater-ICE; and sample submersion states. As a result, intact RNA was extracted and assembly was successful when samples were preserved at low temperatures regardless of the type of preservative used. The assemblies as well as the gene expression profiles were shown to be robust to RNA degradation, when 30 million 150-bp paired-end reads are obtained. The parameters for sample storage, RNA extraction, library preparation, sequencing and in silico assembly considered in this work provide a guideline for the study of field-collected samples of spiders. © 2015 John Wiley & Sons Ltd.

  9. Identification of ovule transcripts from the Apospory-Specific Genomic Region (ASGR)-carrier chromosome

    PubMed Central

    2011-01-01

    Background Apomixis, asexual seed production in plants, holds great potential for agriculture as a means to fix hybrid vigor. Apospory is a form of apomixis where the embryo develops from an unreduced egg that is derived from a somatic nucellar cell, the aposporous initial, via mitosis. Understanding the molecular mechanism regulating aposporous initial specification will be a critical step toward elucidation of apomixis and also provide insight into developmental regulation and downstream signaling that results in apomixis. To discover candidate transcripts for regulating aposporous initial specification in P. squamulatum, we compared two transcriptomes derived from microdissected ovules at the stage of aposporous initial formation between the apomictic donor parent, P. squamulatum (accession PS26), and an apomictic derived backcross 8 (BC8) line containing only the Apospory-Specific Genomic Region (ASGR)-carrier chromosome from P. squamulatum. Toward this end, two transcriptomes derived from ovules of an apomictic donor parent and its apomictic backcross derivative at the stage of apospory initiation, were sequenced using 454-FLX technology. Results Using 454-FLX technology, we generated 332,567 reads with an average read length of 147 base pairs (bp) for the PS26 ovule transcriptome library and 363,637 reads with an average read length of 142 bp for the BC8 ovule transcriptome library. A total of 33,977 contigs from the PS26 ovule transcriptome library and 26,576 contigs from the BC8 ovule transcriptome library were assembled using the Multifunctional Inertial Reference Assembly program. Using stringent in silico parameters, 61 transcripts were predicted to map to the ASGR-carrier chromosome, of which 49 transcripts were verified as ASGR-carrier chromosome specific. One of the alien expressed genes could be assigned as tightly linked to the ASGR by screening of apomictic and sexual F1s. Only one transcript, which did not map to the ASGR, showed expression primarily in reproductive tissue. Conclusions Our results suggest that a strategy of comparative sequencing of transcriptomes between donor parent and backcross lines containing an alien chromosome of interest can be an efficient method of identifying transcripts derived from an alien chromosome in a chromosome addition line. PMID:21521529

  10. TranscriptomeBrowser 3.0: introducing a new compendium of molecular interactions and a new visualization tool for the study of gene regulatory networks.

    PubMed

    Lepoivre, Cyrille; Bergon, Aurélie; Lopez, Fabrice; Perumal, Narayanan B; Nguyen, Catherine; Imbert, Jean; Puthier, Denis

    2012-01-31

    Deciphering gene regulatory networks by in silico approaches is a crucial step in the study of the molecular perturbations that occur in diseases. The development of regulatory maps is a tedious process requiring the comprehensive integration of various evidences scattered over biological databases. Thus, the research community would greatly benefit from having a unified database storing known and predicted molecular interactions. Furthermore, given the intrinsic complexity of the data, the development of new tools offering integrated and meaningful visualizations of molecular interactions is necessary to help users drawing new hypotheses without being overwhelmed by the density of the subsequent graph. We extend the previously developed TranscriptomeBrowser database with a set of tables containing 1,594,978 human and mouse molecular interactions. The database includes: (i) predicted regulatory interactions (computed by scanning vertebrate alignments with a set of 1,213 position weight matrices), (ii) potential regulatory interactions inferred from systematic analysis of ChIP-seq experiments, (iii) regulatory interactions curated from the literature, (iv) predicted post-transcriptional regulation by micro-RNA, (v) protein kinase-substrate interactions and (vi) physical protein-protein interactions. In order to easily retrieve and efficiently analyze these interactions, we developed In-teractomeBrowser, a graph-based knowledge browser that comes as a plug-in for Transcriptome-Browser. The first objective of InteractomeBrowser is to provide a user-friendly tool to get new insight into any gene list by providing a context-specific display of putative regulatory and physical interactions. To achieve this, InteractomeBrowser relies on a "cell compartments-based layout" that makes use of a subset of the Gene Ontology to map gene products onto relevant cell compartments. This layout is particularly powerful for visual integration of heterogeneous biological information and is a productive avenue in generating new hypotheses. The second objective of InteractomeBrowser is to fill the gap between interaction databases and dynamic modeling. It is thus compatible with the network analysis software Cytoscape and with the Gene Interaction Network simulation software (GINsim). We provide examples underlying the benefits of this visualization tool for large gene set analysis related to thymocyte differentiation. The InteractomeBrowser plugin is a powerful tool to get quick access to a knowledge database that includes both predicted and validated molecular interactions. InteractomeBrowser is available through the TranscriptomeBrowser framework and can be found at: http://tagc.univ-mrs.fr/tbrowser/. Our database is updated on a regular basis.

  11. Improved intra-array and interarray normalization of peptide microarray phosphorylation for phosphorylome and kinome profiling by rational selection of relevant spots

    PubMed Central

    Scholma, Jetse; Fuhler, Gwenny M.; Joore, Jos; Hulsman, Marc; Schivo, Stefano; List, Alan F.; Reinders, Marcel J. T.; Peppelenbosch, Maikel P.; Post, Janine N.

    2016-01-01

    Massive parallel analysis using array technology has become the mainstay for analysis of genomes and transcriptomes. Analogously, the predominance of phosphorylation as a regulator of cellular metabolism has fostered the development of peptide arrays of kinase consensus substrates that allow the charting of cellular phosphorylation events (often called kinome profiling). However, whereas the bioinformatical framework for expression array analysis is well-developed, no advanced analysis tools are yet available for kinome profiling. Especially intra-array and interarray normalization of peptide array phosphorylation remain problematic, due to the absence of “housekeeping” kinases and the obvious fallacy of the assumption that different experimental conditions should exhibit equal amounts of kinase activity. Here we describe the development of analysis tools that reliably quantify phosphorylation of peptide arrays and that allow normalization of the signals obtained. We provide a method for intraslide gradient correction and spot quality control. We describe a novel interarray normalization procedure, named repetitive signal enhancement, RSE, which provides a mathematical approach to limit the false negative results occuring with the use of other normalization procedures. Using in silico and biological experiments we show that employing such protocols yields superior insight into cellular physiology as compared to classical analysis tools for kinome profiling. PMID:27225531

  12. A transcriptomic insight into the infective juvenile stage of the insect parasitic nematode, Heterorhabditis indica.

    PubMed

    Somvanshi, Vishal S; Gahoi, Shachi; Banakar, Prakash; Thakur, Prasoon Kumar; Kumar, Mukesh; Sajnani, Manisha; Pandey, Priyatama; Rao, Uma

    2016-03-01

    Nematodes are the most numerous animals in the soil. Insect parasitic nematodes of the genus Heterorhabditis are capable of selectively seeking, infecting and killing their insect-hosts in the soil. The infective juvenile (IJ) stage of the Heterorhabditis nematodes is analogous to Caenorhabditis elegans dauer juvenile stage, which remains in 'arrested development' till it finds and infects a new insect-host in the soil. H. indica is the most prevalent species of Heterorhabditis in India. To understand the genes and molecular processes that govern the biology of the IJ stage, and to create a resource to facilitate functional genomics and genetic exploration, we sequenced the transcriptome of H. indica IJs. The de-novo sequence assembly using Velvet-Oases pipeline resulted in 13,593 unique transcripts at N50 of 1,371 bp, of which 53 % were annotated by blastx. H. indica transcripts showed higher orthology with parasitic nematodes as compared to free living nematodes. In-silico expression analysis showed 30 % of transcripts expressing with ≥100 FPKM value. All the four canonical dauer formation pathways like cGMP-PKG, insulin, dafachronic acid and TGF-β were active in the IJ stage. Several other signaling pathways were highly represented in the transcriptome. Twenty-four orthologs of C. elegans RNAi pathway effector genes were discovered in H. indica, including nrde-3 that is reported for the first time in any of the parasitic nematodes. An ortholog of C. elegans tol-1 was also identified. Further, 272 kinases belonging to 137 groups, and several previously unidentified members of important gene classes were identified. We generated high-quality transcriptome sequence data from H. indica IJs for the first time. The transcripts showed high similarity with the parasitic nematodes, M. hapla, and A. suum as opposed to C. elegans, a species to which H. indica is more closely related. The high representation of transcripts from several signaling pathways in the IJs indicates that despite being a developmentally arrested stage; IJs are a hotbed of signaling and are actively interacting with their environment.

  13. Pharmacogenomic identification of small molecules for lineage specific manipulation of subventricular zone germinal activity.

    PubMed

    Azim, Kasum; Angonin, Diane; Marcy, Guillaume; Pieropan, Francesca; Rivera, Andrea; Donega, Vanessa; Cantù, Claudio; Williams, Gareth; Berninger, Benedikt; Butt, Arthur M; Raineteau, Olivier

    2017-03-01

    Strategies for promoting neural regeneration are hindered by the difficulty of manipulating desired neural fates in the brain without complex genetic methods. The subventricular zone (SVZ) is the largest germinal zone of the forebrain and is responsible for the lifelong generation of interneuron subtypes and oligodendrocytes. Here, we have performed a bioinformatics analysis of the transcriptome of dorsal and lateral SVZ in early postnatal mice, including neural stem cells (NSCs) and their immediate progenies, which generate distinct neural lineages. We identified multiple signaling pathways that trigger distinct downstream transcriptional networks to regulate the diversity of neural cells originating from the SVZ. Next, we used a novel in silico genomic analysis, searchable platform-independent expression database/connectivity map (SPIED/CMAP), to generate a catalogue of small molecules that can be used to manipulate SVZ microdomain-specific lineages. Finally, we demonstrate that compounds identified in this analysis promote the generation of specific cell lineages from NSCs in vivo, during postnatal life and adulthood, as well as in regenerative contexts. This study unravels new strategies for using small bioactive molecules to direct germinal activity in the SVZ, which has therapeutic potential in neurodegenerative diseases.

  14. Unlocking the potential of publicly available microarray data using inSilicoDb and inSilicoMerging R/Bioconductor packages.

    PubMed

    Taminau, Jonatan; Meganck, Stijn; Lazar, Cosmin; Steenhoff, David; Coletta, Alain; Molter, Colin; Duque, Robin; de Schaetzen, Virginie; Weiss Solís, David Y; Bersini, Hugues; Nowé, Ann

    2012-12-24

    With an abundant amount of microarray gene expression data sets available through public repositories, new possibilities lie in combining multiple existing data sets. In this new context, analysis itself is no longer the problem, but retrieving and consistently integrating all this data before delivering it to the wide variety of existing analysis tools becomes the new bottleneck. We present the newly released inSilicoMerging R/Bioconductor package which, together with the earlier released inSilicoDb R/Bioconductor package, allows consistent retrieval, integration and analysis of publicly available microarray gene expression data sets. Inside the inSilicoMerging package a set of five visual and six quantitative validation measures are available as well. By providing (i) access to uniformly curated and preprocessed data, (ii) a collection of techniques to remove the batch effects between data sets from different sources, and (iii) several validation tools enabling the inspection of the integration process, these packages enable researchers to fully explore the potential of combining gene expression data for downstream analysis. The power of using both packages is demonstrated by programmatically retrieving and integrating gene expression studies from the InSilico DB repository [https://insilicodb.org/app/].

  15. The Molecular Phenotype of Endocapillary Proliferation: Novel Therapeutic Targets for IgA Nephropathy

    PubMed Central

    John, Rohan; Grone, Elisabeth; Porubsky, Stefan; Gröne, Hermann-Josef; Herzenberg, Andrew M.; Scholey, James W.; Hladunewich, Michelle; Cattran, Daniel C.

    2014-01-01

    IgA nephropathy (IgAN) is a clinically and pathologically heterogeneous disease. Endocapillary proliferation is associated with higher risk of progressive disease, and clinical studies suggest that corticosteroids mitigate this risk. However, corticosteroids are associated with protean cellular effects and significant toxicity. Furthermore the precise mechanism by which they modulate kidney injury in IgAN is not well delineated. To better understand molecular pathways involved in the development of endocapillary proliferation and to identify novel specific therapeutic targets, we evaluated the glomerular transcriptome of microdissected kidney biopsies from 22 patients with IgAN. Endocapillary proliferation was defined according to the Oxford scoring system independently by 3 nephropathologists. We analyzed mRNA expression using microarrays and identified transcripts differentially expressed in patients with endocapillary proliferation compared to IgAN without endocapillary lesions. Next, we employed both transcription factor analysis and in silico drug screening and confirmed that the endocapillary proliferation transcriptome is significantly enriched with pathways that can be impacted by corticosteroids. With this approach we also identified novel therapeutic targets and bioactive small molecules that may be considered for therapeutic trials for the treatment of IgAN, including resveratrol and hydroquinine. In summary, we have defined the distinct molecular profile of a pathologic phenotype associated with progressive renal insufficiency in IgAN. Exploration of the pathways associated with endocapillary proliferation confirms a molecular basis for the clinical effectiveness of corticosteroids in this subgroup of IgAN, and elucidates new therapeutic strategies for IgAN. PMID:25133636

  16. A comprehensive transcriptome assembly of Pigeonpea (Cajanus cajan L.) using sanger and second-generation sequencing platforms.

    PubMed

    Kudapa, Himabindu; Bharti, Arvind K; Cannon, Steven B; Farmer, Andrew D; Mulaosmanovic, Benjamin; Kramer, Robin; Bohra, Abhishek; Weeks, Nathan T; Crow, John A; Tuteja, Reetu; Shah, Trushar; Dutta, Sutapa; Gupta, Deepak K; Singh, Archana; Gaikwad, Kishor; Sharma, Tilak R; May, Gregory D; Singh, Nagendra K; Varshney, Rajeev K

    2012-09-01

    A comprehensive transcriptome assembly for pigeonpea has been developed by analyzing 128.9 million short Illumina GA IIx single end reads, 2.19 million single end FLX/454 reads, and 18 353 Sanger expressed sequenced tags from more than 16 genotypes. The resultant transcriptome assembly, referred to as CcTA v2, comprised 21 434 transcript assembly contigs (TACs) with an N50 of 1510 bp, the largest one being ~8 kb. Of the 21 434 TACs, 16 622 (77.5%) could be mapped on to the soybean genome build 1.0.9 under fairly stringent alignment parameters. Based on knowledge of intron junctions, 10 009 primer pairs were designed from 5033 TACs for amplifying intron spanning regions (ISRs). By using in silico mapping of BAC-end-derived SSR loci of pigeonpea on the soybean genome as a reference, putative mapping positions at the chromosome level were predicted for 6284 ISR markers, covering all 11 pigeonpea chromosomes. A subset of 128 ISR markers were analyzed on a set of eight genotypes. While 116 markers were validated, 70 markers showed one to three alleles, with an average of 0.16 polymorphism information content (PIC) value. In summary, the CcTA v2 transcript assembly and ISR markers will serve as a useful resource to accelerate genetic research and breeding applications in pigeonpea.

  17. Exploring the Transcriptome of Ciliated Cells Using In Silico Dissection of Human Tissues

    PubMed Central

    Ivliev, Alexander E.; 't Hoen, Peter A. C.; van Roon-Mom, Willeke M. C.; Peters, Dorien J. M.; Sergeeva, Marina G.

    2012-01-01

    Cilia are cell organelles that play important roles in cell motility, sensory and developmental functions and are involved in a range of human diseases, known as ciliopathies. Here, we search for novel human genes related to cilia using a strategy that exploits the previously reported tendency of cell type-specific genes to be coexpressed in the transcriptome of complex tissues. Gene coexpression networks were constructed using the noise-resistant WGCNA algorithm in 12 publicly available microarray datasets from human tissues rich in motile cilia: airways, fallopian tubes and brain. A cilia-related coexpression module was detected in 10 out of the 12 datasets. A consensus analysis of this module's gene composition recapitulated 297 known and predicted 74 novel cilia-related genes. 82% of the novel candidates were supported by tissue-specificity expression data from GEO and/or proteomic data from the Human Protein Atlas. The novel findings included a set of genes (DCDC2, DYX1C1, KIAA0319) related to a neurological disease dyslexia suggesting their potential involvement in ciliary functions. Furthermore, we searched for differences in gene composition of the ciliary module between the tissues. A multidrug-and-toxin extrusion transporter MATE2 (SLC47A2) was found as a brain-specific central gene in the ciliary module. We confirm the localization of MATE2 in cilia by immunofluorescence staining using MDCK cells as a model. While MATE2 has previously gained attention as a pharmacologically relevant transporter, its potential relation to cilia is suggested for the first time. Taken together, our large-scale analysis of gene coexpression networks identifies novel genes related to human cell cilia. PMID:22558177

  18. hSAGEing: an improved SAGE-based software for identification of human tissue-specific or common tumor markers and suppressors.

    PubMed

    Yang, Cheng-Hong; Chuang, Li-Yeh; Shih, Tsung-Mu; Chang, Hsueh-Wei

    2010-12-17

    SAGE (serial analysis of gene expression) is a powerful method of analyzing gene expression for the entire transcriptome. There are currently many well-developed SAGE tools. However, the cross-comparison of different tissues is seldom addressed, thus limiting the identification of common- and tissue-specific tumor markers. To improve the SAGE mining methods, we propose a novel function for cross-tissue comparison of SAGE data by combining the mathematical set theory and logic with a unique "multi-pool method" that analyzes multiple pools of pair-wise case controls individually. When all the settings are in "inclusion", the common SAGE tag sequences are mined. When one tissue type is in "inclusion" and the other types of tissues are not in "inclusion", the selected tissue-specific SAGE tag sequences are generated. They are displayed in tags-per-million (TPM) and fold values, as well as visually displayed in four kinds of scales in a color gradient pattern. In the fold visualization display, the top scores of the SAGE tag sequences are provided, along with cluster plots. A user-defined matrix file is designed for cross-tissue comparison by selecting libraries from publically available databases or user-defined libraries. The hSAGEing tool provides a combination of friendly cross-tissue analysis and an interface for comparing SAGE libraries for the first time. Some up- or down-regulated genes with tissue-specific or common tumor markers and suppressors are identified computationally. The tool is useful and convenient for in silico cancer transcriptomic studies and is freely available at http://bio.kuas.edu.tw/hSAGEing.

  19. Discovery of novel antimicrobial peptides: A transcriptomic study of the sea anemone Cnidopus japonicus.

    PubMed

    Grafskaia, Ekaterina N; Polina, Nadezhda F; Babenko, Vladislav V; Kharlampieva, Daria D; Bobrovsky, Pavel A; Manuvera, Valentin A; Farafonova, Tatyana E; Anikanov, Nikolay A; Lazarev, Vassili N

    2018-04-01

    As essential conservative component of the innate immune systems of living organisms, antimicrobial peptides (AMPs) could complement pharmaceuticals that increasingly fail to combat various pathogens exhibiting increased resistance to microbial antibiotics. Among the properties of AMPs that suggest their potential as therapeutic agents, diverse peptides in the venoms of various predators demonstrate antimicrobial activity and kill a wide range of microorganisms. To identify potent AMPs, the study reported here involved a transcriptomic profiling of the tentacle secretion of the sea anemone Cnidopus japonicus. An in silico search algorithm designed to discover toxin-like proteins containing AMPs was developed based on the evaluation of the properties and structural peculiarities of amino acid sequences. The algorithm revealed new proteins of the anemone containing antimicrobial candidate sequences, and 10 AMPs verified using high-throughput proteomics were synthesized. The antimicrobial activity of the candidate molecules was experimentally estimated against Gram-positive and -negative bacteria. Ultimately, three peptides exhibited antimicrobial activity against bacterial strains, which suggests that the method can be applied to reveal new AMPs in the venoms of other predators as well.

  20. A Case-Matched Gender Comparison Transcriptomic Screen Identifies eIF4E and eIF5 as Potential Prognostic Markers in Male Breast Cancer.

    PubMed

    Humphries, Matthew P; Sundara Rajan, Sreekumar; Droop, Alastair; Suleman, Charlotte A B; Carbone, Carmine; Nilsson, Cecilia; Honarpisheh, Hedieh; Cserni, Gabor; Dent, Jo; Fulford, Laura; Jordan, Lee B; Jones, J Louise; Kanthan, Rani; Litwiniuk, Maria; Di Benedetto, Anna; Mottolese, Marcella; Provenzano, Elena; Shousha, Sami; Stephens, Mark; Walker, Rosemary A; Kulka, Janina; Ellis, Ian O; Jeffery, Margaret; Thygesen, Helene H; Cappelletti, Vera; Daidone, Maria G; Hedenfalk, Ingrid A; Fjällskog, Marie-Louise; Melisi, Davide; Stead, Lucy F; Shaaban, Abeer M; Speirs, Valerie

    2017-05-15

    Purpose: Breast cancer affects both genders, but is understudied in men. Although still rare, male breast cancer (MBC) is being diagnosed more frequently. Treatments are wholly informed by clinical studies conducted in women, based on assumptions that underlying biology is similar. Experimental Design: A transcriptomic investigation of male and female breast cancer was performed, confirming transcriptomic data in silico Biomarkers were immunohistochemically assessed in 697 MBCs ( n = 477, training; n = 220, validation set) and quantified in pre- and posttreatment samples from an MBC patient receiving everolimus and PI3K/mTOR inhibitor. Results: Gender-specific gene expression patterns were identified. eIF transcripts were upregulated in MBC. eIF4E and eIF5 were negatively prognostic for overall survival alone (log-rank P = 0.013; HR = 1.77, 1.12-2.8 and P = 0.035; HR = 1.68, 1.03-2.74, respectively), or when coexpressed ( P = 0.01; HR = 2.66, 1.26-5.63), confirmed in the validation set. This remained upon multivariate Cox regression analysis [eIF4E P = 0.016; HR = 2.38 (1.18-4.8), eIF5 P = 0.022; HR = 2.55 (1.14-5.7); coexpression P = 0.001; HR = 7.04 (2.22-22.26)]. Marked reduction in eIF4E and eIF5 expression was seen post BEZ235/everolimus, with extended survival. Conclusions: Translational initiation pathway inhibition could be of clinical utility in MBC patients overexpressing eIF4E and eIF5. With mTOR inhibitors that target this pathway now in the clinic, these biomarkers may represent new targets for therapeutic intervention, although further independent validation is required. Clin Cancer Res; 23(10); 2575-83. ©2016 AACR . ©2016 American Association for Cancer Research.

  1. Root Cell-Specific Regulators of Phosphate-Dependent Growth1[OPEN

    PubMed Central

    Ding, Wona

    2017-01-01

    Cellular specialization in abiotic stress responses is an important regulatory feature driving plant acclimation. Our in silico approach of iterative coexpression, interaction, and enrichment analyses predicted root cell-specific regulators of phosphate starvation response networks in Arabidopsis (Arabidopsis thaliana). This included three uncharacterized genes termed Phosphate starvation-induced gene interacting Root Cell Enriched (PRCE1, PRCE2, and PRCE3). Root cell-specific enrichment of 12 candidates was confirmed in promoter-GFP lines. T-DNA insertion lines of 11 genes showed changes in phosphate status and growth responses to phosphate availability compared with the wild type. Some mutants (cbl1, cipk2, prce3, and wdd1) displayed strong biomass gain irrespective of phosphate supply, while others (cipk14, mfs1, prce1, prce2, and s6k2) were able to sustain growth under low phosphate supply better than the wild type. Notably, root or shoot phosphate accumulation did not strictly correlate with organ growth. Mutant response patterns markedly differed from those of master regulators of phosphate homeostasis, PHOSPHATE STARVATION RESPONSE1 (PHR1) and PHOSPHATE2 (PHO2), demonstrating that negative growth responses in the latter can be overcome when cell-specific regulators are targeted. RNA sequencing analysis highlighted the transcriptomic plasticity in these mutants and revealed PHR1-dependent and -independent regulatory circuits with gene coexpression profiles that were highly correlated to the quantified physiological traits. The results demonstrate how in silico prediction of cell-specific, stress-responsive genes uncovers key regulators and how their manipulation can have positive impacts on plant growth under abiotic stress. PMID:28465462

  2. Deep mRNA Sequencing of the Tritonia diomedea Brain Transcriptome Provides Access to Gene Homologues for Neuronal Excitability, Synaptic Transmission and Peptidergic Signalling

    PubMed Central

    Senatore, Adriano; Edirisinghe, Neranjan; Katz, Paul S.

    2015-01-01

    Background The sea slug Tritonia diomedea (Mollusca, Gastropoda, Nudibranchia), has a simple and highly accessible nervous system, making it useful for studying neuronal and synaptic mechanisms underlying behavior. Although many important contributions have been made using Tritonia, until now, a lack of genetic information has impeded exploration at the molecular level. Results We performed Illumina sequencing of central nervous system mRNAs from Tritonia, generating 133.1 million 100 base pair, paired-end reads. De novo reconstruction of the RNA-Seq data yielded a total of 185,546 contigs, which partitioned into 123,154 non-redundant gene clusters (unigenes). BLAST comparison with RefSeq and Swiss-Prot protein databases, as well as mRNA data from other invertebrates (gastropod molluscs: Aplysia californica, Lymnaea stagnalis and Biomphalaria glabrata; cnidarian: Nematostella vectensis) revealed that up to 76,292 unigenes in the Tritonia transcriptome have putative homologues in other databases, 18,246 of which are below a more stringent E-value cut-off of 1x10-6. In silico prediction of secreted proteins from the Tritonia transcriptome shotgun assembly (TSA) produced a database of 579 unique sequences of secreted proteins, which also exhibited markedly higher expression levels compared to other genes in the TSA. Conclusions Our efforts greatly expand the availability of gene sequences available for Tritonia diomedea. We were able to extract full length protein sequences for most queried genes, including those involved in electrical excitability, synaptic vesicle release and neurotransmission, thus confirming that the transcriptome will serve as a useful tool for probing the molecular correlates of behavior in this species. We also generated a neurosecretome database that will serve as a useful tool for probing peptidergic signalling systems in the Tritonia brain. PMID:25719197

  3. The current status of alternatives to animal testing and predictive toxicology methods using liver microfluidic biochips.

    PubMed

    Prot, Jean Matthieu; Leclerc, Eric

    2012-06-01

    In this paper, we will consider new in vitro cell culture platforms and the progress made, based on the microfluidic liver biochips dedicated to pharmacological and toxicological studies. Particular emphasis will be given to recent developments in the microfluidic tools dedicated to cell culture (more particularly liver cell culture), in silico opportunities for Physiologically Based PharmacoKinetic (PBPK) modelling, the challenge of the mechanistic interpretations offered by the approaches resulting from "multi-omics" data (transcriptomics, proteomics, metabolomics, cytomics) and imaging microfluidic platforms. Finally, we will discuss the critical features regarding microfabrication, design and materials, and cell functionality as the key points for the future development of new microfluidic liver biochips.

  4. PIVOT: platform for interactive analysis and visualization of transcriptomics data.

    PubMed

    Zhu, Qin; Fisher, Stephen A; Dueck, Hannah; Middleton, Sarah; Khaladkar, Mugdha; Kim, Junhyong

    2018-01-05

    Many R packages have been developed for transcriptome analysis but their use often requires familiarity with R and integrating results of different packages requires scripts to wrangle the datatypes. Furthermore, exploratory data analyses often generate multiple derived datasets such as data subsets or data transformations, which can be difficult to track. Here we present PIVOT, an R-based platform that wraps open source transcriptome analysis packages with a uniform user interface and graphical data management that allows non-programmers to interactively explore transcriptomics data. PIVOT supports more than 40 popular open source packages for transcriptome analysis and provides an extensive set of tools for statistical data manipulations. A graph-based visual interface is used to represent the links between derived datasets, allowing easy tracking of data versions. PIVOT further supports automatic report generation, publication-quality plots, and program/data state saving, such that all analysis can be saved, shared and reproduced. PIVOT will allow researchers with broad background to easily access sophisticated transcriptome analysis tools and interactively explore transcriptome datasets.

  5. Digital Gene Expression Analysis Based on De Novo Transcriptome Assembly Reveals New Genes Associated with Floral Organ Differentiation of the Orchid Plant Cymbidium ensifolium

    PubMed Central

    Yang, Fengxi; Zhu, Genfa

    2015-01-01

    Cymbidium ensifolium belongs to the genus Cymbidium of the orchid family. Owing to its spectacular flower morphology, C. ensifolium has considerable ecological and cultural value. However, limited genetic data is available for this non-model plant, and the molecular mechanism underlying floral organ identity is still poorly understood. In this study, we characterize the floral transcriptome of C. ensifolium and present, for the first time, extensive sequence and transcript abundance data of individual floral organs. After sequencing, over 10 Gb clean sequence data were generated and assembled into 111,892 unigenes with an average length of 932.03 base pairs, including 1,227 clusters and 110,665 singletons. Assembled sequences were annotated with gene descriptions, gene ontology, clusters of orthologous group terms, the Kyoto Encyclopedia of Genes and Genomes, and the plant transcription factor database. From these annotations, 131 flowering-associated unigenes, 61 CONSTANS-LIKE (COL) unigenes and 90 floral homeotic genes were identified. In addition, four digital gene expression libraries were constructed for the sepal, petal, labellum and gynostemium, and 1,058 genes corresponding to individual floral organ development were identified. Among them, eight MADS-box genes were further investigated by full-length cDNA sequence analysis and expression validation, which revealed two APETALA1/AGL9-like MADS-box genes preferentially expressed in the sepal and petal, two AGAMOUS-like genes particularly restricted to the gynostemium, and four DEF-like genes distinctively expressed in different floral organs. The spatial expression of these genes varied distinctly in different floral mutant corresponding to different floral morphogenesis, which validated the specialized roles of them in floral patterning and further supported the effectiveness of our in silico analysis. This dataset generated in our study provides new insights into the molecular mechanisms underlying floral patterning of Cymbidium and supports a valuable resource for molecular breeding of the orchid plant. PMID:26580566

  6. Genome and transcriptome adaptation accompanying emergence of the definitive type 2 host-restricted Salmonella enterica serovar Typhimurium pathovar.

    PubMed

    Kingsley, Robert A; Kay, Sally; Connor, Thomas; Barquist, Lars; Sait, Leanne; Holt, Kathryn E; Sivaraman, Karthi; Wileman, Thomas; Goulding, David; Clare, Simon; Hale, Christine; Seshasayee, Aswin; Harris, Simon; Thomson, Nicholas R; Gardner, Paul; Rabsch, Wolfgang; Wigley, Paul; Humphrey, Tom; Parkhill, Julian; Dougan, Gordon

    2013-08-27

    Salmonella enterica serovar Typhimurium definitive type 2 (DT2) is host restricted to Columba livia (rock or feral pigeon) but is also closely related to S. Typhimurium isolates that circulate in livestock and cause a zoonosis characterized by gastroenteritis in humans. DT2 isolates formed a distinct phylogenetic cluster within S. Typhimurium based on whole-genome-sequence polymorphisms. Comparative genome analysis of DT2 94-213 and S. Typhimurium SL1344, DT104, and D23580 identified few differences in gene content with the exception of variations within prophages. However, DT2 94-213 harbored 22 pseudogenes that were intact in other closely related S. Typhimurium strains. We report a novel in silico approach to identify single amino acid substitutions in proteins that have a high probability of a functional impact. One polymorphism identified using this method, a single-residue deletion in the Tar protein, abrogated chemotaxis to aspartate in vitro. DT2 94-213 also exhibited an altered transcriptional profile in response to culture at 42°C compared to that of SL1344. Such differentially regulated genes included a number involved in flagellum biosynthesis and motility. IMPORTANCE Whereas Salmonella enterica serovar Typhimurium can infect a wide range of animal species, some variants within this serovar exhibit a more limited host range and altered disease potential. Phylogenetic analysis based on whole-genome sequences can identify lineages associated with specific virulence traits, including host adaptation. This study represents one of the first to link pathogen-specific genetic signatures, including coding capacity, genome degradation, and transcriptional responses to host adaptation within a Salmonella serovar. We performed comparative genome analysis of reference and pigeon-adapted definitive type 2 (DT2) S. Typhimurium isolates alongside phenotypic and transcriptome analyses, to identify genetic signatures linked to host adaptation within the DT2 lineage.

  7. Insights into the olfactory system of the ectoparasite Caligus rogercresseyi: molecular characterization and gene transcription analysis of novel ionotropic receptors.

    PubMed

    Núñez-Acuña, Gustavo; Valenzuela-Muñoz, Valentina; Marambio, Jorge Pino; Wadsworth, Simon; Gallardo-Escárate, Cristian

    2014-10-01

    Although various elements of the olfactory system have been elucidated in insects, it remains practically unstudied in crustaceans at a molecular level. Among crustaceans, some species are classified as ectoparasites that impact the finfish aquaculture industry. Thus, there is an urgent need to identify and comprehend the signaling pathways used by these in host recognition. The present study, through RNA-seq and qPCR analyses, found novel transcripts involved in the olfactory system of Caligus rogercresseyi, in addition to the transcriptomic patterns expressed during different stages of salmon lice development. From a transcriptomic library generated by Illumina sequencing, contigs that annotated for ionotropic receptors and other genes implicated in the olfactory system were identified and extracted. Full length mRNA was obtained for the ionotropic glutamate receptor 25, which had 3923 bp, and for the glutamate receptor ionotropic kainate 2, which had 2737 bp. Furthermore, two other transcripts identified as glutamate receptor, ionotropic kainate 2-like were found. In silico analysis was performed for the transcription expression from different stages of development in C. rogercresseyi, and clusters according to RPKM values were constructed. Gene transcription data were validated through qPCR assays in ionotropic receptors, and showed an expression of glutamate receptor 25 associated with the copepodid stage whereas adults, especially male adults, were associated with the kainate 2 and kainate 2-like transcripts. Additionally, gene transcription analysis of the ionotropic receptors showed an overexpression in response to the presence of masking compounds and immunostimulant in salmon diets. This response correlated to a reduction in sea lice infection following in vivo challenge. Diets with masking compounds showed a decrease of lice infestation of up to 25%. This work contributes to the available knowledge on chemosensory systems in this ectoparasite, providing novel elements towards understanding the host-finding process of the salmon louse C. rogercresseyi. Copyright © 2014 Elsevier Inc. All rights reserved.

  8. Transcriptome association analysis identifies miR-375 as a major determinant of variable acetaminophen glucuronidation by human liver.

    PubMed

    Papageorgiou, Ioannis; Freytsis, Marina; Court, Michael H

    2016-10-01

    Acetaminophen is the leading cause of acute liver failure (ALF) in many countries including the United States. Hepatic glucuronidation by UDP-glucuronosyltransferase (UGT) 1A subfamily enzymes is the major route of acetaminophen elimination. Reduced glucuronidation may predispose some individuals to acetaminophen-induced ALF, but mechanisms underlying reduced glucuronidation are poorly understood. We hypothesized that specific microRNAs (miRNAs) may reduce UGT1A activity by direct effects on the UGT1A 3'-UTR shared by all UGT1A enzyme transcripts, or by indirect effects on transcription factors regulating UGT1A expression. We performed an unbiased miRNA whole transcriptome association analysis using a bank of human livers with known acetaminophen glucuronidation activities. Of 754 miRNAs evaluated, 9 miRNAs were identified that were significantly overexpressed (p<0.05; >2-fold) in livers with low acetaminophen glucuronidation activities compared with those with high activities. miR-375 showed the highest difference (>10-fold), and was chosen for further mechanistic validation. We demonstrated using in silico analysis and luciferase reporter assays that miR-375 has a unique functional binding site in the 3'-UTR of the aryl hydrocarbon receptor (AhR) gene. Furthermore overexpression of miR-375 in LS180 cells demonstrated significant repression of endogenous AhR protein (by 40%) and mRNA (by 10%), as well as enzyme activity and/or mRNA of AhR regulated enzymes including UGT1A1, UGT1A6, and CYP1A2, without affecting UGT2B7, which is not regulated by AhR. Thus miR-375 is identified as a novel repressor of UGT1A-mediated hepatic acetaminophen glucuronidation through reduced AhR expression, which could predispose some individuals to increased risk for acetaminophen-induced ALF. Published by Elsevier Inc.

  9. Trinity | Informatics Technology for Cancer Research (ITCR)

    Cancer.gov

    Trinity Cancer Transcriptome Analysis Toolkit (CTAT) including de novo transcriptome assembly with downstream support for expression analysis and focused analyses on cancer transcriptomes, incorporating mutation and fusion transcript discovery, and single cell analysis.

  10. Transcriptomic analysis of Ruditapes philippinarum hemocytes reveals cytoskeleton disruption after in vitro Vibrio tapetis challenge.

    PubMed

    Brulle, Franck; Jeffroy, Fanny; Madec, Stéphanie; Nicolas, Jean-Louis; Paillard, Christine

    2012-10-01

    The Manila clam, Ruditapes philippinarum, is an economically-important, commercial shellfish; harvests are diminished in some European waters by a pathogenic bacterium, Vibrio tapetis, that causes Brown Ring disease. To identify molecular characteristics associated with susceptibility or resistance to Brown Ring disease, Suppression Subtractive Hybridization (SSH) analyzes were performed to construct cDNA libraries enriched in up- or down-regulated transcripts from clam immune cells, hemocytes, after a 3-h in vitro challenge with cultured V. tapetis. Nine hundred and ninety eight sequences from the two libraries were sequenced, and an in silico analysis identified 235 unique genes. BLAST and "Gene ontology" classification analyzes revealed that 60.4% of the Expressed Sequence Tags (ESTs) have high similarities with genes involved in various physiological functions, such as immunity, apoptosis and cytoskeleton organization; whereas, 39.6% remain unidentified. From the 235 unique genes, we selected 22 candidates based upon physiological function and redundancy in the libraries. Then, Real-Time PCR analysis identified 3 genes related to cytoskeleton organization showing significant variation in expression attributable to V. tapetis exposure. Disruption in regulation of these genes is consistent with the etiologic agent of Brown Ring disease in Manila clams. Copyright © 2012 Elsevier Ltd. All rights reserved.

  11. In silico prediction of the G-protein coupled receptors expressed during the metamorphic molt of Sagmariasus verreauxi (Crustacea: Decapoda) by mining transcriptomic data: RNA-seq to repertoire.

    PubMed

    Buckley, Sean J; Fitzgibbon, Quinn P; Smith, Gregory G; Ventura, Tomer

    2016-03-01

    Against a backdrop of food insecurity, the farming of decapod crustaceans is a rapidly expanding and globally significant source of food protein. Sagmariasus verreauxi spiny lobster, the subject of this study, are decapods of underdeveloped aquaculture potential. Crustacean neuropeptide G-protein coupled receptors (GPCRs) mediate endocrine pathways that are integral to animal fecundity, growth and survival. The potential use of novel biotechnologies to enhance GPCR-mediated physiology may assist in improving the health and productivity of farmed decapod populations. This study catalogues the GPCRs expressed in the early developmental stages, as well as adult tissues, with a view to illuminating key neuropeptide receptors. De novo assembled contiguous sequences generated from transcriptomic reads of metamorphic and post metamorphic S. verreauxi were filtered for seven transmembrane domains, and used as a reference for iterative re-mapping. Subsequent putative GPCR open reading frames (ORFs) were BLAST annotated, categorised, and compared to published orthologues based on phylogenetic analysis. A total of 85 GPCRs were digitally predicted, that represented each of the four arthropod subfamilies. They generally displayed low-level and non-differential metamorphic expression with few exceptions that we examined using RT-PCR and qPCR. Two putative CHH-like neuropeptide receptors were annotated. Three dimensional structural modelling suggests that these receptors exhibit a conserved extracellular ligand binding pocket, providing support to the notion that these receptors co-evolved with their ligands across Decapoda. This perhaps narrows the search for means to increase productivity of farmed decapod populations. Copyright © 2016 Elsevier Inc. All rights reserved.

  12. Lipid transfer proteins and protease inhibitors as key factors in the priming of barley responses to Fusarium head blight disease by a biocontrol strain of Pseudomonas fluorescens.

    PubMed

    Petti, Carloalberto; Khan, Mojibur; Doohan, Fiona

    2010-11-01

    Strains of non-pathogenic pseudomonad bacteria, can elicit host defence responses against pathogenic microorganisms. Pseudomonas fluorescens strain MKB158 can protect cereals from pathogenesis by Fusarium fungi, including Fusarium head blight which is an economically important disease due to its association with both yield loss and mycotoxin contamination of grain. Using the 22 K barley Affymetrix chip, trancriptome studies were undertaken to determine the local effect of P. fluorescens strain MKB158 on the transcriptome of barley head tissue, and to discriminate transcripts primed by the bacterium to respond to challenge by Fusarium culmorum, a causal agent of the economically important Fusarium head blight disease of cereals. The bacterium significantly affected the accumulation of 1203 transcripts and primed 74 to positively, and 14 to negatively, respond to the pathogen (P = 0.05). This is the first study to give insights into bacterium priming in the Triticeae tribe of grasses and associated transcripts were classified into 13 functional classes, associated with diverse functions, including detoxification, cell wall biosynthesis and the amplification of host defence responses. In silico analysis of Arabidopsis homologs of bacterium-primed barley genes indicated that, as is the case in dicots, jasmonic acid plays a role in pseudomonad priming of host responses. Additionally, the transcriptome studies described herein also reveal new insights into bacterium-mediated priming of host defences against necrotrophs, including the positive effects on grain filling, lignin deposition, oxidative stress responses, and the inhibition of protease inhibitors and proteins that play a key role in programmed cell death.

  13. Transcriptomic analysis of the autophagy machinery in crustaceans.

    PubMed

    Suwansa-Ard, Saowaros; Kankuan, Wilairat; Thongbuakaew, Tipsuda; Saetan, Jirawat; Kornthong, Napamanee; Kruangkum, Thanapong; Khornchatri, Kanjana; Cummins, Scott F; Isidoro, Ciro; Sobhon, Prasert

    2016-08-09

    The giant freshwater prawn, Macrobrachium rosenbergii, is a decapod crustacean that is commercially important as a food source. Farming of commercial crustaceans requires an efficient management strategy because the animals are easily subjected to stress and diseases during the culture. Autophagy, a stress response process, is well-documented and conserved in most animals, yet it is poorly studied in crustaceans. In this study, we have performed an in silico search for transcripts encoding autophagy-related (Atg) proteins within various tissue transcriptomes of M. rosenbergii. Basic Local Alignment Search Tool (BLAST) search using previously known Atg proteins as queries revealed 41 transcripts encoding homologous M. rosenbergii Atg proteins. Among these Atg proteins, we selected commonly used autophagy markers, including Beclin 1, vacuolar protein sorting (Vps) 34, microtubule-associated proteins 1A/1B light chain 3B (MAP1LC3B), p62/sequestosome 1 (SQSTM1), and lysosomal-associated membrane protein 1 (Lamp-1) for further sequence analyses using comparative alignment and protein structural prediction. We found that crustacean autophagy marker proteins contain conserved motifs typical of other animal Atg proteins. Western blotting using commercial antibodies raised against human Atg marker proteins indicated their presence in various M. rosenbergii tissues, while immunohistochemistry localized Atg marker proteins within ovarian tissue, specifically late stage oocytes. This study demonstrates that the molecular components of autophagic process are conserved in crustaceans, which is comparable to autophagic process in mammals. Furthermore, it provides a foundation for further studies of autophagy in crustaceans that may lead to more understanding of the reproduction- and stress-related autophagy, which will enable the efficient aquaculture practices.

  14. Global gene expression under nitrogen starvation in Xylella fastidiosa: contribution of the σ54 regulon

    PubMed Central

    2010-01-01

    Background Xylella fastidiosa, a Gram-negative fastidious bacterium, grows in the xylem of several plants causing diseases such as citrus variegated chlorosis. As the xylem sap contains low concentrations of amino acids and other compounds, X. fastidiosa needs to cope with nitrogen limitation in its natural habitat. Results In this work, we performed a whole-genome microarray analysis of the X. fastidiosa nitrogen starvation response. A time course experiment (2, 8 and 12 hours) of cultures grown in defined medium under nitrogen starvation revealed many differentially expressed genes, such as those related to transport, nitrogen assimilation, amino acid biosynthesis, transcriptional regulation, and many genes encoding hypothetical proteins. In addition, a decrease in the expression levels of many genes involved in carbon metabolism and energy generation pathways was also observed. Comparison of gene expression profiles between the wild type strain and the rpoN null mutant allowed the identification of genes directly or indirectly induced by nitrogen starvation in a σ54-dependent manner. A more complete picture of the σ54 regulon was achieved by combining the transcriptome data with an in silico search for potential σ54-dependent promoters, using a position weight matrix approach. One of these σ54-predicted binding sites, located upstream of the glnA gene (encoding glutamine synthetase), was validated by primer extension assays, confirming that this gene has a σ54-dependent promoter. Conclusions Together, these results show that nitrogen starvation causes intense changes in the X. fastidiosa transcriptome and some of these differentially expressed genes belong to the σ54 regulon. PMID:20799976

  15. Mechanisms of action of sacubitril/valsartan on cardiac remodeling: a systems biology approach.

    PubMed

    Iborra-Egea, Oriol; Gálvez-Montón, Carolina; Roura, Santiago; Perea-Gil, Isaac; Prat-Vidal, Cristina; Soler-Botija, Carolina; Bayes-Genis, Antoni

    2017-01-01

    Sacubitril/Valsartan, proved superiority over other conventional heart failure management treatments, but its mechanisms of action remains obscure. In this study, we sought to explore the mechanistic details for Sacubitril/Valsartan in heart failure and post-myocardial infarction remodeling, using an in silico, systems biology approach. Myocardial transcriptome obtained in response to myocardial infarction in swine was analyzed to address post-infarction ventricular remodeling. Swine transcriptome hits were mapped to their human equivalents using Reciprocal Best (blast) Hits, Gene Name Correspondence, and InParanoid database. Heart failure remodeling was studied using public data available in gene expression omnibus (accession GSE57345, subseries GSE57338), processed using the GEO2R tool. Using the Therapeutic Performance Mapping System technology, dedicated mathematical models trained to fit a set of molecular criteria, defining both pathologies and including all the information available on Sacubitril/Valsartan, were generated. All relationships incorporated into the biological network were drawn from public resources (including KEGG, REACTOME, INTACT, BIOGRID, and MINT). An artificial neural network analysis revealed that Sacubitril/Valsartan acts synergistically against cardiomyocyte cell death and left ventricular extracellular matrix remodeling via eight principal synergistic nodes. When studying each pathway independently, Valsartan was found to improve cardiac remodeling by inhibiting members of the guanine nucleotide-binding protein family, while Sacubitril attenuated cardiomyocyte cell death, hypertrophy, and impaired myocyte contractility by inhibiting PTEN. The complex molecular mechanisms of action of Sacubitril/Valsartan upon post-myocardial infarction and heart failure cardiac remodeling were delineated using a systems biology approach. Further, this dataset provides pathophysiological rationale for the use of Sacubitril/Valsartan to prevent post-infarct remodeling.

  16. Systems metabolic engineering of Escherichia coli for L-threonine production.

    PubMed

    Lee, Kwang Ho; Park, Jin Hwan; Kim, Tae Yong; Kim, Hyun Uk; Lee, Sang Yup

    2007-01-01

    Amino-acid producers have traditionally been developed by repeated random mutagenesis owing to the difficulty in rationally engineering the complex and highly regulated metabolic network. Here, we report the development of the genetically defined L-threonine overproducing Escherichia coli strain by systems metabolic engineering. Feedback inhibitions of aspartokinase I and III (encoded by thrA and lysC, respectively) and transcriptional attenuation regulations (located in thrL) were removed. Pathways for Thr degradation were removed by deleting tdh and mutating ilvA. The metA and lysA genes were deleted to make more precursors available for Thr biosynthesis. Further target genes to be engineered were identified by transcriptome profiling combined with in silico flux response analysis, and their expression levels were manipulated accordingly. The final engineered E. coli strain was able to produce Thr with a high yield of 0.393 g per gram of glucose, and 82.4 g/l Thr by fed-batch culture. The systems metabolic engineering strategy reported here may be broadly employed for developing genetically defined organisms for the efficient production of various bioproducts.

  17. Reconstruction and Analysis of Human Kidney-Specific Metabolic Network Based on Omics Data

    PubMed Central

    Zhang, Ai-Di; Dai, Shao-Xing; Huang, Jing-Fei

    2013-01-01

    With the advent of the high-throughput data production, recent studies of tissue-specific metabolic networks have largely advanced our understanding of the metabolic basis of various physiological and pathological processes. However, for kidney, which plays an essential role in the body, the available kidney-specific model remains incomplete. This paper reports the reconstruction and characterization of the human kidney metabolic network based on transcriptome and proteome data. In silico simulations revealed that house-keeping genes were more essential than kidney-specific genes in maintaining kidney metabolism. Importantly, a total of 267 potential metabolic biomarkers for kidney-related diseases were successfully explored using this model. Furthermore, we found that the discrepancies in metabolic processes of different tissues are directly corresponding to tissue's functions. Finally, the phenotypes of the differentially expressed genes in diabetic kidney disease were characterized, suggesting that these genes may affect disease development through altering kidney metabolism. Thus, the human kidney-specific model constructed in this study may provide valuable information for the metabolism of kidney and offer excellent insights into complex kidney diseases. PMID:24222897

  18. Phosphoglycerolipids are master players in plant hormone signal transduction.

    PubMed

    Janda, Martin; Planchais, Severine; Djafi, Nabila; Martinec, Jan; Burketova, Lenka; Valentova, Olga; Zachowski, Alain; Ruelland, Eric

    2013-06-01

    Phosphoglycerolipids are essential structural constituents of membranes and some also have important cell signalling roles. In this review, we focus on phosphoglycerolipids that are mediators in hormone signal transduction in plants. We first describe the structures of the main signalling phosphoglycerolipids and the metabolic pathways that generate them, namely the phospholipase and lipid kinase pathways. In silico analysis of Arabidopsis transcriptome data provides evidence that the genes encoding the enzymes of these pathways are transcriptionally regulated in responses to hormones, suggesting some link with hormone signal transduction. The involvement of phosphoglycerolipid signalling in the early responses to abscisic acid, salicylic acid and auxins is then detailed. One of the most important signalling lipids in plants is phosphatidic acid. It can activate or inactivate protein kinases and/or protein phosphatases involved in hormone signalling. It can also activate NADPH oxidase leading to the production of reactive oxygen species. We will interrogate the mechanisms that allow the activation/deactivation of the lipid pathways, in particular the roles of G proteins and calcium. Mediating lipids thus appear as master players of cell signalling, modulating, if not controlling, major transducing steps of hormone signals.

  19. Fine-Scale Variation and Genetic Determinants of Alternative Splicing across Individuals

    PubMed Central

    Coulombe-Huntington, Jasmin; Lam, Kevin C. L.; Dias, Christel; Majewski, Jacek

    2009-01-01

    Recently, thanks to the increasing throughput of new technologies, we have begun to explore the full extent of alternative pre–mRNA splicing (AS) in the human transcriptome. This is unveiling a vast layer of complexity in isoform-level expression differences between individuals. We used previously published splicing sensitive microarray data from lymphoblastoid cell lines to conduct an in-depth analysis on splicing efficiency of known and predicted exons. By combining publicly available AS annotation with a novel algorithm designed to search for AS, we show that many real AS events can be detected within the usually unexploited, speculative majority of the array and at significance levels much below standard multiple-testing thresholds, demonstrating that the extent of cis-regulated differential splicing between individuals is potentially far greater than previously reported. Specifically, many genes show subtle but significant genetically controlled differences in splice-site usage. PCR validation shows that 42 out of 58 (72%) candidate gene regions undergo detectable AS, amounting to the largest scale validation of isoform eQTLs to date. Targeted sequencing revealed a likely causative SNP in most validated cases. In all 17 incidences where a SNP affected a splice-site region, in silico splice-site strength modeling correctly predicted the direction of the micro-array and PCR results. In 13 other cases, we identified likely causative SNPs disrupting predicted splicing enhancers. Using Fst and REHH analysis, we uncovered significant evidence that 2 putative causative SNPs have undergone recent positive selection. We verified the effect of five SNPs using in vivo minigene assays. This study shows that splicing differences between individuals, including quantitative differences in isoform ratios, are frequent in human populations and that causative SNPs can be identified using in silico predictions. Several cases affected disease-relevant genes and it is likely some of these differences are involved in phenotypic diversity and susceptibility to complex diseases. PMID:20011102

  20. A combination of LongSAGE with Solexa sequencing is well suited to explore the depth and the complexity of transcriptome

    PubMed Central

    Hanriot, Lucie; Keime, Céline; Gay, Nadine; Faure, Claudine; Dossat, Carole; Wincker, Patrick; Scoté-Blachon, Céline; Peyron, Christelle; Gandrillon, Olivier

    2008-01-01

    Background "Open" transcriptome analysis methods allow to study gene expression without a priori knowledge of the transcript sequences. As of now, SAGE (Serial Analysis of Gene Expression), LongSAGE and MPSS (Massively Parallel Signature Sequencing) are the mostly used methods for "open" transcriptome analysis. Both LongSAGE and MPSS rely on the isolation of 21 pb tag sequences from each transcript. In contrast to LongSAGE, the high throughput sequencing method used in MPSS enables the rapid sequencing of very large libraries containing several millions of tags, allowing deep transcriptome analysis. However, a bias in the complexity of the transcriptome representation obtained by MPSS was recently uncovered. Results In order to make a deep analysis of mouse hypothalamus transcriptome avoiding the limitation introduced by MPSS, we combined LongSAGE with the Solexa sequencing technology and obtained a library of more than 11 millions of tags. We then compared it to a LongSAGE library of mouse hypothalamus sequenced with the Sanger method. Conclusion We found that Solexa sequencing technology combined with LongSAGE is perfectly suited for deep transcriptome analysis. In contrast to MPSS, it gives a complex representation of transcriptome as reliable as a LongSAGE library sequenced by the Sanger method. PMID:18796152

  1. De novo assembly of the pepper transcriptome (Capsicum annuum): a benchmark for in silico discovery of SNPs, SSRs and candidate genes.

    PubMed

    Ashrafi, Hamid; Hill, Theresa; Stoffel, Kevin; Kozik, Alexander; Yao, Jiqiang; Chin-Wo, Sebastian Reyes; Van Deynze, Allen

    2012-10-30

    Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes. Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80-120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins. Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project.

  2. The aquatic animals' transcriptome resource for comparative functional analysis.

    PubMed

    Chou, Chih-Hung; Huang, Hsi-Yuan; Huang, Wei-Chih; Hsu, Sheng-Da; Hsiao, Chung-Der; Liu, Chia-Yu; Chen, Yu-Hung; Liu, Yu-Chen; Huang, Wei-Yun; Lee, Meng-Lin; Chen, Yi-Chang; Huang, Hsien-Da

    2018-05-09

    Aquatic animals have great economic and ecological importance. Among them, non-model organisms have been studied regarding eco-toxicity, stress biology, and environmental adaptation. Due to recent advances in next-generation sequencing techniques, large amounts of RNA-seq data for aquatic animals are publicly available. However, currently there is no comprehensive resource exist for the analysis, unification, and integration of these datasets. This study utilizes computational approaches to build a new resource of transcriptomic maps for aquatic animals. This aquatic animal transcriptome map database dbATM provides de novo assembly of transcriptome, gene annotation and comparative analysis of more than twenty aquatic organisms without draft genome. To improve the assembly quality, three computational tools (Trinity, Oases and SOAPdenovo-Trans) were employed to enhance individual transcriptome assembly, and CAP3 and CD-HIT-EST software were then used to merge these three assembled transcriptomes. In addition, functional annotation analysis provides valuable clues to gene characteristics, including full-length transcript coding regions, conserved domains, gene ontology and KEGG pathways. Furthermore, all aquatic animal genes are essential for comparative genomics tasks such as constructing homologous gene groups and blast databases and phylogenetic analysis. In conclusion, we establish a resource for non model organism aquatic animals, which is great economic and ecological importance and provide transcriptomic information including functional annotation and comparative transcriptome analysis. The database is now publically accessible through the URL http://dbATM.mbc.nctu.edu.tw/ .

  3. Characterizing the Grape Transcriptome. Analysis of Expressed Sequence Tags from Multiple Vitis Species and Development of a Compendium of Gene Expression during Berry Development1[w

    PubMed Central

    Silva, Francisco Goes da; Iandolino, Alberto; Al-Kayal, Fadi; Bohlmann, Marlene C.; Cushman, Mary Ann; Lim, Hyunju; Ergul, Ali; Figueroa, Rubi; Kabuloglu, Elif K.; Osborne, Craig; Rowe, Joan; Tattersall, Elizabeth; Leslie, Anna; Xu, Jane; Baek, JongMin; Cramer, Grant R.; Cushman, John C.; Cook, Douglas R.

    2005-01-01

    We report the analysis and annotation of 146,075 expressed sequence tags from Vitis species. The majority of these sequences were derived from different cultivars of Vitis vinifera, comprising an estimated 25,746 unique contig and singleton sequences that survey transcription in various tissues and developmental stages and during biotic and abiotic stress. Putatively homologous proteins were identified for over 17,752 of the transcripts, with 1,962 transcripts further subdivided into one or more Gene Ontology categories. A simple structured vocabulary, with modules for plant genotype, plant development, and stress, was developed to describe the relationship between individual expressed sequence tags and cDNA libraries; the resulting vocabulary provides query terms to facilitate data mining within the context of a relational database. As a measure of the extent to which characterized metabolic pathways were encompassed by the data set, we searched for homologs of the enzymes leading from glycolysis, through the oxidative/nonoxidative pentose phosphate pathway, and into the general phenylpropanoid pathway. Homologs were identified for 65 of these 77 enzymes, with 86% of enzymatic steps represented by paralogous genes. Differentially expressed transcripts were identified by means of a stringent believability index cutoff of ≥98.4%. Correlation analysis and two-dimensional hierarchical clustering grouped these transcripts according to similarity of expression. In the broadest analysis, 665 differentially expressed transcripts were identified across 29 cDNA libraries, representing a range of developmental and stress conditions. The groupings revealed expected associations between plant developmental stages and tissue types, with the notable exception of abiotic stress treatments. A more focused analysis of flower and berry development identified 87 differentially expressed transcripts and provides the basis for a compendium that relates gene expression and annotation to previously characterized aspects of berry development and physiology. Comparison with published results for select genes, as well as correlation analysis between independent data sets, suggests that the inferred in silico patterns of expression are likely to be an accurate representation of transcript abundance for the conditions surveyed. Thus, the combined data set reveals the in silico expression patterns for hundreds of genes in V. vinifera, the majority of which have not been previously studied within this species. PMID:16219919

  4. Peptidergic signaling in the crab Cancer borealis: Tapping the power of transcriptomics for neuropeptidome expansion.

    PubMed

    Christie, Andrew E; Pascual, Micah G

    2016-10-01

    The crab Cancer borealis has long been used as a model for understanding neural control of rhythmic behavior. One significant discovery made through its use is that even numerically simple neural circuits are capable of producing an essentially infinite array of distinct motor outputs via the actions of locally released and circulating neuromodulators, the largest class being peptides. While much work has focused on elucidating the peptidome of C. borealis, no investigation has used in silico transcriptome mining for peptide discovery in this species, a strategy proven highly effective for identifying neuropeptides in other crustaceans. Here, we mined a C. borealis neural transcriptome for putative peptide-encoding transcripts, and predicted 200 distinct mature neuropeptides from the proteins deduced from these sequences. The identified peptides include isoforms of allatostatin A, allatostatin B, allatostatin C, CCHamide, crustacean cardioactive peptide, crustacean hyperglycemic hormone, diuretic hormone 31 (DH31), diuretic hormone 44 (DH44), FMRFamide-like peptide, GSEFLamide, HIGSLYRamide, insulin-like peptide (ILP), intocin, leucokinin, neuroparsin, pigment dispersing hormone, pyrokinin, red pigment concentrating hormone, short neuropeptide F and SIFamide. While some of the predicted peptides were known previously from C. borealis, most (159) are new discoveries for the species, e.g., the isoforms of CCHamide, DH31, DH44, GSEFLamide, ILP, intocin and neuroparsin, which are the first members of these peptide families identified from C. borealis. Collectively, the peptides predicted here approximately double the peptidome known for C. borealis, and in so doing provide an expanded platform from which to launch new investigations of peptidergic neuromodulation in this species. Copyright © 2016 Elsevier Inc. All rights reserved.

  5. Proteome analysis of digestive fluids in Nepenthes pitchers

    PubMed Central

    Rottloff, Sandy; Miguel, Sissi; Biteau, Flore; Nisse, Estelle; Hammann, Philippe; Kuhn, Lauriane; Chicher, Johana; Bazile, Vincent; Gaume, Laurence; Mignard, Benoit; Hehn, Alain; Bourgaud, Frédéric

    2016-01-01

    Background and Aims Carnivorous plants have developed strategies to enable growth in nutrient-poor soils. For the genus Nepenthes, this strategy represents producing pitcher-modified leaves that can trap and digest various prey. These pitchers produce a digestive fluid composed of proteins, including hydrolytic enzymes. The focus of this study was on the identification of these proteins. Methods In order to better characterize and have an overview of these proteins, digestive fluid was sampled from pitchers at different stages of maturity from five species of Nepenthes (N. mirabilis, N. alata, N. sanguinea, N. bicalcarata and N. albomarginata) that vary in their ecological niches and grew under different conditions. Three complementary approaches based on transcriptomic resources, mass spectrometry and in silico analysis were used. Key Results This study permitted the identification of 29 proteins excreted in the pitchers. Twenty of these proteins were never reported in Nepenthes previously and included serine carboxypeptidases, α- and β-galactosidases, lipid transfer proteins and esterases/lipases. These 20 proteins display sequence signals allowing their secretion into the pitcher fluid. Conclusions Nepenthes pitcher plants have evolved an arsenal of enzymes to digest prey caught in their traps. The panel of new proteins identified in this study provides new insights into the digestive process of these carnivorous plants. PMID:26912512

  6. Smooth Muscle Cell Genome Browser: Enabling the Identification of Novel Serum Response Factor Target Genes

    PubMed Central

    Lee, Moon Young; Park, Chanjae; Berent, Robyn M.; Park, Paul J.; Fuchs, Robert; Syn, Hannah; Chin, Albert; Townsend, Jared; Benson, Craig C.; Redelman, Doug; Shen, Tsai-wei; Park, Jong Kun; Miano, Joseph M.; Sanders, Kenton M.; Ro, Seungil

    2015-01-01

    Genome-scale expression data on the absolute numbers of gene isoforms offers essential clues in cellular functions and biological processes. Smooth muscle cells (SMCs) perform a unique contractile function through expression of specific genes controlled by serum response factor (SRF), a transcription factor that binds to DNA sites known as the CArG boxes. To identify SRF-regulated genes specifically expressed in SMCs, we isolated SMC populations from mouse small intestine and colon, obtained their transcriptomes, and constructed an interactive SMC genome and CArGome browser. To our knowledge, this is the first online resource that provides a comprehensive library of all genetic transcripts expressed in primary SMCs. The browser also serves as the first genome-wide map of SRF binding sites. The browser analysis revealed novel SMC-specific transcriptional variants and SRF target genes, which provided new and unique insights into the cellular and biological functions of the cells in gastrointestinal (GI) physiology. The SRF target genes in SMCs, which were discovered in silico, were confirmed by proteomic analysis of SMC-specific Srf knockout mice. Our genome browser offers a new perspective into the alternative expression of genes in the context of SRF binding sites in SMCs and provides a valuable reference for future functional studies. PMID:26241044

  7. Comprehensive analysis of the transcriptional profile of the Mediator complex across human cancer types.

    PubMed

    Syring, Isabella; Klümper, Niklas; Offermann, Anne; Braun, Martin; Deng, Mario; Boehm, Diana; Queisser, Angela; von Mässenhausen, Anne; Brägelmann, Johannes; Vogel, Wenzel; Schmidt, Doris; Majores, Michael; Schindler, Anne; Kristiansen, Glen; Müller, Stefan C; Ellinger, Jörg; Shaikhibrahim, Zaki; Perner, Sven

    2016-04-26

    The Mediator complex is a key regulator of gene transcription and several studies demonstrated altered expressions of particular subunits in diverse human diseases, especially cancer. However a systematic study deciphering the transcriptional expression of the Mediator across different cancer entities is still lacking.We therefore performed a comprehensive in silico cancer vs. benign analysis of the Mediator complex subunits (MEDs) for 20 tumor entities using Oncomine datasets. The transcriptional expression profiles across almost all cancer entities showed differentially expressed MEDs as compared to benign tissue. Differential expression of MED8 in renal cell carcinoma (RCC) and MED12 in lung cancer (LCa) were validated and further investigated by immunohistochemical staining on tissue microarrays containing large numbers of specimen. MED8 in clear cell RCC (ccRCC) associated with shorter survival and advanced TNM stage and showed higher expression in metastatic than primary tumors. In vitro, siRNA mediated MED8 knockdown significantly impaired proliferation and motility in ccRCC cell lines, hinting at a role for MED8 to serve as a novel therapeutic target in ccRCC. Taken together, our Mediator complex transcriptome proved to be a valid tool for identifying cancer-related shifts in Mediator complex composition, revealing that MEDs do exhibit cancer specific transcriptional expression profiles.

  8. Developmental and Environmental Regulation of Aquaporin Gene Expression across Populus Species: Divergence or Redundancy?

    PubMed Central

    Cohen, David; Bogeat-Triboulot, Marie-Béatrice; Vialet-Chabrand, Silvère; Merret, Rémy; Courty, Pierre-Emmanuel; Moretti, Sébastien; Bizet, François; Guilliot, Agnès; Hummel, Irène

    2013-01-01

    Aquaporins (AQPs) are membrane channels belonging to the major intrinsic proteins family and are known for their ability to facilitate water movement. While in Populus trichocarpa, AQP proteins form a large family encompassing fifty-five genes, most of the experimental work focused on a few genes or subfamilies. The current work was undertaken to develop a comprehensive picture of the whole AQP gene family in Populus species by delineating gene expression domain and distinguishing responsiveness to developmental and environmental cues. Since duplication events amplified the poplar AQP family, we addressed the question of expression redundancy between gene duplicates. On these purposes, we carried a meta-analysis of all publicly available Affymetrix experiments. Our in-silico strategy controlled for previously identified biases in cross-species transcriptomics, a necessary step for any comparative transcriptomics based on multispecies design chips. Three poplar AQPs were not supported by any expression data, even in a large collection of situations (abiotic and biotic constraints, temporal oscillations and mutants). The expression of 11 AQPs was never or poorly regulated whatever the wideness of their expression domain and their expression level. Our work highlighted that PtTIP1;4 was the most responsive gene of the AQP family. A high functional divergence between gene duplicates was detected across species and in response to tested cues, except for the root-expressed PtTIP2;3/PtTIP2;4 pair exhibiting 80% convergent responses. Our meta-analysis assessed key features of aquaporin expression which had remained hidden in single experiments, such as expression wideness, response specificity and genotype and environment interactions. By consolidating expression profiles using independent experimental series, we showed that the large expansion of AQP family in poplar was accompanied with a strong divergence of gene expression, even if some cases of functional redundancy could be suspected. PMID:23393587

  9. Developmental and environmental regulation of Aquaporin gene expression across Populus species: divergence or redundancy?

    PubMed

    Cohen, David; Bogeat-Triboulot, Marie-Béatrice; Vialet-Chabrand, Silvère; Merret, Rémy; Courty, Pierre-Emmanuel; Moretti, Sébastien; Bizet, François; Guilliot, Agnès; Hummel, Irène

    2013-01-01

    Aquaporins (AQPs) are membrane channels belonging to the major intrinsic proteins family and are known for their ability to facilitate water movement. While in Populus trichocarpa, AQP proteins form a large family encompassing fifty-five genes, most of the experimental work focused on a few genes or subfamilies. The current work was undertaken to develop a comprehensive picture of the whole AQP gene family in Populus species by delineating gene expression domain and distinguishing responsiveness to developmental and environmental cues. Since duplication events amplified the poplar AQP family, we addressed the question of expression redundancy between gene duplicates. On these purposes, we carried a meta-analysis of all publicly available Affymetrix experiments. Our in-silico strategy controlled for previously identified biases in cross-species transcriptomics, a necessary step for any comparative transcriptomics based on multispecies design chips. Three poplar AQPs were not supported by any expression data, even in a large collection of situations (abiotic and biotic constraints, temporal oscillations and mutants). The expression of 11 AQPs was never or poorly regulated whatever the wideness of their expression domain and their expression level. Our work highlighted that PtTIP1;4 was the most responsive gene of the AQP family. A high functional divergence between gene duplicates was detected across species and in response to tested cues, except for the root-expressed PtTIP2;3/PtTIP2;4 pair exhibiting 80% convergent responses. Our meta-analysis assessed key features of aquaporin expression which had remained hidden in single experiments, such as expression wideness, response specificity and genotype and environment interactions. By consolidating expression profiles using independent experimental series, we showed that the large expansion of AQP family in poplar was accompanied with a strong divergence of gene expression, even if some cases of functional redundancy could be suspected.

  10. Asian Citrus Psyllid Expression Profiles Suggest Candidatus Liberibacter Asiaticus-Mediated Alteration of Adult Nutrition and Metabolism, and of Nymphal Development and Immunity

    PubMed Central

    He, Ruifeng; Nelson, William; Yin, Guohua; Cicero, Joseph M.; Willer, Mark; Kim, Ryan; Kramer, Robin; May, Greg A.; Crow, John A.; Soderlund, Carol A.; Gang, David R.; Brown, Judith K.

    2015-01-01

    The Asian citrus psyllid (ACP) Diaphorina citri Kuwayama (Hemiptera: Psyllidae) is the insect vector of the fastidious bacterium Candidatus Liberibacter asiaticus (CLas), the causal agent of citrus greening disease, or Huanglongbing (HLB). The widespread invasiveness of the psyllid vector and HLB in citrus trees worldwide has underscored the need for non-traditional approaches to manage the disease. One tenable solution is through the deployment of RNA interference technology to silence protein-protein interactions essential for ACP-mediated CLas invasion and transmission. To identify psyllid interactor-bacterial effector combinations associated with psyllid-CLas interactions, cDNA libraries were constructed from CLas-infected and CLas-free ACP adults and nymphs, and analyzed for differential expression. Library assemblies comprised 24,039,255 reads and yielded 45,976 consensus contigs. They were annotated (UniProt), classified using Gene Ontology, and subjected to in silico expression analyses using the Transcriptome Computational Workbench (TCW) (http://www.sohomoptera.org/ACPPoP/). Functional-biological pathway interpretations were carried out using the Kyoto Encyclopedia of Genes and Genomes databases. Differentially expressed contigs in adults and/or nymphs represented genes and/or metabolic/pathogenesis pathways involved in adhesion, biofilm formation, development-related, immunity, nutrition, stress, and virulence. Notably, contigs involved in gene silencing and transposon-related responses were documented in a psyllid for the first time. This is the first comparative transcriptomic analysis of ACP adults and nymphs infected and uninfected with CLas. The results provide key initial insights into host-parasite interactions involving CLas effectors that contribute to invasion-virulence, and to host nutritional exploitation and immune-related responses that appear to be essential for successful ACP-mediated circulative, propagative CLas transmission. PMID:26091106

  11. In silico pharmacology for drug discovery: applications to targets and beyond

    PubMed Central

    Ekins, S; Mestres, J; Testa, B

    2007-01-01

    Computational (in silico) methods have been developed and widely applied to pharmacology hypothesis development and testing. These in silico methods include databases, quantitative structure-activity relationships, similarity searching, pharmacophores, homology models and other molecular modeling, machine learning, data mining, network analysis tools and data analysis tools that use a computer. Such methods have seen frequent use in the discovery and optimization of novel molecules with affinity to a target, the clarification of absorption, distribution, metabolism, excretion and toxicity properties as well as physicochemical characterization. The first part of this review discussed the methods that have been used for virtual ligand and target-based screening and profiling to predict biological activity. The aim of this second part of the review is to illustrate some of the varied applications of in silico methods for pharmacology in terms of the targets addressed. We will also discuss some of the advantages and disadvantages of in silico methods with respect to in vitro and in vivo methods for pharmacology research. Our conclusion is that the in silico pharmacology paradigm is ongoing and presents a rich array of opportunities that will assist in expediating the discovery of new targets, and ultimately lead to compounds with predicted biological activity for these novel targets. PMID:17549046

  12. TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis

    PubMed Central

    Ji, Zhicheng; Ji, Hongkai

    2016-01-01

    When analyzing single-cell RNA-seq data, constructing a pseudo-temporal path to order cells based on the gradual transition of their transcriptomes is a useful way to study gene expression dynamics in a heterogeneous cell population. Currently, a limited number of computational tools are available for this task, and quantitative methods for comparing different tools are lacking. Tools for Single Cell Analysis (TSCAN) is a software tool developed to better support in silico pseudo-Time reconstruction in Single-Cell RNA-seq ANalysis. TSCAN uses a cluster-based minimum spanning tree (MST) approach to order cells. Cells are first grouped into clusters and an MST is then constructed to connect cluster centers. Pseudo-time is obtained by projecting each cell onto the tree, and the ordered sequence of cells can be used to study dynamic changes of gene expression along the pseudo-time. Clustering cells before MST construction reduces the complexity of the tree space. This often leads to improved cell ordering. It also allows users to conveniently adjust the ordering based on prior knowledge. TSCAN has a graphical user interface (GUI) to support data visualization and user interaction. Furthermore, quantitative measures are developed to objectively evaluate and compare different pseudo-time reconstruction methods. TSCAN is available at https://github.com/zji90/TSCAN and as a Bioconductor package. PMID:27179027

  13. TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis.

    PubMed

    Ji, Zhicheng; Ji, Hongkai

    2016-07-27

    When analyzing single-cell RNA-seq data, constructing a pseudo-temporal path to order cells based on the gradual transition of their transcriptomes is a useful way to study gene expression dynamics in a heterogeneous cell population. Currently, a limited number of computational tools are available for this task, and quantitative methods for comparing different tools are lacking. Tools for Single Cell Analysis (TSCAN) is a software tool developed to better support in silico pseudo-Time reconstruction in Single-Cell RNA-seq ANalysis. TSCAN uses a cluster-based minimum spanning tree (MST) approach to order cells. Cells are first grouped into clusters and an MST is then constructed to connect cluster centers. Pseudo-time is obtained by projecting each cell onto the tree, and the ordered sequence of cells can be used to study dynamic changes of gene expression along the pseudo-time. Clustering cells before MST construction reduces the complexity of the tree space. This often leads to improved cell ordering. It also allows users to conveniently adjust the ordering based on prior knowledge. TSCAN has a graphical user interface (GUI) to support data visualization and user interaction. Furthermore, quantitative measures are developed to objectively evaluate and compare different pseudo-time reconstruction methods. TSCAN is available at https://github.com/zji90/TSCAN and as a Bioconductor package. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. Dynamic changes in global microRNAome and transcriptome reveal complex miRNA-mRNA regulated host response to Japanese Encephalitis Virus in microglial cells

    PubMed Central

    Kumari, Bharti; Jain, Pratistha; Das, Shaoli; Ghosal, Suman; Hazra, Bibhabasu; Trivedi, Ashish Chandra; Basu, Anirban; Chakrabarti, Jayprokas; Vrati, Sudhanshu; Banerjee, Arup

    2016-01-01

    Microglia cells in the brain play essential role during Japanese Encephalitis Virus (JEV) infection and may lead to change in microRNA (miRNA) and mRNA profile. These changes may together control disease outcome. Using Affymetrix microarray platform, we profiled cellular miRNA and mRNA expression at multiple time points during viral infection in human microglial (CHME3) cells. In silico analysis of microarray data revealed a phased pattern of miRNAs expression, associated with JEV replication and provided unique signatures of infection. Target prediction and pathway enrichment analysis identified anti correlation between differentially expressed miRNA and the gene expression at multiple time point which ultimately affected diverse signaling pathways including Notch signaling pathways in microglia. Activation of Notch pathway during JEV infection was demonstrated in vitro and in vivo. The expression of a subset of miRNAs that target multiple genes in Notch signaling pathways were suppressed and their overexpression could affect JEV induced immune response. Further analysis provided evidence for the possible presence of cellular competing endogenous RNA (ceRNA) associated with innate immune response. Collectively, our data provide a uniquely comprehensive view of the changes in the host miRNAs induced by JEV during cellular infection and identify Notch pathway in modulating microglia mediated inflammation. PMID:26838068

  15. Dynamic changes in global microRNAome and transcriptome reveal complex miRNA-mRNA regulated host response to Japanese Encephalitis Virus in microglial cells.

    PubMed

    Kumari, Bharti; Jain, Pratistha; Das, Shaoli; Ghosal, Suman; Hazra, Bibhabasu; Trivedi, Ashish Chandra; Basu, Anirban; Chakrabarti, Jayprokas; Vrati, Sudhanshu; Banerjee, Arup

    2016-02-03

    Microglia cells in the brain play essential role during Japanese Encephalitis Virus (JEV) infection and may lead to change in microRNA (miRNA) and mRNA profile. These changes may together control disease outcome. Using Affymetrix microarray platform, we profiled cellular miRNA and mRNA expression at multiple time points during viral infection in human microglial (CHME3) cells. In silico analysis of microarray data revealed a phased pattern of miRNAs expression, associated with JEV replication and provided unique signatures of infection. Target prediction and pathway enrichment analysis identified anti correlation between differentially expressed miRNA and the gene expression at multiple time point which ultimately affected diverse signaling pathways including Notch signaling pathways in microglia. Activation of Notch pathway during JEV infection was demonstrated in vitro and in vivo. The expression of a subset of miRNAs that target multiple genes in Notch signaling pathways were suppressed and their overexpression could affect JEV induced immune response. Further analysis provided evidence for the possible presence of cellular competing endogenous RNA (ceRNA) associated with innate immune response. Collectively, our data provide a uniquely comprehensive view of the changes in the host miRNAs induced by JEV during cellular infection and identify Notch pathway in modulating microglia mediated inflammation.

  16. In Silico PCR Tools for a Fast Primer, Probe, and Advanced Searching.

    PubMed

    Kalendar, Ruslan; Muterko, Alexandr; Shamekova, Malika; Zhambakin, Kabyl

    2017-01-01

    The polymerase chain reaction (PCR) is fundamental to molecular biology and is the most important practical molecular technique for the research laboratory. The principle of this technique has been further used and applied in plenty of other simple or complex nucleic acid amplification technologies (NAAT). In parallel to laboratory "wet bench" experiments for nucleic acid amplification technologies, in silico or virtual (bioinformatics) approaches have been developed, among which in silico PCR analysis. In silico NAAT analysis is a useful and efficient complementary method to ensure the specificity of primers or probes for an extensive range of PCR applications from homology gene discovery, molecular diagnosis, DNA fingerprinting, and repeat searching. Predicting sensitivity and specificity of primers and probes requires a search to determine whether they match a database with an optimal number of mismatches, similarity, and stability. In the development of in silico bioinformatics tools for nucleic acid amplification technologies, the prospects for the development of new NAAT or similar approaches should be taken into account, including forward-looking and comprehensive analysis that is not limited to only one PCR technique variant. The software FastPCR and the online Java web tool are integrated tools for in silico PCR of linear and circular DNA, multiple primer or probe searches in large or small databases and for advanced search. These tools are suitable for processing of batch files that are essential for automation when working with large amounts of data. The FastPCR software is available for download at http://primerdigital.com/fastpcr.html and the online Java version at http://primerdigital.com/tools/pcr.html .

  17. The low-abundance transcriptome reveals novel biomarkers, specific intracellular pathways and targetable genes associated with advanced gastric cancer.

    PubMed

    Bizama, Carolina; Benavente, Felipe; Salvatierra, Edgardo; Gutiérrez-Moraga, Ana; Espinoza, Jaime A; Fernández, Elmer A; Roa, Iván; Mazzolini, Guillermo; Sagredo, Eduardo A; Gidekel, Manuel; Podhajcer, Osvaldo L

    2014-02-15

    Studies on the low-abundance transcriptome are of paramount importance for identifying the intimate mechanisms of tumor progression that can lead to novel therapies. The aim of the present study was to identify novel markers and targetable genes and pathways in advanced human gastric cancer through analyses of the low-abundance transcriptome. The procedure involved an initial subtractive hybridization step, followed by global gene expression analysis using microarrays. We observed profound differences, both at the single gene and gene ontology levels, between the low-abundance transcriptome and the whole transcriptome. Analysis of the low-abundance transcriptome led to the identification and validation by tissue microarrays of novel biomarkers, such as LAMA3 and TTN; moreover, we identified cancer type-specific intracellular pathways and targetable genes, such as IRS2, IL17, IFNγ, VEGF-C, WISP1, FZD5 and CTBP1 that were not detectable by whole transcriptome analyses. We also demonstrated that knocking down the expression of CTBP1 sensitized gastric cancer cells to mainstay chemotherapeutic drugs. We conclude that the analysis of the low-abundance transcriptome provides useful insights into the molecular basis and treatment of cancer. © 2013 UICC.

  18. Assessment of the predictive accuracy of five in silico prediction tools, alone or in combination, and two metaservers to classify long QT syndrome gene mutations.

    PubMed

    Leong, Ivone U S; Stuckey, Alexander; Lai, Daniel; Skinner, Jonathan R; Love, Donald R

    2015-05-13

    Long QT syndrome (LQTS) is an autosomal dominant condition predisposing to sudden death from malignant arrhythmia. Genetic testing identifies many missense single nucleotide variants of uncertain pathogenicity. Establishing genetic pathogenicity is an essential prerequisite to family cascade screening. Many laboratories use in silico prediction tools, either alone or in combination, or metaservers, in order to predict pathogenicity; however, their accuracy in the context of LQTS is unknown. We evaluated the accuracy of five in silico programs and two metaservers in the analysis of LQTS 1-3 gene variants. The in silico tools SIFT, PolyPhen-2, PROVEAN, SNPs&GO and SNAP, either alone or in all possible combinations, and the metaservers Meta-SNP and PredictSNP, were tested on 312 KCNQ1, KCNH2 and SCN5A gene variants that have previously been characterised by either in vitro or co-segregation studies as either "pathogenic" (283) or "benign" (29). The accuracy, sensitivity, specificity and Matthews Correlation Coefficient (MCC) were calculated to determine the best combination of in silico tools for each LQTS gene, and when all genes are combined. The best combination of in silico tools for KCNQ1 is PROVEAN, SNPs&GO and SIFT (accuracy 92.7%, sensitivity 93.1%, specificity 100% and MCC 0.70). The best combination of in silico tools for KCNH2 is SIFT and PROVEAN or PROVEAN, SNPs&GO and SIFT. Both combinations have the same scores for accuracy (91.1%), sensitivity (91.5%), specificity (87.5%) and MCC (0.62). In the case of SCN5A, SNAP and PROVEAN provided the best combination (accuracy 81.4%, sensitivity 86.9%, specificity 50.0%, and MCC 0.32). When all three LQT genes are combined, SIFT, PROVEAN and SNAP is the combination with the best performance (accuracy 82.7%, sensitivity 83.0%, specificity 80.0%, and MCC 0.44). Both metaservers performed better than the single in silico tools; however, they did not perform better than the best performing combination of in silico tools. The combination of in silico tools with the best performance is gene-dependent. The in silico tools reported here may have some value in assessing variants in the KCNQ1 and KCNH2 genes, but caution should be taken when the analysis is applied to SCN5A gene variants.

  19. High-throughput SNP discovery and transcriptome expression profiles from the salmon louse Caligus rogercresseyi (Copepoda: Caligidae).

    PubMed

    Nuñez-Acuña, Gustavo; Valenzuela-Muñoz, Valentina; Gallardo-Escárate, Cristian

    2014-06-01

    The salmon louse Caligus rogercresseyi is the dominant ectoparasite species affecting the salmon aquaculture industry in the Southern hemisphere, and it is currently the main cause for economic losses in Chilean aquaculture. However, despite the great concern over Caligus infestations, genomic information on this louse is still scarce, even while the need to develop high-resolution molecular markers is growing. This study provides the first deep transcriptome survey to identify thousands of SNP markers from C. rogercresseyi, with a total of 69,466 SNPs identified using the MiSeq platform (Illumina®), 30,605 (52%) of which were found in contigs successfully annotated against known protein databases. Furthermore, in silico gene expression profiles associated with SNP variants were evaluated, and the results evidenced a wide array of genes that were down- and upregulated throughout the developmental stages of C. rogercresseyi. Interestingly, putative KEGG pathways involved in resistance to antiparasitic agents were also identified, where ten pathways were associated with the nervous system and one was related to ABC transporters. Taken together, this information could be highly useful for investigating the molecular underpinnings involved in the susceptibility or resistance of salmon lice to chemical treatments. Copyright © 2014 Elsevier Inc. All rights reserved.

  20. Two novel male-associated peroxinectin genes are downregulated by exposure to delousing drugs in Caligus rogercresseyi.

    PubMed

    Núñez-Acuña, Gustavo; Gallardo-Escárate, Cristian

    2015-02-15

    Peroxinectin (PX) is a protein involved in cell adhesion, peroxidase activities, and the encapsulation of invaders in diverse species, including parasitic copepods. Recently, a transcript denoted peroxinectin-like was identified in the salmon louse Lepeophtheirus salmonis, and this was significantly correlated with the immune response of host fish. Thus, the PX gene is a potential candidate to evaluate host-parasite interactions, as well as to analyze responses to delousing drugs used in the salmon aquaculture industry worldwide. The objective of this study was to identify Peroxinectin transcripts in the Chilean salmon louse Caligus rogercresseyi, and to determine expression levels after exposition to the antiparasitics deltamethrin and azamethiphos. Two novel transcript homologs to peroxinectins were identified from a transcriptomic library of C. rogercresseyi. Moreover, in silico gene transcription levels were evaluated through RNA-seq analyses based on unique gene read levels in transcriptomic libraries that were constructed from sea lice exposed to delousing drugs. The identified transcripts were named Peroxinectin-Cr1 and Peroxinectin-Cr2, which, respectively, had lengths of 2543 and 2555 base pairs. Both PX transcripts were highly associated with male adults, and transcription levels were significantly reduced by deltamethrin and azamethiphos. This result suggests a modulation of peroxinectin in response to delousing drugs. Copyright © 2014 Elsevier B.V. All rights reserved.

  1. Transcriptome assembly and digital gene expression atlas of the rainbow trout

    USDA-ARS?s Scientific Manuscript database

    Background: Transcriptome analysis is a preferred method for gene discovery, marker development and gene expression profiling in non-model organisms. Previously, we sequenced a transcriptome reference using Sanger-based and 454-pyrosequencing, however, a transcriptome assembly is still incomplete an...

  2. Preliminary profiling of blood transcriptome in a rat model of hemorrhagic shock.

    PubMed

    Braga, D; Barcella, M; D'Avila, F; Lupoli, S; Tagliaferri, F; Santamaria, M H; DeLano, F A; Baselli, G; Schmid-Schönbein, G W; Kistler, E B; Aletti, F; Barlassina, C

    2017-08-01

    Hemorrhagic shock is a leading cause of morbidity and mortality worldwide. Significant blood loss may lead to decreased blood pressure and inadequate tissue perfusion with resultant organ failure and death, even after replacement of lost blood volume. One reason for this high acuity is that the fundamental mechanisms of shock are poorly understood. Proteomic and metabolomic approaches have been used to investigate the molecular events occurring in hemorrhagic shock but, to our knowledge, a systematic analysis of the transcriptomic profile is missing. Therefore, a pilot analysis using paired-end RNA sequencing was used to identify changes that occur in the blood transcriptome of rats subjected to hemorrhagic shock after blood reinfusion. Hemorrhagic shock was induced using a Wigger's shock model. The transcriptome of whole blood from shocked animals shows modulation of genes related to inflammation and immune response (Tlr13, Il1b, Ccl6, Lgals3), antioxidant functions (Mt2A, Mt1), tissue injury and repair pathways (Gpnmb, Trim72) and lipid mediators (Alox5ap, Ltb4r, Ptger2) compared with control animals. These findings are congruent with results obtained in hemorrhagic shock analysis by other authors using metabolomics and proteomics. The analysis of blood transcriptome may be a valuable tool to understand the biological changes occurring in hemorrhagic shock and a promising approach for the identification of novel biomarkers and therapeutic targets. Impact statement This study provides the first pilot analysis of the changes occurring in transcriptome expression of whole blood in hemorrhagic shock (HS) rats. We showed that the analysis of blood transcriptome is a useful approach to investigate pathways and functional alterations in this disease condition. This pilot study encourages the possible application of transcriptome analysis in the clinical setting, for the molecular profiling of whole blood in HS patients.

  3. RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome.

    PubMed

    Wenger, Yvan; Galliot, Brigitte

    2013-03-25

    Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.

  4. RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome

    PubMed Central

    2013-01-01

    Background Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. Results To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48’909 unique sequences including splice variants, representing approximately 24’450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10’597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11’270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. Conclusions We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events. PMID:23530871

  5. Transcriptome analysis of the Spodoptera frugiperda ascovirus in vivo provides insights into how its apoptosis inhibitors and caspase promote increased synthesis of viral vesicles and virion progeny.

    PubMed

    Zaghloul, Heba; Hice, Robert; Arensburger, Peter; Federici, Brian A

    2017-09-27

    Ascoviruses are ds DNA viruses that attack caterpillars and differ from all other viruses by inducing nuclear lysis followed by cleavage of host cells into numerous anucleate vesicles in which virus replication continues as these grow in the blood. Ascoviruses are also unusual in that most encode apoptosis inhibitors and caspase or caspase-like proteins. A robust cell line to study the novel molecular biology of ascovirus replication in vitro is lacking. Therefore, we used strand-specific RNA-Seq to study transcription in vivo in third instars of Spodoptera frugiperda infected with the Spodoptera frugiperda ascovirus, a member of the type species, Spodoptera frugiperda ascovirus (SfAV-1a), sampling transcripts at different time points after infection. We targeted transcription of two types of SfAV-1a genes; first, 44 core genes that occur in several ascovirus species, and second, 26 genes predicted in silico to have metabolic functions likely involved in synthesizing viral vesicle membranes. Gene cluster analysis showed differences in temporal expression of SfAV-1a genes, enabling their assignment to three temporal classes; early, late and very late. Inhibitors of apoptosis (IAP-like proteins; ORF016, ORF025 and ORF074) were expressed early, whereas its caspase (ORF073) was expressed very late, which correlated with apoptotic events leading to viral vesicle formation. Expression analysis revealed that a Diedel gene homolog (ORF121), the only known "virokine," was highly expressed, implying this ascovirus protein helps evade innate host immunity. Lastly, single-nucleotide resolution of RNA-Seq data revealed 15 bicistronic and tricistronic messages along the genome, an unusual occurrence for large ds DNA viruses. IMPORTANCE Unlike all other DNA viruses, ascoviruses code for an executioner caspase, apparently involved in a novel cytopathology in which viral replication induces nuclear lysis followed by cell cleavage yielding numerous large anucleate viral vesicles that continue to produce virions. Our transcriptome analysis of genome expression in vivo by the Spodoptera frugiperda ascovirus shows that inhibitors of apoptosis are expressed first enabling viral replication to proceed, after which the SfAV-1a caspase is synthesized, leading to viral vesicle synthesis and subsequent extensive production of progeny virions. Moreover, we detected numerous bicistronic and tricistronic mRNA messages in the ascovirus transcriptome, implying ascoviruses use other non-canonical translational mechanisms such as Internal Ribosome Entry Site (IRES). These results provide the first insights into the molecular biology of a unique coordinated gene expression pattern in which cell architecture is markedly modified, more than in any other known eukaryotic virus, to promote viral reproduction and transmission. Copyright © 2017 American Society for Microbiology.

  6. Studies on DNA-binding selectivity of WRKY transcription factors lend structural clues into WRKY-domain function.

    PubMed

    Ciolkowski, Ingo; Wanke, Dierk; Birkenbihl, Rainer P; Somssich, Imre E

    2008-09-01

    WRKY transcription factors have been shown to play a major role in regulating, both positively and negatively, the plant defense transcriptome. Nearly all studied WRKY factors appear to have a stereotypic binding preference to one DNA element termed the W-box. How specificity for certain promoters is accomplished therefore remains completely unknown. In this study, we tested five distinct Arabidopsis WRKY transcription factor subfamily members for their DNA binding selectivity towards variants of the W-box embedded in neighboring DNA sequences. These studies revealed for the first time differences in their binding site preferences, which are partly dependent on additional adjacent DNA sequences outside of the TTGACY-core motif. A consensus WRKY binding site derived from these studies was used for in silico analysis to identify potential target genes within the Arabidopsis genome. Furthermore, we show that even subtle amino acid substitutions within the DNA binding region of AtWRKY11 strongly impinge on its binding activity. Additionally, all five factors were found localized exclusively to the plant cell nucleus and to be capable of trans-activating expression of a reporter gene construct in vivo.

  7. Deep sequencing and in silico analysis of small RNA library reveals novel miRNA from leaf Persicaria minor transcriptome.

    PubMed

    Samad, Abdul Fatah A; Nazaruddin, Nazaruddin; Murad, Abdul Munir Abdul; Jani, Jaeyres; Zainal, Zamri; Ismail, Ismanizan

    2018-03-01

    In current era, majority of microRNA (miRNA) are being discovered through computational approaches which are more confined towards model plants. Here, for the first time, we have described the identification and characterization of novel miRNA in a non-model plant, Persicaria minor ( P . minor ) using computational approach. Unannotated sequences from deep sequencing were analyzed based on previous well-established parameters. Around 24 putative novel miRNAs were identified from 6,417,780 reads of the unannotated sequence which represented 11 unique putative miRNA sequences. PsRobot target prediction tool was deployed to identify the target transcripts of putative novel miRNAs. Most of the predicted target transcripts (mRNAs) were known to be involved in plant development and stress responses. Gene ontology showed that majority of the putative novel miRNA targets involved in cellular component (69.07%), followed by molecular function (30.08%) and biological process (0.85%). Out of 11 unique putative miRNAs, 7 miRNAs were validated through semi-quantitative PCR. These novel miRNAs discoveries in P . minor may develop and update the current public miRNA database.

  8. De novo assembly of the pepper transcriptome (Capsicum annuum): a benchmark for in silico discovery of SNPs, SSRs and candidate genes

    PubMed Central

    2012-01-01

    Background Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes. Results Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80–120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins. Conclusions Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project. PMID:23110314

  9. Scheffersomyces stipitis: a comparative systems biology study with the Crabtree positive yeast Saccharomyces cerevisiae

    PubMed Central

    2012-01-01

    Background Scheffersomyces stipitis is a Crabtree negative yeast, commonly known for its capacity to ferment pentose sugars. Differently from Crabtree positive yeasts such as Saccharomyces cerevisiae, the onset of fermentation in S. stipitis is not dependent on the sugar concentration, but is regulated by a decrease in oxygen levels. Even though S. stipitis has been extensively studied due to its potential application in pentoses fermentation, a limited amount of information is available about its metabolism during aerobic growth on glucose. Here, we provide a systems biology based comparison between the two yeasts, uncovering the metabolism of S. stipitis during aerobic growth on glucose under batch and chemostat cultivations. Results Starting from the analysis of physiological data, we confirmed through 13C-based flux analysis the fully respiratory metabolism of S. stipitis when growing both under glucose limited or glucose excess conditions. The patterns observed showed similarity to the fully respiratory metabolism observed for S. cerevisiae under chemostat cultivations however, intracellular metabolome analysis uncovered the presence of several differences in metabolite patterns. To describe gene expression levels under the two conditions, we performed RNA sequencing and the results were used to quantify transcript abundances of genes from the central carbon metabolism and compared with those obtained with S. cerevisiae. Interestingly, genes involved in central pathways showed different patterns of expression, suggesting different regulatory networks between the two yeasts. Efforts were focused on identifying shared and unique families of transcription factors between the two yeasts through in silico transcription factors analysis, suggesting a different regulation of glycolytic and glucoenogenic pathways. Conclusions The work presented addresses the impact of high-throughput methods in describing and comparing the physiology of Crabtree positive and Crabtree negative yeasts. Based on physiological data and flux analysis we identified the presence of one metabolic condition for S. stipitis under aerobic batch and chemostat cultivations, which shows similarities to the oxidative metabolism observed for S. cerevisiae under chemostat cultivations. Through metabolome analysis and genome-wide transcriptomic analysis several differences were identified. Interestingly, in silico analysis of transciption factors was useful to address a different regulation of mRNAs of genes involved in the central carbon metabolism. To our knowledge, this is the first time that the metabolism of S. stiptis is investigated in details and is compared to S. cerevisiae. Our study provides useful results and allows for the possibility to incorporate these data into recently developed genome-scaled metabolic, thus contributing to improve future industrial applications of S. stipitis as cell factory. PMID:23043429

  10. [Prediction of ETA oligopeptides antagonists from Glycine max based on in silico proteolysis].

    PubMed

    Qiao, Lian-Sheng; Jiang, Lu-di; Luo, Gang-Gang; Lu, Fang; Chen, Yan-Kun; Wang, Ling-Zhi; Li, Gong-Yu; Zhang, Yan-Ling

    2017-02-01

    Oligopeptides are one of the the key pharmaceutical effective constituents of traditional Chinese medicine(TCM). Systematic study on composition and efficacy of TCM oligopeptides is essential for the analysis of material basis and mechanism of TCM. In this study, the potential anti-hypertensive oligopeptides from Glycine max and their endothelin receptor A (ETA) antagonistic activity were discovered and predicted based on in silico technologies.Main protein sequences of G. max were collected and oligopeptides were obtained using in silico gastrointestinal tract proteolysis. Then, the pharmacophore of ETA antagonistic peptides was constructed and included one hydrophobic feature, one ionizable negative feature, one ring aromatic feature and five excluded volumes. Meanwhile, three-dimensional structure of ETA was developed by homology modeling methods for further docking studies. According to docking analysis and consensus score, the key amino acid of GLN165 was identified for ETA antagonistic activity. And 27 oligopeptides from G. max were predicted as the potential ETA antagonists by pharmacophore and docking studies.In silico proteolysis could be used to analyze the protein sequences from TCM. According to combination of in silico proteolysis and molecular simulation, the biological activities of oligopeptides could be predicted rapidly based on the known TCM protein sequence. It might provide the methodology basis for rapidly and efficiently implementing the mechanism analysis of TCM oligopeptides. Copyright© by the Chinese Pharmaceutical Association.

  11. InSilico DB genomic datasets hub: an efficient starting point for analyzing genome-wide studies in GenePattern, Integrative Genomics Viewer, and R/Bioconductor.

    PubMed

    Coletta, Alain; Molter, Colin; Duqué, Robin; Steenhoff, David; Taminau, Jonatan; de Schaetzen, Virginie; Meganck, Stijn; Lazar, Cosmin; Venet, David; Detours, Vincent; Nowé, Ann; Bersini, Hugues; Weiss Solís, David Y

    2012-11-18

    Genomics datasets are increasingly useful for gaining biomedical insights, with adoption in the clinic underway. However, multiple hurdles related to data management stand in the way of their efficient large-scale utilization. The solution proposed is a web-based data storage hub. Having clear focus, flexibility and adaptability, InSilico DB seamlessly connects genomics dataset repositories to state-of-the-art and free GUI and command-line data analysis tools. The InSilico DB platform is a powerful collaborative environment, with advanced capabilities for biocuration, dataset sharing, and dataset subsetting and combination. InSilico DB is available from https://insilicodb.org.

  12. SACCHARIS: an automated pipeline to streamline discovery of carbohydrate active enzyme activities within polyspecific families and de novo sequence datasets.

    PubMed

    Jones, Darryl R; Thomas, Dallas; Alger, Nicholas; Ghavidel, Ata; Inglis, G Douglas; Abbott, D Wade

    2018-01-01

    Deposition of new genetic sequences in online databases is expanding at an unprecedented rate. As a result, sequence identification continues to outpace functional characterization of carbohydrate active enzymes (CAZymes). In this paradigm, the discovery of enzymes with novel functions is often hindered by high volumes of uncharacterized sequences particularly when the enzyme sequence belongs to a family that exhibits diverse functional specificities (i.e., polyspecificity). Therefore, to direct sequence-based discovery and characterization of new enzyme activities we have developed an automated in silico pipeline entitled: Sequence Analysis and Clustering of CarboHydrate Active enzymes for Rapid Informed prediction of Specificity (SACCHARIS). This pipeline streamlines the selection of uncharacterized sequences for discovery of new CAZyme or CBM specificity from families currently maintained on the CAZy website or within user-defined datasets. SACCHARIS was used to generate a phylogenetic tree of a GH43, a CAZyme family with defined subfamily designations. This analysis confirmed that large datasets can be organized into sequence clusters of manageable sizes that possess related functions. Seeding this tree with a GH43 sequence from Bacteroides dorei DSM 17855 (BdGH43b, revealed it partitioned as a single sequence within the tree. This pattern was consistent with it possessing a unique enzyme activity for GH43 as BdGH43b is the first described α-glucanase described for this family. The capacity of SACCHARIS to extract and cluster characterized carbohydrate binding module sequences was demonstrated using family 6 CBMs (i.e., CBM6s). This CBM family displays a polyspecific ligand binding profile and contains many structurally determined members. Using SACCHARIS to identify a cluster of divergent sequences, a CBM6 sequence from a unique clade was demonstrated to bind yeast mannan, which represents the first description of an α-mannan binding CBM. Additionally, we have performed a CAZome analysis of an in-house sequenced bacterial genome and a comparative analysis of B. thetaiotaomicron VPI-5482 and B. thetaiotaomicron 7330, to demonstrate that SACCHARIS can generate "CAZome fingerprints", which differentiate between the saccharolytic potential of two related strains in silico. Establishing sequence-function and sequence-structure relationships in polyspecific CAZyme families are promising approaches for streamlining enzyme discovery. SACCHARIS facilitates this process by embedding CAZyme and CBM family trees generated from biochemically to structurally characterized sequences, with protein sequences that have unknown functions. In addition, these trees can be integrated with user-defined datasets (e.g., genomics, metagenomics, and transcriptomics) to inform experimental characterization of new CAZymes or CBMs not currently curated, and for researchers to compare differential sequence patterns between entire CAZomes. In this light, SACCHARIS provides an in silico tool that can be tailored for enzyme bioprospecting in datasets of increasing complexity and for diverse applications in glycobiotechnology.

  13. Deep developmental transcriptome sequencing uncovers numerous new genes and enhances gene annotation in the sponge Amphimedon queenslandica.

    PubMed

    Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M

    2015-05-15

    The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.

  14. Full genome survey and dynamics of gene expression in the greater amberjack Seriola dumerili.

    PubMed

    Sarropoulou, Elena; Sundaram, Arvind Y M; Kaitetzidou, Elisavet; Kotoulas, Georgios; Gilfillan, Gregor D; Papandroulakis, Nikos; Mylonas, Constantinos C; Magoulas, Antonios

    2017-12-01

    Teleosts of the genus Seriola, commonly known as amberjacks, are of high commercial value in international markets due to their flesh quality and worldwide distribution. The Seriola species of interest to Mediterranean aquaculture is the greater amberjack (Seriola dumerili). This species holds great potential for the aquaculture industry, but in captivity, reproduction has proved to be challenging, and observed growth dysfunction hinders their domestication. Insights into molecular mechanisms may contribute to a better understanding of traits like growth and sex, but investigations to unravel the molecular background of amberjacks have begun only recently. Illumina HiSeq sequencing generated a high-coverage greater amberjack genome sequence comprising 45 909 scaffolds. Comparative mapping to the Japanese yellowtail (Seriola quinqueriadiata) and to the model species medaka (Oryzias latipes) allowed the generation of in silico groups. Additional gonad transcriptome sequencing identified sex-biased transcripts, including known sex-determining and differentiation genes. Investigation of the muscle transcriptome of slow-growing individuals showed that transcripts involved in oxygen and gas transport were differentially expressed compared with fast/normal-growing individuals. On the other hand, transcripts involved in muscle functions were found to be enriched in fast/normal-growing individuals. The present study provides the first insights into the molecular background of male and female amberjacks and of fast- and slow-growing fish. Therefore, valuable molecular resources have been generated in the form of a first draft genome and a reference transcriptome. Sex-biased genes, which may also have roles in sex determination or differentiation, and genes that may be responsible for slow growth are suggested. © The Authors 2017. Published by Oxford University Press.

  15. Genome-wide organization and expression profiling of the NAC transcription factor family in potato (Solanum tuberosum L.).

    PubMed

    Singh, Anil Kumar; Sharma, Vishal; Pal, Awadhesh Kumar; Acharya, Vishal; Ahuja, Paramvir Singh

    2013-08-01

    NAC [no apical meristem (NAM), Arabidopsis thaliana transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins belong to one of the largest plant-specific transcription factor (TF) families and play important roles in plant development processes, response to biotic and abiotic cues and hormone signalling. Our genome-wide analysis identified 110 StNAC genes in potato encoding for 136 proteins, including 14 membrane-bound TFs. The physical map positions of StNAC genes on 12 potato chromosomes were non-random, and 40 genes were found to be distributed in 16 clusters. The StNAC proteins were phylogenetically clustered into 12 subgroups. Phylogenetic analysis of StNACs along with their Arabidopsis and rice counterparts divided these proteins into 18 subgroups. Our comparative analysis has also identified 36 putative TNAC proteins, which appear to be restricted to Solanaceae family. In silico expression analysis, using Illumina RNA-seq transcriptome data, revealed tissue-specific, biotic, abiotic stress and hormone-responsive expression profile of StNAC genes. Several StNAC genes, including StNAC072 and StNAC101that are orthologs of known stress-responsive Arabidopsis RESPONSIVE TO DEHYDRATION 26 (RD26) were identified as highly abiotic stress responsive. Quantitative real-time polymerase chain reaction analysis largely corroborated the expression profile of StNAC genes as revealed by the RNA-seq data. Taken together, this analysis indicates towards putative functions of several StNAC TFs, which will provide blue-print for their functional characterization and utilization in potato improvement.

  16. Characterization of irritans mariner-like elements in the olive fruit fly Bactrocera oleae (Diptera: Tephritidae): evolutionary implications.

    PubMed

    Ben Lazhar-Ajroud, Wafa; Caruso, Aurore; Mezghani, Maha; Bouallegue, Maryem; Tastard, Emmanuelle; Denis, Françoise; Rouault, Jacques-Deric; Makni, Hanem; Capy, Pierre; Chénais, Benoît; Makni, Mohamed; Casse, Nathalie

    2016-08-01

    Genomic variation among species is commonly driven by transposable element (TE) invasion; thus, the pattern of TEs in a genome allows drawing an evolutionary history of the studied species. This paper reports in vitro and in silico detection and characterization of irritans mariner-like elements (MLEs) in the genome and transcriptome of Bactrocera oleae (Rossi) (Diptera: Tephritidae). Eleven irritans MLE sequences have been isolated in vitro using terminal inverted repeats (TIRs) as primers, and 215 have been extracted in silico from the sequenced genome of B. oleae. Additionally, the sequenced genomes of Bactrocera tryoni (Froggatt) and Bactrocera cucurbitae (Diptera: Tephritidae) have been explored to identify irritans MLEs. A total of 129 sequences from B. tryoni have been extracted, while the genome of B. cucurbitae appears probably devoid of irritans MLEs. All detected irritans MLEs are defective due to several mutations and are clustered together in a monophyletic group suggesting a common ancestor. The evolutionary history and dynamics of these TEs are discussed in relation with the phylogenetic distribution of their hosts. The knowledge on the structure, distribution, dynamic, and evolution of irritans MLEs in Bactrocera species contributes to the understanding of both their evolutionary history and the invasion history of their hosts. This could also be the basis for genetic control strategies using transposable elements.

  17. High hydrostatic pressure adaptive strategies in an obligate piezophile Pyrococcus yayanosii

    PubMed Central

    Michoud, Grégoire; Jebbar, Mohamed

    2016-01-01

    Pyrococcus yayanosii CH1, as the first and only obligate piezophilic hyperthermophilic microorganism discovered to date, extends the physical and chemical limits of life on Earth. It was isolated from the Ashadze hydrothermal vent at 4,100 m depth. Multi-omics analyses were performed to study the mechanisms used by the cell to cope with high hydrostatic pressure variations. In silico analyses showed that the P. yayanosii genome is highly adapted to its harsh environment, with a loss of aromatic amino acid biosynthesis pathways and the high constitutive expression of the energy metabolism compared with other non-obligate piezophilic Pyrococcus species. Differential proteomics and transcriptomics analyses identified key hydrostatic pressure-responsive genes involved in translation, chemotaxis, energy metabolism (hydrogenases and formate metabolism) and Clustered Regularly Interspaced Short Palindromic Repeats sequences associated with Cellular apoptosis susceptibility proteins. PMID:27250364

  18. High hydrostatic pressure adaptive strategies in an obligate piezophile Pyrococcus yayanosii

    NASA Astrophysics Data System (ADS)

    Michoud, Grégoire; Jebbar, Mohamed

    2016-06-01

    Pyrococcus yayanosii CH1, as the first and only obligate piezophilic hyperthermophilic microorganism discovered to date, extends the physical and chemical limits of life on Earth. It was isolated from the Ashadze hydrothermal vent at 4,100 m depth. Multi-omics analyses were performed to study the mechanisms used by the cell to cope with high hydrostatic pressure variations. In silico analyses showed that the P. yayanosii genome is highly adapted to its harsh environment, with a loss of aromatic amino acid biosynthesis pathways and the high constitutive expression of the energy metabolism compared with other non-obligate piezophilic Pyrococcus species. Differential proteomics and transcriptomics analyses identified key hydrostatic pressure-responsive genes involved in translation, chemotaxis, energy metabolism (hydrogenases and formate metabolism) and Clustered Regularly Interspaced Short Palindromic Repeats sequences associated with Cellular apoptosis susceptibility proteins.

  19. Comparative genomics of grass EST libraries reveals previously uncharacterized splicing events in crop plants.

    PubMed

    Chuang, Trees-Juen; Yang, Min-Yu; Lin, Chuang-Chieh; Hsieh, Ping-Hung; Hung, Li-Yuan

    2015-02-05

    Crop plants such as rice, maize and sorghum play economically-important roles as main sources of food, fuel, and animal feed. However, current genome annotations of crop plants still suffer false-positive predictions; a more comprehensive registry of alternative splicing (AS) events is also in demand. Comparative genomics of crop plants is largely unexplored. We performed a large-scale comparative analysis (ExonFinder) of the expressed sequence tag (EST) library from nine grass plants against three crop genomes (rice, maize, and sorghum) and identified 2,879 previously-unannotated exons (i.e., novel exons) in the three crops. We validated 81% of the tested exons by RT-PCR-sequencing, supporting the effectiveness of our in silico strategy. Evolutionary analysis reveals that the novel exons, comparing with their flanking annotated ones, are generally under weaker selection pressure at the protein level, but under stronger pressure at the RNA level, suggesting that most of the novel exons also represent novel alternatively spliced variants (ASVs). However, we also observed the consistency of evolutionary rates between certain novel exons and their flanking exons, which provided further evidence of their co-occurrence in the transcripts, suggesting that previously-annotated isoforms might be subject to erroneous predictions. Our validation showed that 54% of the tested genes expressed the newly-identified isoforms that contained the novel exons, rather than the previously-annotated isoforms that excluded them. The consistent results were steadily observed across cultivated (Oryza sativa and O. glaberrima) and wild (O. rufipogon and O. nivara) rice species, asserting the necessity of our curation of the crop genome annotations. Our comparative analyses also inferred the common ancestral transcriptome of grass plants and gain- and loss-of-ASV events. We have reannotated the rice, maize, and sorghum genomes, and showed that evolutionary rates might serve as an indicator for determining whether the identified exons were alternatively spliced. This study not only presents an effective in silico strategy for the improvement of plant annotations, but also provides further insights into the role of AS events in the evolution and domestication of crop plants. ExonFinder and the novel exons/ASVs identified are publicly accessible at http://exonfinder.sourceforge.net/ .

  20. Functional proteomic analyses of Bothrops atrox venom reveals phenotypes associated with habitat variation in the Amazon.

    PubMed

    Sousa, Leijiane F; Portes-Junior, José A; Nicolau, Carolina A; Bernardoni, Juliana L; Nishiyama, Milton Y; Amazonas, Diana R; Freitas-de-Sousa, Luciana A; Mourão, Rosa Hv; Chalkidis, Hipócrates M; Valente, Richard H; Moura-da-Silva, Ana M

    2017-04-21

    Venom variability is commonly reported for venomous snakes including Bothrops atrox. Here, we compared the composition of venoms from B. atrox snakes collected at Amazonian conserved habitats (terra-firme upland forest and várzea) and human modified areas (pasture and degraded areas). Venom samples were submitted to shotgun proteomic analysis as a whole or compared after fractionation by reversed-phase chromatography. Whole venom proteomes revealed a similar composition among the venoms with predominance of SVMPs, CTLs, and SVSPs and intermediate amounts of PLA 2 s and LAAOs. However, when distribution of particular isoforms was analyzed by either method, the venom from várzea snakes showed a decrease in hemorrhagic SVMPs and an increase in SVSPs, and procoagulant SVMPs and PLA 2 s. These differences were validated by experimental approaches including both enzymatic and in vivo assays, and indicated restrictions in respect to antivenom efficacy to variable components. Thus, proteomic analysis at the isoform level combined to in silico prediction of functional properties may indicate venom biological activity. These results also suggest that the prevalence of functionally distinct isoforms contributes to the variability of the venoms and could reflect the adaptation of B. atrox to distinct prey communities in different Amazon habitats. In this report, we compared isoforms present in venoms from snakes collected at different Amazonian habitats. By means of a species venom gland transcriptome and the in silico functional prediction of each isoform, we were able to predict the principal venom activities in vitro and in animal models. We also showed remarkable differences in the venom pools from snakes collected at the floodplain (várzea habitat) compared to other habitats. Not only was this venom less hemorrhagic and more procoagulant, when compared to the venom pools from the other three habitats studied, but also this enhanced procoagulant activity was not efficiently neutralized by Bothrops antivenom. Thus, using a functional proteomic approach, we highlighted intraspecific differences in B. atrox venom that could impact both in the ecology of snakes but also in the treatment of snake bite patients in the region. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. A Fashi Lymphoproliferative Phenotype Reveals Non-Apoptotic Fas Signaling in HTLV-1-Associated Neuroinflammation.

    PubMed

    Menezes, Soraya Maria; Leal, Fabio E; Dierckx, Tim; Khouri, Ricardo; Decanine, Daniele; Silva-Santos, Gilvaneia; Schnitman, Saul V; Kruschewsky, Ramon; López, Giovanni; Alvarez, Carolina; Talledo, Michael; Gotuzzo, Eduardo; Nixon, Douglas F; Vercauteren, Jurgen; Brassat, David; Liblau, Roland; Vandamme, Anne Mieke; Galvão-Castro, Bernardo; Van Weyenbergh, Johan

    2017-01-01

    Human T-cell lymphotropic virus (HTLV)-1 was the first human retrovirus to be associated to cancer, namely adult T-cell leukemia (ATL), but its pathogenesis remains enigmatic, since only a minority of infected individuals develops either ATL or the neuroinflammatory disorder HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP). A functional FAS -670 polymorphism in an interferon (IFN)-regulated STAT1-binding site has been associated to both ATL and HAM/TSP susceptibility. Fas hi T stem cell memory (Tscm) cells have been identified as the hierarchical apex of ATL, but have not been investigated in HAM/TSP. In addition, both FAS and STAT1 have been identified in an IFN-inducible HAM/TSP gene signature, but its pathobiological significance remains unclear. We comprehensively explored Fas expression (protein/mRNA) and function in lymphocyte activation, apoptosis, proliferation, and transcriptome, in PBMC from a total of 47 HAM/TSP patients, 40 asymptomatic HTLV-1-infected individuals (AC), and 58 HTLV-1 -uninfected healthy controls. Fas surface expression followed a two-step increase from HC to AC and from AC to HAM/TSP. In HAM/TSP, Fas levels correlated positively to lymphocyte activation markers, but negatively to age of onset, linking Fas hi cells to earlier, more aggressive disease. Surprisingly, increased lymphocyte Fas expression in HAM/TSP was linked to decreased apoptosis and increased lymphoproliferation upon in vitro culture, but not to proviral load. This Fas hi phenotype is HAM/TSP-specific, since both ex vivo and in vitro Fas expression was increased as compared to multiple sclerosis (MS), another neuroinflammatory disorder. To elucidate the molecular mechanism underlying non-apoptotic Fas signaling in HAM/TSP, we combined transcriptome analysis with functional assays, i.e., blocking vs. triggering Fas receptor in vitro with antagonist and agonist-, anti-Fas mAb, respectively. Treatment with agonist anti-Fas mAb restored apoptosis, indicating biased, but not defective Fas signaling in HAM/TSP. In silico analysis revealed biased Fas signaling toward proliferation and inflammation, driven by RelA/NF-κB. Correlation of Fas transcript levels with proliferation (but not apoptosis) was confirmed in HAM/TSP ex vivo transcriptomes. In conclusion, we demonstrated a two-step increase in Fas expression, revealing a unique Fas hi lymphocyte phenotype in HAM/TSP, distinguishable from MS. Non-apoptotic Fas signaling might fuel HAM/TSP pathogenesis, through increased lymphoproliferation, inflammation, and early age of onset.

  2. A Fashi Lymphoproliferative Phenotype Reveals Non-Apoptotic Fas Signaling in HTLV-1-Associated Neuroinflammation

    PubMed Central

    Menezes, Soraya Maria; Leal, Fabio E.; Dierckx, Tim; Khouri, Ricardo; Decanine, Daniele; Silva-Santos, Gilvaneia; Schnitman, Saul V.; Kruschewsky, Ramon; López, Giovanni; Alvarez, Carolina; Talledo, Michael; Gotuzzo, Eduardo; Nixon, Douglas F.; Vercauteren, Jurgen; Brassat, David; Liblau, Roland; Vandamme, Anne Mieke; Galvão-Castro, Bernardo; Van Weyenbergh, Johan

    2017-01-01

    Human T-cell lymphotropic virus (HTLV)-1 was the first human retrovirus to be associated to cancer, namely adult T-cell leukemia (ATL), but its pathogenesis remains enigmatic, since only a minority of infected individuals develops either ATL or the neuroinflammatory disorder HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP). A functional FAS -670 polymorphism in an interferon (IFN)-regulated STAT1-binding site has been associated to both ATL and HAM/TSP susceptibility. Fashi T stem cell memory (Tscm) cells have been identified as the hierarchical apex of ATL, but have not been investigated in HAM/TSP. In addition, both FAS and STAT1 have been identified in an IFN-inducible HAM/TSP gene signature, but its pathobiological significance remains unclear. We comprehensively explored Fas expression (protein/mRNA) and function in lymphocyte activation, apoptosis, proliferation, and transcriptome, in PBMC from a total of 47 HAM/TSP patients, 40 asymptomatic HTLV-1-infected individuals (AC), and 58 HTLV-1 -uninfected healthy controls. Fas surface expression followed a two-step increase from HC to AC and from AC to HAM/TSP. In HAM/TSP, Fas levels correlated positively to lymphocyte activation markers, but negatively to age of onset, linking Fashi cells to earlier, more aggressive disease. Surprisingly, increased lymphocyte Fas expression in HAM/TSP was linked to decreased apoptosis and increased lymphoproliferation upon in vitro culture, but not to proviral load. This Fashi phenotype is HAM/TSP-specific, since both ex vivo and in vitro Fas expression was increased as compared to multiple sclerosis (MS), another neuroinflammatory disorder. To elucidate the molecular mechanism underlying non-apoptotic Fas signaling in HAM/TSP, we combined transcriptome analysis with functional assays, i.e., blocking vs. triggering Fas receptor in vitro with antagonist and agonist-, anti-Fas mAb, respectively. Treatment with agonist anti-Fas mAb restored apoptosis, indicating biased, but not defective Fas signaling in HAM/TSP. In silico analysis revealed biased Fas signaling toward proliferation and inflammation, driven by RelA/NF-κB. Correlation of Fas transcript levels with proliferation (but not apoptosis) was confirmed in HAM/TSP ex vivo transcriptomes. In conclusion, we demonstrated a two-step increase in Fas expression, revealing a unique Fashi lymphocyte phenotype in HAM/TSP, distinguishable from MS. Non-apoptotic Fas signaling might fuel HAM/TSP pathogenesis, through increased lymphoproliferation, inflammation, and early age of onset. PMID:28261198

  3. Preliminary profiling of blood transcriptome in a rat model of hemorrhagic shock

    PubMed Central

    Braga, D; Barcella, M; D’Avila, F; Lupoli, S; Tagliaferri, F; Santamaria, MH; DeLano, FA; Baselli, G; Schmid-Schönbein, GW; Kistler, EB; Aletti, F

    2017-01-01

    Hemorrhagic shock is a leading cause of morbidity and mortality worldwide. Significant blood loss may lead to decreased blood pressure and inadequate tissue perfusion with resultant organ failure and death, even after replacement of lost blood volume. One reason for this high acuity is that the fundamental mechanisms of shock are poorly understood. Proteomic and metabolomic approaches have been used to investigate the molecular events occurring in hemorrhagic shock but, to our knowledge, a systematic analysis of the transcriptomic profile is missing. Therefore, a pilot analysis using paired-end RNA sequencing was used to identify changes that occur in the blood transcriptome of rats subjected to hemorrhagic shock after blood reinfusion. Hemorrhagic shock was induced using a Wigger’s shock model. The transcriptome of whole blood from shocked animals shows modulation of genes related to inflammation and immune response (Tlr13, Il1b, Ccl6, Lgals3), antioxidant functions (Mt2A, Mt1), tissue injury and repair pathways (Gpnmb, Trim72) and lipid mediators (Alox5ap, Ltb4r, Ptger2) compared with control animals. These findings are congruent with results obtained in hemorrhagic shock analysis by other authors using metabolomics and proteomics. The analysis of blood transcriptome may be a valuable tool to understand the biological changes occurring in hemorrhagic shock and a promising approach for the identification of novel biomarkers and therapeutic targets. Impact statement This study provides the first pilot analysis of the changes occurring in transcriptome expression of whole blood in hemorrhagic shock (HS) rats. We showed that the analysis of blood transcriptome is a useful approach to investigate pathways and functional alterations in this disease condition. This pilot study encourages the possible application of transcriptome analysis in the clinical setting, for the molecular profiling of whole blood in HS patients. PMID:28661205

  4. N-of-1-pathways MixEnrich: advancing precision medicine via single-subject analysis in discovering dynamic changes of transcriptomes.

    PubMed

    Li, Qike; Schissler, A Grant; Gardeux, Vincent; Achour, Ikbel; Kenost, Colleen; Berghout, Joanne; Li, Haiquan; Zhang, Hao Helen; Lussier, Yves A

    2017-05-24

    Transcriptome analytic tools are commonly used across patient cohorts to develop drugs and predict clinical outcomes. However, as precision medicine pursues more accurate and individualized treatment decisions, these methods are not designed to address single-patient transcriptome analyses. We previously developed and validated the N-of-1-pathways framework using two methods, Wilcoxon and Mahalanobis Distance (MD), for personal transcriptome analysis derived from a pair of samples of a single patient. Although, both methods uncover concordantly dysregulated pathways, they are not designed to detect dysregulated pathways with up- and down-regulated genes (bidirectional dysregulation) that are ubiquitous in biological systems. We developed N-of-1-pathways MixEnrich, a mixture model followed by a gene set enrichment test, to uncover bidirectional and concordantly dysregulated pathways one patient at a time. We assess its accuracy in a comprehensive simulation study and in a RNA-Seq data analysis of head and neck squamous cell carcinomas (HNSCCs). In presence of bidirectionally dysregulated genes in the pathway or in presence of high background noise, MixEnrich substantially outperforms previous single-subject transcriptome analysis methods, both in the simulation study and the HNSCCs data analysis (ROC Curves; higher true positive rates; lower false positive rates). Bidirectional and concordant dysregulated pathways uncovered by MixEnrich in each patient largely overlapped with the quasi-gold standard compared to other single-subject and cohort-based transcriptome analyses. The greater performance of MixEnrich presents an advantage over previous methods to meet the promise of providing accurate personal transcriptome analysis to support precision medicine at point of care.

  5. Transcriptomic Analysis of Avocado Hass (Persea americana Mill) in the Interaction System Fruit-Chitosan-Colletotrichum.

    PubMed

    Xoca-Orozco, Luis-Ángel; Cuellar-Torres, Esther Angélica; González-Morales, Sandra; Gutiérrez-Martínez, Porfirio; López-García, Ulises; Herrera-Estrella, Luis; Vega-Arreguín, Julio; Chacón-López, Alejandra

    2017-01-01

    Avocado ( Persea americana ) is one of the most important crops in Mexico as it is the main producer, consumer, and exporter of avocado fruit in the world. However, successful avocado commercialization is often reduced by large postharvest losses due to Colletotrichum sp., the causal agent of anthracnose. Chitosan is known to have a direct antifungal effect and acts also as an elicitor capable of stimulating a defense response in plants. However, there is little information regarding the genes that are either activated or repressed in fruits treated with chitosan. The aim of this study was to identify by RNA-seq the genes differentially regulated by the action of low molecular weight chitosan in the avocado-chitosan- Colletotrichum interaction system. The samples for RNA-seq were obtained from fruits treated with chitosan, fruits inoculated with Colletotrichum and fruits both treated with chitosan and inoculated with the fungus. Non-treated and non-inoculated fruits were also analyzed. Expression profiles showed that in short times, the fruit-chitosan system presented a greater number of differentially expressed genes, compared to the fruit-pathogen system. Gene Ontology analysis of differentially expressed genes showed a large number of metabolic processes regulated by chitosan, including those preventing the spread of Colletotrichum . It was also found that there is a high correlation between the expression of genes in silico and qPCR of several genes involved in different metabolic pathways.

  6. Proteome analysis of digestive fluids in Nepenthes pitchers.

    PubMed

    Rottloff, Sandy; Miguel, Sissi; Biteau, Flore; Nisse, Estelle; Hammann, Philippe; Kuhn, Lauriane; Chicher, Johana; Bazile, Vincent; Gaume, Laurence; Mignard, Benoit; Hehn, Alain; Bourgaud, Frédéric

    2016-03-01

    Carnivorous plants have developed strategies to enable growth in nutrient-poor soils. For the genus Nepenthes, this strategy represents producing pitcher-modified leaves that can trap and digest various prey. These pitchers produce a digestive fluid composed of proteins, including hydrolytic enzymes. The focus of this study was on the identification of these proteins. In order to better characterize and have an overview of these proteins, digestive fluid was sampled from pitchers at different stages of maturity from five species of Nepenthes (N. mirabilis, N. alata, N. sanguinea, N. bicalcarata and N. albomarginata) that vary in their ecological niches and grew under different conditions. Three complementary approaches based on transcriptomic resources, mass spectrometry and in silico analysis were used. This study permitted the identification of 29 proteins excreted in the pitchers. Twenty of these proteins were never reported in Nepenthes previously and included serine carboxypeptidases, α- and β-galactosidases, lipid transfer proteins and esterases/lipases. These 20 proteins display sequence signals allowing their secretion into the pitcher fluid. Nepenthes pitcher plants have evolved an arsenal of enzymes to digest prey caught in their traps. The panel of new proteins identified in this study provides new insights into the digestive process of these carnivorous plants. © The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  7. In vitro fatigue tests and in silico finite element analysis of dental implants with different fixture/abutment joint types using computer-aided design models.

    PubMed

    Yamaguchi, Satoshi; Yamanishi, Yasufumi; Machado, Lucas S; Matsumoto, Shuji; Tovar, Nick; Coelho, Paulo G; Thompson, Van P; Imazato, Satoshi

    2018-01-01

    The aim of this study was to evaluate fatigue resistance of dental fixtures with two different fixture-abutment connections by in vitro fatigue testing and in silico three-dimensional finite element analysis (3D FEA) using original computer-aided design (CAD) models. Dental implant fixtures with external connection (EX) or internal connection (IN) abutments were fabricated from original CAD models using grade IV titanium and step-stress accelerated life testing was performed. Fatigue cycles and loads were assessed by Weibull analysis, and fatigue cracking was observed by micro-computed tomography and a stereomicroscope with high dynamic range software. Using the same CAD models, displacement vectors of implant components were also analyzed by 3D FEA. Angles of the fractured line occurring at fixture platforms in vitro and of displacement vectors corresponding to the fractured line in silico were compared by two-way ANOVA. Fatigue testing showed significantly greater reliability for IN than EX (p<0.001). Fatigue crack initiation was primarily observed at implant fixture platforms. FEA demonstrated that crack lines of both implant systems in vitro were observed in the same direction as displacement vectors of the implant fixtures in silico. In silico displacement vectors in the implant fixture are insightful for geometric development of dental implants to reduce complex interactions leading to fatigue failure. Copyright © 2017 Japan Prosthodontic Society. Published by Elsevier Ltd. All rights reserved.

  8. The Transcriptome Analysis and Comparison Explorer--T-ACE: a platform-independent, graphical tool to process large RNAseq datasets of non-model organisms.

    PubMed

    Philipp, E E R; Kraemer, L; Mountfort, D; Schilhabel, M; Schreiber, S; Rosenstiel, P

    2012-03-15

    Next generation sequencing (NGS) technologies allow a rapid and cost-effective compilation of large RNA sequence datasets in model and non-model organisms. However, the storage and analysis of transcriptome information from different NGS platforms is still a significant bottleneck, leading to a delay in data dissemination and subsequent biological understanding. Especially database interfaces with transcriptome analysis modules going beyond mere read counts are missing. Here, we present the Transcriptome Analysis and Comparison Explorer (T-ACE), a tool designed for the organization and analysis of large sequence datasets, and especially suited for transcriptome projects of non-model organisms with little or no a priori sequence information. T-ACE offers a TCL-based interface, which accesses a PostgreSQL database via a php-script. Within T-ACE, information belonging to single sequences or contigs, such as annotation or read coverage, is linked to the respective sequence and immediately accessible. Sequences and assigned information can be searched via keyword- or BLAST-search. Additionally, T-ACE provides within and between transcriptome analysis modules on the level of expression, GO terms, KEGG pathways and protein domains. Results are visualized and can be easily exported for external analysis. We developed T-ACE for laboratory environments, which have only a limited amount of bioinformatics support, and for collaborative projects in which different partners work on the same dataset from different locations or platforms (Windows/Linux/MacOS). For laboratories with some experience in bioinformatics and programming, the low complexity of the database structure and open-source code provides a framework that can be customized according to the different needs of the user and transcriptome project.

  9. Comparative multi-omics systems analysis of Escherichia coli strains B and K-12.

    PubMed

    Yoon, Sung Ho; Han, Mee-Jung; Jeong, Haeyoung; Lee, Choong Hoon; Xia, Xiao-Xia; Lee, Dae-Hee; Shim, Ji Hoon; Lee, Sang Yup; Oh, Tae Kwang; Kim, Jihyun F

    2012-05-25

    Elucidation of a genotype-phenotype relationship is critical to understand an organism at the whole-system level. Here, we demonstrate that comparative analyses of multi-omics data combined with a computational modeling approach provide a framework for elucidating the phenotypic characteristics of organisms whose genomes are sequenced. We present a comprehensive analysis of genome-wide measurements incorporating multifaceted holistic data - genome, transcriptome, proteome, and phenome - to determine the differences between Escherichia coli B and K-12 strains. A genome-scale metabolic network of E. coli B was reconstructed and used to identify genetic bases of the phenotypes unique to B compared with K-12 through in silico complementation testing. This systems analysis revealed that E. coli B is well-suited for production of recombinant proteins due to a greater capacity for amino acid biosynthesis, fewer proteases, and lack of flagella. Furthermore, E. coli B has an additional type II secretion system and a different cell wall and outer membrane composition predicted to be more favorable for protein secretion. In contrast, E. coli K-12 showed a higher expression of heat shock genes and was less susceptible to certain stress conditions. This integrative systems approach provides a high-resolution system-wide view and insights into why two closely related strains of E. coli, B and K-12, manifest distinct phenotypes. Therefore, systematic understanding of cellular physiology and metabolism of the strains is essential not only to determine culture conditions but also to design recombinant hosts.

  10. Comparative multi-omics systems analysis of Escherichia coli strains B and K-12

    PubMed Central

    2012-01-01

    Background Elucidation of a genotype-phenotype relationship is critical to understand an organism at the whole-system level. Here, we demonstrate that comparative analyses of multi-omics data combined with a computational modeling approach provide a framework for elucidating the phenotypic characteristics of organisms whose genomes are sequenced. Results We present a comprehensive analysis of genome-wide measurements incorporating multifaceted holistic data - genome, transcriptome, proteome, and phenome - to determine the differences between Escherichia coli B and K-12 strains. A genome-scale metabolic network of E. coli B was reconstructed and used to identify genetic bases of the phenotypes unique to B compared with K-12 through in silico complementation testing. This systems analysis revealed that E. coli B is well-suited for production of recombinant proteins due to a greater capacity for amino acid biosynthesis, fewer proteases, and lack of flagella. Furthermore, E. coli B has an additional type II secretion system and a different cell wall and outer membrane composition predicted to be more favorable for protein secretion. In contrast, E. coli K-12 showed a higher expression of heat shock genes and was less susceptible to certain stress conditions. Conclusions This integrative systems approach provides a high-resolution system-wide view and insights into why two closely related strains of E. coli, B and K-12, manifest distinct phenotypes. Therefore, systematic understanding of cellular physiology and metabolism of the strains is essential not only to determine culture conditions but also to design recombinant hosts. PMID:22632713

  11. MicroRNA biogenesis pathway from the salmon louse (Caligus rogercresseyi): emerging role in delousing drug response.

    PubMed

    Valenzuela-Miranda, Diego; Nuñez-Acuña, Gustavo; Valenzuela-Muñoz, Valentina; Asgari, Sassan; Gallardo-Escárate, Cristian

    2015-01-25

    Despite the increasing evidence of the importance of microRNAs (miRNAs) in the regulation of multiple biological processes, the molecular bases supporting this regulation are still barely understood in crustaceans. Therefore, the molecular characterization and transcriptome modulation of the miRNA biogenesis pathway were evaluated in the salmon louse Caligus rogercresseyi, an ectoparasite that constitutes one of the biggest concerns for salmonid aquaculture industry. Hence, RNA-Seq analysis was conducted from six different developmental stages, and also after bioassays with delousing drugs Deltamethrin and Azamethiphos using adult individuals. In silico analysis evidenced 24 putative genes involved in the miRNA pathway such as biogenesis, transport, maturation and miRNA-target interaction. Moreover, 243 putative single nucleotide polymorphisms (SNPs) were identified, 15 of which showed non-synonym mutations. RNA-Seq analysis revealed that CCR4-Not complex subunit 3 (CNOT3) was upregulated at earlier developmental stages (nauplius I-II and copepodid), and also after the exposure to Azamethiphos, but not to Deltamethrin. In contrast, the subunit 7 (CNOT7) showed an inverse expression pattern. Different Argonaute transcripts were associated to chalimus and adult stages, revealing specific expression patterns in response to antiparasitic drugs. Our results suggest novel insights into the regulatory network of the post-transcriptional gene regulation in C. rogercresseyi mediated by miRNAs, evidencing a putative role during the ontogeny and drug response. Copyright © 2014 Elsevier B.V. All rights reserved.

  12. Pre-microRNA and Mature microRNA in Human Mitochondria

    PubMed Central

    Barrey, Eric; Saint-Auret, Gaelle; Bonnamy, Blandine; Damas, Dominique; Boyer, Orane; Gidrol, Xavier

    2011-01-01

    Background Because of the central functions of the mitochondria in providing metabolic energy and initiating apoptosis on one hand and the role that microRNA (miRNA) play in gene expression, we hypothesized that some miRNA could be present in the mitochondria for post-transcriptomic regulation by RNA interference. We intend to identify miRNA localized in the mitochondria isolated from human skeletal primary muscular cells. Methodology/Principal Findings To investigate the potential origin of mitochondrial miRNA, we in-silico searched for microRNA candidates in the mtDNA. Twenty five human pre-miRNA and 33 miRNA aligments (E-value<0.1) were found in the reference mitochondrial sequence and some of the best candidates were chosen for a co-localization test. In situ hybridization of pre-mir-302a, pre-let-7b and mir-365, using specific labelled locked nucleic acids and confocal microscopy, demonstrated that these miRNA were localized in mitochondria of human myoblasts. Total RNA was extracted from enriched mitochondria isolated by an immunomagnetic method from a culture of human myotubes. The detection of 742 human miRNA (miRBase) were monitored by RT-qPCR at three increasing mtRNA inputs. Forty six miRNA were significantly expressed (2nd derivative method Cp>35) for the smallest RNA input concentration and 204 miRNA for the maximum RNA input concentration. In silico analysis predicted 80 putative miRNA target sites in the mitochondrial genome (E-value<0.05). Conclusions/Significance The present study experimentally demonstrated for the first time the presence of pre-miRNA and miRNA in the human mitochondria isolated from skeletal muscular cells. A set of miRNA were significantly detected in mitochondria fraction. The origin of these pre-miRNA and miRNA should be further investigate to determine if they are imported from the cytosol and/or if they are partially processed in the mitochondria. PMID:21637849

  13. Integrated Analysis of Transcriptomic and Proteomic Data

    PubMed Central

    Haider, Saad; Pal, Ranadip

    2013-01-01

    Until recently, understanding the regulatory behavior of cells has been pursued through independent analysis of the transcriptome or the proteome. Based on the central dogma, it was generally assumed that there exist a direct correspondence between mRNA transcripts and generated protein expressions. However, recent studies have shown that the correlation between mRNA and Protein expressions can be low due to various factors such as different half lives and post transcription machinery. Thus, a joint analysis of the transcriptomic and proteomic data can provide useful insights that may not be deciphered from individual analysis of mRNA or protein expressions. This article reviews the existing major approaches for joint analysis of transcriptomic and proteomic data. We categorize the different approaches into eight main categories based on the initial algorithm and final analysis goal. We further present analogies with other domains and discuss the existing research problems in this area. PMID:24082820

  14. RNA-sequence data normalization through in silico prediction of reference genes: the bacterial response to DNA damage as case study.

    PubMed

    Berghoff, Bork A; Karlsson, Torgny; Källman, Thomas; Wagner, E Gerhart H; Grabherr, Manfred G

    2017-01-01

    Measuring how gene expression changes in the course of an experiment assesses how an organism responds on a molecular level. Sequencing of RNA molecules, and their subsequent quantification, aims to assess global gene expression changes on the RNA level (transcriptome). While advances in high-throughput RNA-sequencing (RNA-seq) technologies allow for inexpensive data generation, accurate post-processing and normalization across samples is required to eliminate any systematic noise introduced by the biochemical and/or technical processes. Existing methods thus either normalize on selected known reference genes that are invariant in expression across the experiment, assume that the majority of genes are invariant, or that the effects of up- and down-regulated genes cancel each other out during the normalization. Here, we present a novel method, moose 2 , which predicts invariant genes in silico through a dynamic programming (DP) scheme and applies a quadratic normalization based on this subset. The method allows for specifying a set of known or experimentally validated invariant genes, which guides the DP. We experimentally verified the predictions of this method in the bacterium Escherichia coli , and show how moose 2 is able to (i) estimate the expression value distances between RNA-seq samples, (ii) reduce the variation of expression values across all samples, and (iii) to subsequently reveal new functional groups of genes during the late stages of DNA damage. We further applied the method to three eukaryotic data sets, on which its performance compares favourably to other methods. The software is implemented in C++ and is publicly available from http://grabherr.github.io/moose2/. The proposed RNA-seq normalization method, moose 2 , is a valuable alternative to existing methods, with two major advantages: (i) in silico prediction of invariant genes provides a list of potential reference genes for downstream analyses, and (ii) non-linear artefacts in RNA-seq data are handled adequately to minimize variations between replicates.

  15. The Salmonella In Silico Typing Resource (SISTR): An Open Web-Accessible Tool for Rapidly Typing and Subtyping Draft Salmonella Genome Assemblies.

    PubMed

    Yoshida, Catherine E; Kruczkiewicz, Peter; Laing, Chad R; Lingohr, Erika J; Gannon, Victor P J; Nash, John H E; Taboada, Eduardo N

    2016-01-01

    For nearly 100 years serotyping has been the gold standard for the identification of Salmonella serovars. Despite the increasing adoption of DNA-based subtyping approaches, serotype information remains a cornerstone in food safety and public health activities aimed at reducing the burden of salmonellosis. At the same time, recent advances in whole-genome sequencing (WGS) promise to revolutionize our ability to perform advanced pathogen characterization in support of improved source attribution and outbreak analysis. We present the Salmonella In Silico Typing Resource (SISTR), a bioinformatics platform for rapidly performing simultaneous in silico analyses for several leading subtyping methods on draft Salmonella genome assemblies. In addition to performing serovar prediction by genoserotyping, this resource integrates sequence-based typing analyses for: Multi-Locus Sequence Typing (MLST), ribosomal MLST (rMLST), and core genome MLST (cgMLST). We show how phylogenetic context from cgMLST analysis can supplement the genoserotyping analysis and increase the accuracy of in silico serovar prediction to over 94.6% on a dataset comprised of 4,188 finished genomes and WGS draft assemblies. In addition to allowing analysis of user-uploaded whole-genome assemblies, the SISTR platform incorporates a database comprising over 4,000 publicly available genomes, allowing users to place their isolates in a broader phylogenetic and epidemiological context. The resource incorporates several metadata driven visualizations to examine the phylogenetic, geospatial and temporal distribution of genome-sequenced isolates. As sequencing of Salmonella isolates at public health laboratories around the world becomes increasingly common, rapid in silico analysis of minimally processed draft genome assemblies provides a powerful approach for molecular epidemiology in support of public health investigations. Moreover, this type of integrated analysis using multiple sequence-based methods of sub-typing allows for continuity with historical serotyping data as we transition towards the increasing adoption of genomic analyses in epidemiology. The SISTR platform is freely available on the web at https://lfz.corefacility.ca/sistr-app/.

  16. Strategies for In Vivo Screening and Mitigation of Hepatotoxicity Associated with Antisense Drugs.

    PubMed

    Kamola, Piotr J; Maratou, Klio; Wilson, Paul A; Rush, Kay; Mullaney, Tanya; McKevitt, Tom; Evans, Paula; Ridings, Jim; Chowdhury, Probash; Roulois, Aude; Fairchild, Ann; McCawley, Sean; Cartwright, Karen; Gooderham, Nigel J; Gant, Timothy W; Moores, Kitty; Hughes, Stephen A; Edbrooke, Mark R; Clark, Kenneth; Parry, Joel D

    2017-09-15

    Antisense oligonucleotide (ASO) gapmers downregulate gene expression by inducing enzyme-dependent degradation of targeted RNA and represent a promising therapeutic platform for addressing previously undruggable genes. Unfortunately, their therapeutic application, particularly that of the more potent chemistries (e.g., locked-nucleic-acid-containing gapmers), has been hampered by their frequent hepatoxicity, which could be driven by hybridization-mediated interactions. An early de-risking of this liability is a crucial component of developing safe, ASO-based drugs. To rank ASOs based on their effect on the liver, we have developed an acute screen in the mouse that can be applied early in the drug development cycle. A single-dose (3-day) screen with streamlined endpoints (i.e., plasma transaminase levels and liver weights) was observed to be predictive of ASO hepatotoxicity ranking established based on a repeat-dose (15 day) study. Furthermore, to study the underlying mechanisms of liver toxicity, we applied transcriptome profiling and pathway analyses and show that adverse in vivo liver phenotypes correlate with the number of potent, hybridization-mediated off-target effects (OTEs). We propose that a combination of in silico OTE predictions, streamlined in vivo hepatotoxicity screening, and a transcriptome-wide selectivity screen is a valid approach to identifying and progressing safer compounds. Copyright © 2017 GSK R&D. Published by Elsevier Inc. All rights reserved.

  17. Identification of Novel Placentally Expressed Aspartic Proteinase in Humans

    PubMed Central

    Majewska, Marta; Lipka, Aleksandra; Panasiewicz, Grzegorz; Gowkielewicz, Marek; Jozwik, Marcin; Majewski, Mariusz Krzysztof; Szafranska, Bozena

    2017-01-01

    This study presents pioneering data concerning the human pregnancy-associated glycoprotein-Like family, identified in the genome, of the term placental transcriptome and proteome. RNA-seq allowed the identification of 1364 bp hPAG-L/pep cDNA with at least 56.5% homology with other aspartic proteinases (APs). In silico analyses revealed 388 amino acids (aa) of full-length hPAG-L polypeptide precursor, with 15 aa-signal peptide, 47 aa-blocking peptide and 326 aa-mature protein, and two Asp residues (D), specific for a catalytic cleft of the APs (VVFDTGSSNLWV91-102 and AIVDTGTSLLTG274-285). Capillary sequencing identified 9330 bp of the hPAG-L gene (Gen Bank Acc. No. KX533473), composed of nine exons and eight introns. Heterologous Western blotting revealed the presence of one dominant 60 kDa isoform of the hPAG-L amongst cellular placental proteins. Detection with anti-pPAG-P and anti-Rec pPAG2 polyclonals allowed identification of the hPAG-L proteins located within regions of chorionic villi, especially within the syncytiotrophoblast of term singleton placentas. Our novel data extend the present knowledge about the human genome, as well as placental transcriptome and proteome during term pregnancy. Presumably, this may contribute to establishing a new diagnostic tool for examination of some disturbances during human pregnancy, as well as growing interest from both scientific and clinical perspectives. PMID:28594357

  18. Identification of Novel Placentally Expressed Aspartic Proteinase in Humans.

    PubMed

    Majewska, Marta; Lipka, Aleksandra; Panasiewicz, Grzegorz; Gowkielewicz, Marek; Jozwik, Marcin; Majewski, Mariusz Krzysztof; Szafranska, Bozena

    2017-06-08

    This study presents pioneering data concerning the human pregnancy-associated glycoprotein-Like family, identified in the genome, of the term placental transcriptome and proteome. RNA-seq allowed the identification of 1364 bp hPAG-L/pep cDNA with at least 56.5% homology with other aspartic proteinases (APs). In silico analyses revealed 388 amino acids (aa) of full-length hPAG-L polypeptide precursor, with 15 aa-signal peptide, 47 aa-blocking peptide and 326 aa-mature protein, and two Asp residues (D), specific for a catalytic cleft of the APs (VVFDTGSSNLWV91-102 and AIVDTGTSLLTG274-285). Capillary sequencing identified 9330 bp of the hPAG-L gene (Gen Bank Acc. No. KX533473), composed of nine exons and eight introns. Heterologous Western blotting revealed the presence of one dominant 60 kDa isoform of the hPAG-L amongst cellular placental proteins. Detection with anti-pPAG-P and anti-Rec pPAG2 polyclonals allowed identification of the hPAG-L proteins located within regions of chorionic villi, especially within the syncytiotrophoblast of term singleton placentas. Our novel data extend the present knowledge about the human genome, as well as placental transcriptome and proteome during term pregnancy. Presumably, this may contribute to establishing a new diagnostic tool for examination of some disturbances during human pregnancy, as well as growing interest from both scientific and clinical perspectives.

  19. An ovary transcriptome for all maturational stages of the striped bass (Morone saxatilis), a highly advanced perciform fish.

    PubMed

    Reading, Benjamin J; Chapman, Robert W; Schaff, Jennifer E; Scholl, Elizabeth H; Opperman, Charles H; Sullivan, Craig V

    2012-02-21

    The striped bass and its relatives (genus Morone) are important fisheries and aquaculture species native to estuaries and rivers of the Atlantic coast and Gulf of Mexico in North America. To open avenues of gene expression research on reproduction and breeding of striped bass, we generated a collection of expressed sequence tags (ESTs) from a complementary DNA (cDNA) library representative of their ovarian transcriptome. Sequences of a total of 230,151 ESTs (51,259,448 bp) were acquired by Roche 454 pyrosequencing of cDNA pooled from ovarian tissues obtained at all stages of oocyte growth, at ovulation (eggs), and during preovulatory atresia. Quality filtering of ESTs allowed assembly of 11,208 high-quality contigs ≥ 100 bp, including 2,984 contigs 500 bp or longer (average length 895 bp). Blastx comparisons revealed 5,482 gene orthologues (E-value < 10-3), of which 4,120 (36.7% of total contigs) were annotated with Gene Ontology terms (E-value < 10-6). There were 5,726 remaining unknown unique sequences (51.1% of total contigs). All of the high-quality EST sequences are available in the National Center for Biotechnology Information (NCBI) Short Read Archive (GenBank: SRX007394). Informative contigs were considered to be abundant if they were assembled from groups of ESTs comprising ≥ 0.15% of the total short read sequences (≥ 345 reads/contig). Approximately 52.5% of these abundant contigs were predicted to have predominant ovary expression through digital differential display in silico comparisons to zebrafish (Danio rerio) UniGene orthologues. Over 1,300 Gene Ontology terms from Biological Process classes of Reproduction, Reproductive process, and Developmental process were assigned to this collection of annotated contigs. This first large reference sequence database available for the ecologically and economically important temperate basses (genus Morone) provides a foundation for gene expression studies in these species. The predicted predominance of ovary gene expression and assignment of directly relevant Gene Ontology classes suggests a powerful utility of this dataset for analysis of ovarian gene expression related to fundamental questions of oogenesis. Additionally, a high definition Agilent 60-mer oligo ovary 'UniClone' microarray with 8 × 15,000 probe format has been designed based on this striped bass transcriptome (eArray Group: Striper Group, Design ID: 029004).

  20. Proposal of an in silico profiler for categorisation of repeat dose toxicity data of hair dyes.

    PubMed

    Nelms, M D; Ates, G; Madden, J C; Vinken, M; Cronin, M T D; Rogiers, V; Enoch, S J

    2015-05-01

    This study outlines the analysis of 94 chemicals with repeat dose toxicity data taken from Scientific Committee on Consumer Safety opinions for commonly used hair dyes in the European Union. Structural similarity was applied to group these chemicals into categories. Subsequent mechanistic analysis suggested that toxicity to mitochondria is potentially a key driver of repeat dose toxicity for chemicals within each of the categories. The mechanistic hypothesis allowed for an in silico profiler consisting of four mechanism-based structural alerts to be proposed. These structural alerts related to a number of important chemical classes such as quinones, anthraquinones, substituted nitrobenzenes and aromatic azos. This in silico profiler is intended for grouping chemicals into mechanism-based categories within the adverse outcome pathway paradigm.

  1. Analysis of the Citrullus colocynthis Transcriptome during Water Deficit Stress

    PubMed Central

    Wang, Zhuoyu; Hu, Hongtao; Goertzen, Leslie R.; McElroy, J. Scott; Dane, Fenny

    2014-01-01

    Citrullus colocynthis is a very drought tolerant species, closely related to watermelon (C. lanatus var. lanatus), an economically important cucurbit crop. Drought is a threat to plant growth and development, and the discovery of drought inducible genes with various functions is of great importance. We used high throughput mRNA Illumina sequencing technology and bioinformatic strategies to analyze the C. colocynthis leaf transcriptome under drought treatment. Leaf samples at four different time points (0, 24, 36, or 48 hours of withholding water) were used for RNA extraction and Illumina sequencing. qRT-PCR of several drought responsive genes was performed to confirm the accuracy of RNA sequencing. Leaf transcriptome analysis provided the first glimpse of the drought responsive transcriptome of this unique cucurbit species. A total of 5038 full-length cDNAs were detected, with 2545 genes showing significant changes during drought stress. Principle component analysis indicated that drought was the major contributing factor regulating transcriptome changes. Up regulation of many transcription factors, stress signaling factors, detoxification genes, and genes involved in phytohormone signaling and citrulline metabolism occurred under the water deficit conditions. The C. colocynthis transcriptome data highlight the activation of a large set of drought related genes in this species, thus providing a valuable resource for future functional analysis of candidate genes in defense of drought stress. PMID:25118696

  2. Cancer Transcriptome Dataset Analysis: Comparing Methods of Pathway and Gene Regulatory Network-Based Cluster Identification.

    PubMed

    Nam, Seungyoon

    2017-04-01

    Cancer transcriptome analysis is one of the leading areas of Big Data science, biomarker, and pharmaceutical discovery, not to forget personalized medicine. Yet, cancer transcriptomics and postgenomic medicine require innovation in bioinformatics as well as comparison of the performance of available algorithms. In this data analytics context, the value of network generation and algorithms has been widely underscored for addressing the salient questions in cancer pathogenesis. Analysis of cancer trancriptome often results in complicated networks where identification of network modularity remains critical, for example, in delineating the "druggable" molecular targets. Network clustering is useful, but depends on the network topology in and of itself. Notably, the performance of different network-generating tools for network cluster (NC) identification has been little investigated to date. Hence, using gastric cancer (GC) transcriptomic datasets, we compared two algorithms for generating pathway versus gene regulatory network-based NCs, showing that the pathway-based approach better agrees with a reference set of cancer-functional contexts. Finally, by applying pathway-based NC identification to GC transcriptome datasets, we describe cancer NCs that associate with candidate therapeutic targets and biomarkers in GC. These observations collectively inform future research on cancer transcriptomics, drug discovery, and rational development of new analysis tools for optimal harnessing of omics data.

  3. Neuropeptides encoded by the genomes of the Akoya pearl oyster Pinctata fucata and Pacific oyster Crassostrea gigas: a bioinformatic and peptidomic survey.

    PubMed

    Stewart, Michael J; Favrel, Pascal; Rotgans, Bronwyn A; Wang, Tianfang; Zhao, Min; Sohail, Manzar; O'Connor, Wayne A; Elizur, Abigail; Henry, Joel; Cummins, Scott F

    2014-10-02

    Oysters impart significant socio-ecological benefits from primary production of food supply, to estuarine ecosystems via reduction of water column nutrients, plankton and seston biomass. Little though is known at the molecular level of what genes are responsible for how oysters reproduce, filter nutrients, survive stressful physiological events and form reef communities. Neuropeptides represent a diverse class of chemical messengers, instrumental in orchestrating these complex physiological events in other species. By a combination of in silico data mining and peptide analysis of ganglia, 74 putative neuropeptide genes were identified from genome and transcriptome databases of the Akoya pearl oyster, Pinctata fucata and the Pacific oyster, Crassostrea gigas, encoding precursors for over 300 predicted bioactive peptide products, including three newly identified neuropeptide precursors PFGx8amide, RxIamide and Wx3Yamide. Our findings also include a gene for the gonadotropin-releasing hormone (GnRH) and two egg-laying hormones (ELH) which were identified from both oysters. Multiple sequence alignments and phylogenetic analysis supports similar global organization of these mature peptides. Computer-based peptide modeling of the molecular tertiary structures of ELH highlights the structural homologies within ELH family, which may facilitate ELH activity leading to the release of gametes. Our analysis demonstrates that oysters possess conserved molluscan neuropeptide domains and overall precursor organization whilst highlighting many previously unrecognized bivalve idiosyncrasies. This genomic analysis provides a solid foundation from which further studies aimed at the functional characterization of these molluscan neuropeptides can be conducted to further stimulate advances in understanding the ecology and cultivation of oysters.

  4. MTHFR-Ala222Val and male infertility: a study in Iranian men, an updated meta-analysis and an in silico-analysis.

    PubMed

    Nikzad, Hossein; Karimian, Mohammad; Sareban, Kobra; Khoshsokhan, Maryam; Hosseinzadeh Colagar, Abasalt

    2015-11-01

    Methylenetetrahydrofolate reductase (MTHFR) functions as a main regulatory enzyme in folate metabolism. The association of MTHFR gene Ala222Val polymorphism with male infertility in an Iranian population was investigated by undertaking a meta-analysis and in-silico approach. A genetic association study included 497 men; 242 had unexplained infertility and 255 were healthy controls. Polymerase chain reaction restriction fragment length polymorphism was used for genotyping MTHFR-Ala222Val. OpenMeta[Analyst] software was used to conduct the analysis; 22 studies were identified by searching PubMed and the currently reported genetic association study. A novel in-silico approach was used to analyse the effects of Ala222Val substitution on the structure of mRNA and protein. Genetic association study revealed a significant association of MTHFR-222Val/Val genotype with oligozoospermia (OR 2.32; 95% CI, 1.12 to 4.78; P = 0.0451) and azoospermia (OR 2.59; 95% CI 1.09 to 6.17; P = 0.0314). Meta-analysis for allelic, dominant and codominant models showed a significant association between Ala222Val polymorphism and the risk of male infertility (P < 0.001). In silico-analysis showed MTHFR-Ala222Val affects enzyme structure and could also change the mRNA properties (P = 0.1641; P < 0.2 is significant). The meta-analysis suggested significant association of MTHFR-Ala222Val with risk of male infertility, especially in Asian populations. Copyright © 2015 Reproductive Healthcare Ltd. Published by Elsevier Ltd. All rights reserved.

  5. Salivary Polytene Chromosome Map of Anopheles darlingi, the Main Vector of Neotropical Malaria

    PubMed Central

    Rafael, Míriam S.; Rohde, Cláudia; Bridi, Letícia C.; da Silva Valente Gaiesky, Vera Lúcia; Tadei, Wanderli P.

    2010-01-01

    New photomap of Anopheles (Nyssorhynchus) darlingi Root, 1926, is described for a population from Guajará-Mirim, State of Rondonia, Brazil. The number of sections in the previous A. darlingi reference map was maintained and new subsections were added to the five chromosome arms. Breakage points of paracentric inversions had been previously incorporated into the photomap of this species. An additional inversion is reported, called 3Lc, totaling 14 inversions in the A. darlingi chromosome arms. The proposed photomap is potentially useful for further evolutionary studies in addition to physical and in silico chromosome mapping using A. darlingi genomic and transcriptome sequences. Furthermore, in our attempt to compare sections of the 2R chromosome arm of A. darlingi with Anopheles funestus, Anopheles stephensi, and Anopheles gambiae, we found great differences in the arrangement of the polytene chromosome bands, which are consistent with the known phylogenetic divergence of these species. PMID:20682862

  6. Bioremediation of uranium-contaminated groundwater: a systems approach to subsurface biogeochemistry.

    PubMed

    Williams, Kenneth H; Bargar, John R; Lloyd, Jonathan R; Lovley, Derek R

    2013-06-01

    Adding organic electron donors to stimulate microbial reduction of highly soluble U(VI) to less soluble U(IV) is a promising strategy for immobilizing uranium in contaminated subsurface environments. Studies suggest that diagnosing the in situ physiological status of the subsurface community during uranium bioremediation with environmental transcriptomic and proteomic techniques can identify factors potentially limiting U(VI) reduction activity. Models which couple genome-scale in silico representations of the metabolism of key microbial populations with geochemical and hydrological models may be able to predict the outcome of bioremediation strategies and aid in the development of new approaches. Concerns remain about the long-term stability of sequestered U(IV) minerals and the release of co-contaminants associated with Fe(III) oxides, which might be overcome through targeted delivery of electrons to select microorganisms using in situ electrodes. Copyright © 2012 Elsevier Ltd. All rights reserved.

  7. Evidence for a hierarchical transcriptional circuit in Drosophila male germline involving testis-specific TAF and two gene-specific transcription factors, Mod and Acj6.

    PubMed

    Jiang, Mei; Gao, Zhengliang; Wang, Jian; Nurminsky, Dmitry I

    2018-01-01

    To analyze transcription factors involved in gene regulation by testis-specific TAF (tTAF), tTAF-dependent promoters were mapped and analyzed in silico. Core promoters show decreased AT content, paucity of classical promoter motifs, and enrichment with translation control element CAAAATTY. Scanning of putative regulatory regions for known position frequency matrices identified 19 transcription regulators possibly contributing to tTAF-driven gene expression. Decreased male fertility associated with mutation in one of the regulators, Acj6, indicates its involvement in male reproduction. Transcriptome study of testes from male mutants for tTAF, Acj6, and previously characterized tTAF-interacting factor Modulo implies the existence of a regulatory hierarchy of tTAF, Modulo and Acj6, in which Modulo and/or Acj6 regulate one-third of tTAF-dependent genes. © 2017 Federation of European Biochemical Societies.

  8. 6-Gingerol reduces Pseudomonas aeruginosa biofilm formation and virulence via quorum sensing inhibition.

    PubMed

    Kim, Han-Shin; Lee, Sang-Hoon; Byun, Youngjoo; Park, Hee-Deung

    2015-03-02

    Pseudomonas aeruginosa is a well-known pathogenic bacterium that forms biofilms and produces virulence factors via quorum sensing (QS). Interfering with normal QS interactions between signal molecules and their cognate receptors is a developing strategy for attenuating its virulence. Here we tested the hypothesis that 6-gingerol, a pungent oil of fresh ginger, reduces biofilm formation and virulence by antagonistically binding to P. aeruginosa QS receptors. In silico studies demonstrated molecular binding occurs between 6-gingerol and the QS receptor LasR through hydrogen bonding and hydrophobic interactions. Experimentally 6-gingerol reduced biofilm formation, several virulence factors (e.g., exoprotease, rhamnolipid, and pyocyanin), and mice mortality. Further transcriptome analyses demonstrated that 6-gingerol successfully repressed QS-induced genes, specifically those related to the production of virulence factors. These results strongly support our hypothesis and offer insight into the molecular mechanism that caused QS gene repression.

  9. 6-Gingerol reduces Pseudomonas aeruginosa biofilm formation and virulence via quorum sensing inhibition

    PubMed Central

    Kim, Han-Shin; Lee, Sang-Hoon; Byun, Youngjoo; Park, Hee-Deung

    2015-01-01

    Pseudomonas aeruginosa is a well-known pathogenic bacterium that forms biofilms and produces virulence factors via quorum sensing (QS). Interfering with normal QS interactions between signal molecules and their cognate receptors is a developing strategy for attenuating its virulence. Here we tested the hypothesis that 6-gingerol, a pungent oil of fresh ginger, reduces biofilm formation and virulence by antagonistically binding to P. aeruginosa QS receptors. In silico studies demonstrated molecular binding occurs between 6-gingerol and the QS receptor LasR through hydrogen bonding and hydrophobic interactions. Experimentally 6-gingerol reduced biofilm formation, several virulence factors (e.g., exoprotease, rhamnolipid, and pyocyanin), and mice mortality. Further transcriptome analyses demonstrated that 6-gingerol successfully repressed QS-induced genes, specifically those related to the production of virulence factors. These results strongly support our hypothesis and offer insight into the molecular mechanism that caused QS gene repression. PMID:25728862

  10. Quantitative RNA-seq analysis of the Campylobacter jejuni transcriptome

    PubMed Central

    Chaudhuri, Roy R.; Yu, Lu; Kanji, Alpa; Perkins, Timothy T.; Gardner, Paul P.; Choudhary, Jyoti; Maskell, Duncan J.

    2011-01-01

    Campylobacter jejuni is the most common bacterial cause of foodborne disease in the developed world. Its general physiology and biochemistry, as well as the mechanisms enabling it to colonize and cause disease in various hosts, are not well understood, and new approaches are required to understand its basic biology. High-throughput sequencing technologies provide unprecedented opportunities for functional genomic research. Recent studies have shown that direct Illumina sequencing of cDNA (RNA-seq) is a useful technique for the quantitative and qualitative examination of transcriptomes. In this study we report RNA-seq analyses of the transcriptomes of C. jejuni (NCTC11168) and its rpoN mutant. This has allowed the identification of hitherto unknown transcriptional units, and further defines the regulon that is dependent on rpoN for expression. The analysis of the NCTC11168 transcriptome was supplemented by additional proteomic analysis using liquid chromatography-MS. The transcriptomic and proteomic datasets represent an important resource for the Campylobacter research community. PMID:21816880

  11. Analysis of Transcriptomic Dose Response Data in the ...

    EPA Pesticide Factsheets

    Slide presentation at the HESI-HEALTH Canada-McGill Workshop on Transcriptomic Dose Response Data in the Context of Chemical Risk Assessment Slide presentation at the HESI-HEALTH Canada-McGill Workshop on Transcriptomic Dose Response Data in the Context of Chemical Risk Assessment

  12. Developmental Transcriptome for a Facultatively Eusocial Bee, Megalopta genalis

    PubMed Central

    Jones, Beryl M.; Wcislo, William T.; Robinson, Gene E.

    2015-01-01

    Transcriptomes provide excellent foundational resources for mechanistic and evolutionary analyses of complex traits. We present a developmental transcriptome for the facultatively eusocial bee Megalopta genalis, which represents a potential transition point in the evolution of eusociality. A de novo transcriptome assembly of Megalopta genalis was generated using paired-end Illumina sequencing and the Trinity assembler. Males and females of all life stages were aligned to this transcriptome for analysis of gene expression profiles throughout development. Gene Ontology analysis indicates that stage-specific genes are involved in ion transport, cell–cell signaling, and metabolism. A number of distinct biological processes are upregulated in each life stage, and transitions between life stages involve shifts in dominant functional processes, including shifts from transcriptional regulation in embryos to metabolism in larvae, and increased lipid metabolism in adults. We expect that this transcriptome will provide a useful resource for future analyses to better understand the molecular basis of the evolution of eusociality and, more generally, phenotypic plasticity. PMID:26276382

  13. Developmental Transcriptome for a Facultatively Eusocial Bee, Megalopta genalis.

    PubMed

    Jones, Beryl M; Wcislo, William T; Robinson, Gene E

    2015-08-14

    Transcriptomes provide excellent foundational resources for mechanistic and evolutionary analyses of complex traits. We present a developmental transcriptome for the facultatively eusocial bee Megalopta genalis, which represents a potential transition point in the evolution of eusociality. A de novo transcriptome assembly of Megalopta genalis was generated using paired-end Illumina sequencing and the Trinity assembler. Males and females of all life stages were aligned to this transcriptome for analysis of gene expression profiles throughout development. Gene Ontology analysis indicates that stage-specific genes are involved in ion transport, cell-cell signaling, and metabolism. A number of distinct biological processes are upregulated in each life stage, and transitions between life stages involve shifts in dominant functional processes, including shifts from transcriptional regulation in embryos to metabolism in larvae, and increased lipid metabolism in adults. We expect that this transcriptome will provide a useful resource for future analyses to better understand the molecular basis of the evolution of eusociality and, more generally, phenotypic plasticity. Copyright © 2015 Jones et al.

  14. A survey of the sorghum transcriptome using single-molecule long reads

    DOE PAGES

    Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; ...

    2016-06-24

    Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novelmore » splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.« less

  15. A survey of the sorghum transcriptome using single-molecule long reads

    PubMed Central

    Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; Ngam, Peter; Devitt, Nicholas; Schilkey, Faye; Ben-Hur, Asa; Reddy, Anireddy S. N.

    2016-01-01

    Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. PMID:27339290

  16. Transcriptome In Vivo Analysis (TIVA) of spatially defined single cells in intact live mouse and human brain tissue

    PubMed Central

    Lovatt, Ditte; Ruble, Brittani K.; Lee, Jaehee; Dueck, Hannah; Kim, Tae Kyung; Fisher, Stephen; Francis, Chantal; Spaethling, Jennifer M.; Wolf, John A.; Grady, M. Sean; Ulyanova, Alexandra V.; Yeldell, Sean B.; Griepenburg, Julianne C.; Buckley, Peter T.; Kim, Junhyong; Sul, Jai-Yoon; Dmochowski, Ivan J.; Eberwine, James

    2014-01-01

    Transcriptome profiling is an indispensable tool in advancing the understanding of single cell biology, but depends upon methods capable of isolating mRNA at the spatial resolution of a single cell. Current capture methods lack sufficient spatial resolution to isolate mRNA from individual in vivo resident cells without damaging adjacent tissue. Because of this limitation, it has been difficult to assess the influence of the microenvironment on the transcriptome of individual neurons. Here, we engineered a Transcriptome In Vivo Analysis (TIVA)-tag, which upon photoactivation enables mRNA capture from single cells in live tissue. Using the TIVA-tag in combination with RNA-seq to analyze transcriptome variance among single dispersed cells and in vivo resident mouse and human neurons, we show that the tissue microenvironment shapes the transcriptomic landscape of individual cells. The TIVA methodology provides the first noninvasive approach for capturing mRNA from single cells in their natural microenvironment. PMID:24412976

  17. PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.

    PubMed

    Gan, Ruei-Chi; Chen, Ting-Wen; Wu, Timothy H; Huang, Po-Jung; Lee, Chi-Ching; Yeh, Yuan-Ming; Chiu, Cheng-Hsun; Huang, Hsien-Da; Tang, Petrus

    2016-12-22

    Next-generation sequencing promises the de novo genomic and transcriptomic analysis of samples of interests. However, there are only a few organisms having reference genomic sequences and even fewer having well-defined or curated annotations. For transcriptome studies focusing on organisms lacking proper reference genomes, the common strategy is de novo assembly followed by functional annotation. However, things become even more complicated when multiple transcriptomes are compared. Here, we propose a new analysis strategy and quantification methods for quantifying expression level which not only generate a virtual reference from sequencing data, but also provide comparisons between transcriptomes. First, all reads from the transcriptome datasets are pooled together for de novo assembly. The assembled contigs are searched against NCBI NR databases to find potential homolog sequences. Based on the searched result, a set of virtual transcripts are generated and served as a reference transcriptome. By using the same reference, normalized quantification values including RC (read counts), eRPKM (estimated RPKM) and eTPM (estimated TPM) can be obtained that are comparable across transcriptome datasets. In order to demonstrate the feasibility of our strategy, we implement it in the web service PARRoT. PARRoT stands for Pipeline for Analyzing RNA Reads of Transcriptomes. It analyzes gene expression profiles for two transcriptome sequencing datasets. For better understanding of the biological meaning from the comparison among transcriptomes, PARRoT further provides linkage between these virtual transcripts and their potential function through showing best hits in SwissProt, NR database, assigning GO terms. Our demo datasets showed that PARRoT can analyze two paired-end transcriptomic datasets of approximately 100 million reads within just three hours. In this study, we proposed and implemented a strategy to analyze transcriptomes from non-reference organisms which offers the opportunity to quantify and compare transcriptome profiles through a homolog based virtual transcriptome reference. By using the homolog based reference, our strategy effectively avoids the problems that may cause from inconsistencies among transcriptomes. This strategy will shed lights on the field of comparative genomics for non-model organism. We have implemented PARRoT as a web service which is freely available at http://parrot.cgu.edu.tw .

  18. Promzea: a pipeline for discovery of co-regulatory motifs in maize and other plant species and its application to the anthocyanin and phlobaphene biosynthetic pathways and the Maize Development Atlas.

    PubMed

    Liseron-Monfils, Christophe; Lewis, Tim; Ashlock, Daniel; McNicholas, Paul D; Fauteux, François; Strömvik, Martina; Raizada, Manish N

    2013-03-15

    The discovery of genetic networks and cis-acting DNA motifs underlying their regulation is a major objective of transcriptome studies. The recent release of the maize genome (Zea mays L.) has facilitated in silico searches for regulatory motifs. Several algorithms exist to predict cis-acting elements, but none have been adapted for maize. A benchmark data set was used to evaluate the accuracy of three motif discovery programs: BioProspector, Weeder and MEME. Analysis showed that each motif discovery tool had limited accuracy and appeared to retrieve a distinct set of motifs. Therefore, using the benchmark, statistical filters were optimized to reduce the false discovery ratio, and then remaining motifs from all programs were combined to improve motif prediction. These principles were integrated into a user-friendly pipeline for motif discovery in maize called Promzea, available at http://www.promzea.org and on the Discovery Environment of the iPlant Collaborative website. Promzea was subsequently expanded to include rice and Arabidopsis. Within Promzea, a user enters cDNA sequences or gene IDs; corresponding upstream sequences are retrieved from the maize genome. Predicted motifs are filtered, combined and ranked. Promzea searches the chosen plant genome for genes containing each candidate motif, providing the user with the gene list and corresponding gene annotations. Promzea was validated in silico using a benchmark data set: the Promzea pipeline showed a 22% increase in nucleotide sensitivity compared to the best standalone program tool, Weeder, with equivalent nucleotide specificity. Promzea was also validated by its ability to retrieve the experimentally defined binding sites of transcription factors that regulate the maize anthocyanin and phlobaphene biosynthetic pathways. Promzea predicted additional promoter motifs, and genome-wide motif searches by Promzea identified 127 non-anthocyanin/phlobaphene genes that each contained all five predicted promoter motifs in their promoters, perhaps uncovering a broader co-regulated gene network. Promzea was also tested against tissue-specific microarray data from maize. An online tool customized for promoter motif discovery in plants has been generated called Promzea. Promzea was validated in silico by its ability to retrieve benchmark motifs and experimentally defined motifs and was tested using tissue-specific microarray data. Promzea predicted broader networks of gene regulation associated with the historic anthocyanin and phlobaphene biosynthetic pathways. Promzea is a new bioinformatics tool for understanding transcriptional gene regulation in maize and has been expanded to include rice and Arabidopsis.

  19. Phylogenetic analysis of the GST family in Anopheles (Nyssorhynchus) darlingi.

    PubMed

    Azevedo-Júnior, Gilson Martins de; Guimarães-Marques, Giselle Moura; Cegatti Bridi, Leticia; Christine Ohse, Ketlen; Vicentini, Renato; Tadei, Wanderli; Rafael, Míriam Silva

    2014-08-01

    Anopheles darlingi Root, 1926 and Anopheles gambiae (Diptera: Culicidae) are the most important human malaria vectors in South America and Africa, respectively. The two species are estimated to have diverged 100 million years ago. Studies on the phylogenetics and evolution of gene sequences, such as glutathione S-transferase (GST) in disease-transmitting mosquitoes are scarce. The sigma class GST (KC890767) from the transcriptome of An. darlingi captured in the Brazilian Amazon was studied by in silico hybridization, and mapped to chromosome 3 of An. gambiae. The sigma class GST of An. darlingi was used for phylogenetic analyses to understand the GST base composition of the most recent common ancestor between An. darlingi, Anopheles gambiae, Aedes aegypti and Culex quinquefasciatus. The GST (KC890767) of An. darlingi was studied to generate the main divergence branches using a Neighbor-Joining and bootstrapping approaches to confirm confidence levels on the tree nodes that separate the An. darlingi and other mosquito species. The results showed divergence between An. gambiae, Ae. Aegypti, Cx. quinquefasciatus, and Phlebotomus papatasi as outgroup, and the homology relationship between sigma class GST of An. darlingi and GSTS1_1 gene of An. gambiae was valuable for phylogenetic and evolutionary studies. Copyright © 2014 Elsevier B.V. All rights reserved.

  20. Combination of six enzymes of a marine Novosphingobium converts the stereoisomers of β-O-4 lignin model dimers into the respective monomers

    PubMed Central

    Ohta, Yukari; Nishi, Shinro; Hasegawa, Ryoichi; Hatada, Yuji

    2015-01-01

    Lignin, an aromatic polymer of phenylpropane units joined predominantly by β-O-4 linkages, is the second most abundant biomass component on Earth. Despite the continuous discharge of terrestrially produced lignin into marine environments, few studies have examined lignin degradation by marine microorganisms. Here, we screened marine isolates for β-O-4 cleavage activity and determined the genes responsible for this enzymatic activity in one positive isolate. Novosphingobium sp. strain MBES04 converted all four stereoisomers of guaiacylglycerol-β-guaiacyl ether (GGGE), a structural mimic of lignin, to guaiacylhydroxypropanone as an end metabolite in three steps involving six enzymes, including a newly identified Nu-class glutathione-S-transferase (GST). In silico searches of the strain MBES04 genome revealed that four GGGE-metabolizing GST genes were arranged in a cluster. Transcriptome analysis demonstrated that the lignin model compounds GGGE and (2-methoxyphenoxy)hydroxypropiovanillone (MPHPV) enhanced the expression of genes in involved in energy metabolism, including aromatic-monomer assimilation, and evoked defense responses typically expressed upon exposure to toxic compounds. The findings from this study provide insight into previously unidentified bacterial enzymatic systems and the physiological acclimation of microbes associated with the biological transformation of lignin-containing materials in marine environments. PMID:26477321

  1. Transcription of two adjacent carbohydrate utilization gene clusters in Bifidobacterium breve UCC2003 is controlled by LacI- and repressor open reading frame kinase (ROK)-type regulators.

    PubMed

    O'Connell, Kerry Joan; Motherway, Mary O'Connell; Liedtke, Andrea; Fitzgerald, Gerald F; Paul Ross, R; Stanton, Catherine; Zomer, Aldert; van Sinderen, Douwe

    2014-06-01

    Members of the genus Bifidobacterium are commonly found in the gastrointestinal tracts of mammals, including humans, where their growth is presumed to be dependent on various diet- and/or host-derived carbohydrates. To understand transcriptional control of bifidobacterial carbohydrate metabolism, we investigated two genetic carbohydrate utilization clusters dedicated to the metabolism of raffinose-type sugars and melezitose. Transcriptomic and gene inactivation approaches revealed that the raffinose utilization system is positively regulated by an activator protein, designated RafR. The gene cluster associated with melezitose metabolism was shown to be subject to direct negative control by a LacI-type transcriptional regulator, designated MelR1, in addition to apparent indirect negative control by means of a second LacI-type regulator, MelR2. In silico analysis, DNA-protein interaction, and primer extension studies revealed the MelR1 and MelR2 operator sequences, each of which is positioned just upstream of or overlapping the correspondingly regulated promoter sequences. Similar analyses identified the RafR binding operator sequence located upstream of the rafB promoter. This study indicates that transcriptional control of gene clusters involved in carbohydrate metabolism in bifidobacteria is subject to conserved regulatory systems, representing either positive or negative control.

  2. Whole-exome sequencing analysis of Waardenburg syndrome in a Chinese family.

    PubMed

    Chen, Dezhong; Zhao, Na; Wang, Jing; Li, Zhuoyu; Wu, Changxin; Fu, Jie; Xiao, Han

    2017-01-01

    Waardenburg syndrome (WS) is a dominantly inherited, genetically heterogeneous auditory-pigmentary syndrome characterized by non-progressive sensorineural hearing loss and iris discoloration. By whole-exome sequencing (WES), we identified a nonsense mutation (c.598C>T) in PAX3 gene, predicted to be disease causing by in silico analysis. This is the first report of genetically diagnosed case of WS PAX3 c.598C>T nonsense mutation in Chinese ethnic origin by WES and in silico functional prediction methods.

  3. Whole-exome sequencing analysis of Waardenburg syndrome in a Chinese family

    PubMed Central

    Chen, Dezhong; Zhao, Na; Wang, Jing; Li, Zhuoyu; Wu, Changxin; Fu, Jie; Xiao, Han

    2017-01-01

    Waardenburg syndrome (WS) is a dominantly inherited, genetically heterogeneous auditory-pigmentary syndrome characterized by non-progressive sensorineural hearing loss and iris discoloration. By whole-exome sequencing (WES), we identified a nonsense mutation (c.598C>T) in PAX3 gene, predicted to be disease causing by in silico analysis. This is the first report of genetically diagnosed case of WS PAX3 c.598C>T nonsense mutation in Chinese ethnic origin by WES and in silico functional prediction methods. PMID:28690861

  4. Evaluation of a genome-scale in silico metabolic model for Geobacter metallireducens by using proteomic data from a field biostimulation experiment.

    PubMed

    Fang, Yilin; Wilkins, Michael J; Yabusaki, Steven B; Lipton, Mary S; Long, Philip E

    2012-12-01

    Accurately predicting the interactions between microbial metabolism and the physical subsurface environment is necessary to enhance subsurface energy development, soil and groundwater cleanup, and carbon management. This study was an initial attempt to confirm the metabolic functional roles within an in silico model using environmental proteomic data collected during field experiments. Shotgun global proteomics data collected during a subsurface biostimulation experiment were used to validate a genome-scale metabolic model of Geobacter metallireducens-specifically, the ability of the metabolic model to predict metal reduction, biomass yield, and growth rate under dynamic field conditions. The constraint-based in silico model of G. metallireducens relates an annotated genome sequence to the physiological functions with 697 reactions controlled by 747 enzyme-coding genes. Proteomic analysis showed that 180 of the 637 G. metallireducens proteins detected during the 2008 experiment were associated with specific metabolic reactions in the in silico model. When the field-calibrated Fe(III) terminal electron acceptor process reaction in a reactive transport model for the field experiments was replaced with the genome-scale model, the model predicted that the largest metabolic fluxes through the in silico model reactions generally correspond to the highest abundances of proteins that catalyze those reactions. Central metabolism predicted by the model agrees well with protein abundance profiles inferred from proteomic analysis. Model discrepancies with the proteomic data, such as the relatively low abundances of proteins associated with amino acid transport and metabolism, revealed pathways or flux constraints in the in silico model that could be updated to more accurately predict metabolic processes that occur in the subsurface environment.

  5. Insights into transcriptomes of Big and Low sagebrush

    Treesearch

    Mark D. Huynh; Justin T. Page; Bryce A. Richardson; Joshua A. Udall

    2015-01-01

    We report the sequencing and assembly of three transcriptomes from Big (Artemisia tridentatassp. wyomingensis and A. tridentatassp. tridentata) and Low (A. arbuscula ssp. arbuscula) sagebrush. The sequence reads are available in the Sequence Read Archive of NCBI. We demonstrate the utilities of these transcriptomes for gene discovery and phylogenomic analysis. An...

  6. Bioinformatics Identification of Modules of Transcription Factor Binding Sites in Alzheimer's Disease-Related Genes by In Silico Promoter Analysis and Microarrays

    PubMed Central

    Augustin, Regina; Lichtenthaler, Stefan F.; Greeff, Michael; Hansen, Jens; Wurst, Wolfgang; Trümbach, Dietrich

    2011-01-01

    The molecular mechanisms and genetic risk factors underlying Alzheimer's disease (AD) pathogenesis are only partly understood. To identify new factors, which may contribute to AD, different approaches are taken including proteomics, genetics, and functional genomics. Here, we used a bioinformatics approach and found that distinct AD-related genes share modules of transcription factor binding sites, suggesting a transcriptional coregulation. To detect additional coregulated genes, which may potentially contribute to AD, we established a new bioinformatics workflow with known multivariate methods like support vector machines, biclustering, and predicted transcription factor binding site modules by using in silico analysis and over 400 expression arrays from human and mouse. Two significant modules are composed of three transcription factor families: CTCF, SP1F, and EGRF/ZBPF, which are conserved between human and mouse APP promoter sequences. The specific combination of in silico promoter and multivariate analysis can identify regulation mechanisms of genes involved in multifactorial diseases. PMID:21559189

  7. Decoding genes with coexpression networks and metabolomics - 'majority report by precogs'.

    PubMed

    Saito, Kazuki; Hirai, Masami Y; Yonekura-Sakakibara, Keiko

    2008-01-01

    Following the sequencing of whole genomes of model plants, high-throughput decoding of gene function is a major challenge in modern plant biology. In view of remarkable technical advances in transcriptomics and metabolomics, integrated analysis of these 'omics' by data-mining informatics is an excellent tool for prediction and identification of gene function, particularly for genes involved in complicated metabolic pathways. The availability of Arabidopsis public transcriptome datasets containing data of >1000 microarrays reinforces the potential for prediction of gene function by transcriptome coexpression analysis. Here, we review the strategy of combining transcriptome and metabolome as a powerful technology for studying the functional genomics of model plants and also crop and medicinal plants.

  8. Solanum torvum responses to the root-knot nematode Meloidogyne incognita

    PubMed Central

    2013-01-01

    Background Solanum torvum Sw is worldwide employed as rootstock for eggplant cultivation because of its vigour and resistance/tolerance to the most serious soil-borne diseases as bacterial, fungal wilts and root-knot nematodes. The little information on Solanum torvum (hereafter Torvum) resistance mechanisms, is mostly attributable to the lack of genomic tools (e.g. dedicated microarray) as well as to the paucity of database information limiting high-throughput expression studies in Torvum. Results As a first step towards transcriptome profiling of Torvum inoculated with the nematode M. incognita, we built a Torvum 3’ transcript catalogue. One-quarter of a 454 full run resulted in 205,591 quality-filtered reads. De novo assembly yielded 24,922 contigs and 11,875 singletons. Similarity searches of the S. torvum transcript tags catalogue produced 12,344 annotations. A 30,0000 features custom combimatrix chip was then designed and microarray hybridizations were conducted for both control and 14 dpi (day post inoculation) with Meloidogyne incognita-infected roots samples resulting in 390 differentially expressed genes (DEG). We also tested the chip with samples from the phylogenetically-related nematode-susceptible eggplant species Solanum melongena. An in-silico validation strategy was developed based on assessment of sequence similarity among Torvum probes and eggplant expressed sequences available in public repositories. GO term enrichment analyses with the 390 Torvum DEG revealed enhancement of several processes as chitin catabolism and sesquiterpenoids biosynthesis, while no GO term enrichment was found with eggplant DEG. The genes identified from S. torvum catalogue, bearing high similarity to known nematode resistance genes, were further investigated in view of their potential role in the nematode resistance mechanism. Conclusions By combining 454 pyrosequencing and microarray technology we were able to conduct a cost-effective global transcriptome profiling in a non-model species. In addition, the development of an in silico validation strategy allowed to further extend the use of the custom chip to a related species and to assess by comparison the expression of selected genes without major concerns of artifacts. The expression profiling of S. torvum responses to nematode infection points to sesquiterpenoids and chitinases as major effectors of nematode resistance. The availability of the long sequence tags in S. torvum catalogue will allow precise identification of active nematocide/nematostatic compounds and associated enzymes posing the basis for exploitation of these resistance mechanisms in other species. PMID:23937585

  9. Flux analysis and metabolomics for systematic metabolic engineering of microorganisms.

    PubMed

    Toya, Yoshihiro; Shimizu, Hiroshi

    2013-11-01

    Rational engineering of metabolism is important for bio-production using microorganisms. Metabolic design based on in silico simulations and experimental validation of the metabolic state in the engineered strain helps in accomplishing systematic metabolic engineering. Flux balance analysis (FBA) is a method for the prediction of metabolic phenotype, and many applications have been developed using FBA to design metabolic networks. Elementary mode analysis (EMA) and ensemble modeling techniques are also useful tools for in silico strain design. The metabolome and flux distribution of the metabolic pathways enable us to evaluate the metabolic state and provide useful clues to improve target productivity. Here, we reviewed several computational applications for metabolic engineering by using genome-scale metabolic models of microorganisms. We also discussed the recent progress made in the field of metabolomics and (13)C-metabolic flux analysis techniques, and reviewed these applications pertaining to bio-production development. Because these in silico or experimental approaches have their respective advantages and disadvantages, the combined usage of these methods is complementary and effective for metabolic engineering. Copyright © 2013 Elsevier Inc. All rights reserved.

  10. Transcriptome analysis in non-model species: a new method for the analysis of heterologous hybridization on microarrays

    PubMed Central

    2010-01-01

    Background Recent developments in high-throughput methods of analyzing transcriptomic profiles are promising for many areas of biology, including ecophysiology. However, although commercial microarrays are available for most common laboratory models, transcriptome analysis in non-traditional model species still remains a challenge. Indeed, the signal resulting from heterologous hybridization is low and difficult to interpret because of the weak complementarity between probe and target sequences, especially when no microarray dedicated to a genetically close species is available. Results We show here that transcriptome analysis in a species genetically distant from laboratory models is made possible by using MAXRS, a new method of analyzing heterologous hybridization on microarrays. This method takes advantage of the design of several commercial microarrays, with different probes targeting the same transcript. To illustrate and test this method, we analyzed the transcriptome of king penguin pectoralis muscle hybridized to Affymetrix chicken microarrays, two organisms separated by an evolutionary distance of approximately 100 million years. The differential gene expression observed between different physiological situations computed by MAXRS was confirmed by real-time PCR on 10 genes out of 11 tested. Conclusions MAXRS appears to be an appropriate method for gene expression analysis under heterologous hybridization conditions. PMID:20509979

  11. Single-cell analysis of the transcriptome and its application in the characterization of stem cells and early embryos.

    PubMed

    Liu, Na; Liu, Lin; Pan, Xinghua

    2014-07-01

    Cellular heterogeneity within a cell population is a common phenomenon in multicellular organisms, tissues, cultured cells, and even FACS-sorted subpopulations. Important information may be masked if the cells are studied as a mass. Transcriptome profiling is a parameter that has been intensively studied, and relatively easier to address than protein composition. To understand the basis and importance of heterogeneity and stochastic aspects of the cell function and its mechanisms, it is essential to examine transcriptomes of a panel of single cells. High-throughput technologies, starting from microarrays and now RNA-seq, provide a full view of the expression of transcriptomes but are limited by the amount of RNA for analysis. Recently, several new approaches for amplification and sequencing the transcriptome of single cells or a limited low number of cells have been developed and applied. In this review, we summarize these major strategies, such as PCR-based methods, IVT-based methods, phi29-DNA polymerase-based methods, and several other methods, including their principles, characteristics, advantages, and limitations, with representative applications in cancer stem cells, early development, and embryonic stem cells. The prospects for development of future technology and application of transcriptome analysis in a single cell are also discussed.

  12. In Silico Estimation of Translation Efficiency in Human Cell Lines: Potential Evidence for Widespread Translational Control

    PubMed Central

    Stevens, Stewart G.; Brown, Chris M

    2013-01-01

    Recently large scale transcriptome and proteome datasets for human cells have become available. A striking finding from these studies is that the level of an mRNA typically predicts no more than 40% of the abundance of protein. This correlation represents the overall figure for all genes. We present here a bioinformatic analysis of translation efficiency – the rate at which mRNA is translated into protein. We have analysed those human datasets that include genome wide mRNA and protein levels determined in the same study. The analysis comprises five distinct human cell lines that together provide comparable data for 8,170 genes. For each gene we have used levels of mRNA and protein combined with protein stability data from the HeLa cell line to estimate translation efficiency. This was possible for 3,990 genes in one or more cell lines and 1,807 genes in all five cell lines. Interestingly, our analysis and modelling shows that for many genes this estimated translation efficiency has considerable consistency between cell lines. Some deviations from this consistency likely result from the regulation of protein degradation. Others are likely due to known translational control mechanisms. These findings suggest it will be possible to build improved models for the interpretation of mRNA expression data. The results we present here provide a view of translation efficiency for many genes. We provide an online resource allowing the exploration of translation efficiency in genes of interest within different cell lines (http://bioanalysis.otago.ac.nz/TranslationEfficiency). PMID:23460887

  13. Transcriptome Analysis of Barbarea vulgaris Infested with Diamondback Moth (Plutella xylostella) Larvae

    PubMed Central

    Shen, Di; Wang, Haiping; Wu, Qingjun; Lu, Peng; Qiu, Yang; Song, Jiangping; Zhang, Youjun; Li, Xixiang

    2013-01-01

    Background The diamondback moth (DBM, Plutella xylostella) is a crucifer-specific pest that causes significant crop losses worldwide. Barbarea vulgaris (Brassicaceae) can resist DBM and other herbivorous insects by producing feeding-deterrent triterpenoid saponins. Plant breeders have long aimed to transfer this insect resistance to other crops. However, a lack of knowledge on the biosynthetic pathways and regulatory networks of these insecticidal saponins has hindered their practical application. A pyrosequencing-based transcriptome analysis of B. vulgaris during DBM larval feeding was performed to identify genes and gene networks responsible for saponin biosynthesis and its regulation at the genome level. Principal Findings Approximately 1.22, 1.19, 1.16, 1.23, 1.16, 1.20, and 2.39 giga base pairs of clean nucleotides were generated from B. vulgaris transcriptomes sampled 1, 4, 8, 12, 24, and 48 h after onset of P. xylostella feeding and from non-inoculated controls, respectively. De novo assembly using all data of the seven transcriptomes generated 39,531 unigenes. A total of 37,780 (95.57%) unigenes were annotated, 14,399 of which were assigned to one or more gene ontology terms and 19,620 of which were assigned to 126 known pathways. Expression profiles revealed 2,016–4,685 up-regulated and 557–5188 down-regulated transcripts. Secondary metabolic pathways, such as those of terpenoids, glucosinolates, and phenylpropanoids, and its related regulators were elevated. Candidate genes for the triterpene saponin pathway were found in the transcriptome. Orthological analysis of the transcriptome with four other crucifer transcriptomes identified 592 B. vulgaris-specific gene families with a P-value cutoff of 1e−5. Conclusion This study presents the first comprehensive transcriptome analysis of B. vulgaris subjected to a series of DBM feedings. The biosynthetic and regulatory pathways of triterpenoid saponins and other DBM deterrent metabolites in this plant were classified. The results of this study will provide useful data for future investigations on pest-resistance phytochemistry and plant breeding. PMID:23696897

  14. Transcriptomic Analysis of Avocado Hass (Persea americana Mill) in the Interaction System Fruit-Chitosan-Colletotrichum

    PubMed Central

    Xoca-Orozco, Luis-Ángel; Cuellar-Torres, Esther Angélica; González-Morales, Sandra; Gutiérrez-Martínez, Porfirio; López-García, Ulises; Herrera-Estrella, Luis; Vega-Arreguín, Julio; Chacón-López, Alejandra

    2017-01-01

    Avocado (Persea americana) is one of the most important crops in Mexico as it is the main producer, consumer, and exporter of avocado fruit in the world. However, successful avocado commercialization is often reduced by large postharvest losses due to Colletotrichum sp., the causal agent of anthracnose. Chitosan is known to have a direct antifungal effect and acts also as an elicitor capable of stimulating a defense response in plants. However, there is little information regarding the genes that are either activated or repressed in fruits treated with chitosan. The aim of this study was to identify by RNA-seq the genes differentially regulated by the action of low molecular weight chitosan in the avocado-chitosan-Colletotrichum interaction system. The samples for RNA-seq were obtained from fruits treated with chitosan, fruits inoculated with Colletotrichum and fruits both treated with chitosan and inoculated with the fungus. Non-treated and non-inoculated fruits were also analyzed. Expression profiles showed that in short times, the fruit-chitosan system presented a greater number of differentially expressed genes, compared to the fruit-pathogen system. Gene Ontology analysis of differentially expressed genes showed a large number of metabolic processes regulated by chitosan, including those preventing the spread of Colletotrichum. It was also found that there is a high correlation between the expression of genes in silico and qPCR of several genes involved in different metabolic pathways. PMID:28642771

  15. Transcriptome-wide identification of Rauvolfia serpentina microRNAs and prediction of their potential targets.

    PubMed

    Prakash, Pravin; Rajakani, Raja; Gupta, Vikrant

    2016-04-01

    MicroRNAs (miRNAs) are small non-coding RNAs of ∼ 19-24 nucleotides (nt) in length and considered as potent regulators of gene expression at transcriptional and post-transcriptional levels. Here we report the identification and characterization of 15 conserved miRNAs belonging to 13 families from Rauvolfia serpentina through in silico analysis of available nucleotide dataset. The identified mature R. serpentina miRNAs (rse-miRNAs) ranged between 20 and 22nt in length, and the average minimal folding free energy index (MFEI) value of rse-miRNA precursor sequences was found to be -0.815 kcal/mol. Using the identified rse-miRNAs as query, their potential targets were predicted in R. serpentina and other plant species. Gene Ontology (GO) annotation showed that predicted targets of rse-miRNAs include transcription factors as well as genes involved in diverse biological processes such as primary and secondary metabolism, stress response, disease resistance, growth, and development. Few rse-miRNAs were predicted to target genes of pharmaceutically important secondary metabolic pathways such as alkaloids and anthocyanin biosynthesis. Phylogenetic analysis showed the evolutionary relationship of rse-miRNAs and their precursor sequences to homologous pre-miRNA sequences from other plant species. The findings under present study besides giving first hand information about R. serpentina miRNAs and their targets, also contributes towards the better understanding of miRNA-mediated gene regulatory processes in plants. Copyright © 2015 Elsevier Ltd. All rights reserved.

  16. Proteins with an Euonymus lectin-like domain are ubiquitous in Embryophyta

    PubMed Central

    2009-01-01

    Background Cloning of the Euonymus lectin led to the discovery of a novel domain that also occurs in some stress-induced plant proteins. The distribution and the diversity of proteins with an Euonymus lectin (EUL) domain were investigated using detailed analysis of sequences in publicly accessible genome and transcriptome databases. Results Comprehensive in silico analyses indicate that the recently identified Euonymus europaeus lectin domain represents a conserved structural unit of a novel family of putative carbohydrate-binding proteins, which will further be referred to as the Euonymus lectin (EUL) family. The EUL domain is widespread among plants. Analysis of retrieved sequences revealed that some sequences consist of a single EUL domain linked to an unrelated N-terminal domain whereas others comprise two in tandem arrayed EUL domains. A new classification system for these lectins is proposed based on the overall domain architecture. Evolutionary relationships among the sequences with EUL domains are discussed. Conclusion The identification of the EUL family provides the first evidence for the occurrence in terrestrial plants of a highly conserved plant specific domain. The widespread distribution of the EUL domain strikingly contrasts the more limited or even narrow distribution of most other lectin domains found in plants. The apparent omnipresence of the EUL domain is indicative for a universal role of this lectin domain in plants. Although there is unambiguous evidence that several EUL domains possess carbohydrate-binding activity further research is required to corroborate the carbohydrate-binding properties of different members of the EUL family. PMID:19930663

  17. Expressed sequence tag based identification and expression analysis of some cold inducible elements in seabuckthorn (Hippophae rhamnoides L.).

    PubMed

    Ghangal, Rajesh; Raghuvanshi, Saurabh; Sharma, Prakash C

    2012-02-01

    A cDNA library was constructed from the mature leaves of seabuckthorn (Hippophae rhamnoides). Expressed Sequence Tags (ESTs) were generated by single pass sequencing of 4500 cDNA clones. We submitted 3412 ESTs to dbEST of NCBI. Clustering of these ESTs yielded 1665 unigenes comprising of 345 contigs and 1320 singletons. Out of 1665 unigenes, 1278 unigenes were annotated by similarity search while the remaining 387 unannotated unigenes were considered as organism specific. Gene Ontology (GO) analysis of the unigene dataset showed 691 unigenes related to biological processes, 727 to molecular functions and 588 to cellular component category. On the basis of similarity search and GO annotation, 43 unigenes were found responsive to biotic and abiotic stresses. To validate this observation, 13 genes that are known to be associated with cold stress tolerance from previous studies in Arabidopsis and 3 novel transcripts were examined by Real time RT-PCR to understand the change in expression pattern under cold/freeze stress. In silico study of occurrence of microsatellites in these ESTs revealed the presence of 62 Simple Sequence Repeats (SSRs), some of which are being explored to assess genetic diversity among seabuckthorn collections. This is the first report of generation of transcriptome data providing information about genes involved in managing plant abiotic stress in seabuckthorn, a plant known for its enormous medicinal and ecological value. Copyright © 2011 Elsevier Masson SAS. All rights reserved.

  18. Hsp70 gene expansions in the scallop Patinopecten yessoensis and their expression regulation after exposure to the toxic dinoflagellate Alexandrium catenella.

    PubMed

    Cheng, Jie; Xun, Xiaogang; Kong, Yifan; Wang, Shuyue; Yang, Zhihui; Li, Yajuan; Kong, Dexu; Wang, Shi; Zhang, Lingling; Hu, Xiaoli; Bao, Zhenmin

    2016-11-01

    Heat shock protein 70 (Hsp70s) family members are present in virtually all living organisms and perform a fundamental role against different types of environmental stressors and pathogenic organisms. Marine bivalves live in highly dynamic environments and may accumulate paralytic shellfish toxins (PSTs), a class of well-known neurotoxins closely associated with harmful algal blooms (HABs). Here, we provide a systematic analysis of Hsp70 genes (PyHsp70s) in the genome of Yesso scallop (Patinopecten yessoensis), an important aquaculture species in China, through in silico analysis using transcriptome and genome databases. Phylogenetic analyses indicated extensive expansion of Hsp70 genes from the Hspa12 sub-family in the Yesso scallop and also the bivalve lineages, with gene duplication events before or after the split between the Yesso scallop and the Pacific oyster. In addition, we determined the expression patterns of PyHsp70s after exposure to Alexandrium catenella, the dinoflagellate producing PSTs. Our results confirmed the inducible expression patterns of PyHsp70s under PSTs stress, and the responses to the toxic stress may have arisen through the adaptive recruitment of tandem duplication of Hsp70 genes. These findings provide a thorough overview of the evolution and modification of the Hsp70 family, which will gain insights into the functional characteristics of scallop Hsp70 genes in response to different stresses. Copyright © 2016. Published by Elsevier Ltd.

  19. Coccidian Merozoite Transcriptome Analysis From Eimeria Maxima In Comparison To Eimeria Tenella And Eimeria Acervulina

    USDA-ARS?s Scientific Manuscript database

    Using the Eimeria spp. population that infect chickens as a model for coccidian biology, we aimed to survey the transcriptome of E. maxima and contrast it to the two other Eimeria spp. for which transcriptome data are available, E. tenella and E. acervulina. Examining specifically the asexual intra...

  20. Transcriptome profiling analysis of cultivar-specific apple fruit ripening and texture attributes

    USDA-ARS?s Scientific Manuscript database

    Molecular events regulating cultivar-specific apple fruit ripening and sensory quality are largely unknown. Such knowledge is essential for genomic-assisted apple breeding and postharvest quality management. In this study, transcriptome profile analysis, scanning electron microscopic examination an...

  1. Characterizing differential gene expression in polyploid grasses lacking a reference transcriptome

    USDA-ARS?s Scientific Manuscript database

    Basal transcriptome characterization and differential gene expression in response to varying conditions are often addressed through next generation sequencing (NGS) and data analysis techniques. While these strategies are commonly used, there are countless tools, pipelines, data analysis methods an...

  2. In Silico Evaluation of the Potential Impact of Bioanalytical Bias Difference between Two Therapeutic Protein Formulations for Pharmacokinetic Assessment in a Biocomparability Study.

    PubMed

    Thway, Theingi M; Macaraeg, Chris; Eschenberg, Michael; Ma, Mark

    2015-05-01

    Formulation changes at later stages of biotherapeutics development require biocomparability (BC) assessment. Using simulation, this study aims to determine the potential effect of bias difference observed between the two formulations after spiking into serum in passing or failing of a critical BC study. An ELISA method with 20% total error was used to assess any bias differences between a reference (RF) and test formulations (TF) in serum. During bioanalytical comparison of these formulations, a 9% difference in bias was observed between the two formulations in sera. To determine acceptable level of bias difference between the RF and TF bioanalytically, two in silico simulations were performed. The in silico analysis showed that the likelihood of the study meeting the BC criteria was >90% when the bias difference between RF and TF in serum was 9% and the number of subjects was ≥20 per treatment arm. An additional simulation showed that when the bias difference was increased to 13% and the number of subjects was <40, the likelihood of meeting the BC criteria decreased to 80%. The result from in silico analysis allowed the bioanalytical laboratory to proceed with sample analysis using a single calibrator and quality controls made from the reference formulation. This modeling approach can be applied to other BC studies with similar situations.

  3. Toxicological evaluation in silico and in vivo of secondary metabolites of Cissampelos sympodialis in Mus musculus mice following inhalation.

    PubMed

    Alves, Mateus Feitosa; Ferreira, Larissa Adilis Maria Paiva; Gadelha, Francisco Allysson Assis Ferreira; Ferreira, Laércia Karla Diega Paiva; Felix, Mayara Barbalho; Scotti, Marcus Tullius; Scotti, Luciana; de Oliveira, Kardilândia Mendes; Dos Santos, Sócrates Golzio; Diniz, Margareth de Fátima Formiga Melo

    2017-12-04

    The ethanolic extract of the leaves of Cissampelos sympodialis showed great pharmacological potential, with inflammatory and immunomodulatory activities, however, it showed some toxicological effects. Therefore, this study aims to verify the toxicological potential of alkaloids of the genus Cissampelos through in silico methodologies, to develop a method in LC-MS/MS verifying the presence of alkaloids in the infusion and to evaluate the toxicity of the infusion of the leaves of C. sympodialis when inhaled by Swiss mice. Results in silico showed that alkaloid 93 presented high toxicological potential along with the products of its metabolism. LC-MS/MS results showed that the infusion of the leaves of this plant contained the alkaloids warifteine and methylwarifteine. Finally, the in vivo toxicological analysis of the C. sympodialis infusion showed results, both in biochemistry, organ weights and histological analysis, that the infusion of C. sympodialis leaves presents a low toxicity.

  4. Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants.

    PubMed

    Li, Xinguo; Wu, Harry X; Southerton, Simon G

    2010-06-21

    Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution.

  5. Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants

    PubMed Central

    2010-01-01

    Background Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. Results The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conclusions Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution. PMID:20565927

  6. Comparative transcriptomics of early dipteran development

    PubMed Central

    2013-01-01

    Background Modern sequencing technologies have massively increased the amount of data available for comparative genomics. Whole-transcriptome shotgun sequencing (RNA-seq) provides a powerful basis for comparative studies. In particular, this approach holds great promise for emerging model species in fields such as evolutionary developmental biology (evo-devo). Results We have sequenced early embryonic transcriptomes of two non-drosophilid dipteran species: the moth midge Clogmia albipunctata, and the scuttle fly Megaselia abdita. Our analysis includes a third, published, transcriptome for the hoverfly Episyrphus balteatus. These emerging models for comparative developmental studies close an important phylogenetic gap between Drosophila melanogaster and other insect model systems. In this paper, we provide a comparative analysis of early embryonic transcriptomes across species, and use our data for a phylogenomic re-evaluation of dipteran phylogenetic relationships. Conclusions We show how comparative transcriptomics can be used to create useful resources for evo-devo, and to investigate phylogenetic relationships. Our results demonstrate that de novo assembly of short (Illumina) reads yields high-quality, high-coverage transcriptomic data sets. We use these data to investigate deep dipteran phylogenetic relationships. Our results, based on a concatenation of 160 orthologous genes, provide support for the traditional view of Clogmia being the sister group of Brachycera (Megaselia, Episyrphus, Drosophila), rather than that of Culicomorpha (which includes mosquitoes and blackflies). PMID:23432914

  7. BLIND ordering of large-scale transcriptomic developmental timecourses.

    PubMed

    Anavy, Leon; Levin, Michal; Khair, Sally; Nakanishi, Nagayasu; Fernandez-Valverde, Selene L; Degnan, Bernard M; Yanai, Itai

    2014-03-01

    RNA-Seq enables the efficient transcriptome sequencing of many samples from small amounts of material, but the analysis of these data remains challenging. In particular, in developmental studies, RNA-Seq is challenged by the morphological staging of samples, such as embryos, since these often lack clear markers at any particular stage. In such cases, the automatic identification of the stage of a sample would enable previously infeasible experimental designs. Here we present the 'basic linear index determination of transcriptomes' (BLIND) method for ordering samples comprising different developmental stages. The method is an implementation of a traveling salesman algorithm to order the transcriptomes according to their inter-relationships as defined by principal components analysis. To establish the direction of the ordered samples, we show that an appropriate indicator is the entropy of transcriptomic gene expression levels, which increases over developmental time. Using BLIND, we correctly recover the annotated order of previously published embryonic transcriptomic timecourses for frog, mosquito, fly and zebrafish. We further demonstrate the efficacy of BLIND by collecting 59 embryos of the sponge Amphimedon queenslandica and ordering their transcriptomes according to developmental stage. BLIND is thus useful in establishing the temporal order of samples within large datasets and is of particular relevance to the study of organisms with asynchronous development and when morphological staging is difficult.

  8. Transcriptome Analysis at the Single-Cell Level Using SMART Technology.

    PubMed

    Fish, Rachel N; Bostick, Magnolia; Lehman, Alisa; Farmer, Andrew

    2016-10-10

    RNA sequencing (RNA-seq) is a powerful method for analyzing cell state, with minimal bias, and has broad applications within the biological sciences. However, transcriptome analysis of seemingly homogenous cell populations may in fact overlook significant heterogeneity that can be uncovered at the single-cell level. The ultra-low amount of RNA contained in a single cell requires extraordinarily sensitive and reproducible transcriptome analysis methods. As next-generation sequencing (NGS) technologies mature, transcriptome profiling by RNA-seq is increasingly being used to decipher the molecular signature of individual cells. This unit describes an ultra-sensitive and reproducible protocol to generate cDNA and sequencing libraries directly from single cells or RNA inputs ranging from 10 pg to 10 ng. Important considerations for working with minute RNA inputs are given. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.

  9. Transcriptomic analysis of persistent infection with foot-and-mouth disease virus in cattle suggests impairment of cell-mediated immunity in the nasopharynx

    USDA-ARS?s Scientific Manuscript database

    In order to investigate the mechanisms of persistent foot-and-mouth disease virus (FMDV) infection in cattle, transcriptome alterations associated with the FMDV carrier state were characterized using a bovine whole-transcriptome microarray. Eighteen cattle (8 vaccinated with a recombinant FMDV A vac...

  10. New approach for the study of mite reproduction: the first transcriptome analysis of a mite, Phytoseiulus persimilis (Acari: Phytoseiidae)

    USDA-ARS?s Scientific Manuscript database

    Many species of mites and ticks are of agricultural and medical importance. Much can be learned from the study of transcriptomes of acarines which can generate DNA-sequence information of potential target genes for the control of acarine pests. High throughput transcriptome sequencing can also yie...

  11. RNA-Seq Atlas of Glycine max: a guide to the soybean transcriptome

    USDA-ARS?s Scientific Manuscript database

    A first analysis of the Glycine max (L.) Merr. (soybean) transcriptome using next generation sequencing technology and RNA-Sequencing (RNA-Seq) is presented. This analysis will provide an important resource for understanding transcription and gene co-regulatory networks in soybean, the most economic...

  12. The grapevine expression atlas reveals a deep transcriptome shift driving the entire plant into a maturation program.

    PubMed

    Fasoli, Marianna; Dal Santo, Silvia; Zenoni, Sara; Tornielli, Giovanni Battista; Farina, Lorenzo; Zamboni, Anita; Porceddu, Andrea; Venturini, Luca; Bicego, Manuele; Murino, Vittorio; Ferrarini, Alberto; Delledonne, Massimo; Pezzotti, Mario

    2012-09-01

    We developed a genome-wide transcriptomic atlas of grapevine (Vitis vinifera) based on 54 samples representing green and woody tissues and organs at different developmental stages as well as specialized tissues such as pollen and senescent leaves. Together, these samples expressed ∼91% of the predicted grapevine genes. Pollen and senescent leaves had unique transcriptomes reflecting their specialized functions and physiological status. However, microarray and RNA-seq analysis grouped all the other samples into two major classes based on maturity rather than organ identity, namely, the vegetative/green and mature/woody categories. This division represents a fundamental transcriptomic reprogramming during the maturation process and was highlighted by three statistical approaches identifying the transcriptional relationships among samples (correlation analysis), putative biomarkers (O2PLS-DA approach), and sets of strongly and consistently expressed genes that define groups (topics) of similar samples (biclustering analysis). Gene coexpression analysis indicated that the mature/woody developmental program results from the reiterative coactivation of pathways that are largely inactive in vegetative/green tissues, often involving the coregulation of clusters of neighboring genes and global regulation based on codon preference. This global transcriptomic reprogramming during maturation has not been observed in herbaceous annual species and may be a defining characteristic of perennial woody plants.

  13. Elucidating and mining the Tulipa and Lilium transcriptomes.

    PubMed

    Moreno-Pachon, Natalia M; Leeggangers, Hendrika A C F; Nijveen, Harm; Severing, Edouard; Hilhorst, Henk; Immink, Richard G H

    2016-10-01

    Genome sequencing remains a challenge for species with large and complex genomes containing extensive repetitive sequences, of which the bulbous and monocotyledonous plants tulip and lily are examples. In such a case, sequencing of only the active part of the genome, represented by the transcriptome, is a good alternative to obtain information about gene content. In this study we aimed to generate a high quality transcriptome of tulip and lily and to make this data available as an open-access resource via a user-friendly web-based interface. The Illumina HiSeq 2000 platform was applied and the transcribed RNA was sequenced from a collection of different lily and tulip tissues, respectively. In order to obtain good transcriptome coverage and to facilitate effective data mining, assembly was done using different filtering parameters for clearing out contamination and noise of the RNAseq datasets. This analysis revealed limitations of commonly applied methods and parameter settings used in de novo transcriptome assembly. The final created transcriptomes are publicly available via a user friendly Transcriptome browser ( http://www.bioinformatics.nl/bulbs/db/species/index ). The usefulness of this resource has been exemplified by a search for all potential transcription factors in lily and tulip, with special focus on the TCP transcription factor family. This analysis and other quality parameters point out the quality of the transcriptomes, which can serve as a basis for further genomics studies in lily, tulip, and bulbous plants in general.

  14. Comparative analysis of transcriptome in two wheat genotypes with contrasting levels of drought tolerance

    USDA-ARS?s Scientific Manuscript database

    Drought tolerance is a complex trait that is governed by multiple genes. To identify the potential candidate genes, comparative analysis of drought stress-responsive transcriptome between drought-tolerant (Triticum aestivum Cv. C306) and drought-sensitive (Triticum aestivum Cv. WL711) genotypes was ...

  15. Comparative transcriptome analysis during early fruit development between three seedy citrus genotypes and their seedless mutants

    USDA-ARS?s Scientific Manuscript database

    Identification of genes with differential transcript abundance (GDTA) in seedless mutants may enhance understanding of seedless citrus development. Transcriptome analysis was conducted at three time points during early fruit development (Phase 1) of three seedy citrus genotypes: Fallglo [Bower citru...

  16. Evaluation of a Genome-Scale In Silico Metabolic Model for Geobacter metallireducens Using Proteomic Data from a Field Biostimulation Experiment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fang, Yilin; Wilkins, Michael J.; Yabusaki, Steven B.

    2012-12-12

    Biomass and shotgun global proteomics data that reflected relative protein abundances from samples collected during the 2008 experiment at the U.S. Department of Energy Integrated Field-Scale Subsurface Research Challenge site in Rifle, Colorado, provided an unprecedented opportunity to validate a genome-scale metabolic model of Geobacter metallireducens and assess its performance with respect to prediction of metal reduction, biomass yield, and growth rate under dynamic field conditions. Reconstructed from annotated genomic sequence, biochemical, and physiological data, the constraint-based in silico model of G. metallireducens relates an annotated genome sequence to the physiological functions with 697 reactions controlled by 747 enzyme-coding genes.more » Proteomic analysis showed that 180 of the 637 G. metallireducens proteins detected during the 2008 experiment were associated with specific metabolic reactions in the in silico model. When the field-calibrated Fe(III) terminal electron acceptor process reaction in a reactive transport model for the field experiments was replaced with the genome-scale model, the model predicted that the largest metabolic fluxes through the in silico model reactions generally correspond to the highest abundances of proteins that catalyze those reactions. Central metabolism predicted by the model agrees well with protein abundance profiles inferred from proteomic analysis. Model discrepancies with the proteomic data, such as the relatively low fluxes through amino acid transport and metabolism, revealed pathways or flux constraints in the in silico model that could be updated to more accurately predict metabolic processes that occur in the subsurface environment.« less

  17. Optimized approach for Ion Proton RNA sequencing reveals details of RNA splicing and editing features of the transcriptome.

    PubMed

    Brown, Roger B; Madrid, Nathaniel J; Suzuki, Hideaki; Ness, Scott A

    2017-01-01

    RNA-sequencing (RNA-seq) has become the standard method for unbiased analysis of gene expression but also provides access to more complex transcriptome features, including alternative RNA splicing, RNA editing, and even detection of fusion transcripts formed through chromosomal translocations. However, differences in library methods can adversely affect the ability to recover these different types of transcriptome data. For example, some methods have bias for one end of transcripts or rely on low-efficiency steps that limit the complexity of the resulting library, making detection of rare transcripts less likely. We tested several commonly used methods of RNA-seq library preparation and found vast differences in the detection of advanced transcriptome features, such as alternatively spliced isoforms and RNA editing sites. By comparing several different protocols available for the Ion Proton sequencer and by utilizing detailed bioinformatics analysis tools, we were able to develop an optimized random primer based RNA-seq technique that is reliable at uncovering rare transcript isoforms and RNA editing features, as well as fusion reads from oncogenic chromosome rearrangements. The combination of optimized libraries and rapid Ion Proton sequencing provides a powerful platform for the transcriptome analysis of research and clinical samples.

  18. Assessment of pleiotropic transcriptome perturbations in Arabidopsis engineered for indirect insect defence.

    PubMed

    Houshyani, Benyamin; van der Krol, Alexander R; Bino, Raoul J; Bouwmeester, Harro J

    2014-06-19

    Molecular characterization is an essential step of risk/safety assessment of genetically modified (GM) crops. Holistic approaches for molecular characterization using omics platforms can be used to confirm the intended impact of the genetic engineering, but can also reveal the unintended changes at the omics level as a first assessment of potential risks. The potential of omics platforms for risk assessment of GM crops has rarely been used for this purpose because of the lack of a consensus reference and statistical methods to judge the significance or importance of the pleiotropic changes in GM plants. Here we propose a meta data analysis approach to the analysis of GM plants, by measuring the transcriptome distance to untransformed wild-types. In the statistical analysis of the transcriptome distance between GM and wild-type plants, values are compared with naturally occurring transcriptome distances in non-GM counterparts obtained from a database. Using this approach we show that the pleiotropic effect of genes involved in indirect insect defence traits is substantially equivalent to the variation in gene expression occurring naturally in Arabidopsis. Transcriptome distance is a useful screening method to obtain insight in the pleiotropic effects of genetic modification.

  19. New milk protein-derived peptides with potential antimicrobial activity: an approach based on bioinformatic studies.

    PubMed

    Dziuba, Bartłomiej; Dziuba, Marta

    2014-08-20

    New peptides with potential antimicrobial activity, encrypted in milk protein sequences, were searched for with the use of bioinformatic tools. The major milk proteins were hydrolyzed in silico by 28 enzymes. The obtained peptides were characterized by the following parameters: molecular weight, isoelectric point, composition and number of amino acid residues, net charge at pH 7.0, aliphatic index, instability index, Boman index, and GRAVY index, and compared with those calculated for known 416 antimicrobial peptides including 59 antimicrobial peptides (AMPs) from milk proteins listed in the BIOPEP database. A simple analysis of physico-chemical properties and the values of biological activity indicators were insufficient to select potentially antimicrobial peptides released in silico from milk proteins by proteolytic enzymes. The final selection was made based on the results of multidimensional statistical analysis such as support vector machines (SVM), random forest (RF), artificial neural networks (ANN) and discriminant analysis (DA) available in the Collection of Anti-Microbial Peptides (CAMP database). Eleven new peptides with potential antimicrobial activity were selected from all peptides released during in silico proteolysis of milk proteins.

  20. New Milk Protein-Derived Peptides with Potential Antimicrobial Activity: An Approach Based on Bioinformatic Studies

    PubMed Central

    Dziuba, Bartłomiej; Dziuba, Marta

    2014-01-01

    New peptides with potential antimicrobial activity, encrypted in milk protein sequences, were searched for with the use of bioinformatic tools. The major milk proteins were hydrolyzed in silico by 28 enzymes. The obtained peptides were characterized by the following parameters: molecular weight, isoelectric point, composition and number of amino acid residues, net charge at pH 7.0, aliphatic index, instability index, Boman index, and GRAVY index, and compared with those calculated for known 416 antimicrobial peptides including 59 antimicrobial peptides (AMPs) from milk proteins listed in the BIOPEP database. A simple analysis of physico-chemical properties and the values of biological activity indicators were insufficient to select potentially antimicrobial peptides released in silico from milk proteins by proteolytic enzymes. The final selection was made based on the results of multidimensional statistical analysis such as support vector machines (SVM), random forest (RF), artificial neural networks (ANN) and discriminant analysis (DA) available in the Collection of Anti-Microbial Peptides (CAMP database). Eleven new peptides with potential antimicrobial activity were selected from all peptides released during in silico proteolysis of milk proteins. PMID:25141106

  1. in silico identification of cross affinity towards Cry1Ac pesticidal protein with receptor enzyme in Bos taurus and sequence, structure analysis of crystal proteins for stability.

    PubMed

    Ebenezer, King Solomon; Nachimuthu, Ramesh; Thiagarajan, Prabha; Velu, Rajesh Kannan

    2013-01-01

    Any novel protein introduced into the GM crops need to be evaluated for cross affinity on living organisms. Many researchers are currently focusing on the impact of Bacillus thuringiensis cotton on soil and microbial diversity by field experiments. In spite of this, in silico approach might be helpful to elucidate the impact of cry genes. The crystal a protein which was produced by Bt at the time of sporulation has been used as a biological pesticide to target the insectivorous pests like Cry1Ac for Helicoverpa armigera and Cry2Ab for Spodoptera sp. and Heliothis sp. Here, we present the comprehensive in silico analysis of Cry1Ac and Cry2Ab proteins with available in silico tools, databases and docking servers. Molecular docking of Cry1Ac with procarboxypeptidase from Helicoverpa armigera and Cry1Ac with Leucine aminopeptidase from Bos taurus has showed the 125(th) amino acid position to be the preference site of Cry1Ac protein. The structures were compared with each other and it showed 5% of similarity. The cross affinity of this toxin that have confirmed the earlier reports of ill effects of Bt cotton consumed by cattle.

  2. BRCA1/2 missense mutations and the value of in-silico analyses.

    PubMed

    Sadowski, Carolin E; Kohlstedt, Daniela; Meisel, Cornelia; Keller, Katja; Becker, Kerstin; Mackenroth, Luisa; Rump, Andreas; Schröck, Evelin; Wimberger, Pauline; Kast, Karin

    2017-11-01

    The clinical implications of genetic variants in BRCA1/2 in healthy and affected individuals are considerable. Variant interpretation, however, is especially challenging for missense variants. The majority of them are classified as variants of unknown clinical significance (VUS). Computational (in-silico) predictive programs are easy to access, but represent only one tool out of a wide range of complemental approaches to classify VUS. With this single-center study, we aimed to evaluate the impact of in-silico analyses in a spectrum of different BRCA1/2 missense variants. We conducted mutation analysis of BRCA1/2 in 523 index patients with suspected hereditary breast and ovarian cancer (HBOC). Classification of the genetic variants was performed according to the German Consortium (GC)-HBOC database. Additionally, all missense variants were classified by the following three in-silico prediction tools: SIFT, Mutation Taster (MT2) and PolyPhen2 (PPH2). Overall 201 different variants, 68 of which constituted missense variants were ranked as pathogenic, neutral, or unknown. The classification of missense variants by in-silico tools resulted in a higher amount of pathogenic mutations (25% vs. 13.2%) compared to the GC-HBOC-classification. Altogether, more than fifty percent (38/68, 55.9%) of missense variants were ranked differently. Sensitivity of in-silico-tools for mutation prediction was 88.9% (PPH2), 100% (SIFT) and 100% (MT2). We found a relevant discrepancy in variant classification by using in-silico prediction tools, resulting in potential overestimation and/or underestimation of cancer risk. More reliable, notably gene-specific, prediction tools and functional tests are needed to improve clinical counseling. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  3. Impact of S100A8 expression on kidney cancer progression and molecular docking studies for kidney cancer therapeutics.

    PubMed

    Mirza, Zeenat; Schulten, Hans-Juergen; Farsi, Hasan Ma; Al-Maghrabi, Jaudah A; Gari, Mamdooh A; Chaudhary, Adeel Ga; Abuzenadah, Adel M; Al-Qahtani, Mohammed H; Karim, Sajjad

    2014-04-01

    The proinflammatory protein S100A8, which is expressed in myeloid cells under physiological conditions, is strongly expressed in human cancer tissues. Its role in tumor cell differentiation and tumor progression is largely unclear and virtually unstudied in kidney cancer. In the present study, we investigated whether S100A8 could be a potential anticancer drug target and therapeutic biomarker for kidney cancer, and the underlying molecular mechanisms by exploiting its interaction profile with drugs. Microarray-based transcriptomics experiments using Affymetrix HuGene 1.0 ST arrays were applied to renal cell carcinoma specimens from Saudi patients for identification of significant genes associated with kidney cancer. In addition, we retrieved selected expression data from the National Center for Biotechnology Information Gene Expression Omnibus database for comparative analysis and confirmation of S100A8 expression. Ingenuity Pathway Analysis (IPA) was used to elucidate significant molecular networks and pathways associated with kidney cancer. The probable polar and non-polar interactions of possible S100A8 inhibitors (aspirin, celecoxib, dexamethasone and diclofenac) were examined by performing molecular docking and binding free energy calculations. Detailed analysis of bound structures and their binding free energies was carried out for S100A8, its known partner (S100A9), and S100A8-S100A9 complex (calprotectin). In our microarray experiments, we identified 1,335 significantly differentially expressed genes, including S100A8, in kidney cancer using a cut-off of p<0.05 and fold-change of 2. Functional analysis of kidney cancer-associated genes showed overexpression of genes involved in cell-cycle progression, DNA repair, cell death, tumor morphology and tissue development. Pathway analysis showed significant disruption of pathways of atherosclerosis signaling, liver X receptor/retinoid X receptor (LXR/RXR) activation, notch signaling, and interleukin-12 (IL-12) signaling. We identified S100A8 as a prospective biomarker for kidney cancer and in silico analysis showed that aspirin, celecoxib, dexamethasone and diclofenac binds to S100A8 and may inhibit downstream signaling in kidney cancer. The present study provides an initial overview of differentially expressed genes in kidney cancer of Saudi Arabian patients using whole-transcript, high-density expression arrays. Our analysis suggests distinct transcriptomic signatures, with significantly high levels of S100A8, and underlying molecular mechanisms contributing to kidney cancer progression. Our docking-based findings shed insight into S100A8 protein as an attractive anticancer target for therapeutic intervention in kidney cancer. To our knowledge, this is the first structure-based docking study for the selected protein targets using the chosen ligands.

  4. Two-Stage, In Silico Deconvolution of the Lymphocyte Compartment of the Peripheral Whole Blood Transcriptome in the Context of Acute Kidney Allograft Rejection

    PubMed Central

    Shannon, Casey P.; Balshaw, Robert; Ng, Raymond T.; Wilson-McManus, Janet E.; Keown, Paul; McMaster, Robert; McManus, Bruce M.; Landsberg, David; Isbel, Nicole M.; Knoll, Greg; Tebbutt, Scott J.

    2014-01-01

    Acute rejection is a major complication of solid organ transplantation that prevents the long-term assimilation of the allograft. Various populations of lymphocytes are principal mediators of this process, infiltrating graft tissues and driving cell-mediated cytotoxicity. Understanding the lymphocyte-specific biology associated with rejection is therefore critical. Measuring genome-wide changes in transcript abundance in peripheral whole blood cells can deliver a comprehensive view of the status of the immune system. The heterogeneous nature of the tissue significantly affects the sensitivity and interpretability of traditional analyses, however. Experimental separation of cell types is an obvious solution, but is often impractical and, more worrying, may affect expression, leading to spurious results. Statistical deconvolution of the cell type-specific signal is an attractive alternative, but existing approaches still present some challenges, particularly in a clinical research setting. Obtaining time-matched sample composition to biologically interesting, phenotypically homogeneous cell sub-populations is costly and adds significant complexity to study design. We used a two-stage, in silico deconvolution approach that first predicts sample composition to biologically meaningful and homogeneous leukocyte sub-populations, and then performs cell type-specific differential expression analysis in these same sub-populations, from peripheral whole blood expression data. We applied this approach to a peripheral whole blood expression study of kidney allograft rejection. The patterns of differential composition uncovered are consistent with previous studies carried out using flow cytometry and provide a relevant biological context when interpreting cell type-specific differential expression results. We identified cell type-specific differential expression in a variety of leukocyte sub-populations at the time of rejection. The tissue-specificity of these differentially expressed probe-set lists is consistent with the originating tissue and their functional enrichment consistent with allograft rejection. Finally, we demonstrate that the strategy described here can be used to derive useful hypotheses by validating a cell type-specific ratio in an independent cohort using the nanoString nCounter assay. PMID:24733377

  5. Transcriptome landscape of Synechococcus elongatus PCC 7942 for nitrogen starvation responses using RNA-seq

    PubMed Central

    Choi, Sun Young; Park, Byeonghyeok; Choi, In-Geol; Sim, Sang Jun; Lee, Sun-Mi; Um, Youngsoon; Woo, Han Min

    2016-01-01

    The development of high-throughput technology using RNA-seq has allowed understanding of cellular mechanisms and regulations of bacterial transcription. In addition, transcriptome analysis with RNA-seq has been used to accelerate strain improvement through systems metabolic engineering. Synechococcus elongatus PCC 7942, a photosynthetic bacterium, has remarkable potential for biochemical and biofuel production due to photoautotrophic cell growth and direct CO2 conversion. Here, we performed a transcriptome analysis of S. elongatus PCC 7942 using RNA-seq to understand the changes of cellular metabolism and regulation for nitrogen starvation responses. As a result, differentially expressed genes (DEGs) were identified and functionally categorized. With mapping onto metabolic pathways, we probed transcriptional perturbation and regulation of carbon and nitrogen metabolisms relating to nitrogen starvation responses. Experimental evidence such as chlorophyll a and phycobilisome content and the measurement of CO2 uptake rate validated the transcriptome analysis. The analysis suggests that S. elongatus PCC 7942 reacts to nitrogen starvation by not only rearranging the cellular transport capacity involved in carbon and nitrogen assimilation pathways but also by reducing protein synthesis and photosynthesis activities. PMID:27488818

  6. Transcriptomics of cortical gray matter thickness decline during normal aging

    PubMed Central

    Kochunov, P; Charlesworth, J; Winkler, A; Hong, LE; Nichols, T; Curran, JE; Sprooten, E; Jahanshad, N; Thompson, PM; Johnson, MP; Kent, JW; Landman, BA; Mitchell, B; Cole, SA; Dyer, TD; Moses, EK; Goring, HHH; Almasy, L; Duggirala, R; Olvera, RL; Glahn, DC; Blangero, J

    2013-01-01

    Introduction We performed a whole-transcriptome correlation analysis, followed by the pathway enrichment and testing of innate immune response pathways analyses to evaluate the hypothesis that transcriptional activity can predict cortical gray matter thickness (GMT) variability during normal cerebral aging Methods Transcriptome and GMT data were availabe for 379 individuals (age range=28–85) community-dwelling members of large extended Mexican-American families. Collection of transcriptome data preceded that of neuroimaging data by 17 years. Genome-wide gene transcriptome data consisted of 20,413 heritable lymphocytes-based transcripts. GMT measurements were performed from high-resolution (isotropic 800µm) T1-weighted MRI. Transcriptome-wide and pathway enrichment analysis was used to classify genes correlated with GMT. Transcripts for sixty genes from seven innate immune pathways were tested as specific predictors of GMT variability. Results Transcripts for eight genes (IGFBP3, LRRN3, CRIP2, SCD, IDS, TCF4, GATA3, HN1) passed the transcriptome-wide significance threshold. Four orthogonal factors extracted from this set predicted 31.9% of the variability in the whole-brain and between 23.4 and 35% of regional GMT measurements. Pathway enrichment analysis identified six functional categories including cellular proliferation, aggregation, differentiation, viral infection, and metabolism. The integrin signaling pathway was significantly (p<10−6) enriched with GMT. Finally, three innate immune pathways (complement signaling, toll-receptors and scavenger and immunoglobulins) were significantly associated with GMT. Conclusion Expression activity for the genes that regulate cellular proliferation, adhesion, differentiation and inflammation can explain a significant proportion of individual variability in cortical GMT. Our findings suggest that normal cerebral aging is the product of a progressive decline in regenerative capacity and increased neuroinflammation. PMID:23707588

  7. Transcriptomics of cortical gray matter thickness decline during normal aging.

    PubMed

    Kochunov, P; Charlesworth, J; Winkler, A; Hong, L E; Nichols, T E; Curran, J E; Sprooten, E; Jahanshad, N; Thompson, P M; Johnson, M P; Kent, J W; Landman, B A; Mitchell, B; Cole, S A; Dyer, T D; Moses, E K; Goring, H H H; Almasy, L; Duggirala, R; Olvera, R L; Glahn, D C; Blangero, J

    2013-11-15

    We performed a whole-transcriptome correlation analysis, followed by the pathway enrichment and testing of innate immune response pathway analyses to evaluate the hypothesis that transcriptional activity can predict cortical gray matter thickness (GMT) variability during normal cerebral aging. Transcriptome and GMT data were available for 379 individuals (age range=28-85) community-dwelling members of large extended Mexican American families. Collection of transcriptome data preceded that of neuroimaging data by 17 years. Genome-wide gene transcriptome data consisted of 20,413 heritable lymphocytes-based transcripts. GMT measurements were performed from high-resolution (isotropic 800 μm) T1-weighted MRI. Transcriptome-wide and pathway enrichment analysis was used to classify genes correlated with GMT. Transcripts for sixty genes from seven innate immune pathways were tested as specific predictors of GMT variability. Transcripts for eight genes (IGFBP3, LRRN3, CRIP2, SCD, IDS, TCF4, GATA3, and HN1) passed the transcriptome-wide significance threshold. Four orthogonal factors extracted from this set predicted 31.9% of the variability in the whole-brain and between 23.4 and 35% of regional GMT measurements. Pathway enrichment analysis identified six functional categories including cellular proliferation, aggregation, differentiation, viral infection, and metabolism. The integrin signaling pathway was significantly (p<10(-6)) enriched with GMT. Finally, three innate immune pathways (complement signaling, toll-receptors and scavenger and immunoglobulins) were significantly associated with GMT. Expression activity for the genes that regulate cellular proliferation, adhesion, differentiation and inflammation can explain a significant proportion of individual variability in cortical GMT. Our findings suggest that normal cerebral aging is the product of a progressive decline in regenerative capacity and increased neuroinflammation. Copyright © 2013 Elsevier Inc. All rights reserved.

  8. Genetic signatures of adaptation revealed from transcriptome sequencing of Arctic and red foxes.

    PubMed

    Kumar, Vikas; Kutschera, Verena E; Nilsson, Maria A; Janke, Axel

    2015-08-07

    The genus Vulpes (true foxes) comprises numerous species that inhabit a wide range of habitats and climatic conditions, including one species, the Arctic fox (Vulpes lagopus) which is adapted to the arctic region. A close relative to the Arctic fox, the red fox (Vulpes vulpes), occurs in subarctic to subtropical habitats. To study the genetic basis of their adaptations to different environments, transcriptome sequences from two Arctic foxes and one red fox individual were generated and analyzed for signatures of positive selection. In addition, the data allowed for a phylogenetic analysis and divergence time estimate between the two fox species. The de novo assembly of reads resulted in more than 160,000 contigs/transcripts per individual. Approximately 17,000 homologous genes were identified using human and the non-redundant databases. Positive selection analyses revealed several genes involved in various metabolic and molecular processes such as energy metabolism, cardiac gene regulation, apoptosis and blood coagulation to be under positive selection in foxes. Branch site tests identified four genes to be under positive selection in the Arctic fox transcriptome, two of which are fat metabolism genes. In the red fox transcriptome eight genes are under positive selection, including molecular process genes, notably genes involved in ATP metabolism. Analysis of the three transcriptomes and five Sanger re-sequenced genes in additional individuals identified a lower genetic variability within Arctic foxes compared to red foxes, which is consistent with distribution range differences and demographic responses to past climatic fluctuations. A phylogenomic analysis estimated that the Arctic and red fox lineages diverged about three million years ago. Transcriptome data are an economic way to generate genomic resources for evolutionary studies. Despite not representing an entire genome, this transcriptome analysis identified numerous genes that are relevant to arctic adaptation in foxes. Similar to polar bears, fat metabolism seems to play a central role in adaptation of Arctic foxes to the cold climate, as has been identified in the polar bear, another arctic specialist.

  9. DOGMA: domain-based transcriptome and proteome quality assessment.

    PubMed

    Dohmen, Elias; Kremer, Lukas P M; Bornberg-Bauer, Erich; Kemena, Carsten

    2016-09-01

    Genome studies have become cheaper and easier than ever before, due to the decreased costs of high-throughput sequencing and the free availability of analysis software. However, the quality of genome or transcriptome assemblies can vary a lot. Therefore, quality assessment of assemblies and annotations are crucial aspects of genome analysis pipelines. We developed DOGMA, a program for fast and easy quality assessment of transcriptome and proteome data based on conserved protein domains. DOGMA measures the completeness of a given transcriptome or proteome and provides information about domain content for further analysis. DOGMA provides a very fast way to do quality assessment within seconds. DOGMA is implemented in Python and published under GNU GPL v.3 license. The source code is available on https://ebbgit.uni-muenster.de/domainWorld/DOGMA/ CONTACTS: e.dohmen@wwu.de or c.kemena@wwu.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  10. Understanding mechanism of in vitro maturation, fertilization and culture of sheep embryoes through in silico analysis.

    PubMed

    Sreenivas, Dulam; Kaladhar, Dowluru Svgk; Samy, A Palni; Kumar, R Sangeeth

    2012-01-01

    Protein interations are presently required to understand the mechanisms of in vitro maturation, fertilization and culture of sheep embryoes through in silico analysis. The present work has been conducted on TCM-199 supplemented with epidermal growth factor (EGF), fetal bovine serum (FBS) or wheat peptones The maturation rate of oocyte was significantly higher in the FBS supplemented group when compared with BSA and wheat peptone supplemented groups. The in silico protein interaction studies has shown that the proteins EGFR (epidermal growth factor receptor), CCK (cholecystokinin)- a peptide hormone, Alb - a serum albumin, ESR- estrogen receptor 1, TGFA- transforming growth factor, STAT- signal transducer and FN1- fibronectin 1 has direct interaction and produces cell growth in in vitro culture. Alb is directly activates EGF and promotes MAPK3 that mediates diverse biological functions such as cell growth, adhesion and proliferation. Alb may also involve in stress response signalling and may be in cell cycle control.

  11. Cloning and characterization of the ONAC106 gene from Oryza sativa cultivar Kuku Belang

    NASA Astrophysics Data System (ADS)

    Basri, Khairunnisa; Sukiran, Noor Liyana; Zainal, Zamri

    2016-11-01

    Plants possess different mechanisms in stress response, where induction of stress-responsive genes provides tolerance to unfavorable conditions. Stress-responsive genes are characterized for functional and regulatory genes that help in overcoming stress by molecular, biochemical and morphological adaptations. NAC transcription factors are one of the regulatory proteins that involved in stress signaling pathway. A putative NAC transcription factor, ONAC016 was identified from drought transcriptomic data. Our data suggested that ONAC106 was induced by drought, but its function in abiotic stress is still unclear. In silico analysis of ONAC106 showed that this gene encodes 334 amino acids, and its protein consists of NAM (No Apical Meristem) domain. The orthologue of ONAC106 was present in several Poaceae family members, suggesting that ONAC106 is unique to monocot plants only. We found that ONAC106 was induced by salt and cold stresses, indicating that this gene involves in abiotic stress response. In addition, we also found that ONAC106 might function in defense response to pathogen invasion. The ABRE (Abscisic Acid Regulatory Element) cis-element was identified in the promoter region of ONAC106, suggesting that it may involve in the abscisic acid (ABA)-dependent signaling pathway. Based on this preliminary result, we hypothesize that ONAC106 may play a role in abiotic stress response by regulating ABA-responsive genes.

  12. Development of Genic and Genomic SSR Markers of Robusta Coffee (Coffea canephora Pierre Ex A. Froehner)

    PubMed Central

    Hendre, Prasad S.; Aggarwal, Ramesh K.

    2014-01-01

    Coffee breeding and improvement efforts can be greatly facilitated by availability of a large repository of simple sequence repeats (SSRs) based microsatellite markers, which provides efficiency and high-resolution in genetic analyses. This study was aimed to improve SSR availability in coffee by developing new genic−/genomic-SSR markers using in-silico bioinformatics and streptavidin-biotin based enrichment approach, respectively. The expressed sequence tag (EST) based genic microsatellite markers (EST-SSRs) were developed using the publicly available dataset of 13,175 unigene ESTs, which showed a distribution of 1 SSR/3.4 kb of coffee transcriptome. Genomic SSRs, on the other hand, were developed from an SSR-enriched small-insert partial genomic library of robusta coffee. In total, 69 new SSRs (44 EST-SSRs and 25 genomic SSRs) were developed and validated as suitable genetic markers. Diversity analysis of selected coffee genotypes revealed these to be highly informative in terms of allelic diversity and PIC values, and eighteen of these markers (∼27%) could be mapped on a robusta linkage map. Notably, the markers described here also revealed a very high cross-species transferability. In addition to the validated markers, we have also designed primer pairs for 270 putative EST-SSRs, which are expected to provide another ca. 200 useful genetic markers considering the high success rate (88%) of marker conversion of similar pairs tested/validated in this study. PMID:25461752

  13. Transcriptome analyses of the Dof-like gene family in grapevine reveal its involvement in berry, flower and seed development.

    PubMed

    da Silva, Danielle Costenaro; da Silveira Falavigna, Vítor; Fasoli, Marianna; Buffon, Vanessa; Porto, Diogo Denardi; Pappas, Georgios Joannis; Pezzotti, Mario; Pasquali, Giancarlo; Revers, Luís Fernando

    2016-01-01

    The Dof (DNA-binding with one finger) protein family spans a group of plant transcription factors involved in the regulation of several functions, such as plant responses to stress, hormones and light, phytochrome signaling and seed germination. Here we describe the Dof-like gene family in grapevine (Vitis vinifera L.), which consists of 25 genes coding for Dof. An extensive in silico characterization of the VviDofL gene family was performed. Additionally, the expression of the entire gene family was assessed in 54 grapevine tissues and organs using an integrated approach with microarray (cv Corvina) and real-time PCR (cv Pinot Noir) analyses. The phylogenetic analysis comparing grapevine sequences with those of Arabidopsis, tomato, poplar and already described Dof genes in other species allowed us to identify several duplicated genes. The diversification of grapevine DofL genes during evolution likely resulted in a broader range of biological roles. Furthermore, distinct expression patterns were identified between samples analyzed, corroborating such hypothesis. Our expression results indicate that several VviDofL genes perform their functional roles mainly during flower, berry and seed development, highlighting their importance for grapevine growth and production. The identification of similar expression profiles between both approaches strongly suggests that these genes have important regulatory roles that are evolutionally conserved between grapevine cvs Corvina and Pinot Noir.

  14. Systemic miRNA-7 delivery inhibits tumor angiogenesis and growth in murine xenograft glioblastoma.

    PubMed

    Babae, Negar; Bourajjaj, Meriem; Liu, Yijia; Van Beijnum, Judy R; Cerisoli, Francesco; Scaria, Puthupparampil V; Verheul, Mark; Van Berkel, Maaike P; Pieters, Ebel H E; Van Haastert, Rick J; Yousefi, Afrouz; Mastrobattista, Enrico; Storm, Gert; Berezikov, Eugene; Cuppen, Edwin; Woodle, Martin; Schaapveld, Roel Q J; Prevost, Gregoire P; Griffioen, Arjan W; Van Noort, Paula I; Schiffelers, Raymond M

    2014-08-30

    Tumor-angiogenesis is the multi-factorial process of sprouting of endothelial cells (EC) into micro-vessels to provide tumor cells with nutrients and oxygen. To explore miRNAs as therapeutic angiogenesis-inhibitors, we performed a functional screen to identify miRNAs that are able to decrease EC viability. We identified miRNA-7 (miR-7) as a potent negative regulator of angiogenesis. Introduction of miR-7 in EC resulted in strongly reduced cell viability, tube formation, sprouting and migration. Application of miR-7 in the chick chorioallantoic membrane assay led to a profound reduction of vascularization, similar to anti-angiogenic drug sunitinib. Local administration of miR-7 in an in vivo murine neuroblastoma tumor model significantly inhibited angiogenesis and tumor growth. Finally, systemic administration of miR-7 using a novel integrin-targeted biodegradable polymeric nanoparticles that targets both EC and tumor cells, strongly reduced angiogenesis and tumor proliferation in mice with human glioblastoma xenografts. Transcriptome analysis of miR-7 transfected EC in combination with in silico target prediction resulted in the identification of OGT as novel target gene of miR-7. Our study provides a comprehensive validation of miR-7 as novel anti-angiogenic therapeutic miRNA that can be systemically delivered to both EC and tumor cells and offers promise for miR-7 as novel anti-tumor therapeutic.

  15. Genomic and transcriptomic characterization of the transcription factor family R2R3-MYB in soybean and its involvement in the resistance responses to Phakopsora pachyrhizi.

    PubMed

    Aoyagi, Luciano N; Lopes-Caitar, Valéria S; de Carvalho, Mayra C C G; Darben, Luana M; Polizel-Podanosqui, Adriana; Kuwahara, Marcia K; Nepomuceno, Alexandre L; Abdelnoor, Ricardo V; Marcelino-Guimarães, Francismar C

    2014-12-01

    Myb genes constitute one of the largest transcription factor families in the plant kingdom. Soybean MYB transcription factors have been related to the plant response to biotic stresses. Their involvement in response to Phakopsora pachyrhizi infection has been reported by several transcriptional studies. Due to their apparently highly diverse functions, these genes are promising targets for developing crop varieties resistant to diseases. In the present study, the identification and phylogenetic analysis of the soybean R2R3-MYB (GmMYB) transcription factor family was performed and the expression profiles of these genes under biotic stress were determined. GmMYBs were identified from the soybean genome using bioinformatic tools, and their putative functions were determined based on the phylogenetic tree and classified into subfamilies using guides AtMYBs describing known functions. The transcriptional profiles of GmMYBs upon infection with different pathogen were revealed by in vivo and in silico analyses. Selected target genes potentially involved in disease responses were assessed by RT-qPCR after different times of inoculation with P. pachyrhizi using different genetic backgrounds related to resistance genes (Rpp2 and Rpp5). R2R3-MYB transcription factors related to lignin synthesis and genes responsive to chitin were significantly induced in the resistant genotypes. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  16. Germline DNA methylation in reef corals: patterns and potential roles in response to environmental change.

    PubMed

    Dimond, James L; Roberts, Steven B

    2016-04-01

    DNA methylation is an epigenetic mark that plays an inadequately understood role in gene regulation, particularly in nonmodel species. Because it can be influenced by the environment, DNA methylation may contribute to the ability of organisms to acclimatize and adapt to environmental change. We evaluated the distribution of gene body methylation in reef-building corals, a group of organisms facing significant environmental threats. Gene body methylation in six species of corals was inferred from in silico transcriptome analysis of CpG O/E, an estimate of germline DNA methylation that is highly correlated with patterns of methylation enrichment. Consistent with what has been documented in most other invertebrates, all corals exhibited bimodal distributions of germline methylation suggestive of distinct fractions of genes with high and low levels of methylation. The hypermethylated fractions were enriched with genes with housekeeping functions, while genes with inducible functions were highly represented in the hypomethylated fractions. High transcript abundance was associated with intermediate levels of methylation. In three of the coral species, we found that genes differentially expressed in response to thermal stress and ocean acidification exhibited significantly lower levels of methylation. These results support a link between gene body hypomethylation and transcriptional plasticity that may point to a role of DNA methylation in the response of corals to environmental change. © 2015 John Wiley & Sons Ltd.

  17. Transcription of Two Adjacent Carbohydrate Utilization Gene Clusters in Bifidobacterium breve UCC2003 Is Controlled by LacI- and Repressor Open Reading Frame Kinase (ROK)-Type Regulators

    PubMed Central

    O'Connell, Kerry Joan; O'Connell Motherway, Mary; Liedtke, Andrea; Fitzgerald, Gerald F.; Ross, R. Paul; Stanton, Catherine; Zomer, Aldert

    2014-01-01

    Members of the genus Bifidobacterium are commonly found in the gastrointestinal tracts of mammals, including humans, where their growth is presumed to be dependent on various diet- and/or host-derived carbohydrates. To understand transcriptional control of bifidobacterial carbohydrate metabolism, we investigated two genetic carbohydrate utilization clusters dedicated to the metabolism of raffinose-type sugars and melezitose. Transcriptomic and gene inactivation approaches revealed that the raffinose utilization system is positively regulated by an activator protein, designated RafR. The gene cluster associated with melezitose metabolism was shown to be subject to direct negative control by a LacI-type transcriptional regulator, designated MelR1, in addition to apparent indirect negative control by means of a second LacI-type regulator, MelR2. In silico analysis, DNA-protein interaction, and primer extension studies revealed the MelR1 and MelR2 operator sequences, each of which is positioned just upstream of or overlapping the correspondingly regulated promoter sequences. Similar analyses identified the RafR binding operator sequence located upstream of the rafB promoter. This study indicates that transcriptional control of gene clusters involved in carbohydrate metabolism in bifidobacteria is subject to conserved regulatory systems, representing either positive or negative control. PMID:24705323

  18. Transcriptome analyses of the Dof-like gene family in grapevine reveal its involvement in berry, flower and seed development

    PubMed Central

    da Silva, Danielle Costenaro; da Silveira Falavigna, Vítor; Fasoli, Marianna; Buffon, Vanessa; Porto, Diogo Denardi; Pappas, Georgios Joannis; Pezzotti, Mario; Pasquali, Giancarlo; Revers, Luís Fernando

    2016-01-01

    The Dof (DNA-binding with one finger) protein family spans a group of plant transcription factors involved in the regulation of several functions, such as plant responses to stress, hormones and light, phytochrome signaling and seed germination. Here we describe the Dof-like gene family in grapevine (Vitis vinifera L.), which consists of 25 genes coding for Dof. An extensive in silico characterization of the VviDofL gene family was performed. Additionally, the expression of the entire gene family was assessed in 54 grapevine tissues and organs using an integrated approach with microarray (cv Corvina) and real-time PCR (cv Pinot Noir) analyses. The phylogenetic analysis comparing grapevine sequences with those of Arabidopsis, tomato, poplar and already described Dof genes in other species allowed us to identify several duplicated genes. The diversification of grapevine DofL genes during evolution likely resulted in a broader range of biological roles. Furthermore, distinct expression patterns were identified between samples analyzed, corroborating such hypothesis. Our expression results indicate that several VviDofL genes perform their functional roles mainly during flower, berry and seed development, highlighting their importance for grapevine growth and production. The identification of similar expression profiles between both approaches strongly suggests that these genes have important regulatory roles that are evolutionally conserved between grapevine cvs Corvina and Pinot Noir. PMID:27610237

  19. Transcription Factor Binding Site Enrichment Analysis in Co-Expression Modules in Celiac Disease

    PubMed Central

    Romero-Garmendia, Irati; Jauregi-Miguel, Amaia; Plaza-Izurieta, Leticia; Cros, Marie-Pierre; Legarda, Maria; Irastorza, Iñaki; Herceg, Zdenko; Fernandez-Jimenez, Nora

    2018-01-01

    The aim of this study was to construct celiac co-expression patterns at a whole genome level and to identify transcription factors (TFs) that could drive the gliadin-related changes in coordination of gene expression observed in celiac disease (CD). Differential co-expression modules were identified in the acute and chronic responses to gliadin using expression data from a previous microarray study in duodenal biopsies. Transcription factor binding site (TFBS) and Gene Ontology (GO) annotation enrichment analyses were performed in differentially co-expressed genes (DCGs) and selection of candidate regulators was performed. Expression of candidates was measured in clinical samples and the activation of the TFs was further characterized in C2BBe1 cells upon gliadin challenge. Enrichment analyses of the DCGs identified 10 TFs and five were selected for further investigation. Expression changes related to active CD were detected in four TFs, as well as in several of their in silico predicted targets. The activation of TFs was further characterized in C2BBe1 cells upon gliadin challenge, and an increase in nuclear translocation of CAMP Responsive Element Binding Protein 1 (CREB1) and IFN regulatory factor-1 (IRF1) in response to gliadin was observed. Using transcriptome-wide co-expression analyses we are able to propose novel genes involved in CD pathogenesis that respond upon gliadin stimulation, also in non-celiac models. PMID:29748492

  20. Transcription Factor Binding Site Enrichment Analysis in Co-Expression Modules in Celiac Disease.

    PubMed

    Romero-Garmendia, Irati; Garcia-Etxebarria, Koldo; Hernandez-Vargas, Hector; Santin, Izortze; Jauregi-Miguel, Amaia; Plaza-Izurieta, Leticia; Cros, Marie-Pierre; Legarda, Maria; Irastorza, Iñaki; Herceg, Zdenko; Fernandez-Jimenez, Nora; Bilbao, Jose Ramon

    2018-05-10

    The aim of this study was to construct celiac co-expression patterns at a whole genome level and to identify transcription factors (TFs) that could drive the gliadin-related changes in coordination of gene expression observed in celiac disease (CD). Differential co-expression modules were identified in the acute and chronic responses to gliadin using expression data from a previous microarray study in duodenal biopsies. Transcription factor binding site (TFBS) and Gene Ontology (GO) annotation enrichment analyses were performed in differentially co-expressed genes (DCGs) and selection of candidate regulators was performed. Expression of candidates was measured in clinical samples and the activation of the TFs was further characterized in C2BBe1 cells upon gliadin challenge. Enrichment analyses of the DCGs identified 10 TFs and five were selected for further investigation. Expression changes related to active CD were detected in four TFs, as well as in several of their in silico predicted targets. The activation of TFs was further characterized in C2BBe1 cells upon gliadin challenge, and an increase in nuclear translocation of CAMP Responsive Element Binding Protein 1 (CREB1) and IFN regulatory factor-1 (IRF1) in response to gliadin was observed. Using transcriptome-wide co-expression analyses we are able to propose novel genes involved in CD pathogenesis that respond upon gliadin stimulation, also in non-celiac models.

  1. Propagating annotations of molecular networks using in silico fragmentation

    PubMed Central

    da Silva, Ricardo R.; Wang, Mingxun; Fox, Evan; Balunas, Marcy J.; Klassen, Jonathan L.; Dorrestein, Pieter C.

    2018-01-01

    The annotation of small molecules is one of the most challenging and important steps in untargeted mass spectrometry analysis, as most of our biological interpretations rely on structural annotations. Molecular networking has emerged as a structured way to organize and mine data from untargeted tandem mass spectrometry (MS/MS) experiments and has been widely applied to propagate annotations. However, propagation is done through manual inspection of MS/MS spectra connected in the spectral networks and is only possible when a reference library spectrum is available. One of the alternative approaches used to annotate an unknown fragmentation mass spectrum is through the use of in silico predictions. One of the challenges of in silico annotation is the uncertainty around the correct structure among the predicted candidate lists. Here we show how molecular networking can be used to improve the accuracy of in silico predictions through propagation of structural annotations, even when there is no match to a MS/MS spectrum in spectral libraries. This is accomplished through creating a network consensus of re-ranked structural candidates using the molecular network topology and structural similarity to improve in silico annotations. The Network Annotation Propagation (NAP) tool is accessible through the GNPS web-platform https://gnps.ucsd.edu/ProteoSAFe/static/gnps-theoretical.jsp. PMID:29668671

  2. Propagating annotations of molecular networks using in silico fragmentation.

    PubMed

    da Silva, Ricardo R; Wang, Mingxun; Nothias, Louis-Félix; van der Hooft, Justin J J; Caraballo-Rodríguez, Andrés Mauricio; Fox, Evan; Balunas, Marcy J; Klassen, Jonathan L; Lopes, Norberto Peporine; Dorrestein, Pieter C

    2018-04-01

    The annotation of small molecules is one of the most challenging and important steps in untargeted mass spectrometry analysis, as most of our biological interpretations rely on structural annotations. Molecular networking has emerged as a structured way to organize and mine data from untargeted tandem mass spectrometry (MS/MS) experiments and has been widely applied to propagate annotations. However, propagation is done through manual inspection of MS/MS spectra connected in the spectral networks and is only possible when a reference library spectrum is available. One of the alternative approaches used to annotate an unknown fragmentation mass spectrum is through the use of in silico predictions. One of the challenges of in silico annotation is the uncertainty around the correct structure among the predicted candidate lists. Here we show how molecular networking can be used to improve the accuracy of in silico predictions through propagation of structural annotations, even when there is no match to a MS/MS spectrum in spectral libraries. This is accomplished through creating a network consensus of re-ranked structural candidates using the molecular network topology and structural similarity to improve in silico annotations. The Network Annotation Propagation (NAP) tool is accessible through the GNPS web-platform https://gnps.ucsd.edu/ProteoSAFe/static/gnps-theoretical.jsp.

  3. Identification and characterization of aldehyde oxidases (AOXs) in the cotton bollworm

    NASA Astrophysics Data System (ADS)

    Xu, Wei; Liao, Yalin

    2017-12-01

    Aldehyde oxidases (AOXs) are a family of metabolic enzymes that oxidize aldehydes into carboxylic acids; therefore, they play critical roles in detoxification and degradation of chemicals. By using transcriptomic and genomic approaches, we successfully identified six putative AOX genes (HarmAOX1-6) from cotton bollworm, Helicoverpa armigera (Hübner) (Lepidoptera: Noctuidae). In silico expression profile, reverse transcription (RT)-PCR, and quantitative PCR (qPCR) analyses showed that HarmAOX1 is highly expressed in adult antennae, tarsi, and larval mouthparts, so they may play an important role in degrading plant-derived compounds. HarmAOX2 is highly and specifically expressed in adult antennae, suggesting a candidate pheromone-degrading enzyme (PDE) to inactivate the sex pheromone components (Z)-11-hexadecenal and (Z)-9-hexadecenal. RNA sequencing data further demonstrated that a number of host plants they feed on could significantly upregulate the expression levels of HarmAOX1 in larvae. This study improves our understanding of insect aldehyde oxidases and insect-plant interactions.

  4. In silico discovery of terpenoid metabolism in Cannabis sativa.

    PubMed

    Massimino, Luca

    2017-01-01

    Due to their efficacy, cannabis based therapies are currently being prescribed for the treatment of many different medical conditions. Interestingly, treatments based on the use of cannabis flowers or their derivatives have been shown to be very effective, while therapies based on drugs containing THC alone lack therapeutic value and lead to increased side effects, likely resulting from the absence of other pivotal entourage compounds found in the Phyto-complex. Among these compounds are terpenoids, which are not produced exclusively by cannabis plants, so other plant species must share many of the enzymes involved in their metabolism. In the present work, 23,630 transcripts from the canSat3 reference transcriptome were scanned for evolutionarily conserved protein domains and annotated in accordance with their predicted molecular functions. A total of 215 evolutionarily conserved genes encoding enzymes presumably involved in terpenoid metabolism are described, together with their expression profiles in different cannabis plant tissues at different developmental stages. The resource presented here will aid future investigations on terpenoid metabolism in Cannabis sativa .

  5. Insights into teichoic acid biosynthesis by Bifidobacterium bifidum PRL2010.

    PubMed

    Colagiorgi, Angelo; Turroni, Francesca; Mancabelli, Leonardo; Serafini, Fausta; Secchi, Andrea; van Sinderen, Douwe; Ventura, Marco

    2015-09-01

    Bifidobacteria are colonizers of the human gut, where they are interacting with their host as well as with other members of the intestinal microbiota. Teichoic acids (TAs) have previously been shown to play an important role in modulating microbe-host interactions in the human gut. However, so far, there is a paucity of information regarding the presence of TAs in the cell envelope of bifidobacteria. In silico analyses targeting the chromosomes of all 48 (sub)species that currently represent the genus Bifidobacterium revealed the presence of genes responsible for TA biosynthesis, suggesting that bifidobacteria contain both wall TAs and lipoteichoic acids. Transcriptome analyses of the infant gut commensal Bifidobacterium bifidum PRL2010 highlighted that the transcription of the presumptive TA biosynthetic loci is modulated in response to environmental conditions reflecting those of the human gut. Furthermore, chemical characterization of TAs produced by PRL2010 indicates the presence of lipoteichoic acids. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  6. Nuclear factor-kappaB bioluminescence imaging-guided transcriptomic analysis for the assessment of host-biomaterial interaction in vivo.

    PubMed

    Hsiang, Chien-Yun; Chen, Yueh-Sheng; Ho, Tin-Yun

    2009-06-01

    Establishment of a comprehensive platform for the assessment of host-biomaterial interaction in vivo is an important issue. Nuclear factor-kappaB (NF-kappaB) is an inducible transcription factor that is activated by numerous stimuli. Therefore, NF-kappaB-dependent luminescent signal in transgenic mice carrying the luciferase genes was used as the guide to monitor the biomaterials-affected organs, and transcriptomic analysis was further applied to evaluate the complex host responses in affected organs in this study. In vivo imaging showed that genipin-cross-linked gelatin conduit (GGC) implantation evoked the strong NF-kappaB activity at 6h in the implanted region, and transcriptomic analysis showed that the expressions of interleukin-6 (IL-6), IL-24, and IL-1 family were up-regulated. A strong luminescent signal was observed in spleen on 14 d, suggesting that GGC implantation might elicit the biological events in spleen. Transcriptomic analysis of spleen showed that 13 Kyoto Encyclopedia of Genes and Genomes pathways belonging to cell cycles, immune responses, and metabolism were significantly altered by GGC implants. Connectivity Map analysis suggested that the gene signatures of GGC were similar to those of compounds that affect lipid or glucose metabolism. GeneSetTest analysis further showed that host responses to GGC implants might be related to diseases states, especially the metabolic and cardiovascular diseases. In conclusion, our data provided a concept of molecular imaging-guided transcriptomic platform for the evaluation and the prediction of host-biomaterial interaction in vivo.

  7. Necklace: combining reference and assembled transcriptomes for more comprehensive RNA-Seq analysis.

    PubMed

    Davidson, Nadia M; Oshlack, Alicia

    2018-05-01

    RNA sequencing (RNA-seq) analyses can benefit from performing a genome-guided and de novo assembly, in particular for species where the reference genome or the annotation is incomplete. However, tools for integrating an assembled transcriptome with reference annotation are lacking. Necklace is a software pipeline that runs genome-guided and de novo assembly and combines the resulting transcriptomes with reference genome annotations. Necklace constructs a compact but comprehensive superTranscriptome out of the assembled and reference data. Reads are subsequently aligned and counted in preparation for differential expression testing. Necklace allows a comprehensive transcriptome to be built from a combination of assembled and annotated transcripts, which results in a more comprehensive transcriptome for the majority of organisms. In addition RNA-seq data are mapped back to this newly created superTranscript reference to enable differential expression testing with standard methods.

  8. Cadmium effects on sperm morphology and semenogelin with relates to increased ROS in infertile smokers: An in vitro and in silico approach.

    PubMed

    Ranganathan, Parameswari; Rao, Kamini A; Sudan, Jesu Jaya; Balasundaram, Sridharan

    2018-06-01

    Smoking releases cadmium (Cd), the metal toxicant which causes an imbalance in reactive oxygen species level in seminal plasma. This imbalance is envisaged to impair the sperm DNA morphology and thereby result in male infertility. In order to correlate this association, we performed in vitro and in silico studies and evaluated the influence of reactive oxygen species imbalance on sperm morphology impairments due to smoking. The study included 76 infertile smokers, 72 infertile non-smokers, 68 fertile smokers and 74 fertile non-smokers (control). Semen samples were collected at regular intervals from all the subjects. Semen parameters were examined by computer assisted semen analysis, quantification of metal toxicant by atomic absorption spectrophotometer, assessment of antioxidants through enzymatic and non-enzymatic methods, diagnosis of reactive oxygen species by nitro blue tetrazolium method and Cd influence on sperm protein by in vitro and in silico methods. Our analysis revealed that the levels of cigarette toxicants in semen were high, accompanied by low levels of antioxidants in seminal plasma of infertile smoker subjects. In addition the investigation of Cd treated sperm cells through scanning electronic microscope showed the mid piece damage of spermatozoa. The dispersive X-ray analysis to identify the elemental composition further confirmed the presence of Cd. Finally, the in-silico analysis on semenogelin sequences revealed the D-H-D motif which represents a favourable binding site for Cd coordination. Our findings clearly indicated the influence of Cd on reactive oxygen species leading to impaired sperm morphology leading to male infertility. Copyright © 2018 Society for Biology of Reproduction & the Institute of Animal Reproduction and Food Research of Polish Academy of Sciences in Olsztyn. Published by Elsevier B.V. All rights reserved.

  9. Cell-type- and tissue-specific transcriptomes of the white spruce (Picea glauca) bark unmask fine-scale spatial patterns of constitutive and induced conifer defense.

    PubMed

    Celedon, Jose M; Yuen, Macaire M S; Chiang, Angela; Henderson, Hannah; Reid, Karen E; Bohlmann, Jörg

    2017-11-01

    Plant defenses often involve specialized cells and tissues. In conifers, specialized cells of the bark are important for defense against insects and pathogens. Using laser microdissection, we characterized the transcriptomes of cortical resin duct cells, phenolic cells and phloem of white spruce (Picea glauca) bark under constitutive and methyl jasmonate (MeJa)-induced conditions, and we compared these transcriptomes with the transcriptome of the bark tissue complex. Overall, ~3700 bark transcripts were differentially expressed in response to MeJa. Approximately 25% of transcripts were expressed in only one cell type, revealing cell specialization at the transcriptome level. MeJa caused cell-type-specific transcriptome responses and changed the overall patterns of cell-type-specific transcript accumulation. Comparison of transcriptomes of the conifer bark tissue complex and specialized cells resolved a masking effect inherent to transcriptome analysis of complex tissues, and showed the actual cell-type-specific transcriptome signatures. Characterization of cell-type-specific transcriptomes is critical to reveal the dynamic patterns of spatial and temporal display of constitutive and induced defense systems in a complex plant tissue or organ. This was demonstrated with the improved resolution of spatially restricted expression of sets of genes of secondary metabolism in the specialized cell types. © 2017 The Authors The Plant Journal published by John Wiley & Sons Ltd and Society for Experimental Biology.

  10. Transcriptome analysis and gene expression profiling of abortive and developing ovules during fruit development in hazelnut.

    PubMed

    Cheng, Yunqing; Liu, Jianfeng; Zhang, Huidi; Wang, Ju; Zhao, Yixin; Geng, Wanting

    2015-01-01

    A high ratio of blank fruit in hazelnut (Corylus heterophylla Fisch) is a very common phenomenon that causes serious yield losses in northeast China. The development of blank fruit in the Corylus genus is known to be associated with embryo abortion. However, little is known about the molecular mechanisms responsible for embryo abortion during the nut development stage. Genomic information for C. heterophylla Fisch is not available; therefore, data related to transcriptome and gene expression profiling of developing and abortive ovules are needed. In this study, de novo transcriptome sequencing and RNA-seq analysis were conducted using short-read sequencing technology (Illumina HiSeq 2000). The results of the transcriptome assembly analysis revealed genetic information that was associated with the fruit development stage. Two digital gene expression libraries were constructed, one for a full (normally developing) ovule and one for an empty (abortive) ovule. Transcriptome sequencing and assembly results revealed 55,353 unigenes, including 18,751 clusters and 36,602 singletons. These results were annotated using the public databases NR, NT, Swiss-Prot, KEGG, COG, and GO. Using digital gene expression profiling, gene expression differences in developing and abortive ovules were identified. A total of 1,637 and 715 unigenes were significantly upregulated and downregulated, respectively, in abortive ovules, compared with developing ovules. Quantitative real-time polymerase chain reaction analysis was used in order to verify the differential expression of some genes. The transcriptome and digital gene expression profiling data of normally developing and abortive ovules in hazelnut provide exhaustive information that will improve our understanding of the molecular mechanisms of abortive ovule formation in hazelnut.

  11. Transcriptome Network Analysis Reveals Aging-Related Mitochondrial and Proteasomal Dysfunction and Immune Activation in Human Thyroid

    PubMed Central

    Cho, Byuri Angela; Yoo, Seong-Keun; Song, Young Shin; Kim, Su-jin; Lee, Kyu Eun; Shong, Minho

    2018-01-01

    Background: Elucidating aging-related transcriptomic changes in human organs is necessary to understand the aging physiology and mechanisms, but little is known regarding the thyroid gland. We investigated aging-related transcriptomic alterations in the human thyroid gland and characterized the related molecular functions. Methods: Publicly available RNA sequencing data of 322 thyroid tissue samples from the Genotype-Tissue Expression project were analyzed. In addition, our own 64 RNA sequencing data of normal thyroid tissue samples were used as a validation set. To comprehensively evaluate the associations between aging and transcriptomic changes, we performed a weighted gene coexpression network analysis and pathway enrichment analysis. The thyroid differentiation score was then used for further analysis, defining the correlations between thyroid differentiation and aging. Results: The most significant aging-related transcriptomic change in thyroid was the downregulation of genes related to the mitochondrial and proteasomal functions (p = 3 × 10−6). Moreover, genes that are associated with immune processes were significantly upregulated with age (p = 3 × 10−4), and all of them overlapped with the upregulated genes in the thyroid glands affected by lymphocytic thyroiditis. Furthermore, these aging-related changes were not significantly different according to sex, but in terms of the thyroid differentiation, females were more susceptible to aging-related changes (p for trend = 0.03). Conclusions: Aging-related transcriptomic changes in the thyroid gland were associated with mitochondrial and proteasomal dysfunction, loss of differentiation, and activation of autoimmune processes. Our results provide clues to better understanding the age-related decline in thyroid function and higher susceptibility to autoimmune thyroid disease. PMID:29652618

  12. Impact of Transcriptomics on Our Understanding of Pulmonary Fibrosis

    PubMed Central

    Vukmirovic, Milica; Kaminski, Naftali

    2018-01-01

    Idiopathic pulmonary fibrosis (IPF) is a lethal fibrotic lung disease characterized by aberrant remodeling of the lung parenchyma with extensive changes to the phenotypes of all lung resident cells. The introduction of transcriptomics, genome scale profiling of thousands of RNA transcripts, caused a significant inversion in IPF research. Instead of generating hypotheses based on animal models of disease, or biological plausibility, with limited validation in humans, investigators were able to generate hypotheses based on unbiased molecular analysis of human samples and then use animal models of disease to test their hypotheses. In this review, we describe the insights made from transcriptomic analysis of human IPF samples. We describe how transcriptomic studies led to identification of novel genes and pathways involved in the human IPF lung such as: matrix metalloproteinases, WNT pathway, epithelial genes, role of microRNAs among others, as well as conceptual insights such as the involvement of developmental pathways and deep shifts in epithelial and fibroblast phenotypes. The impact of lung and transcriptomic studies on disease classification, endotype discovery, and reproducible biomarkers is also described in detail. Despite these impressive achievements, the impact of transcriptomic studies has been limited because they analyzed bulk tissue and did not address the cellular and spatial heterogeneity of the IPF lung. We discuss new emerging technologies and applications, such as single-cell RNAseq and microenvironment analysis that may address cellular and spatial heterogeneity. We end by making the point that most current tissue collections and resources are not amenable to analysis using the novel technologies. To take advantage of the new opportunities, we need new efforts of sample collections, this time focused on access to all the microenvironments and cells in the IPF lung. PMID:29670881

  13. Epigenetic transgenerational inheritance of somatic transcriptomes and epigenetic control regions

    PubMed Central

    2012-01-01

    Background Environmentally induced epigenetic transgenerational inheritance of adult onset disease involves a variety of phenotypic changes, suggesting a general alteration in genome activity. Results Investigation of different tissue transcriptomes in male and female F3 generation vinclozolin versus control lineage rats demonstrated all tissues examined had transgenerational transcriptomes. The microarrays from 11 different tissues were compared with a gene bionetwork analysis. Although each tissue transgenerational transcriptome was unique, common cellular pathways and processes were identified between the tissues. A cluster analysis identified gene modules with coordinated gene expression and each had unique gene networks regulating tissue-specific gene expression and function. A large number of statistically significant over-represented clusters of genes were identified in the genome for both males and females. These gene clusters ranged from 2-5 megabases in size, and a number of them corresponded to the epimutations previously identified in sperm that transmit the epigenetic transgenerational inheritance of disease phenotypes. Conclusions Combined observations demonstrate that all tissues derived from the epigenetically altered germ line develop transgenerational transcriptomes unique to the tissue, but common epigenetic control regions in the genome may coordinately regulate these tissue-specific transcriptomes. This systems biology approach provides insight into the molecular mechanisms involved in the epigenetic transgenerational inheritance of a variety of adult onset disease phenotypes. PMID:23034163

  14. ATGC transcriptomics: a web-based application to integrate, explore and analyze de novo transcriptomic data.

    PubMed

    Gonzalez, Sergio; Clavijo, Bernardo; Rivarola, Máximo; Moreno, Patricio; Fernandez, Paula; Dopazo, Joaquín; Paniego, Norma

    2017-02-22

    In the last years, applications based on massively parallelized RNA sequencing (RNA-seq) have become valuable approaches for studying non-model species, e.g., without a fully sequenced genome. RNA-seq is a useful tool for detecting novel transcripts and genetic variations and for evaluating differential gene expression by digital measurements. The large and complex datasets resulting from functional genomic experiments represent a challenge in data processing, management, and analysis. This problem is especially significant for small research groups working with non-model species. We developed a web-based application, called ATGC transcriptomics, with a flexible and adaptable interface that allows users to work with new generation sequencing (NGS) transcriptomic analysis results using an ontology-driven database. This new application simplifies data exploration, visualization, and integration for a better comprehension of the results. ATGC transcriptomics provides access to non-expert computer users and small research groups to a scalable storage option and simple data integration, including database administration and management. The software is freely available under the terms of GNU public license at http://atgcinta.sourceforge.net .

  15. Prioritizing Therapeutics for Lung Cancer: An Integrative Meta-analysis of Cancer Gene Signatures and Chemogenomic Data

    PubMed Central

    Fortney, Kristen; Griesman, Joshua; Kotlyar, Max; Pastrello, Chiara; Angeli, Marc; Sound-Tsao, Ming; Jurisica, Igor

    2015-01-01

    Repurposing FDA-approved drugs with the aid of gene signatures of disease can accelerate the development of new therapeutics. A major challenge to developing reliable drug predictions is heterogeneity. Different gene signatures of the same disease or drug treatment often show poor overlap across studies, as a consequence of both biological and technical variability, and this can affect the quality and reproducibility of computational drug predictions. Existing algorithms for signature-based drug repurposing use only individual signatures as input. But for many diseases, there are dozens of signatures in the public domain. Methods that exploit all available transcriptional knowledge on a disease should produce improved drug predictions. Here, we adapt an established meta-analysis framework to address the problem of drug repurposing using an ensemble of disease signatures. Our computational pipeline takes as input a collection of disease signatures, and outputs a list of drugs predicted to consistently reverse pathological gene changes. We apply our method to conduct the largest and most systematic repurposing study on lung cancer transcriptomes, using 21 signatures. We show that scaling up transcriptional knowledge significantly increases the reproducibility of top drug hits, from 44% to 78%. We extensively characterize drug hits in silico, demonstrating that they slow growth significantly in nine lung cancer cell lines from the NCI-60 collection, and identify CALM1 and PLA2G4A as promising drug targets for lung cancer. Our meta-analysis pipeline is general, and applicable to any disease context; it can be applied to improve the results of signature-based drug repurposing by leveraging the large number of disease signatures in the public domain. PMID:25786242

  16. Toward the Replacement of Animal Experiments through the Bioinformatics-driven Analysis of 'Omics' Data from Human Cell Cultures.

    PubMed

    Grafström, Roland C; Nymark, Penny; Hongisto, Vesa; Spjuth, Ola; Ceder, Rebecca; Willighagen, Egon; Hardy, Barry; Kaski, Samuel; Kohonen, Pekka

    2015-11-01

    This paper outlines the work for which Roland Grafström and Pekka Kohonen were awarded the 2014 Lush Science Prize. The research activities of the Grafström laboratory have, for many years, covered cancer biology studies, as well as the development and application of toxicity-predictive in vitro models to determine chemical safety. Through the integration of in silico analyses of diverse types of genomics data (transcriptomic and proteomic), their efforts have proved to fit well into the recently-developed Adverse Outcome Pathway paradigm. Genomics analysis within state-of-the-art cancer biology research and Toxicology in the 21st Century concepts share many technological tools. A key category within the Three Rs paradigm is the Replacement of animals in toxicity testing with alternative methods, such as bioinformatics-driven analyses of data obtained from human cell cultures exposed to diverse toxicants. This work was recently expanded within the pan-European SEURAT-1 project (Safety Evaluation Ultimately Replacing Animal Testing), to replace repeat-dose toxicity testing with data-rich analyses of sophisticated cell culture models. The aims and objectives of the SEURAT project have been to guide the application, analysis, interpretation and storage of 'omics' technology-derived data within the service-oriented sub-project, ToxBank. Particularly addressing the Lush Science Prize focus on the relevance of toxicity pathways, a 'data warehouse' that is under continuous expansion, coupled with the development of novel data storage and management methods for toxicology, serve to address data integration across multiple 'omics' technologies. The prize winners' guiding principles and concepts for modern knowledge management of toxicological data are summarised. The translation of basic discovery results ranged from chemical-testing and material-testing data, to information relevant to human health and environmental safety. 2015 FRAME.

  17. Quantitative analysis of ChIP-seq data uncovers dynamic and sustained H3K4me3 and H3K27me3 modulation in cancer cells under hypoxia.

    PubMed

    Adriaens, Michiel E; Prickaerts, Peggy; Chan-Seng-Yue, Michelle; van den Beucken, Twan; Dahlmans, Vivian E H; Eijssen, Lars M; Beck, Timothy; Wouters, Bradly G; Voncken, Jan Willem; Evelo, Chris T A

    2016-01-01

    A comprehensive assessment of the epigenetic dynamics in cancer cells is the key to understanding the molecular mechanisms underlying cancer and to improving cancer diagnostics, prognostics and treatment. By combining genome-wide ChIP-seq epigenomics and microarray transcriptomics, we studied the effects of oxygen deprivation and subsequent reoxygenation on histone 3 trimethylation of lysine 4 (H3K4me3) and lysine 27 (H3K27me3) in a breast cancer cell line, serving as a model for abnormal oxygenation in solid tumors. A priori, epigenetic markings and gene expression levels not only are expected to vary greatly between hypoxic and normoxic conditions, but also display a large degree of heterogeneity across the cell population. Where traditionally ChIP-seq data are often treated as dichotomous data, the model and experiment here necessitate a quantitative, data-driven analysis of both datasets. We first identified genomic regions with sustained epigenetic markings, which provided a sample-specific reference enabling quantitative ChIP-seq data analysis. Sustained H3K27me3 marking was located around centromeres and intergenic regions, while sustained H3K4me3 marking is associated with genes involved in RNA binding, translation and protein transport and localization. Dynamic marking with both H3K4me3 and H3K27me3 (hypoxia-induced bivalency) was found in CpG-rich regions at loci encoding factors that control developmental processes, congruent with observations in embryonic stem cells. In silico -identified epigenetically sustained and dynamic genomic regions were confirmed through ChIP-PCR in vitro, and obtained results are corroborated by published data and current insights regarding epigenetic regulation.

  18. Using metabolic flux data to further constrain the metabolic solution space and predict internal flux patterns: the Escherichia coli spectrum.

    PubMed

    Wiback, Sharon J; Mahadevan, Radhakrishnan; Palsson, Bernhard Ø

    2004-05-05

    Constraint-based metabolic modeling has been used to capture the genome-scale, systems properties of an organism's metabolism. The first generation of these models has been built on annotated gene sequence. To further this field, we now need to develop methods to incorporate additional "omic" data types including transcriptomics, metabolomics, and fluxomics to further facilitate the construction, validation, and predictive capabilities of these models. The work herein combines metabolic flux data with an in silico model of central metabolism of Escherichia coli for model centric integration of the flux data. The extreme pathways for this network, which define the allowable solution space for all possible flux distributions, are analyzed using the alpha-spectrum. The alpha-spectrum determines which extreme pathways can and cannot contribute to the metabolic flux distribution for a given condition and gives the allowable range of weightings on each extreme pathway that can contribute. Since many extreme pathways cannot be used under certain conditions, the result is a "condition-specific" solution space that is a subset of the original solution space. The alpha-spectrum results are used to create a "condition-specific" extreme pathway matrix that can be analyzed using singular value decomposition (SVD). The first mode of the SVD analysis characterizes the solution space for a given condition. We show that SVD analysis of the alpha-spectrum extreme pathway matrix that incorporates measured uptake and byproduct secretion rates, can predict internal flux trends for different experimental conditions. These predicted internal flux trends are, in general, consistent with the flux trends measured using experimental metabolic flux analysis techniques. Copyright 2004 Wiley Periodicals, Inc.

  19. Helicobacter pylori HP1512 Is a Nickel-Responsive NikR-Regulated Outer Membrane Protein▿

    PubMed Central

    Davis, Gregg S.; Flannery, Erika L.; Mobley, Harry L. T.

    2006-01-01

    Helicobacter pylori is dependent upon the production of the highly abundant and active metalloenzyme urease for colonization of the human stomach. Thus, H. pylori has an absolute requirement for the transition metal nickel, a required cofactor for urease. To investigate the contribution of genes that are factors in this process, microarray analysis comparing the transcriptome of wild-type H. pylori 26695 cultured in brucella broth containing fetal calf serum (BBF) alone or supplemented with 100 μM NiCl2 suggested that HP1512 is repressed in the presence of 100 μM supplemental nickel. When measured by comparative real-time quantitative PCR (qPCR), HP1512 transcription was reduced 43-fold relative to the value for the wild type when cultured in BBF supplemented with 10 μM NiCl2. When grown in unsupplemented BBF, urease activity of an HP1512::cat mutant was significantly reduced compared to the wild type, 4.9 ± 0.5 μmol/min/mg of protein (n = 7) and 17.1 ± 4.9 μmol/min/mg of protein (n = 13), respectively (P < 0.0001). In silico analysis of the HP1511-HP1512 (HP1511-1512) intergenic region identified a putative NikR operator upstream of HP1512. Gel shift analysis with purified recombinant NikR verified nickel-dependent binding of H. pylori NikR to the HP1511-1512 intergenic region. Furthermore, comparative real-time qPCR of four nickel-related genes suggests that mutation of HP1512 results in reduced intracellular nickel concentration relative to wild-type H. pylori 26695. Taken together, these data suggest that HP1512 encodes a NikR-nickel-regulated outer membrane protein. PMID:17030579

  20. Global Transcriptome Analysis of Staphylococcus aureus Response to Hydrogen Peroxide†

    PubMed Central

    Chang, Wook; Small, David A.; Toghrol, Freshteh; Bentley, William E.

    2006-01-01

    Staphylococcus aureus responds with protective strategies against phagocyte-derived reactive oxidants to infect humans. Herein, we report the transcriptome analysis of the cellular response of S. aureus to hydrogen peroxide-induced oxidative stress. The data indicate that the oxidative response includes the induction of genes involved in virulence, DNA repair, and notably, anaerobic metabolism. PMID:16452450

  1. Abscisic Acid Is a Major Regulator of Grape Berry Ripening Onset: New Insights into ABA Signaling Network

    PubMed Central

    Pilati, Stefania; Bagagli, Giorgia; Sonego, Paolo; Moretto, Marco; Brazzale, Daniele; Castorina, Giulia; Simoni, Laura; Tonelli, Chiara; Guella, Graziano; Engelen, Kristof; Galbiati, Massimo; Moser, Claudio

    2017-01-01

    Grapevine is a world-wide cultivated economically relevant crop. The process of berry ripening is non-climacteric and does not rely on the sole ethylene signal. Abscisic acid (ABA) is recognized as an important hormone of ripening inception and color development in ripening berries. In order to elucidate the effect of this signal at the molecular level, pre-véraison berries were treated ex vivo for 20 h with 0.2 mM ABA and berry skin transcriptional modulation was studied by RNA-seq after the treatment and 24 h later, in the absence of exogenous ABA. This study highlighted that a small amount of ABA triggered its own biosynthesis and had a transcriptome-wide effect (1893 modulated genes) characterized by the amplification of the transcriptional response over time. By comparing this dataset with the many studies on ripening collected within the grapevine transcriptomic compendium Vespucci, an extended overlap between ABA- and ripening modulated gene sets was observed (71% of the genes), underpinning the role of this hormone in the regulation of berry ripening. The signaling network of ABA, encompassing ABA metabolism, transport and signaling cascade, has been analyzed in detail and expanded based on knowledge from other species in order to provide an integrated molecular description of this pathway at berry ripening onset. Expression data analysis was combined with in silico promoter analysis to identify candidate target genes of ABA responsive element binding protein 2 (VvABF2), a key upstream transcription factor of the ABA signaling cascade which is up-regulated at véraison and also by ABA treatments. Two transcription factors, VvMYB143 and VvNAC17, and two genes involved in protein degradation, Armadillo-like and Xerico-like genes, were selected for in vivo validation by VvABF2-mediated promoter trans-activation in tobacco. VvNAC17 and Armadillo-like promoters were induced by ABA via VvABF2, while VvMYB143 responded to ABA in a VvABF2-independent manner. This knowledge of the ABA cascade in berry skin contributes not only to the understanding of berry ripening regulation but might be useful to other areas of viticultural interest, such as bud dormancy regulation and drought stress tolerance. PMID:28680438

  2. Vitiligo blood transcriptomics provides new insights into disease mechanisms and identifies potential novel therapeutic targets.

    PubMed

    Dey-Rao, Rama; Sinha, Animesh A

    2017-01-28

    Significant gaps remain regarding the pathomechanisms underlying the autoimmune response in vitiligo (VL), where the loss of self-tolerance leads to the targeted killing of melanocytes. Specifically, there is incomplete information regarding alterations in the systemic environment that are relevant to the disease state. We undertook a genome-wide profiling approach to examine gene expression in the peripheral blood of VL patients and healthy controls in the context of our previously published VL-skin gene expression profile. We used several in silico bioinformatics-based analyses to provide new insights into disease mechanisms and suggest novel targets for future therapy. Unsupervised clustering methods of the VL-blood dataset demonstrate a "disease-state"-specific set of co-expressed genes. Ontology enrichment analysis of 99 differentially expressed genes (DEGs) uncovers a down-regulated immune/inflammatory response, B-Cell antigen receptor (BCR) pathways, apoptosis and catabolic processes in VL-blood. There is evidence for both type I and II interferon (IFN) playing a role in VL pathogenesis. We used interactome analysis to identify several key blood associated transcriptional factors (TFs) from within (STAT1, STAT6 and NF-kB), as well as "hidden" (CREB1, MYC, IRF4, IRF1, and TP53) from the dataset that potentially affect disease pathogenesis. The TFs overlap with our reported lesional-skin transcriptional circuitry, underscoring their potential importance to the disease. We also identify a shared VL-blood and -skin transcriptional "hot spot" that maps to chromosome 6, and includes three VL-blood dysregulated genes (PSMB8, PSMB9 and TAP1) described as potential VL-associated genetic susceptibility loci. Finally, we provide bioinformatics-based support for prioritizing dysregulated genes in VL-blood or skin as potential therapeutic targets. We examined the VL-blood transcriptome in context with our (previously published) VL-skin transcriptional profile to address a major gap in knowledge regarding the systemic changes underlying skin-specific manifestation of vitiligo. Several transcriptional "hot spots" observed in both environments offer prioritized targets for identifying disease risk genes. Finally, within the transcriptional framework of VL, we identify five novel molecules (STAT1, PRKCD, PTPN6, MYC and FGFR2) that lend themselves to being targeted by drugs for future potential VL-therapy.

  3. Identification of Novel and Conserved miRNAs from Extreme Halophyte, Oryza coarctata, a Wild Relative of Rice.

    PubMed

    Mondal, Tapan Kumar; Ganie, Showkat Ahmad; Debnath, Ananda Bhusan

    2015-01-01

    Oryza coarctata, a halophyte and wild relative of rice, is grown normally in saline water. MicroRNAs (miRNAs) are non-coding RNAs that play pivotal roles in every domain of life including stress response. There are very few reports on the discovery of salt-responsive miRNAs from halophytes. In this study, two small RNA libraries, one each from the control and salt-treated (450 mM NaCl for 24 h) leaves of O. coarctata were sequenced, which yielded 338 known and 95 novel miRNAs. Additionally, we used publicly available transcriptomics data of O. coarctata which led to the discovery of additional 48 conserved miRNAs along with their pre-miRNA sequences through in silico analysis. In total, 36 known and 7 novel miRNAs were up-regulated whereas, 12 known and 7 novel miRNAs were down-regulated under salinity stress. Further, 233 and 154 target genes were predicted for 48 known and 14 novel differentially regulated miRNAs respectively. These targets with the help of gene ontology analysis were found to be involved in several important biological processes that could be involved in salinity tolerance. Relative expression trends of majority of the miRNAs as detected by real time-PCR as well as predicted by Illumina sequencing were found to be coherent. Additionally, expression of most of the target genes was negatively correlated with their corresponding miRNAs. Thus, the present study provides an account of miRNA-target networking that is involved in salinity adaption of O. coarctata.

  4. Transcriptomic Analysis of Phenotypic Changes in Birch (Betula platyphylla) Autotetraploids

    PubMed Central

    Mu, Huai-Zhi; Liu, Zi-Jia; Lin, Lin; Li, Hui-Yu; Jiang, Jing; Liu, Gui-Feng

    2012-01-01

    Plant breeders have focused much attention on polyploid trees because of their importance to forestry. To evaluate the impact of intraspecies genome duplication on the transcriptome, a series of Betula platyphylla autotetraploids and diploids were generated from four full-sib families. The phenotypes and transcriptomes of these autotetraploid individuals were compared with those of diploid trees. Autotetraploids were generally superior in breast-height diameter, volume, leaf, fruit and stoma and were generally inferior in height compared to diploids. Transcriptome data revealed numerous changes in gene expression attributable to autotetraploidization, which resulted in the upregulation of 7052 unigenes and the downregulation of 3658 unigenes. Pathway analysis revealed that the biosynthesis and signal transduction of indoleacetate (IAA) and ethylene were altered after genome duplication, which may have contributed to phenotypic changes. These results shed light on variations in birch autotetraploidization and help identify important genes for the genetic engineering of birch trees. PMID:23202935

  5. De novo Assembly and Analysis of the Chilean Pencil Catfish Trichomycterus areolatus Transcriptome

    PubMed Central

    Schulze, Thomas T.; Ali, Jonathan M.; Bartlett, Maggie L.; McFarland, Madalyn M.; Clement, Emalie J.; Won, Harim I.; Sanford, Austin G.; Monzingo, Elyssa B.; Martens, Matthew C.; Hemsley, Ryan M.; Kumar, Sidharta; Gouin, Nicolas; Kolok, Alan S.; Davis, Paul H.

    2016-01-01

    Trichomycterus areolatus is an endemic species of pencil catfish that inhabits the riffles and rapids of many freshwater ecosystems of Chile. Despite its unique adaptation to Chile's high gradient watersheds and therefore potential application in the investigation of ecosystem integrity and environmental contamination, relatively little is known regarding the molecular biology of this environmental sentinel. Here, we detail the assembly of the Trichomycterus areolatus transcriptome, a molecular resource for the study of this organism and its molecular response to the environment. RNA-Seq reads were obtained by next-generation sequencing with an Illumina® platform and processed using PRINSEQ. The transcriptome assembly was performed using TRINITY assembler. Transcriptome validation was performed by functional characterization with KOG, KEGG, and GO analyses. Additionally, differential expression analysis highlights sex-specific expression patterns, and a list of endocrine and oxidative stress related transcripts are included. PMID:27672404

  6. Transcriptome analysis by strand-specific sequencing of complementary DNA

    PubMed Central

    Parkhomchuk, Dmitri; Borodina, Tatiana; Amstislavskiy, Vyacheslav; Banaru, Maria; Hallen, Linda; Krobitsch, Sylvia; Lehrach, Hans; Soldatov, Alexey

    2009-01-01

    High-throughput complementary DNA sequencing (RNA-Seq) is a powerful tool for whole-transcriptome analysis, supplying information about a transcript's expression level and structure. However, it is difficult to determine the polarity of transcripts, and therefore identify which strand is transcribed. Here, we present a simple cDNA sequencing protocol that preserves information about a transcript's direction. Using Saccharomyces cerevisiae and mouse brain transcriptomes as models, we demonstrate that knowing the transcript's orientation allows more accurate determination of the structure and expression of genes. It also helps to identify new genes and enables studying promoter-associated and antisense transcription. The transcriptional landscapes we obtained are available online. PMID:19620212

  7. Transcriptome analysis by strand-specific sequencing of complementary DNA.

    PubMed

    Parkhomchuk, Dmitri; Borodina, Tatiana; Amstislavskiy, Vyacheslav; Banaru, Maria; Hallen, Linda; Krobitsch, Sylvia; Lehrach, Hans; Soldatov, Alexey

    2009-10-01

    High-throughput complementary DNA sequencing (RNA-Seq) is a powerful tool for whole-transcriptome analysis, supplying information about a transcript's expression level and structure. However, it is difficult to determine the polarity of transcripts, and therefore identify which strand is transcribed. Here, we present a simple cDNA sequencing protocol that preserves information about a transcript's direction. Using Saccharomyces cerevisiae and mouse brain transcriptomes as models, we demonstrate that knowing the transcript's orientation allows more accurate determination of the structure and expression of genes. It also helps to identify new genes and enables studying promoter-associated and antisense transcription. The transcriptional landscapes we obtained are available online.

  8. Genome wide in silico characterization of Dof gene families of pigeonpea (Cajanus cajan (L) Millsp.).

    PubMed

    Malviya, N; Gupta, S; Singh, V K; Yadav, M K; Bisht, N C; Sarangi, B K; Yadav, D

    2015-02-01

    The DNA binding with One Finger (Dof) protein is a plant specific transcription factor involved in the regulation of wide range of processes. The analysis of whole genome sequence of pigeonpea has identified 38 putative Dof genes (CcDof) distributed on 8 chromosomes. A total of 17 out of 38 CcDof genes were found to be intronless. A comprehensive in silico characterization of CcDof gene family including the gene structure, chromosome location, protein motif, phylogeny, gene duplication and functional divergence has been attempted. The phylogenetic analysis resulted in 3 major clusters with closely related members in phylogenetic tree revealed common motif distribution. The in silico cis-regulatory element analysis revealed functional diversity with predominance of light responsive and stress responsive elements indicating the possibility of these CcDof genes to be associated with photoperiodic control and biotic and abiotic stress. The duplication pattern showed that tandem duplication is predominant over segmental duplication events. The comparative phylogenetic analysis of these Dof proteins along with 78 soybean, 36 Arabidopsis and 30 rice Dof proteins revealed 7 major clusters. Several groups of orthologs and paralogs were identified based on phylogenetic tree constructed. Our study provides useful information for functional characterization of CcDof genes.

  9. Evaluation of Bioinformatic Programmes for the Analysis of Variants within Splice Site Consensus Regions

    PubMed Central

    Tang, Rongying; Prosser, Debra O.; Love, Donald R.

    2016-01-01

    The increasing diagnostic use of gene sequencing has led to an expanding dataset of novel variants that lie within consensus splice junctions. The challenge for diagnostic laboratories is the evaluation of these variants in order to determine if they affect splicing or are merely benign. A common evaluation strategy is to use in silico analysis, and it is here that a number of programmes are available online; however, currently, there are no consensus guidelines on the selection of programmes or protocols to interpret the prediction results. Using a collection of 222 pathogenic mutations and 50 benign polymorphisms, we evaluated the sensitivity and specificity of four in silico programmes in predicting the effect of each variant on splicing. The programmes comprised Human Splice Finder (HSF), Max Entropy Scan (MES), NNSplice, and ASSP. The MES and ASSP programmes gave the highest performance based on Receiver Operator Curve analysis, with an optimal cut-off of score reduction of 10%. The study also showed that the sensitivity of prediction is affected by the level of conservation of individual positions, with in silico predictions for variants at positions −4 and +7 within consensus splice sites being largely uninformative. PMID:27313609

  10. In silico analysis of cacao (Theobroma cacao L.) genes that involved in pathogen and disease responses

    NASA Astrophysics Data System (ADS)

    Agung, Muhammad Budi; Budiarsa, I. Made; Suwastika, I. Nengah

    2017-02-01

    Cocoa bean is one of the main commodities from Indonesia for the world, which still have problem regarding yield degradation due to pathogens and disease attack. Developing robust cacao plant that genetically resistant to pathogen and disease attack is an ideal solution in over taking on this problem. The aim of this study was to identify Theobroma cacao genes on database of cacao genome that homolog to response genes of pathogen and disease attack in other plant, through in silico analysis. Basic information survey and gene identification were performed in GenBank and The Arabidopsis Information Resource database. The In silico analysis contains protein BLAST, homology test of each gene's protein candidates, and identification of homologue gene in Cacao Genome Database using data source "Theobroma cacao cv. Matina 1-6 v1.1" genome. Identification found that Thecc1EG011959t1 (EDS1), Thecc1EG006803t1 (EDS5), Thecc1EG013842t1 (ICS1), and Thecc1EG015614t1 (BG_PPAP) gene of Cacao Genome Database were Theobroma cacao genes that homolog to plant's resistance genes which highly possible to have similar functions of each gene's homologue gene.

  11. Pyrosequencing the Bemisia tabaci Transcriptome Reveals a Highly Diverse Bacterial Community and a Robust System for Insecticide Resistance

    PubMed Central

    Wu, Qing-jun; Wang, Shao-li; Yang, Xin; Yang, Ni-na; Li, Ru-mei; Jiao, Xiao-guo; Pan, Hui-peng; Liu, Bai-ming; Su, Qi; Xu, Bao-yun; Hu, Song-nian; Zhou, Xu-guo; Zhang, You-jun

    2012-01-01

    Background Bemisia tabaci (Gennadius) is a phloem-feeding insect poised to become one of the major insect pests in open field and greenhouse production systems throughout the world. The high level of resistance to insecticides is a main factor that hinders continued use of insecticides for suppression of B. tabaci. Despite its prevalence, little is known about B. tabaci at the genome level. To fill this gap, an invasive B. tabaci B biotype was subjected to pyrosequencing-based transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes. Methodology and Principal Findings Using Roche 454 pyrosequencing, 857,205 reads containing approximately 340 megabases were obtained from the B. tabaci transcriptome. De novo assembly generated 178,669 unigenes including 30,980 from insects, 17,881 from bacteria, and 129,808 from the nohit. A total of 50,835 (28.45%) unigenes showed similarity to the non-redundant database in GenBank with a cut-off E-value of 10–5. Among them, 40,611 unigenes were assigned to one or more GO terms and 6,917 unigenes were assigned to 288 known pathways. De novo metatranscriptome analysis revealed highly diverse bacterial symbionts in B. tabaci, and demonstrated the host-symbiont cooperation in amino acid production. In-depth transcriptome analysis indentified putative molecular markers, and genes potentially involved in insecticide resistance and nutrient digestion. The utility of this transcriptome was validated by a thiamethoxam resistance study, in which annotated cytochrome P450 genes were significantly overexpressed in the resistant B. tabaci in comparison to its susceptible counterparts. Conclusions This transcriptome/metatranscriptome analysis sheds light on the molecular understanding of symbiosis and insecticide resistance in an agriculturally important phloem-feeding insect pest, and lays the foundation for future functional genomics research of the B. tabaci complex. Moreover, current pyrosequencing effort greatly enriched the existing whitefly EST database, and makes RNAseq a viable option for future genomic analysis. PMID:22558125

  12. Comprehensive evaluation of AmpliSeq transcriptome, a novel targeted whole transcriptome RNA sequencing methodology for global gene expression analysis.

    PubMed

    Li, Wenli; Turner, Amy; Aggarwal, Praful; Matter, Andrea; Storvick, Erin; Arnett, Donna K; Broeckel, Ulrich

    2015-12-16

    Whole transcriptome sequencing (RNA-seq) represents a powerful approach for whole transcriptome gene expression analysis. However, RNA-seq carries a few limitations, e.g., the requirement of a significant amount of input RNA and complications led by non-specific mapping of short reads. The Ion AmpliSeq Transcriptome Human Gene Expression Kit (AmpliSeq) was recently introduced by Life Technologies as a whole-transcriptome, targeted gene quantification kit to overcome these limitations of RNA-seq. To assess the performance of this new methodology, we performed a comprehensive comparison of AmpliSeq with RNA-seq using two well-established next-generation sequencing platforms (Illumina HiSeq and Ion Torrent Proton). We analyzed standard reference RNA samples and RNA samples obtained from human induced pluripotent stem cell derived cardiomyocytes (hiPSC-CMs). Using published data from two standard RNA reference samples, we observed a strong concordance of log2 fold change for all genes when comparing AmpliSeq to Illumina HiSeq (Pearson's r = 0.92) and Ion Torrent Proton (Pearson's r = 0.92). We used ROC, Matthew's correlation coefficient and RMSD to determine the overall performance characteristics. All three statistical methods demonstrate AmpliSeq as a highly accurate method for differential gene expression analysis. Additionally, for genes with high abundance, AmpliSeq outperforms the two RNA-seq methods. When analyzing four closely related hiPSC-CM lines, we show that both AmpliSeq and RNA-seq capture similar global gene expression patterns consistent with known sources of variations. Our study indicates that AmpliSeq excels in the limiting areas of RNA-seq for gene expression quantification analysis. Thus, AmpliSeq stands as a very sensitive and cost-effective approach for very large scale gene expression analysis and mRNA marker screening with high accuracy.

  13. Transcriptome-Based Characterization of Interactions between Saccharomyces cerevisiae and Lactobacillus delbrueckii subsp. bulgaricus in Lactose-Grown Chemostat Cocultures

    PubMed Central

    Mendes, Filipa; Sieuwerts, Sander; de Hulster, Erik; Almering, Marinka J. H.; Luttik, Marijke A. H.; Pronk, Jack T.; Smid, Eddy J.; Bron, Peter A.

    2013-01-01

    Mixed populations of Saccharomyces cerevisiae yeasts and lactic acid bacteria occur in many dairy, food, and beverage fermentations, but knowledge about their interactions is incomplete. In the present study, interactions between Saccharomyces cerevisiae and Lactobacillus delbrueckii subsp. bulgaricus, two microorganisms that co-occur in kefir fermentations, were studied during anaerobic growth on lactose. By combining physiological and transcriptome analysis of the two strains in the cocultures, five mechanisms of interaction were identified. (i) Lb. delbrueckii subsp. bulgaricus hydrolyzes lactose, which cannot be metabolized by S. cerevisiae, to galactose and glucose. Subsequently, galactose, which cannot be metabolized by Lb. delbrueckii subsp. bulgaricus, is excreted and provides a carbon source for yeast. (ii) In pure cultures, Lb. delbrueckii subsp. bulgaricus grows only in the presence of increased CO2 concentrations. In anaerobic mixed cultures, the yeast provides this CO2 via alcoholic fermentation. (iii) Analysis of amino acid consumption from the defined medium indicated that S. cerevisiae supplied alanine to the bacterium. (iv) A mild but significant low-iron response in the yeast transcriptome, identified by DNA microarray analysis, was consistent with the chelation of iron by the lactate produced by Lb. delbrueckii subsp. bulgaricus. (v) Transcriptome analysis of Lb. delbrueckii subsp. bulgaricus in mixed cultures showed an overrepresentation of transcripts involved in lipid metabolism, suggesting either a competition of the two microorganisms for fatty acids or a response to the ethanol produced by S. cerevisiae. This study demonstrates that chemostat-based transcriptome analysis is a powerful tool to investigate microbial interactions in mixed populations. PMID:23872557

  14. CONVERGENT TRANSCRIPTOMICS AND PROTEOMICS OF ENVIRONMENTAL ENRICHMENT AND COCAINE IDENTIFIES NOVEL THERAPEUTIC STRATEGIES FOR ADDICTION

    PubMed Central

    ZHANG, YAFANG; CROFTON, ELIZABETH J.; FAN, XIUZHEN; LI, DINGGE; KONG, FANPING; SINHA, MALA; LUXON, BRUCE A.; SPRATT, HEIDI M.; LICHTI, CHERYL F.; GREEN, THOMAS A.

    2016-01-01

    Transcriptomic and proteomic approaches have separately proven effective at identifying novel mechanisms affecting addiction-related behavior; however, it is difficult to prioritize the many promising leads from each approach. A convergent secondary analysis of proteomic and transcriptomic results can glean additional information to help prioritize promising leads. The current study is a secondary analysis of the convergence of recently published separate transcriptomic and proteomic analyses of nucleus accumbens (NAc) tissue from rats subjected to environmental enrichment vs. isolation and cocaine self-administration vs. saline. Multiple bioinformatics approaches (e.g. Gene Ontology (GO) analysis, Ingenuity Pathway Analysis (IPA), and Gene Set Enrichment Analysis (GSEA)) were used to interrogate these rich data sets. Although there was little correspondence between mRNA vs. protein at the individual target level, good correspondence was found at the level of gene/protein sets, particularly for the environmental enrichment manipulation. These data identify gene sets where there is a positive relationship between changes in mRNA and protein (e.g. glycolysis, ATP synthesis, translation elongation factor activity, etc.) and gene sets where there is an inverse relationship (e.g. ribosomes, Rho GTPase signaling, protein ubiquitination, etc.). Overall environmental enrichment produced better correspondence than cocaine self-administration. The individual targets contributing to mRNA and protein effects were largely not overlapping. As a whole, these results confirm that robust transcriptomic and proteomic data sets can provide similar results at the gene/protein set level even when there is little correspondence at the individual target level and little overlap in the targets contributing to the effects. PMID:27717806

  15. Transcriptome-based characterization of interactions between Saccharomyces cerevisiae and Lactobacillus delbrueckii subsp. bulgaricus in lactose-grown chemostat cocultures.

    PubMed

    Mendes, Filipa; Sieuwerts, Sander; de Hulster, Erik; Almering, Marinka J H; Luttik, Marijke A H; Pronk, Jack T; Smid, Eddy J; Bron, Peter A; Daran-Lapujade, Pascale

    2013-10-01

    Mixed populations of Saccharomyces cerevisiae yeasts and lactic acid bacteria occur in many dairy, food, and beverage fermentations, but knowledge about their interactions is incomplete. In the present study, interactions between Saccharomyces cerevisiae and Lactobacillus delbrueckii subsp. bulgaricus, two microorganisms that co-occur in kefir fermentations, were studied during anaerobic growth on lactose. By combining physiological and transcriptome analysis of the two strains in the cocultures, five mechanisms of interaction were identified. (i) Lb. delbrueckii subsp. bulgaricus hydrolyzes lactose, which cannot be metabolized by S. cerevisiae, to galactose and glucose. Subsequently, galactose, which cannot be metabolized by Lb. delbrueckii subsp. bulgaricus, is excreted and provides a carbon source for yeast. (ii) In pure cultures, Lb. delbrueckii subsp. bulgaricus grows only in the presence of increased CO2 concentrations. In anaerobic mixed cultures, the yeast provides this CO2 via alcoholic fermentation. (iii) Analysis of amino acid consumption from the defined medium indicated that S. cerevisiae supplied alanine to the bacterium. (iv) A mild but significant low-iron response in the yeast transcriptome, identified by DNA microarray analysis, was consistent with the chelation of iron by the lactate produced by Lb. delbrueckii subsp. bulgaricus. (v) Transcriptome analysis of Lb. delbrueckii subsp. bulgaricus in mixed cultures showed an overrepresentation of transcripts involved in lipid metabolism, suggesting either a competition of the two microorganisms for fatty acids or a response to the ethanol produced by S. cerevisiae. This study demonstrates that chemostat-based transcriptome analysis is a powerful tool to investigate microbial interactions in mixed populations.

  16. Microfluidic single-cell whole-transcriptome sequencing.

    PubMed

    Streets, Aaron M; Zhang, Xiannian; Cao, Chen; Pang, Yuhong; Wu, Xinglong; Xiong, Liang; Yang, Lu; Fu, Yusi; Zhao, Liang; Tang, Fuchou; Huang, Yanyi

    2014-05-13

    Single-cell whole-transcriptome analysis is a powerful tool for quantifying gene expression heterogeneity in populations of cells. Many techniques have, thus, been recently developed to perform transcriptome sequencing (RNA-Seq) on individual cells. To probe subtle biological variation between samples with limiting amounts of RNA, more precise and sensitive methods are still required. We adapted a previously developed strategy for single-cell RNA-Seq that has shown promise for superior sensitivity and implemented the chemistry in a microfluidic platform for single-cell whole-transcriptome analysis. In this approach, single cells are captured and lysed in a microfluidic device, where mRNAs with poly(A) tails are reverse-transcribed into cDNA. Double-stranded cDNA is then collected and sequenced using a next generation sequencing platform. We prepared 94 libraries consisting of single mouse embryonic cells and technical replicates of extracted RNA and thoroughly characterized the performance of this technology. Microfluidic implementation increased mRNA detection sensitivity as well as improved measurement precision compared with tube-based protocols. With 0.2 M reads per cell, we were able to reconstruct a majority of the bulk transcriptome with 10 single cells. We also quantified variation between and within different types of mouse embryonic cells and found that enhanced measurement precision, detection sensitivity, and experimental throughput aided the distinction between biological variability and technical noise. With this work, we validated the advantages of an early approach to single-cell RNA-Seq and showed that the benefits of combining microfluidic technology with high-throughput sequencing will be valuable for large-scale efforts in single-cell transcriptome analysis.

  17. Genome-Scale Transcriptome Analysis in Response to Nitric Oxide in Birch Cells: Implications of the Triterpene Biosynthetic Pathway

    PubMed Central

    Zeng, Fansuo; Sun, Fengkun; Li, Leilei; Liu, Kun; Zhan, Yaguang

    2014-01-01

    Evidence supporting nitric oxide (NO) as a mediator of plant biochemistry continues to grow, but its functions at the molecular level remains poorly understood and, in some cases, controversial. To study the role of NO at the transcriptional level in Betula platyphylla cells, we conducted a genome-scale transcriptome analysis of these cells. The transcriptome of untreated birch cells and those treated by sodium nitroprusside (SNP) were analyzed using the Solexa sequencing. Data were collected by sequencing cDNA libraries of birch cells, which had a long period to adapt to the suspension culture conditions before SNP-treated cells and untreated cells were sampled. Among the 34,100 UniGenes detected, BLASTX search revealed that 20,631 genes showed significant (E-values≤10−5) sequence similarity with proteins from the NR-database. Numerous expressed sequence tags (i.e., 1374) were identified as differentially expressed between the 12 h SNP-treated cells and control cells samples: 403 up-regulated and 971 down-regulated. From this, we specifically examined a core set of NO-related transcripts. The altered expression levels of several transcripts, as determined by transcriptome analysis, was confirmed by qRT-PCR. The results of transcriptome analysis, gene expression quantification, the content of triterpenoid and activities of defensive enzymes elucidated NO has a significant effect on many processes including triterpenoid production, carbohydrate metabolism and cell wall biosynthesis. PMID:25551661

  18. TRAM (Transcriptome Mapper): database-driven creation and analysis of transcriptome maps from multiple sources

    PubMed Central

    2011-01-01

    Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format) and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper) is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays), implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile), useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples) and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene clusters with differential expression during the differentiation toward megakaryocyte were identified. Conclusions TRAM is designed to create, and statistically analyze, quantitative transcriptome maps, based on gene expression data from multiple sources. The release includes FileMaker Pro database management runtime application and it is freely available at http://apollo11.isto.unibo.it/software/, along with preconfigured implementations for mapping of human, mouse and zebrafish transcriptomes. PMID:21333005

  19. Genome-scale cold stress response regulatory networks in ten Arabidopsis thaliana ecotypes

    PubMed Central

    2013-01-01

    Background Low temperature leads to major crop losses every year. Although several studies have been conducted focusing on diversity of cold tolerance level in multiple phenotypically divergent Arabidopsis thaliana (A. thaliana) ecotypes, genome-scale molecular understanding is still lacking. Results In this study, we report genome-scale transcript response diversity of 10 A. thaliana ecotypes originating from different geographical locations to non-freezing cold stress (10°C). To analyze the transcriptional response diversity, we initially compared transcriptome changes in all 10 ecotypes using Arabidopsis NimbleGen ATH6 microarrays. In total 6061 transcripts were significantly cold regulated (p < 0.01) in 10 ecotypes, including 498 transcription factors and 315 transposable elements. The majority of the transcripts (75%) showed ecotype specific expression pattern. By using sequence data available from Arabidopsis thaliana 1001 genome project, we further investigated sequence polymorphisms in the core cold stress regulon genes. Significant numbers of non-synonymous amino acid changes were observed in the coding region of the CBF regulon genes. Considering the limited knowledge about regulatory interactions between transcription factors and their target genes in the model plant A. thaliana, we have adopted a powerful systems genetics approach- Network Component Analysis (NCA) to construct an in-silico transcriptional regulatory network model during response to cold stress. The resulting regulatory network contained 1,275 nodes and 7,720 connections, with 178 transcription factors and 1,331 target genes. Conclusions A. thaliana ecotypes exhibit considerable variation in transcriptome level responses to non-freezing cold stress treatment. Ecotype specific transcripts and related gene ontology (GO) categories were identified to delineate natural variation of cold stress regulated differential gene expression in the model plant A. thaliana. The predicted regulatory network model was able to identify new ecotype specific transcription factors and their regulatory interactions, which might be crucial for their local geographic adaptation to cold temperature. Additionally, since the approach presented here is general, it could be adapted to study networks regulating biological process in any biological systems. PMID:24148294

  20. JNK1 induces hedgehog signaling from stellate cells to accelerate liver regeneration in mice.

    PubMed

    Langiewicz, Magda; Graf, Rolf; Humar, Bostjan; Clavien, Pierre A

    2018-04-28

    To improve outcomes of two-staged hepatectomies for large/multiple liver tumors, portal vein ligation (PVL) has been combined with parenchymal transection (associating liver partition and portal vein ligation for staged hepatectomy [coined ALPPS]) to greatly accelerate liver regeneration. In a novel ALPPS mouse model, we have reported paracrine Indian hedgehog (IHH) signaling from stellate cells as an early contributor to augmented regeneration. Here, we sought to identify upstream regulators of IHH. ALPPS in mice was compared against PVL and additional control surgeries. Potential IHH regulators were identified through in silico mining of transcriptomic data. c-Jun N-terminal kinase (JNK1 [Mapk8]) activity was reduced through SP600125 to evaluate its effects on IHH signaling. Recombinant IHH was injected after JNK1 diminution to substantiate their relationship during accelerated liver regeneration. Transcriptomic analysis linked Ihh to Mapk8. JNK1 upregulation after ALPPS was validated and preceded the IHH peak. On immunofluorescence, JNK1 and IHH co-localized in alpha-smooth muscle actin-positive non-parenchymal cells. Inhibition of JNK1 prior to ALPPS surgery reduced liver weight gain to PVL levels and was accompanied by downregulation of hepatocellular proliferation and the IHH-GLI1-CCND1 axis. In JNK1-inhibited mice, recombinant IHH restored ALPPS-like acceleration of regeneration and re-elevated JNK1 activity, suggesting the presence of a positive IHH-JNK1 feedback loop. JNK1-mediated induction of IHH paracrine signaling from hepatic stellate cells is essential for accelerated regeneration of parenchymal mass. The JNK1-IHH axis is a mechanism unique to ALPPS surgery and may point to therapeutic alternatives for patients with insufficient regenerative capacity. Associating liver partition and portal vein ligation for staged hepatectomy (so called ALPPS), is a new two-staged approach to hepatectomy, which induces an unprecedented acceleration of liver regeneration, enabling treatment of patients with liver tumors that would otherwise be considered unresectable. Herein, we demonstrate that JNK1-IHH signaling from stellate cells is a key mechanism underlying the regenerative acceleration that is induced by ALPPS. Copyright © 2018 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.

  1. Comparative transcriptome analysis by RNAseq of necrotic enteritis Clostridium perfringens during in vivo colonization and in vitro conditions.

    PubMed

    Parreira, Valeria R; Russell, Kay; Athanasiadou, Spiridoula; Prescott, John F

    2016-08-12

    Necrotic enteritis (NE) caused by netB-positive type A Clostridium perfringens is an important bacterial disease of poultry. Through its complex regulatory system, C. perfringens orchestrates the expression of a collection of toxins and extracellular enzymes that are crucial for the development of the disease; environmental conditions play an important role in their regulation. In this study, and for the first time, global transcriptomic analysis was performed on ligated intestinal loops in chickens colonized with a netB-positive C. perfringens strain, as well as the same strain propagated in vitro under various nutritional and environmental conditions. Analysis of the respective pathogen transcriptomes revealed up to 673 genes that were significantly expressed in vivo. Gene expression profiles in vivo were most similar to those of C. perfringens grown in nutritionally-deprived conditions. Taken together, our results suggest a bacterial transcriptome responses to the early stages of adaptation, and colonization of, the chicken intestine. Our work also reveals how netB-positive C. perfringens reacts to different environmental conditions including those in the chicken intestine.

  2. Comparative transcriptome analysis of unripe and mid-ripe fruit of Mangifera indica (var. “Dashehari”) unravels ripening associated genes

    PubMed Central

    Srivastava, Smriti; Singh, Rajesh K.; Pathak, Garima; Goel, Ridhi; Asif, Mehar Hasan; Sane, Aniruddha P.; Sane, Vidhu A.

    2016-01-01

    Ripening in mango is under a complex control of ethylene. In an effort to understand the complex spatio-temporal control of ripening we have made use of a popular N. Indian variety “Dashehari” This variety ripens from the stone inside towards the peel outside and forms jelly in the pulp in ripe fruits. Through a combination of 454 and Illumina sequencing, a transcriptomic analysis of gene expression from unripe and midripe stages have been performed in triplicates. Overall 74,312 unique transcripts with ≥1 FPKM were obtained. The transcripts related to 127 pathways were identified in “Dashehari” mango transcriptome by the KEGG analysis. These pathways ranged from detoxification, ethylene biosynthesis, carbon metabolism and aromatic amino acid degradation. The transcriptome study reveals differences not only in expression of softening associated genes but also those that govern ethylene biosynthesis and other nutritional characteristics. This study could help to develop ripening related markers for selective breeding to reduce the problems of excess jelly formation during softening in the “Dashehari” variety. PMID:27586495

  3. Preclinical evaluation of the PI3K/Akt/mTOR pathway in animal models of multiple sclerosis

    PubMed Central

    Mammana, Santa; Bramanti, Placido; Mazzon, Emanuela; Cavalli, Eugenio; Basile, Maria Sofia; Fagone, Paolo; Petralia, Maria Cristina; McCubrey, James Andrew; Nicoletti, Ferdinando; Mangano, Katia

    2018-01-01

    The PI3K/AKT/mTOR pathway is an intracellular signalling pathway that regulates cell activation. proliferation, metabolism and apoptosis. Increasing body of data suggests that alterations in the PI3K/AKT/mTOR pathway may result in an enhanced susceptibility to autoimmunity. Multiple Sclerosis (MS) is one of the most common chronic inflammatory diseases of the central nervous system leading to demyelination and neurodegeneration. In the current study, we have firstly evaluated in silico the involvement of the mTOR network on the generation and progression of MS and on oligodendrocyte function, making use of currently available whole-genome transcriptomic data. Then, the data generated in silico were subjected to an ex-vivo evaluation. To this aim, the involvement of mTOR was validated on a well-known animal model of MS and in vitro on Th17 cells. Our data indicate that there is a significant involvement of the mTOR network in the etiopathogenesis of MS and that Rapamycin treatment may represent a useful therapeutic approach in this clinical setting. On the other hand, our data showed that a significant involvement of the mTOR network could be observed only in the early phases of oligodendrocyte maturation, but not in the maturation process of adult oligodendrocytes and in the process of remyelination following demyelinating injury. Overall, our study suggests that targeting the PI3K/mTOR pathway, although it may not be a useful therapeutic approach to promote remyelination in MS patients, it can be exploited to exert immunomodulation, preventing/delaying relapses, and to treat MS patients in order to slow down the progression of disability. PMID:29492193

  4. Brownian model of transcriptome evolution and phylogenetic network visualization between tissues.

    PubMed

    Gu, Xun; Ruan, Hang; Su, Zhixi; Zou, Yangyun

    2017-09-01

    While phylogenetic analysis of transcriptomes of the same tissue is usually congruent with the species tree, the controversy emerges when multiple tissues are included, that is, whether species from the same tissue are clustered together, or different tissues from the same species are clustered together. Recent studies have suggested that phylogenetic network approach may shed some lights on our understanding of multi-tissue transcriptome evolution; yet the underlying evolutionary mechanism remains unclear. In this paper we develop a Brownian-based model of transcriptome evolution under the phylogenetic network that can statistically distinguish between the patterns of species-clustering and tissue-clustering. Our model can be used as a null hypothesis (neutral transcriptome evolution) for testing any correlation in tissue evolution, can be applied to cancer transcriptome evolution to study whether two tumors of an individual appeared independently or via metastasis, and can be useful to detect convergent evolution at the transcriptional level. Copyright © 2017. Published by Elsevier Inc.

  5. Transcriptomic Studies of the Effect of nod Gene-Inducing Molecules in Rhizobia: Different Weapons, One Purpose

    PubMed Central

    Jiménez-Guerrero, Irene; Acosta-Jurado, Sebastián; Navarro-Gómez, Pilar; López-Baena, Francisco Javier; Ollero, Francisco Javier

    2017-01-01

    Simultaneous quantification of transcripts of the whole bacterial genome allows the analysis of the global transcriptional response under changing conditions. RNA-seq and microarrays are the most used techniques to measure these transcriptomic changes, and both complement each other in transcriptome profiling. In this review, we exhaustively compiled the symbiosis-related transcriptomic reports (microarrays and RNA sequencing) carried out hitherto in rhizobia. This review is specially focused on transcriptomic changes that takes place when five rhizobial species, Bradyrhizobium japonicum (=diazoefficiens) USDA 110, Rhizobium leguminosarum biovar viciae 3841, Rhizobium tropici CIAT 899, Sinorhizobium (=Ensifer) meliloti 1021 and S. fredii HH103, recognize inducing flavonoids, plant-exuded phenolic compounds that activate the biosynthesis and export of Nod factors (NF) in all analysed rhizobia. Interestingly, our global transcriptomic comparison also indicates that each rhizobial species possesses its own arsenal of molecular weapons accompanying the set of NF in order to establish a successful interaction with host legumes. PMID:29267254

  6. The response of Isidorella newcombi to copper exposure: Using an integrated biological framework to interpret transcriptomic responses from RNA-seq analysis.

    PubMed

    Ubrihien, Rodney P; Ezaz, Tariq; Taylor, Anne M; Stevens, Mark M; Krikowa, Frank; Foster, Simon; Maher, William A

    2017-04-01

    This study describes the transcriptomic response of the Australian endemic freshwater gastropod Isidorella newcombi exposed to 80±1μg/L of copper for 3days. Analysis of copper tissue concentration, lysosomal membrane destabilisation and RNA-seq were conducted. Copper tissue concentrations confirmed that copper was bioaccumulated by the snails. Increased lysosomal membrane destabilisation in the copper-exposed snails indicated that the snails were stressed as a result of the exposure. Both copper tissue concentrations and lysosomal destabilisation were significantly greater in snails exposed to copper. In order to interpret the RNA-seq data from an ecotoxicological perspective an integrated biological response model was developed that grouped transcriptomic responses into those associated with copper transport and storage, survival mechanisms and cell death. A conceptual model of expected transcriptomic changes resulting from the copper exposure was developed as a basis to assess transcriptomic responses. Transcriptomic changes were evident at all the three levels of the integrated biological response model. Despite lacking statistical significance, increased expression of the gene encoding copper transporting ATPase provided an indication of increased internal transport of copper. Increased expression of genes associated with endocytosis are associated with increased transport of copper to the lysosome for storage in a detoxified form. Survival mechanisms included metabolic depression and processes associated with cellular repair and recycling. There was transcriptomic evidence of increased cell death by apoptosis in the copper-exposed organisms. Increased apoptosis is supported by the increase in lysosomal membrane destabilisation in the copper-exposed snails. Transcriptomic changes relating to apoptosis, phagocytosis, protein degradation and the lysosome were evident and these processes can be linked to the degradation of post-apoptotic debris. The study identified contaminant specific transcriptomic markers as well as markers of general stress. From an ecotoxicological perspective, the use of a framework to group transcriptomic responses into those associated with copper transport, survival and cell death assisted with the complex process of interpretation of RNA-seq data. The broad adoption of such a framework in ecotoxicology studies would assist in comparison between studies and the identification of reliable transcriptomic markers of contaminant exposure and response. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. RNA-seq Transcriptome Analysis of Panax japonicus, and Its Comparison with Other Panax Species to Identify Potential Genes Involved in the Saponins Biosynthesis

    PubMed Central

    Rai, Amit; Yamazaki, Mami; Takahashi, Hiroki; Nakamura, Michimi; Kojoma, Mareshige; Suzuki, Hideyuki; Saito, Kazuki

    2016-01-01

    The Panax genus has been a source of natural medicine, benefitting human health over the ages, among which the Panax japonicus represents an important species. Our understanding of several key pathways and enzymes involved in the biosynthesis of ginsenosides, a pharmacologically active class of metabolites and a major chemical constituents of the rhizome extracts from the Panax species, are limited. Limited genomic information, and lack of studies on comparative transcriptomics across the Panax species have restricted our understanding of the biosynthetic mechanisms of these and many other important classes of phytochemicals. Herein, we describe Illumina based RNA sequencing analysis to characterize the transcriptome and expression profiles of genes expressed in the five tissues of P. japonicus, and its comparison with other Panax species. RNA sequencing and de novo transcriptome assembly for P. japonicus resulted in a total of 135,235 unigenes with 78,794 (58.24%) unigenes being annotated using NCBI-nr database. Transcriptome profiling, and gene ontology enrichment analysis for five tissues of P. japonicus showed that although overall processes were evenly conserved across all tissues. However, each tissue was characterized by several unique unigenes with the leaves showing the most unique unigenes among the tissues studied. A comparative analysis of the P. japonicus transcriptome assembly with publically available transcripts from other Panax species, namely, P. ginseng, P. notoginseng, and P. quinquefolius also displayed high sequence similarity across all Panax species, with P. japonicus showing highest similarity with P. ginseng. Annotation of P. japonicus transcriptome resulted in the identification of putative genes encoding all enzymes from the triterpene backbone biosynthetic pathways, and identified 24 and 48 unigenes annotated as cytochrome P450 (CYP) and glycosyltransferases (GT), respectively. These CYPs and GTs annotated unigenes were conserved across all Panax species and co-expressed with other the transcripts involved in the triterpenoid backbone biosynthesis pathways. Unigenes identified in this study represent strong candidates for being involved in the triterpenoid saponins biosynthesis, and can serve as a basis for future validation studies. PMID:27148308

  8. Transcriptomic analysis of Arabidopsis developing stems: a close-up on cell wall genes

    PubMed Central

    Minic, Zoran; Jamet, Elisabeth; San-Clemente, Hélène; Pelletier, Sandra; Renou, Jean-Pierre; Rihouey, Christophe; Okinyo, Denis PO; Proux, Caroline; Lerouge, Patrice; Jouanin, Lise

    2009-01-01

    Background Different strategies (genetics, biochemistry, and proteomics) can be used to study proteins involved in cell biogenesis. The availability of the complete sequences of several plant genomes allowed the development of transcriptomic studies. Although the expression patterns of some Arabidopsis thaliana genes involved in cell wall biogenesis were identified at different physiological stages, detailed microarray analysis of plant cell wall genes has not been performed on any plant tissues. Using transcriptomic and bioinformatic tools, we studied the regulation of cell wall genes in Arabidopsis stems, i.e. genes encoding proteins involved in cell wall biogenesis and genes encoding secreted proteins. Results Transcriptomic analyses of stems were performed at three different developmental stages, i.e., young stems, intermediate stage, and mature stems. Many genes involved in the synthesis of cell wall components such as polysaccharides and monolignols were identified. A total of 345 genes encoding predicted secreted proteins with moderate or high level of transcripts were analyzed in details. The encoded proteins were distributed into 8 classes, based on the presence of predicted functional domains. Proteins acting on carbohydrates and proteins of unknown function constituted the two most abundant classes. Other proteins were proteases, oxido-reductases, proteins with interacting domains, proteins involved in signalling, and structural proteins. Particularly high levels of expression were established for genes encoding pectin methylesterases, germin-like proteins, arabinogalactan proteins, fasciclin-like arabinogalactan proteins, and structural proteins. Finally, the results of this transcriptomic analyses were compared with those obtained through a cell wall proteomic analysis from the same material. Only a small proportion of genes identified by previous proteomic analyses were identified by transcriptomics. Conversely, only a few proteins encoded by genes having moderate or high level of transcripts were identified by proteomics. Conclusion Analysis of the genes predicted to encode cell wall proteins revealed that about 345 genes had moderate or high levels of transcripts. Among them, we identified many new genes possibly involved in cell wall biogenesis. The discrepancies observed between results of this transcriptomic study and a previous proteomic study on the same material revealed post-transcriptional mechanisms of regulation of expression of genes encoding cell wall proteins. PMID:19149885

  9. The First Chameleon Transcriptome: Comparative Genomic Analysis of the OXPHOS System Reveals Loss of COX8 in Iguanian Lizards

    PubMed Central

    Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan

    2013-01-01

    Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system. PMID:24009133

  10. The first Chameleon transcriptome: comparative genomic analysis of the OXPHOS system reveals loss of COX8 in Iguanian lizards.

    PubMed

    Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan

    2013-01-01

    Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system.

  11. Aging-like Changes in the Transcriptome of Irradiated Microglia

    PubMed Central

    Li, Matthew D.; Burns, Terry C.; Kumar, Sunny; Morgan, Alexander A.; Sloan, Steven A.; Palmer, Theo D.

    2014-01-01

    Whole brain irradiation remains important in the management of brain tumors. Although necessary for improving survival outcomes, cranial irradiation also results in cognitive decline in long-term survivors. A chronic inflammatory state characterized by microglial activation has been implicated in radiation-induced brain injury. We here provide the first comprehensive transcriptional profile of irradiated microglia. Fluorescence-activated cell sorting (FACS) was used to isolate CD11b+ microglia from the hippocampi of C57BL/6 and Balb/c mice 1 month after 10Gy cranial irradiation. Affymetrix gene expression profiles were evaluated using linear modeling, rank product analyses. One month after irradiation, a conserved irradiation signature across strains was identified, comprising 448 and 85 differentially up- and down-regulated genes, respectively. Gene set enrichment analysis (GSEA) demonstrated enrichment for inflammation, including M1 macrophage-associated genes, but also an unexpected enrichment for extracellular matrix and blood coagulation-related gene sets, in contrast previously described microglial states. Weighted gene co-expression network analysis (WGCNA) confirmed these findings and further revealed alterations in mitochondrial function. The RNA-seq transcriptome of microglia 24h post-radiation proved similar to the 1-month transcriptome, but additionally featured alterations in apoptotic and lysosomal gene expression. Re-analysis of published aging mouse microglia transcriptome data demonstrated striking similarity to the 1 month irradiated microglia transcriptome, suggesting that shared mechanisms may underlie aging and chronic irradiation-induced cognitive decline. PMID:25690519

  12. Use of prior knowledge for the analysis of high-throughput transcriptomics and metabolomics data

    PubMed Central

    2014-01-01

    Background High-throughput omics technologies have enabled the measurement of many genes or metabolites simultaneously. The resulting high dimensional experimental data poses significant challenges to transcriptomics and metabolomics data analysis methods, which may lead to spurious instead of biologically relevant results. One strategy to improve the results is the incorporation of prior biological knowledge in the analysis. This strategy is used to reduce the solution space and/or to focus the analysis on biological meaningful regions. In this article, we review a selection of these methods used in transcriptomics and metabolomics. We combine the reviewed methods in three groups based on the underlying mathematical model: exploratory methods, supervised methods and estimation of the covariance matrix. We discuss which prior knowledge has been used, how it is incorporated and how it modifies the mathematical properties of the underlying methods. PMID:25033193

  13. Use of homologous and heterologous gene expression profiling tools to characterize transcription dynamics during apple fruit maturation and ripening.

    PubMed

    Costa, Fabrizio; Alba, Rob; Schouten, Henk; Soglio, Valeria; Gianfranceschi, Luca; Serra, Sara; Musacchi, Stefano; Sansavini, Silviero; Costa, Guglielmo; Fei, Zhangjun; Giovannoni, James

    2010-10-25

    Fruit development, maturation and ripening consists of a complex series of biochemical and physiological changes that in climacteric fruits, including apple and tomato, are coordinated by the gaseous hormone ethylene. These changes lead to final fruit quality and understanding of the functional machinery underlying these processes is of both biological and practical importance. To date many reports have been made on the analysis of gene expression in apple. In this study we focused our investigation on the role of ethylene during apple maturation, specifically comparing transcriptomics of normal ripening with changes resulting from application of the hormone receptor competitor 1-methylcyclopropene. To gain insight into the molecular process regulating ripening in apple, and to compare to tomato (model species for ripening studies), we utilized both homologous and heterologous (tomato) microarray to profile transcriptome dynamics of genes involved in fruit development and ripening, emphasizing those which are ethylene regulated.The use of both types of microarrays facilitated transcriptome comparison between apple and tomato (for the later using data previously published and available at the TED: tomato expression database) and highlighted genes conserved during ripening of both species, which in turn represent a foundation for further comparative genomic studies. The cross-species analysis had the secondary aim of examining the efficiency of heterologous (specifically tomato) microarray hybridization for candidate gene identification as related to the ripening process. The resulting transcriptomics data revealed coordinated gene expression during fruit ripening of a subset of ripening-related and ethylene responsive genes, further facilitating the analysis of ethylene response during fruit maturation and ripening. Our combined strategy based on microarray hybridization enabled transcriptome characterization during normal climacteric apple ripening, as well as definition of ethylene-dependent transcriptome changes. Comparison with tomato fruit maturation and ethylene responsive transcriptome activity facilitated identification of putative conserved orthologous ripening-related genes, which serve as an initial set of candidates for assessing conservation of gene activity across genomes of fruit bearing plant species.

  14. Transcriptomic analysis of flower development in tea (Camellia sinensis (L.)).

    PubMed

    Liu, Feng; Wang, Yu; Ding, Zhaotang; Zhao, Lei; Xiao, Jun; Wang, Linjun; Ding, Shibo

    2017-10-05

    Flowering is a critical and complicated process in plant development, involving interactions of numerous endogenous and environmental factors, but little is known about the complex network regulating flower development in tea plants. In this study, de novo transcriptome assembly and gene expression analysis using Illumina sequencing technology were performed. Transcriptomic analysis assembles gene-related information involved in reproductive growth of C. sinensis. Gene Ontology (GO) analysis of the annotated unigenes revealed that the majority of sequenced genes were associated with metabolic and cellular processes, cell and cell parts, catalytic activity and binding. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis indicated that metabolic pathways, biosynthesis of secondary metabolites, and plant hormone signal transduction were enriched among the DEGs. Furthermore, 207 flowering-associated unigenes were identified from our database. Some transcription factors, such as WRKY, ERF, bHLH, MYB and MADS-box were shown to be up-regulated in floral transition, which might play the role of progression of flowering. Furthermore, 14 genes were selected for confirmation of expression levels using quantitative real-time PCR (qRT-PCR). The comprehensive transcriptomic analysis presents fundamental information on the genes and pathways which are involved in flower development in C. sinensis. Our data also provided a useful database for further research of tea and other species of plants. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. Microarray-Based Gene Expression Analysis for Veterinary Pathologists: A Review.

    PubMed

    Raddatz, Barbara B; Spitzbarth, Ingo; Matheis, Katja A; Kalkuhl, Arno; Deschl, Ulrich; Baumgärtner, Wolfgang; Ulrich, Reiner

    2017-09-01

    High-throughput, genome-wide transcriptome analysis is now commonly used in all fields of life science research and is on the cusp of medical and veterinary diagnostic application. Transcriptomic methods such as microarrays and next-generation sequencing generate enormous amounts of data. The pathogenetic expertise acquired from understanding of general pathology provides veterinary pathologists with a profound background, which is essential in translating transcriptomic data into meaningful biological knowledge, thereby leading to a better understanding of underlying disease mechanisms. The scientific literature concerning high-throughput data-mining techniques usually addresses mathematicians or computer scientists as the target audience. In contrast, the present review provides the reader with a clear and systematic basis from a veterinary pathologist's perspective. Therefore, the aims are (1) to introduce the reader to the necessary methodological background; (2) to introduce the sequential steps commonly performed in a microarray analysis including quality control, annotation, normalization, selection of differentially expressed genes, clustering, gene ontology and pathway analysis, analysis of manually selected genes, and biomarker discovery; and (3) to provide references to publically available and user-friendly software suites. In summary, the data analysis methods presented within this review will enable veterinary pathologists to analyze high-throughput transcriptome data obtained from their own experiments, supplemental data that accompany scientific publications, or public repositories in order to obtain a more in-depth insight into underlying disease mechanisms.

  16. Antimicrobial Peptides of Meat Origin - An In silico and In vitro Analysis.

    PubMed

    Keska, Paulina; Stadnik, Joanna

    2017-01-01

    The aim of this study was to evaluate the antimicrobial activity of meat protein-derived peptides against selected Gram-positive and Gram-negative bacteria. The in silico and in vitro approach was combined to determine the potency of antimicrobial peptides derived from pig (Sus scrofa) and cow (Bos taurus) proteins. The in silico studies consisted of an analysis of the amino acid composition of peptides obtained from the CAMPR database, their molecular weight and other physicochemical properties (isoelectric point, molar extinction coefficient, instability index, aliphatic index, hydropathy index and net charge). The degree of similarity was estimated between the antimicrobial peptide sequences derived from the slaughtered animals and the main meat proteins. Antimicrobial activity of peptides isolated from dry-cured meat products was analysed (in vitro) against two strains of pathogenic bacteria using the disc diffusion method. There was no evidence of growthinhibitory properties of peptides isolated from dry-cured meat products against Escherichia coli K12 ATCC 10798 and Staphylococcus aureus ATCC 25923. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  17. In silico quantitative structure-toxicity relationship study of aromatic nitro compounds.

    PubMed

    Pasha, Farhan Ahmad; Neaz, Mohammad Morshed; Cho, Seung Joo; Ansari, Mohiuddin; Mishra, Sunil Kumar; Tiwari, Sharvan

    2009-05-01

    Small molecules often have toxicities that are a function of molecular structural features. Minor variations in structural features can make large difference in such toxicity. Consequently, in silico techniques may be used to correlate such molecular toxicities with their structural features. Relative to nine different sets of aromatic nitro compounds having known observed toxicities against different targets, we developed ligand-based 2D quantitative structure-toxicity relationship models using 20 selected topological descriptors. The topological descriptors have several advantages such as conformational independency, facile and less time-consuming computation to yield good results. Multiple linear regression analysis was used to correlate variations of toxicity with molecular properties. The information index on molecular size, lopping centric index and Kier flexibility index were identified as fundamental descriptors for different kinds of toxicity, and further showed that molecular size, branching and molecular flexibility might be particularly important factors in quantitative structure-toxicity relationship analysis. This study revealed that topological descriptor-guided quantitative structure-toxicity relationship provided a very useful, cost and time-efficient, in silico tool for describing small-molecule toxicities.

  18. De novo assembling and primary analysis of genome and transcriptome of gray whale Eschrichtius robustus.

    PubMed

    Moskalev, Alexey А; Kudryavtseva, Anna V; Graphodatsky, Alexander S; Beklemisheva, Violetta R; Serdyukova, Natalya A; Krutovsky, Konstantin V; Sharov, Vadim V; Kulakovskiy, Ivan V; Lando, Andrey S; Kasianov, Artem S; Kuzmin, Dmitry A; Putintseva, Yuliya A; Feranchuk, Sergey I; Shaposhnikov, Mikhail V; Fraifeld, Vadim E; Toren, Dmitri; Snezhkina, Anastasia V; Sitnik, Vasily V

    2017-12-28

    Gray whale, Eschrichtius robustus (E. robustus), is a single member of the family Eschrichtiidae, which is considered to be the most primitive in the class Cetacea. Gray whale is often described as a "living fossil". It is adapted to extreme marine conditions and has a high life expectancy (77 years). The assembly of a gray whale genome and transcriptome will allow to carry out further studies of whale evolution, longevity, and resistance to extreme environment. In this work, we report the first de novo assembly and primary analysis of the E. robustus genome and transcriptome based on kidney and liver samples. The presented draft genome assembly is complete by 55% in terms of a total genome length, but only by 24% in terms of the BUSCO complete gene groups, although 10,895 genes were identified. Transcriptome annotation and comparison with other whale species revealed robust expression of DNA repair and hypoxia-response genes, which is expected for whales. This preliminary study of the gray whale genome and transcriptome provides new data to better understand the whale evolution and the mechanisms of their adaptation to the hypoxic conditions.

  19. Tentacle Transcriptome and Venom Proteome of the Pacific Sea Nettle, Chrysaora fuscescens (Cnidaria: Scyphozoa).

    PubMed

    Ponce, Dalia; Brinkman, Diane L; Potriquet, Jeremy; Mulvenna, Jason

    2016-04-05

    Jellyfish venoms are rich sources of toxins designed to capture prey or deter predators, but they can also elicit harmful effects in humans. In this study, an integrated transcriptomic and proteomic approach was used to identify putative toxins and their potential role in the venom of the scyphozoan jellyfish Chrysaora fuscescens. A de novo tentacle transcriptome, containing more than 23,000 contigs, was constructed and used in proteomic analysis of C. fuscescens venom to identify potential toxins. From a total of 163 proteins identified in the venom proteome, 27 were classified as putative toxins and grouped into six protein families: proteinases, venom allergens, C-type lectins, pore-forming toxins, glycoside hydrolases and enzyme inhibitors. Other putative toxins identified in the transcriptome, but not the proteome, included additional proteinases as well as lipases and deoxyribonucleases. Sequence analysis also revealed the presence of ShKT domains in two putative venom proteins from the proteome and an additional 15 from the transcriptome, suggesting potential ion channel blockade or modulatory activities. Comparison of these potential toxins to those from other cnidarians provided insight into their possible roles in C. fuscescens venom and an overview of the diversity of potential toxin families in cnidarian venoms.

  20. Transcriptome assembly, profiling and differential gene expression analysis of the halophyte Suaeda fruticosa provides insights into salt tolerance.

    PubMed

    Diray-Arce, Joann; Clement, Mark; Gul, Bilquees; Khan, M Ajmal; Nielsen, Brent L

    2015-05-06

    Improvement of crop production is needed to feed the growing world population as the amount and quality of agricultural land decreases and soil salinity increases. This has stimulated research on salt tolerance in plants. Most crops tolerate a limited amount of salt to survive and produce biomass, while halophytes (salt-tolerant plants) have the ability to grow with saline water utilizing specific biochemical mechanisms. However, little is known about the genes involved in salt tolerance. We have characterized the transcriptome of Suaeda fruticosa, a halophyte that has the ability to sequester salts in its leaves. Suaeda fruticosa is an annual shrub in the family Chenopodiaceae found in coastal and inland regions of Pakistan and Mediterranean shores. This plant is an obligate halophyte that grows optimally from 200-400 mM NaCl and can grow at up to 1000 mM NaCl. High throughput sequencing technology was performed to provide understanding of genes involved in the salt tolerance mechanism. De novo assembly of the transcriptome and analysis has allowed identification of differentially expressed and unique genes present in this non-conventional crop. Twelve sequencing libraries prepared from control (0 mM NaCl treated) and optimum (300 mM NaCl treated) plants were sequenced using Illumina Hiseq 2000 to investigate differential gene expression between shoots and roots of Suaeda fruticosa. The transcriptome was assembled de novo using Velvet and Oases k-45 and clustered using CDHIT-EST. There are 54,526 unigenes; among these 475 genes are downregulated and 44 are upregulated when samples from plants grown under optimal salt are compared with those grown without salt. BLAST analysis identified the differentially expressed genes, which were categorized in gene ontology terms and their pathways. This work has identified potential genes involved in salt tolerance in Suaeda fruticosa, and has provided an outline of tools to use for de novo transcriptome analysis. The assemblies that were used provide coverage of a considerable proportion of the transcriptome, which allows analysis of differential gene expression and identification of genes that may be involved in salt tolerance. The transcriptome may serve as a reference sequence for study of other succulent halophytes.

  1. Analysis, annotation, and profiling of the oat seed transcriptome

    USDA-ARS?s Scientific Manuscript database

    Novel high-throughput next generation sequencing (NGS) technologies are providing opportunities to explore genomes and transcriptomes in a cost-effective manner. To construct a gene expression atlas of developing oat (Avena sativa) seeds, two software packages specifically designed for RNA-seq (Trin...

  2. A comprehensive analysis of the human placenta transcriptome

    USDA-ARS?s Scientific Manuscript database

    As the conduit for nutrients and growth signals, the placenta is critical to establishing an environment sufficient for fetal growth and development. To better understand the mechanisms regulating placental development and gene expression, we characterized the transcriptome of term placenta from 20 ...

  3. Global analysis of WRKY transcription factor superfamily in Setaria identifies potential candidates involved in abiotic stress signaling

    PubMed Central

    Muthamilarasan, Mehanathan; Bonthala, Venkata S.; Khandelwal, Rohit; Jaishankar, Jananee; Shweta, Shweta; Nawaz, Kashif; Prasad, Manoj

    2015-01-01

    Transcription factors (TFs) are major players in stress signaling and constitute an integral part of signaling networks. Among the major TFs, WRKY proteins play pivotal roles in regulation of transcriptional reprogramming associated with stress responses. In view of this, genome- and transcriptome-wide identification of WRKY TF family was performed in the C4model plants, Setaria italica (SiWRKY) and S. viridis (SvWRKY), respectively. The study identified 105 SiWRKY and 44 SvWRKY proteins that were computationally analyzed for their physicochemical properties. Sequence alignment and phylogenetic analysis classified these proteins into three major groups, namely I, II, and III with majority of WRKY proteins belonging to group II (53 SiWRKY and 23 SvWRKY), followed by group III (39 SiWRKY and 11 SvWRKY) and group I (10 SiWRKY and 6 SvWRKY). Group II proteins were further classified into 5 subgroups (IIa to IIe) based on their phylogeny. Domain analysis showed the presence of WRKY motif and zinc finger-like structures in these proteins along with additional domains in a few proteins. All SiWRKY genes were physically mapped on the S. italica genome and their duplication analysis revealed that 10 and 8 gene pairs underwent tandem and segmental duplications, respectively. Comparative mapping of SiWRKY and SvWRKY genes in related C4 panicoid genomes demonstrated the orthologous relationships between these genomes. In silico expression analysis of SiWRKY and SvWRKY genes showed their differential expression patterns in different tissues and stress conditions. Expression profiling of candidate SiWRKY genes in response to stress (dehydration and salinity) and hormone treatments (abscisic acid, salicylic acid, and methyl jasmonate) suggested the putative involvement of SiWRKY066 and SiWRKY082 in stress and hormone signaling. These genes could be potential candidates for further characterization to delineate their functional roles in abiotic stress signaling. PMID:26635818

  4. Global analysis of WRKY transcription factor superfamily in Setaria identifies potential candidates involved in abiotic stress signaling.

    PubMed

    Muthamilarasan, Mehanathan; Bonthala, Venkata S; Khandelwal, Rohit; Jaishankar, Jananee; Shweta, Shweta; Nawaz, Kashif; Prasad, Manoj

    2015-01-01

    Transcription factors (TFs) are major players in stress signaling and constitute an integral part of signaling networks. Among the major TFs, WRKY proteins play pivotal roles in regulation of transcriptional reprogramming associated with stress responses. In view of this, genome- and transcriptome-wide identification of WRKY TF family was performed in the C4model plants, Setaria italica (SiWRKY) and S. viridis (SvWRKY), respectively. The study identified 105 SiWRKY and 44 SvWRKY proteins that were computationally analyzed for their physicochemical properties. Sequence alignment and phylogenetic analysis classified these proteins into three major groups, namely I, II, and III with majority of WRKY proteins belonging to group II (53 SiWRKY and 23 SvWRKY), followed by group III (39 SiWRKY and 11 SvWRKY) and group I (10 SiWRKY and 6 SvWRKY). Group II proteins were further classified into 5 subgroups (IIa to IIe) based on their phylogeny. Domain analysis showed the presence of WRKY motif and zinc finger-like structures in these proteins along with additional domains in a few proteins. All SiWRKY genes were physically mapped on the S. italica genome and their duplication analysis revealed that 10 and 8 gene pairs underwent tandem and segmental duplications, respectively. Comparative mapping of SiWRKY and SvWRKY genes in related C4 panicoid genomes demonstrated the orthologous relationships between these genomes. In silico expression analysis of SiWRKY and SvWRKY genes showed their differential expression patterns in different tissues and stress conditions. Expression profiling of candidate SiWRKY genes in response to stress (dehydration and salinity) and hormone treatments (abscisic acid, salicylic acid, and methyl jasmonate) suggested the putative involvement of SiWRKY066 and SiWRKY082 in stress and hormone signaling. These genes could be potential candidates for further characterization to delineate their functional roles in abiotic stress signaling.

  5. In silico predicted reproductive endocrine transcriptional regulatory networks during zebrafish (Danio rerio) development.

    PubMed

    Hala, D

    2017-03-21

    The interconnected topology of transcriptional regulatory networks (TRNs) readily lends to mathematical (or in silico) representation and analysis as a stoichiometric matrix. Such a matrix can be 'solved' using the mathematical method of extreme pathway (ExPa) analysis, which identifies uniquely activated genes subject to transcription factor (TF) availability. In this manuscript, in silico multi-tissue TRN models of brain, liver and gonad were used to study reproductive endocrine developmental programming in zebrafish (Danio rerio) from 0.25h post fertilization (hpf; zygote) to 90 days post fertilization (dpf; adult life stage). First, properties of TRN models were studied by sequentially activating all genes in multi-tissue models. This analysis showed the brain to exhibit lowest proportion of co-regulated genes (19%) relative to liver (23%) and gonad (32%). This was surprising given that the brain comprised 75% and 25% more TFs than liver and gonad respectively. Such 'hierarchy' of co-regulatory capability (brain

  6. Genome-wide transcriptome and expression profile analysis of Phalaenopsis during explant browning.

    PubMed

    Xu, Chuanjun; Zeng, Biyu; Huang, Junmei; Huang, Wen; Liu, Yumei

    2015-01-01

    Explant browning presents a major problem for in vitro culture, and can lead to the death of the explant and failure of regeneration. Considerable work has examined the physiological mechanisms underlying Phalaenopsis leaf explant browning, but the molecular mechanisms of browning remain elusive. In this study, we used whole genome RNA sequencing to examine Phalaenopsis leaf explant browning at genome-wide level. We first used Illumina high-throughput technology to sequence the transcriptome of Phalaenopsis and then performed de novo transcriptome assembly. We assembled 79,434,350 clean reads into 31,708 isogenes and generated 26,565 annotated unigenes. We assigned Gene Ontology (GO) terms, Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations, and potential Pfam domains to each transcript. Using the transcriptome data as a reference, we next analyzed the differential gene expression of explants cultured for 0, 3, and 6 d, respectively. We then identified differentially expressed genes (DEGs) before and after Phalaenopsis explant browning. We also performed GO, KEGG functional enrichment and Pfam analysis of all DEGs. Finally, we selected 11 genes for quantitative real-time PCR (qPCR) analysis to confirm the expression profile analysis. Here, we report the first comprehensive analysis of transcriptome and expression profiles during Phalaenopsis explant browning. Our results suggest that Phalaenopsis explant browning may be due in part to gene expression changes that affect the secondary metabolism, such as: phenylpropanoid pathway and flavonoid biosynthesis. Genes involved in photosynthesis and ATPase activity have been found to be changed at transcription level; these changes may perturb energy metabolism and thus lead to the decay of plant cells and tissues. This study provides comprehensive gene expression data for Phalaenopsis browning. Our data constitute an important resource for further functional studies to prevent explant browning.

  7. Genome-Wide Transcriptome and Expression Profile Analysis of Phalaenopsis during Explant Browning

    PubMed Central

    Xu, Chuanjun; Zeng, Biyu; Huang, Junmei; Huang, Wen; Liu, Yumei

    2015-01-01

    Background Explant browning presents a major problem for in vitro culture, and can lead to the death of the explant and failure of regeneration. Considerable work has examined the physiological mechanisms underlying Phalaenopsis leaf explant browning, but the molecular mechanisms of browning remain elusive. In this study, we used whole genome RNA sequencing to examine Phalaenopsis leaf explant browning at genome-wide level. Methodology/Principal Findings We first used Illumina high-throughput technology to sequence the transcriptome of Phalaenopsis and then performed de novo transcriptome assembly. We assembled 79,434,350 clean reads into 31,708 isogenes and generated 26,565 annotated unigenes. We assigned Gene Ontology (GO) terms, Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations, and potential Pfam domains to each transcript. Using the transcriptome data as a reference, we next analyzed the differential gene expression of explants cultured for 0, 3, and 6 d, respectively. We then identified differentially expressed genes (DEGs) before and after Phalaenopsis explant browning. We also performed GO, KEGG functional enrichment and Pfam analysis of all DEGs. Finally, we selected 11 genes for quantitative real-time PCR (qPCR) analysis to confirm the expression profile analysis. Conclusions/Significance Here, we report the first comprehensive analysis of transcriptome and expression profiles during Phalaenopsis explant browning. Our results suggest that Phalaenopsis explant browning may be due in part to gene expression changes that affect the secondary metabolism, such as: phenylpropanoid pathway and flavonoid biosynthesis. Genes involved in photosynthesis and ATPase activity have been found to be changed at transcription level; these changes may perturb energy metabolism and thus lead to the decay of plant cells and tissues. This study provides comprehensive gene expression data for Phalaenopsis browning. Our data constitute an important resource for further functional studies to prevent explant browning. PMID:25874455

  8. De novo transcriptomic analysis and development of EST-SSR markers in the Siberian tiger (Panthera tigris altaica).

    PubMed

    Lu, Taofeng; Sun, Yujiao; Ma, Qin; Zhu, Minghao; Liu, Dan; Ma, Jianzhang; Ma, Yuehui; Chen, Hongyan; Guan, Weijun

    2016-12-01

    The Siberian tiger, Panthera tigris altaica, is an endangered species, and much more work is needed to protect this species, which is still vulnerable to extinction. Conservation efforts may be supported by the genetic assessment of wild populations, for which highly specific microsatellite markers are required. However, only a limited amount of genetic sequence data is available for this species. To identify the genes involved in the lung transcriptome and to develop additional simple sequence repeat (SSR) markers for the Siberian tiger, we used high-throughput RNA-Seq to characterize the Siberian tiger transcriptome in lung tissue (designated 'PTA-lung') and a pooled tissue sample (designated 'PTA'). Approximately 47.5 % (33,187/69,836) of the lung transcriptome was annotated in four public databases (Nr, Swiss-Prot, KEGG, and COG). The annotated genes formed a potential pool for gene identification in the tiger. An analysis of the genes differentially expressed in the PTA lung, and PTA samples revealed that the tiger may have suffered a series of diseases before death. In total, 1062 non-redundant SSRs were identified in the Siberian tiger transcriptome. Forty-three primer pairs were randomly selected for amplification reactions, and 26 of the 43 pairs were also used to evaluate the levels of genetic polymorphism. Fourteen primer pairs (32.56 %) amplified products that were polymorphic in size in P. tigris altaica. In conclusion, the transcriptome sequences will provide a valuable genomic resource for genetic research, and these new SSR markers comprise a reasonable number of loci for the genetic analysis of wild and captive populations of P. tigris altaica.

  9. Global transcriptome analysis of the C57BL/6J mouse testis by SAGE: evidence for nonrandom gene order.

    PubMed

    Divina, Petr; Vlcek, Cestmír; Strnad, Petr; Paces, Václav; Forejt, Jirí

    2005-03-05

    We generated the gene expression profile of the total testis from the adult C57BL/6J male mice using serial analysis of gene expression (SAGE). Two high-quality SAGE libraries containing a total of 76 854 tags were constructed. An extensive bioinformatic analysis and comparison of SAGE transcriptomes of the total testis, testicular somatic cells and other mouse tissues was performed and the theory of male-biased gene accumulation on the X chromosome was tested. We sorted out 829 genes predominantly expressed from the germinal part and 944 genes from the somatic part of the testis. The genes preferentially and specifically expressed in total testis and testicular somatic cells were identified by comparing the testis SAGE transcriptomes to the available transcriptomes of seven non-testis tissues. We uncovered chromosomal clusters of adjacent genes with preferential expression in total testis and testicular somatic cells by a genome-wide search and found that the clusters encompassed a significantly higher number of genes than expected by chance. We observed a significant 3.2-fold enrichment of the proportion of X-linked genes specific for testicular somatic cells, while the proportions of X-linked genes specific for total testis and for other tissues were comparable. In contrast to the tissue-specific genes, an under-representation of X-linked genes in the total testis transcriptome but not in the transcriptomes of testicular somatic cells and other tissues was detected. Our results provide new evidence in favor of the theory of male-biased genes accumulation on the X chromosome in testicular somatic cells and indicate the opposite action of the meiotic X-inactivation in testicular germ cells.

  10. Global transcriptome analysis of the C57BL/6J mouse testis by SAGE: evidence for nonrandom gene order

    PubMed Central

    Divina, Petr; Vlček, Čestmír; Strnad, Petr; Pačes, Václav; Forejt, Jiří

    2005-01-01

    Background We generated the gene expression profile of the total testis from the adult C57BL/6J male mice using serial analysis of gene expression (SAGE). Two high-quality SAGE libraries containing a total of 76 854 tags were constructed. An extensive bioinformatic analysis and comparison of SAGE transcriptomes of the total testis, testicular somatic cells and other mouse tissues was performed and the theory of male-biased gene accumulation on the X chromosome was tested. Results We sorted out 829 genes predominantly expressed from the germinal part and 944 genes from the somatic part of the testis. The genes preferentially and specifically expressed in total testis and testicular somatic cells were identified by comparing the testis SAGE transcriptomes to the available transcriptomes of seven non-testis tissues. We uncovered chromosomal clusters of adjacent genes with preferential expression in total testis and testicular somatic cells by a genome-wide search and found that the clusters encompassed a significantly higher number of genes than expected by chance. We observed a significant 3.2-fold enrichment of the proportion of X-linked genes specific for testicular somatic cells, while the proportions of X-linked genes specific for total testis and for other tissues were comparable. In contrast to the tissue-specific genes, an under-representation of X-linked genes in the total testis transcriptome but not in the transcriptomes of testicular somatic cells and other tissues was detected. Conclusion Our results provide new evidence in favor of the theory of male-biased genes accumulation on the X chromosome in testicular somatic cells and indicate the opposite action of the meiotic X-inactivation in testicular germ cells. PMID:15748293

  11. Selenium supplementation prevents metabolic and transcriptomic responses to cadmium in mouse lung.

    PubMed

    Hu, Xin; Chandler, Joshua D; Fernandes, Jolyn; Orr, Michael L; Hao, Li; Uppal, Karan; Neujahr, David C; Jones, Dean P; Go, Young-Mi

    2018-04-12

    The protective effect of selenium (Se) on cadmium (Cd) toxicity is well documented, but underlying mechanisms are unclear. Male mice fed standard diet were given Cd (CdCl 2 , 18 μmol/L) in drinking water with or without Se (Na 2 SeO 4, 20 μmol/L) for 16 weeks. Lungs were analyzed for Cd concentration, transcriptomics and metabolomics. Data were analyzed with biostatistics, bioinformatics, pathway enrichment analysis, and combined transcriptome-metabolome-wide association study. Mice treated with Cd had higher lung Cd content (1.7 ± 0.4 pmol/mg protein) than control mice (0.8 ± 0.3 pmol/mg protein) or mice treated with Cd and Se (0.4 ± 0.1 pmol/mg protein). Gene set enrichment analysis of transcriptomics data showed that Se prevented Cd effects on inflammatory and myogenesis genes and diminished Cd effects on several other pathways. Similarly, Se prevented Cd-disrupted metabolic pathways in amino acid metabolism and urea cycle. Integrated transcriptome and metabolome network analysis showed that Cd treatment had a network structure with fewer gene-metabolite clusters compared to control. Centrality measurements showed that Se counteracted changes in a group of Cd-responsive genes including Zdhhc11, (protein-cysteine S-palmitoyltransferase), Ighg1 (immunoglobulin heavy constant gamma-1) and associated changes in metabolite concentrations. Co-administration of Se with Cd prevented Cd increase in lung and prevented Cd-associated pathway and network responses of the transcriptome and metabolome. Se protection against Cd toxicity in lung involves complex systems responses. Environmental Cd stimulates proinflammatory and profibrotic signaling. The present results indicate that dietary or supplemental Se could be useful to mitigate Cd toxicity. Published by Elsevier B.V.

  12. Determining the optimal number of independent components for reproducible transcriptomic data analysis.

    PubMed

    Kairov, Ulykbek; Cantini, Laura; Greco, Alessandro; Molkenov, Askhat; Czerwinska, Urszula; Barillot, Emmanuel; Zinovyev, Andrei

    2017-09-11

    Independent Component Analysis (ICA) is a method that models gene expression data as an action of a set of statistically independent hidden factors. The output of ICA depends on a fundamental parameter: the number of components (factors) to compute. The optimal choice of this parameter, related to determining the effective data dimension, remains an open question in the application of blind source separation techniques to transcriptomic data. Here we address the question of optimizing the number of statistically independent components in the analysis of transcriptomic data for reproducibility of the components in multiple runs of ICA (within the same or within varying effective dimensions) and in multiple independent datasets. To this end, we introduce ranking of independent components based on their stability in multiple ICA computation runs and define a distinguished number of components (Most Stable Transcriptome Dimension, MSTD) corresponding to the point of the qualitative change of the stability profile. Based on a large body of data, we demonstrate that a sufficient number of dimensions is required for biological interpretability of the ICA decomposition and that the most stable components with ranks below MSTD have more chances to be reproduced in independent studies compared to the less stable ones. At the same time, we show that a transcriptomics dataset can be reduced to a relatively high number of dimensions without losing the interpretability of ICA, even though higher dimensions give rise to components driven by small gene sets. We suggest a protocol of ICA application to transcriptomics data with a possibility of prioritizing components with respect to their reproducibility that strengthens the biological interpretation. Computing too few components (much less than MSTD) is not optimal for interpretability of the results. The components ranked within MSTD range have more chances to be reproduced in independent studies.

  13. Transcriptome of interstitial cells of Cajal reveals unique and selective gene signatures

    PubMed Central

    Park, Paul J.; Fuchs, Robert; Wei, Lai; Jorgensen, Brian G.; Redelman, Doug; Ward, Sean M.; Sanders, Kenton M.

    2017-01-01

    Transcriptome-scale data can reveal essential clues into understanding the underlying molecular mechanisms behind specific cellular functions and biological processes. Transcriptomics is a continually growing field of research utilized in biomarker discovery. The transcriptomic profile of interstitial cells of Cajal (ICC), which serve as slow-wave electrical pacemakers for gastrointestinal (GI) smooth muscle, has yet to be uncovered. Using copGFP-labeled ICC mice and flow cytometry, we isolated ICC populations from the murine small intestine and colon and obtained their transcriptomes. In analyzing the transcriptome, we identified a unique set of ICC-restricted markers including transcription factors, epigenetic enzymes/regulators, growth factors, receptors, protein kinases/phosphatases, and ion channels/transporters. This analysis provides new and unique insights into the cellular and biological functions of ICC in GI physiology. Additionally, we constructed an interactive ICC genome browser (http://med.unr.edu/physio/transcriptome) based on the UCSC genome database. To our knowledge, this is the first online resource that provides a comprehensive library of all known genetic transcripts expressed in primary ICC. Our genome browser offers a new perspective into the alternative expression of genes in ICC and provides a valuable reference for future functional studies. PMID:28426719

  14. Character trees from transcriptome data: Origin and individuation of morphological characters and the so-called "species signal".

    PubMed

    Musser, Jacob M; Wagner, Günter P

    2015-11-01

    We elaborate a framework for investigating the evolutionary history of morphological characters. We argue that morphological character trees generated by phylogenetic analysis of transcriptomes provide a useful tool for identifying causal gene expression differences underlying the development and evolution of morphological characters. They also enable rigorous testing of different models of morphological character evolution and origination, including the hypothesis that characters originate via divergence of repeated ancestral characters. Finally, morphological character trees provide evidence that character transcriptomes undergo concerted evolution. We argue that concerted evolution of transcriptomes can explain the so-called "species signal" found in several recent comparative transcriptome studies. The species signal is the phenomenon that transcriptomes cluster by species rather than character type, even though the characters are older than the respective species. We suggest the species signal is a natural consequence of concerted gene expression evolution resulting from mutations that alter gene regulatory network interactions shared by the characters under comparison. Thus, character trees generated from transcriptomes allow us to investigate the variational independence, or individuation, of morphological characters at the level of genetic programs. © 2015 Wiley Periodicals, Inc.

  15. Analysis of insecticide resistance-related genes of the Carmine spider mite Tetranychus cinnabarinus based on a de novo assembled transcriptome.

    PubMed

    Xu, Zhifeng; Zhu, Wenyi; Liu, Yanchao; Liu, Xing; Chen, Qiushuang; Peng, Miao; Wang, Xiangzun; Shen, Guangmao; He, Lin

    2014-01-01

    The carmine spider mite (CSM), Tetranychus cinnabarinus, is an important pest mite in agriculture, because it can develop insecticide resistance easily. To gain valuable gene information and molecular basis for the future insecticide resistance study of CSM, the first transcriptome analysis of CSM was conducted. A total of 45,016 contigs and 25,519 unigenes were generated from the de novo transcriptome assembly, and 15,167 unigenes were annotated via BLAST querying against current databases, including nr, SwissProt, the Clusters of Orthologous Groups (COGs), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO). Aligning the transcript to Tetranychus urticae genome, the 19255 (75.45%) of the transcripts had significant (e-value <10-5) matches to T. urticae DNA genome, 19111 sequences matched to T. urticae proteome with an average protein length coverage of 42.55%. Core Eukaryotic Genes Mapping Approach (CEGMA) analysis identified 435 core eukaryotic genes (CEGs) in the CSM dataset corresponding to 95% coverage. Ten gene categories that relate to insecticide resistance in arthropod were generated from CSM transcriptome, including 53 P450-, 22 GSTs-, 23 CarEs-, 1 AChE-, 7 GluCls-, 9 nAChRs-, 8 GABA receptor-, 1 sodium channel-, 6 ATPase- and 12 Cyt b genes. We developed significant molecular resources for T. cinnabarinus putatively involved in insecticide resistance. The transcriptome assembly analysis will significantly facilitate our study on the mechanism of adapting environmental stress (including insecticide) in CSM at the molecular level, and will be very important for developing new control strategies against this pest mite.

  16. De Novo Transcriptome Assembly and Characterization of Lithospermum officinale to Discover Putative Genes Involved in Specialized Metabolites Biosynthesis.

    PubMed

    Rai, Amit; Nakaya, Taiki; Shimizu, Yohei; Rai, Megha; Nakamura, Michimi; Suzuki, Hideyuki; Saito, Kazuki; Yamazaki, Mami

    2018-05-29

    Lithospermum officinale is a valuable source of bioactive metabolites with medicinal and industrial values. However, little is known about genes involved in the biosynthesis of these metabolites, primarily due to the lack of genome or transcriptome resources. This study presents the first effort to establish and characterize de novo transcriptome assembly resource for L. officinale and expression analysis for three of its tissues, namely leaf, stem, and root. Using over 4Gbps of RNA-sequencing datasets, we obtained de novo transcriptome assembly of L. officinale , consisting of 77,047 unigenes with assembly N50 value as 1524 bps. Based on transcriptome annotation and functional classification, 52,766 unigenes were assigned with putative genes functions, gene ontology terms, and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. KEGG pathway and gene ontology enrichment analysis using highly expressed unigenes across three tissues and targeted metabolome analysis showed active secondary metabolic processes enriched specifically in the root of L. officinale . Using co-expression analysis, we also identified 20 and 48 unigenes representing different enzymes of lithospermic/chlorogenic acid and shikonin biosynthesis pathways, respectively. We further identified 15 candidate unigenes annotated as cytochrome P450 with the highest expression in the root of L. officinale as novel genes with a role in key biochemical reactions toward shikonin biosynthesis. Thus, through this study, we not only generated a high-quality genomic resource for L. officinale but also propose candidate genes to be involved in shikonin biosynthesis pathways for further functional characterization. Georg Thieme Verlag KG Stuttgart · New York.

  17. Perigone Lobe Transcriptome Analysis Provides Insights into Rafflesia cantleyi Flower Development.

    PubMed

    Lee, Xin-Wei; Mat-Isa, Mohd-Noor; Mohd-Elias, Nur-Atiqah; Aizat-Juhari, Mohd Afiq; Goh, Hoe-Han; Dear, Paul H; Chow, Keng-See; Haji Adam, Jumaat; Mohamed, Rahmah; Firdaus-Raih, Mohd; Wan, Kiew-Lian

    2016-01-01

    Rafflesia is a biologically enigmatic species that is very rare in occurrence and possesses an extraordinary morphology. This parasitic plant produces a gigantic flower up to one metre in diameter with no leaves, stem or roots. However, little is known about the floral biology of this species especially at the molecular level. In an effort to address this issue, we have generated and characterised the transcriptome of the Rafflesia cantleyi flower, and performed a comparison with the transcriptome of its floral bud to predict genes that are expressed and regulated during flower development. Approximately 40 million sequencing reads were generated and assembled de novo into 18,053 transcripts with an average length of 641 bp. Of these, more than 79% of the transcripts had significant matches to annotated sequences in the public protein database. A total of 11,756 and 7,891 transcripts were assigned to Gene Ontology categories and clusters of orthologous groups respectively. In addition, 6,019 transcripts could be mapped to 129 pathways in Kyoto Encyclopaedia of Genes and Genomes Pathway database. Digital abundance analysis identified 52 transcripts with very high expression in the flower transcriptome of R. cantleyi. Subsequently, analysis of differential expression between developing flower and the floral bud revealed a set of 105 transcripts with potential role in flower development. Our work presents a deep transcriptome resource analysis for the developing flower of R. cantleyi. Genes potentially involved in the growth and development of the R. cantleyi flower were identified and provide insights into biological processes that occur during flower development.

  18. Gene expression profiling of immunomagnetically separated cells directly from stabilized whole blood for multicenter clinical trials

    PubMed Central

    2014-01-01

    Background Clinically useful biomarkers for patient stratification and monitoring of disease progression and drug response are in big demand in drug development and for addressing potential safety concerns. Many diseases influence the frequency and phenotype of cells found in the peripheral blood and the transcriptome of blood cells. Changes in cell type composition influence whole blood gene expression analysis results and thus the discovery of true transcript level changes remains a challenge. We propose a robust and reproducible procedure, which includes whole transcriptome gene expression profiling of major subsets of immune cell cells directly sorted from whole blood. Methods Target cells were enriched using magnetic microbeads and an autoMACS® Pro Separator (Miltenyi Biotec). Flow cytometric analysis for purity was performed before and after magnetic cell sorting. Total RNA was hybridized on HGU133 Plus 2.0 expression microarrays (Affymetrix, USA). CEL files signal intensity values were condensed using RMA and a custom CDF file (EntrezGene-based). Results Positive selection by use of MACS® Technology coupled to transcriptomics was assessed for eight different peripheral blood cell types, CD14+ monocytes, CD3+, CD4+, or CD8+ T cells, CD15+ granulocytes, CD19+ B cells, CD56+ NK cells, and CD45+ pan leukocytes. RNA quality from enriched cells was above a RIN of eight. GeneChip analysis confirmed cell type specific transcriptome profiles. Storing whole blood collected in an EDTA Vacutainer® tube at 4°C followed by MACS does not activate sorted cells. Gene expression analysis supports cell enrichment measurements by MACS. Conclusions The proposed workflow generates reproducible cell-type specific transcriptome data which can be translated to clinical settings and used to identify clinically relevant gene expression biomarkers from whole blood samples. This procedure enables the integration of transcriptomics of relevant immune cell subsets sorted directly from whole blood in clinical trial protocols. PMID:25984272

  19. Transcriptome profiling reveals regulatory mechanisms underlying Corolla Senescence in Petunia

    USDA-ARS?s Scientific Manuscript database

    Genetic regulatory mechanisms that govern petal natural senescence in petunia is complicated and unclear. To identify key genes and pathways that regulate the process, we initiated a transcriptome analysis in petunia petals at four developmental time points, including petal opening without anthesis ...

  20. Placental transcriptome co-expression analysis reveals conserved regulatory program across gestation

    USDA-ARS?s Scientific Manuscript database

    Mammalian development in utero is absolutely dependent on proper placental development, which is ultimately regulated by the placental genome. The regulation of the placental genome can be directly studied by exploring the underlying organization of the placental transcriptome through a systematic a...

  1. De novo Assembly of the Burying Beetle Nicrophorus orbicollis (Coleoptera: Silphidae) Transcriptome Across Developmental Stages with Identification of Key Immune Transcripts

    PubMed Central

    Won, Harim I.; Schulze, Thomas T.; Clement, Emalie J.; Watson, Gabrielle F.; Watson, Sean M.; Warner, Rosalie C.; Ramler, Elizabeth A. M.; Witte, Elias J.; Schoenbeck, Mark A.; Rauter, Claudia M.; Davis, Paul H.

    2018-01-01

    Burying beetles (Nicrophorus spp.) are among the relatively few insects that provide parental care while not belonging to the eusocial insects such as ants or bees. This behavior incurs energy costs as evidenced by immune deficits and shorter life-spans in reproducing beetles. In the absence of an assembled transcriptome, relatively little is known concerning the molecular biology of these beetles. This work details the assembly and analysis of the Nicrophorus orbicollis transcriptome at multiple developmental stages. RNA-Seq reads were obtained by next-generation sequencing and the transcriptome was assembled using the Trinity assembler. Validation of the assembly was performed by functional characterization using Gene Ontology (GO), Eukaryotic Orthologous Groups (KOG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses. Differential expression analysis highlights developmental stage-specific expression patterns, and immunity-related transcripts are discussed. The data presented provides a valuable molecular resource to aid further investigation into immunocompetence throughout this organism's sexual development. PMID:29707046

  2. Transcriptional profiling of CD31(+) cells isolated from murine embryonic stem cells.

    PubMed

    Mariappan, Devi; Winkler, Johannes; Chen, Shuhua; Schulz, Herbert; Hescheler, Jürgen; Sachinidis, Agapios

    2009-02-01

    Identification of genes involved in endothelial differentiation is of great interest for the understanding of the cellular and molecular mechanisms involved in the development of new blood vessels. Mouse embryonic stem (mES) cells serve as a potential source of endothelial cells for transcriptomic analysis. We isolated endothelial cells from 8-days old embryoid bodies by immuno-magnetic separation using platelet endothelial cell adhesion molecule-1 (also known as CD31) expressed on both early and mature endothelial cells. CD31(+) cells exhibit endothelial-like behavior by being able to incorporate DiI-labeled acetylated low-density lipoprotein as well as form tubular structures on matrigel. Quantitative and semi-quantitative PCR analysis further demonstrated the increased expression of endothelial transcripts. To ascertain the specific transcriptomic identity of the CD31(+) cells, large-scale microarray analysis was carried out. Comparative bioinformatic analysis reveals an enrichment of the gene ontology categories angiogenesis, blood vessel morphogenesis, vasculogenesis and blood coagulation in the CD31(+) cell population. Based on the transcriptomic signatures of the CD31(+) cells, we conclude that this ES cell-derived population contains endothelial-like cells expressing a mesodermal marker BMP2 and possess an angiogenic potential. The transcriptomic characterization of CD31(+) cells enables an in vitro functional genomic model to identify genes required for angiogenesis.

  3. De novo transcriptome assembly and RNA-Seq expression analysis in blood from beluga whales of Bristol Bay, AK.

    PubMed

    Morey, Jeanine S; Burek Huntington, Kathy A; Campbell, Michelle; Clauss, Tonya M; Goertz, Caroline E; Hobbs, Roderick C; Lunardi, Denise; Moors, Amanda J; Neely, Marion G; Schwacke, Lori H; Van Dolah, Frances M

    2017-10-01

    Assessing the health of marine mammal sentinel species is crucial to understanding the impacts of environmental perturbations on marine ecosystems and human health. In Arctic regions, beluga whales, Delphinapterus leucas, are upper level predators that may serve as a sentinel species, potentially forecasting impacts on human health. While gene expression profiling from blood transcriptomes has widely been used to assess health status and environmental exposures in human and veterinary medicine, its use in wildlife has been limited due to the lack of available genomes and baseline data. To this end we constructed the first beluga whale blood transcriptome de novo from samples collected during annual health assessments of the healthy Bristol Bay, AK stock during 2012-2014 to establish baseline information on the content and variation of the beluga whale blood transcriptome. The Trinity transcriptome assembly from beluga was comprised of 91,325 transcripts that represented a wide array of cellular functions and processes and was extremely similar in content to the blood transcriptome of another cetacean, the bottlenose dolphin. Expression of hemoglobin transcripts was much lower in beluga (25.6% of TPM, transcripts per million) than has been observed in many other mammals. A T12A amino acid substitution in the HBB sequence of beluga whales, but not bottlenose dolphins, was identified and may play a role in low temperature adaptation. The beluga blood transcriptome was extremely stable between sex and year, with no apparent clustering of samples by principle components analysis and <4% of genes differentially expressed (EBseq, FDR<0.05). While the impacts of season, sexual maturity, disease, and geography on the beluga blood transcriptome must be established, the presence of transcripts involved in stress, detoxification, and immune functions indicate that blood gene expression analyses may provide information on health status and exposure. This study provides a wealth of transcriptomic data on beluga whales and provides a sizeable pool of preliminary data for comparison with other studies in beluga whale. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. De novo transcriptome of Ischnura elegans provides insights into sensory biology, colour and vision genes.

    PubMed

    Chauhan, Pallavi; Hansson, Bengt; Kraaijeveld, Ken; de Knijff, Peter; Svensson, Erik I; Wellenreuther, Maren

    2014-09-22

    There is growing interest in odonates (damselflies and dragonflies) as model organisms in ecology and evolutionary biology but the development of genomic resources has been slow. So far only one draft genome (Ladona fulva) and one transcriptome assembly (Enallagma hageni) have been published. Odonates have some of the most advanced visual systems among insects and several species are colour polymorphic, and genomic and transcriptomic data would allow studying the genomic architecture of these interesting traits and make detailed comparative studies between related species possible. Here, we present a comprehensive de novo transcriptome assembly for the blue-tailed damselfly Ischnura elegans (Odonata: Coenagrionidae) built from short-read RNA-seq data. The transcriptome analysis in this paper provides a first step towards identifying genes and pathways underlying the visual and colour systems in this insect group. Illumina RNA sequencing performed on tissues from the head, thorax and abdomen generated 428,744,100 paired-ends reads amounting to 110 Gb of sequence data, which was assembled de novo with Trinity. A transcriptome was produced after filtering and quality checking yielding a final set of 60,232 high quality transcripts for analysis. CEGMA software identified 247 out of 248 ultra-conserved core proteins as 'complete' in the transcriptome assembly, yielding a completeness of 99.6%. BLASTX and InterProScan annotated 55% of the assembled transcripts and showed that the three tissue types differed both qualitatively and quantitatively in I. elegans. Differential expression identified 8,625 transcripts to be differentially expressed in head, thorax and abdomen. Targeted analyses of vision and colour functional pathways identified the presence of four different opsin types and three pigmentation pathways. We also identified transcripts involved in temperature sensitivity, thermoregulation and olfaction. All these traits and their associated transcripts are of considerable ecological and evolutionary interest for this and other insect orders. Our work presents a comprehensive transcriptome resource for the ancient insect order Odonata and provides insight into their biology and physiology. The transcriptomic resource can provide a foundation for future investigations into this diverse group, including the evolution of colour, vision, olfaction and thermal adaptation.

  5. Deep sequencing reveals cell-type-specific patterns of single-cell transcriptome variation.

    PubMed

    Dueck, Hannah; Khaladkar, Mugdha; Kim, Tae Kyung; Spaethling, Jennifer M; Francis, Chantal; Suresh, Sangita; Fisher, Stephen A; Seale, Patrick; Beck, Sheryl G; Bartfai, Tamas; Kuhn, Bernhard; Eberwine, James; Kim, Junhyong

    2015-06-09

    Differentiation of metazoan cells requires execution of different gene expression programs but recent single-cell transcriptome profiling has revealed considerable variation within cells of seeming identical phenotype. This brings into question the relationship between transcriptome states and cell phenotypes. Additionally, single-cell transcriptomics presents unique analysis challenges that need to be addressed to answer this question. We present high quality deep read-depth single-cell RNA sequencing for 91 cells from five mouse tissues and 18 cells from two rat tissues, along with 30 control samples of bulk RNA diluted to single-cell levels. We find that transcriptomes differ globally across tissues with regard to the number of genes expressed, the average expression patterns, and within-cell-type variation patterns. We develop methods to filter genes for reliable quantification and to calibrate biological variation. All cell types include genes with high variability in expression, in a tissue-specific manner. We also find evidence that single-cell variability of neuronal genes in mice is correlated with that in rats consistent with the hypothesis that levels of variation may be conserved. Single-cell RNA-sequencing data provide a unique view of transcriptome function; however, careful analysis is required in order to use single-cell RNA-sequencing measurements for this purpose. Technical variation must be considered in single-cell RNA-sequencing studies of expression variation. For a subset of genes, biological variability within each cell type appears to be regulated in order to perform dynamic functions, rather than solely molecular noise.

  6. Transcriptional pathway and de novo network-based approaches to effects-based monitoring in the Great Lakes

    EPA Science Inventory

    Transcriptomics provides unique solutions for understanding the impact of complex mixtures and their components on aquatic systems. Here we describe the application of transcriptomics analysis of in situ fathead minnow exposures for assessing biological impacts of wastewater trea...

  7. Transcriptome and gene expression analysis in cold-acclimated guayule (Parthenium argentatum)rubber-producing tissue

    USDA-ARS?s Scientific Manuscript database

    Natural rubber biosynthesis in guayule (Parthenium argentatum) is associated with moderately cold night temperatures. To begin to dissect the molecular events triggered by cold temperatures that govern rubber synthesis induction in guayule, the transcriptome of bark tissue, where rubber is produced...

  8. Genetic Profiles of Korean Patients With Glucose-6-Phosphate Dehydrogenase Deficiency

    PubMed Central

    Lee, Jaewoong; Choi, Hayoung; Kim, Jiyeon; Kwon, Ahlm; Jang, Woori; Chae, Hyojin; Kim, Myungshin; Kim, Yonggoo; Lee, Jae Wook; Chung, Nack-Gyun

    2017-01-01

    Background We describe the genetic profiles of Korean patients with glucose-6-phosphate dehydrogenase (G6PD) deficiencies and the effects of G6PD mutations on protein stability and enzyme activity on the basis of in silico analysis. Methods In parallel with a genetic analysis, the pathogenicity of G6PD mutations detected in Korean patients was predicted in silico. The simulated effects of G6PD mutations were compared to the WHO classes based on G6PD enzyme activity. Four previously reported mutations and three newly diagnosed patients with missense mutations were estimated. Results One novel mutation (p.Cys385Gly, labeled G6PD Kangnam) and two known mutations [p.Ile220Met (G6PD São Paulo) and p.Glu416Lys (G6PD Tokyo)] were identified in this study. G6PD mutations identified in Koreans were also found in Brazil (G6PD São Paulo), Poland (G6PD Seoul), United States of America (G6PD Riley), Mexico (G6PD Guadalajara), and Japan (G6PD Tokyo). Several mutations occurred at the same nucleotide, but resulted in different amino acid residue changes in different ethnic populations (p.Ile380 variant, G6PD Calvo Mackenna; p.Cys385 variants, Tomah, Madrid, Lynwood; p.Arg387 variant, Beverly Hills; p.Pro396 variant, Bari; and p.Pro396Ala in India). On the basis of the in silico analysis, Class I or II mutations were predicted to be highly deleterious, and the effects of one Class IV mutation were equivocal. Conclusions The genetic profiles of Korean individuals with G6PD mutations indicated that the same mutations may have arisen by independent mutational events, and were not derived from shared ancestral mutations. The in silico analysis provided insight into the role of G6PD mutations in enzyme function and stability. PMID:28028996

  9. Genetic Profiles of Korean Patients With Glucose-6-Phosphate Dehydrogenase Deficiency.

    PubMed

    Lee, Jaewoong; Park, Joonhong; Choi, Hayoung; Kim, Jiyeon; Kwon, Ahlm; Jang, Woori; Chae, Hyojin; Kim, Myungshin; Kim, Yonggoo; Lee, Jae Wook; Chung, Nack Gyun; Cho, Bin

    2017-03-01

    We describe the genetic profiles of Korean patients with glucose-6-phosphate dehydrogenase (G6PD) deficiencies and the effects of G6PD mutations on protein stability and enzyme activity on the basis of in silico analysis. In parallel with a genetic analysis, the pathogenicity of G6PD mutations detected in Korean patients was predicted in silico. The simulated effects of G6PD mutations were compared to the WHO classes based on G6PD enzyme activity. Four previously reported mutations and three newly diagnosed patients with missense mutations were estimated. One novel mutation (p.Cys385Gly, labeled G6PD Kangnam) and two known mutations [p.Ile220Met (G6PD São Paulo) and p.Glu416Lys (G6PD Tokyo)] were identified in this study. G6PD mutations identified in Koreans were also found in Brazil (G6PD São Paulo), Poland (G6PD Seoul), United States of America (G6PD Riley), Mexico (G6PD Guadalajara), and Japan (G6PD Tokyo). Several mutations occurred at the same nucleotide, but resulted in different amino acid residue changes in different ethnic populations (p.Ile380 variant, G6PD Calvo Mackenna; p.Cys385 variants, Tomah, Madrid, Lynwood; p.Arg387 variant, Beverly Hills; p.Pro396 variant, Bari; and p.Pro396Ala in India). On the basis of the in silico analysis, Class I or II mutations were predicted to be highly deleterious, and the effects of one Class IV mutation were equivocal. The genetic profiles of Korean individuals with G6PD mutations indicated that the same mutations may have arisen by independent mutational events, and were not derived from shared ancestral mutations. The in silico analysis provided insight into the role of G6PD mutations in enzyme function and stability.

  10. Profiling the venom gland transcriptomes of Costa Rican snakes by 454 pyrosequencing

    PubMed Central

    2011-01-01

    Background A long term research goal of venomics, of applied importance for improving current antivenom therapy, but also for drug discovery, is to understand the pharmacological potential of venoms. Individually or combined, proteomic and transcriptomic studies have demonstrated their feasibility to explore in depth the molecular diversity of venoms. In the absence of genome sequence, transcriptomes represent also valuable searchable databases for proteomic projects. Results The venom gland transcriptomes of 8 Costa Rican taxa from 5 genera (Crotalus, Bothrops, Atropoides, Cerrophidion, and Bothriechis) of pitvipers were investigated using high-throughput 454 pyrosequencing. 100,394 out of 330,010 masked reads produced significant hits in the available databases. 5.165,220 nucleotides (8.27%) were masked by RepeatMasker, the vast majority of which corresponding to class I (retroelements) and class II (DNA transposons) mobile elements. BLAST hits included 79,991 matches to entries of the taxonomic suborder Serpentes, of which 62,433 displayed similarity to documented venom proteins. Strong discrepancies between the transcriptome-computed and the proteome-gathered toxin compositions were obvious at first sight. Although the reasons underlaying this discrepancy are elusive, since no clear trend within or between species is apparent, the data indicate that individual mRNA species may be translationally controlled in a species-dependent manner. The minimum number of genes from each toxin family transcribed into the venom gland transcriptome of each species was calculated from multiple alignments of reads matched to a full-length reference sequence of each toxin family. Reads encoding ORF regions of Kazal-type inhibitor-like proteins were uniquely found in Bothriechis schlegelii and B. lateralis transcriptomes, suggesting a genus-specific recruitment event during the early-Middle Miocene. A transcriptome-based cladogram supports the large divergence between A. mexicanus and A. picadoi, and a closer kinship between A. mexicanus and C. godmani. Conclusions Our comparative next-generation sequencing (NGS) analysis reveals taxon-specific trends governing the formulation of the venom arsenal. Knowledge of the venom proteome provides hints on the translation efficiency of toxin-coding transcripts, contributing thereby to a more accurate interpretation of the transcriptome. The application of NGS to the analysis of snake venom transcriptomes, may represent the tool for opening the door to systems venomics. PMID:21605378

  11. A large-scale full-length cDNA analysis to explore the budding yeast transcriptome

    PubMed Central

    Miura, Fumihito; Kawaguchi, Noriko; Sese, Jun; Toyoda, Atsushi; Hattori, Masahira; Morishita, Shinichi; Ito, Takashi

    2006-01-01

    We performed a large-scale cDNA analysis to explore the transcriptome of the budding yeast Saccharomyces cerevisiae. We sequenced two cDNA libraries, one from the cells exponentially growing in a minimal medium and the other from meiotic cells. Both libraries were generated by using a vector-capping method that allows the accurate mapping of transcription start sites (TSSs). Consequently, we identified 11,575 TSSs associated with 3,638 annotated genomic features, including 3,599 ORFs, to suggest that most yeast genes have two or more TSSs. In addition, we identified 45 previously undescribed introns, including those affecting current ORF annotations and those spliced alternatively. Furthermore, the analysis revealed 667 transcription units in the intergenic regions and transcripts derived from antisense strands of 367 known features. We also found that 348 ORFs carry TSSs in their 3′-halves to generate sense transcripts starting from inside the ORFs. These results indicate that the budding yeast transcriptome is considerably more complex than previously thought, and it shares many recently revealed characteristics with the transcriptomes of mammals and other higher eukaryotes. Thus, the genome-wide active transcription that generates novel classes of transcripts appears to be an intrinsic feature of the eukaryotic cells. The budding yeast will serve as a versatile model for the studies on these aspects of transcriptome, and the full-length cDNA clones can function as an invaluable resource in such studies. PMID:17101987

  12. In Silico Constraint-Based Strain Optimization Methods: the Quest for Optimal Cell Factories

    PubMed Central

    Maia, Paulo; Rocha, Miguel

    2015-01-01

    SUMMARY Shifting from chemical to biotechnological processes is one of the cornerstones of 21st century industry. The production of a great range of chemicals via biotechnological means is a key challenge on the way toward a bio-based economy. However, this shift is occurring at a pace slower than initially expected. The development of efficient cell factories that allow for competitive production yields is of paramount importance for this leap to happen. Constraint-based models of metabolism, together with in silico strain design algorithms, promise to reveal insights into the best genetic design strategies, a step further toward achieving that goal. In this work, a thorough analysis of the main in silico constraint-based strain design strategies and algorithms is presented, their application in real-world case studies is analyzed, and a path for the future is discussed. PMID:26609052

  13. The perennial ryegrass GenomeZipper: targeted use of genome resources for comparative grass genomics.

    PubMed

    Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F X; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

    2013-02-01

    Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species.

  14. The Perennial Ryegrass GenomeZipper: Targeted Use of Genome Resources for Comparative Grass Genomics1[C][W

    PubMed Central

    Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F.X.; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

    2013-01-01

    Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species. PMID:23184232

  15. Using "Omics" and Integrated Multi-Omics Approaches to Guide Probiotic Selection to Mitigate Chytridiomycosis and Other Emerging Infectious Diseases.

    PubMed

    Rebollar, Eria A; Antwis, Rachael E; Becker, Matthew H; Belden, Lisa K; Bletz, Molly C; Brucker, Robert M; Harrison, Xavier A; Hughey, Myra C; Kueneman, Jordan G; Loudon, Andrew H; McKenzie, Valerie; Medina, Daniel; Minbiole, Kevin P C; Rollins-Smith, Louise A; Walke, Jenifer B; Weiss, Sophie; Woodhams, Douglas C; Harris, Reid N

    2016-01-01

    Emerging infectious diseases in wildlife are responsible for massive population declines. In amphibians, chytridiomycosis caused by Batrachochytrium dendrobatidis, Bd, has severely affected many amphibian populations and species around the world. One promising management strategy is probiotic bioaugmentation of antifungal bacteria on amphibian skin. In vivo experimental trials using bioaugmentation strategies have had mixed results, and therefore a more informed strategy is needed to select successful probiotic candidates. Metagenomic, transcriptomic, and metabolomic methods, colloquially called "omics," are approaches that can better inform probiotic selection and optimize selection protocols. The integration of multiple omic data using bioinformatic and statistical tools and in silico models that link bacterial community structure with bacterial defensive function can allow the identification of species involved in pathogen inhibition. We recommend using 16S rRNA gene amplicon sequencing and methods such as indicator species analysis, the Kolmogorov-Smirnov Measure, and co-occurrence networks to identify bacteria that are associated with pathogen resistance in field surveys and experimental trials. In addition to 16S amplicon sequencing, we recommend approaches that give insight into symbiont function such as shotgun metagenomics, metatranscriptomics, or metabolomics to maximize the probability of finding effective probiotic candidates, which can then be isolated in culture and tested in persistence and clinical trials. An effective mitigation strategy to ameliorate chytridiomycosis and other emerging infectious diseases is necessary; the advancement of omic methods and the integration of multiple omic data provide a promising avenue toward conservation of imperiled species.

  16. New inhibitor targeting human transcription factor HSF1: effects on the heat shock response and tumor cell survival

    PubMed Central

    Vilaboa, Nuria; Boré, Alba; Martin-Saavedra, Francisco; Bayford, Melanie; Winfield, Natalie; Firth-Clark, Stuart; Kirton, Stewart B.

    2017-01-01

    Abstract Comparative modeling of the DNA-binding domain of human HSF1 facilitated the prediction of possible binding pockets for small molecules and definition of corresponding pharmacophores. In silico screening of a large library of lead-like compounds identified a set of compounds that satisfied the pharmacophoric criteria, a selection of which compounds was purchased to populate a biased sublibrary. A discriminating cell-based screening assay identified compound 001, which was subjected to systematic analysis of structure–activity relationships, resulting in the development of compound 115 (IHSF115). IHSF115 bound to an isolated HSF1 DNA-binding domain fragment. The compound did not affect heat-induced oligomerization, nuclear localization and specific DNA binding but inhibited the transcriptional activity of human HSF1, interfering with the assembly of ATF1-containing transcription complexes. IHSF115 was employed to probe the human heat shock response at the transcriptome level. In contrast to earlier studies of differential regulation in HSF1-naïve and -depleted cells, our results suggest that a large majority of heat-induced genes is positively regulated by HSF1. That IHSF115 effectively countermanded repression in a significant fraction of heat-repressed genes suggests that repression of these genes is mediated by transcriptionally active HSF1. IHSF115 is cytotoxic for a variety of human cancer cell lines, multiple myeloma lines consistently exhibiting high sensitivity. PMID:28369544

  17. Salicylic acid suppresses jasmonic acid signaling downstream of SCFCOI1-JAZ by targeting GCC promoter motifs via transcription factor ORA59.

    PubMed

    Van der Does, Dieuwertje; Leon-Reyes, Antonio; Koornneef, Annemart; Van Verk, Marcel C; Rodenburg, Nicole; Pauwels, Laurens; Goossens, Alain; Körbes, Ana P; Memelink, Johan; Ritsema, Tita; Van Wees, Saskia C M; Pieterse, Corné M J

    2013-02-01

    Antagonism between the defense hormones salicylic acid (SA) and jasmonic acid (JA) plays a central role in the modulation of the plant immune signaling network, but the molecular mechanisms underlying this phenomenon are largely unknown. Here, we demonstrate that suppression of the JA pathway by SA functions downstream of the E3 ubiquitin-ligase Skip-Cullin-F-box complex SCF(COI1), which targets JASMONATE ZIM-domain transcriptional repressor proteins (JAZs) for proteasome-mediated degradation. In addition, neither the stability nor the JA-induced degradation of JAZs was affected by SA. In silico promoter analysis of the SA/JA crosstalk transcriptome revealed that the 1-kb promoter regions of JA-responsive genes that are suppressed by SA are significantly enriched in the JA-responsive GCC-box motifs. Using GCC:GUS lines carrying four copies of the GCC-box fused to the β-glucuronidase reporter gene, we showed that the GCC-box motif is sufficient for SA-mediated suppression of JA-responsive gene expression. Using plants overexpressing the GCC-box binding APETALA2/ETHYLENE RESPONSE FACTOR (AP2/ERF) transcription factors ERF1 or ORA59, we found that SA strongly reduces the accumulation of ORA59 but not that of ERF1. Collectively, these data indicate that the SA pathway inhibits JA signaling downstream of the SCF(COI1)-JAZ complex by targeting GCC-box motifs in JA-responsive promoters via a negative effect on the transcriptional activator ORA59.

  18. Identification of Two Novel Amalgaviruses in the Common Eelgrass (Zostera marina) and in Silico Analysis of the Amalgavirus +1 Programmed Ribosomal Frameshifting Sites.

    PubMed

    Park, Dongbin; Goh, Chul Jun; Kim, Hyein; Hahn, Yoonsoo

    2018-04-01

    The genome sequences of two novel monopartite RNA viruses were identified in a common eelgrass ( Zostera marina ) transcriptome dataset. Sequence comparison and phylogenetic analyses revealed that these two novel viruses belong to the genus Amalgavirus in the family Amalgaviridae . They were named Zostera marina amalgavirus 1 (ZmAV1) and Zostera marina amalgavirus 2 (ZmAV2). Genomes of both ZmAV1 and ZmAV2 contain two overlapping open reading frames (ORFs). ORF1 encodes a putative replication factory matrix-like protein, while ORF2 encodes a RNA-dependent RNA polymerase (RdRp) domain. The fusion protein (ORF1+2) of ORF1 and ORF2, which mediates RNA replication, was produced using the +1 programmed ribosomal frameshifting (PRF) mechanism. The +1 PRF motif sequence, UUU_CGN, which is highly conserved among known amalgaviruses, was also found in ZmAV1 and ZmAV2. Multiple sequence alignment of the ORF1+2 fusion proteins from 24 amalgaviruses revealed that +1 PRF occurred only at three different positions within the 13-amino acid-long segment, which was surrounded by highly conserved regions on both sides. This suggested that the +1 PRF may be constrained by the structure of fusion proteins. Genome sequences of ZmAV1 and ZmAV2, which are the first viruses to be identified in common eelgrass, will serve as useful resources for studying evolution and diversity of amalgaviruses.

  19. Identification of Two Novel Amalgaviruses in the Common Eelgrass (Zostera marina) and in Silico Analysis of the Amalgavirus +1 Programmed Ribosomal Frameshifting Sites

    PubMed Central

    Park, Dongbin; Goh, Chul Jun; Kim, Hyein; Hahn, Yoonsoo

    2018-01-01

    The genome sequences of two novel monopartite RNA viruses were identified in a common eelgrass (Zostera marina) transcriptome dataset. Sequence comparison and phylogenetic analyses revealed that these two novel viruses belong to the genus Amalgavirus in the family Amalgaviridae. They were named Zostera marina amalgavirus 1 (ZmAV1) and Zostera marina amalgavirus 2 (ZmAV2). Genomes of both ZmAV1 and ZmAV2 contain two overlapping open reading frames (ORFs). ORF1 encodes a putative replication factory matrix-like protein, while ORF2 encodes a RNA-dependent RNA polymerase (RdRp) domain. The fusion protein (ORF1+2) of ORF1 and ORF2, which mediates RNA replication, was produced using the +1 programmed ribosomal frameshifting (PRF) mechanism. The +1 PRF motif sequence, UUU_CGN, which is highly conserved among known amalgaviruses, was also found in ZmAV1 and ZmAV2. Multiple sequence alignment of the ORF1+2 fusion proteins from 24 amalgaviruses revealed that +1 PRF occurred only at three different positions within the 13-amino acid-long segment, which was surrounded by highly conserved regions on both sides. This suggested that the +1 PRF may be constrained by the structure of fusion proteins. Genome sequences of ZmAV1 and ZmAV2, which are the first viruses to be identified in common eelgrass, will serve as useful resources for studying evolution and diversity of amalgaviruses. PMID:29628822

  20. Tentacle Transcriptome and Venom Proteome of the Pacific Sea Nettle, Chrysaora fuscescens (Cnidaria: Scyphozoa)

    PubMed Central

    Ponce, Dalia; Brinkman, Diane L.; Potriquet, Jeremy; Mulvenna, Jason

    2016-01-01

    Jellyfish venoms are rich sources of toxins designed to capture prey or deter predators, but they can also elicit harmful effects in humans. In this study, an integrated transcriptomic and proteomic approach was used to identify putative toxins and their potential role in the venom of the scyphozoan jellyfish Chrysaora fuscescens. A de novo tentacle transcriptome, containing more than 23,000 contigs, was constructed and used in proteomic analysis of C. fuscescens venom to identify potential toxins. From a total of 163 proteins identified in the venom proteome, 27 were classified as putative toxins and grouped into six protein families: proteinases, venom allergens, C-type lectins, pore-forming toxins, glycoside hydrolases and enzyme inhibitors. Other putative toxins identified in the transcriptome, but not the proteome, included additional proteinases as well as lipases and deoxyribonucleases. Sequence analysis also revealed the presence of ShKT domains in two putative venom proteins from the proteome and an additional 15 from the transcriptome, suggesting potential ion channel blockade or modulatory activities. Comparison of these potential toxins to those from other cnidarians provided insight into their possible roles in C. fuscescens venom and an overview of the diversity of potential toxin families in cnidarian venoms. PMID:27058558

  1. Discovery of Nuclear-Encoded Genes for the Neurotoxin Saxitoxin in Dinoflagellates

    PubMed Central

    Stüken, Anke; Orr, Russell J. S.; Kellmann, Ralf; Murray, Shauna A.; Neilan, Brett A.; Jakobsen, Kjetill S.

    2011-01-01

    Saxitoxin is a potent neurotoxin that occurs in aquatic environments worldwide. Ingestion of vector species can lead to paralytic shellfish poisoning, a severe human illness that may lead to paralysis and death. In freshwaters, the toxin is produced by prokaryotic cyanobacteria; in marine waters, it is associated with eukaryotic dinoflagellates. However, several studies suggest that saxitoxin is not produced by dinoflagellates themselves, but by co-cultured bacteria. Here, we show that genes required for saxitoxin synthesis are encoded in the nuclear genomes of dinoflagellates. We sequenced >1.2×106 mRNA transcripts from the two saxitoxin-producing dinoflagellate strains Alexandrium fundyense CCMP1719 and A. minutum CCMP113 using high-throughput sequencing technology. In addition, we used in silico transcriptome analyses, RACE, qPCR and conventional PCR coupled with Sanger sequencing. These approaches successfully identified genes required for saxitoxin-synthesis in the two transcriptomes. We focused on sxtA, the unique starting gene of saxitoxin synthesis, and show that the dinoflagellate transcripts of sxtA have the same domain structure as the cyanobacterial sxtA genes. But, in contrast to the bacterial homologs, the dinoflagellate transcripts are monocistronic, have a higher GC content, occur in multiple copies, contain typical dinoflagellate spliced-leader sequences and eukaryotic polyA-tails. Further, we investigated 28 saxitoxin-producing and non-producing dinoflagellate strains from six different genera for the presence of genomic sxtA homologs. Our results show very good agreement between the presence of sxtA and saxitoxin-synthesis, except in three strains of A. tamarense, for which we amplified sxtA, but did not detect the toxin. Our work opens for possibilities to develop molecular tools to detect saxitoxin-producing dinoflagellates in the environment. PMID:21625593

  2. Discovery of nuclear-encoded genes for the neurotoxin saxitoxin in dinoflagellates.

    PubMed

    Stüken, Anke; Orr, Russell J S; Kellmann, Ralf; Murray, Shauna A; Neilan, Brett A; Jakobsen, Kjetill S

    2011-01-01

    Saxitoxin is a potent neurotoxin that occurs in aquatic environments worldwide. Ingestion of vector species can lead to paralytic shellfish poisoning, a severe human illness that may lead to paralysis and death. In freshwaters, the toxin is produced by prokaryotic cyanobacteria; in marine waters, it is associated with eukaryotic dinoflagellates. However, several studies suggest that saxitoxin is not produced by dinoflagellates themselves, but by co-cultured bacteria. Here, we show that genes required for saxitoxin synthesis are encoded in the nuclear genomes of dinoflagellates. We sequenced >1.2×10(6) mRNA transcripts from the two saxitoxin-producing dinoflagellate strains Alexandrium fundyense CCMP1719 and A. minutum CCMP113 using high-throughput sequencing technology. In addition, we used in silico transcriptome analyses, RACE, qPCR and conventional PCR coupled with Sanger sequencing. These approaches successfully identified genes required for saxitoxin-synthesis in the two transcriptomes. We focused on sxtA, the unique starting gene of saxitoxin synthesis, and show that the dinoflagellate transcripts of sxtA have the same domain structure as the cyanobacterial sxtA genes. But, in contrast to the bacterial homologs, the dinoflagellate transcripts are monocistronic, have a higher GC content, occur in multiple copies, contain typical dinoflagellate spliced-leader sequences and eukaryotic polyA-tails. Further, we investigated 28 saxitoxin-producing and non-producing dinoflagellate strains from six different genera for the presence of genomic sxtA homologs. Our results show very good agreement between the presence of sxtA and saxitoxin-synthesis, except in three strains of A. tamarense, for which we amplified sxtA, but did not detect the toxin. Our work opens for possibilities to develop molecular tools to detect saxitoxin-producing dinoflagellates in the environment.

  3. Transcriptome profiling identifies p53 as a key player during calreticulin deficiency: Implications in lipid accumulation.

    PubMed

    Vig, Saurabh; Talwar, Puneet; Kaur, Kirandeep; Srivastava, Rohit; Srivastava, Arvind K; Datta, Malabika

    2015-01-01

    Calreticulin (CRT) is an endoplasmic reticulum (ER) resident calcium binding protein that is involved in several cellular activities. Transcriptome analyses in CRT knockdown HepG2 cells revealed 253 altered unique genes and subsequent in silico protein-protein interaction network and MCODE clustering identified 34 significant clusters, of which p53 occupied the central hub node in the highest node-rich cluster. Toward validation, we show that CRT knockdown leads to inhibition of p53 protein levels. Both, CRT and p53 siRNA promote hepatic lipid accumulation and this was accompanied by elevated SREBP-1c and FAS levels. p53 was identified to bind at -219 bp on the SREBP-1c promoter and in the presence of CRT siRNA, there was decreased occupancy of p53 on this binding element. This was associated with increased SREBP-1c promoter activity and both, mutation in this binding site or p53 over-expression antagonised the effects of CRT knockdown. We, therefore, identify a negatively regulating p53 binding site on the SREBP-1c promoter that is critical during hepatic lipid accumulation. These results were validated in mouse primary hepatocytes and toward a physiological relevance, we report that while the levels of CRT and p53 are reduced in the fatty livers of diabetic db/db mice, SREBP-1c levels are significantly elevated. Our results suggest that decreased CRT levels might be involved in the development of a fatty liver by preventing p53 occupancy on the SREBP-1c promoter and thereby facilitating SREBP-1c up-regulation and consequently, lipid accumulation.

  4. Natural Variation in Fish Transcriptomes: Comparative Analysis of the Fathead Minnow (Pimephales promelas) and Zebrafish (Danio rerio)

    EPA Science Inventory

    Fathead minnow and zebrafish are among the most intensively studied fish species in environmental toxicogenomics. To aid the assessment and interpretation of subtle transcriptomic effects from treatment conditions of interest, there needs to be a better characterization and unde...

  5. Comparative transcriptome analysis in Sclerotinia sclerotiorum and S. trifoliorum by 454 Titanium RNA sequencing

    USDA-ARS?s Scientific Manuscript database

    Sclerotinia sclerotiorum and S. trifoliorum are two closely related devastating plant pathogens. Extensive research has been conducted on S. sclerotiorum and its genome sequences are available. To take advantages of the genomic information of S. sclerotiorum, we compared the transcriptome of S. tr...

  6. Transcriptome analysis of Pseudomonas syringae identifies new genes, ncRNAs, and antisense activity

    USDA-ARS?s Scientific Manuscript database

    To fully understand how bacteria respond to their environment, it is essential to assess genome-wide transcriptional activity. New high throughput sequencing technologies make it possible to query the transcriptome of an organism in an efficient unbiased manner. We applied a strand-specific method t...

  7. Performance of Arma chinensis reared on an artificial diet formulated using transcriptomic methods

    USDA-ARS?s Scientific Manuscript database

    An artificial diet formulated for continuous rearing of the predator Arma chinensis was inferior to natural prey when evaluated using life history parameters. A transcriptome analysis identified differentially expressed genes in diet-fed and prey-fed A. chinensis that were suggestive of molecular me...

  8. Comparative analysis of microarray data in Arabidopsis transcriptome during compatible interactions with plant viruses

    USDA-ARS?s Scientific Manuscript database

    To analyze transcriptome response to virus infection, we have assembled currently available microarray data on changes in gene expression levels in compatible Arabidopsis-virus interactions. We used the mean r (Pearson’s correlation coefficient) for neighboring pairs to estimate pairwise local simil...

  9. Comparative transcriptome analysis of Aspergillus flavus isolates under different oxidative stresses and culture media

    USDA-ARS?s Scientific Manuscript database

    Aspergillus flavus and aflatoxin contamination in the field are known to be influenced by numerous stress factors, particularly drought and heat stress. However, the purpose of aflatoxin production is unknown. Here, we report transcriptome analyses comprised of 282.6 Gb of sequencing data describing...

  10. Additional annotation of the pig transcriptome using integrated Iso-seq and Illumina RNA-seq analysis

    USDA-ARS?s Scientific Manuscript database

    Alternative splicing is a well-known phenomenon that dramatically increases eukaryotic transcriptome diversity. The extent of mRNA isoform diversity among porcine tissues was assessed using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq) and Illumina short read sequencing ...

  11. Identification and characterization of large DNA deletions affecting oil quality traits in soybean seeds through transcriptome sequencing analysis

    USDA-ARS?s Scientific Manuscript database

    Understanding the molecular and genetic mechanisms underlying variation in seed composition and contents among different genotypes is important for soybean oil quality improvement. We designed a bioinformatics approach to compare seed transcriptomes of 9 soybean genotypes varying in oil composition ...

  12. TCW: Transcriptome Computational Workbench

    PubMed Central

    Soderlund, Carol; Nelson, William; Willer, Mark; Gang, David R.

    2013-01-01

    Background The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. Methodology The Transcriptome Computational Workbench (TCW) provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms). The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina) or assembling long sequences (e.g. Sanger, 454, transcripts), annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. Conclusion It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the transcriptome. TCW is freely available from www.agcol.arizona.edu/software/tcw. PMID:23874959

  13. TCW: transcriptome computational workbench.

    PubMed

    Soderlund, Carol; Nelson, William; Willer, Mark; Gang, David R

    2013-01-01

    The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. The Transcriptome Computational Workbench (TCW) provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms). The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina) or assembling long sequences (e.g. Sanger, 454, transcripts), annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the transcriptome. TCW is freely available from www.agcol.arizona.edu/software/tcw.

  14. Global microRNA profiling of peripheral blood mononuclear cells in patients with Behçet's disease.

    PubMed

    Erre, Gian Luca; Piga, Matteo; Carru, Ciriaco; Angius, Andrea; Carcangiu, Laura; Piras, Marco; Sotgia, Salvatore; Zinellu, Angelo; Mathieu, Alessandro; Passiu, Giuseppe; Pescatori, Mario

    2015-01-01

    To explore the post-transcriptional regulation of the peripheral blood mononuclear cells (PBMCs) transcriptome by microRNAs in Behçet's disease (BD). Using TaqMan Low Density Array-based microRNAs expression profiling, the expression of 750 mature human microRNAs in PBMCs from 5 BD patients and 3 healthy controls (HC) was compared. The expression of deregulated microRNAs was then validated by quantitative real time-polymerase chain reaction (qRT-PCR), in 42 BD patients and 8 HC. In the initial screening, 13 microRNAs appeared deregulated in BD vs HC. Among them, the differential expression of miR-720 and miR-139-3p was confirmed by qRT-PCR, (p<0.05 and FDR<5%). Areas under the receiver operating characteristic curve for miR-139-3p, miR-720 and miR-139-3p+miR-720 in the validation cohort were 0.84, 0.87 and 0.92 respectively, indicating good discrimination between BD patients and HC. Post-hoc analysis showed that 9 out of 13 microRNAs from the discovery phase were significantly upregulated in active vs. quiescent BD, suggesting inflammation as a key regulator of microRNAs machinery in BD. In silico analysis revealed that several BD candidate susceptibility genes are predicted target of significantly deregulated microRNAs in active BD. A significant enrichment in microRNAs targeting elements of the Toll-like receptor (TLR) and T-cell receptor signalling pathways was also assumed. miR199-3p and miR720 deserve further confirmation as biomarkers of BD in larger studies. PBMCs from active BD displayed a unique signature of microRNAs which may be implicated in regulation of innate immunity activation and T-cell function.

  15. mRNA-seq reveals skeletal muscle atrophy in response to handling stress in a marine teleost, the red cusk-eel (Genypterus chilensis).

    PubMed

    Aedo, Jorge E; Maldonado, Jonathan; Aballai, Víctor; Estrada, Juan M; Bastias-Molina, Macarena; Meneses, Claudio; Gallardo-Escarate, Cristian; Silva, Herman; Molina, Alfredo; Valdés, Juan A

    2015-12-01

    Fish reared under intensive conditions are repeatedly exposed to stress, which negatively impacts growth. Although most fish follow a conserved pattern of stress response, with increased concentrations of cortisol, each species presents specificities in the cell response and stress tolerance. Therefore, culturing new species requires a detailed knowledge of these specific responses. The red cusk-eel (Genypterus chilensis) is a new economically important marine species for the Chilean aquaculture industry. However, there is no information on the stress- and cortisol-induced mechanisms that decrease skeletal muscle growth in this teleost. Using Illumina RNA-seq technology, skeletal muscle sequence reads for G. chilensis were generated under control and handling stress conditions. Reads were mapped onto a reference transcriptome, resulting in the in silico identification of 785 up-regulated and 167 down-regulated transcripts. Gene ontology enrichment analysis revealed a significant up-regulation of catabolic genes associated with skeletal muscle atrophy. These results were validated by RT-qPCR analysis for ten candidates genes involved in ubiquitin-mediated proteolysis, autophagy and skeletal muscle growth. Additionally, using a primary culture of fish skeletal muscle cells, the effect of cortisol was evaluated in relation to red cusk-eel skeletal muscle atrophy. The present data demonstrated that handling stress promotes skeletal muscle atrophy in the marine teleost G. chilensis through the expression of components of the ubiquitin-proteasome and autophagy-lysosome systems. Furthermore, cortisol was a powerful inductor of skeletal muscle atrophy in fish myotubes. This study is an important step towards understanding the atrophy system in non-model teleost species and provides novel insights on the cellular and molecular mechanisms that control skeletal muscle growth in early vertebrates.

  16. The Bifidobacterium dentium Bd1 Genome Sequence Reflects Its Genetic Adaptation to the Human Oral Cavity

    PubMed Central

    Ventura, Marco; Turroni, Francesca; Zomer, Aldert; Foroni, Elena; Giubellini, Vanessa; Bottacini, Francesca; Canchaya, Carlos; Claesson, Marcus J.; He, Fei; Mantzourani, Maria; Mulas, Laura; Ferrarini, Alberto; Gao, Beile; Delledonne, Massimo; Henrissat, Bernard; Coutinho, Pedro; Oggioni, Marco; Gupta, Radhey S.; Zhang, Ziding; Beighton, David; Fitzgerald, Gerald F.; O'Toole, Paul W.; van Sinderen, Douwe

    2009-01-01

    Bifidobacteria, one of the relatively dominant components of the human intestinal microbiota, are considered one of the key groups of beneficial intestinal bacteria (probiotic bacteria). However, in addition to health-promoting taxa, the genus Bifidobacterium also includes Bifidobacterium dentium, an opportunistic cariogenic pathogen. The genetic basis for the ability of B. dentium to survive in the oral cavity and contribute to caries development is not understood. The genome of B. dentium Bd1, a strain isolated from dental caries, was sequenced to completion to uncover a single circular 2,636,368 base pair chromosome with 2,143 predicted open reading frames. Annotation of the genome sequence revealed multiple ways in which B. dentium has adapted to the oral environment through specialized nutrient acquisition, defences against antimicrobials, and gene products that increase fitness and competitiveness within the oral niche. B. dentium Bd1 was shown to metabolize a wide variety of carbohydrates, consistent with genome-based predictions, while colonization and persistence factors implicated in tissue adhesion, acid tolerance, and the metabolism of human saliva-derived compounds were also identified. Global transcriptome analysis demonstrated that many of the genes encoding these predicted traits are highly expressed under relevant physiological conditions. This is the first report to identify, through various genomic approaches, specific genetic adaptations of a Bifidobacterium taxon, Bifidobacterium dentium Bd1, to a lifestyle as a cariogenic microorganism in the oral cavity. In silico analysis and comparative genomic hybridization experiments clearly reveal a high level of genome conservation among various B. dentium strains. The data indicate that the genome of this opportunistic cariogen has evolved through a very limited number of horizontal gene acquisition events, highlighting the narrow boundaries that separate commensals from opportunistic pathogens. PMID:20041198

  17. Metabolic analysis of Chlorobium chlorochromatii CaD3 reveals clues of the symbiosis in ‘Chlorochromatium aggregatum'.

    PubMed Central

    Cerqueda-García, Daniel; Martínez-Castilla, León P; Falcón, Luisa I; Delaye, Luis

    2014-01-01

    A symbiotic association occurs in ‘Chlorochromatium aggregatum', a phototrophic consortium integrated by two species of phylogenetically distant bacteria composed by the green-sulfur Chlorobium chlorochromatii CaD3 epibiont that surrounds a central β-proteobacterium. The non-motile chlorobia can perform nitrogen and carbon fixation, using sulfide as electron donors for anoxygenic photosynthesis. The consortium can move due to the flagella present in the central β-protobacterium. Although Chl. chlorochromatii CaD3 is never found as free-living bacteria in nature, previous transcriptomic and proteomic studies have revealed that there are differential transcription patterns between the symbiotic and free-living status of Chl. chlorocromatii CaD3 when grown in laboratory conditions. The differences occur mainly in genes encoding the enzymatic reactions involved in nitrogen and amino acid metabolism. We performed a metabolic reconstruction of Chl. chlorochromatii CaD3 and an in silico analysis of its amino acid metabolism using an elementary flux modes approach (EFM). Our study suggests that in symbiosis, Chl. chlorochromatii CaD3 is under limited nitrogen conditions where the GS/GOGAT (glutamine synthetase/glutamate synthetase) pathway is actively assimilating ammonia obtained via N2 fixation. In contrast, when free-living, Chl. chlorochromatii CaD3 is in a condition of nitrogen excess and ammonia is assimilated by the alanine dehydrogenase (AlaDH) pathway. We postulate that ‘Chlorochromatium aggregatum' originated from a parasitic interaction where the N2 fixation capacity of the chlorobia would be enhanced by injection of 2-oxoglutarate from the β-proteobacterium via the periplasm. This consortium would have the advantage of motility, which is fundamental to a phototrophic bacterium, and the syntrophy of nitrogen and carbon sources. PMID:24285361

  18. The virulence factor ychO has a pleiotropic action in an Avian Pathogenic Escherichia coli (APEC) strain.

    PubMed

    Pilatti, Livia; Boldrin de Paiva, Jacqueline; Rojas, Thaís Cabrera Galvão; Leite, Janaína Luisa; Conceição, Rogério Arcuri; Nakazato, Gerson; Dias da Silveira, Wanderley

    2016-03-10

    Avian pathogenic Escherichia coli strains cause extraintestinal diseases in birds, leading to substantial economic losses to the poultry industry worldwide. Bacteria that invade cells can overcome the host humoral immune response, resulting in a higher pathogenicity potential. Invasins are members of a large family of outer membrane proteins that allow pathogen invasion into host cells by interacting with specific receptors on the cell surface. An in silico analysis of the genome of a septicemic APEC strain (SEPT362) demonstrated the presence of a putative invasin homologous to the ychO gene from E. coli str. K-12 substr. MG1655. In vitro and in vivo assays comparing a mutant strain carrying a null mutation of this gene, a complemented strain, and its counterpart wild-type strain showed that ychO plays a role in the pathogenicity of APEC strain SEPT362. In vitro assays demonstrated that the mutant strain exhibited significant decreases in bacterial adhesiveness and invasiveness in chicken cells and biofilm formation. In vivo assay indicated a decrease in pathogenicity of the mutant strain. Moreover, transcriptome analysis demonstrated that the ychO deletion affected the expression of 426 genes. Among the altered genes, 93.66% were downregulated in the mutant, including membrane proteins and metabolism genes. The results led us to propose that gene ychO contributes to the pathogenicity of APEC strain SEPT362 influencing, in a pleiotropic manner, many biological characteristics, such as adhesion and invasion of in vitro cultured cells, biofilm formation and motility, which could be due to the possible membrane location of this protein. All of these results suggest that the absence of gene ychO would influence the virulence of the APEC strain herein studied.

  19. Transcriptomics in cancer diagnostics: developments in technology, clinical research and commercialization.

    PubMed

    Sager, Monica; Yeat, Nai Chien; Pajaro-Van der Stadt, Stefan; Lin, Charlotte; Ren, Qiuyin; Lin, Jimmy

    2015-01-01

    Transcriptomic technologies are evolving to diagnose cancer earlier and more accurately to provide greater predictive and prognostic utility to oncologists and patients. Digital techniques such as RNA sequencing are replacing still-imaging techniques to provide more detailed analysis of the transcriptome and aberrant expression that causes oncogenesis, while companion diagnostics are developing to determine the likely effectiveness of targeted treatments. This article examines recent advancements in molecular profiling research and technology as applied to cancer diagnosis, clinical applications and predictions for the future of personalized medicine in oncology.

  20. Single prokaryotic cell isolation and total transcript amplification protocol for transcriptomic analysis.

    PubMed

    Kang, Yun; McMillan, Ian; Norris, Michael H; Hoang, Tung T

    2015-07-01

    Until recently, transcriptome analyses of single cells have been confined to eukaryotes. The information obtained from single-cell transcripts can provide detailed insight into spatiotemporal gene expression, and it could be even more valuable if expanded to prokaryotic cells. Transcriptome analysis of single prokaryotic cells is a recently developed and powerful tool. Here we describe a procedure that allows amplification of the total transcript of a single prokaryotic cell for in-depth analysis. This is performed by using a laser-capture microdissection instrument for single-cell isolation, followed by reverse transcription via Moloney murine leukemia virus, degradation of chromosomal DNA with McrBC and DpnI restriction enzymes, single-stranded cDNA (ss-cDNA) ligation using T4 polynucleotide kinase and CircLigase, and polymerization of ss-cDNA to double-stranded cDNA (ds-cDNA) by Φ29 polymerase. This procedure takes ∼5 d, and sufficient amounts of ds-cDNA can be obtained from single-cell RNA template for further microarray analysis.

  1. Cell type-specific responses to salinity - the epidermal bladder cell transcriptome of Mesembryanthemum crystallinum.

    PubMed

    Oh, Dong-Ha; Barkla, Bronwyn J; Vera-Estrella, Rosario; Pantoja, Omar; Lee, Sang-Yeol; Bohnert, Hans J; Dassanayake, Maheshi

    2015-08-01

    Mesembryanthemum crystallinum (ice plant) exhibits extreme tolerance to salt. Epidermal bladder cells (EBCs), developing on the surface of aerial tissues and specialized in sodium sequestration and other protective functions, are critical for the plant's stress adaptation. We present the first transcriptome analysis of EBCs isolated from intact plants, to investigate cell type-specific responses during plant salt adaptation. We developed a de novo assembled, nonredundant EBC reference transcriptome. Using RNAseq, we compared the expression patterns of the EBC-specific transcriptome between control and salt-treated plants. The EBC reference transcriptome consists of 37 341 transcript-contigs, of which 7% showed significantly different expression between salt-treated and control samples. We identified significant changes in ion transport, metabolism related to energy generation and osmolyte accumulation, stress signalling, and organelle functions, as well as a number of lineage-specific genes of unknown function, in response to salt treatment. The salinity-induced EBC transcriptome includes active transcript clusters, refuting the view of EBCs as passive storage compartments in the whole-plant stress response. EBC transcriptomes, differing from those of whole plants or leaf tissue, exemplify the importance of cell type-specific resolution in understanding stress adaptive mechanisms. No claim to original US government works. New Phytologist © 2015 New Phytologist Trust.

  2. Transcriptome analysis of Brassica napus pod using RNA-Seq and identification of lipid-related candidate genes.

    PubMed

    Xu, Hai-Ming; Kong, Xiang-Dong; Chen, Fei; Huang, Ji-Xiang; Lou, Xiang-Yang; Zhao, Jian-Yi

    2015-10-24

    Brassica napus is an important oilseed crop. Dissection of the genetic architecture underlying oil-related biological processes will greatly facilitates the genetic improvement of rapeseed. The differential gene expression during pod development offers a snapshot on the genes responsible for oil accumulation in. To identify candidate genes in the linkage peaks reported previously, we used RNA sequencing (RNA-Seq) technology to analyze the pod transcriptomes of German cultivar Sollux and Chinese inbred line Gaoyou. The RNA samples were collected for RNA-Seq at 5-7, 15-17 and 25-27 days after flowering (DAF). Bioinformatics analysis was performed to investigate differentially expressed genes (DEGs). Gene annotation analysis was integrated with QTL mapping and Brassica napus pod transcriptome profiling to detect potential candidate genes in oilseed. Four hundred sixty five and two thousand, one hundred fourteen candidate DEGs were identified, respectively, between two varieties at the same stages and across different periods of each variety. Then, 33 DEGs between Sollux and Gaoyou were identified as the candidate genes affecting seed oil content by combining those DEGs with the quantitative trait locus (QTL) mapping results, of which, one was found to be homologous to Arabidopsis thaliana lipid-related genes. Intervarietal DEGs of lipid pathways in QTL regions represent important candidate genes for oil-related traits. Integrated analysis of transcriptome profiling, QTL mapping and comparative genomics with other relative species leads to efficient identification of most plausible functional genes underlying oil-content related characters, offering valuable resources for bettering breeding program of Brassica napus. This study provided a comprehensive overview on the pod transcriptomes of two varieties with different oil-contents at the three developmental stages.

  3. WEbcoli: an interactive and asynchronous web application for in silico design and analysis of genome-scale E.coli model.

    PubMed

    Jung, Tae-Sung; Yeo, Hock Chuan; Reddy, Satty G; Cho, Wan-Sup; Lee, Dong-Yup

    2009-11-01

    WEbcoli is a WEb application for in silico designing, analyzing and engineering Escherichia coli metabolism. It is devised and implemented using advanced web technologies, thereby leading to enhanced usability and dynamic web accessibility. As a main feature, the WEbcoli system provides a user-friendly rich web interface, allowing users to virtually design and synthesize mutant strains derived from the genome-scale wild-type E.coli model and to customize pathways of interest through a graph editor. In addition, constraints-based flux analysis can be conducted for quantifying metabolic fluxes and charactering the physiological and metabolic states under various genetic and/or environmental conditions. WEbcoli is freely accessible at http://webcoli.org. cheld@nus.edu.sg.

  4. Model-based redesign of global transcription regulation

    PubMed Central

    Carrera, Javier; Rodrigo, Guillermo; Jaramillo, Alfonso

    2009-01-01

    Synthetic biology aims to the design or redesign of biological systems. In particular, one possible goal could be the rewiring of the transcription regulation network by exchanging the endogenous promoters. To achieve this objective, we have adapted current methods to the inference of a model based on ordinary differential equations that is able to predict the network response after a major change in its topology. Our procedure utilizes microarray data for training. We have experimentally validated our inferred global regulatory model in Escherichia coli by predicting transcriptomic profiles under new perturbations. We have also tested our methodology in silico by providing accurate predictions of the underlying networks from expression data generated with artificial genomes. In addition, we have shown the predictive power of our methodology by obtaining the gene profile in experimental redesigns of the E. coli genome, where rewiring the transcriptional network by means of knockouts of master regulators or by upregulating transcription factors controlled by different promoters. Our approach is compatible with most network inference methods, allowing to explore computationally future genome-wide redesign experiments in synthetic biology. PMID:19188257

  5. Update of the Diatom EST Database: a new tool for digital transcriptomics

    PubMed Central

    Maheswari, Uma; Mock, Thomas; Armbrust, E. Virginia; Bowler, Chris

    2009-01-01

    The Diatom Expressed Sequence Tag (EST) Database was constructed to provide integral access to ESTs from these ecologically and evolutionarily interesting microalgae. It has now been updated with 130 000 Phaeodactylum tricornutum ESTs from 16 cDNA libraries and 77 000 Thalassiosira pseudonana ESTs from seven libraries, derived from cells grown in different nutrient and stress regimes. The updated relational database incorporates results from statistical analyses such as log-likelihood ratios and hierarchical clustering, which help to identify differentially expressed genes under different conditions, and allow similarities in gene expression in different libraries to be investigated in a functional context. The database also incorporates links to the recently sequenced genomes of P. tricornutum and T. pseudonana, enabling an easy cross-talk between the expression pattern of diatom orthologs and the genome browsers. These improvements will facilitate exploration of diatom responses to conditions of ecological relevance and will aid gene function identification of diatom-specific genes and in silico gene prediction in this largely unexplored class of eukaryotes. The updated Diatom EST Database is available at http://www.biologie.ens.fr/diatomics/EST3. PMID:19029140

  6. Use of homologous and heterologous gene expression profiling tools to characterize transcription dynamics during apple fruit maturation and ripening

    PubMed Central

    2010-01-01

    Background Fruit development, maturation and ripening consists of a complex series of biochemical and physiological changes that in climacteric fruits, including apple and tomato, are coordinated by the gaseous hormone ethylene. These changes lead to final fruit quality and understanding of the functional machinery underlying these processes is of both biological and practical importance. To date many reports have been made on the analysis of gene expression in apple. In this study we focused our investigation on the role of ethylene during apple maturation, specifically comparing transcriptomics of normal ripening with changes resulting from application of the hormone receptor competitor 1-Methylcyclopropene. Results To gain insight into the molecular process regulating ripening in apple, and to compare to tomato (model species for ripening studies), we utilized both homologous and heterologous (tomato) microarray to profile transcriptome dynamics of genes involved in fruit development and ripening, emphasizing those which are ethylene regulated. The use of both types of microarrays facilitated transcriptome comparison between apple and tomato (for the later using data previously published and available at the TED: tomato expression database) and highlighted genes conserved during ripening of both species, which in turn represent a foundation for further comparative genomic studies. The cross-species analysis had the secondary aim of examining the efficiency of heterologous (specifically tomato) microarray hybridization for candidate gene identification as related to the ripening process. The resulting transcriptomics data revealed coordinated gene expression during fruit ripening of a subset of ripening-related and ethylene responsive genes, further facilitating the analysis of ethylene response during fruit maturation and ripening. Conclusion Our combined strategy based on microarray hybridization enabled transcriptome characterization during normal climacteric apple ripening, as well as definition of ethylene-dependent transcriptome changes. Comparison with tomato fruit maturation and ethylene responsive transcriptome activity facilitated identification of putative conserved orthologous ripening-related genes, which serve as an initial set of candidates for assessing conservation of gene activity across genomes of fruit bearing plant species. PMID:20973957

  7. In silico characterization of a novel pathogenic deletion mutation identified in XPA gene in a Pakistani family with severe xeroderma pigmentosum.

    PubMed

    Nasir, Muhammad; Ahmad, Nafees; Sieber, Christian M K; Latif, Amir; Malik, Salman Akbar; Hameed, Abdul

    2013-09-24

    Xeroderma Pigmentosum (XP) is a rare skin disorder characterized by skin hypersensitivity to sunlight and abnormal pigmentation. The aim of this study was to investigate the genetic cause of a severe XP phenotype in a consanguineous Pakistani family and in silico characterization of any identified disease-associated mutation. The XP complementation group was assigned by genotyping of family for known XP loci. Genotyping data mapped the family to complementation group A locus, involving XPA gene. Mutation analysis of the candidate XP gene by DNA sequencing revealed a novel deletion mutation (c.654del A) in exon 5 of XPA gene. The c.654del A, causes frameshift, which pre-maturely terminates protein and result into a truncated product of 222 amino acid (aa) residues instead of 273 (p.Lys218AsnfsX5). In silico tools were applied to study the likelihood of changes in structural motifs and thus interaction of mutated protein with binding partners. In silico analysis of mutant protein sequence, predicted to affect the aa residue which attains coiled coil structure. The coiled coil structure has an important role in key cellular interactions, especially with DNA damage-binding protein 2 (DDB2), which has important role in DDB-mediated nucleotide excision repair (NER) system. Our findings support the fact of genetic and clinical heterogeneity in XP. The study also predicts the critical role of DDB2 binding region of XPA protein in NER pathway and opens an avenue for further research to study the functional role of the mutated protein domain.

  8. De novo transcript sequence reconstruction from RNA-Seq: reference generation and analysis with Trinity

    PubMed Central

    Yassour, Moran; Grabherr, Manfred; Blood, Philip D.; Bowden, Joshua; Couger, Matthew Brian; Eccles, David; Li, Bo; Lieber, Matthias; MacManes, Matthew D.; Ott, Michael; Orvis, Joshua; Pochet, Nathalie; Strozzi, Francesco; Weeks, Nathan; Westerman, Rick; William, Thomas; Dewey, Colin N.; Henschel, Robert; LeDuc, Richard D.; Friedman, Nir; Regev, Aviv

    2013-01-01

    De novo assembly of RNA-Seq data allows us to study transcriptomes without the need for a genome sequence, such as in non-model organisms of ecological and evolutionary importance, cancer samples, or the microbiome. In this protocol, we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-Seq data in non-model organisms. We also present Trinity’s supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples, and approaches to identify protein coding genes. In an included tutorial we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sf.net. PMID:23845962

  9. Assessing the hodgepodge of non-mapped reads in bacterial transcriptomes: real or artifactual RNA chimeras?

    PubMed

    Lloréns-Rico, Verónica; Serrano, Luis; Lluch-Senar, Maria

    2014-07-29

    RNA sequencing methods have already altered our view of the extent and complexity of bacterial and eukaryotic transcriptomes, revealing rare transcript isoforms (circular RNAs, RNA chimeras) that could play an important role in their biology. We performed an analysis of chimera formation by four different computational approaches, including a custom designed pipeline, to study the transcriptomes of M. pneumoniae and P. aeruginosa, as well as mixtures of both. We found that rare transcript isoforms detected by conventional pipelines of analysis could be artifacts of the experimental procedure used in the library preparation, and that they are protocol-dependent. By using a customized pipeline we show that optimal library preparation protocol and the pipeline to analyze the results are crucial to identify real chimeric RNAs.

  10. Veterinary Medicine and Multi-Omics Research for Future Nutrition Targets: Metabolomics and Transcriptomics of the Common Degenerative Mitral Valve Disease in Dogs.

    PubMed

    Li, Qinghong; Freeman, Lisa M; Rush, John E; Huggins, Gordon S; Kennedy, Adam D; Labuda, Jeffrey A; Laflamme, Dorothy P; Hannah, Steven S

    2015-08-01

    Canine degenerative mitral valve disease (DMVD) is the most common form of heart disease in dogs. The objective of this study was to identify cellular and metabolic pathways that play a role in DMVD by performing metabolomics and transcriptomics analyses on serum and tissue (mitral valve and left ventricle) samples previously collected from dogs with DMVD or healthy hearts. Gas or liquid chromatography followed by mass spectrophotometry were used to identify metabolites in serum. Transcriptomics analysis of tissue samples was completed using RNA-seq, and selected targets were confirmed by RT-qPCR. Random Forest analysis was used to classify the metabolites that best predicted the presence of DMVD. Results identified 41 known and 13 unknown serum metabolites that were significantly different between healthy and DMVD dogs, representing alterations in fat and glucose energy metabolism, oxidative stress, and other pathways. The three metabolites with the greatest single effect in the Random Forest analysis were γ-glutamylmethionine, oxidized glutathione, and asymmetric dimethylarginine. Transcriptomics analysis identified 812 differentially expressed transcripts in left ventricle samples and 263 in mitral valve samples, representing changes in energy metabolism, antioxidant function, nitric oxide signaling, and extracellular matrix homeostasis pathways. Many of the identified alterations may benefit from nutritional or medical management. Our study provides evidence of the growing importance of integrative approaches in multi-omics research in veterinary and nutritional sciences.

  11. Comparative de novo transcriptome analysis of male and female Sea buckthorn.

    PubMed

    Bansal, Ankush; Salaria, Mehul; Sharma, Tashil; Stobdan, Tsering; Kant, Anil

    2018-02-01

    Sea buckthorn is a dioecious medicinal plant found at high altitude. The plant has both male and female reproductive organs in separate individuals. In this article, whole transcriptome de novo assemblies of male and female flower bud samples were carried out using Illumina NextSeq 500 platform to determine the role of the genes involved in sex determination. Moreover, genes with differential expression in male and female transcriptomes were identified to understand the underlying sex determination mechanism. The current study showed 63,904 and 62,272 coding sequences (CDS) in female and male transcriptome data sets, respectively. 16,831 common CDS were screened out from both transcriptomes, out of which 625 were upregulated and 491 were found to be downregulated. To understand the potential regulatory roles of differentially expressed genes in metabolic networks and biosynthetic pathways: KEGG mapping, gene ontology, and co-expression network analysis were performed. Comparison with Flowering Interactive Database (FLOR-ID) resulted in eight differentially expressed genes viz. CHD3-type chromatin-remodeling factor PICKLE ( PKL ), phytochrome-associated serine/threonine-protein phosphatase ( FYPP ), protein TOPLESS ( TPL ), sensitive to freezing 6 ( SFR6 ), lysine-specific histone demethylase 1 homolog 1 ( LDL1 ), pre-mRNA-processing-splicing factor 8A ( PRP8A ), sucrose synthase 4 ( SUS4 ), ubiquitin carboxyl-terminal hydrolase 12 ( UBP12 ), known to be broadly involved in flowering, photoperiodism, embryo development, and cold response pathways. Male and female flower bud transcriptome data of Sea buckthorn may provide comprehensive information at genomic level for the identification of genetic regulation involved in sex determination.

  12. Transcriptome-wide identification of reference genes for expression analysis of soybean responses to drought stress along the day

    USDA-ARS?s Scientific Manuscript database

    The soybean transcriptome displays strong variation along the day in optimal growth conditions and also in response to adverse circumstances, like drought stress. However, no study conducted to date has presented suitable reference genes, with stable expression along the day, for relative gene expre...

  13. Comparison of ribosomal RNA removal methods for transcriptome sequencing workflows in teleost fish

    USDA-ARS?s Scientific Manuscript database

    RNA sequencing (RNA-Seq) is becoming the standard for transcriptome analysis. Removal of contaminating ribosomal RNA (rRNA) is a priority in the preparation of libraries suitable for sequencing. rRNAs are commonly removed from total RNA via either mRNA selection or rRNA depletion. These methods have...

  14. Transcriptome analysis reveals a comprehensive insect resistance response mechanism in cotton to infestation by the phloem feeding insect Bemisia tabaci (whitefly)

    USDA-ARS?s Scientific Manuscript database

    The whitefly (Bemisia tabaci) causes tremendous damage to cotton production worldwide. However, very limited information is available about how plants perceive and defend themselves from this destructive pest. In this study, the transcriptomics differences between two cotton cultivars that exhibit e...

  15. Transcriptomic analysis reveals numerous diverse protein kinases and transcription factors involved in desiccation tolerance in the resurrection plant Myrothamnus flabellifolia

    USDA-ARS?s Scientific Manuscript database

    The woody resurrection plant Myrothamnus flabellifolia has remarkable tolerance to desiccation. Pyro-sequencing technology permitted us to analyze the transcriptome of M. flabellifolia during both dehydration and rehydration. We identified a total of 8287 and 8542 differentially transcribed genes du...

  16. Comparative transcriptome and secretome analysis of wood decay fungi Postia placenta and Phanerochaete chrysosporium

    Treesearch

    Amber J. Vanden Wymelenberg; Jill Gaskell; Michael Mozuch; Grzegorz Sabat; John Ralph; Oleksandr Skyba; Shawn D Mansfield; Robert A. Blanchette; Diego Martinez; Igor Grigoriev; Philip J Kersten; Daniel Cullen

    2010-01-01

    Cellulose degradation by brown rot fungi, such as Postia placenta, is poorly understood relative to the phylogenetically related white rot basidiomycete, Phanerochaete chrysosporium. To elucidate the number, structure, and regulation of genes involved in lignocellulosic cell wall attack, secretome and transcriptome analyses were performed on both wood decay fungi...

  17. Root herbivory: molecular analysis of the maize transcriptome upon infestation by Southern corn rootworm, Diabrotica undecimpunctata howardi

    USDA-ARS?s Scientific Manuscript database

    While many studies have characterized the transcriptome of plants attacked by herbivorous insect pests, few have undertaken an examination of the genes affected by root pests. We have subjected maize seedlings to infestation by southern corn rootworm (SCR) Diabrotica undecimpunctata howardi and usin...

  18. Mango (Mangifera indica L.) cv. Kent fruit mesocarp de novo transcriptome assembly identifies gene families important for ripening

    USDA-ARS?s Scientific Manuscript database

    Fruit ripening is a physiological and biochemical process genetically programmed to regulate fruit quality parameters like firmness, flavor, odor and color, as well as production of ethylene in climacteric fruit. In this study, a transcriptomic analysis of mango (Mangifera indica L.) mesocarp cv. "K...

  19. Information Theoretical Analysis of a Bovine Gene Atlas Reveals Chromosomal Regions with Tissue Specific Gene Expression.

    USDA-ARS?s Scientific Manuscript database

    An essential step to understanding the genomic biology of any organism is to comprehensively survey its transcriptome. We present the Bovine Gene Atlas (BGA) a compendium of over 7.2 million unique 20 base Illumina DGE tags representing 100 tissue transcriptomes collected primarily from L1 Dominette...

  20. Consensus-phenotype integration of transcriptomic and metabolomic data implies a role for metabolism in the chemosensitivity of tumour cells.

    PubMed

    Cavill, Rachel; Kamburov, Atanas; Ellis, James K; Athersuch, Toby J; Blagrove, Marcus S C; Herwig, Ralf; Ebbels, Timothy M D; Keun, Hector C

    2011-03-01

    Using transcriptomic and metabolomic measurements from the NCI60 cell line panel, together with a novel approach to integration of molecular profile data, we show that the biochemical pathways associated with tumour cell chemosensitivity to platinum-based drugs are highly coincident, i.e. they describe a consensus phenotype. Direct integration of metabolome and transcriptome data at the point of pathway analysis improved the detection of consensus pathways by 76%, and revealed associations between platinum sensitivity and several metabolic pathways that were not visible from transcriptome analysis alone. These pathways included the TCA cycle and pyruvate metabolism, lipoprotein uptake and nucleotide synthesis by both salvage and de novo pathways. Extending the approach across a wide panel of chemotherapeutics, we confirmed the specificity of the metabolic pathway associations to platinum sensitivity. We conclude that metabolic phenotyping could play a role in predicting response to platinum chemotherapy and that consensus-phenotype integration of molecular profiling data is a powerful and versatile tool for both biomarker discovery and for exploring the complex relationships between biological pathways and drug response.

  1. CBrowse: a SAM/BAM-based contig browser for transcriptome assembly visualization and analysis.

    PubMed

    Li, Pei; Ji, Guoli; Dong, Min; Schmidt, Emily; Lenox, Douglas; Chen, Liangliang; Liu, Qi; Liu, Lin; Zhang, Jie; Liang, Chun

    2012-09-15

    To address the impending need for exploring rapidly increased transcriptomics data generated for non-model organisms, we developed CBrowse, an AJAX-based web browser for visualizing and analyzing transcriptome assemblies and contigs. Designed in a standard three-tier architecture with a data pre-processing pipeline, CBrowse is essentially a Rich Internet Application that offers many seamlessly integrated web interfaces and allows users to navigate, sort, filter, search and visualize data smoothly. The pre-processing pipeline takes the contig sequence file in FASTA format and its relevant SAM/BAM file as the input; detects putative polymorphisms, simple sequence repeats and sequencing errors in contigs and generates image, JSON and database-compatible CSV text files that are directly utilized by different web interfaces. CBowse is a generic visualization and analysis tool that facilitates close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors in transcriptome sequencing projects. CBrowse is distributed under the GNU General Public License, available at http://bioinfolab.muohio.edu/CBrowse/ liangc@muohio.edu or liangc.mu@gmail.com; glji@xmu.edu.cn Supplementary data are available at Bioinformatics online.

  2. Identification of potential transcriptomic markers in developing pediatric sepsis: a weighted gene co-expression network analysis and a case-control validation study.

    PubMed

    Li, Yiping; Li, Yanhong; Bai, Zhenjiang; Pan, Jian; Wang, Jian; Fang, Fang

    2017-12-13

    Sepsis represents a complex disease with the dysregulated inflammatory response and high mortality rate. The goal of this study was to identify potential transcriptomic markers in developing pediatric sepsis by a co-expression module analysis of the transcriptomic dataset. Using the R software and Bioconductor packages, we performed a weighted gene co-expression network analysis to identify co-expression modules significantly associated with pediatric sepsis. Functional interpretation (gene ontology and pathway analysis) and enrichment analysis with known transcription factors and microRNAs of the identified candidate modules were then performed. In modules significantly associated with sepsis, the intramodular analysis was further performed and "hub genes" were identified and validated by quantitative real-time PCR (qPCR) in this study. 15 co-expression modules in total were detected, and four modules ("midnight blue", "cyan", "brown", and "tan") were most significantly associated with pediatric sepsis and suggested as potential sepsis-associated modules. Gene ontology analysis and pathway analysis revealed that these four modules strongly associated with immune response. Three of the four sepsis-associated modules were also enriched with known transcription factors (false discovery rate-adjusted P < 0.05). Hub genes were identified in each of the four modules. Four of the identified hub genes (MYB proto-oncogene like 1, killer cell lectin like receptor G1, stomatin, and membrane spanning 4-domains A4A) were further validated to be differentially expressed between septic children and controls by qPCR. Four pediatric sepsis-associated co-expression modules were identified in this study. qPCR results suggest that hub genes in these modules are potential transcriptomic markers for pediatric sepsis diagnosis. These results provide novel insights into the pathogenesis of pediatric sepsis and promote the generation of diagnostic gene sets.

  3. Biosynthesis of the active compounds of Isatis indigotica based on transcriptome sequencing and metabolites profiling

    PubMed Central

    2013-01-01

    Backgroud Isatis indigotica is a widely used herb for the clinical treatment of colds, fever, and influenza in Traditional Chinese Medicine (TCM). Various structural classes of compounds have been identified as effective ingredients. However, little is known at genetics level about these active metabolites. In the present study, we performed de novo transcriptome sequencing for the first time to produce a comprehensive dataset of I. indigotica. Results A database of 36,367 unigenes (average length = 1,115.67 bases) was generated by performing transcriptome sequencing. Based on the gene annotation of the transcriptome, 104 unigenes were identified covering most of the catalytic steps in the general biosynthetic pathways of indole, terpenoid, and phenylpropanoid. Subsequently, the organ-specific expression patterns of the genes involved in these pathways, and their responses to methyl jasmonate (MeJA) induction, were investigated. Metabolites profile of effective phenylpropanoid showed accumulation pattern of secondary metabolites were mostly correlated with the transcription of their biosynthetic genes. According to the analysis of UDP-dependent glycosyltransferases (UGT) family, several flavonoids were indicated to exist in I. indigotica and further identified by metabolic profile using UPLC/Q-TOF. Moreover, applying transcriptome co-expression analysis, nine new, putative UGTs were suggested as flavonol glycosyltransferases and lignan glycosyltransferases. Conclusions This database provides a pool of candidate genes involved in biosynthesis of effective metabolites in I. indigotica. Furthermore, the comprehensive analysis and characterization of the significant pathways are expected to give a better insight regarding the diversity of chemical composition, synthetic characteristics, and the regulatory mechanism which operate in this medical herb. PMID:24308360

  4. Integrative "omic" analysis of experimental bacteremia identifies a metabolic signature that distinguishes human sepsis from systemic inflammatory response syndromes.

    PubMed

    Langley, Raymond J; Tipper, Jennifer L; Bruse, Shannon; Baron, Rebecca M; Tsalik, Ephraim L; Huntley, James; Rogers, Angela J; Jaramillo, Richard J; O'Donnell, Denise; Mega, William M; Keaton, Mignon; Kensicki, Elizabeth; Gazourian, Lee; Fredenburgh, Laura E; Massaro, Anthony F; Otero, Ronny M; Fowler, Vance G; Rivers, Emanuel P; Woods, Chris W; Kingsmore, Stephen F; Sopori, Mohan L; Perrella, Mark A; Choi, Augustine M K; Harrod, Kevin S

    2014-08-15

    Sepsis is a leading cause of morbidity and mortality. Currently, early diagnosis and the progression of the disease are difficult to make. The integration of metabolomic and transcriptomic data in a primate model of sepsis may provide a novel molecular signature of clinical sepsis. To develop a biomarker panel to characterize sepsis in primates and ascertain its relevance to early diagnosis and progression of human sepsis. Intravenous inoculation of Macaca fascicularis with Escherichia coli produced mild to severe sepsis, lung injury, and death. Plasma samples were obtained before and after 1, 3, and 5 days of E. coli challenge and at the time of killing. At necropsy, blood, lung, kidney, and spleen samples were collected. An integrative analysis of the metabolomic and transcriptomic datasets was performed to identify a panel of sepsis biomarkers. The extent of E. coli invasion, respiratory distress, lethargy, and mortality was dependent on the bacterial dose. Metabolomic and transcriptomic changes characterized severe infections and death, and indicated impaired mitochondrial, peroxisomal, and liver functions. Analysis of the pulmonary transcriptome and plasma metabolome suggested impaired fatty acid catabolism regulated by peroxisome-proliferator activated receptor signaling. A representative four-metabolite model effectively diagnosed sepsis in primates (area under the curve, 0.966) and in two human sepsis cohorts (area under the curve, 0.78 and 0.82). A model of sepsis based on reciprocal metabolomic and transcriptomic data was developed in primates and validated in two human patient cohorts. It is anticipated that the identified parameters will facilitate early diagnosis and management of sepsis.

  5. The Transcriptomes of Xiphinema index and Longidorus elongatus Suggest Independent Acquisition of Some Plant Parasitism Genes by Horizontal Gene Transfer in Early-Branching Nematodes

    PubMed Central

    Danchin, Etienne G.J.; Perfus-Barbeoch, Laetitia; Rancurel, Corinne; Thorpe, Peter; Da Rocha, Martine; Bajew, Simon; Neilson, Roy; Sokolova (Guzeeva), Elena; Da Silva, Corinne; Guy, Julie; Labadie, Karine; Esmenjaud, Daniel; Helder, Johannes; Jones, John T.

    2017-01-01

    Nematodes have evolved the ability to parasitize plants on at least four independent occasions, with plant parasites present in Clades 1, 2, 10 and 12 of the phylum. In the case of Clades 10 and 12, horizontal gene transfer of plant cell wall degrading enzymes from bacteria and fungi has been implicated in the evolution of plant parasitism. We have used ribonucleic acid sequencing (RNAseq) to generate reference transcriptomes for two economically important nematode species, Xiphinema index and Longidorus elongatus, representative of two genera within the early-branching Clade 2 of the phylum Nematoda. We used a transcriptome-wide analysis to identify putative horizontal gene transfer events. This represents the first in-depth transcriptome analysis from any plant-parasitic nematode of this clade. For each species, we assembled ~30 million Illumina reads into a reference transcriptome. We identified 62 and 104 transcripts, from X. index and L. elongatus, respectively, that were putatively acquired via horizontal gene transfer. By cross-referencing horizontal gene transfer prediction with a phylum-wide analysis of Pfam domains, we identified Clade 2-specific events. Of these, a GH12 cellulase from X. index was analysed phylogenetically and biochemically, revealing a likely bacterial origin and canonical enzymatic function. Horizontal gene transfer was previously shown to be a phenomenon that has contributed to the evolution of plant parasitism among nematodes. Our findings underline the importance and the extensiveness of this phenomenon in the evolution of plant-parasitic life styles in this speciose and widespread animal phylum. PMID:29065523

  6. The Transcriptomes of Xiphinema index and Longidorus elongatus Suggest Independent Acquisition of Some Plant Parasitism Genes by Horizontal Gene Transfer in Early-Branching Nematodes.

    PubMed

    Danchin, Etienne G J; Perfus-Barbeoch, Laetitia; Rancurel, Corinne; Thorpe, Peter; Da Rocha, Martine; Bajew, Simon; Neilson, Roy; Guzeeva, Elena Sokolova; Da Silva, Corinne; Guy, Julie; Labadie, Karine; Esmenjaud, Daniel; Helder, Johannes; Jones, John T; den Akker, Sebastian Eves-van

    2017-10-23

    Nematodes have evolved the ability to parasitize plants on at least four independent occasions, with plant parasites present in Clades 1, 2, 10 and 12 of the phylum. In the case of Clades 10 and 12, horizontal gene transfer of plant cell wall degrading enzymes from bacteria and fungi has been implicated in the evolution of plant parasitism. We have used ribonucleic acid sequencing (RNAseq) to generate reference transcriptomes for two economically important nematode species, Xiphinema index and Longidorus elongatus , representative of two genera within the early-branching Clade 2 of the phylum Nematoda. We used a transcriptome-wide analysis to identify putative horizontal gene transfer events. This represents the first in-depth transcriptome analysis from any plant-parasitic nematode of this clade. For each species, we assembled ~30 million Illumina reads into a reference transcriptome. We identified 62 and 104 transcripts, from X. index and L. elongatus , respectively, that were putatively acquired via horizontal gene transfer. By cross-referencing horizontal gene transfer prediction with a phylum-wide analysis of Pfam domains, we identified Clade 2-specific events. Of these, a GH12 cellulase from X. index was analysed phylogenetically and biochemically, revealing a likely bacterial origin and canonical enzymatic function. Horizontal gene transfer was previously shown to be a phenomenon that has contributed to the evolution of plant parasitism among nematodes. Our findings underline the importance and the extensiveness of this phenomenon in the evolution of plant-parasitic life styles in this speciose and widespread animal phylum.

  7. Transcriptome profile and unique genetic evolution of positively selected genes in yak lungs.

    PubMed

    Lan, DaoLiang; Xiong, XianRong; Ji, WenHui; Li, Jian; Mipam, Tserang-Donko; Ai, Yi; Chai, ZhiXin

    2018-04-01

    The yak (Bos grunniens), which is a unique bovine breed that is distributed mainly in the Qinghai-Tibetan Plateau, is considered a good model for studying plateau adaptability in mammals. The lungs are important functional organs that enable animals to adapt to their external environment. However, the genetic mechanism underlying the adaptability of yak lungs to harsh plateau environments remains unknown. To explore the unique evolutionary process and genetic mechanism of yak adaptation to plateau environments, we performed transcriptome sequencing of yak and cattle (Bos taurus) lungs using RNA-Seq technology and a subsequent comparison analysis to identify the positively selected genes in the yak. After deep sequencing, a normal transcriptome profile of yak lung that containing a total of 16,815 expressed genes was obtained, and the characteristics of yak lungs transcriptome was described by functional analysis. Furthermore, Ka/Ks comparison statistics result showed that 39 strong positively selected genes are identified from yak lungs. Further GO and KEGG analysis was conducted for the functional annotation of these genes. The results of this study provide valuable data for further explorations of the unique evolutionary process of high-altitude hypoxia adaptation in yaks in the Tibetan Plateau and the genetic mechanism at the molecular level.

  8. Sequencing and De Novo Assembly of the Toxicodendron radicans (Poison Ivy) Transcriptome

    PubMed Central

    Kim, Gunjune

    2017-01-01

    Contact with poison ivy plants is widely dreaded because they produce a natural product called urushiol that is responsible for allergenic contact delayed-dermatitis symptoms lasting for weeks. For this reason, the catchphrase most associated with poison ivy is “leaves of three, let it be”, which serves the purpose of both identification and an appeal for avoidance. Ironically, despite this notoriety, there is a dearth of specific knowledge about nearly all other aspects of poison ivy physiology and ecology. As a means of gaining a more molecular-oriented understanding of poison ivy physiology and ecology, Next Generation DNA sequencing technology was used to develop poison ivy root and leaf RNA-seq transcriptome resources. De novo assembled transcriptomes were analyzed to generate a core set of high quality expressed transcripts present in poison ivy tissue. The predicted protein sequences were evaluated for similarity to SwissProt homologs and InterProScan domains, as well as assigned both GO terms and KEGG annotations. Over 23,000 simple sequence repeats were identified in the transcriptome, and corresponding oligo nucleotide primer pairs were designed. A pan-transcriptome analysis of existing Anacardiaceae transcriptomes revealed conserved and unique transcripts among these species. PMID:29125533

  9. Sequencing and De Novo Assembly of the Toxicodendron radicans (Poison Ivy) Transcriptome.

    PubMed

    Weisberg, Alexandra J; Kim, Gunjune; Westwood, James H; Jelesko, John G

    2017-11-10

    Contact with poison ivy plants is widely dreaded because they produce a natural product called urushiol that is responsible for allergenic contact delayed-dermatitis symptoms lasting for weeks. For this reason, the catchphrase most associated with poison ivy is "leaves of three, let it be", which serves the purpose of both identification and an appeal for avoidance. Ironically, despite this notoriety, there is a dearth of specific knowledge about nearly all other aspects of poison ivy physiology and ecology. As a means of gaining a more molecular-oriented understanding of poison ivy physiology and ecology, Next Generation DNA sequencing technology was used to develop poison ivy root and leaf RNA-seq transcriptome resources. De novo assembled transcriptomes were analyzed to generate a core set of high quality expressed transcripts present in poison ivy tissue. The predicted protein sequences were evaluated for similarity to SwissProt homologs and InterProScan domains, as well as assigned both GO terms and KEGG annotations. Over 23,000 simple sequence repeats were identified in the transcriptome, and corresponding oligo nucleotide primer pairs were designed. A pan-transcriptome analysis of existing Anacardiaceae transcriptomes revealed conserved and unique transcripts among these species.

  10. Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing

    PubMed Central

    Wang, Bin; Guo, Guangwu; Wang, Chao; Lin, Ying; Wang, Xiaoning; Zhao, Mouming; Guo, Yong; He, Minghui; Zhang, Yong; Pan, Li

    2010-01-01

    Aspergillus oryzae, an important filamentous fungus used in food fermentation and the enzyme industry, has been shown through genome sequencing and various other tools to have prominent features in its genomic composition. However, the functional complexity of the A. oryzae transcriptome has not yet been fully elucidated. Here, we applied direct high-throughput paired-end RNA-sequencing (RNA-Seq) to the transcriptome of A. oryzae under four different culture conditions. With the high resolution and sensitivity afforded by RNA-Seq, we were able to identify a substantial number of novel transcripts, new exons, untranslated regions, alternative upstream initiation codons and upstream open reading frames, which provide remarkable insight into the A. oryzae transcriptome. We were also able to assess the alternative mRNA isoforms in A. oryzae and found a large number of genes undergoing alternative splicing. Many genes and pathways that might be involved in higher levels of protein production in solid-state culture than in liquid culture were identified by comparing gene expression levels between different cultures. Our analysis indicated that the transcriptome of A. oryzae is much more complex than previously anticipated, and these results may provide a blueprint for further study of the A. oryzae transcriptome. PMID:20392818

  11. Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing.

    PubMed

    Wang, Bin; Guo, Guangwu; Wang, Chao; Lin, Ying; Wang, Xiaoning; Zhao, Mouming; Guo, Yong; He, Minghui; Zhang, Yong; Pan, Li

    2010-08-01

    Aspergillus oryzae, an important filamentous fungus used in food fermentation and the enzyme industry, has been shown through genome sequencing and various other tools to have prominent features in its genomic composition. However, the functional complexity of the A. oryzae transcriptome has not yet been fully elucidated. Here, we applied direct high-throughput paired-end RNA-sequencing (RNA-Seq) to the transcriptome of A. oryzae under four different culture conditions. With the high resolution and sensitivity afforded by RNA-Seq, we were able to identify a substantial number of novel transcripts, new exons, untranslated regions, alternative upstream initiation codons and upstream open reading frames, which provide remarkable insight into the A. oryzae transcriptome. We were also able to assess the alternative mRNA isoforms in A. oryzae and found a large number of genes undergoing alternative splicing. Many genes and pathways that might be involved in higher levels of protein production in solid-state culture than in liquid culture were identified by comparing gene expression levels between different cultures. Our analysis indicated that the transcriptome of A. oryzae is much more complex than previously anticipated, and these results may provide a blueprint for further study of the A. oryzae transcriptome.

  12. Genetic Epidemiology of Glucose-6-Dehydrogenase Deficiency in the Arab World.

    PubMed

    Doss, C George Priya; Alasmar, Dima R; Bux, Reem I; Sneha, P; Bakhsh, Fadheela Dad; Al-Azwani, Iman; Bekay, Rajaa El; Zayed, Hatem

    2016-11-17

    A systematic search was implemented using four literature databases (PubMed, Embase, Science Direct and Web of Science) to capture all the causative mutations of Glucose-6-phosphate dehydrogenase (G6PD) deficiency (G6PDD) in the 22 Arab countries. Our search yielded 43 studies that captured 33 mutations (23 missense, one silent, two deletions, and seven intronic mutations), in 3,430 Arab patients with G6PDD. The 23 missense mutations were then subjected to phenotypic classification using in silico prediction tools, which were compared to the WHO pathogenicity scale as a reference. These in silico tools were tested for their predicting efficiency using rigorous statistical analyses. Of the 23 missense mutations, p.S188F, p.I48T, p.N126D, and p.V68M, were identified as the most common mutations among Arab populations, but were not unique to the Arab world, interestingly, our search strategy found four other mutations (p.N135T, p.S179N, p.R246L, and p.Q307P) that are unique to Arabs. These mutations were exposed to structural analysis and molecular dynamics simulation analysis (MDSA), which predicting these mutant forms as potentially affect the enzyme function. The combination of the MDSA, structural analysis, and in silico predictions and statistical tools we used will provide a platform for future prediction accuracy for the pathogenicity of genetic mutations.

  13. Transcriptome Analysis of Fat Bodies from Two Brown Planthopper (Nilaparvata lugens) Populations with Different Virulence Levels in Rice

    PubMed Central

    Chen, Hongdan; Lai, Wenxiang; Fu, Qiang; Lou, Yonggen

    2014-01-01

    Background The brown planthopper (BPH), Nilaparvata lugens (Stål), one of the most serious rice insect pests in Asia, can quickly overcome rice resistance by evolving new virulent populations. The insect fat body plays essential roles in the life cycles of insects and in plant-insect interactions. However, whether differences in fat body transcriptomes exist between insect populations with different virulence levels and whether the transcriptomic differences are related to insect virulence remain largely unknown. Methodology/Principal Findings In this study, we performed transcriptome-wide analyses on the fat bodies of two BPH populations with different virulence levels in rice. The populations were derived from rice variety TN1 (TN1 population) and Mudgo (M population). In total, 33,776 and 32,332 unigenes from the fat bodies of TN1 and M populations, respectively, were generated using Illumina technology. Gene ontology annotations and Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology classifications indicated that genes related to metabolism and immunity were significantly active in the fat bodies. In addition, a total of 339 unigenes showed homology to genes of yeast-like symbionts (YLSs) from 12 genera and endosymbiotic bacteria Wolbachia. A comparative analysis of the two transcriptomes generated 7,860 differentially expressed genes. GO annotations and enrichment analysis of KEGG pathways indicated these differentially expressed transcripts might be involved in metabolism and immunity. Finally, 105 differentially expressed genes from YLSs and Wolbachia were identified, genes which might be associated with the formation of different virulent populations. Conclusions/Significance This study was the first to compare the fat-body transcriptomes of two BPH populations having different virulence traits and to find genes that may be related to this difference. Our findings provide a molecular resource for future investigations of fat bodies and will be useful in examining the interactions between the fat body and virulence variation in the BPH. PMID:24533099

  14. Transcriptome analysis of fat bodies from two brown planthopper (Nilaparvata lugens) populations with different virulence levels in rice.

    PubMed

    Yu, Haixin; Ji, Rui; Ye, Wenfeng; Chen, Hongdan; Lai, Wenxiang; Fu, Qiang; Lou, Yonggen

    2014-01-01

    The brown planthopper (BPH), Nilaparvata lugens (Stål), one of the most serious rice insect pests in Asia, can quickly overcome rice resistance by evolving new virulent populations. The insect fat body plays essential roles in the life cycles of insects and in plant-insect interactions. However, whether differences in fat body transcriptomes exist between insect populations with different virulence levels and whether the transcriptomic differences are related to insect virulence remain largely unknown. In this study, we performed transcriptome-wide analyses on the fat bodies of two BPH populations with different virulence levels in rice. The populations were derived from rice variety TN1 (TN1 population) and Mudgo (M population). In total, 33,776 and 32,332 unigenes from the fat bodies of TN1 and M populations, respectively, were generated using Illumina technology. Gene ontology annotations and Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology classifications indicated that genes related to metabolism and immunity were significantly active in the fat bodies. In addition, a total of 339 unigenes showed homology to genes of yeast-like symbionts (YLSs) from 12 genera and endosymbiotic bacteria Wolbachia. A comparative analysis of the two transcriptomes generated 7,860 differentially expressed genes. GO annotations and enrichment analysis of KEGG pathways indicated these differentially expressed transcripts might be involved in metabolism and immunity. Finally, 105 differentially expressed genes from YLSs and Wolbachia were identified, genes which might be associated with the formation of different virulent populations. This study was the first to compare the fat-body transcriptomes of two BPH populations having different virulence traits and to find genes that may be related to this difference. Our findings provide a molecular resource for future investigations of fat bodies and will be useful in examining the interactions between the fat body and virulence variation in the BPH.

  15. Deep sequencing-based transcriptome profiling analysis of bacteria-challenged Lateolabrax japonicus reveals insight into the immune-relevant genes in marine fish

    PubMed Central

    2010-01-01

    Background Systematic research on fish immunogenetics is indispensable in understanding the origin and evolution of immune systems. This has long been a challenging task because of the limited number of deep sequencing technologies and genome backgrounds of non-model fish available. The newly developed Solexa/Illumina RNA-seq and Digital gene expression (DGE) are high-throughput sequencing approaches and are powerful tools for genomic studies at the transcriptome level. This study reports the transcriptome profiling analysis of bacteria-challenged Lateolabrax japonicus using RNA-seq and DGE in an attempt to gain insights into the immunogenetics of marine fish. Results RNA-seq analysis generated 169,950 non-redundant consensus sequences, among which 48,987 functional transcripts with complete or various length encoding regions were identified. More than 52% of these transcripts are possibly involved in approximately 219 known metabolic or signalling pathways, while 2,673 transcripts were associated with immune-relevant genes. In addition, approximately 8% of the transcripts appeared to be fish-specific genes that have never been described before. DGE analysis revealed that the host transcriptome profile of Vibrio harveyi-challenged L. japonicus is considerably altered, as indicated by the significant up- or down-regulation of 1,224 strong infection-responsive transcripts. Results indicated an overall conservation of the components and transcriptome alterations underlying innate and adaptive immunity in fish and other vertebrate models. Analysis suggested the acquisition of numerous fish-specific immune system components during early vertebrate evolution. Conclusion This study provided a global survey of host defence gene activities against bacterial challenge in a non-model marine fish. Results can contribute to the in-depth study of candidate genes in marine fish immunity, and help improve current understanding of host-pathogen interactions and evolutionary history of immunogenetics from fish to mammals. PMID:20707909

  16. Global Analysis of Transcriptome Responses and Gene Expression Profiles to Cold Stress of Jatropha curcas L.

    PubMed Central

    Wang, Haibo; Zou, Zhurong; Wang, Shasha; Gong, Ming

    2013-01-01

    Background Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance) that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE) are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. Results In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs) were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C) for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. Conclusions This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of crucial genes for genetically enhancing cold resistance in J. curcas. PMID:24349370

  17. Global analysis of transcriptome responses and gene expression profiles to cold stress of Jatropha curcas L.

    PubMed

    Wang, Haibo; Zou, Zhurong; Wang, Shasha; Gong, Ming

    2013-01-01

    Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance) that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE) are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs) were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C) for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of crucial genes for genetically enhancing cold resistance in J. curcas.

  18. In silico environmental chemical science: properties and processes from statistical and computational modelling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tratnyek, Paul G.; Bylaska, Eric J.; Weber, Eric J.

    2017-01-01

    Quantitative structure–activity relationships (QSARs) have long been used in the environmental sciences. More recently, molecular modeling and chemoinformatic methods have become widespread. These methods have the potential to expand and accelerate advances in environmental chemistry because they complement observational and experimental data with “in silico” results and analysis. The opportunities and challenges that arise at the intersection between statistical and theoretical in silico methods are most apparent in the context of properties that determine the environmental fate and effects of chemical contaminants (degradation rate constants, partition coefficients, toxicities, etc.). The main example of this is the calibration of QSARs usingmore » descriptor variable data calculated from molecular modeling, which can make QSARs more useful for predicting property data that are unavailable, but also can make them more powerful tools for diagnosis of fate determining pathways and mechanisms. Emerging opportunities for “in silico environmental chemical science” are to move beyond the calculation of specific chemical properties using statistical models and toward more fully in silico models, prediction of transformation pathways and products, incorporation of environmental factors into model predictions, integration of databases and predictive models into more comprehensive and efficient tools for exposure assessment, and extending the applicability of all the above from chemicals to biologicals and materials.« less

  19. In vitro, in vivo and in silico analysis of the anticancer and estrogen-like activity of guava leaf extracts.

    PubMed

    Rizzo, L Y; Longato, G B; Ruiz, A Lt G; Tinti, S V; Possenti, A; Vendramini-Costa, D B; Sartoratto, A; Figueira, G M; Silva, F L N; Eberlin, M N; Souza, T A C B; Murakami, M T; Rizzo, E; Foglio, M A; Kiessling, F; Lammers, T; Carvalho, J E

    2014-01-01

    Anticancer drug research based on natural compounds enabled the discovery of many drugs currently used in cancer therapy. Here, we report the in vitro, in vivo and in silico anticancer and estrogen-like activity of Psidium guajava L. (guava) extracts and enriched mixture containing the meroterpenes guajadial, psidial A and psiguadial A and B. All samples were evaluated in vitro for anticancer activity against nine human cancer lines: K562 (leukemia), MCF7 (breast), NCI/ADR-RES (resistant ovarian cancer), NCI-H460 (lung), UACC-62 (melanoma), PC-3 (prostate), HT-29 (colon), OVCAR-3 (ovarian) and 786-0 (kidney). Psidium guajava's active compounds displayed similar physicochemical properties to estradiol and tamoxifen, as in silico molecular docking studies demonstrated that they fit into the estrogen receptors (ERs). The meroterpene-enriched fraction was also evaluated in vivo in a Solid Ehrlich murine breast adenocarcinoma model, and showed to be highly effective in inhibiting tumor growth, also demonstrating uterus increase in comparison to negative controls. The ability of guajadial, psidial A and psiguadials A and B to reduce tumor growth and stimulate uterus proliferation, as well as their in silico docking similarity to tamoxifen, suggest that these compounds may act as Selective Estrogen Receptors Modulators (SERMs), therefore holding significant potential for anticancer therapy.

  20. A pipeline for the de novo assembly of the Themira biloba (Sepsidae: Diptera) transcriptome using a multiple k-mer length approach.

    PubMed

    Melicher, Dacotah; Torson, Alex S; Dworkin, Ian; Bowsher, Julia H

    2014-03-12

    The Sepsidae family of flies is a model for investigating how sexual selection shapes courtship and sexual dimorphism in a comparative framework. However, like many non-model systems, there are few molecular resources available. Large-scale sequencing and assembly have not been performed in any sepsid, and the lack of a closely related genome makes investigation of gene expression challenging. Our goal was to develop an automated pipeline for de novo transcriptome assembly, and to use that pipeline to assemble and analyze the transcriptome of the sepsid Themira biloba. Our bioinformatics pipeline uses cloud computing services to assemble and analyze the transcriptome with off-site data management, processing, and backup. It uses a multiple k-mer length approach combined with a second meta-assembly to extend transcripts and recover more bases of transcript sequences than standard single k-mer assembly. We used 454 sequencing to generate 1.48 million reads from cDNA generated from embryo, larva, and pupae of T. biloba and assembled a transcriptome consisting of 24,495 contigs. Annotation identified 16,705 transcripts, including those involved in embryogenesis and limb patterning. We assembled transcriptomes from an additional three non-model organisms to demonstrate that our pipeline assembled a higher-quality transcriptome than single k-mer approaches across multiple species. The pipeline we have developed for assembly and analysis increases contig length, recovers unique transcripts, and assembles more base pairs than other methods through the use of a meta-assembly. The T. biloba transcriptome is a critical resource for performing large-scale RNA-Seq investigations of gene expression patterns, and is the first transcriptome sequenced in this Dipteran family.

  1. Sequencing, Annotation and Analysis of the Syrian Hamster (Mesocricetus auratus) Transcriptome

    PubMed Central

    Tchitchek, Nicolas; Safronetz, David; Rasmussen, Angela L.; Martens, Craig; Virtaneva, Kimmo; Porcella, Stephen F.; Feldmann, Heinz

    2014-01-01

    Background The Syrian hamster (golden hamster, Mesocricetus auratus) is gaining importance as a new experimental animal model for multiple pathogens, including emerging zoonotic diseases such as Ebola. Nevertheless there are currently no publicly available transcriptome reference sequences or genome for this species. Results A cDNA library derived from mRNA and snRNA isolated and pooled from the brains, lungs, spleens, kidneys, livers, and hearts of three adult female Syrian hamsters was sequenced. Sequence reads were assembled into 62,482 contigs and 111,796 reads remained unassembled (singletons). This combined contig/singleton dataset, designated as the Syrian hamster transcriptome, represents a total of 60,117,204 nucleotides. Our Mesocricetus auratus Syrian hamster transcriptome mapped to 11,648 mouse transcripts representing 9,562 distinct genes, and mapped to a similar number of transcripts and genes in the rat. We identified 214 quasi-complete transcripts based on mouse annotations. Canonical pathways involved in a broad spectrum of fundamental biological processes were significantly represented in the library. The Syrian hamster transcriptome was aligned to the current release of the Chinese hamster ovary (CHO) cell transcriptome and genome to improve the genomic annotation of this species. Finally, our Syrian hamster transcriptome was aligned against 14 other rodents, primate and laurasiatheria species to gain insights about the genetic relatedness and placement of this species. Conclusions This Syrian hamster transcriptome dataset significantly improves our knowledge of the Syrian hamster's transcriptome, especially towards its future use in infectious disease research. Moreover, this library is an important resource for the wider scientific community to help improve genome annotation of the Syrian hamster and other closely related species. Furthermore, these data provide the basis for development of expression microarrays that can be used in functional genomics studies. PMID:25398096

  2. An integrated in silico approach for functional and structural impact of non- synonymous SNPs in the MYH1 gene in Jeju Native Pigs.

    PubMed

    Ghosh, Mrinmoy; Sodhi, Simrinder Singh; Sharma, Neelesh; Mongre, Raj Kumar; Kim, Nameun; Singh, Amit Kumar; Lee, Sung Jin; Kim, Dae Cheol; Kim, Sung Woo; Lee, Hak Kyo; Song, Ki-Duk; Jeong, Dong Kee

    2016-02-04

    This study was performed to identify the non- synonymous polymorphisms in the myosin heavy chain 1 gene (MYH1) association with skeletal muscle development in economically important Jeju Native Pig (JNP) and Berkshire breeds. Herein, we present an in silico analysis, with a focus on (a) in silico approaches to predict the functional effect of non-synonymous SNP (nsSNP) in MYH1 on growth, and (b) molecular docking and dynamic simulation of MYH1 to predict the effects of those nsSNP on protein-protein association. The NextGENe (V 2.3.4.) tool was used to identify the variants in MYH1 from JNP and Berkshire using RNA seq. Gene ontology analysis of MYH1 revealed significant association with muscle contraction and muscle organ development. The 95 % confidence intervals clearly indicate that the mRNA expression of MYH1 is significantly higher in the Berkshire longissimus dorsi muscle samples than JNP breed. Concordant in silico analysis of MYH1, the open-source software tools identified 4 potential nsSNP (L884T, K972C, N981G, and Q1285C) in JNP and 1 nsSNP (H973G) in Berkshire pigs. Moreover, protein-protein interactions were studied to investigate the effect of MYH1 mutations on association with hub proteins, and MYH1 was found to be closely associated with the protein myosin light chain, phosphorylatable, fast skeletal muscle MYLPF. The results of molecular docking studies on MYH1 (native and 4 mutants) and MYLFP demonstrated that the native complex showed higher electrostatic energy (-466.5 Kcal mol(-1)), van der Walls energy (-87.3 Kcal mol(-1)), and interaction energy (-835.7 Kcal mol(-1)) than the mutant complexes. Furthermore, the molecular dynamic simulation revealed that the native complex yielded a higher root-mean-square deviation (0.2-0.55 nm) and lower root-mean-square fluctuation (approximately 0.08-0.3 nm) as compared to the mutant complexes. The results suggest that the variants at L884T, K972C, N981G, and Q1285C in MYH1 in JNP might represent a cause for the poor growth performance for this breed. This study is a pioneering in-depth in silico analysis of polymorphic MYH1 and will serve as a valuable resource for further targeted molecular diagnosis and population-based studies conducted for improving the growth performance of JNP.

  3. In silico analysis of protein toxin and bacteriocins from Lactobacillus paracasei SD1 genome and available online databases

    PubMed Central

    Surachat, Komwit; Sangket, Unitsa; Deachamag, Panchalika; Chotigeat, Wilaiwan

    2017-01-01

    Lactobacillus paracasei SD1 is a potential probiotic strain due to its ability to survive several conditions in human dental cavities. To ascertain its safety for human use, we therefore performed a comprehensive bioinformatics analysis and characterization of the bacterial protein toxins produced by this strain. We report the complete genome of Lactobacillus paracasei SD1 and its comparison to other Lactobacillus genomes. Additionally, we identify and analyze its protein toxins and antimicrobial proteins using reliable online database resources and establish its phylogenetic relationship with other bacterial genomes. Our investigation suggests that this strain is safe for human use and contains several bacteriocins that confer health benefits to the host. An in silico analysis of protein-protein interactions between the target bacteriocins and the microbial proteins gtfB and luxS of Streptococcus mutans was performed and is discussed here. PMID:28837656

  4. Analysis of the Macaca mulatta transcriptome and the sequence divergence between Macaca and human.

    PubMed

    Magness, Charles L; Fellin, P Campion; Thomas, Matthew J; Korth, Marcus J; Agy, Michael B; Proll, Sean C; Fitzgibbon, Matthew; Scherer, Christina A; Miner, Douglas G; Katze, Michael G; Iadonato, Shawn P

    2005-01-01

    We report the initial sequencing and comparative analysis of the Macaca mulatta transcriptome. Cloned sequences from 11 tissues, nine animals, and three species (M. mulatta, M. fascicularis, and M. nemestrina) were sampled, resulting in the generation of 48,642 sequence reads. These data represent an initial sampling of the putative rhesus orthologs for 6,216 human genes. Mean nucleotide diversity within M. mulatta and sequence divergence among M. fascicularis, M. nemestrina, and M. mulatta are also reported.

  5. Hybrid In Silico/In Vitro Approaches for the Identification of Functional Cholesterol-Binding Domains in Membrane Proteins.

    PubMed

    Di Scala, Coralie; Fantini, Jacques

    2017-01-01

    In eukaryotic cells, cholesterol is an important regulator of a broad range of membrane proteins, including receptors, transporters, and ion channels. Understanding how cholesterol interacts with membrane proteins is a difficult task because structural data of these proteins complexed with cholesterol are scarce. Here, we describe a dual approach based on in silico studies of protein-cholesterol interactions, combined with physico-chemical measurements of protein insertion into cholesterol-containing monolayers. Our algorithm is validated through careful analysis of the effect of key mutations within and outside the predicted cholesterol-binding site. Our method is illustrated by a complete analysis of cholesterol-binding to Alzheimer's β-amyloid peptide, a protein that penetrates the plasma membrane of brain cells through a cholesterol-dependent process.

  6. Transcriptome analysis in cotton boll weevil (Anthonomus grandis) and RNA interference in insect pests.

    PubMed

    Firmino, Alexandre Augusto Pereira; Fonseca, Fernando Campos de Assis; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; Antonino de Souza, José Dijair; Togawa, Roberto Coiti; Silva-Junior, Orzenil Bonfim; Pappas, Georgios Joannis; da Silva, Maria Cristina Mattar; Engler, Gilbert; Grossi-de-Sa, Maria Fatima

    2013-01-01

    Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi) as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families' data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects.

  7. Transcriptome Analysis in Cotton Boll Weevil (Anthonomus grandis) and RNA Interference in Insect Pests

    PubMed Central

    Coelho, Roberta Ramos; Antonino de Souza Jr, José Dijair; Togawa, Roberto Coiti; Silva-Junior, Orzenil Bonfim; Pappas-Jr, Georgios Joannis; da Silva, Maria Cristina Mattar; Engler, Gilbert; Grossi-de-Sa, Maria Fatima

    2013-01-01

    Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi) as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families’ data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects. PMID:24386449

  8. Maternal Plane of Nutrition during Late Gestation and Weaning Age Alter Angus × Simmental Offspring Longissimus Muscle Transcriptome and Intramuscular Fat

    PubMed Central

    Moisá, Sonia J.; Shike, Daniel W.; Shoup, Lindsay; Rodriguez-Zas, Sandra L.; Loor, Juan J.

    2015-01-01

    In model organisms both the nutrition of the mother and the young offspring could induce long-lasting transcriptional changes in tissues. In livestock, such changes could have important roles in determining nutrient use and meat quality. The main objective was to evaluate if plane of maternal nutrition during late-gestation and weaning age alter the offspring’s Longissimus muscle (LM) transcriptome, animal performance, and metabolic hormones. Whole-transcriptome microarray analysis was performed on LM samples of early (EW) and normal weaned (NW) Angus × Simmental calves born to grazing cows receiving no supplement [low plane of nutrition (LPN)] or 2.3 kg high-grain mix/day [medium plane of nutrition (MPN)] during the last 105 days of gestation. Biopsies of LM were harvested at 78 (EW), 187 (NW) and 354 (before slaughter) days of age. Despite greater feed intake in MPN offspring, blood insulin was greater in LPN offspring. Carcass intramuscular fat content was greater in EW offspring. Bioinformatics analysis of the transcriptome highlighted a modest overall response to maternal plane of nutrition, resulting in only 35 differentially expressed genes (DEG). However, weaning age and a high-grain diet (EW) strongly impacted the transcriptome (DEG = 167), especially causing a lipogenic program activation. In addition, between 78 and 187 days of age, EW steers had an activation of the innate immune system due presumably to macrophage infiltration of intramuscular fat. Between 187 and 354 days of age (the “finishing” phase), NW steers had an activation of the lipogenic transcriptome machinery, while EW steers had a clear inhibition through the epigenetic control of histone acetylases. Results underscored the need to conduct further studies to understand better the functional outcome of transcriptome changes induced in the offspring by pre- and post-natal nutrition. Additional knowledge on molecular and functional outcomes would help produce more efficient beef cattle. PMID:26153887

  9. De novo transcriptome assembly analysis of weed Apera spica-venti from seven tissues and growth stages.

    PubMed

    Babineau, Marielle; Mahmood, Khalid; Mathiassen, Solvejg K; Kudsk, Per; Kristensen, Michael

    2017-02-06

    Loose silky bentgrass (Apera spica-venti) is an important weed in Europe with a recent increase in herbicide resistance cases. The lack of genetic information about this noxious weed limits its biological understanding such as growth, reproduction, genetic variation, molecular ecology and metabolic herbicide resistance. This study produced a reference transcriptome for A. spica-venti from different tissues (leaf, root, stem) and various growth stages (seed at phenological stages 05, 07, 08, 09). The de novo assembly was performed on individual and combined dataset followed by functional annotations. Individual transcripts and gene families involved in metabolic based herbicide resistance were identified. Eight separate transcriptome assemblies were performed and compared. The combined transcriptome assembly consists of 83,349 contigs with an N50 and average contig length of 762 and 658 bp, respectively. This dataset contains 74,724 transcripts consisting of total 54,846,111 bp. Among them 94% had a homologue to UniProtKB, 73% retrieved a GO mapping, and 50% were functionally annotated. Compared with other grass species, A. spica-venti has 26% proteins in common to Brachypodium distachyon, and 41% to Lolium spp. Glycosyltransferases had the highest number of transcripts in each tissue followed by the cytochrome P450s. The GSTF1 and CYP89A2 transcripts were recovered from the majority of tissues and aligned at a maximum of 66 and 30% to proven herbicide resistant allele from Alopecurus myosuroides and Lolium rigidum, respectively. De novo transcriptome assembly enabled the generation of the first reference transcriptome of A. spica-venti. This can serve as stepping stone for understanding the metabolic herbicide resistance as well as the general biology of this problematic weed. Furthermore, this large-scale sequence data is a valuable scientific resource for comparative transcriptome analysis for Poaceae grasses.

  10. Maternal Plane of Nutrition during Late Gestation and Weaning Age Alter Angus × Simmental Offspring Longissimus Muscle Transcriptome and Intramuscular Fat.

    PubMed

    Moisá, Sonia J; Shike, Daniel W; Shoup, Lindsay; Rodriguez-Zas, Sandra L; Loor, Juan J

    2015-01-01

    In model organisms both the nutrition of the mother and the young offspring could induce long-lasting transcriptional changes in tissues. In livestock, such changes could have important roles in determining nutrient use and meat quality. The main objective was to evaluate if plane of maternal nutrition during late-gestation and weaning age alter the offspring's Longissimus muscle (LM) transcriptome, animal performance, and metabolic hormones. Whole-transcriptome microarray analysis was performed on LM samples of early (EW) and normal weaned (NW) Angus × Simmental calves born to grazing cows receiving no supplement [low plane of nutrition (LPN)] or 2.3 kg high-grain mix/day [medium plane of nutrition (MPN)] during the last 105 days of gestation. Biopsies of LM were harvested at 78 (EW), 187 (NW) and 354 (before slaughter) days of age. Despite greater feed intake in MPN offspring, blood insulin was greater in LPN offspring. Carcass intramuscular fat content was greater in EW offspring. Bioinformatics analysis of the transcriptome highlighted a modest overall response to maternal plane of nutrition, resulting in only 35 differentially expressed genes (DEG). However, weaning age and a high-grain diet (EW) strongly impacted the transcriptome (DEG = 167), especially causing a lipogenic program activation. In addition, between 78 and 187 days of age, EW steers had an activation of the innate immune system due presumably to macrophage infiltration of intramuscular fat. Between 187 and 354 days of age (the "finishing" phase), NW steers had an activation of the lipogenic transcriptome machinery, while EW steers had a clear inhibition through the epigenetic control of histone acetylases. Results underscored the need to conduct further studies to understand better the functional outcome of transcriptome changes induced in the offspring by pre- and post-natal nutrition. Additional knowledge on molecular and functional outcomes would help produce more efficient beef cattle.

  11. Global Landscape of a Co-Expressed Gene Network in Barley and its Application to Gene Discovery in Triticeae Crops

    PubMed Central

    Mochida, Keiichi; Uehara-Yamaguchi, Yukiko; Yoshida, Takuhiro; Sakurai, Tetsuya; Shinozaki, Kazuo

    2011-01-01

    Accumulated transcriptome data can be used to investigate regulatory networks of genes involved in various biological systems. Co-expression analysis data sets generated from comprehensively collected transcriptome data sets now represent efficient resources that are capable of facilitating the discovery of genes with closely correlated expression patterns. In order to construct a co-expression network for barley, we analyzed 45 publicly available experimental series, which are composed of 1,347 sets of GeneChip data for barley. On the basis of a gene-to-gene weighted correlation coefficient, we constructed a global barley co-expression network and classified it into clusters of subnetwork modules. The resulting clusters are candidates for functional regulatory modules in the barley transcriptome. To annotate each of the modules, we performed comparative annotation using genes in Arabidopsis and Brachypodium distachyon. On the basis of a comparative analysis between barley and two model species, we investigated functional properties from the representative distributions of the gene ontology (GO) terms. Modules putatively involved in drought stress response and cellulose biogenesis have been identified. These modules are discussed to demonstrate the effectiveness of the co-expression analysis. Furthermore, we applied the data set of co-expressed genes coupled with comparative analysis in attempts to discover potentially Triticeae-specific network modules. These results demonstrate that analysis of the co-expression network of the barley transcriptome together with comparative analysis should promote the process of gene discovery in barley. Furthermore, the insights obtained should be transferable to investigations of Triticeae plants. The associated data set generated in this analysis is publicly accessible at http://coexpression.psc.riken.jp/barley/. PMID:21441235

  12. Combined venomics, antivenomics and venom gland transcriptome analysis of the monocoled cobra (Naja kaouthia) from China.

    PubMed

    Xu, Ning; Zhao, Hong-Yan; Yin, Yin; Shen, Shan-Shan; Shan, Lin-Lin; Chen, Chuan-Xi; Zhang, Yan-Xia; Gao, Jian-Fang; Ji, Xiang

    2017-04-21

    We conducted an omics-analysis of the venom of Naja kaouthia from China. Proteomics analysis revealed six protein families [three-finger toxins (3-FTx), phospholipase A 2 (PLA 2 ), nerve growth factor, snake venom metalloproteinase (SVMP), cysteine-rich secretory protein and ohanin], and venom-gland transcriptomics analysis revealed 28 protein families from 79 unigenes. 3-FTx (56.5% in proteome/82.0% in transcriptome) and PLA 2 (26.9%/13.6%) were identified as the most abundant families in venom proteome and venom-gland transcriptome. Furthermore, N. kaouthia venom expressed strong lethality (i.p. LD 50 : 0.79μg/g) and myotoxicity (CK: 5939U/l) in mice, and showed notable activity in PLA 2 but weak activity in SVMP, l-amino acid oxidase or 5' nucleotidase. Antivenomic assessment revealed that several venom components (nearly 17.5% of total venom) from N. kaouthia could not be thoroughly immunocaptured by commercial Naja atra antivenom. ELISA analysis revealed that there was no difference in the cross-reaction between N. kaouthia and N. atra venoms against the N. atra antivenom. The use of commercial N. atra antivenom in treatment of snakebites caused by N. kaouthia is reasonable, but design of novel antivenom with the attention on enhancing the immune response of non-immunocaptured components should be encouraged. The venomics, antivenomics and venom-gland transcriptome of the monocoled cobra (Naja kaouthia) from China have been elucidated. Quantitative and qualitative differences are evident when venom proteomic and venom-gland transcriptomic profiles are compared. Two protein families (3-FTx and PLA 2 ) are found to be the predominated components in N. kaouthia venom, and considered as the major players in functional role of venom. Other protein families with relatively low abundance appear to be minor in the functional significance. Antivenomics and ELISA evaluation reveal that the N. kaouthia venom can be effectively immunorecognized by commercial N. atra antivenom, but still a small number of venom components could not be thoroughly immunocaptured. The findings indicate that exploring the precise composition of snake venom should be executed by an integrated omics-approach, and elucidating the venom composition is helpful in understanding composition-function relationships and will facilitate the clinical application of antivenoms. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. How to normalize metatranscriptomic count data for differential expression analysis.

    PubMed

    Klingenberg, Heiner; Meinicke, Peter

    2017-01-01

    Differential expression analysis on the basis of RNA-Seq count data has become a standard tool in transcriptomics. Several studies have shown that prior normalization of the data is crucial for a reliable detection of transcriptional differences. Until now it has not been clear whether and how the transcriptomic approach can be used for differential expression analysis in metatranscriptomics. We propose a model for differential expression in metatranscriptomics that explicitly accounts for variations in the taxonomic composition of transcripts across different samples. As a main consequence the correct normalization of metatranscriptomic count data under this model requires the taxonomic separation of the data into organism-specific bins. Then the taxon-specific scaling of organism profiles yields a valid normalization and allows us to recombine the scaled profiles into a metatranscriptomic count matrix. This matrix can then be analyzed with statistical tools for transcriptomic count data. For taxon-specific scaling and recombination of scaled counts we provide a simple R script. When applying transcriptomic tools for differential expression analysis directly to metatranscriptomic data with an organism-independent (global) scaling of counts the resulting differences may be difficult to interpret. The differences may correspond to changing functional profiles of the contributing organisms but may also result from a variation of taxonomic abundances. Taxon-specific scaling eliminates this variation and therefore the resulting differences actually reflect a different behavior of organisms under changing conditions. In simulation studies we show that the divergence between results from global and taxon-specific scaling can be drastic. In particular, the variation of organism abundances can imply a considerable increase of significant differences with global scaling. Also, on real metatranscriptomic data, the predictions from taxon-specific and global scaling can differ widely. Our studies indicate that in real data applications performed with global scaling it might be impossible to distinguish between differential expression in terms of transcriptomic changes and differential composition in terms of changing taxonomic proportions. As in transcriptomics, a proper normalization of count data is also essential for differential expression analysis in metatranscriptomics. Our model implies a taxon-specific scaling of counts for normalization of the data. The application of taxon-specific scaling consequently removes taxonomic composition variations from functional profiles and therefore provides a clear interpretation of the observed functional differences.

  14. De-novo assembly and characterization of the transcriptome of Metschnikowia fructicola reveals differences in gene expression following interaction with Penicillium digitatum and grapefruit peel

    USDA-ARS?s Scientific Manuscript database

    The yeast, Metschnikowia fructicola, is an antagonist with biological control activity against postharvest diseases of several fruits. We performed a transcriptome analysis, using RNA-Seq technology, to examine the response of M. fructicola with citrus fruit and with the postharvest pathogen, Penic...

  15. Gene Expression Analysis of Copper Tolerance and Wood Decay in the Brown Rot Fungus Fibroporia radiculosa

    Treesearch

    J. D. Tang; L. A. Parker; A. D. Perkins; T. S. Sonstegard; S. G. Schroeder; D. D. Nicholas; S. V. Diehl

    2013-01-01

    High-throughput transcriptomics was used to identify Fibroporia radiculosa genes that were differentially regulated during colonization of wood treated with a copper-based preservative. The transcriptome was profiled at two time points while the fungus was growing on wood treated with micronized copper quat (MCQ). A total of 917 transcripts were...

  16. Reliable transformation system for Microbotryum lychnidis-dioicae informed by genome and transcriptome project.

    PubMed

    Toh, Su San; Treves, David S; Barati, Michelle T; Perlin, Michael H

    2016-10-01

    Microbotryum lychnidis-dioicae is a member of a species complex infecting host plants in the Caryophyllaceae. It is used as a model system in many areas of research, but attempts to make this organism tractable for reverse genetic approaches have not been fruitful. Here, we exploited the recently obtained genome sequence and transcriptome analysis to inform our design of constructs for use in Agrobacterium-mediated transformation techniques currently available for other fungi. Reproducible transformation was demonstrated at the genomic, transcriptional and functional levels. Moreover, these initial proof-of-principle experiments provide evidence that supports the findings from initial global transcriptome analysis regarding expression from the respective promoters under different growth conditions of the fungus. The technique thus provides for the first time the ability to stably introduce transgenes and over-express target M. lychnidis-dioicae genes.

  17. Research Resource: A Reference Transcriptome for Constitutive Androstane Receptor and Pregnane X Receptor Xenobiotic Signaling

    PubMed Central

    Ochsner, Scott A.; Tsimelzon, Anna; Dong, Jianrong; Coarfa, Cristian

    2016-01-01

    The pregnane X receptor (PXR) (PXR/NR1I3) and constitutive androstane receptor (CAR) (CAR/NR1I2) members of the nuclear receptor (NR) superfamily of ligand-regulated transcription factors are well-characterized mediators of xenobiotic and endocrine-disrupting chemical signaling. The Nuclear Receptor Signaling Atlas maintains a growing library of transcriptomic datasets involving perturbations of NR signaling pathways, many of which involve perturbations relevant to PXR and CAR xenobiotic signaling. Here, we generated a reference transcriptome based on the frequency of differential expression of genes across 159 experiments compiled from 22 datasets involving perturbations of CAR and PXR signaling pathways. In addition to the anticipated overrepresentation in the reference transcriptome of genes encoding components of the xenobiotic stress response, the ranking of genes involved in carbohydrate metabolism and gonadotropin action sheds mechanistic light on the suspected role of xenobiotics in metabolic syndrome and reproductive disorders. Gene Set Enrichment Analysis showed that although acetaminophen, chlorpromazine, and phenobarbital impacted many similar gene sets, differences in direction of regulation were evident in a variety of processes. Strikingly, gene sets representing genes linked to Parkinson's, Huntington's, and Alzheimer's diseases were enriched in all 3 transcriptomes. The reference xenobiotic transcriptome will be supplemented with additional future datasets to provide the community with a continually updated reference transcriptomic dataset for CAR- and PXR-mediated xenobiotic signaling. Our study demonstrates how aggregating and annotating transcriptomic datasets, and making them available for routine data mining, facilitates research into the mechanisms by which xenobiotics and endocrine-disrupting chemicals subvert conventional NR signaling modalities. PMID:27409825

  18. Transcriptomic Studies of Malaria: a Paradigm for Investigation of Systemic Host-Pathogen Interactions

    PubMed Central

    2018-01-01

    SUMMARY Transcriptomics, the analysis of genome-wide RNA expression, is a common approach to investigate host and pathogen processes in infectious diseases. Technical and bioinformatic advances have permitted increasingly thorough analyses of the association of RNA expression with fundamental biology, immunity, pathogenesis, diagnosis, and prognosis. Transcriptomic approaches can now be used to realize a previously unattainable goal, the simultaneous study of RNA expression in host and pathogen, in order to better understand their interactions. This exciting prospect is not without challenges, especially as focus moves from interactions in vitro under tightly controlled conditions to tissue- and systems-level interactions in animal models and natural and experimental infections in humans. Here we review the contribution of transcriptomic studies to the understanding of malaria, a parasitic disease which has exerted a major influence on human evolution and continues to cause a huge global burden of disease. We consider malaria a paradigm for the transcriptomic assessment of systemic host-pathogen interactions in humans, because much of the direct host-pathogen interaction occurs within the blood, a readily sampled compartment of the body. We illustrate lessons learned from transcriptomic studies of malaria and how these lessons may guide studies of host-pathogen interactions in other infectious diseases. We propose that the potential of transcriptomic studies to improve the understanding of malaria as a disease remains partly untapped because of limitations in study design rather than as a consequence of technological constraints. Further advances will require the integration of transcriptomic data with analytical approaches from other scientific disciplines, including epidemiology and mathematical modeling. PMID:29695497

  19. Research Resource: A Reference Transcriptome for Constitutive Androstane Receptor and Pregnane X Receptor Xenobiotic Signaling.

    PubMed

    Ochsner, Scott A; Tsimelzon, Anna; Dong, Jianrong; Coarfa, Cristian; McKenna, Neil J

    2016-08-01

    The pregnane X receptor (PXR) (PXR/NR1I3) and constitutive androstane receptor (CAR) (CAR/NR1I2) members of the nuclear receptor (NR) superfamily of ligand-regulated transcription factors are well-characterized mediators of xenobiotic and endocrine-disrupting chemical signaling. The Nuclear Receptor Signaling Atlas maintains a growing library of transcriptomic datasets involving perturbations of NR signaling pathways, many of which involve perturbations relevant to PXR and CAR xenobiotic signaling. Here, we generated a reference transcriptome based on the frequency of differential expression of genes across 159 experiments compiled from 22 datasets involving perturbations of CAR and PXR signaling pathways. In addition to the anticipated overrepresentation in the reference transcriptome of genes encoding components of the xenobiotic stress response, the ranking of genes involved in carbohydrate metabolism and gonadotropin action sheds mechanistic light on the suspected role of xenobiotics in metabolic syndrome and reproductive disorders. Gene Set Enrichment Analysis showed that although acetaminophen, chlorpromazine, and phenobarbital impacted many similar gene sets, differences in direction of regulation were evident in a variety of processes. Strikingly, gene sets representing genes linked to Parkinson's, Huntington's, and Alzheimer's diseases were enriched in all 3 transcriptomes. The reference xenobiotic transcriptome will be supplemented with additional future datasets to provide the community with a continually updated reference transcriptomic dataset for CAR- and PXR-mediated xenobiotic signaling. Our study demonstrates how aggregating and annotating transcriptomic datasets, and making them available for routine data mining, facilitates research into the mechanisms by which xenobiotics and endocrine-disrupting chemicals subvert conventional NR signaling modalities.

  20. Transcriptomic Studies of Malaria: a Paradigm for Investigation of Systemic Host-Pathogen Interactions.

    PubMed

    Lee, Hyun Jae; Georgiadou, Athina; Otto, Thomas D; Levin, Michael; Coin, Lachlan J; Conway, David J; Cunnington, Aubrey J

    2018-06-01

    Transcriptomics, the analysis of genome-wide RNA expression, is a common approach to investigate host and pathogen processes in infectious diseases. Technical and bioinformatic advances have permitted increasingly thorough analyses of the association of RNA expression with fundamental biology, immunity, pathogenesis, diagnosis, and prognosis. Transcriptomic approaches can now be used to realize a previously unattainable goal, the simultaneous study of RNA expression in host and pathogen, in order to better understand their interactions. This exciting prospect is not without challenges, especially as focus moves from interactions in vitro under tightly controlled conditions to tissue- and systems-level interactions in animal models and natural and experimental infections in humans. Here we review the contribution of transcriptomic studies to the understanding of malaria, a parasitic disease which has exerted a major influence on human evolution and continues to cause a huge global burden of disease. We consider malaria a paradigm for the transcriptomic assessment of systemic host-pathogen interactions in humans, because much of the direct host-pathogen interaction occurs within the blood, a readily sampled compartment of the body. We illustrate lessons learned from transcriptomic studies of malaria and how these lessons may guide studies of host-pathogen interactions in other infectious diseases. We propose that the potential of transcriptomic studies to improve the understanding of malaria as a disease remains partly untapped because of limitations in study design rather than as a consequence of technological constraints. Further advances will require the integration of transcriptomic data with analytical approaches from other scientific disciplines, including epidemiology and mathematical modeling. Copyright © 2018 Lee et al.

  1. Digital Marine Bioprospecting: Mining New Neurotoxin Drug Candidates from the Transcriptomes of Cold-Water Sea Anemones

    PubMed Central

    Urbarova, Ilona; Karlsen, Bård Ove; Okkenhaug, Siri; Seternes, Ole Morten; Johansen, Steinar D.; Emblem, Åse

    2012-01-01

    Marine bioprospecting is the search for new marine bioactive compounds and large-scale screening in extracts represents the traditional approach. Here, we report an alternative complementary protocol, called digital marine bioprospecting, based on deep sequencing of transcriptomes. We sequenced the transcriptomes from the adult polyp stage of two cold-water sea anemones, Bolocera tuediae and Hormathia digitata. We generated approximately 1.1 million quality-filtered sequencing reads by 454 pyrosequencing, which were assembled into approximately 120,000 contigs and 220,000 single reads. Based on annotation and gene ontology analysis we profiled the expressed mRNA transcripts according to known biological processes. As a proof-of-concept we identified polypeptide toxins with a potential blocking activity on sodium and potassium voltage-gated channels from digital transcriptome libraries. PMID:23170083

  2. When Genomics Is Not Enough: Experimental Evidence for a Decrease in LINE-1 Activity During the Evolution of Australian Marsupials

    PubMed Central

    Gallus, Susanne; Lammers, Fritjof

    2016-01-01

    The autonomous transposable element LINE-1 is a highly abundant element that makes up between 15% and 20% of therian mammal genomes. Since their origin before the divergence of marsupials and placental mammals, LINE-1 elements have contributed actively to the genome landscape. A previous in silico screen of the Tasmanian devil genome revealed a lack of functional coding LINE-1 sequences. In this study we present the results of an in vitro analysis from a partial LINE-1 reverse transcriptase coding sequence in five marsupial species. Our experimental screen supports the in silico findings of the genome-wide degradation of LINE-1 sequences in the Tasmanian devil, and identifies a high frequency of degraded LINE-1 sequences in other Australian marsupials. The comparison between the experimentally obtained LINE-1 sequences and reference genome assemblies suggests that conclusions from in silico analyses of retrotransposition activity can be influenced by incomplete genome assemblies from short reads. PMID:27389686

  3. In Silico Systems Biology Analysis of Variants of Uncertain Significance in Lynch Syndrome Supports the Prioritization of Functional Molecular Validation.

    PubMed

    Borras, Ester; Chang, Kyle; Pande, Mala; Cuddy, Amanda; Bosch, Jennifer L; Bannon, Sarah A; Mork, Maureen E; Rodriguez-Bigas, Miguel A; Taggart, Melissa W; Lynch, Patrick M; You, Y Nancy; Vilar, Eduardo

    2017-10-01

    Lynch syndrome (LS) is a genetic condition secondary to germline alterations in the DNA mismatch repair (MMR) genes with 30% of changes being variants of uncertain significance (VUS). Our aim was to perform an in silico reclassification of VUS from a large single institutional cohort that will help prioritizing functional validation. A total of 54 VUS were detected with 33 (61%) novel variants. We integrated family history, pathology, and genetic information along with supporting evidence from eight different in silico tools at the RNA and protein level. Our assessment allowed us to reclassify 54% (29/54) of the VUS as probably damaging, 13% (7/54) as possibly damaging, and 28% (15/54) as probably neutral. There are more than 1,000 VUS reported in MMR genes and our approach facilitates the prioritization of further functional efforts to assess the pathogenicity to those classified as probably damaging. Cancer Prev Res; 10(10); 580-7. ©2017 AACR . ©2017 American Association for Cancer Research.

  4. Transcriptome analysis and metabolic profiling of green and red kale (Brassica oleracea var. acephala) seedlings.

    PubMed

    Jeon, Jin; Kim, Jae Kwang; Kim, HyeRan; Kim, Yeon Jeong; Park, Yun Ji; Kim, Sun Ju; Kim, Changsoo; Park, Sang Un

    2018-02-15

    Kale (Brassica oleracea var. acephala) is a rich source of numerous health-benefiting compounds, including vitamins, glucosinolates, phenolic compounds, and carotenoids. However, the genetic resources for exploiting the phyto-nutritional traits of kales are limited. To acquire precise information on secondary metabolites in kales, we performed a comprehensive analysis of the transcriptome and metabolome of green and red kale seedlings. Kale transcriptome datasets revealed 37,149 annotated genes and several secondary metabolite biosynthetic genes. HPLC analysis revealed 14 glucosinolates, 20 anthocyanins, 3 phenylpropanoids, and 6 carotenoids in the kale seedlings that were examined. Red kale contained more glucosinolates, anthocyanins, and phenylpropanoids than green kale, whereas the carotenoid contents were much higher in green kale than in red kale. Ultimately, our data will be a valuable resource for future research on kale bio-engineering and will provide basic information to define gene-to-metabolite networks in kale. Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. Transcriptome Meta-Analysis of Lung Cancer Reveals Recurrent Aberrations in NRG1 and Hippo Pathway Genes

    PubMed Central

    Dhanasekaran, Saravana M.; Balbin, O. Alejandro; Chen, Guoan; Nadal, Ernest; Kalyana-Sundaram, Shanker; Pan, Jincheng; Veeneman, Brendan; Cao, Xuhong; Malik, Rohit; Vats, Pankaj; Wang, Rui; Huang, Stephanie; Zhong, Jinjie; Jing, Xiaojun; Iyer, Matthew; Wu, Yi-Mi; Harms, Paul W.; Lin, Jules; Reddy, Rishindra; Brennan, Christine; Palanisamy, Nallasivam; Chang, Andrew C.; Truini, Anna; Truini, Mauro; Robinson, Dan R.; Beer, David G.; Chinnaiyan, Arul M.

    2014-01-01

    Lung cancer is emerging as a paradigm for disease molecular subtyping, facilitating targeted therapy based on driving somatic alterations. Here, we perform transcriptome analysis of 153 samples representing lung adenocarcinomas, squamous cell carcinomas, large cell lung cancer, adenoid cystic carcinomas and cell lines. By integrating our data with The Cancer Genome Atlas and published sources, we analyze 753 lung cancer samples for gene fusions and other transcriptomic alterations. We show that higher numbers of gene fusions is an independent prognostic factor for poor survival in lung cancer. Our analysis confirms the recently reported CD74-NRG1 fusion and suggests that NRG1, NF1 and Hippo pathway fusions may play important roles in tumors without known driver mutations. In addition, we observe exon skipping events in c-MET, which are attributable to splice site mutations. These classes of genetic aberrations may play a significant role in the genesis of lung cancers lacking known driver mutations. PMID:25531467

  6. In silico characterization of a novel pathogenic deletion mutation identified in XPA gene in a Pakistani family with severe xeroderma pigmentosum

    PubMed Central

    2013-01-01

    Background Xeroderma Pigmentosum (XP) is a rare skin disorder characterized by skin hypersensitivity to sunlight and abnormal pigmentation. The aim of this study was to investigate the genetic cause of a severe XP phenotype in a consanguineous Pakistani family and in silico characterization of any identified disease-associated mutation. Results The XP complementation group was assigned by genotyping of family for known XP loci. Genotyping data mapped the family to complementation group A locus, involving XPA gene. Mutation analysis of the candidate XP gene by DNA sequencing revealed a novel deletion mutation (c.654del A) in exon 5 of XPA gene. The c.654del A, causes frameshift, which pre-maturely terminates protein and result into a truncated product of 222 amino acid (aa) residues instead of 273 (p.Lys218AsnfsX5). In silico tools were applied to study the likelihood of changes in structural motifs and thus interaction of mutated protein with binding partners. In silico analysis of mutant protein sequence, predicted to affect the aa residue which attains coiled coil structure. The coiled coil structure has an important role in key cellular interactions, especially with DNA damage-binding protein 2 (DDB2), which has important role in DDB-mediated nucleotide excision repair (NER) system. Conclusions Our findings support the fact of genetic and clinical heterogeneity in XP. The study also predicts the critical role of DDB2 binding region of XPA protein in NER pathway and opens an avenue for further research to study the functional role of the mutated protein domain. PMID:24063568

  7. Characterization of mango (Mangifera indica L.) transcriptome and chloroplast genome.

    PubMed

    Azim, M Kamran; Khan, Ishtaiq A; Zhang, Yong

    2014-05-01

    We characterized mango leaf transcriptome and chloroplast genome using next generation DNA sequencing. The RNA-seq output of mango transcriptome generated >12 million reads (total nucleotides sequenced >1 Gb). De novo transcriptome assembly generated 30,509 unigenes with lengths in the range of 300 to ≥3,000 nt and 67× depth of coverage. Blast searching against nonredundant nucleotide databases and several Viridiplantae genomic datasets annotated 24,593 mango unigenes (80% of total) and identified Citrus sinensis as closest neighbor of mango with 9,141 (37%) matched sequences. The annotation with gene ontology and Clusters of Orthologous Group terms categorized unigene sequences into 57 and 25 classes, respectively. More than 13,500 unigenes were assigned to 293 KEGG pathways. Besides major plant biology related pathways, KEGG based gene annotation pointed out active presence of an array of biochemical pathways involved in (a) biosynthesis of bioactive flavonoids, flavones and flavonols, (b) biosynthesis of terpenoids and lignins and (c) plant hormone signal transduction. The mango transcriptome sequences revealed 235 proteases belonging to five catalytic classes of proteolytic enzymes. The draft genome of mango chloroplast (cp) was obtained by a combination of Sanger and next generation sequencing. The draft mango cp genome size is 151,173 bp with a pair of inverted repeats of 27,093 bp separated by small and large single copy regions, respectively. Out of 139 genes in mango cp genome, 91 found to be protein coding. Sequence analysis revealed cp genome of C. sinensis as closest neighbor of mango. We found 51 short repeats in mango cp genome supposed to be associated with extensive rearrangements. This is the first report of transcriptome and chloroplast genome analysis of any Anacardiaceae family member.

  8. RNA-seq analysis of Rubus idaeus cv. Nova: transcriptome sequencing and de novo assembly for subsequent functional genomics approaches.

    PubMed

    Hyun, Tae Kyung; Lee, Sarah; Kumar, Dhinesh; Rim, Yeonggil; Kumar, Ritesh; Lee, Sang Yeol; Lee, Choong Hwan; Kim, Jae-Yean

    2014-10-01

    Using Illumina sequencing technology, we have generated the large-scale transcriptome sequencing data containing abundant information on genes involved in the metabolic pathways in R. idaeus cv. Nova fruits. Rubus idaeus (Red raspberry) is one of the important economical crops that possess numerous nutrients, micronutrients and phytochemicals with essential health benefits to human. The molecular mechanism underlying the ripening process and phytochemical biosynthesis in red raspberry is attributed to the changes in gene expression, but very limited transcriptomic and genomic information in public databases is available. To address this issue, we generated more than 51 million sequencing reads from R. idaeus cv. Nova fruit using Illumina RNA-Seq technology. After de novo assembly, we obtained 42,604 unigenes with an average length of 812 bp. At the protein level, Nova fruit transcriptome showed 77 and 68 % sequence similarities with Rubus coreanus and Fragaria versa, respectively, indicating the evolutionary relationship between them. In addition, 69 % of assembled unigenes were annotated using public databases including NCBI non-redundant, Cluster of Orthologous Groups and Gene ontology database, suggesting that our transcriptome dataset provides a valuable resource for investigating metabolic processes in red raspberry. To analyze the relationship between several novel transcripts and the amounts of metabolites such as γ-aminobutyric acid and anthocyanins, real-time PCR and target metabolite analysis were performed on two different ripening stages of Nova. This is the first attempt using Illumina sequencing platform for RNA sequencing and de novo assembly of Nova fruit without reference genome. Our data provide the most comprehensive transcriptome resource available for Rubus fruits, and will be useful for understanding the ripening process and for breeding R. idaeus cultivars with improved fruit quality.

  9. Identification of Immune-Related Genes and Development of SSR/SNP Markers from the Spleen Transcriptome of Schizothorax prenanti.

    PubMed

    Luo, Hui; Xiao, Shijun; Ye, Hua; Zhang, Zhengshi; Lv, Changhuan; Zheng, Shuming; Wang, Zhiyong; Wang, Xiaoqing

    2016-01-01

    Schizothorax prenanti (S. prenanti) is mainly distributed in the upstream regions of the Yangtze River and its tributaries in China. This species is indigenous and commercially important. However, in recent years, wild populations and aquacultures have faced the serious challenges of germplasm variation loss and an increased susceptibility to a range of pathogens. Currently, the genetics and immune mechanisms of S. prenanti are unknown, partly due to a lack of genome and transcriptome information. Here, we sought to identify genes related to immune functions and to identify molecular markers to study the function of these genes and for trait mapping. To this end, the transcriptome from spleen tissues of S. prenanti was analyzed and sequenced. Using paired-end reads from the Illumina Hiseq2500 platform, 48,517 transcripts were isolated from the spleen transcriptome. These transcripts could be clustered into 37,785 unigenes with an N50 length of 2,539 bp. The majority of the unigenes (35,653, 94.4%) were successfully annotated using non-redundant nucleotide sequence analysis (nt), and the non-redundant protein (nr), Swiss-Prot, Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. KEGG pathway assignment identified more than 500 immune-related genes. Furthermore, 7,545 putative simple sequence repeats (SSRs), 857,535 single nucleotide polymorphisms (SNPs), and 53,481 insertion/deletion (InDels) were detected from the transcriptome. This is the first reported high-throughput transcriptome analysis of S. prenanti, and it provides valuable genetic resources for the investigation of immune mechanisms, conservation of germplasm, and molecular marker-assisted breeding of S. prenanti.

  10. Mining genes involved in insecticide resistance of Liposcelis bostrychophila Badonnel by transcriptome and expression profile analysis.

    PubMed

    Dou, Wei; Shen, Guang-Mao; Niu, Jin-Zhi; Ding, Tian-Bo; Wei, Dan-Dan; Wang, Jin-Jun

    2013-01-01

    Recent studies indicate that infestations of psocids pose a new risk for global food security. Among the psocids species, Liposcelis bostrychophila Badonnel has gained recognition in importance because of its parthenogenic reproduction, rapid adaptation, and increased worldwide distribution. To date, the molecular data available for L. bostrychophila is largely limited to genes identified through homology. Also, no transcriptome data relevant to psocids infection is available. In this study, we generated de novo assembly of L. bostrychophila transcriptome performed through the short read sequencing technology (Illumina). In a single run, we obtained more than 51 million sequencing reads that were assembled into 60,012 unigenes (mean size = 711 bp) by Trinity. The transcriptome sequences from different developmental stages of L. bostrychophila including egg, nymph and adult were annotated with non-redundant (Nr) protein database, gene ontology (GO), cluster of orthologous groups of proteins (COG), and KEGG orthology (KO). The analysis revealed three major enzyme families involved in insecticide metabolism as differentially expressed in the L. bostrychophila transcriptome. A total of 49 P450-, 31 GST- and 21 CES-specific genes representing the three enzyme families were identified. Besides, 16 transcripts were identified to contain target site sequences of resistance genes. Furthermore, we profiled gene expression patterns upon insecticide (malathion and deltamethrin) exposure using the tag-based digital gene expression (DGE) method. The L. bostrychophila transcriptome and DGE data provide gene expression data that would further our understanding of molecular mechanisms in psocids. In particular, the findings of this investigation will facilitate identification of genes involved in insecticide resistance and designing of new compounds for control of psocids.

  11. Mining Genes Involved in Insecticide Resistance of Liposcelis bostrychophila Badonnel by Transcriptome and Expression Profile Analysis

    PubMed Central

    Dou, Wei; Shen, Guang-Mao; Niu, Jin-Zhi; Ding, Tian-Bo; Wei, Dan-Dan; Wang, Jin-Jun

    2013-01-01

    Background Recent studies indicate that infestations of psocids pose a new risk for global food security. Among the psocids species, Liposcelis bostrychophila Badonnel has gained recognition in importance because of its parthenogenic reproduction, rapid adaptation, and increased worldwide distribution. To date, the molecular data available for L. bostrychophila is largely limited to genes identified through homology. Also, no transcriptome data relevant to psocids infection is available. Methodology and Principal Findings In this study, we generated de novo assembly of L. bostrychophila transcriptome performed through the short read sequencing technology (Illumina). In a single run, we obtained more than 51 million sequencing reads that were assembled into 60,012 unigenes (mean size = 711 bp) by Trinity. The transcriptome sequences from different developmental stages of L. bostrychophila including egg, nymph and adult were annotated with non-redundant (Nr) protein database, gene ontology (GO), cluster of orthologous groups of proteins (COG), and KEGG orthology (KO). The analysis revealed three major enzyme families involved in insecticide metabolism as differentially expressed in the L. bostrychophila transcriptome. A total of 49 P450-, 31 GST- and 21 CES-specific genes representing the three enzyme families were identified. Besides, 16 transcripts were identified to contain target site sequences of resistance genes. Furthermore, we profiled gene expression patterns upon insecticide (malathion and deltamethrin) exposure using the tag-based digital gene expression (DGE) method. Conclusion The L. bostrychophila transcriptome and DGE data provide gene expression data that would further our understanding of molecular mechanisms in psocids. In particular, the findings of this investigation will facilitate identification of genes involved in insecticide resistance and designing of new compounds for control of psocids. PMID:24278202

  12. Comparison of normalization methods for differential gene expression analysis in RNA-Seq experiments

    PubMed Central

    Maza, Elie; Frasse, Pierre; Senin, Pavel; Bouzayen, Mondher; Zouine, Mohamed

    2013-01-01

    In recent years, RNA-Seq technologies became a powerful tool for transcriptome studies. However, computational methods dedicated to the analysis of high-throughput sequencing data are yet to be standardized. In particular, it is known that the choice of a normalization procedure leads to a great variability in results of differential gene expression analysis. The present study compares the most widespread normalization procedures and proposes a novel one aiming at removing an inherent bias of studied transcriptomes related to their relative size. Comparisons of the normalization procedures are performed on real and simulated data sets. Real RNA-Seq data sets analyses, performed with all the different normalization methods, show that only 50% of significantly differentially expressed genes are common. This result highlights the influence of the normalization step on the differential expression analysis. Real and simulated data sets analyses give similar results showing 3 different groups of procedures having the same behavior. The group including the novel method named “Median Ratio Normalization” (MRN) gives the lower number of false discoveries. Within this group the MRN method is less sensitive to the modification of parameters related to the relative size of transcriptomes such as the number of down- and upregulated genes and the gene expression levels. The newly proposed MRN method efficiently deals with intrinsic bias resulting from relative size of studied transcriptomes. Validation with real and simulated data sets confirmed that MRN is more consistent and robust than existing methods. PMID:26442135

  13. Gonad Transcriptome Analysis of High-Temperature-Treated Females and High-Temperature-Induced Sex-Reversed Neomales in Nile Tilapia

    PubMed Central

    Sun, Li Xue; Teng, Jian; Zhao, Yan; Li, Ning; Wang, Hui

    2018-01-01

    Background: Nowadays, the molecular mechanisms governing TSD (temperature-dependent sex determination) or GSD + TE (genotypic sex determination + temperature effects) remain a mystery in fish. Methods: We developed three all-female families of Nile tilapia (Oreochromis niloticus), and the family with the highest male ratio after high-temperature treatment was used for transcriptome analysis. Results: First, gonadal histology analysis indicated that the histological morphology of control females (CF) was not significantly different from that of high-temperature-treated females (TF) at various development stages. However, the high-temperature treatment caused a lag of spermatogenesis in high-temperature-induced neomales (IM). Next, we sequenced the transcriptome of CF, TF, and IM Nile tilapia. 79, 11,117, and 11,000 differentially expressed genes (DEGs) were detected in the CF–TF, CF–IM, and TF–IM comparisons, respectively, and 44 DEGs showed identical expression changes in the CF–TF and CF–IM comparisons. Principal component analysis (PCA) indicated that three individuals in CF and three individuals in TF formed a cluster, and three individuals in IM formed a distinct cluster, which confirmed that the gonad transcriptome profile of TF was similar to that of CF and different from that of IM. Finally, six sex-related genes were validated by qRT-PCR. Conclusions: This study identifies a number of genes that may be involved in GSD + TE, which will be useful for investigating the molecular mechanisms of TSD or GSD + TE in fish. PMID:29495590

  14. Gonad Transcriptome Analysis of High-Temperature-Treated Females and High-Temperature-Induced Sex-Reversed Neomales in Nile Tilapia.

    PubMed

    Sun, Li Xue; Teng, Jian; Zhao, Yan; Li, Ning; Wang, Hui; Ji, Xiang Shan

    2018-02-28

    Nowadays, the molecular mechanisms governing TSD (temperature-dependent sex determination) or GSD + TE (genotypic sex determination + temperature effects) remain a mystery in fish. We developed three all-female families of Nile tilapia ( Oreochromis niloticus ), and the family with the highest male ratio after high-temperature treatment was used for transcriptome analysis. First, gonadal histology analysis indicated that the histological morphology of control females (CF) was not significantly different from that of high-temperature-treated females (TF) at various development stages. However, the high-temperature treatment caused a lag of spermatogenesis in high-temperature-induced neomales (IM). Next, we sequenced the transcriptome of CF, TF, and IM Nile tilapia. 79, 11,117, and 11,000 differentially expressed genes (DEGs) were detected in the CF-TF, CF-IM, and TF-IM comparisons, respectively, and 44 DEGs showed identical expression changes in the CF-TF and CF-IM comparisons. Principal component analysis (PCA) indicated that three individuals in CF and three individuals in TF formed a cluster, and three individuals in IM formed a distinct cluster, which confirmed that the gonad transcriptome profile of TF was similar to that of CF and different from that of IM. Finally, six sex-related genes were validated by qRT-PCR. This study identifies a number of genes that may be involved in GSD + TE, which will be useful for investigating the molecular mechanisms of TSD or GSD + TE in fish.

  15. Comparative transcriptome analysis of ginger variety Suprabha from two different agro-climatic zones of Odisha.

    PubMed

    Gaur, Mahendra; Das, Aradhana; Sahoo, Rajesh Kumar; Mohanty, Sujata; Joshi, Raj Kumar; Subudhi, Enketeswara

    2016-09-01

    Ginger (Zingiber officinale Rosc.), a well-known member of family Zingiberaceae, is bestowed with number of medicinal properties which is because of the secondary metabolites, essential oil and oleoresin, it contains in its rhizome. The drug yielding potential is known to depend on agro-climatic conditions prevailing at the place cultivation. Present study deals with comparative transcriptome analysis of two sample of elite ginger variety Suprabha collected from two different agro-climatic zones of Odisha. Transcriptome assembly for both the samples was done using next generation sequencing methodology. The raw data of size 10.8 and 11.8 GB obtained from analysis of two rhizomes S1Z4 and S2Z5 collected from Bhubaneswar and Koraput and are available in NCBI accession number SAMN03761169 and SAMN03761176 respectively. We identified 60,452 and 54,748 transcripts using trinity tool respectively from ginger rhizome of S1Z4 and S2Z5. The transcript length varied from 300 bp to 15,213 bp and 8988 bp and N50 value of 1415 bp and 1334 bp respectively for S1Z4 and S2Z5. To the best of our knowledge, this is the first comparative transcriptome analysis of elite ginger cultivars Suprabha from two different agro-climatic conditions of Odisha, India which will help to understand the effect of agro-climatic conditions on differential expression of secondary metabolites.

  16. Transcriptome Profile Analysis of Breast Muscle Tissues from High or Low Levels of Atmospheric Ammonia Exposed Broilers (Gallus gallus)

    PubMed Central

    Sa, Renna; Zhong, Ruqing; Xing, Huan; Zhang, Hongfu

    2016-01-01

    Atmospheric ammonia is a common problem in poultry industry. High concentrations of aerial ammonia cause great harm to broilers' health and production. For the consideration of human health, the limit exposure concentration of ammonia in houses is set at 25 ppm. Previous reports have shown that 25 ppm is still detrimental to livestock, especially the gastrointestinal tract and respiratory tract, but the negative relationship between ammonia exposure and the tissue of breast muscle of broilers is still unknown. In the present study, 25 ppm ammonia in poultry houses was found to lower slaughter performance and breast yield. Then, high-throughput RNA sequencing was utilized to identify differentially expressed genes in breast muscle of broiler chickens exposed to high (25 ppm) or low (3 ppm) levels of atmospheric ammonia. The transcriptome analysis showed that 163 genes (fold change ≥ 2 or ≤ 0.5; P-value < 0.05) were differentially expressed between Ammonia25 (treatment group) and Ammonia3 (control group), including 96 down-regulated and 67 up-regulated genes. qRT-PCR analysis validated the transcriptomic results of RNA sequencing. Gene Ontology (GO) functional annotation analysis revealed potential genes, processes and pathways with putative involvement in growth and development inhibition of breast muscle in broilers caused by aerial ammonia exposure. This study facilitates understanding of the genetic architecture of the chicken breast muscle transcriptome, and has identified candidate genes for breast muscle response to atmospheric ammonia exposure. PMID:27611572

  17. Gene expression analysis of induced pluripotent stem cells from aneuploid chromosomal syndromes

    PubMed Central

    2013-01-01

    Background Human aneuploidy is the leading cause of early pregnancy loss, mental retardation, and multiple congenital anomalies. Due to the high mortality associated with aneuploidy, the pathophysiological mechanisms of aneuploidy syndrome remain largely unknown. Previous studies focused mostly on whether dosage compensation occurs, and the next generation transcriptomics sequencing technology RNA-seq is expected to eventually uncover the mechanisms of gene expression regulation and the related pathological phenotypes in human aneuploidy. Results Using next generation transcriptomics sequencing technology RNA-seq, we profiled the transcriptomes of four human aneuploid induced pluripotent stem cell (iPSC) lines generated from monosomy × (Turner syndrome), trisomy 8 (Warkany syndrome 2), trisomy 13 (Patau syndrome), and partial trisomy 11:22 (Emanuel syndrome) as well as two umbilical cord matrix iPSC lines as euploid controls to examine how phenotypic abnormalities develop with aberrant karyotype. A total of 466 M (50-bp) reads were obtained from the six iPSC lines, and over 13,000 mRNAs were identified by gene annotation. Global analysis of gene expression profiles and functional analysis of differentially expressed (DE) genes were implemented. Over 5000 DE genes are determined between aneuploidy and euploid iPSCs respectively while 9 KEGG pathways are overlapped enriched in four aneuploidy samples. Conclusions Our results demonstrate that the extra or missing chromosome has extensive effects on the whole transcriptome. Functional analysis of differentially expressed genes reveals that the genes most affected in aneuploid individuals are related to central nervous system development and tumorigenesis. PMID:24564826

  18. Histological and Transcriptomic Analysis during Bulbil Formation in Lilium lancifolium

    PubMed Central

    Yang, Panpan; Xu, Leifeng; Xu, Hua; Tang, Yuchao; He, Guoren; Cao, Yuwei; Feng, Yayan; Yuan, Suxia; Ming, Jun

    2017-01-01

    Aerial bulbils are an important propagative organ, playing an important role in population expansion. However, the detailed gene regulatory patterns and molecular mechanism underlying bulbil formation remain unclear. Triploid Lilium lancifolium, which develops many aerial bulbils on the leaf axils of middle-upper stem, is a useful species for investigating bulbil formation. To investigate the mechanism of bulbil formation in triploid L. lancifolium, we performed histological and transcriptomic analyses using samples of leaf axils located in the upper and lower stem of triploid L. lancifolium during bulbil formation. Histological results indicated that the bulbils of triploid L. lancifolium are derived from axillary meristems that initiate de novo from cells on the adaxial side of the petiole base. Transcriptomic analysis generated ~650 million high-quality reads and 11,871 differentially expressed genes (DEGs). Functional analysis showed that the DEGs were significantly enriched in starch and sucrose metabolism and plant hormone signal transduction. Starch synthesis and accumulation likely promoted the initiation of upper bulbils in triploid L. lancifolium. Hormone-associated pathways exhibited distinct patterns of change in each sample. Auxin likely promoted the initiation of bulbils and then inhibited further bulbil formation. High biosynthesis and low degradation of cytokinin might have led to bulbil formation in the upper leaf axil. The present study achieved a global transcriptomic analysis focused on gene expression changes and pathways' enrichment during upper bulbil formation in triploid L. lancifolium, laying a solid foundation for future molecular studies on bulbil formation. PMID:28912794

  19. Prepartal Energy Intake Alters Blood Polymorphonuclear Leukocyte Transcriptome During the Peripartal Period in Holstein Cows

    PubMed Central

    Agrawal, A; Khan, MJ; Graugnard, DE; Vailati-Riboni, M; Rodriguez-Zas, SL; Osorio, JS; Loor, JJ

    2017-01-01

    In the dairy industry, cow health and farmer profits depend on the balance between diet (ie, nutrient composition, daily intake) and metabolism. This is especially true during the transition period, where dramatic physiological changes foster vulnerability to immunosuppression, negative energy balance, and clinical and subclinical disorders. Using an Agilent microarray platform, this study examined changes in the transcriptome of bovine polymorphonuclear leukocytes (PMNLs) due to prepartal dietary intake. Holstein cows were fed a high-straw, control-energy diet (CON; NEL = 1.34 Mcal/kg) or overfed a moderate-energy diet (OVE; NEL = 1.62 Mcal/kg) during the dry period. Blood for PMNL isolation and metabolite analysis was collected at −14 and +7 days relative to parturition. At an analysis of variance false discovery rate <0.05, energy intake (OVE vs CON) influenced 1806 genes. Dynamic Impact Approach bioinformatics analysis classified treatment effects on Kyoto Encyclopedia of Genes and Genomes pathways, including activated oxidative phosphorylation and biosynthesis of unsaturated fatty acids and inhibited RNA polymerase, proteasome, and toll-like receptor signaling pathway. This analysis indicates that processes critical for energy metabolism and cellular and immune function were affected with mixed results. However, overall interpretation of the transcriptome data agreed in part with literature documenting a potentially detrimental, chronic activation of PMNL in response to overfeeding. The widespread, transcriptome-level changes captured here confirm the importance of dietary energy adjustments around calving on the immune system. PMID:28579762

  20. De Novo Sequencing and Analysis of Lemongrass Transcriptome Provide First Insights into the Essential Oil Biosynthesis of Aromatic Grasses.

    PubMed

    Meena, Seema; Kumar, Sarma R; Venkata Rao, D K; Dwivedi, Varun; Shilpashree, H B; Rastogi, Shubhra; Shasany, Ajit K; Nagegowda, Dinesh A

    2016-01-01

    Aromatic grasses of the genus Cymbopogon (Poaceae family) represent unique group of plants that produce diverse composition of monoterpene rich essential oils, which have great value in flavor, fragrance, cosmetic, and aromatherapy industries. Despite the commercial importance of these natural aromatic oils, their biosynthesis at the molecular level remains unexplored. As the first step toward understanding the essential oil biosynthesis, we performed de novo transcriptome assembly and analysis of C. flexuosus (lemongrass) by employing Illumina sequencing. Mining of transcriptome data and subsequent phylogenetic analysis led to identification of terpene synthases, pyrophosphatases, alcohol dehydrogenases, aldo-keto reductases, carotenoid cleavage dioxygenases, alcohol acetyltransferases, and aldehyde dehydrogenases, which are potentially involved in essential oil biosynthesis. Comparative essential oil profiling and mRNA expression analysis in three Cymbopogon species (C. flexuosus, aldehyde type; C. martinii, alcohol type; and C. winterianus, intermediate type) with varying essential oil composition indicated the involvement of identified candidate genes in the formation of alcohols, aldehydes, and acetates. Molecular modeling and docking further supported the role of identified protein sequences in aroma formation in Cymbopogon. Also, simple sequence repeats were found in the transcriptome with many linked to terpene pathway genes including the genes potentially involved in aroma biosynthesis. This work provides the first insights into the essential oil biosynthesis of aromatic grasses, and the identified candidate genes and markers can be a great resource for biotechnological and molecular breeding approaches to modulate the essential oil composition.

  1. Transcriptomic insights on the ABC transporter gene family in the salmon louse Caligus rogercresseyi.

    PubMed

    Valenzuela-Muñoz, Valentina; Sturm, Armin; Gallardo-Escárate, Cristian

    2015-04-09

    ATP-binding cassette (ABC) protein family encode for membrane proteins involved in the transport of various biomolecules through the cellular membrane. These proteins have been identified in all taxa and present important physiological functions, including the process of insecticide detoxification in arthropods. For that reason the ectoparasite Caligus rogercresseyi represents a model species for understanding the molecular underpinnings involved in insecticide drug resistance. llumina sequencing was performed using sea lice exposed to 2 and 3 ppb of deltamethrin and azamethiphos. Contigs obtained from de novo assembly were annotated by Blastx. RNA-Seq analysis was performed and validated by qPCR analysis. From the transcriptome database of C. rogercresseyi, 57 putative members of ABC protein sequences were identified and phylogenetically classified into the eight subfamilies described for ABC transporters in arthropods. Transcriptomic profiles for ABC proteins subfamilies were evaluated throughout C. rogercresseyi development. Moreover, RNA-Seq analysis was performed for adult male and female salmon lice exposed to the delousing drugs azamethiphos and deltamethrin. High transcript levels of the ABCB and ABCC subfamilies were evidenced. Furthermore, SNPs mining was carried out for the ABC proteins sequences, revealing pivotal genomic information. The present study gives a comprehensive transcriptome analysis of ABC proteins from C. rogercresseyi, providing relevant information about transporter roles during ontogeny and in relation to delousing drug responses in salmon lice. This genomic information represents a valuable tool for pest management in the Chilean salmon aquaculture industry.

  2. De Novo Sequencing and Analysis of Lemongrass Transcriptome Provide First Insights into the Essential Oil Biosynthesis of Aromatic Grasses

    PubMed Central

    Meena, Seema; Kumar, Sarma R.; Venkata Rao, D. K.; Dwivedi, Varun; Shilpashree, H. B.; Rastogi, Shubhra; Shasany, Ajit K.; Nagegowda, Dinesh A.

    2016-01-01

    Aromatic grasses of the genus Cymbopogon (Poaceae family) represent unique group of plants that produce diverse composition of monoterpene rich essential oils, which have great value in flavor, fragrance, cosmetic, and aromatherapy industries. Despite the commercial importance of these natural aromatic oils, their biosynthesis at the molecular level remains unexplored. As the first step toward understanding the essential oil biosynthesis, we performed de novo transcriptome assembly and analysis of C. flexuosus (lemongrass) by employing Illumina sequencing. Mining of transcriptome data and subsequent phylogenetic analysis led to identification of terpene synthases, pyrophosphatases, alcohol dehydrogenases, aldo-keto reductases, carotenoid cleavage dioxygenases, alcohol acetyltransferases, and aldehyde dehydrogenases, which are potentially involved in essential oil biosynthesis. Comparative essential oil profiling and mRNA expression analysis in three Cymbopogon species (C. flexuosus, aldehyde type; C. martinii, alcohol type; and C. winterianus, intermediate type) with varying essential oil composition indicated the involvement of identified candidate genes in the formation of alcohols, aldehydes, and acetates. Molecular modeling and docking further supported the role of identified protein sequences in aroma formation in Cymbopogon. Also, simple sequence repeats were found in the transcriptome with many linked to terpene pathway genes including the genes potentially involved in aroma biosynthesis. This work provides the first insights into the essential oil biosynthesis of aromatic grasses, and the identified candidate genes and markers can be a great resource for biotechnological and molecular breeding approaches to modulate the essential oil composition. PMID:27516768

  3. Integrated analysis of whole-exome sequencing and transcriptome profiling in males with autism spectrum disorders.

    PubMed

    Codina-Solà, Marta; Rodríguez-Santiago, Benjamín; Homs, Aïda; Santoyo, Javier; Rigau, Maria; Aznar-Laín, Gemma; Del Campo, Miguel; Gener, Blanca; Gabau, Elisabeth; Botella, María Pilar; Gutiérrez-Arumí, Armand; Antiñolo, Guillermo; Pérez-Jurado, Luis Alberto; Cuscó, Ivon

    2015-01-01

    Autism spectrum disorders (ASD) are a group of neurodevelopmental disorders with high heritability. Recent findings support a highly heterogeneous and complex genetic etiology including rare de novo and inherited mutations or chromosomal rearrangements as well as double or multiple hits. We performed whole-exome sequencing (WES) and blood cell transcriptome by RNAseq in a subset of male patients with idiopathic ASD (n = 36) in order to identify causative genes, transcriptomic alterations, and susceptibility variants. We detected likely monogenic causes in seven cases: five de novo (SCN2A, MED13L, KCNV1, CUL3, and PTEN) and two inherited X-linked variants (MAOA and CDKL5). Transcriptomic analyses allowed the identification of intronic causative mutations missed by the usual filtering of WES and revealed functional consequences of some rare mutations. These included aberrant transcripts (PTEN, POLR3C), deregulated expression in 1.7% of mutated genes (that is, SEMA6B, MECP2, ANK3, CREBBP), allele-specific expression (FUS, MTOR, TAF1C), and non-sense-mediated decay (RIT1, ALG9). The analysis of rare inherited variants showed enrichment in relevant pathways such as the PI3K-Akt signaling and the axon guidance. Integrative analysis of WES and blood RNAseq data has proven to be an efficient strategy to identify likely monogenic forms of ASD (19% in our cohort), as well as additional rare inherited mutations that can contribute to ASD risk in a multifactorial manner. Blood transcriptomic data, besides validating 88% of expressed variants, allowed the identification of missed intronic mutations and revealed functional correlations of genetic variants, including changes in splicing, expression levels, and allelic expression.

  4. Comparative Transcriptome Analysis of the Accessory Sex Gland and Testis from the Chinese Mitten Crab (Eriocheir sinensis)

    PubMed Central

    He, Lin; Jiang, Hui; Cao, Dandan; Liu, Lihua; Hu, Songnian; Wang, Qun

    2013-01-01

    The accessory sex gland (ASG) is an important component of the male reproductive system, which functions to enhance the fertility of spermatozoa during male reproduction. Certain proteins secreted by the ASG are known to bind to the spermatozoa membrane and affect its function. The ASG gene expression profile in Chinese mitten crab (Eriocheir sinensis) has not been extensively studied, and limited genetic research has been conducted on this species. The advent of high-throughput sequencing technologies enables the generation of genomic resources within a short period of time and at minimal cost. In the present study, we performed de novo transcriptome sequencing to produce a comprehensive transcript dataset for the ASG of E. sinensis using Illumina sequencing technology. This analysis yielded a total of 33,221,284 sequencing reads, including 2.6 Gb of total nucleotides. Reads were assembled into 85,913 contigs (average 218 bp), or 58,567 scaffold sequences (average 292 bp), that identified 37,955 unigenes (average 385 bp). We assembled all unigenes and compared them with the published testis transcriptome from E. sinensis. In order to identify which genes may be involved in ASG function, as it pertains to modification of spermatozoa, we compared the ASG and testis transcriptome of E. sinensis. Our analysis identified specific genes with both higher and lower tissue expression levels in the two tissues, and the functions of these genes were analyzed to elucidate their potential roles during maturation of spermatozoa. Availability of detailed transcriptome data from ASG and testis in E. sinensis can assist our understanding of the molecular mechanisms involved with spermatozoa conservation, transport, maturation and capacitation and potentially acrosome activation. PMID:23342039

  5. Identification of Putative Precursor Genes for the Biosynthesis of Cannabinoid-Like Compound in Radula marginata

    PubMed Central

    Hussain, Tajammul; Plunkett, Blue; Ejaz, Mahwish; Espley, Richard V.; Kayser, Oliver

    2018-01-01

    The liverwort Radula marginata belongs to the bryophyte division of land plants and is a prospective alternate source of cannabinoid-like compounds. However, mechanistic insights into the molecular pathways directing the synthesis of these cannabinoid-like compounds have been hindered due to the lack of genetic information. This prompted us to do deep sequencing, de novo assembly and annotation of R. marginata transcriptome, which resulted in the identification and validation of the genes for cannabinoid biosynthetic pathway. In total, we have identified 11,421 putative genes encoding 1,554 enzymes from 145 biosynthetic pathways. Interestingly, we have identified all the upstream genes of the central precursor of cannabinoid biosynthesis, cannabigerolic acid (CBGA), including its two first intermediates, stilbene acid (SA) and geranyl diphosphate (GPP). Expression of all these genes was validated using quantitative real-time PCR. We have characterized the protein structure of stilbene synthase (STS), which is considered as a homolog of olivetolic acid in R. marginata. Moreover, the metabolomics approach enabled us to identify CBGA-analogous compounds using electrospray ionization mass spectrometry (ESI-MS/MS) and gas chromatography mass spectrometry (GC-MS). Transcriptomic analysis revealed 1085 transcription factors (TF) from 39 families. Comparative analysis showed that six TF families have been uniquely predicted in R. marginata. In addition, the bioinformatics analysis predicted a large number of simple sequence repeats (SSRs) and non-coding RNAs (ncRNAs). Our results collectively provide mechanistic insights into the putative precursor genes for the biosynthesis of cannabinoid-like compounds and a novel transcriptomic resource for R. marginata. The large-scale transcriptomic resource generated in this study would further serve as a reference transcriptome to explore the Radulaceae family.

  6. Transcriptomic responses to wounding: meta-analysis of gene expression microarray data.

    PubMed

    Sass, Piotr Andrzej; Dąbrowski, Michał; Charzyńska, Agata; Sachadyn, Paweł

    2017-11-07

    A vast amount of microarray data on transcriptomic response to injury has been collected so far. We designed the analysis in order to identify the genes displaying significant changes in expression after wounding in different organisms and tissues. This meta-analysis is the first study to compare gene expression profiles in response to wounding in as different tissues as heart, liver, skin, bones, and spinal cord, and species, including rat, mouse and human. We collected available microarray transcriptomic profiles obtained from different tissue injury experiments and selected the genes showing a minimum twofold change in expression in response to wounding in prevailing number of experiments for each of five wound healing stages we distinguished: haemostasis & early inflammation, inflammation, early repair, late repair and remodelling. During the initial phases after wounding, haemostasis & early inflammation and inflammation, the transcriptomic responses showed little consistency between different tissues and experiments. For the later phases, wound repair and remodelling, we identified a number of genes displaying similar transcriptional responses in all examined tissues. As revealed by ontological analyses, activation of certain pathways was rather specific for selected phases of wound healing, such as e.g. responses to vitamin D pronounced during inflammation. Conversely, we observed induction of genes encoding inflammatory agents and extracellular matrix proteins in all wound healing phases. Further, we selected several genes differentially upregulated throughout different stages of wound response, including established factors of wound healing in addition to those previously unreported  in this context such as PTPRC and AQP4. We found that transcriptomic responses to wounding showed similar traits in a diverse selection of tissues including skin, muscles, internal organs and nervous system. Notably, we distinguished transcriptional induction of inflammatory genes not only in the initial response to wounding, but also later, during wound repair and tissue remodelling.

  7. Acclimation of Antarctic Chlamydomonas to the sea-ice environment: a transcriptomic analysis.

    PubMed

    Liu, Chenlin; Wang, Xiuliang; Wang, Xingna; Sun, Chengjun

    2016-07-01

    The Antarctic green alga Chlamydomonas sp. ICE-L was isolated from sea ice. As a psychrophilic microalga, it can tolerate the environmental stress in the sea-ice brine, such as freezing temperature and high salinity. We performed a transcriptome analysis to identify freezing stress responding genes and explore the extreme environmental acclimation-related strategies. Here, we show that many genes in ICE-L transcriptome that encoding PUFA synthesis enzymes, molecular chaperon proteins, and cell membrane transport proteins have high similarity to the gens from Antarctic bacteria. These ICE-L genes are supposed to be acquired through horizontal gene transfer from its symbiotic microbes in the sea-ice brine. The presence of these genes in both sea-ice microalgae and bacteria indicated the biological processes they involved in are possibly contributing to ICE-L success in sea ice. In addition, the biological pathways were compared between ICE-L and its closely related sister species, Chlamydomonas reinhardtii and Volvox carteri. In ICE-L transcripome, many sequences homologous to the plant or bacteria proteins in the post-transcriptional, post-translational modification, and signal-transduction KEGG pathways, are absent in the nonpsychrophilic green algae. These complex structural components might imply enhanced stress adaptation capacity. At last, differential gene expression analysis at the transcriptome level of ICE-L indicated that genes that associated with post-translational modification, lipid metabolism, and nitrogen metabolism are responding to the freezing treatment. In conclusion, the transcriptome of Chlamydomonas sp. ICE-L is very useful for exploring the mutualistic interaction between microalgae and bacteria in sea ice; and discovering the specific genes and metabolism pathways responding to the freezing acclimation in psychrophilic microalgae.

  8. Revisiting venom of the sea anemone Stichodactyla haddoni: Omics techniques reveal the complete toxin arsenal of a well-studied sea anemone genus.

    PubMed

    Madio, Bruno; Undheim, Eivind A B; King, Glenn F

    2017-08-23

    More than a century of research on sea anemone venoms has shown that they contain a diversity of biologically active proteins and peptides. However, recent omics studies have revealed that much of the venom proteome remains unexplored. We used, for the first time, a combination of proteomic and transcriptomic techniques to obtain a holistic overview of the venom arsenal of the well-studied sea anemone Stichodactyla haddoni. A purely search-based approach to identify putative toxins in a transcriptome from tentacles regenerating after venom extraction identified 508 unique toxin-like transcripts grouped into 63 families. However, proteomic analysis of venom revealed that 52 of these toxin families are likely false positives. In contrast, the combination of transcriptomic and proteomic data enabled positive identification of 23 families of putative toxins, 12 of which have no homology known proteins or peptides. Our data highlight the importance of using proteomics of milked venom to correctly identify venom proteins/peptides, both known and novel, while minimizing false positive identifications from non-toxin homologues identified in transcriptomes of venom-producing tissues. This work lays the foundation for uncovering the role of individual toxins in sea anemone venom and how they contribute to the envenomation of prey, predators, and competitors. Proteomic analysis of milked venom combined with analysis of a tentacle transcriptome revealed the full extent of the venom arsenal of the sea anemone Stichodactyla haddoni. This combined approach led to the discovery of 12 entirely new families of disulfide-rich peptides and proteins in a genus of anemones that have been studied for over a century. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. Analysis of the Salivary Gland Transcriptome of Frankliniella occidentalis

    PubMed Central

    Stafford-Banks, Candice A.; Rotenberg, Dorith; Johnson, Brian R.; Whitfield, Anna E.; Ullman, Diane E.

    2014-01-01

    Saliva is known to play a crucial role in insect feeding behavior and virus transmission. Currently, little is known about the salivary glands and saliva of thrips, despite the fact that Frankliniella occidentalis (Pergande) (the western flower thrips) is a serious pest due to its destructive feeding, wide host range, and transmission of tospoviruses. As a first step towards characterizing thrips salivary gland functions, we sequenced the transcriptome of the primary salivary glands of F. occidentalis using short read sequencing (Illumina) technology. A de novo-assembled transcriptome revealed 31,392 high quality contigs with an average size of 605 bp. A total of 12,166 contigs had significant BLASTx or tBLASTx hits (E≤1.0E−6) to known proteins, whereas a high percentage (61.24%) of contigs had no apparent protein or nucleotide hits. Comparison of the F. occidentalis salivary gland transcriptome (sialotranscriptome) against a published F. occidentalis full body transcriptome assembled from Roche-454 reads revealed several contigs with putative annotations associated with salivary gland functions. KEGG pathway analysis of the sialotranscriptome revealed that the majority (18 out of the top 20 predicted KEGG pathways) of the salivary gland contig sequences match proteins involved in metabolism. We identified several genes likely to be involved in detoxification and inhibition of plant defense responses including aldehyde dehydrogenase, metalloprotease, glucose oxidase, glucose dehydrogenase, and regucalcin. We also identified several genes that may play a role in the extra-oral digestion of plant structural tissues including β-glucosidase and pectin lyase; and the extra-oral digestion of sugars, including α-amylase, maltase, sucrase, and α-glucosidase. This is the first analysis of a sialotranscriptome for any Thysanopteran species and it provides a foundational tool to further our understanding of how thrips interact with their plant hosts and the viruses they transmit. PMID:24736614

  10. Analysis of the salivary gland transcriptome of Frankliniella occidentalis.

    PubMed

    Stafford-Banks, Candice A; Rotenberg, Dorith; Johnson, Brian R; Whitfield, Anna E; Ullman, Diane E

    2014-01-01

    Saliva is known to play a crucial role in insect feeding behavior and virus transmission. Currently, little is known about the salivary glands and saliva of thrips, despite the fact that Frankliniella occidentalis (Pergande) (the western flower thrips) is a serious pest due to its destructive feeding, wide host range, and transmission of tospoviruses. As a first step towards characterizing thrips salivary gland functions, we sequenced the transcriptome of the primary salivary glands of F. occidentalis using short read sequencing (Illumina) technology. A de novo-assembled transcriptome revealed 31,392 high quality contigs with an average size of 605 bp. A total of 12,166 contigs had significant BLASTx or tBLASTx hits (E≤1.0E-6) to known proteins, whereas a high percentage (61.24%) of contigs had no apparent protein or nucleotide hits. Comparison of the F. occidentalis salivary gland transcriptome (sialotranscriptome) against a published F. occidentalis full body transcriptome assembled from Roche-454 reads revealed several contigs with putative annotations associated with salivary gland functions. KEGG pathway analysis of the sialotranscriptome revealed that the majority (18 out of the top 20 predicted KEGG pathways) of the salivary gland contig sequences match proteins involved in metabolism. We identified several genes likely to be involved in detoxification and inhibition of plant defense responses including aldehyde dehydrogenase, metalloprotease, glucose oxidase, glucose dehydrogenase, and regucalcin. We also identified several genes that may play a role in the extra-oral digestion of plant structural tissues including β-glucosidase and pectin lyase; and the extra-oral digestion of sugars, including α-amylase, maltase, sucrase, and α-glucosidase. This is the first analysis of a sialotranscriptome for any Thysanopteran species and it provides a foundational tool to further our understanding of how thrips interact with their plant hosts and the viruses they transmit.

  11. Transcriptomic immune response of Tenebrio molitor pupae to parasitization by Scleroderma guani.

    PubMed

    Zhu, Jia-Ying; Yang, Pu; Zhang, Zhong; Wu, Guo-Xing; Yang, Bin

    2013-01-01

    Host and parasitoid interaction is one of the most fascinating relationships of insects, which is currently receiving an increasing interest. Understanding the mechanisms evolved by the parasitoids to evade or suppress the host immune system is important for dissecting this interaction, while it was still poorly known. In order to gain insight into the immune response of Tenebrio molitor to parasitization by Scleroderma guani, the transcriptome of T. molitor pupae was sequenced with focus on immune-related gene, and the non-parasitized and parasitized T. molitor pupae were analyzed by digital gene expression (DGE) analysis with special emphasis on parasitoid-induced immune-related genes using Illumina sequencing. In a single run, 264,698 raw reads were obtained. De novo assembly generated 71,514 unigenes with mean length of 424 bp. Of those unigenes, 37,373 (52.26%) showed similarity to the known proteins in the NCBI nr database. Via analysis of the transcriptome data in depth, 430 unigenes related to immunity were identified. DGE analysis revealed that parasitization by S. guani had considerable impacts on the transcriptome profile of T. molitor pupae, as indicated by the significant up- or down-regulation of 3,431 parasitism-responsive transcripts. The expression of a total of 74 unigenes involved in immune response of T. molitor was significantly altered after parasitization. obtained T. molitor transcriptome, in addition to establishing a fundamental resource for further research on functional genomics, has allowed the discovery of a large group of immune genes that might provide a meaningful framework to better understand the immune response in this species and other beetles. The DGE profiling data provides comprehensive T. molitor immune gene expression information at the transcriptional level following parasitization, and sheds valuable light on the molecular understanding of the host-parasitoid interaction.

  12. Trichoderma reesei xylanase 5 is defective in the reference strain QM6a but functional alleles are present in other wild-type strains.

    PubMed

    Ramoni, Jonas; Marchetti-Deschmann, Martina; Seidl-Seiboth, Verena; Seiboth, Bernhard

    2017-05-01

    Trichoderma reesei is a paradigm for the regulation and industrial production of plant cell wall-degrading enzymes. Among these, five xylanases, including the glycoside hydrolase (GH) family 11 XYN1 and XYN2, the GH10 XYN3, and the GH30 XYN4 and XYN6, were described. By genome mining and transcriptome analysis, a further putative xylanase, encoded by xyn5, was identified. Analysis of xyn5 from the genome-sequenced reference strain T. reesei QM6a shows that it encodes a non-functional, truncated form of XYN5. However, non-truncated orthologues are present in other genome sequenced Trichoderma spp., and sequencing of xyn5 in other T. reesei wild-type isolates shows that they harbor a putative functional xyn5 allele. In silico analysis and 3D modeling revealed that the encoded XYN5 has significant structural similarities to xylanases of the GH11 family, including a GH-typical substrate binding groove and a carboxylate pair in the active site. The xyn5 of wild-type strain TUCIM1282 was recombinantly expressed in a T. reesei strain with a (hemi)cellulase-free background and the corresponding protein purified to apparent homogeneity. The pH and temperature optima and the kinetic parameters of the purified XYN5 were pH 4, 50 °C, and V max  = 2646 nkat/mg with a K m of 9.68 mg/ml. This functional xyn5 allele was used to replace the mutated version which led to an overall increase of the xylanolytic activity. These findings are of particular importance as GH11 xylanases are of high biotechnological relevance, and T. reesei is one of the main industrial producers of such lignocellulose-degrading enzymes.

  13. Genome-wide identification of translationally inhibited and degraded miR-155 targets using RNA-interacting protein-IP

    PubMed Central

    Meier, Jan; Hovestadt, Volker; Zapatka, Marc; Pscherer, Armin; Lichter, Peter; Seiffert, Martina

    2013-01-01

    MicroRNAs (miRNAs) are single-stranded, small, non-coding RNAs, which fine-tune protein expression by degrading and/or translationally inhibiting mRNAs. Manipulation of miRNA expression in animal models frequently results in severe phenotypes indicating their relevance in controlling cellular functions, most likely by interacting with multiple targets. To better understand the effect of miRNA activities, genome-wide analysis of their targets are required. MicroRNA profiling as well as transcriptome analysis upon enforced miRNA expression were frequently used to investigate their relevance. However, these approaches often fail to identify relevant miRNAs targets. Therefore, we tested the precision of RNA-interacting protein immunoprecipitation (RIP) using AGO2-specific antibodies, a core component of the “RNA-induced silencing complex” (RISC), followed by RNA sequencing (Seq) in a defined cellular system, the HEK293T cells with stable, ectopic expression of miR-155. Thereby, we identified 100 AGO2-associated mRNAs in miR-155-expressing cells, of which 67 were in silico predicted miR-155 target genes. An integrated analysis of the corresponding expression profiles indicated that these targets were either regulated by mRNA decay or by translational repression. Of the identified miR-155 targets, 17 were related to cell cycle control, suggesting their involvement in the observed increase in cell proliferation of HEK293T cells upon miR-155 expression. Additional, secondary changes within the gene expression profile were detected and might contribute to this phenotype as well. Interestingly, by analyzing RIP-Seq data of HEK-293T cells and two B-cell lines we identified a recurrent disproportional enrichment of several miRNAs, including miR-155 and miRNAs of the miR-17-92 cluster, in the AGO2-associated precipitates, suggesting discrepancies in miRNA expression and activity. PMID:23673373

  14. Knock down of Whitefly Gut Gene Expression and Mortality by Orally Delivered Gut Gene-Specific dsRNAs.

    PubMed

    Vyas, Meenal; Raza, Amir; Ali, Muhammad Yousaf; Ashraf, Muhammad Aleem; Mansoor, Shahid; Shahid, Ahmad Ali; Brown, Judith K

    2017-01-01

    Control of the whitefly Bemisia tabaci (Genn.) agricultural pest and plant virus vector relies on the use of chemical insecticides. RNA-interference (RNAi) is a homology-dependent innate immune response in eukaryotes, including insects, which results in degradation of the corresponding transcript following its recognition by a double-stranded RNA (dsRNA) that shares 100% sequence homology. In this study, six whitefly 'gut' genes were selected from an in silico-annotated transcriptome library constructed from the whitefly alimentary canal or 'gut' of the B biotype of B. tabaci, and tested for knock down efficacy, post-ingestion of dsRNAs that share 100% sequence homology to each respective gene target. Candidate genes were: Acetylcholine receptor subunit α, Alpha glucosidase 1, Aquaporin 1, Heat shock protein 70, Trehalase1, and Trehalose transporter1. The efficacy of RNAi knock down was further tested in a gene-specific functional bioassay, and mortality was recorded in 24 hr intervals, six days, post-treatment. Based on qPCR analysis, all six genes tested showed significantly reduced gene expression. Moderate-to-high whitefly mortality was associated with the down-regulation of osmoregulation, sugar metabolism and sugar transport-associated genes, demonstrating that whitefly survivability was linked with RNAi results. Silenced Acetylcholine receptor subunit α and Heat shock protein 70 genes showed an initial low whitefly mortality, however, following insecticide or high temperature treatments, respectively, significantly increased knockdown efficacy and death was observed, indicating enhanced post-knockdown sensitivity perhaps related to systemic silencing. The oral delivery of gut-specific dsRNAs, when combined with qPCR analysis of gene expression and a corresponding gene-specific bioassay that relates knockdown and mortality, offers a viable approach for functional genomics analysis and the discovery of prospective dsRNA biopesticide targets. The approach can be applied to functional genomics analyses to facilitate, species-specific dsRNA-mediated control of other non-model hemipterans.

  15. In-silico Metabolome Target Analysis Towards PanC-based Antimycobacterial Agent Discovery.

    PubMed

    Khoshkholgh-Sima, Baharak; Sardari, Soroush; Izadi Mobarakeh, Jalal; Khavari-Nejad, Ramezan Ali

    2015-01-01

    Mycobacterium tuberculosis, the main cause of tuberculosis (TB), has still remained a global health crisis especially in developing countries. Tuberculosis treatment is a laborious and lengthy process with high risk of noncompliance, cytotoxicity adverse events and drug resistance in patient. Recently, there has been an alarming rise of drug resistant in TB. In this regard, it is an unmet need to develop novel antitubercular medicines that target new or more effective biochemical pathways to prevent drug resistant Mycobacterium. Integrated study of metabolic pathways through in-silico approach played a key role in antimycobacterial design process in this study. Our results suggest that pantothenate synthetase (PanC), anthranilate phosphoribosyl transferase (TrpD) and 3-isopropylmalate dehydratase (LeuD) might be appropriate drug targets. In the next step, in-silico ligand analysis was used for more detailed study of chemical tractability of targets. This was helpful to identify pantothenate synthetase (PanC, Rv3602c) as the best target for antimycobacterial design procedure. Virtual library screening on the best ligand of PanC was then performed for inhibitory ligand design. At the end, five chemical intermediates showed significant inhibition of Mycobacterium bovis with good selectivity indices (SI) ≥10 according to Tuberculosis Antimicrobial Acquisition & Coordinating Facility of US criteria for antimycobacterial screening programs.

  16. In silico prediction of splice-altering single nucleotide variants in the human genome.

    PubMed

    Jian, Xueqiu; Boerwinkle, Eric; Liu, Xiaoming

    2014-12-16

    In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DNA sequence without measuring splicing signal changes caused by a variant. Another limitation is the lack of large-scale evaluation studies of these tools. We compared eight in silico tools on 2959 single nucleotide variants within splicing consensus regions (scSNVs) using receiver operating characteristic analysis. The Position Weight Matrix model and MaxEntScan outperformed other methods. Two ensemble learning methods, adaptive boosting and random forests, were used to construct models that take advantage of individual methods. Both models further improved prediction, with outputs of directly interpretable prediction scores. We applied our ensemble scores to scSNVs from the Catalogue of Somatic Mutations in Cancer database. Analysis showed that predicted splice-altering scSNVs are enriched in recurrent scSNVs and known cancer genes. We pre-computed our ensemble scores for all potential scSNVs across the human genome, providing a whole genome level resource for identifying splice-altering scSNVs discovered from large-scale sequencing studies.

  17. Screening of mutations affecting protein stability and dynamics of FGFR1—A simulation analysis

    PubMed Central

    Doss, C. George Priya; Rajith, B.; Garwasis, Nimisha; Mathew, Pretty Raju; Raju, Anand Solomon; Apoorva, K.; William, Denise; Sadhana, N.R.; Himani, Tanwar; Dike, IP.

    2012-01-01

    Single amino acid substitutions in Fibroblast Growth Factor Receptor 1 (FGFR1) destabilize protein and have been implicated in several genetic disorders like various forms of cancer, Kallamann syndrome, Pfeiffer syndrome, Jackson Weiss syndrome, etc. In order to gain functional insight into mutation caused by amino acid substitution to protein function and expression, special emphasis was laid on molecular dynamics simulation techniques in combination with in silico tools such as SIFT, PolyPhen 2.0, I-Mutant 3.0 and SNAP. It has been estimated that 68% nsSNPs were predicted to be deleterious by I-Mutant, slightly higher than SIFT (37%), PolyPhen 2.0 (61%) and SNAP (58%). From the observed results, P722S mutation was found to be most deleterious by comparing results of all in silico tools. By molecular dynamics approach, we have shown that P722S mutation leads to increase in flexibility, and deviated more from the native structure which was supported by the decrease in the number of hydrogen bonds. In addition, biophysical analysis revealed a clear insight of stability loss due to P722S mutation in FGFR1 protein. Majority of mutations predicted by these in silico tools were in good concordance with the experimental results. PMID:27896051

  18. Screening of mutations affecting protein stability and dynamics of FGFR1-A simulation analysis.

    PubMed

    Doss, C George Priya; Rajith, B; Garwasis, Nimisha; Mathew, Pretty Raju; Raju, Anand Solomon; Apoorva, K; William, Denise; Sadhana, N R; Himani, Tanwar; Dike, I P

    2012-12-01

    Single amino acid substitutions in Fibroblast Growth Factor Receptor 1 ( FGFR1 ) destabilize protein and have been implicated in several genetic disorders like various forms of cancer, Kallamann syndrome, Pfeiffer syndrome, Jackson Weiss syndrome, etc. In order to gain functional insight into mutation caused by amino acid substitution to protein function and expression, special emphasis was laid on molecular dynamics simulation techniques in combination with in silico tools such as SIFT, PolyPhen 2.0, I-Mutant 3.0 and SNAP. It has been estimated that 68% nsSNPs were predicted to be deleterious by I-Mutant, slightly higher than SIFT (37%), PolyPhen 2.0 (61%) and SNAP (58%). From the observed results, P722S mutation was found to be most deleterious by comparing results of all in silico tools. By molecular dynamics approach, we have shown that P722S mutation leads to increase in flexibility, and deviated more from the native structure which was supported by the decrease in the number of hydrogen bonds. In addition, biophysical analysis revealed a clear insight of stability loss due to P722S mutation in FGFR1 protein. Majority of mutations predicted by these in silico tools were in good concordance with the experimental results.

  19. In silico characterization and expression analysis of the multigene family encoding the Bowman-Birk protease inhibitor in soybean.

    PubMed

    de Almeida Barros, Beatriz; da Silva, Wiliane Garcia; Moreira, Maurilio Alves; de Barros, Everaldo Gonçalves

    2012-01-01

    The Bowman-Birk (BBI) protease inhibitors can be used as source of sulfur amino acids, can regulate endogenous protease activity during seed germination and during the defense response of plants to pathogens. In soybean this family has not been fully described. The goal of this work was to characterize in silico and analyze the expression of the members of this family in soybean. We identified 11 potential BBI genes in the soybean genome. In each one of them at least a characteristic BBI conserved domain was detected in addition to a potential signal peptide. The sequences have been positioned in the soybean physical map and the promoter regions were analyzed with respect to known regulatory elements. Elements related to seed-specific expression and also to response to biotic and abiotic stresses have been identified. Based on the in silico analysis and also on quantitative RT-PCR data it was concluded that BBI-A, BBI-CII and BBI-DII are expressed specifically in the seed. The expression profiles of these three genes are similar along seed development. Their expressions reach a maximum in the intermediate stages and decrease as the seed matures. The BBI-DII transcripts are the most abundant ones followed by those of BBI-A and BBI-CII.

  20. Transcriptomic meta-analysis identifies gene expression characteristics in various samples of HIV-infected patients with nonprogressive disease.

    PubMed

    Zhang, Le-Le; Zhang, Zi-Ning; Wu, Xian; Jiang, Yong-Jun; Fu, Ya-Jing; Shang, Hong

    2017-09-12

    A small proportion of HIV-infected patients remain clinically and/or immunologically stable for years, including elite controllers (ECs) who have undetectable viremia (<50 copies/ml) and long-term nonprogressors (LTNPs) who maintain normal CD4 + T cell counts for prolonged periods (>10 years). However, the mechanism of nonprogression needs to be further resolved. In this study, a transcriptome meta-analysis was performed on nonprogressor and progressor microarray data to identify differential transcriptome pathways and potential biomarkers. Using the INMEX (integrative meta-analysis of expression data) program, we performed the meta-analysis to identify consistently differentially expressed genes (DEGs) in nonprogressors and further performed functional interpretation (gene ontology analysis and pathway analysis) of the DEGs identified in the meta-analysis. Five microarray datasets (81 cases and 98 controls in total), including whole blood, CD4 + and CD8 + T cells, were collected for meta-analysis. We determined that nonprogressors have reduced expression of important interferon-stimulated genes (ISGs), CD38, lymphocyte activation gene 3 (LAG-3) in whole blood, CD4 + and CD8 + T cells. Gene ontology (GO) analysis showed a significant enrichment in DEGs that function in the type I interferon signaling pathway. Upregulated pathways, including the PI3K-Akt signaling pathway in whole blood, cytokine-cytokine receptor interaction in CD4 + T cells and the MAPK signaling pathway in CD8 + T cells, were identified in nonprogressors compared with progressors. In each metabolic functional category, the number of downregulated DEGs was more than the upregulated DEGs, and almost all genes were downregulated DEGs in the oxidative phosphorylation (OXPHOS) and tricarboxylic acid (TCA) cycle in the three types of samples. Our transcriptomic meta-analysis provides a comprehensive evaluation of the gene expression profiles in major blood types of nonprogressors, providing new insights in the understanding of HIV pathogenesis and developing strategies to delay HIV disease progression.

Top