Science.gov

Sample records for genome scale transcriptome

  1. Genome scale transcriptomics of baculovirus-insect interactions.

    PubMed

    Nguyen, Quan; Nielsen, Lars K; Reid, Steven

    2013-11-12

    Baculovirus-insect cell technologies are applied in the production of complex proteins, veterinary and human vaccines, gene delivery vectors' and biopesticides. Better understanding of how baculoviruses and insect cells interact would facilitate baculovirus-based production. While complete genomic sequences are available for over 58 baculovirus species, little insect genomic information is known. The release of the Bombyx mori and Plutella xylostella genomes, the accumulation of EST sequences for several Lepidopteran species, and especially the availability of two genome-scale analysis tools, namely oligonucleotide microarrays and next generation sequencing (NGS), have facilitated expression studies to generate a rich picture of insect gene responses to baculovirus infections. This review presents current knowledge on the interaction dynamics of the baculovirus-insect system' which is relatively well studied in relation to nucleocapsid transportation, apoptosis, and heat shock responses, but is still poorly understood regarding responses involved in pro-survival pathways, DNA damage pathways, protein degradation, translation, signaling pathways, RNAi pathways, and importantly metabolic pathways for energy, nucleotide and amino acid production. We discuss how the two genome-scale transcriptomic tools can be applied for studying such pathways and suggest that proteomics and metabolomics can produce complementary findings to transcriptomic studies.

  2. Inferring the choreography of parental genomes during fertilization from ultralarge-scale whole-transcriptome analysis.

    PubMed

    Park, Sung-Joon; Komata, Makiko; Inoue, Fukashi; Yamada, Kaori; Nakai, Kenta; Ohsugi, Miho; Shirahige, Katsuhiko

    2013-12-15

    Fertilization precisely choreographs parental genomes by using gamete-derived cellular factors and activating genome regulatory programs. However, the mechanism remains elusive owing to the technical difficulties of preparing large numbers of high-quality preimplantation cells. Here, we collected >14 × 10(4) high-quality mouse metaphase II oocytes and used these to establish detailed transcriptional profiles for four early embryo stages and parthenogenetic development. By combining these profiles with other public resources, we found evidence that gene silencing appeared to be mediated in part by noncoding RNAs and that this was a prerequisite for post-fertilization development. Notably, we identified 817 genes that were differentially expressed in embryos after fertilization compared with parthenotes. The regulation of these genes was distinctly different from those expressed in parthenotes, suggesting functional specialization of particular transcription factors prior to first cell cleavage. We identified five transcription factors that were potentially necessary for developmental progression: Foxd1, Nkx2-5, Sox18, Myod1, and Runx1. Our very large-scale whole-transcriptome profile of early mouse embryos yielded a novel and valuable resource for studies in developmental biology and stem cell research. The database is available at http://dbtmee.hgc.jp.

  3. Genome-scale transcriptomic insights into early-stage fruit development in woodland strawberry Fragaria vesca.

    PubMed

    Kang, Chunying; Darwish, Omar; Geretz, Aviva; Shahan, Rachel; Alkharouf, Nadim; Liu, Zhongchi

    2013-06-01

    Fragaria vesca, a diploid woodland strawberry with a small and sequenced genome, is an excellent model for studying fruit development. The strawberry fruit is unique in that the edible flesh is actually enlarged receptacle tissue. The true fruit are the numerous dry achenes dotting the receptacle's surface. Auxin produced from the achene is essential for the receptacle fruit set, a paradigm for studying crosstalk between hormone signaling and development. To investigate the molecular mechanism underlying strawberry fruit set, next-generation sequencing was employed to profile early-stage fruit development with five fruit tissue types and five developmental stages from floral anthesis to enlarged fruits. This two-dimensional data set provides a systems-level view of molecular events with precise spatial and temporal resolution. The data suggest that the endosperm and seed coat may play a more prominent role than the embryo in auxin and gibberellin biosynthesis for fruit set. A model is proposed to illustrate how hormonal signals produced in the endosperm and seed coat coordinate seed, ovary wall, and receptacle fruit development. The comprehensive fruit transcriptome data set provides a wealth of genomic resources for the strawberry and Rosaceae communities as well as unprecedented molecular insight into fruit set and early stage fruit development.

  4. Genome-Scale Transcriptome Analysis of the Desert Shrub Artemisia sphaerocephala

    PubMed Central

    Zhang, Lijing; Hu, Xiaowei; Miao, Xiumei; Chen, Xiaolong; Nan, Shuzhen; Fu, Hua

    2016-01-01

    Background Artemisia sphaerocephala, a semi-shrub belonging to the Artemisia genus of the Compositae family, is an important pioneer plant that inhabits moving and semi-stable sand dunes in the deserts and steppes of northwest and north-central China. It is very resilient in extreme environments. Additionally, its seeds have excellent nutritional value, and the abundant lipids and polysaccharides in the seeds make this plant a potential valuable source of bio-energy. However, partly due to the scarcity of genetic information, the genetic mechanisms controlling the traits and environmental adaptation capacity of A. sphaerocephala are unknown. Results Here, we present the first in-depth transcriptomic analysis of A. sphaerocephala. To maximize the representation of conditional transcripts, mRNA was obtained from 17 samples, including living tissues of desert-growing A. sphaerocephala, seeds germinated in the laboratory, and calli subjected to no stress (control) and high and low temperature, high and low osmotic, and salt stresses. De novo transcriptome assembly performed using an Illumina HiSeq 2500 platform resulted in the generation of 68,373 unigenes. We analyzed the key genes involved in the unsaturated fatty acid synthesis pathway and identified 26 A. sphaerocephala fad2 genes, which is the largest fad2 gene family reported to date. Furthermore, a set of genes responsible for resistance to extreme temperatures, salt, drought and a combination of stresses was identified. Conclusion The present work provides abundant genomic information for functional dissection of the important traits of A. sphaerocephala and contributes to the current understanding of molecular adaptive mechanisms of A. sphaerocephala in the desert environment. Identification of the key genes in the unsaturated fatty acid synthesis pathway could increase understanding of the biological regulatory mechanisms of fatty acid composition traits in plants and facilitate genetic manipulation of the

  5. Genome-scale transcriptome analysis of the desert poplar, Populus euphratica.

    PubMed

    Qiu, Qiang; Ma, Tao; Hu, Quanjun; Liu, Bingbing; Wu, Yuxia; Zhou, Haihong; Wang, Qian; Wang, Juan; Liu, Jianquan

    2011-04-01

    Populus euphratica is well-adapted to extreme desert environments and is an important model species for studying the effects of abiotic stresses on trees. Here we present the first deep transcriptomic analysis of this species. To maximize representation of conditional transcripts, mRNA was obtained from living tissues of desert-grown trees and two types of callus (salt-stressed and unstressed). De novo assembly generated 86,777 Unigenes using Solexa sequence data. These sequences covered 92% of previously reported P. euphratica expressed sequence tags (ESTs) and 90% of the TIGR poplar ESTs, and a total of 58,499 high-quality unique sequences were annotated by BLAST similarity searches against public databases. We found that 27% of the total Unigenes were differentially expressed (up- or down-regulated) in response to salt stress in P. euphratica callus. These differentially expressed genes are mainly involved in transport, transcription, cellular communication and metabolism. In addition, we found that numerous putative genes involved in ABA regulation and biosynthesis were also differentially regulated. This study represents the deepest transcriptomic and gene-annotation analysis of P. euphratica to date. The genetic knowledge acquired should be very useful for future studies of the molecular adaptation of this tree species to abiotic stress and facilitate genetic manipulation of other poplar species.

  6. Illuminating the Transcriptome through the Genome.

    PubMed

    Elliott, David J

    2014-03-14

    Sequencing the human genome was a huge milestone in genetic research that revealed almost the total DNA sequence required to create a human being. However, in order to function, the DNA genome needs to be expressed as an RNA transcriptome. This article reviews how knowledge of genome sequence information has led to fundamental discoveries in how the transcriptome is processed, with a focus on new system-wide insights into how pre-mRNAs that are encoded by split genes in the genome are rearranged by splicing into functional mRNAs. These advances have been made possible by the development of new post-genome technologies to probe splicing patterns. Transcriptome-wide approaches have characterised a "splicing code" that is embedded within and has a significant role in deciphering the genome, and is deciphered by RNA binding proteins. These analyses have also found that most human genes encode multiple mRNA isoforms, and in some cases proteins, leading in turn to a re-assessment of what exactly a gene is. Analysis of the transcriptome has given insights into how the genome is packaged and transcribed, and is helping to explain important aspects of genome evolution.

  7. Illuminating the Transcriptome through the Genome

    PubMed Central

    Elliott, David J.

    2014-01-01

    Sequencing the human genome was a huge milestone in genetic research that revealed almost the total DNA sequence required to create a human being. However, in order to function, the DNA genome needs to be expressed as an RNA transcriptome. This article reviews how knowledge of genome sequence information has led to fundamental discoveries in how the transcriptome is processed, with a focus on new system-wide insights into how pre-mRNAs that are encoded by split genes in the genome are rearranged by splicing into functional mRNAs. These advances have been made possible by the development of new post-genome technologies to probe splicing patterns. Transcriptome-wide approaches have characterised a “splicing code” that is embedded within and has a significant role in deciphering the genome, and is deciphered by RNA binding proteins. These analyses have also found that most human genes encode multiple mRNA isoforms, and in some cases proteins, leading in turn to a re-assessment of what exactly a gene is. Analysis of the transcriptome has given insights into how the genome is packaged and transcribed, and is helping to explain important aspects of genome evolution. PMID:24705295

  8. Identification of candidate network hubs involved in metabolic adjustments of rice under drought stress by integrating transcriptome data and genome-scale metabolic network.

    PubMed

    Mohanty, Bijayalaxmi; Kitazumi, Ai; Cheung, C Y Maurice; Lakshmanan, Meiyappan; de los Reyes, Benildo G; Jang, In-Cheol; Lee, Dong-Yup

    2016-01-01

    In this study, we have integrated a rice genome-scale metabolic network and the transcriptome of a drought-tolerant rice line, DK151, to identify the major transcriptional regulators involved in metabolic adjustments necessary for adaptation to drought. This was achieved by examining the differential expressions of transcription factors and metabolic genes in leaf, root and young panicle of rice plants subjected to drought stress during tillering, booting and panicle elongation stages. Critical transcription factors such as AP2/ERF, bZIP, MYB and NAC that control the important nodes in the gene regulatory pathway were identified through correlative analysis of the patterns of spatio-temporal expression and cis-element enrichment. We showed that many of the candidate transcription factors involved in metabolic adjustments were previously linked to phenotypic variation for drought tolerance. This approach represents the first attempt to integrate models of transcriptional regulation and metabolic pathways for the identification of candidate regulatory genes for targeted selection in rice breeding.

  9. Optimal Scaling of Digital Transcriptomes

    PubMed Central

    Glusman, Gustavo; Caballero, Juan; Robinson, Max; Kutlu, Burak; Hood, Leroy

    2013-01-01

    Deep sequencing of transcriptomes has become an indispensable tool for biology, enabling expression levels for thousands of genes to be compared across multiple samples. Since transcript counts scale with sequencing depth, counts from different samples must be normalized to a common scale prior to comparison. We analyzed fifteen existing and novel algorithms for normalizing transcript counts, and evaluated the effectiveness of the resulting normalizations. For this purpose we defined two novel and mutually independent metrics: (1) the number of “uniform” genes (genes whose normalized expression levels have a sufficiently low coefficient of variation), and (2) low Spearman correlation between normalized expression profiles of gene pairs. We also define four novel algorithms, one of which explicitly maximizes the number of uniform genes, and compared the performance of all fifteen algorithms. The two most commonly used methods (scaling to a fixed total value, or equalizing the expression of certain ‘housekeeping’ genes) yielded particularly poor results, surpassed even by normalization based on randomly selected gene sets. Conversely, seven of the algorithms approached what appears to be optimal normalization. Three of these algorithms rely on the identification of “ubiquitous” genes: genes expressed in all the samples studied, but never at very high or very low levels. We demonstrate that these include a “core” of genes expressed in many tissues in a mutually consistent pattern, which is suitable for use as an internal normalization guide. The new methods yield robustly normalized expression values, which is a prerequisite for the identification of differentially expressed and tissue-specific genes as potential biomarkers. PMID:24223126

  10. International Standards for Genomes, Transcriptomes, and Metagenomes.

    PubMed

    Mason, Christopher E; Afshinnekoo, Ebrahim; Tighe, Scott; Wu, Shixiu; Levy, Shawn

    2017-03-17

    Challenges and biases in preparing, characterizing, and sequencing DNA and RNA can have significant impacts on research in genomics across all kingdoms of life, including experiments in single-cells, RNA profiling, and metagenomics (across multiple genomes). Technical artifacts and contamination can arise at each point of sample manipulation, extraction, sequencing, and analysis. Thus, the measurement and benchmarking of these potential sources of error are of paramount importance as next-generation sequencing (NGS) projects become more global and ubiquitous. Fortunately, a variety of methods, standards, and technologies have recently emerged that improve measurements in genomics and sequencing, from the initial input material to the computational pipelines that process and annotate the data. Here we review current standards and their applications in genomics, including whole genomes, transcriptomes, mixed genomic samples (metagenomes), and the modified bases within each (epigenomes and epitranscriptomes). These standards, tools, and metrics are critical for quantifying the accuracy of NGS methods, which will be essential for robust approaches in clinical genomics and precision medicine.

  11. International Standards for Genomes, Transcriptomes, and Metagenomes

    PubMed Central

    Mason, Christopher E.; Afshinnekoo, Ebrahim; Tighe, Scott; Wu, Shixiu; Levy, Shawn

    2017-01-01

    Challenges and biases in preparing, characterizing, and sequencing DNA and RNA can have significant impacts on research in genomics across all kingdoms of life, including experiments in single-cells, RNA profiling, and metagenomics (across multiple genomes). Technical artifacts and contamination can arise at each point of sample manipulation, extraction, sequencing, and analysis. Thus, the measurement and benchmarking of these potential sources of error are of paramount importance as next-generation sequencing (NGS) projects become more global and ubiquitous. Fortunately, a variety of methods, standards, and technologies have recently emerged that improve measurements in genomics and sequencing, from the initial input material to the computational pipelines that process and annotate the data. Here we review current standards and their applications in genomics, including whole genomes, transcriptomes, mixed genomic samples (metagenomes), and the modified bases within each (epigenomes and epitranscriptomes). These standards, tools, and metrics are critical for quantifying the accuracy of NGS methods, which will be essential for robust approaches in clinical genomics and precision medicine. PMID:28337071

  12. Multi-Scale Genomic, Transcriptomic and Proteomic Analysis of Colorectal Cancer Cell Lines to Identify Novel Biomarkers

    PubMed Central

    Briffa, Romina; Um, Inhwa; Faratian, Dana; Zhou, Ying; Turnbull, Arran K.; Langdon, Simon P.; Harrison, David J.

    2015-01-01

    Selecting colorectal cancer (CRC) patients likely to respond to therapy remains a clinical challenge. The objectives of this study were to establish which genes were differentially expressed with respect to treatment sensitivity and relate this to copy number in a panel of 15 CRC cell lines. Copy number variations of the identified genes were assessed in a cohort of CRCs. IC50’s were measured for 5-fluorouracil, oxaliplatin, and BEZ-235, a PI3K/mTOR inhibitor. Cell lines were profiled using array comparative genomic hybridisation, Illumina gene expression analysis, reverse phase protein arrays, and targeted sequencing of KRAS hotspot mutations. Frequent gains were observed at 2p, 3q, 5p, 7p, 7q, 8q, 12p, 13q, 14q, and 17q and losses at 2q, 3p, 5q, 8p, 9p, 9q, 14q, 18q, and 20p. Frequently gained regions contained EGFR, PIK3CA, MYC, SMO, TRIB1, FZD1, and BRCA2, while frequently lost regions contained FHIT and MACROD2. TRIB1 was selected for further study. Gene enrichment analysis showed that differentially expressed genes with respect to treatment response were involved in Wnt signalling, EGF receptor signalling, apoptosis, cell cycle, and angiogenesis. Stepwise integration of copy number and gene expression data yielded 47 candidate genes that were significantly correlated. PDCD6 was differentially expressed in all three treatment responses. Tissue microarrays were constructed for a cohort of 118 CRC patients and TRIB1 and MYC amplifications were measured using fluorescence in situ hybridisation. TRIB1 and MYC were amplified in 14.5% and 7.4% of the cohort, respectively, and these amplifications were significantly correlated (p≤0.0001). TRIB1 protein expression in the patient cohort was significantly correlated with pERK, Akt, and Caspase 3 expression. In conclusion, a set of candidate predictive biomarkers for 5-fluorouracil, oxaliplatin, and BEZ235 are described that warrant further study. Amplification of the putative oncogene TRIB1 has been described for

  13. Genomic and Transcriptomic Analyses of Foodborne Bacterial Pathogens

    NASA Astrophysics Data System (ADS)

    Zhang, Wei; Dudley, Edward G.; Wade, Joseph T.

    DNA microarrays (often interchangeably called DNA chips or DNA arrays) are among the most popular analytical tools for high-throughput comparative genomic and transcriptomic analyses of foodborne bacterial pathogens. A typical DNA microarray contains hundreds to millions of small DNA probes that are chemically attached (or "printed") onto the surface of a microscopic glass slide. Depending on the specific "printing" and probe synthesis technologies for different microarray platforms, such DNA probes can be PCR amplicons or in situ synthesized short oligonucleotides. DNA microarray technologies have revolutionized the way that we investigate the biology of foodborne bacterial pathogens. The major advantage of these technologies is that DNA microarrays allow comparison of subtle genomic or transcriptomic variations between two bacterial samples, such as genomic variations between two different bacterial strains or transcriptomic alterations of same bacterial strain under two different treatments. Some applications of comparative genomic hybridization microarrays and global gene expression microarrays have been covered in previous chapters of this book.

  14. Translating Cancer Genomes and Transcriptomes for Precision Oncology

    PubMed Central

    Roychowdhury, Sameek; Chinnaiyan, Arul M.

    2015-01-01

    Understanding the molecular landscape of cancer has facilitated the development of diagnostic, prognostic, and predictive biomarkers for clinical oncology. Developments in next generation DNA sequencing technologies have increased the speed and reduced the cost of sequencing the nucleic acids of cancer cells. This has unlocked opportunities to characterize the genomic and transcriptomic landscapes of cancer for basic science research through projects such as The Cancer Genome Atlas. The cancer genome includes DNA-based alterations such as point mutations or gene duplications. The cancer transcriptome involves RNA-based alterations including changes in messenger RNAs. Together the genome and transcriptome can provide a comprehensive view of an individual patient’s cancer and is beginning to impact real-time clinical decision-making. We discuss several opportunities for translating this basic science knowledge into clinical practice including a molecular classification of cancer, heritable risk of cancer, eligibility for targeted therapies, and the development of innovative genomic-based clinical trials. In this review, we outline key applications and new directions for translating the cancer genome and transcriptome into patient care in the clinic. PMID:26528881

  15. Reptilian Transcriptomes v2.0: An Extensive Resource for Sauropsida Genomics and Transcriptomics

    PubMed Central

    Tzika, Athanasia C.; Ullate-Agote, Asier; Grbic, Djordje; Milinkovitch, Michel C.

    2015-01-01

    Despite the availability of deep-sequencing techniques, genomic and transcriptomic data remain unevenly distributed across phylogenetic groups. For example, reptiles are poorly represented in sequence databases, hindering functional evolutionary and developmental studies in these lineages substantially more diverse than mammals. In addition, different studies use different assembly and annotation protocols, inhibiting meaningful comparisons. Here, we present the “Reptilian Transcriptomes Database 2.0,” which provides extensive annotation of transcriptomes and genomes from species covering the major reptilian lineages. To this end, we sequenced normalized complementary DNA libraries of multiple adult tissues and various embryonic stages of the leopard gecko and the corn snake and gathered published reptilian sequence data sets from representatives of the four extant orders of reptiles: Squamata (snakes and lizards), the tuatara, crocodiles, and turtles. The LANE runner 2.0 software was implemented to annotate all assemblies within a single integrated pipeline. We show that this approach increases the annotation completeness of the assembled transcriptomes/genomes. We then built large concatenated protein alignments of single-copy genes and inferred phylogenetic trees that support the positions of turtles and the tuatara as sister groups of Archosauria and Squamata, respectively. The Reptilian Transcriptomes Database 2.0 resource will be updated to include selected new data sets as they become available, thus making it a reference for differential expression studies, comparative genomics and transcriptomics, linkage mapping, molecular ecology, and phylogenomic analyses involving reptiles. The database is available at www.reptilian-transcriptomes.org and can be enquired using a wwwblast server installed at the University of Geneva. PMID:26133641

  16. Characterization of mango (Mangifera indica L.) transcriptome and chloroplast genome.

    PubMed

    Azim, M Kamran; Khan, Ishtaiq A; Zhang, Yong

    2014-05-01

    We characterized mango leaf transcriptome and chloroplast genome using next generation DNA sequencing. The RNA-seq output of mango transcriptome generated >12 million reads (total nucleotides sequenced >1 Gb). De novo transcriptome assembly generated 30,509 unigenes with lengths in the range of 300 to ≥3,000 nt and 67× depth of coverage. Blast searching against nonredundant nucleotide databases and several Viridiplantae genomic datasets annotated 24,593 mango unigenes (80% of total) and identified Citrus sinensis as closest neighbor of mango with 9,141 (37%) matched sequences. The annotation with gene ontology and Clusters of Orthologous Group terms categorized unigene sequences into 57 and 25 classes, respectively. More than 13,500 unigenes were assigned to 293 KEGG pathways. Besides major plant biology related pathways, KEGG based gene annotation pointed out active presence of an array of biochemical pathways involved in (a) biosynthesis of bioactive flavonoids, flavones and flavonols, (b) biosynthesis of terpenoids and lignins and (c) plant hormone signal transduction. The mango transcriptome sequences revealed 235 proteases belonging to five catalytic classes of proteolytic enzymes. The draft genome of mango chloroplast (cp) was obtained by a combination of Sanger and next generation sequencing. The draft mango cp genome size is 151,173 bp with a pair of inverted repeats of 27,093 bp separated by small and large single copy regions, respectively. Out of 139 genes in mango cp genome, 91 found to be protein coding. Sequence analysis revealed cp genome of C. sinensis as closest neighbor of mango. We found 51 short repeats in mango cp genome supposed to be associated with extensive rearrangements. This is the first report of transcriptome and chloroplast genome analysis of any Anacardiaceae family member.

  17. Genome interplay in the grain transcriptome of hexaploid bread wheat.

    PubMed

    Pfeifer, Matthias; Kugler, Karl G; Sandve, Simen R; Zhan, Bujie; Rudi, Heidi; Hvidsten, Torgeir R; Mayer, Klaus F X; Olsen, Odd-Arne

    2014-07-18

    Allohexaploid bread wheat (Triticum aestivum L.) provides approximately 20% of calories consumed by humans. Lack of genome sequence for the three homeologous and highly similar bread wheat genomes (A, B, and D) has impeded expression analysis of the grain transcriptome. We used previously unknown genome information to analyze the cell type-specific expression of homeologous genes in the developing wheat grain and identified distinct co-expression clusters reflecting the spatiotemporal progression during endosperm development. We observed no global but cell type- and stage-dependent genome dominance, organization of the wheat genome into transcriptionally active chromosomal regions, and asymmetric expression in gene families related to baking quality. Our findings give insight into the transcriptional dynamics and genome interplay among individual grain cell types in a polyploid cereal genome.

  18. InsectBase: a resource for insect genomes and transcriptomes

    PubMed Central

    Yin, Chuanlin; Shen, Gengyu; Guo, Dianhao; Wang, Shuping; Ma, Xingzhou; Xiao, Huamei; Liu, Jinding; Zhang, Zan; Liu, Ying; Zhang, Yiqun; Yu, Kaixiang; Huang, Shuiqing; Li, Fei

    2016-01-01

    The genomes and transcriptomes of hundreds of insects have been sequenced. However, insect community lacks an integrated, up-to-date collection of insect gene data. Here, we introduce the first release of InsectBase, available online at http://www.insect-genome.com. The database encompasses 138 insect genomes, 116 insect transcriptomes, 61 insect gene sets, 36 gene families of 60 insects, 7544 miRNAs of 69 insects, 96 925 piRNAs of Drosophila melanogaster and Chilo suppressalis, 2439 lncRNA of Nilaparvata lugens, 22 536 pathways of 78 insects, 678 881 untranslated regions (UTR) of 84 insects and 160 905 coding sequences (CDS) of 70 insects. This release contains over 12 million sequences and provides search functionality, a BLAST server, GBrowse, insect pathway construction, a Facebook-like network for the insect community (iFacebook), and phylogenetic analysis of selected genes. PMID:26578584

  19. InsectBase: a resource for insect genomes and transcriptomes.

    PubMed

    Yin, Chuanlin; Shen, Gengyu; Guo, Dianhao; Wang, Shuping; Ma, Xingzhou; Xiao, Huamei; Liu, Jinding; Zhang, Zan; Liu, Ying; Zhang, Yiqun; Yu, Kaixiang; Huang, Shuiqing; Li, Fei

    2016-01-04

    The genomes and transcriptomes of hundreds of insects have been sequenced. However, insect community lacks an integrated, up-to-date collection of insect gene data. Here, we introduce the first release of InsectBase, available online at http://www.insect-genome.com. The database encompasses 138 insect genomes, 116 insect transcriptomes, 61 insect gene sets, 36 gene families of 60 insects, 7544 miRNAs of 69 insects, 96,925 piRNAs of Drosophila melanogaster and Chilo suppressalis, 2439 lncRNA of Nilaparvata lugens, 22,536 pathways of 78 insects, 678,881 untranslated regions (UTR) of 84 insects and 160,905 coding sequences (CDS) of 70 insects. This release contains over 12 million sequences and provides search functionality, a BLAST server, GBrowse, insect pathway construction, a Facebook-like network for the insect community (iFacebook), and phylogenetic analysis of selected genes.

  20. Genomic and transcriptomic resources for assassin flies including the complete genome sequence of Proctacanthus coquilletti (Insecta: Diptera: Asilidae) and 16 representative transcriptomes.

    PubMed

    Dikow, Rebecca B; Frandsen, Paul B; Turcatel, Mauren; Dikow, Torsten

    2017-01-01

    A high-quality draft genome for Proctacanthus coquilletti (Insecta: Diptera: Asilidae) is presented along with transcriptomes for 16 Diptera species from five families: Asilidae, Apioceridae, Bombyliidae, Mydidae, and Tabanidae. Genome sequencing reveals that P. coquilletti has a genome size of approximately 210 Mbp and remarkably low heterozygosity (0.47%) and few repeats (15%). These characteristics helped produce a highly contiguous (N50 = 862 kbp) assembly, particularly given that only a single 2 × 250 bp PCR-free Illumina library was sequenced. A phylogenomic hypothesis is presented based on thousands of putative orthologs across the 16 transcriptomes. Phylogenetic relationships support the sister group relationship of Apioceridae + Mydidae to Asilidae. A time-calibrated phylogeny is also presented, with seven fossil calibration points, which suggests an older age of the split among Apioceridae, Asilidae, and Mydidae (158 mya) and Apioceridae and Mydidae (135 mya) than proposed in the AToL FlyTree project. Future studies will be able to take advantage of the resources presented here in order to produce large scale phylogenomic and evolutionary studies of assassin fly phylogeny, life histories, or venom. The bioinformatics tools and workflow presented here will be useful to others wishing to generate de novo genomic resources in species-rich taxa without a closely-related reference genome.

  1. Genomic and transcriptomic resources for assassin flies including the complete genome sequence of Proctacanthus coquilletti (Insecta: Diptera: Asilidae) and 16 representative transcriptomes

    PubMed Central

    Frandsen, Paul B.; Turcatel, Mauren

    2017-01-01

    A high-quality draft genome for Proctacanthus coquilletti (Insecta: Diptera: Asilidae) is presented along with transcriptomes for 16 Diptera species from five families: Asilidae, Apioceridae, Bombyliidae, Mydidae, and Tabanidae. Genome sequencing reveals that P. coquilletti has a genome size of approximately 210 Mbp and remarkably low heterozygosity (0.47%) and few repeats (15%). These characteristics helped produce a highly contiguous (N50 = 862 kbp) assembly, particularly given that only a single 2 × 250 bp PCR-free Illumina library was sequenced. A phylogenomic hypothesis is presented based on thousands of putative orthologs across the 16 transcriptomes. Phylogenetic relationships support the sister group relationship of Apioceridae + Mydidae to Asilidae. A time-calibrated phylogeny is also presented, with seven fossil calibration points, which suggests an older age of the split among Apioceridae, Asilidae, and Mydidae (158 mya) and Apioceridae and Mydidae (135 mya) than proposed in the AToL FlyTree project. Future studies will be able to take advantage of the resources presented here in order to produce large scale phylogenomic and evolutionary studies of assassin fly phylogeny, life histories, or venom. The bioinformatics tools and workflow presented here will be useful to others wishing to generate de novo genomic resources in species-rich taxa without a closely-related reference genome. PMID:28168115

  2. Bacillus anthracis genome organization in light of whole transcriptome sequencing

    SciTech Connect

    Martin, Jeffrey; Zhu, Wenhan; Passalacqua, Karla D.; Bergman, Nicholas; Borodovsky, Mark

    2010-03-22

    Emerging knowledge of whole prokaryotic transcriptomes could validate a number of theoretical concepts introduced in the early days of genomics. What are the rules connecting gene expression levels with sequence determinants such as quantitative scores of promoters and terminators? Are translation efficiency measures, e.g. codon adaptation index and RBS score related to gene expression? We used the whole transcriptome shotgun sequencing of a bacterial pathogen Bacillus anthracis to assess correlation of gene expression level with promoter, terminator and RBS scores, codon adaptation index, as well as with a new measure of gene translational efficiency, average translation speed. We compared computational predictions of operon topologies with the transcript borders inferred from RNA-Seq reads. Transcriptome mapping may also improve existing gene annotation. Upon assessment of accuracy of current annotation of protein-coding genes in the B. anthracis genome we have shown that the transcriptome data indicate existence of more than a hundred genes missing in the annotation though predicted by an ab initio gene finder. Interestingly, we observed that many pseudogenes possess not only a sequence with detectable coding potential but also promoters that maintain transcriptional activity.

  3. Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree

    PubMed Central

    Rangiah, Kannan; Mahesh, HB; Rajamani, Anantharamanan; Shirke, Meghana D.; Russiachand, Heikham; Loganathan, Ramya Malarini; Shankara Lingu, Chandana; Siddappa, Shilpa; Ramamurthy, Aishwarya; Sathyanarayana, BN

    2015-01-01

    Neem (Azadirachta indica A. Juss) is one of the most versatile tropical evergreen tree species known in India since the Vedic period (1500 BC–600 BC). Neem tree is a rich source of limonoids, having a wide spectrum of activity against insect pests and microbial pathogens. Complex tetranortriterpenoids such as azadirachtin, salanin and nimbin are the major active principles isolated from neem seed. Absolutely nothing is known about the biochemical pathways of these metabolites in neem tree. To identify genes and pathways in neem, we sequenced neem genomes and transcriptomes using next generation sequencing technologies. Assembly of Illumina and 454 sequencing reads resulted in 267 Mb, which accounts for 70% of estimated size of neem genome. We predicted 44,495 genes in the neem genome, of which 32,278 genes were expressed in neem tissues. Neem genome consists about 32.5% (87 Mb) of repetitive DNA elements. Neem tree is phylogenetically related to citrus, Citrus sinensis. Comparative analysis anchored 62% (161 Mb) of assembled neem genomic contigs onto citrus chromomes. Ultrahigh performance liquid chromatography-mass spectrometry-selected reaction monitoring (UHPLC-MS/SRM) method was used to quantify azadirachtin, nimbin, and salanin from neem tissues. Weighted Correlation Network Analysis (WCGNA) of expressed genes and metabolites resulted in identification of possible candidate genes involved in azadirachtin biosynthesis pathway. This study provides genomic, transcriptomic and quantity of top three neem metabolites resource, which will accelerate basic research in neem to understand biochemical pathways. PMID:26290780

  4. Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree.

    PubMed

    Kuravadi, Nagesh A; Yenagi, Vijay; Rangiah, Kannan; Mahesh, H B; Rajamani, Anantharamanan; Shirke, Meghana D; Russiachand, Heikham; Loganathan, Ramya Malarini; Shankara Lingu, Chandana; Siddappa, Shilpa; Ramamurthy, Aishwarya; Sathyanarayana, B N; Gowda, Malali

    2015-01-01

    Neem (Azadirachta indica A. Juss) is one of the most versatile tropical evergreen tree species known in India since the Vedic period (1500 BC-600 BC). Neem tree is a rich source of limonoids, having a wide spectrum of activity against insect pests and microbial pathogens. Complex tetranortriterpenoids such as azadirachtin, salanin and nimbin are the major active principles isolated from neem seed. Absolutely nothing is known about the biochemical pathways of these metabolites in neem tree. To identify genes and pathways in neem, we sequenced neem genomes and transcriptomes using next generation sequencing technologies. Assembly of Illumina and 454 sequencing reads resulted in 267 Mb, which accounts for 70% of estimated size of neem genome. We predicted 44,495 genes in the neem genome, of which 32,278 genes were expressed in neem tissues. Neem genome consists about 32.5% (87 Mb) of repetitive DNA elements. Neem tree is phylogenetically related to citrus, Citrus sinensis. Comparative analysis anchored 62% (161 Mb) of assembled neem genomic contigs onto citrus chromomes. Ultrahigh performance liquid chromatography-mass spectrometry-selected reaction monitoring (UHPLC-MS/SRM) method was used to quantify azadirachtin, nimbin, and salanin from neem tissues. Weighted Correlation Network Analysis (WCGNA) of expressed genes and metabolites resulted in identification of possible candidate genes involved in azadirachtin biosynthesis pathway. This study provides genomic, transcriptomic and quantity of top three neem metabolites resource, which will accelerate basic research in neem to understand biochemical pathways.

  5. Genomics and transcriptomics in drug discovery.

    PubMed

    Dopazo, Joaquin

    2014-02-01

    The popularization of genomic high-throughput technologies is causing a revolution in biomedical research and, particularly, is transforming the field of drug discovery. Systems biology offers a framework to understand the extensive human genetic heterogeneity revealed by genomic sequencing in the context of the network of functional, regulatory and physical protein-drug interactions. Thus, approaches to find biomarkers and therapeutic targets will have to take into account the complex system nature of the relationships of the proteins with the disease. Pharmaceutical companies will have to reorient their drug discovery strategies considering the human genetic heterogeneity. Consequently, modeling and computational data analysis will have an increasingly important role in drug discovery.

  6. The past, present, and future of Leishmania genomics and transcriptomics

    PubMed Central

    Cantacessi, Cinzia; Dantas-Torres, Filipe; Nolan, Matthew J.; Otranto, Domenico

    2015-01-01

    It has been nearly 10 years since the completion of the first entire genome sequence of a Leishmania parasite. Genomic and transcriptomic analyses have advanced our understanding of the biology of Leishmania, and shed new light on the complex interactions occurring within the parasite–host–vector triangle. Here, we review these advances and examine potential avenues for translation of these discoveries into treatment and control programs. In addition, we argue for a strong need to explore how disease in dogs relates to that in humans, and how an improved understanding in line with the ‘One Health’ concept may open new avenues for the control of these devastating diseases. PMID:25638444

  7. The past, present, and future of Leishmania genomics and transcriptomics.

    PubMed

    Cantacessi, Cinzia; Dantas-Torres, Filipe; Nolan, Matthew J; Otranto, Domenico

    2015-03-01

    It has been nearly 10 years since the completion of the first entire genome sequence of a Leishmania parasite. Genomic and transcriptomic analyses have advanced our understanding of the biology of Leishmania, and shed new light on the complex interactions occurring within the parasite-host-vector triangle. Here, we review these advances and examine potential avenues for translation of these discoveries into treatment and control programs. In addition, we argue for a strong need to explore how disease in dogs relates to that in humans, and how an improved understanding in line with the 'One Health' concept may open new avenues for the control of these devastating diseases.

  8. Single Cell Genomics and Transcriptomics for Unicellular Eukaryotes

    SciTech Connect

    Ciobanu, Doina; Clum, Alicia; Singh, Vasanth; Salamov, Asaf; Han, James; Copeland, Alex; Grigoriev, Igor; James, Timothy; Singer, Steven; Woyke, Tanja; Malmstrom, Rex; Cheng, Jan-Fang

    2014-03-14

    Despite their small size, unicellular eukaryotes have complex genomes with a high degree of plasticity that allow them to adapt quickly to environmental changes. Unicellular eukaryotes live with prokaryotes and higher eukaryotes, frequently in symbiotic or parasitic niches. To this day their contribution to the dynamics of the environmental communities remains to be understood. Unfortunately, the vast majority of eukaryotic microorganisms are either uncultured or unculturable, making genome sequencing impossible using traditional approaches. We have developed an approach to isolate unicellular eukaryotes of interest from environmental samples, and to sequence and analyze their genomes and transcriptomes. We have tested our methods with six species: an uncharacterized protist from cellulose-enriched compost identified as Platyophrya, a close relative of P. vorax; the fungus Metschnikowia bicuspidate, a parasite of water flea Daphnia; the mycoparasitic fungi Piptocephalis cylindrospora, a parasite of Cokeromyces and Mucor; Caulochytrium protosteloides, a parasite of Sordaria; Rozella allomycis, a parasite of the water mold Allomyces; and the microalgae Chlamydomonas reinhardtii. Here, we present the four components of our approach: pre-sequencing methods, sequence analysis for single cell genome assembly, sequence analysis of single cell transcriptomes, and genome annotation. This technology has the potential to uncover the complexity of single cell eukaryotes and their role in the environmental samples.

  9. Bamboo Flowering from the Perspective of Comparative Genomics and Transcriptomics.

    PubMed

    Biswas, Prasun; Chakraborty, Sukanya; Dutta, Smritikana; Pal, Amita; Das, Malay

    2016-01-01

    Bamboos are an important member of the subfamily Bambusoideae, family Poaceae. The plant group exhibits wide variation with respect to the timing (1-120 years) and nature (sporadic vs. gregarious) of flowering among species. Usually flowering in woody bamboos is synchronous across culms growing over a large area, known as gregarious flowering. In many monocarpic bamboos this is followed by mass death and seed setting. While in sporadic flowering an isolated wild clump may flower, set little or no seed and remain alive. Such wide variation in flowering time and extent means that the plant group serves as repositories for genes and expression patterns that are unique to bamboo. Due to the dearth of available genomic and transcriptomic resources, limited studies have been undertaken to identify the potential molecular players in bamboo flowering. The public release of the first bamboo genome sequence Phyllostachys heterocycla, availability of related genomes Brachypodium distachyon and Oryza sativa provide us the opportunity to study this long-standing biological problem in a comparative and functional genomics framework. We identified bamboo genes homologous to those of Oryza and Brachypodium that are involved in established pathways such as vernalization, photoperiod, autonomous, and hormonal regulation of flowering. Additionally, we investigated triggers like stress (drought), physiological maturity and micro RNAs that may play crucial roles in flowering. We also analyzed available transcriptome datasets of different bamboo species to identify genes and their involvement in bamboo flowering. Finally, we summarize potential research hurdles that need to be addressed in future research.

  10. Bamboo Flowering from the Perspective of Comparative Genomics and Transcriptomics

    PubMed Central

    Biswas, Prasun; Chakraborty, Sukanya; Dutta, Smritikana; Pal, Amita; Das, Malay

    2016-01-01

    Bamboos are an important member of the subfamily Bambusoideae, family Poaceae. The plant group exhibits wide variation with respect to the timing (1–120 years) and nature (sporadic vs. gregarious) of flowering among species. Usually flowering in woody bamboos is synchronous across culms growing over a large area, known as gregarious flowering. In many monocarpic bamboos this is followed by mass death and seed setting. While in sporadic flowering an isolated wild clump may flower, set little or no seed and remain alive. Such wide variation in flowering time and extent means that the plant group serves as repositories for genes and expression patterns that are unique to bamboo. Due to the dearth of available genomic and transcriptomic resources, limited studies have been undertaken to identify the potential molecular players in bamboo flowering. The public release of the first bamboo genome sequence Phyllostachys heterocycla, availability of related genomes Brachypodium distachyon and Oryza sativa provide us the opportunity to study this long-standing biological problem in a comparative and functional genomics framework. We identified bamboo genes homologous to those of Oryza and Brachypodium that are involved in established pathways such as vernalization, photoperiod, autonomous, and hormonal regulation of flowering. Additionally, we investigated triggers like stress (drought), physiological maturity and micro RNAs that may play crucial roles in flowering. We also analyzed available transcriptome datasets of different bamboo species to identify genes and their involvement in bamboo flowering. Finally, we summarize potential research hurdles that need to be addressed in future research. PMID:28018419

  11. Transcriptome and Genome Size Analysis of the Venus Flytrap

    PubMed Central

    Bressendorff, Simon; Seguin-Orlando, Andaine; Petersen, Morten; Sicheritz-Pontén, Thomas; Mundy, John

    2015-01-01

    The insectivorous Venus flytrap (Dionaea muscipula) is renowned from Darwin’s studies of plant carnivory and the origins of species. To provide tools to analyze the evolution and functional genomics of D. muscipula, we sequenced a normalized cDNA library synthesized from mRNA isolated from D. muscipula flowers and traps. Using the Oases transcriptome assembler 79,165,657 quality trimmed reads were assembled into 80,806 cDNA contigs, with an average length of 679 bp and an N50 length of 1,051 bp. A total of 17,047 unique proteins were identified, and assigned to Gene Ontology (GO) and classified into functional categories. A total of 15,547 full-length cDNA sequences were identified, from which open reading frames were detected in 10,941. Comparative GO analyses revealed that D. muscipula is highly represented in molecular functions related to catalytic, antioxidant, and electron carrier activities. Also, using a single copy sequence PCR-based method, we estimated that the genome size of D. muscipula is approx. 3 Gb. Our genome size estimate and transcriptome analyses will contribute to future research on this fascinating, monotypic species and its heterotrophic adaptations. PMID:25886597

  12. Transcriptome and genome size analysis of the Venus flytrap.

    PubMed

    Jensen, Michael Krogh; Vogt, Josef Korbinian; Bressendorff, Simon; Seguin-Orlando, Andaine; Petersen, Morten; Sicheritz-Pontén, Thomas; Mundy, John

    2015-01-01

    The insectivorous Venus flytrap (Dionaea muscipula) is renowned from Darwin's studies of plant carnivory and the origins of species. To provide tools to analyze the evolution and functional genomics of D. muscipula, we sequenced a normalized cDNA library synthesized from mRNA isolated from D. muscipula flowers and traps. Using the Oases transcriptome assembler 79,165,657 quality trimmed reads were assembled into 80,806 cDNA contigs, with an average length of 679 bp and an N50 length of 1,051 bp. A total of 17,047 unique proteins were identified, and assigned to Gene Ontology (GO) and classified into functional categories. A total of 15,547 full-length cDNA sequences were identified, from which open reading frames were detected in 10,941. Comparative GO analyses revealed that D. muscipula is highly represented in molecular functions related to catalytic, antioxidant, and electron carrier activities. Also, using a single copy sequence PCR-based method, we estimated that the genome size of D. muscipula is approx. 3 Gb. Our genome size estimate and transcriptome analyses will contribute to future research on this fascinating, monotypic species and its heterotrophic adaptations.

  13. The draft genome and transcriptome of Cannabis sativa

    PubMed Central

    2011-01-01

    Background Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored. Results We sequenced genomic DNA and RNA from the marijuana strain Purple Kush using shortread approaches. We report a draft haploid genome sequence of 534 Mb and a transcriptome of 30,000 genes. Comparison of the transcriptome of Purple Kush with that of the hemp cultivar 'Finola' revealed that many genes encoding proteins involved in cannabinoid and precursor pathways are more highly expressed in Purple Kush than in 'Finola'. The exclusive occurrence of Δ9-tetrahydrocannabinolic acid synthase in the Purple Kush transcriptome, and its replacement by cannabidiolic acid synthase in 'Finola', may explain why the psychoactive cannabinoid Δ9-tetrahydrocannabinol (THC) is produced in marijuana but not in hemp. Resequencing the hemp cultivars 'Finola' and 'USO-31' showed little difference in gene copy numbers of cannabinoid pathway enzymes. However, single nucleotide variant analysis uncovered a relatively high level of variation among four cannabis types, and supported a separation of marijuana and hemp. Conclusions The availability of the Cannabis sativa genome enables the study of a multifunctional plant that occupies a unique role in human culture. Its availability will aid the development of therapeutic marijuana strains with tailored cannabinoid profiles and provide a basis for the breeding of hemp with improved agronomic characteristics. PMID:22014239

  14. Genomics and transcriptomics across the diversity of the Nematoda.

    PubMed

    Blaxter, M; Kumar, S; Kaur, G; Koutsovoulos, G; Elsworth, B

    2012-01-01

    The diversity of biology in nematodes is reflected in the diversity of their genomes. Parasitic species in particular have evolved mechanisms to invade and outwit their hosts, and these offer opportunities for the development of control measures. Genomic analyses can reveal the molecular underpinnings of phenotypes such as parasitism and thus, initiate and support research programmes that explore the manipulation of host and parasite physiologies to achieve favourable outcomes. Wide sampling across nematode diversity allows phylogenetically informed formulation of research hypotheses, identification of core features shared by all species or important evolutionary novelties present in isolated clades. Many nematode species have been investigated through the use of the expressed sequence tag approach, which samples from the transcribed genome. Gene catalogues generated in this way can be explored to reveal the patterns of expression associated with parasitism and candidates for testing as drug targets or vaccine components. Analysis environments, such as NEMBASE facilitate exploitation of these data. The development of new high-throughput DNA-sequencing technologies has facilitated transcriptomic and genomic approaches to parasite biology. Whole genome sequencing offers more complete catalogues of genes and assists a systems approach to phenotype dissection. These efforts are being coordinated through the 959 Nematode Genomes initiative.

  15. Cajal body function in genome organization and transcriptome diversity.

    PubMed

    Sawyer, Iain A; Sturgill, David; Sung, Myong-Hee; Hager, Gordon L; Dundr, Miroslav

    2016-12-01

    Nuclear bodies contribute to non-random organization of the human genome and nuclear function. Using a major prototypical nuclear body, the Cajal body, as an example, we suggest that these structures assemble at specific gene loci located across the genome as a result of high transcriptional activity. Subsequently, target genes are physically clustered in close proximity in Cajal body-containing cells. However, Cajal bodies are observed in only a limited number of human cell types, including neuronal and cancer cells. Ultimately, Cajal body depletion perturbs splicing kinetics by reducing target small nuclear RNA (snRNA) transcription and limiting the levels of spliceosomal snRNPs, including their modification and turnover following each round of RNA splicing. As such, Cajal bodies are capable of shaping the chromatin interaction landscape and the transcriptome by influencing spliceosome kinetics. Future studies should concentrate on characterizing the direct influence of Cajal bodies upon snRNA gene transcriptional dynamics. Also see the video abstract here.

  16. The capsicum transcriptome DB: a "hot" tool for genomic research.

    PubMed

    Góngora-Castillo, Elsa; Fajardo-Jaime, Rubén; Fernández-Cortes, Araceli; Jofre-Garfias, Alba E; Lozoya-Gloria, Edmundo; Martínez, Octavio; Ochoa-Alejo, Neftalí; Rivera-Bustamante, Rafael

    2012-01-01

    Chili pepper (Capsicum annuum) is an economically important crop with no available public genome sequence. We describe a genomic resource to facilitate Capsicum annuum research. A collection of Expressed Sequence Tags (ESTs) derived from five C. annuum organs (root, stem, leaf, flower and fruit) were sequenced using the Sanger method and multiple leaf transcriptomes were deeply sampled using with GS-pyrosequencing. A hybrid assembly of 1,324,516 raw reads yielded 32,314 high quality contigs as validated by coverage and identity analysis with existing pepper sequences. Overall, 75.5% of the contigs had significant sequence similarity to entries in nucleic acid and protein databases; 23% of the sequences have not been previously reported for C. annuum and expand sequence resources for this species. A MySQL database and a user-friendly Web interface were constructed with search-tools that permit queries of the ESTs including sequence, functional annotation, Gene Ontology classification, metabolic pathways, and assembly information. The Capsicum Transcriptome DB is free available from http://www.bioingenios.ira.cinvestav.mx:81/Joomla/

  17. LegumeIP: an integrative database for comparative genomics and transcriptomics of model legumes.

    PubMed

    Li, Jun; Dai, Xinbin; Liu, Tingsong; Zhao, Patrick Xuechun

    2012-01-01

    Legumes play a vital role in maintaining the nitrogen cycle of the biosphere. They conduct symbiotic nitrogen fixation through endosymbiotic relationships with bacteria in root nodules. However, this and other characteristics of legumes, including mycorrhization, compound leaf development and profuse secondary metabolism, are absent in the typical model plant Arabidopsis thaliana. We present LegumeIP (http://plantgrn.noble.org/LegumeIP/), an integrative database for comparative genomics and transcriptomics of model legumes, for studying gene function and genome evolution in legumes. LegumeIP compiles gene and gene family information, syntenic and phylogenetic context and tissue-specific transcriptomic profiles. The database holds the genomic sequences of three model legumes, Medicago truncatula, Glycine max and Lotus japonicus plus two reference plant species, A. thaliana and Populus trichocarpa, with annotations based on UniProt, InterProScan, Gene Ontology and the Kyoto Encyclopedia of Genes and Genomes databases. LegumeIP also contains large-scale microarray and RNA-Seq-based gene expression data. Our new database is capable of systematic synteny analysis across M. truncatula, G. max, L. japonicas and A. thaliana, as well as construction and phylogenetic analysis of gene families across the five hosted species. Finally, LegumeIP provides comprehensive search and visualization tools that enable flexible queries based on gene annotation, gene family, synteny and relative gene expression.

  18. Comparative Genomics and Transcriptomics Analyses Reveal Divergent Lifestyle Features of Nematode Endoparasitic Fungus Hirsutella minnesotensis

    PubMed Central

    Lai, Yiling; Liu, Keke; Zhang, Xinyu; Zhang, Xiaoling; Li, Kuan; Wang, Niuniu; Shu, Chi; Wu, Yunpeng; Wang, Chengshu; Bushley, Kathryn E.; Xiang, Meichun; Liu, Xingzhong

    2014-01-01

    Hirsutella minnesotensis [Ophiocordycipitaceae (Hypocreales, Ascomycota)] is a dominant endoparasitic fungus by using conidia that adhere to and penetrate the secondary stage juveniles of soybean cyst nematode. Its genome was de novo sequenced and compared with five entomopathogenic fungi in the Hypocreales and three nematode-trapping fungi in the Orbiliales (Ascomycota). The genome of H. minnesotensis is 51.4 Mb and encodes 12,702 genes enriched with transposable elements up to 32%. Phylogenomic analysis revealed that H. minnesotensis was diverged from entomopathogenic fungi in Hypocreales. Genome of H. minnesotensis is similar to those of entomopathogenic fungi to have fewer genes encoding lectins for adhesion and glycoside hydrolases for cellulose degradation, but is different from those of nematode-trapping fungi to possess more genes for protein degradation, signal transduction, and secondary metabolism. Those results indicate that H. minnesotensis has evolved different mechanism for nematode endoparasitism compared with nematode-trapping fungi. Transcriptomics analyses for the time-scale parasitism revealed the upregulations of lectins, secreted proteases and the genes for biosynthesis of secondary metabolites that could be putatively involved in host surface adhesion, cuticle degradation, and host manipulation. Genome and transcriptome analyses provided comprehensive understanding of the evolution and lifestyle of nematode endoparasitism. PMID:25359922

  19. Genome-wide transcriptome analysis of human epidermal melanocytes

    PubMed Central

    Haltaufderhyde, Kirk D.; Oancea, Elena

    2015-01-01

    Because human epidermal melanocytes (HEMs) provide critical protection against skin cancer, sunburn, and photoaging, a genome-wide perspective of gene expression in these cells is vital to understanding human skin physiology. In this study we performed high throughput sequencing of HEMs to obtain a complete data set of transcript sizes, abundances, and splicing. As expected, we found that melanocyte specific genes that function in pigmentation were among the highest expressed genes. We analyzed receptor, ion channel and transcription factor gene families to get a better understanding of the cell signalling pathways used by melanocytes. We also performed a comparative transcriptomic analysis of lightly versus darkly pigmented HEMs and found 16 genes differentially expressed in the two pigmentation phenotypes; of those, only one putative melanosomal transporter (SLC45A2) has known function in pigmentation. In addition, we found 166 genes with splice isoforms expressed exclusively in one pigmentation phenotype, 17 of which are genes involved in signal transduction. Our melanocyte transcriptome study provides a comprehensive view and may help identify novel pigmentation genes and potential pharmacological targets. PMID:25451175

  20. Transcriptome-wide investigation of genomic imprinting in chicken.

    PubMed

    Frésard, Laure; Leroux, Sophie; Servin, Bertrand; Gourichon, David; Dehais, Patrice; Cristobal, Magali San; Marsaud, Nathalie; Vignoles, Florence; Bed'hom, Bertrand; Coville, Jean-Luc; Hormozdiari, Farhad; Beaumont, Catherine; Zerjal, Tatiana; Vignal, Alain; Morisson, Mireille; Lagarrigue, Sandrine; Pitel, Frédérique

    2014-04-01

    Genomic imprinting is an epigenetic mechanism by which alleles of some specific genes are expressed in a parent-of-origin manner. It has been observed in mammals and marsupials, but not in birds. Until now, only a few genes orthologous to mammalian imprinted ones have been analyzed in chicken and did not demonstrate any evidence of imprinting in this species. However, several published observations such as imprinted-like QTL in poultry or reciprocal effects keep the question open. Our main objective was thus to screen the entire chicken genome for parental-allele-specific differential expression on whole embryonic transcriptomes, using high-throughput sequencing. To identify the parental origin of each observed haplotype, two chicken experimental populations were used, as inbred and as genetically distant as possible. Two families were produced from two reciprocal crosses. Transcripts from 20 embryos were sequenced using NGS technology, producing ∼200 Gb of sequences. This allowed the detection of 79 potentially imprinted SNPs, through an analysis method that we validated by detecting imprinting from mouse data already published. However, out of 23 candidates tested by pyrosequencing, none could be confirmed. These results come together, without a priori, with previous statements and phylogenetic considerations assessing the absence of genomic imprinting in chicken.

  1. Transcriptome-wide investigation of genomic imprinting in chicken

    PubMed Central

    Frésard, Laure; Leroux, Sophie; Servin, Bertrand; Gourichon, David; Dehais, Patrice; Cristobal, Magali San; Marsaud, Nathalie; Vignoles, Florence; Bed'hom, Bertrand; Coville, Jean-Luc; Hormozdiari, Farhad; Beaumont, Catherine; Zerjal, Tatiana; Vignal, Alain; Morisson, Mireille; Lagarrigue, Sandrine; Pitel, Frédérique

    2014-01-01

    Genomic imprinting is an epigenetic mechanism by which alleles of some specific genes are expressed in a parent-of-origin manner. It has been observed in mammals and marsupials, but not in birds. Until now, only a few genes orthologous to mammalian imprinted ones have been analyzed in chicken and did not demonstrate any evidence of imprinting in this species. However, several published observations such as imprinted-like QTL in poultry or reciprocal effects keep the question open. Our main objective was thus to screen the entire chicken genome for parental-allele-specific differential expression on whole embryonic transcriptomes, using high-throughput sequencing. To identify the parental origin of each observed haplotype, two chicken experimental populations were used, as inbred and as genetically distant as possible. Two families were produced from two reciprocal crosses. Transcripts from 20 embryos were sequenced using NGS technology, producing ∼200 Gb of sequences. This allowed the detection of 79 potentially imprinted SNPs, through an analysis method that we validated by detecting imprinting from mouse data already published. However, out of 23 candidates tested by pyrosequencing, none could be confirmed. These results come together, without a priori, with previous statements and phylogenetic considerations assessing the absence of genomic imprinting in chicken. PMID:24452801

  2. Comprehensive Annotation of the Parastagonospora nodorum Reference Genome Using Next-Generation Genomics, Transcriptomics and Proteogenomics.

    PubMed

    Syme, Robert A; Tan, Kar-Chun; Hane, James K; Dodhia, Kejal; Stoll, Thomas; Hastie, Marcus; Furuki, Eiko; Ellwood, Simon R; Williams, Angela H; Tan, Yew-Foon; Testa, Alison C; Gorman, Jeffrey J; Oliver, Richard P

    2016-01-01

    Parastagonospora nodorum, the causal agent of Septoria nodorum blotch (SNB), is an economically important pathogen of wheat (Triticum spp.), and a model for the study of necrotrophic pathology and genome evolution. The reference P. nodorum strain SN15 was the first Dothideomycete with a published genome sequence, and has been used as the basis for comparison within and between species. Here we present an updated reference genome assembly with corrections of SNP and indel errors in the underlying genome assembly from deep resequencing data as well as extensive manual annotation of gene models using transcriptomic and proteomic sources of evidence (https://github.com/robsyme/Parastagonospora_nodorum_SN15). The updated assembly and annotation includes 8,366 genes with modified protein sequence and 866 new genes. This study shows the benefits of using a wide variety of experimental methods allied to expert curation to generate a reliable set of gene models.

  3. A comparative genomics approach to identifying the plasticity transcriptome

    PubMed Central

    Pfenning, Andreas R; Schwartz, Russell; Barth, Alison L

    2007-01-01

    Background Neuronal activity regulates gene expression to control learning and memory, homeostasis of neuronal function, and pathological disease states such as epilepsy. A great deal of experimental evidence supports the involvement of two particular transcription factors in shaping the genomic response to neuronal activity and mediating plasticity: CREB and zif268 (egr-1, krox24, NGFI-A). The gene targets of these two transcription factors are of considerable interest, since they may help develop hypotheses about how neural activity is coupled to changes in neural function. Results We have developed a computational approach for identifying binding sites for these transcription factors within the promoter regions of annotated genes in the mouse, rat, and human genomes. By combining a robust search algorithm to identify discrete binding sites, a comparison of targets across species, and an analysis of binding site locations within promoter regions, we have defined a group of candidate genes that are strong CREB- or zif268 targets and are thus regulated by neural activity. Our analysis revealed that CREB and zif268 share a disproportionate number of targets in common and that these common targets are dominated by transcription factors. Conclusion These observations may enable a more detailed understanding of the regulatory networks that are induced by neural activity and contribute to the plasticity transcriptome. The target genes identified in this study will be a valuable resource for investigators who hope to define the functions of specific genes that underlie activity-dependent changes in neuronal properties. PMID:17355637

  4. Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants

    PubMed Central

    2010-01-01

    Background Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. Results The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conclusions Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution. PMID:20565927

  5. INTEGRATE: gene fusion discovery using whole genome and transcriptome data

    PubMed Central

    Zhang, Jin; White, Nicole M.; Schmidt, Heather K.; Fulton, Robert S.; Tomlinson, Chad; Warren, Wesley C.; Wilson, Richard K.; Maher, Christopher A.

    2016-01-01

    While next-generation sequencing (NGS) has become the primary technology for discovering gene fusions, we are still faced with the challenge of ensuring that causative mutations are not missed while minimizing false positives. Currently, there are many computational tools that predict structural variations (SV) and gene fusions using whole genome (WGS) and transcriptome sequencing (RNA-seq) data separately. However, as both WGS and RNA-seq have their limitations when used independently, we hypothesize that the orthogonal validation from integrating both data could generate a sensitive and specific approach for detecting high-confidence gene fusion predictions. Fortunately, decreasing NGS costs have resulted in a growing quantity of patients with both data available. Therefore, we developed a gene fusion discovery tool, INTEGRATE, that leverages both RNA-seq and WGS data to reconstruct gene fusion junctions and genomic breakpoints by split-read mapping. To evaluate INTEGRATE, we compared it with eight additional gene fusion discovery tools using the well-characterized breast cell line HCC1395 and peripheral blood lymphocytes derived from the same patient (HCC1395BL). The predictions subsequently underwent a targeted validation leading to the discovery of 131 novel fusions in addition to the seven previously reported fusions. Overall, INTEGRATE only missed six out of the 138 validated fusions and had the highest accuracy of the nine tools evaluated. Additionally, we applied INTEGRATE to 62 breast cancer patients from The Cancer Genome Atlas (TCGA) and found multiple recurrent gene fusions including a subset involving estrogen receptor. Taken together, INTEGRATE is a highly sensitive and accurate tool that is freely available for academic use. PMID:26556708

  6. INTEGRATE: gene fusion discovery using whole genome and transcriptome data.

    PubMed

    Zhang, Jin; White, Nicole M; Schmidt, Heather K; Fulton, Robert S; Tomlinson, Chad; Warren, Wesley C; Wilson, Richard K; Maher, Christopher A

    2016-01-01

    While next-generation sequencing (NGS) has become the primary technology for discovering gene fusions, we are still faced with the challenge of ensuring that causative mutations are not missed while minimizing false positives. Currently, there are many computational tools that predict structural variations (SV) and gene fusions using whole genome (WGS) and transcriptome sequencing (RNA-seq) data separately. However, as both WGS and RNA-seq have their limitations when used independently, we hypothesize that the orthogonal validation from integrating both data could generate a sensitive and specific approach for detecting high-confidence gene fusion predictions. Fortunately, decreasing NGS costs have resulted in a growing quantity of patients with both data available. Therefore, we developed a gene fusion discovery tool, INTEGRATE, that leverages both RNA-seq and WGS data to reconstruct gene fusion junctions and genomic breakpoints by split-read mapping. To evaluate INTEGRATE, we compared it with eight additional gene fusion discovery tools using the well-characterized breast cell line HCC1395 and peripheral blood lymphocytes derived from the same patient (HCC1395BL). The predictions subsequently underwent a targeted validation leading to the discovery of 131 novel fusions in addition to the seven previously reported fusions. Overall, INTEGRATE only missed six out of the 138 validated fusions and had the highest accuracy of the nine tools evaluated. Additionally, we applied INTEGRATE to 62 breast cancer patients from The Cancer Genome Atlas (TCGA) and found multiple recurrent gene fusions including a subset involving estrogen receptor. Taken together, INTEGRATE is a highly sensitive and accurate tool that is freely available for academic use.

  7. The genome- and transcriptome-wide analysis of innate immunity in the brown planthopper, Nilaparvata lugens

    PubMed Central

    2013-01-01

    Background The brown planthopper (Nilaparvata lugens) is one of the most serious rice plant pests in Asia. N. lugens causes extensive rice damage by sucking rice phloem sap, which results in stunted plant growth and the transmission of plant viruses. Despite the importance of this insect pest, little is known about the immunological mechanisms occurring in this hemimetabolous insect species. Results In this study, we performed a genome- and transcriptome-wide analysis aiming at the immune-related genes. The transcriptome datasets include the N. lugens intestine, the developmental stage, wing formation, and sex-specific expression information that provided useful gene expression sequence data for the genome-wide analysis. As a result, we identified a large number of genes encoding N. lugens pattern recognition proteins, modulation proteins in the prophenoloxidase (proPO) activating cascade, immune effectors, and the signal transduction molecules involved in the immune pathways, including the Toll, Immune deficiency (Imd) and Janus kinase signal transducers and activators of transcription (JAK-STAT) pathways. The genome scale analysis revealed detailed information of the gene structure, distribution and transcription orientations in scaffolds. A comparison of the genome-available hemimetabolous and metabolous insect species indicate the differences in the immune-related gene constitution. We investigated the gene expression profiles with regards to how they responded to bacterial infections and tissue, as well as development and sex expression specificity. Conclusions The genome- and transcriptome-wide analysis of immune-related genes including pattern recognition and modulation molecules, immune effectors, and the signal transduction molecules involved in the immune pathways is an important step in determining the overall architecture and functional network of the immune components in N. lugens. Our findings provide the comprehensive gene sequence resource and

  8. A genome survey and postharvest transcriptome analysis in Lentinula edodes.

    PubMed

    Sakamoto, Yuichi; Nakade, Keiko; Sato, Shiho; Yoshida, Kentaro; Miyazaki, Kazuhiro; Natsume, Satoshi; Konno, Naotake

    2017-03-17

    Lentinula edodes is a popular cultivated edible and medicinal mushroom. Lentinula edodes is susceptible to postharvest problems such as gill browning, fruiting body softening, and lentinan degradation. We constructed a de novo assembly draft genome sequence and performed gene prediction of Lentinula edodesDe novo assembly was carried out using short reads from paired-end and mate-paired libraries and long reads by PacBio, resulting in a contig number of 1951 and an N50 of 1 Mb. Further, we predicted genes by Augustus using RNA-seq data from the whole life cycle of Lentinula edodes, resulting in 12,959 predicted genes. This analysis revealed that Lentinula edodes lacks lignin peroxidase. To reveal genes involved in Lentinula edodes postharvest fruiting body quality loss, transcriptome analysis was carried out using Super-SAGE. This analysis revealed that many cell wall-related enzymes are upregulated after harvest, such as β-1,3-1,6-glucan-degrading enzymes in glycoside hydrolase (GH) families 5, 16, 30, 55, 128, and thaumatin-like proteins. In addition, we found several chitin-related genes are upregulated, such as putative chitinases in GH family18, exo-chitinases in GH 20, and a putative chitosanase in GH 75. The results suggest that cell wall-degrading enzymes synergistically cooperate for rapid fruiting body autolysis. Many putative transcription factor genes were upregulated postharvest, such as genes containing high mobility group (HMG) domains and zinc finger domains. Several cell death-related proteins were also upregulated postharvest.Importance Our data collectively suggest that there is a rapid fruiting body autolysis system in Lentinula edodes The genes for postharvest quality loss newly found in this research will be targets for future breeding of strains that can keep freshness longer than present strains. De novo Lentinula edodes genome assembly data will be used for construction of the complete Lentinula edodes chromosome map for the future

  9. Metaplastic breast carcinomas display genomic and transcriptomic heterogeneity [corrected]. .

    PubMed

    Weigelt, Britta; Ng, Charlotte K Y; Shen, Ronglai; Popova, Tatiana; Schizas, Michail; Natrajan, Rachael; Mariani, Odette; Stern, Marc-Henri; Norton, Larry; Vincent-Salomon, Anne; Reis-Filho, Jorge S

    2015-03-01

    features of metaplastic breast carcinomas is reflected at the transcriptomic level, and an association between molecular subtypes and histology was observed. BRCA1-like genomic profiles were found only in a subset (31%) of metaplastic breast cancers, and were not associated with a specific molecular or histologic subtype.

  10. Annotation of the zebrafish genome through an integrated transcriptomic and proteomic analysis.

    PubMed

    Kelkar, Dhanashree S; Provost, Elayne; Chaerkady, Raghothama; Muthusamy, Babylakshmi; Manda, Srikanth S; Subbannayya, Tejaswini; Selvan, Lakshmi Dhevi N; Wang, Chieh-Huei; Datta, Keshava K; Woo, Sunghee; Dwivedi, Sutopa B; Renuse, Santosh; Getnet, Derese; Huang, Tai-Chung; Kim, Min-Sik; Pinto, Sneha M; Mitchell, Christopher J; Madugundu, Anil K; Kumar, Praveen; Sharma, Jyoti; Advani, Jayshree; Dey, Gourav; Balakrishnan, Lavanya; Syed, Nazia; Nanjappa, Vishalakshi; Subbannayya, Yashwanth; Goel, Renu; Prasad, T S Keshava; Bafna, Vineet; Sirdeshmukh, Ravi; Gowda, Harsha; Wang, Charles; Leach, Steven D; Pandey, Akhilesh

    2014-11-01

    Accurate annotation of protein-coding genes is one of the primary tasks upon the completion of whole genome sequencing of any organism. In this study, we used an integrated transcriptomic and proteomic strategy to validate and improve the existing zebrafish genome annotation. We undertook high-resolution mass-spectrometry-based proteomic profiling of 10 adult organs, whole adult fish body, and two developmental stages of zebrafish (SAT line), in addition to transcriptomic profiling of six organs. More than 7,000 proteins were identified from proteomic analyses, and ∼ 69,000 high-confidence transcripts were assembled from the RNA sequencing data. Approximately 15% of the transcripts mapped to intergenic regions, the majority of which are likely long non-coding RNAs. These high-quality transcriptomic and proteomic data were used to manually reannotate the zebrafish genome. We report the identification of 157 novel protein-coding genes. In addition, our data led to modification of existing gene structures including novel exons, changes in exon coordinates, changes in frame of translation, translation in annotated UTRs, and joining of genes. Finally, we discovered four instances of genome assembly errors that were supported by both proteomic and transcriptomic data. Our study shows how an integrative analysis of the transcriptome and the proteome can extend our understanding of even well-annotated genomes.

  11. Combined genome and transcriptome analysis of single disseminated cancer cells from bone marrow of prostate cancer patients reveals unexpected transcriptomes.

    PubMed

    Gužvić, Miodrag; Braun, Bernhard; Ganzer, Roman; Burger, Maximilian; Nerlich, Michael; Winkler, Sebastian; Werner-Klein, Melanie; Czyż, Zbigniew T; Polzer, Bernhard; Klein, Christoph A

    2014-12-15

    Bone is the most frequent site of metastasis in prostate cancer and patients with bone metastases are deemed incurable. Targeting prostate cancer cells that disseminated to the bone marrow before surgery and before metastatic outgrowth may therefore prevent lethal metastasis. This prompted us to directly analyze the transcriptome of disseminated cancer cells (DCC) isolated from patients with nonmetastatic (UICC stage M0) prostate cancer. We screened 105 bone marrow samples of patients with M0-stage prostate cancer and 18 bone marrow samples of patients without malignancy for the presence of EpCAM(+) single cells. In total, we isolated 270 cells from both groups by micromanipulation and globally amplified their mRNA. We used targeted transcriptional profiling to unambiguously identify DCCs for subsequent in-depth analysis. Transcriptomes of all cells were examined for the expression of EPCAM, KRT8, KRT18, KRT19, KRT14, KRT6a, KRT5, KLK3 (PSA), MAGEA2, MAGEA4, PTPRC (CD45), CD33, CD34, CD19, GYPC, SCL4A1 (band 3), and HBA2. Using these transcripts, we found it impossible to reliably identify true DCCs. We then applied combined genome and transcriptome analysis of single cells and found that EpCAM(+) cells from controls expressed transcripts thought to be epithelial-specific, whereas true DCCs may express hematopoietic transcripts. These results point to an unexpected transcriptome plasticity of epithelial cancer cells in bone marrow and question common transcriptional criteria to identify DCCs.

  12. Genomic and transcriptomic alterations following hybridisation and genome doubling in trigenomic allohexaploid Brassica carinata × Brassica rapa.

    PubMed

    Xu, Y; Zhao, Q; Mei, S; Wang, J

    2012-09-01

    Allopolyploidisation is a prominent evolutionary force that involves two major events: interspecific hybridisation and genome doubling. Both events have important functional consequences in shaping the genomic architecture of the neo-allopolyploids. The respective effects of hybridisation and genome doubling upon genomic and transcriptomic changes in Brassica allopolyploids are unresolved. In this study, amplified fragment length polymorphism (AFLP), methylation-sensitive amplification polymorphism (MSAP) and cDNA-AFLP approaches were used to track genetic, epigenetic and transcriptional changes in both allohexaploid Brassica (ArArBcBcCcCc genome) and triploid hybrids (ArBcCc genome). Results from these groups were compared with each other and also to their parents Brassica carinata (BBCC genome) and Brassica rapa (AA genome). Rapid and dramatic genetic, DNA methylation and gene expression changes were detected in the triploid hybrids. During the shift from triploidy to allohexaploidy, some of the hybridisation-induced alterations underwent reversion. Additionally, novel genetic, epigenetic and transcriptional alterations were also detected. The proportions of A-genome-specific DNA methylation and gene expression alterations were significantly greater than those of BC-genome-specific alterations in the triploid hybrids. However, the two parental genomes were equally affected during the ploidy shift. Hemi-CCG methylation changes induced by hybridisation were recovered after genome doubling. Full-CG methylation changes were a more general process initiated in the hybrid and continued after genome doubling. These results indicate that genome doubling could ameliorate genomic and transcriptomic alterations induced by hybridisation and instigate additional alterations in trigenomic Brassica allohexaploids. Moreover, genome doubling also modified hybridisation-induced progenitor genome-biased alterations and epigenetic alteration characteristics.

  13. Whole-genome duplication and molecular evolution in Cornus L. (Cornaceae) – Insights from transcriptome sequences

    PubMed Central

    Yu, Yan; Xiang, Qiuyun; Manos, Paul S.; Soltis, Douglas E.; Soltis, Pamela S.; Song, Bao-Hua; Cheng, Shifeng; Liu, Xin; Wong, Gane

    2017-01-01

    The pattern and rate of genome evolution have profound consequences in organismal evolution. Whole-genome duplication (WGD), or polyploidy, has been recognized as an important evolutionary mechanism of plant diversification. However, in non-model plants the molecular signals of genome duplications have remained largely unexplored. High-throughput transcriptome data from next-generation sequencing have set the stage for novel investigations of genome evolution using new bioinformatic and methodological tools in a phylogenetic framework. Here we compare ten de novo-assembled transcriptomes representing the major lineages of the angiosperm genus Cornus (dogwood) and relevant outgroups using a customized pipeline for analyses. Using three distinct approaches, molecular dating of orthologous genes, analyses of the distribution of synonymous substitutions between paralogous genes, and examination of substitution rates through time, we detected a shared WGD event in the late Cretaceous across all taxa sampled. The inferred doubling event coincides temporally with the paleoclimatic changes associated with the initial divergence of the genus into three major lineages. Analyses also showed an acceleration of rates of molecular evolution after WGD. The highest rates of molecular evolution were observed in the transcriptome of the herbaceous lineage, C. canadensis, a species commonly found at higher latitudes, including the Arctic. Our study demonstrates the value of transcriptome data for understanding genome evolution in closely related species. The results suggest dramatic increase in sea surface temperature in the late Cretaceous may have contributed to the evolution and diversification of flowering plants. PMID:28225773

  14. Primary analysis of repeat elements of the Asian seabass (Lates calcarifer) transcriptome and genome

    PubMed Central

    Kuznetsova, Inna S.; Thevasagayam, Natascha M.; Sridatta, Prakki S. R.; Komissarov, Aleksey S.; Saju, Jolly M.; Ngoh, Si Y.; Jiang, Junhui; Shen, Xueyan; Orbán, László

    2014-01-01

    As part of our Asian seabass genome project, we are generating an inventory of repeat elements in the genome and transcriptome. The karyotype showed a diploid number of 2n = 24 chromosomes with a variable number of B-chromosomes. The transcriptome and genome of Asian seabass were searched for repetitive elements with experimental and bioinformatics tools. Six different types of repeats constituting 8–14% of the genome were characterized. Repetitive elements were clustered in the pericentromeric heterochromatin of all chromosomes, but some of them were preferentially accumulated in pretelomeric and pericentromeric regions of several chromosomes pairs and have chromosomes specific arrangement. From the dispersed class of fish-specific non-LTR retrotransposon elements Rex1 and MAUI-like repeats were analyzed. They were wide-spread both in the genome and transcriptome, accumulated on the pericentromeric and peritelomeric areas of all chromosomes. Every analyzed repeat was represented in the Asian seabass transcriptome, some showed differential expression between the gonads. The other group of repeats analyzed belongs to the rRNA multigene family. FISH signal for 5S rDNA was located on a single pair of chromosomes, whereas that for 18S rDNA was found on two pairs. A BAC-derived contig containing rDNA was sequenced and assembled into a scaffold containing incomplete fragments of 18S rDNA. Their assembly and chromosomal position revealed that this part of Asian seabass genome is extremely rich in repeats containing evolutionarily conserved and novel sequences. In summary, transcriptome assemblies and cDNA data are suitable for the identification of repetitive DNA from unknown genomes and for comparative investigation of conserved elements between teleosts and other vertebrates. PMID:25120555

  15. Identifying characteristic scales in the human genome

    NASA Astrophysics Data System (ADS)

    Carpena, P.; Bernaola-Galván, P.; Coronado, A. V.; Hackenberg, M.; Oliver, J. L.

    2007-03-01

    The scale-free, long-range correlations detected in DNA sequences contrast with characteristic lengths of genomic elements, being particularly incompatible with the isochores (long, homogeneous DNA segments). By computing the local behavior of the scaling exponent α of detrended fluctuation analysis (DFA), we discriminate between sequences with and without true scaling, and we find that no single scaling exists in the human genome. Instead, human chromosomes show a common compositional structure with two characteristic scales, the large one corresponding to the isochores and the other to small and medium scale genomic elements.

  16. VESPA: Software to Facilitate Genomic Annotation of Prokaryotic Organisms Through Integration of Proteomic and Transcriptomic Data

    SciTech Connect

    Peterson, Elena S.; McCue, Lee Ann; Rutledge, Alexandra C.; Jensen, Jeffrey L.; Walker, Julia; Kobold, Mark A.; Webb, Samantha R.; Payne, Samuel H.; Ansong, Charles; Adkins, Joshua N.; Cannon, William R.; Webb-Robertson, Bobbie-Jo M.

    2012-04-25

    Visual Exploration and Statistics to Promote Annotation (VESPA) is an interactive visual analysis software tool that facilitates the discovery of structural mis-annotations in prokaryotic genomes. VESPA integrates high-throughput peptide-centric proteomics data and oligo-centric or RNA-Seq transcriptomics data into a genomic context. The data may be interrogated via visual analysis across multiple levels of genomic resolution, linked searches, exports and interaction with BLAST to rapidly identify location of interest within the genome and evaluate potential mis-annotations.

  17. Genomic, transcriptomic and phenomic variation reveals the complex adaptation to stress response of modern maize breeding

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Early maize adaptation to different agricultural environments was an important process associated with the creation of a stable food supply that allowed the evolution of human civilization in the Americas. To explore the mechanisms of maize adaptation, genomic, transcriptomic and phenomic data were ...

  18. G&T-seq: parallel sequencing of single-cell genomes and transcriptomes.

    PubMed

    Macaulay, Iain C; Haerty, Wilfried; Kumar, Parveen; Li, Yang I; Hu, Tim Xiaoming; Teng, Mabel J; Goolam, Mubeen; Saurat, Nathalie; Coupland, Paul; Shirley, Lesley M; Smith, Miriam; Van der Aa, Niels; Banerjee, Ruby; Ellis, Peter D; Quail, Michael A; Swerdlow, Harold P; Zernicka-Goetz, Magdalena; Livesey, Frederick J; Ponting, Chris P; Voet, Thierry

    2015-06-01

    The simultaneous sequencing of a single cell's genome and transcriptome offers a powerful means to dissect genetic variation and its effect on gene expression. Here we describe G&T-seq, a method for separating and sequencing genomic DNA and full-length mRNA from single cells. By applying G&T-seq to over 220 single cells from mice and humans, we discovered cellular properties that could not be inferred from DNA or RNA sequencing alone.

  19. Comparative genomics and transcriptomics of lineages I, II, and III strains of Listeria monocytogenes

    PubMed Central

    2012-01-01

    Background Listeria monocytogenes is a food-borne pathogen that causes infections with a high-mortality rate and has served as an invaluable model for intracellular parasitism. Here, we report complete genome sequences for two L. monocytogenes strains belonging to serotype 4a (L99) and 4b (CLIP80459), and transcriptomes of representative strains from lineages I, II, and III, thereby permitting in-depth comparison of genome- and transcriptome -based data from three lineages of L. monocytogenes. Lineage III, represented by the 4a L99 genome is known to contain strains less virulent for humans. Results The genome analysis of the weakly pathogenic L99 serotype 4a provides extensive evidence of virulence gene decay, including loss of several important surface proteins. The 4b CLIP80459 genome, unlike the previously sequenced 4b F2365 genome harbours an intact inlB invasion gene. These lineage I strains are characterized by the lack of prophage genes, as they share only a single prophage locus with other L. monocytogenes genomes 1/2a EGD-e and 4a L99. Comparative transcriptome analysis during intracellular growth uncovered adaptive expression level differences in lineages I, II and III of Listeria, notable amongst which was a strong intracellular induction of flagellar genes in strain 4a L99 compared to the other lineages. Furthermore, extensive differences between strains are manifest at levels of metabolic flux control and phosphorylated sugar uptake. Intriguingly, prophage gene expression was found to be a hallmark of intracellular gene expression. Deletion mutants in the single shared prophage locus of lineage II strain EGD-e 1/2a, the lma operon, revealed severe attenuation of virulence in a murine infection model. Conclusion Comparative genomics and transcriptome analysis of L. monocytogenes strains from three lineages implicate prophage genes in intracellular adaptation and indicate that gene loss and decay may have led to the emergence of attenuated lineages

  20. Genomic resources for a model in adaptation and speciation research: characterization of the Poecilia mexicana transcriptome

    PubMed Central

    2012-01-01

    Background Elucidating the genomic basis of adaptation and speciation is a major challenge in natural systems with large quantities of environmental and phenotypic data, mostly because of the scarcity of genomic resources for non-model organisms. The Atlantic molly (Poecilia mexicana, Poeciliidae) is a small livebearing fish that has been extensively studied for evolutionary ecology research, particularly because this species has repeatedly colonized extreme environments in the form of caves and toxic hydrogen sulfide containing springs. In such extreme environments, populations show strong patterns of adaptive trait divergence and the emergence of reproductive isolation. Here, we used RNA-sequencing to assemble and annotate the first transcriptome of P. mexicana to facilitate ecological genomics studies in the future and aid the identification of genes underlying adaptation and speciation in the system. Description We provide the first annotated reference transcriptome of P. mexicana. Our transcriptome shows high congruence with other published fish transcriptomes, including that of the guppy, medaka, zebrafish, and stickleback. Transcriptome annotation uncovered the presence of candidate genes relevant in the study of adaptation to extreme environments. We describe general and oxidative stress response genes as well as genes involved in pathways induced by hypoxia or involved in sulfide metabolism. To facilitate future comparative analyses, we also conducted quantitative comparisons between P. mexicana from different river drainages. 106,524 single nucleotide polymorphisms were detected in our dataset, including potential markers that are putatively fixed across drainages. Furthermore, specimens from different drainages exhibited some consistent differences in gene regulation. Conclusions Our study provides a valuable genomic resource to study the molecular underpinnings of adaptation to extreme environments in replicated sulfide spring and cave environments. In

  1. Transcriptome sequencing and whole genome expression profiling of chrysanthemum under dehydration stress

    PubMed Central

    2013-01-01

    Background Chrysanthemum is one of the most important ornamental crops in the world and drought stress seriously limits its production and distribution. In order to generate a functional genomics resource and obtain a deeper understanding of the molecular mechanisms regarding chrysanthemum responses to dehydration stress, we performed large-scale transcriptome sequencing of chrysanthemum plants under dehydration stress using the Illumina sequencing technology. Results Two cDNA libraries constructed from mRNAs of control and dehydration-treated seedlings were sequenced by Illumina technology. A total of more than 100 million reads were generated and de novo assembled into 98,180 unique transcripts which were further extensively annotated by comparing their sequencing to different protein databases. Biochemical pathways were predicted from these transcript sequences. Furthermore, we performed gene expression profiling analysis upon dehydration treatment in chrysanthemum and identified 8,558 dehydration-responsive unique transcripts, including 307 transcription factors and 229 protein kinases and many well-known stress responsive genes. Gene ontology (GO) term enrichment and biochemical pathway analyses showed that dehydration stress caused changes in hormone response, secondary and amino acid metabolism, and light and photoperiod response. These findings suggest that drought tolerance of chrysanthemum plants may be related to the regulation of hormone biosynthesis and signaling, reduction of oxidative damage, stabilization of cell proteins and structures, and maintenance of energy and carbon supply. Conclusions Our transcriptome sequences can provide a valuable resource for chrysanthemum breeding and research and novel insights into chrysanthemum responses to dehydration stress and offer candidate genes or markers that can be used to guide future studies attempting to breed drought tolerant chrysanthemum cultivars. PMID:24074255

  2. Genome Annotation and Transcriptomics of Oil-Producing Algae

    DTIC Science & Technology

    2015-03-16

    accumulation in Chlamydomonas. We used RNA -Seq on the Illumina platform for quantitative determination of the Chlamydomonas transcriptome. Deep...signal and to unequivocally identify relevant changes in RNA abundance. This led to the functional identification of 8 acyltransferases, several of...which were documented to have a role in TAG accumulation in stressed algal cells, and candidate regulators of the N starvation response. We used our RNA

  3. Genome wide transcriptome profiling reveals differential gene expression in secondary metabolite pathway of Cymbopogon winterianus

    PubMed Central

    Devi, Kamalakshi; Mishra, Surajit K.; Sahu, Jagajjit; Panda, Debashis; Modi, Mahendra K.; Sen, Priyabrata

    2016-01-01

    Advances in transcriptome sequencing provide fast, cost-effective and reliable approach to generate large expression datasets especially suitable for non-model species to identify putative genes, key pathway and regulatory mechanism. Citronella (Cymbopogon winterianus) is an aromatic medicinal grass used for anti-tumoral, antibacterial, anti-fungal, antiviral, detoxifying and natural insect repellent properties. Despite of having number of utilities, the genes involved in terpenes biosynthetic pathway is not yet clearly elucidated. The present study is a pioneering attempt to generate an exhaustive molecular information of secondary metabolite pathway and to increase genomic resources in Citronella. Using high-throughput RNA-Seq technology, root and leaf transcriptome was analysed at an unprecedented depth (11.7 Gb). Targeted searches identified majority of the genes associated with metabolic pathway and other natural product pathway viz. antibiotics synthesis along with many novel genes. Terpenoid biosynthesis genes comparative expression results were validated for 15 unigenes by RT-PCR and qRT-PCR. Thus the coverage of these transcriptome is comprehensive enough to discover all known genes of major metabolic pathways. This transcriptome dataset can serve as important public information for gene expression, genomics and function genomics studies in Citronella and shall act as a benchmark for future improvement of the crop. PMID:26877149

  4. Genome and transcriptome analysis of surfactin biosynthesis in Bacillus amyloliquefaciens MT45

    PubMed Central

    Zhi, Yan; Wu, Qun; Xu, Yan

    2017-01-01

    Natural Bacillus isolates generate limited amounts of surfactin (<10% of their biomass), which functions as an antibiotic or signalling molecule in inter-/intra-specific interactions. However, overproduction of surfactin in Bacillus amyloliquefaciens MT45 was observed at a titre of 2.93 g/l, which is equivalent to half of the maximum biomass. To systemically unravel this efficient biosynthetic process, the genome and transcriptome of this bacterium were compared with those of B. amyloliquefaciens type strain DSM7T. MT45 possesses a smaller genome while containing more unique transporters and resistance-associated genes. Comparative transcriptome analysis revealed notable enrichment of the surfactin synthesis pathway in MT45, including central carbon metabolism and fatty acid biosynthesis to provide sufficient quantities of building precursors. Most importantly, the modular surfactin synthase overexpressed (9 to 49-fold) in MT45 compared to DSM7T suggested efficient surfactin assembly and resulted in the overproduction of surfactin. Furthermore, based on the expression trends observed in the transcriptome, there are multiple potential regulatory genes mediating the expression of surfactin synthase. Thus, the results of the present study provide new insights regarding the synthesis and regulation of surfactin in high-producing strain and enrich the genomic and transcriptomic resources available for B. amyloliquefaciens. PMID:28112210

  5. Genome wide transcriptome profiling reveals differential gene expression in secondary metabolite pathway of Cymbopogon winterianus.

    PubMed

    Devi, Kamalakshi; Mishra, Surajit K; Sahu, Jagajjit; Panda, Debashis; Modi, Mahendra K; Sen, Priyabrata

    2016-02-15

    Advances in transcriptome sequencing provide fast, cost-effective and reliable approach to generate large expression datasets especially suitable for non-model species to identify putative genes, key pathway and regulatory mechanism. Citronella (Cymbopogon winterianus) is an aromatic medicinal grass used for anti-tumoral, antibacterial, anti-fungal, antiviral, detoxifying and natural insect repellent properties. Despite of having number of utilities, the genes involved in terpenes biosynthetic pathway is not yet clearly elucidated. The present study is a pioneering attempt to generate an exhaustive molecular information of secondary metabolite pathway and to increase genomic resources in Citronella. Using high-throughput RNA-Seq technology, root and leaf transcriptome was analysed at an unprecedented depth (11.7 Gb). Targeted searches identified majority of the genes associated with metabolic pathway and other natural product pathway viz. antibiotics synthesis along with many novel genes. Terpenoid biosynthesis genes comparative expression results were validated for 15 unigenes by RT-PCR and qRT-PCR. Thus the coverage of these transcriptome is comprehensive enough to discover all known genes of major metabolic pathways. This transcriptome dataset can serve as important public information for gene expression, genomics and function genomics studies in Citronella and shall act as a benchmark for future improvement of the crop.

  6. Plant Aquaporins: Genome-Wide Identification, Transcriptomics, Proteomics, and Advanced Analytical Tools

    PubMed Central

    Deshmukh, Rupesh K.; Sonah, Humira; Bélanger, Richard R.

    2016-01-01

    Aquaporins (AQPs) are channel-forming integral membrane proteins that facilitate the movement of water and many other small molecules. Compared to animals, plants contain a much higher number of AQPs in their genome. Homology-based identification of AQPs in sequenced species is feasible because of the high level of conservation of protein sequences across plant species. Genome-wide characterization of AQPs has highlighted several important aspects such as distribution, genetic organization, evolution and conserved features governing solute specificity. From a functional point of view, the understanding of AQP transport system has expanded rapidly with the help of transcriptomics and proteomics data. The efficient analysis of enormous amounts of data generated through omic scale studies has been facilitated through computational advancements. Prediction of protein tertiary structures, pore architecture, cavities, phosphorylation sites, heterodimerization, and co-expression networks has become more sophisticated and accurate with increasing computational tools and pipelines. However, the effectiveness of computational approaches is based on the understanding of physiological and biochemical properties, transport kinetics, solute specificity, molecular interactions, sequence variations, phylogeny and evolution of aquaporins. For this purpose, tools like Xenopus oocyte assays, yeast expression systems, artificial proteoliposomes, and lipid membranes have been efficiently exploited to study the many facets that influence solute transport by AQPs. In the present review, we discuss genome-wide identification of AQPs in plants in relation with recent advancements in analytical tools, and their availability and technological challenges as they apply to AQPs. An exhaustive review of omics resources available for AQP research is also provided in order to optimize their efficient utilization. Finally, a detailed catalog of computational tools and analytical pipelines is

  7. CaPSID: A bioinformatics platform for computational pathogen sequence identification in human genomes and transcriptomes

    PubMed Central

    2012-01-01

    Background It is now well established that nearly 20% of human cancers are caused by infectious agents, and the list of human oncogenic pathogens will grow in the future for a variety of cancer types. Whole tumor transcriptome and genome sequencing by next-generation sequencing technologies presents an unparalleled opportunity for pathogen detection and discovery in human tissues but requires development of new genome-wide bioinformatics tools. Results Here we present CaPSID (Computational Pathogen Sequence IDentification), a comprehensive bioinformatics platform for identifying, querying and visualizing both exogenous and endogenous pathogen nucleotide sequences in tumor genomes and transcriptomes. CaPSID includes a scalable, high performance database for data storage and a web application that integrates the genome browser JBrowse. CaPSID also provides useful metrics for sequence analysis of pre-aligned BAM files, such as gene and genome coverage, and is optimized to run efficiently on multiprocessor computers with low memory usage. Conclusions To demonstrate the usefulness and efficiency of CaPSID, we carried out a comprehensive analysis of both a simulated dataset and transcriptome samples from ovarian cancer. CaPSID correctly identified all of the human and pathogen sequences in the simulated dataset, while in the ovarian dataset CaPSID’s predictions were successfully validated in vitro. PMID:22901030

  8. Comprehensive identification and quantification of microbial transcriptomes by genome-wide unbiased methods.

    PubMed

    Mäder, Ulrike; Nicolas, Pierre; Richard, Hugues; Bessières, Philippe; Aymerich, Stéphane

    2011-02-01

    Genomic tiling array transcriptomics and RNA-seq are two powerful and rapidly developing approaches for unbiased transcriptome analysis. Providing comprehensive identification and quantification of transcripts with an unprecedented resolution, they are leading to major breakthroughs in systems biology. Here we review each step of the analysis from library preparation to the interpretation of the data, with particular attention paid to the possible sources of artifacts. Methodological requirements and statistical frameworks are often similar in both the approaches despite differences in the nature of the data. Tiling array analysis does not require rRNA depletion and benefits from a more mature computational workflow, whereas RNA-Seq has a clear lead in terms of background noise and dynamic range with a considerable potential for evolution with the improvements of sequencing technologies. Being independent of prior sequence knowledge, RNA-seq will boost metatranscriptomics and evolutionary transcriptomics applications.

  9. Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence.

    PubMed

    Góngora-Castillo, Elsa; Buell, C Robin

    2013-04-01

    Plant natural product research can be facilitated through genome and transcriptome sequencing approaches that generate informative sequence and expression datasets that enable characterization of biochemical pathways of interest. As the overwhelming majority of plant-derived natural products are derived from species with little, if any, sequence and/or genomic resources, the ability to perform whole genome shotgun sequencing and assembly has been and will continue to be transformative as access to a genome sequence provides molecular resources and a context for discovery and characterization of biosynthetic pathways. Due to the reduced size and complexity of the transcriptome relative to the genome, transcriptome sequencing provides a rapid, inexpensive approach to access gene sequences, gene expression abundances, and gene expression patterns in any species, including those that lack a reference genome sequence. To date, successful applications of RNA sequencing in conjunction with de novo transcriptome assembly has enabled identification of new genes in an array of biochemical pathways in plants. While sequencing technologies are well developed, challenges remain in the handling and analysis of transcriptome sequences. In this Highlight article, we provide an overview of the bioinformatics challenges associated with transcriptome analyses using short read sequences and how to address these issues in plant species that lack a reference genome.

  10. Benchmarking next-generation transcriptome sequencing for functional and evolutionary genomics.

    PubMed

    Gibbons, John G; Janson, Eric M; Hittinger, Chris Todd; Johnston, Mark; Abbot, Patrick; Rokas, Antonis

    2009-12-01

    Next-generation sequencing has opened the door to genomic analysis of nonmodel organisms. Technologies generating long-sequence reads (200-400 bp) are increasingly used in evolutionary studies of nonmodel organisms, but the short-sequence reads (30-50 bp) that can be produced at lower cost are thought to be of limited utility for de novo sequencing applications. Here, we tested this assumption by short-read sequencing the transcriptomes of the tropical disease vectors Aedes aegypti and Anopheles gambiae, for which complete genome sequences are available. Comparison of our results to the reference genomes allowed us to accurately evaluate the quantity, quality, and functional and evolutionary information content of our "test" data. We produced more than 0.7 billion nucleotides of sequenced data per species that assembled into more than 21,000 test contigs larger than 100 bp per species and covered approximately 27% of the Aedes reference transcriptome. Remarkably, the substitution error rate in the test contigs was approximately 0.25% per site, with very few indels or assembly errors. Test contigs of both species were enriched for genes involved in energy production and protein synthesis and underrepresented in genes involved in transcription and differentiation. Ortholog prediction using the test contigs was accurate across hundreds of millions of years of evolution. Our results demonstrate the considerable utility of short-read transcriptome sequencing for genomic studies of nonmodel organisms and suggest an approach for assessing the information content of next-generation data for evolutionary studies.

  11. The genome and transcriptome of Haemonchus contortus, a key model parasite for drug and vaccine discovery

    PubMed Central

    2013-01-01

    Background The small ruminant parasite Haemonchus contortus is the most widely used parasitic nematode in drug discovery, vaccine development and anthelmintic resistance research. Its remarkable propensity to develop resistance threatens the viability of the sheep industry in many regions of the world and provides a cautionary example of the effect of mass drug administration to control parasitic nematodes. Its phylogenetic position makes it particularly well placed for comparison with the free-living nematode Caenorhabditis elegans and the most economically important parasites of livestock and humans. Results Here we report the detailed analysis of a draft genome assembly and extensive transcriptomic dataset for H. contortus. This represents the first genome to be published for a strongylid nematode and the most extensive transcriptomic dataset for any parasitic nematode reported to date. We show a general pattern of conservation of genome structure and gene content between H. contortus and C. elegans, but also a dramatic expansion of important parasite gene families. We identify genes involved in parasite-specific pathways such as blood feeding, neurological function, and drug metabolism. In particular, we describe complete gene repertoires for known drug target families, providing the most comprehensive understanding yet of the action of several important anthelmintics. Also, we identify a set of genes enriched in the parasitic stages of the lifecycle and the parasite gut that provide a rich source of vaccine and drug target candidates. Conclusions The H. contortus genome and transcriptome provide an essential platform for postgenomic research in this and other important strongylid parasites. PMID:23985316

  12. A new RNASeq-based reference transcriptome for sugar beet and its application in transcriptome-scale analysis of vernalization and gibberellin responses

    PubMed Central

    2012-01-01

    Background Sugar beet (Beta vulgaris sp. vulgaris) crops account for about 30% of world sugar. Sugar yield is compromised by reproductive growth hence crops must remain vegetative until harvest. Prolonged exposure to cold temperature (vernalization) in the range 6°C to 12°C induces reproductive growth, leading to bolting (rapid elongation of the main stem) and flowering. Spring cultivation of crops in cool temperate climates makes them vulnerable to vernalization and hence bolting, which is initiated in the apical shoot meristem in processes involving interaction between gibberellin (GA) hormones and vernalization. The underlying mechanisms are unknown and genome scale next generation sequencing approaches now offer comprehensive strategies to investigate them; enabling the identification of novel targets for bolting control in sugar beet crops. In this study, we demonstrate the application of an mRNA-Seq based strategy for this purpose. Results There is no sugar beet reference genome, or public expression array platforms. We therefore used RNA-Seq to generate the first reference transcriptome. We next performed digital gene expression profiling using shoot apex mRNA from two sugar beet cultivars with and without applied GA, and also a vernalized cultivar with and without applied GA. Subsequent bioinformatics analyses identified transcriptional changes associated with genotypic difference and experimental treatments. Analysis of expression profiles in response to vernalization and GA treatment suggested previously unsuspected roles for a RAV1-like AP2/B3 domain protein in vernalization and efflux transporters in the GA response. Conclusions Next generation RNA-Seq enabled the generation of the first reference transcriptome for sugar beet and the study of global transcriptional responses in the shoot apex to vernalization and GA treatment, without the need for a reference genome or established array platforms. Comprehensive bioinformatic analysis identified

  13. Applications of genome-scale metabolic reconstructions

    PubMed Central

    Oberhardt, Matthew A; Palsson, Bernhard Ø; Papin, Jason A

    2009-01-01

    The availability and utility of genome-scale metabolic reconstructions have exploded since the first genome-scale reconstruction was published a decade ago. Reconstructions have now been built for a wide variety of organisms, and have been used toward five major ends: (1) contextualization of high-throughput data, (2) guidance of metabolic engineering, (3) directing hypothesis-driven discovery, (4) interrogation of multi-species relationships, and (5) network property discovery. In this review, we examine the many uses and future directions of genome-scale metabolic reconstructions, and we highlight trends and opportunities in the field that will make the greatest impact on many fields of biology. PMID:19888215

  14. Genome reannotation of the lizard Anolis carolinensis based on 14 adult and embryonic deep transcriptomes

    PubMed Central

    2013-01-01

    Background The green anole lizard, Anolis carolinensis, is a key species for both laboratory and field-based studies of evolutionary genetics, development, neurobiology, physiology, behavior, and ecology. As the first non-avian reptilian genome sequenced, A. carolinesis is also a prime reptilian model for comparison with other vertebrate genomes. The public databases of Ensembl and NCBI have provided a first generation gene annotation of the anole genome that relies primarily on sequence conservation with related species. A second generation annotation based on tissue-specific transcriptomes would provide a valuable resource for molecular studies. Results Here we provide an annotation of the A. carolinensis genome based on de novo assembly of deep transcriptomes of 14 adult and embryonic tissues. This revised annotation describes 59,373 transcripts, compared to 16,533 and 18,939 currently for Ensembl and NCBI, and 22,962 predicted protein-coding genes. A key improvement in this revised annotation is coverage of untranslated region (UTR) sequences, with 79% and 59% of transcripts containing 5’ and 3’ UTRs, respectively. Gaps in genome sequence from the current A. carolinensis build (Anocar2.0) are highlighted by our identification of 16,542 unmapped transcripts, representing 6,695 orthologues, with less than 70% genomic coverage. Conclusions Incorporation of tissue-specific transcriptome sequence into the A. carolinensis genome annotation has markedly improved its utility for comparative and functional studies. Increased UTR coverage allows for more accurate predicted protein sequence and regulatory analysis. This revised annotation also provides an atlas of gene expression specific to adult and embryonic tissues. PMID:23343042

  15. The channel catfish genome sequence provides insights into the evolution of scale formation in teleosts

    PubMed Central

    Liu, Zhanjiang; Liu, Shikai; Yao, Jun; Bao, Lisui; Zhang, Jiaren; Li, Yun; Jiang, Chen; Sun, Luyang; Wang, Ruijia; Zhang, Yu; Zhou, Tao; Zeng, Qifan; Fu, Qiang; Gao, Sen; Li, Ning; Koren, Sergey; Jiang, Yanliang; Zimin, Aleksey; Xu, Peng; Phillippy, Adam M.; Geng, Xin; Song, Lin; Sun, Fanyue; Li, Chao; Wang, Xiaozhu; Chen, Ailu; Jin, Yulin; Yuan, Zihao; Yang, Yujia; Tan, Suxu; Peatman, Eric; Lu, Jianguo; Qin, Zhenkui; Dunham, Rex; Li, Zhaoxia; Sonstegard, Tad; Feng, Jianbin; Danzmann, Roy G.; Schroeder, Steven; Scheffler, Brian; Duke, Mary V.; Ballard, Linda; Kucuktas, Huseyin; Kaltenboeck, Ludmilla; Liu, Haixia; Armbruster, Jonathan; Xie, Yangjie; Kirby, Mona L.; Tian, Yi; Flanagan, Mary Elizabeth; Mu, Weijie; Waldbieser, Geoffrey C.

    2016-01-01

    Catfish represent 12% of teleost or 6.3% of all vertebrate species, and are of enormous economic value. Here we report a high-quality reference genome sequence of channel catfish (Ictalurus punctatus), the major aquaculture species in the US. The reference genome sequence was validated by genetic mapping of 54,000 SNPs, and annotated with 26,661 predicted protein-coding genes. Through comparative analysis of genomes and transcriptomes of scaled and scaleless fish and scale regeneration experiments, we address the genomic basis for the most striking physical characteristic of catfish, the evolutionary loss of scales and provide evidence that lack of secretory calcium-binding phosphoproteins accounts for the evolutionary loss of scales in catfish. The channel catfish reference genome sequence, along with two additional genome sequences and transcriptomes of scaled catfishes, provide crucial resources for evolutionary and biological studies. This work also demonstrates the power of comparative subtraction of candidate genes for traits of structural significance. PMID:27249958

  16. Oil Accumulation by the Oleaginous Diatom Fistulifera solaris as Revealed by the Genome and Transcriptome

    PubMed Central

    Veluchamy, Alaguraj; Tanaka, Michihiro; Abida, Heni; Maréchal, Eric; Bowler, Chris; Muto, Masaki; Sunaga, Yoshihiko; Tanaka, Masayoshi; Taniguchi, Takeaki; Fukuda, Yorikane; Nemoto, Michiko; Matsumoto, Mitsufumi; Wong, Pui Shan; Aburatani, Sachiyo; Fujibuchi, Wataru

    2015-01-01

    Oleaginous photosynthetic organisms such as microalgae are promising sources for biofuel production through the generation of carbon-neutral sustainable energy. However, the metabolic mechanisms driving high-rate lipid production in these oleaginous organisms remain unclear, thus impeding efforts to improve productivity through genetic modifications. We analyzed the genome and transcriptome of the oleaginous diatom Fistulifera solaris JPCC DA0580. Next-generation sequencing technology provided evidence of an allodiploid genome structure, suggesting unorthodox molecular evolutionary and genetic regulatory systems for reinforcing metabolic efficiencies. Although major metabolic pathways were shared with nonoleaginous diatoms, transcriptome analysis revealed unique expression patterns, such as concomitant upregulation of fatty acid/triacylglycerol biosynthesis and fatty acid degradation (β-oxidation) in concert with ATP production. This peculiar pattern of gene expression may account for the simultaneous growth and oil accumulation phenotype and may inspire novel biofuel production technology based on this oleaginous microalga. PMID:25634988

  17. Genomic and transcriptomic characterization of skull base chordoma

    PubMed Central

    Sa, Jason K.; Lee, In-Hee; Hong, Sang Duk; Kong, Doo-Sik; Nam, Do-Hyun

    2017-01-01

    Skull base chordoma is a primary rare malignant bone-origin tumor showing relatively slow growth pattern and locally destructive lesions, which can only be characterized by histologic components. There is no available prognostic or therapeutic biomarker to predict clinical outcome or treatment response and the molecular mechanisms underlying chordoma development still remain unexplored. Therefore, we sought out to identify novel somatic variations that are associated with chordoma progression and potentially employed as therapeutic targets. Thirteen skull base chordomas were subjected for whole-exome and/or whole-transcriptome sequencing. In process, we have identified chromosomal aberration in 1p, 7, 10, 13 and 17q, high frequency of functional germline SNP of the T gene, rs2305089 (P = 0.0038) and several recurrent alterations including MUC4, NBPF1, NPIPB15 mutations and novel gene fusion of SAMD5-SASH1 for the first time in skull base chordoma. PMID:27901492

  18. CTDB: An Integrated Chickpea Transcriptome Database for Functional and Applied Genomics

    PubMed Central

    Patel, Ravi K.; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Chickpea is an important grain legume used as a rich source of protein in human diet. The narrow genetic diversity and limited availability of genomic resources are the major constraints in implementing breeding strategies and biotechnological interventions for genetic enhancement of chickpea. We developed an integrated Chickpea Transcriptome Database (CTDB), which provides the comprehensive web interface for visualization and easy retrieval of transcriptome data in chickpea. The database features many tools for similarity search, functional annotation (putative function, PFAM domain and gene ontology) search and comparative gene expression analysis. The current release of CTDB (v2.0) hosts transcriptome datasets with high quality functional annotation from cultivated (desi and kabuli types) and wild chickpea. A catalog of transcription factor families and their expression profiles in chickpea are available in the database. The gene expression data have been integrated to study the expression profiles of chickpea transcripts in major tissues/organs and various stages of flower development. The utilities, such as similarity search, ortholog identification and comparative gene expression have also been implemented in the database to facilitate comparative genomic studies among different legumes and Arabidopsis. Furthermore, the CTDB represents a resource for the discovery of functional molecular markers (microsatellites and single nucleotide polymorphisms) between different chickpea types. We anticipate that integrated information content of this database will accelerate the functional and applied genomic research for improvement of chickpea. The CTDB web service is freely available at http://nipgr.res.in/ctdb.html. PMID:26322998

  19. The Draft Genome and Transcriptome of the Atlantic Horseshoe Crab, Limulus polyphemus

    PubMed Central

    Ramsdell, Jordan S.; Watson III, Winsor H.; Chabot, Christopher C.

    2017-01-01

    The horseshoe crab, Limulus polyphemus, exhibits robust circadian and circatidal rhythms, but little is known about the molecular mechanisms underlying those rhythms. In this study, horseshoe crabs were collected during the day and night as well as high and low tides, and their muscle and central nervous system tissues were processed for genome and transcriptome sequencing, respectively. The genome assembly resulted in 7.4 × 105 contigs with N50 of 4,736, while the transcriptome assembly resulted in 9.3 × 104 contigs and N50 of 3,497. Analysis of functional completeness by the identification of putative universal orthologs suggests that the transcriptome has three times more total expected orthologs than the genome. Interestingly, RNA-Seq analysis indicated no statistically significant changes in expression level for any circadian core or accessory gene, but there was significant cycling of several noncircadian transcripts. Overall, these assemblies provide a resource to investigate the Limulus clock systems and provide a large dataset for further exploration into the taxonomy and biology of the Atlantic horseshoe crab. PMID:28265565

  20. Comparative Transcriptome and Chloroplast Genome Analyses of Two Related Dipteronia Species

    PubMed Central

    Zhou, Tao; Chen, Chen; Wei, Yue; Chang, Yongxia; Bai, Guoqing; Li, Zhonghu; Kanwal, Nazish; Zhao, Guifang

    2016-01-01

    Dipteronia (order Sapindales) is an endangered genus endemic to China and has two living species, D.sinensis and D. dyeriana. The plants are closely related to the genus Acer, which is also classified in the order Sapindales. Evolutionary studies on Dipteronia have been hindered by the paucity of information on their genomes and plastids. Here, we used next generation sequencing to characterize the transcriptomes and complete chloroplast genomes of both Dipteronia species. A comparison of the transcriptomes of both species identified a total of 7814 orthologs. Estimation of selection pressures using Ka/Ks ratios showed that only 30 of 5435 orthologous pairs had a ratio significantly >1, i.e., showing positive selection. However, 4041 orthologs had a Ka/Ks < 0.5 (p < 0.05), suggesting that most genes had likely undergone purifying selection. Based on orthologous unigenes, 314 single copy nuclear genes (SCNGs) were identified. Through a combination of de novo and reference guided assembly, plastid genomes were obtained; that of D. sinensis was 157,080 bp and that of D. dyeriana was 157,071 bp. Both plastid genomes encoded 87 protein coding genes, 40 tRNAs, and 8 rRNAs; no significant differences were detected in the size, gene content, and organization of the two plastomes. We used the whole chloroplast genomes to determine the phylogeny of D. sinensis and D. dyeriana and confirmed that the two species were highly divergent. Overall, our study provides comprehensive transcriptomic and chloroplast genomic resources, which will be valuable for future evolutionary studies of Dipteronia. PMID:27790228

  1. Genome scale engineering techniques for metabolic engineering.

    PubMed

    Liu, Rongming; Bassalo, Marcelo C; Zeitoun, Ramsey I; Gill, Ryan T

    2015-11-01

    Metabolic engineering has expanded from a focus on designs requiring a small number of genetic modifications to increasingly complex designs driven by advances in genome-scale engineering technologies. Metabolic engineering has been generally defined by the use of iterative cycles of rational genome modifications, strain analysis and characterization, and a synthesis step that fuels additional hypothesis generation. This cycle mirrors the Design-Build-Test-Learn cycle followed throughout various engineering fields that has recently become a defining aspect of synthetic biology. This review will attempt to summarize recent genome-scale design, build, test, and learn technologies and relate their use to a range of metabolic engineering applications.

  2. Discovery of germline-related genes in Cephalochordate amphioxus: A genome wide survey using genome annotation and transcriptome data.

    PubMed

    Yue, Jia-Xing; Li, Kun-Lung; Yu, Jr-Kai

    2015-12-01

    The generation of germline cells is a critical process in the reproduction of multicellular organisms. Studies in animal models have identified a common repertoire of genes that play essential roles in primordial germ cell (PGC) formation. However, comparative studies also indicate that the timing and regulation of this core genetic program vary considerably in different animals, raising the intriguing questions regarding the evolution of PGC developmental mechanisms in metazoans. Cephalochordates (commonly called amphioxus or lancelets) represent one of the invertebrate chordate groups and can provide important information about the evolution of developmental mechanisms in the chordate lineage. In this study, we used genome and transcriptome data to identify germline-related genes in two distantly related cephalochordate species, Branchiostoma floridae and Asymmetron lucayanum. Branchiostoma and Asymmetron diverged more than 120 MYA, and the most conspicuous difference between them is their gonadal morphology. We used important germline developmental genes in several model animals to search the amphioxus genome and transcriptome dataset for conserved homologs. We also annotated the assembled transcriptome data using Gene Ontology (GO) terms to facilitate the discovery of putative genes associated with germ cell development and reproductive functions in amphioxus. We further confirmed the expression of 14 genes in developing oocytes or mature eggs using whole mount in situ hybridization, suggesting their potential functions in amphioxus germ cell development. The results of this global survey provide a useful resource for testing potential functions of candidate germline-related genes in cephalochordates and for investigating differences in gonad developmental mechanisms between Branchiostoma and Asymmetron species.

  3. The genome and developmental transcriptome of the strongylid nematode Haemonchus contortus

    PubMed Central

    2013-01-01

    Background The barber's pole worm, Haemonchus contortus, is one of the most economically important parasites of small ruminants worldwide. Although this parasite can be controlled using anthelmintic drugs, resistance against most drugs in common use has become a widespread problem. We provide a draft of the genome and the transcriptomes of all key developmental stages of H. contortus to support biological and biotechnological research areas of this and related parasites. Results The draft genome of H. contortus is 320 Mb in size and encodes 23,610 protein-coding genes. On a fundamental level, we elucidate transcriptional alterations taking place throughout the life cycle, characterize the parasite's gene silencing machinery, and explore molecules involved in development, reproduction, host-parasite interactions, immunity, and disease. The secretome of H. contortus is particularly rich in peptidases linked to blood-feeding activity and interactions with host tissues, and a diverse array of molecules is involved in complex immune responses. On an applied level, we predict drug targets and identify vaccine molecules. Conclusions The draft genome and developmental transcriptome of H. contortus provide a major resource to the scientific community for a wide range of genomic, genetic, proteomic, metabolomic, evolutionary, biological, ecological, and epidemiological investigations, and a solid foundation for biotechnological outcomes, including new anthelmintics, vaccines and diagnostic tests. This first draft genome of any strongylid nematode paves the way for a rapid acceleration in our understanding of a wide range of socioeconomically important parasites of one of the largest nematode orders. PMID:23985341

  4. Dataset for distribution of SIDER2 elements in the Leishmania major genome and transcriptome.

    PubMed

    Requena, Jose M; Rastrojo, Alberto; Garde, Esther; López, Manuel C; Thomas, M Carmen; Aguado, Begoña

    2017-04-01

    This paper contains data related to the research article entitled "Genomic cartography and proposal of nomenclature for the repeated, interspersed elements of the Leishmania major SIDER2 family and identification of SIDER2-containing transcripts" [1]. SIDER2 elements are repeated sequences, derived from, nowadays, extinct retrotransposons, that populate the genomes of protist of the genera Leishmania. This dataset (Supplementary file 1), an inventory of 1100 SIDER2 elements, was generated by surveying the L. major complete genome using bioinformatics tools with further manual refinements. In addition to the genomic distribution of these elements (summarized in Fig. 1), this dataset contains information regarding their association with specific transcripts, based on the recently established transcriptome for L. major[2].

  5. Trade-off between Transcriptome Plasticity and Genome Evolution in Cephalopods.

    PubMed

    Liscovitch-Brauer, Noa; Alon, Shahar; Porath, Hagit T; Elstein, Boaz; Unger, Ron; Ziv, Tamar; Admon, Arie; Levanon, Erez Y; Rosenthal, Joshua J C; Eisenberg, Eli

    2017-04-06

    RNA editing, a post-transcriptional process, allows the diversification of proteomes beyond the genomic blueprint; however it is infrequently used among animals for this purpose. Recent reports suggesting increased levels of RNA editing in squids thus raise the question of the nature and effects of these events. We here show that RNA editing is particularly common in behaviorally sophisticated coleoid cephalopods, with tens of thousands of evolutionarily conserved sites. Editing is enriched in the nervous system, affecting molecules pertinent for excitability and neuronal morphology. The genomic sequence flanking editing sites is highly conserved, suggesting that the process confers a selective advantage. Due to the large number of sites, the surrounding conservation greatly reduces the number of mutations and genomic polymorphisms in protein-coding regions. This trade-off between genome evolution and transcriptome plasticity highlights the importance of RNA recoding as a strategy for diversifying proteins, particularly those associated with neural function. PAPERCLIP.

  6. Genome-scale functional profiling of the mammalian AP-1 signaling pathway

    PubMed Central

    Chanda, Sumit K.; White, Suhaila; Orth, Anthony P.; Reisdorph, Richard; Miraglia, Loren; Thomas, Russell S.; DeJesus, Paul; Mason, Daniel E.; Huang, Qihong; Vega, Raquel; Yu, De-Hua; Nelson, Christian G.; Smith, Brendan M.; Terry, Robert; Linford, Alicia S.; Yu, Yang; Chirn, Gung-wei; Song, Chuanzheng; Labow, Mark A.; Cohen, Dalia; King, Frederick J.; Peters, Eric C.; Schultz, Peter G.; Vogt, Peter K.; Hogenesch, John B.; Caldwell, Jeremy S.

    2003-01-01

    Large-scale functional genomics approaches are fundamental to the characterization of mammalian transcriptomes annotated by genome sequencing projects. Although current high-throughput strategies systematically survey either transcriptional or biochemical networks, analogous genome-scale investigations that analyze gene function in mammalian cells have yet to be fully realized. Through transient overexpression analysis, we describe the parallel interrogation of ≈20,000 sequence annotated genes in cancer-related signaling pathways. For experimental validation of these genome data, we apply an integrative strategy to characterize previously unreported effectors of activator protein-1 (AP-1) mediated growth and mitogenic response pathways. These studies identify the ADP-ribosylation factor GTPase-activating protein Centaurin α1 and a Tudor domain-containing hypothetical protein as putative AP-1 regulatory oncogenes. These results provide insight into the composition of the AP-1 signaling machinery and validate this approach as a tractable platform for genome-wide functional analysis. PMID:14514886

  7. Transcriptome-scale homoeolog-specific transcript assemblies of bread wheat

    PubMed Central

    2012-01-01

    Background Bread wheat is one of the world’s most important food crops and considerable efforts have been made to develop genomic resources for this species. This includes an on-going project by the International Wheat Genome Sequencing Consortium to assemble its large and complex genome, which is hexaploid and contains three closely related ‘homoeologous’ copies for each chromosome. This multi-national effort avoids the complications polyploidy entails for correct assembly of the genome by sequencing flow-sorted chromosome arms one at a time. Here we report on an alternate approach, a direct homoeolog-specific assembly of the expressed portion of the genome, the transcriptome. Results After assessment of the ability of various assemblers to generate homoeolog-specific assemblies, we employed a two-stage assembly process to produce a high-quality assembly of the transcriptome of hexaploid wheat from Roche-454 and Illumina GAIIx paired-end sequence reads. The assembly process made use of a rapid partitioning of expressed sequences into homoeologous clusters, followed by a parallel high-fidelity assembly of each cluster on a 1150-processor compute cloud. We assessed assembly quality through comparison to known wheat gene sequences and found that in ca. 98.5% of cases the assembly was sufficiently accurate for homoeologous triplets to be cleanly separated into either two or three separate contigs. Comparison to publicly available transcript collections suggests that the assembly covers ~75-80% of the complete transcriptome. Conclusions This work therefore describes the first homoeolog-specific sequence assembly of the wheat transcriptome and provides a reference transcriptome for future wheat research. Furthermore, our assembly methodology is transferable to other polyploid organisms. PMID:22989011

  8. The genomic and transcriptomic landscape of a HeLa cell line.

    PubMed

    Landry, Jonathan J M; Pyl, Paul Theodor; Rausch, Tobias; Zichner, Thomas; Tekkedil, Manu M; Stütz, Adrian M; Jauch, Anna; Aiyar, Raeka S; Pau, Gregoire; Delhomme, Nicolas; Gagneur, Julien; Korbel, Jan O; Huber, Wolfgang; Steinmetz, Lars M

    2013-08-07

    HeLa is the most widely used model cell line for studying human cellular and molecular biology. To date, no genomic reference for this cell line has been released, and experiments have relied on the human reference genome. Effective design and interpretation of molecular genetic studies performed using HeLa cells require accurate genomic information. Here we present a detailed genomic and transcriptomic characterization of a HeLa cell line. We performed DNA and RNA sequencing of a HeLa Kyoto cell line and analyzed its mutational portfolio and gene expression profile. Segmentation of the genome according to copy number revealed a remarkably high level of aneuploidy and numerous large structural variants at unprecedented resolution. Some of the extensive genomic rearrangements are indicative of catastrophic chromosome shattering, known as chromothripsis. Our analysis of the HeLa gene expression profile revealed that several pathways, including cell cycle and DNA repair, exhibit significantly different expression patterns from those in normal human tissues. Our results provide the first detailed account of genomic variants in the HeLa genome, yielding insight into their impact on gene expression and cellular function as well as their origins. This study underscores the importance of accounting for the strikingly aberrant characteristics of HeLa cells when designing and interpreting experiments, and has implications for the use of HeLa as a model of human biology.

  9. Phenotypic, genomic, transcriptomic and proteomic changes in Bacillus cereus after a short-term space flight

    NASA Astrophysics Data System (ADS)

    Su, Longxiang; Zhou, Lisha; Liu, Jinwen; Cen, Zhong; Wu, Chunyan; Wang, Tong; Zhou, Tao; Chang, De; Guo, Yinghua; Fang, Xiangqun; Wang, Junfeng; Li, Tianzhi; Yin, Sanjun; Dai, Wenkui; Zhou, Yuping; Zhao, Jiao; Fang, Chengxiang; Yang, Ruifu; Liu, Changting

    2014-01-01

    The environment in space could affect microorganisms by changing a variety of features, including proliferation rate, cell physiology, cell metabolism, biofilm production, virulence, and drug resistance. However, the relevant mechanisms remain unclear. To explore the effect of a space environment on Bacillus cereus, a strain of B. cereus was sent to space for 398 h by ShenZhou VIII from November 1, 2011 to November 17, 2011. A ground simulation with similar temperature conditions was simultaneously performed as a control. After the flight, the flight and control strains were further analyzed using phenotypic, genomic, transcriptomic and proteomic techniques to explore the divergence of B. cereus in a space environment. The flight strains exhibited a significantly slower growth rate, a significantly higher amikacin resistance level, and changes in metabolism relative to the ground control strain. After the space flight, three polymorphic loci were found in the flight strains LCT-BC25 and LCT-BC235. A combined transcriptome and proteome analysis was performed, and this analysis revealed that the flight strains had changes in genes/proteins relevant to metabolism. In addition, certain genes/proteins that are relevant to structural function, gene expression modification and translation, and virulence were also altered. Our study represents the first documented analysis of the phenotypic, genomic, transcriptomic, and proteomic changes that occur in B. cereus during space flight, and our results could be beneficial to the field of space microbiology.

  10. The duck genome and transcriptome provide insight into an avian influenza virus reservoir species

    PubMed Central

    Chen, Hualan; Zhang, Yong; Qian, Wubin; Kim, Heebal; Gan, Shangquan; Zhao, Yiqiang; Li, Jianwen; Yi, Kang; Feng, Huapeng; Zhu, Pengyang; Li, Bo; Liu, Qiuyue; Fairley, Suan; Magor, Katharine E; Du, Zhenlin; Hu, Xiaoxiang; Goodman, Laurie; Tafer, Hakim; Vignal, Alain; Lee, Taeheon; Kim, Kyu-Won; Sheng, Zheya; An, Yang; Searle, Steve; Herrero, Javier; Groenen, Martien A M; Crooijmans, Richard P M A; Faraut, Thomas; Cai, Qingle; Webster, Robert G; Aldridge, Jerry R; Warren, Wesley C; Bartschat, Sebastian; Kehr, Stephanie; Marz, Manja; Stadler, Peter F; Smith, Jacqueline; Kraus, Robert H S; Zhao, Yaofeng; Ren, Liming; Fei, Jing; Morisson, Mireille; Kaiser, Pete; Griffin, Darren K; Rao, Man; Pitel, Frederique; Wang, Jun; Li, Ning

    2014-01-01

    The duck (Anas platyrhynchos) is one of the principal natural hosts of influenza A viruses. We present the duck genome sequence and perform deep transcriptome analyses to investigate immune-related genes. Our data indicate that the duck possesses a contractive immune gene repertoire, as in chicken and zebra finch, and this repertoire has been shaped through lineage-specific duplications. We identify genes that are responsive to influenza A viruses using the lung transcriptomes of control ducks and ones that were infected with either a highly pathogenic (A/duck/Hubei/49/05) or a weakly pathogenic (A/goose/Hubei/65/05) H5N1 virus. Further, we show how the duck’s defense mechanisms against influenza infection have been optimized through the diversification of its β-defensin and butyrophilin-like repertoires. These analyses, in combination with the genomic and transcriptomic data, provide a resource for characterizing the interaction between host and influenza viruses. PMID:23749191

  11. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution.

    PubMed

    Clarke, Thomas H; Garb, Jessica E; Hayashi, Cheryl Y; Arensburger, Peter; Ayoub, Nadia A

    2015-06-08

    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae).

  12. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution

    PubMed Central

    Clarke, Thomas H.; Garb, Jessica E.; Hayashi, Cheryl Y.; Arensburger, Peter; Ayoub, Nadia A.

    2015-01-01

    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). PMID:26058392

  13. Defining the transcriptome assembly and its use for genome dynamics and transcriptome profiling studies in pigeonpea (Cajanus cajan L.)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This study reports generation of large-scale genomic resources for pigeonpea, a so-called ‘orphan crop species’ of the semi-arid tropic regions. Roche FLX/454 sequencing was carried out on a normalized cDNA pool prepared from 31 tissues produced 494,353 short transcript reads (STRs). Cluster analysi...

  14. Genome and transcriptome of the regeneration-competent flatworm, Macrostomum lignano

    PubMed Central

    Wasik, Kaja; Gurtowski, James; Zhou, Xin; Ramos, Olivia Mendivil; Delás, M. Joaquina; Battistoni, Giorgia; El Demerdash, Osama; Falciatori, Ilaria; Vizoso, Dita B.; Smith, Andrew D.; Ladurner, Peter; Schärer, Lukas; McCombie, W. Richard; Hannon, Gregory J.; Schatz, Michael

    2015-01-01

    The free-living flatworm, Macrostomum lignano has an impressive regenerative capacity. Following injury, it can regenerate almost an entirely new organism because of the presence of an abundant somatic stem cell population, the neoblasts. This set of unique properties makes many flatworms attractive organisms for studying the evolution of pathways involved in tissue self-renewal, cell-fate specification, and regeneration. The use of these organisms as models, however, is hampered by the lack of a well-assembled and annotated genome sequences, fundamental to modern genetic and molecular studies. Here we report the genomic sequence of M. lignano and an accompanying characterization of its transcriptome. The genome structure of M. lignano is remarkably complex, with ∼75% of its sequence being comprised of simple repeats and transposon sequences. This has made high-quality assembly from Illumina reads alone impossible (N50 = 222 bp). We therefore generated 130× coverage by long sequencing reads from the Pacific Biosciences platform to create a substantially improved assembly with an N50 of 64 Kbp. We complemented the reference genome with an assembled and annotated transcriptome, and used both of these datasets in combination to probe gene-expression patterns during regeneration, examining pathways important to stem cell function. PMID:26392545

  15. Role of genomics and transcriptomics in selection of reintroduction source populations.

    PubMed

    He, Xiaoping; Johansson, Mattias L; Heath, Daniel D

    2016-10-01

    The use and importance of reintroduction as a conservation tool to return a species to its historical range from which it has been extirpated will increase as climate change and human development accelerate habitat loss and population extinctions. Although the number of reintroduction attempts has increased rapidly over the past 2 decades, the success rate is generally low. As a result of population differences in fitness-related traits and divergent responses to environmental stresses, population performance upon reintroduction is highly variable, and it is generally agreed that selecting an appropriate source population is a critical component of a successful reintroduction. Conservation genomics is an emerging field that addresses long-standing challenges in conservation, and the potential for using novel molecular genetic approaches to inform and improve conservation efforts is high. Because the successful establishment and persistence of reintroduced populations is highly dependent on the functional genetic variation and environmental stress tolerance of the source population, we propose the application of conservation genomics and transcriptomics to guide reintroduction practices. Specifically, we propose using genome-wide functional loci to estimate genetic variation of source populations. This estimate can then be used to predict the potential for adaptation. We also propose using transcriptional profiling to measure the expression response of fitness-related genes to environmental stresses as a proxy for acclimation (tolerance) capacity. Appropriate application of conservation genomics and transcriptomics has the potential to dramatically enhance reintroduction success in a time of rapidly declining biodiversity and accelerating environmental change.

  16. RSIADB, a collective resource for genome and transcriptome analyses in Rhizoctonia solani AG1 IA.

    PubMed

    Chen, Lei; Ai, Peng; Zhang, Jinfeng; Deng, Qiming; Wang, Shiquan; Li, Shuangcheng; Zhu, Jun; Li, Ping; Zheng, Aiping

    2016-01-01

    Rice [Oryza sativa (L.)] feeds more than half of the world's population. Rhizoctonia solaniis a major fungal pathogen of rice causing extreme crop losses in all rice-growing regions of the world. R. solani AG1 IA is a major cause of sheath blight in rice. In this study, we constructed a comprehensive and user-friendly web-based database, RSIADB, to analyse its draft genome and transcriptome. The database was built using the genome sequence (10,489 genes) and annotation information for R. solani AG1 IA. A total of six RNAseq samples of R. solani AG1 IA were also analysed, corresponding to 10, 18, 24, 32, 48 and 72 h after infection of rice leaves. The RSIADB database enables users to search, browse, and download gene sequences for R. solani AG1 IA, and mine the data using BLAST, Sequence Extractor, Browse and Construction Diagram tools that were integrated into the database. RSIADB is an important genomic resource for scientists working with R. solani AG1 IA and will assist researchers in analysing the annotated genome and transcriptome of this pathogen. This resource will facilitate studies on gene function, pathogenesis factors and secreted proteins, as well as provide an avenue for comparative analyses of genes expressed during different stages of infection. Database URL:http://genedenovoweb.ticp.net:81/rsia/index.php.

  17. Microbial genomics, transcriptomics and proteomics: new discoveries in decomposition research using complementary methods.

    PubMed

    Baldrian, Petr; López-Mondéjar, Rubén

    2014-02-01

    Molecular methods for the analysis of biomolecules have undergone rapid technological development in the last decade. The advent of next-generation sequencing methods and improvements in instrumental resolution enabled the analysis of complex transcriptome, proteome and metabolome data, as well as a detailed annotation of microbial genomes. The mechanisms of decomposition by model fungi have been described in unprecedented detail by the combination of genome sequencing, transcriptomics and proteomics. The increasing number of available genomes for fungi and bacteria shows that the genetic potential for decomposition of organic matter is widespread among taxonomically diverse microbial taxa, while expression studies document the importance of the regulation of expression in decomposition efficiency. Importantly, high-throughput methods of nucleic acid analysis used for the analysis of metagenomes and metatranscriptomes indicate the high diversity of decomposer communities in natural habitats and their taxonomic composition. Today, the metaproteomics of natural habitats is of interest. In combination with advanced analytical techniques to explore the products of decomposition and the accumulation of information on the genomes of environmentally relevant microorganisms, advanced methods in microbial ecophysiology should increase our understanding of the complex processes of organic matter transformation.

  18. The Genome and Development-Dependent Transcriptomes of Pyronema confluens: A Window into Fungal Evolution

    PubMed Central

    Traeger, Stefanie; Altegoer, Florian; Freitag, Michael; Gabaldon, Toni; Kempken, Frank; Kumar, Abhishek; Marcet-Houben, Marina; Pöggeler, Stefanie; Stajich, Jason E.; Nowrousian, Minou

    2013-01-01

    Fungi are a large group of eukaryotes found in nearly all ecosystems. More than 250 fungal genomes have already been sequenced, greatly improving our understanding of fungal evolution, physiology, and development. However, for the Pezizomycetes, an early-diverging lineage of filamentous ascomycetes, there is so far only one genome available, namely that of the black truffle, Tuber melanosporum, a mycorrhizal species with unusual subterranean fruiting bodies. To help close the sequence gap among basal filamentous ascomycetes, and to allow conclusions about the evolution of fungal development, we sequenced the genome and assayed transcriptomes during development of Pyronema confluens, a saprobic Pezizomycete with a typical apothecium as fruiting body. With a size of 50 Mb and ∼13,400 protein-coding genes, the genome is more characteristic of higher filamentous ascomycetes than the large, repeat-rich truffle genome; however, some typical features are different in the P. confluens lineage, e.g. the genomic environment of the mating type genes that is conserved in higher filamentous ascomycetes, but only partly conserved in P. confluens. On the other hand, P. confluens has a full complement of fungal photoreceptors, and expression studies indicate that light perception might be similar to distantly related ascomycetes and, thus, represent a basic feature of filamentous ascomycetes. Analysis of spliced RNA-seq sequence reads allowed the detection of natural antisense transcripts for 281 genes. The P. confluens genome contains an unusually high number of predicted orphan genes, many of which are upregulated during sexual development, consistent with the idea of rapid evolution of sex-associated genes. Comparative transcriptomics identified the transcription factor gene pro44 that is upregulated during development in P. confluens and the Sordariomycete Sordaria macrospora. The P. confluens pro44 gene (PCON_06721) was used to complement the S. macrospora pro44 deletion

  19. Human genomics. Effect of predicted protein-truncating genetic variants on the human transcriptome.

    PubMed

    Rivas, Manuel A; Pirinen, Matti; Conrad, Donald F; Lek, Monkol; Tsang, Emily K; Karczewski, Konrad J; Maller, Julian B; Kukurba, Kimberly R; DeLuca, David S; Fromer, Menachem; Ferreira, Pedro G; Smith, Kevin S; Zhang, Rui; Zhao, Fengmei; Banks, Eric; Poplin, Ryan; Ruderfer, Douglas M; Purcell, Shaun M; Tukiainen, Taru; Minikel, Eric V; Stenson, Peter D; Cooper, David N; Huang, Katharine H; Sullivan, Timothy J; Nedzel, Jared; Bustamante, Carlos D; Li, Jin Billy; Daly, Mark J; Guigo, Roderic; Donnelly, Peter; Ardlie, Kristin; Sammeth, Michael; Dermitzakis, Emmanouil T; McCarthy, Mark I; Montgomery, Stephen B; Lappalainen, Tuuli; MacArthur, Daniel G

    2015-05-08

    Accurate prediction of the functional effect of genetic variation is critical for clinical genome interpretation. We systematically characterized the transcriptome effects of protein-truncating variants, a class of variants expected to have profound effects on gene function, using data from the Genotype-Tissue Expression (GTEx) and Geuvadis projects. We quantitated tissue-specific and positional effects on nonsense-mediated transcript decay and present an improved predictive model for this decay. We directly measured the effect of variants both proximal and distal to splice junctions. Furthermore, we found that robustness to heterozygous gene inactivation is not due to dosage compensation. Our results illustrate the value of transcriptome data in the functional interpretation of genetic variants.

  20. Transcriptome and Functional Genomics Reveal the Participation of Adenine Phosphoribosyltransferase in Trypanosoma cruzi Resistance to Benznidazole.

    PubMed

    García-Huertas, Paola; Mejía-Jaramillo, Ana María; González, Laura; Triana Chávez, Omar

    2017-03-09

    Currently, the only available treatments for Trypanosoma cruzi are benznidazole (Bz) and nifurtimox (Nfx). The mechanisms of action and resistance to these drugs in this parasite are not complete known. In order to identify differentially expressed transcripts between sensitive and resistant parasites, a massive pyrosequencing of the T. cruzi transcriptome was carried out. Additionally, the 2D gel electrophoresis profile of sensitive and resistant parasites was analyzed and the data were supported with functional genomics. The results showed 133 differentially expressed genes in resistant parasites. The transcriptome analysis revealed the regulation of different genes with several functions and metabolic pathways, which could suggest that resistance in T. cruzi is a multigenic process. Additionally, using transcriptomics, one gene, adenine phosphoribosyltransferase (APRT), was found to be down-regulated in the resistant parasites and its expression profile was confirmed by 2D electrophoresis analysis. The role of this gene in the resistance to Bz was confirmed overexpressing it in sensitive and resistant parasites. Interestingly, both parasites became more sensitive to Bz and H2 O2 . This is the first RNA-seq study to identify regulated genes in T. cruzi associated with Bz resistance and to show the role of APRT in T. cruzi resistance. Although T. cruzi regulation is mainly post-transcriptional, the transcriptome analysis, supported by 2D gel analysis and functional genomic, provides an overall idea of the expression profiles of genes under resistance conditions. These results contribute essential information to further the understanding of the mechanisms of action and resistance to Bz in T. cruzi. This article is protected by copyright. All rights reserved.

  1. Databases and information integration for the Medicago truncatula genome and transcriptome.

    PubMed

    Cannon, Steven B; Crow, John A; Heuer, Michael L; Wang, Xiaohong; Cannon, Ethalinda K S; Dwan, Christopher; Lamblin, Anne-Francoise; Vasdewani, Jayprakash; Mudge, Joann; Cook, Andrew; Gish, John; Cheung, Foo; Kenton, Steve; Kunau, Timothy M; Brown, Douglas; May, Gregory D; Kim, Dongjin; Cook, Douglas R; Roe, Bruce A; Town, Chris D; Young, Nevin D; Retzel, Ernest F

    2005-05-01

    An international consortium is sequencing the euchromatic genespace of Medicago truncatula. Extensive bioinformatic and database resources support the marker-anchored bacterial artificial chromosome (BAC) sequencing strategy. Existing physical and genetic maps and deep BAC-end sequencing help to guide the sequencing effort, while EST databases provide essential resources for genome annotation as well as transcriptome characterization and microarray design. Finished BAC sequences are joined into overlapping sequence assemblies and undergo an automated annotation process that integrates ab initio predictions with EST, protein, and other recognizable features. Because of the sequencing project's international and collaborative nature, data production, storage, and visualization tools are broadly distributed. This paper describes databases and Web resources for the project, which provide support for physical and genetic maps, genome sequence assembly, gene prediction, and integration of EST data. A central project Web site at medicago.org/genome provides access to genome viewers and other resources project-wide, including an Ensembl implementation at medicago.org, physical map and marker resources at mtgenome.ucdavis.edu, and genome viewers at the University of Oklahoma (www.genome.ou.edu), the Institute for Genomic Research (www.tigr.org), and Munich Information for Protein Sequences Center (mips.gsf.de).

  2. Breeding in peach, cherry and plum: from a tissue culture, genetic, transcriptomic and genomic perspective.

    PubMed

    Carrasco, Basilio; Meisel, Lee; Gebauer, Marlene; Garcia-Gonzales, Rolando; Silva, Herman

    2013-01-01

    This review is an overview of traditional and modern breeding methodologies being used to develop new Prunus cultivars (stone fruits) with major emphasis on peach, sweet cherry and Japanese plum. To this end, common breeding tools used to produce seedlings, including in vitro culture tools, are discussed. Additionally, the mechanisms of inheritance of many important agronomical traits are described. Recent advances in stone fruit transcriptomics and genomic resources are providing an understanding of the molecular basis of phenotypic variability as well as the identification of allelic variants and molecular markers. These have potential applications for understanding the genetic diversity of the Prunus species, molecular marker-assisted selection and transgenesis. Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNPs) molecular markers are described as useful tools to describe genetic diversity in peach, sweet cherry and Japanese plum. Additionally, the recently sequenced peach genome and the public release of the sweet cherry genome are discussed in terms of their applicability to breeding programs.

  3. Genome-Wide Transcriptome Analysis of Cadmium Stress in Rice

    PubMed Central

    Oono, Youko; Yazawa, Takayuki; Kanamori, Hiroyuki; Sasaki, Harumi; Mori, Satomi; Handa, Hirokazu; Matsumoto, Takashi

    2016-01-01

    Rice growth is severely affected by toxic concentrations of the nonessential heavy metal cadmium (Cd). To elucidate the molecular basis of the response to Cd stress, we performed mRNA sequencing of rice following our previous study on exposure to high concentrations of Cd (Oono et al., 2014). In this study, rice plants were hydroponically treated with low concentrations of Cd and approximately 211 million sequence reads were mapped onto the IRGSP-1.0 reference rice genome sequence. Many genes, including some identified under high Cd concentration exposure in our previous study, were found to be responsive to low Cd exposure, with an average of about 11,000 transcripts from each condition. However, genes expressed constitutively across the developmental course responded only slightly to low Cd concentrations, in contrast to their clear response to high Cd concentration, which causes fatal damage to rice seedlings according to phenotypic changes. The expression of metal ion transporter genes tended to correlate with Cd concentration, suggesting the potential of the RNA-Seq strategy to reveal novel Cd-responsive transporters by analyzing gene expression under different Cd concentrations. This study could help to develop novel strategies for improving tolerance to Cd exposure in rice and other cereal crops. PMID:27034955

  4. Genome-wide investigation and transcriptome analysis of the WRKY gene family in Gossypium.

    PubMed

    Ding, Mingquan; Chen, Jiadong; Jiang, Yurong; Lin, Lifeng; Cao, YueFen; Wang, Minhua; Zhang, Yuting; Rong, Junkang; Ye, Wuwei

    2015-02-01

    WRKY transcription factors play important roles in various stress responses in diverse plant species. In cotton, this family has not been well studied, especially in relation to fiber development. Here, the genomes and transcriptomes of Gossypium raimondii and Gossypium arboreum were investigated to identify fiber development related WRKY genes. This represents the first comprehensive comparative study of WRKY transcription factors in both diploid A and D cotton species. In total, 112 G. raimondii and 109 G. arboreum WRKY genes were identified. No significant gene structure or domain alterations were detected between the two species, but many SNPs distributed unequally in exon and intron regions. Physical mapping revealed that the WRKY genes in G. arboreum were not located in the corresponding chromosomes of G. raimondii, suggesting great chromosome rearrangement in the diploid cotton genomes. The cotton WRKY genes, especially subgroups I and II, have expanded through multiple whole genome duplications and tandem duplications compared with other plant species. Sequence comparison showed many functionally divergent sites between WRKY subgroups, while the genes within each group are under strong purifying selection. Transcriptome analysis suggested that many WRKY genes participate in specific fiber development processes such as fiber initiation, elongation and maturation with different expression patterns between species. Complex WRKY gene expression such as differential Dt and At allelic gene expression in G. hirsutum and alternative splicing events were also observed in both diploid and tetraploid cottons during fiber development process. In conclusion, this study provides important information on the evolution and function of WRKY gene family in cotton species.

  5. Mapping the C. elegans noncoding transcriptome with a whole-genome tiling microarray.

    PubMed

    He, Housheng; Wang, Jie; Liu, Tao; Liu, X Shirley; Li, Tiantian; Wang, Yunfei; Qian, Zuwei; Zheng, Haixia; Zhu, Xiaopeng; Wu, Tao; Shi, Baochen; Deng, Wei; Zhou, Wei; Skogerbø, Geir; Chen, Runsheng

    2007-10-01

    The number of annotated protein coding genes in the genome of Caenorhabditis elegans is similar to that of other animals, but the extent of its non-protein-coding transcriptome remains unknown. Expression profiling on whole-genome tiling microarrays applied to a mixed-stage C. elegans population verified the expression of 71% of all annotated exons. Only a small fraction (11%) of the polyadenylated transcription is non-annotated and appears to consist of approximately 3200 missed or alternative exons and 7800 small transcripts of unknown function (TUFs). Almost half (44%) of the detected transcriptional output is non-polyadenylated and probably not protein coding, and of this, 70% overlaps the boundaries of protein-coding genes in a complex manner. Specific analysis of small non-polyadenylated transcripts verified 97% of all annotated small ncRNAs and suggested that the transcriptome contains approximately 1200 small (<500 nt) unannotated noncoding loci. After combining overlapping transcripts, we estimate that at least 70% of the total C. elegans genome is transcribed.

  6. Transcriptome characterization for genome annotation and functional genomics in Theobroma cacao

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Evidence from leaf transcriptome sequencing using two technology platforms, in combination with protein homology and trained ab initio predictions, previously enabled us to build 35,000 gene models in T. cacao (www.cacaogenomedb.org). Here we review the contribution of each data type to cacao gene a...

  7. Integrated clinical, whole-genome, and transcriptome analysis of multisampled lethal metastatic prostate cancer

    PubMed Central

    Bova, G. Steven; Kallio, Heini M.L.; Annala, Matti; Kivinummi, Kati; Högnäs, Gunilla; Häyrynen, Sergei; Rantapero, Tommi; Kivinen, Virpi; Isaacs, William B.; Tolonen, Teemu; Nykter, Matti; Visakorpi, Tapio

    2016-01-01

    We report the first combined analysis of whole-genome sequence, detailed clinical history, and transcriptome sequence of multiple prostate cancer metastases in a single patient (A21). Whole-genome and transcriptome sequence was obtained from nine anatomically separate metastases, and targeted DNA sequencing was performed in cancerous and noncancerous foci within the primary tumor specimen removed 5 yr before death. Transcriptome analysis revealed increased expression of androgen receptor (AR)-regulated genes in liver metastases that harbored an AR p.L702H mutation, suggesting a dominant effect by the mutation despite being present in only one of an estimated 16 copies per cell. The metastases harbored several alterations to the PI3K/AKT pathway, including a clonal truncal mutation in PIK3CG and present in all metastatic sites studied. The list of truncal genomic alterations shared by all metastases included homozygous deletion of TP53, hemizygous deletion of RB1 and CHD1, and amplification of FGFR1. If the patient were treated today, given this knowledge, the use of second-generation androgen-directed therapies, cessation of glucocorticoid administration, and therapeutic inhibition of the PI3K/AKT pathway or FGFR1 receptor could provide personalized benefit. Three previously unreported truncal clonal missense mutations (ABCC4 p.R891L, ALDH9A1 p.W89R, and ASNA1 p.P75R) were expressed at the RNA level and assessed as druggable. The truncal status of mutations may be critical for effective actionability and merit further study. Our findings suggest that a large set of deeply analyzed cases could serve as a powerful guide to more effective prostate cancer basic science and personalized cancer medicine clinical trials. PMID:27148588

  8. Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing

    PubMed Central

    2011-01-01

    Background The garden pea, Pisum sativum, is among the best-investigated legume plants and of significant agro-commercial relevance. Pisum sativum has a large and complex genome and accordingly few comprehensive genomic resources exist. Results We analyzed the pea transcriptome at the highest possible amount of accuracy by current technology. We used next generation sequencing with the Roche/454 platform and evaluated and compared a variety of approaches, including diverse tissue libraries, normalization, alternative sequencing technologies, saturation estimation and diverse assembly strategies. We generated libraries from flowers, leaves, cotyledons, epi- and hypocotyl, and etiolated and light treated etiolated seedlings, comprising a total of 450 megabases. Libraries were assembled into 324,428 unigenes in a first pass assembly. A second pass assembly reduced the amount to 81,449 unigenes but caused a significant number of chimeras. Analyses of the assemblies identified the assembly step as a major possibility for improvement. By recording frequencies of Arabidopsis orthologs hit by randomly drawn reads and fitting parameters of the saturation curve we concluded that sequencing was exhaustive. For leaf libraries we found normalization allows partial recovery of expression strength aside the desired effect of increased coverage. Based on theoretical and biological considerations we concluded that the sequence reads in the database tagged the vast majority of transcripts in the aerial tissues. A pathway representation analysis showed the merits of sampling multiple aerial tissues to increase the number of tagged genes. All results have been made available as a fully annotated database in fasta format. Conclusions We conclude that the approach taken resulted in a high quality - dataset which serves well as a first comprehensive reference set for the model legume pea. We suggest future deep sequencing transcriptome projects of species lacking a genomics backbone will

  9. Integrated clinical, whole-genome, and transcriptome analysis of multisampled lethal metastatic prostate cancer.

    PubMed

    Bova, G Steven; Kallio, Heini M L; Annala, Matti; Kivinummi, Kati; Högnäs, Gunilla; Häyrynen, Sergei; Rantapero, Tommi; Kivinen, Virpi; Isaacs, William B; Tolonen, Teemu; Nykter, Matti; Visakorpi, Tapio

    2016-05-01

    We report the first combined analysis of whole-genome sequence, detailed clinical history, and transcriptome sequence of multiple prostate cancer metastases in a single patient (A21). Whole-genome and transcriptome sequence was obtained from nine anatomically separate metastases, and targeted DNA sequencing was performed in cancerous and noncancerous foci within the primary tumor specimen removed 5 yr before death. Transcriptome analysis revealed increased expression of androgen receptor (AR)-regulated genes in liver metastases that harbored an AR p.L702H mutation, suggesting a dominant effect by the mutation despite being present in only one of an estimated 16 copies per cell. The metastases harbored several alterations to the PI3K/AKT pathway, including a clonal truncal mutation in PIK3CG and present in all metastatic sites studied. The list of truncal genomic alterations shared by all metastases included homozygous deletion of TP53, hemizygous deletion of RB1 and CHD1, and amplification of FGFR1. If the patient were treated today, given this knowledge, the use of second-generation androgen-directed therapies, cessation of glucocorticoid administration, and therapeutic inhibition of the PI3K/AKT pathway or FGFR1 receptor could provide personalized benefit. Three previously unreported truncal clonal missense mutations (ABCC4 p.R891L, ALDH9A1 p.W89R, and ASNA1 p.P75R) were expressed at the RNA level and assessed as druggable. The truncal status of mutations may be critical for effective actionability and merit further study. Our findings suggest that a large set of deeply analyzed cases could serve as a powerful guide to more effective prostate cancer basic science and personalized cancer medicine clinical trials.

  10. Orthology Inference in Nonmodel Organisms Using Transcriptomes and Low-Coverage Genomes: Improving Accuracy and Matrix Occupancy for Phylogenomics

    PubMed Central

    Yang, Ya; Smith, Stephen A.

    2014-01-01

    Orthology inference is central to phylogenomic analyses. Phylogenomic data sets commonly include transcriptomes and low-coverage genomes that are incomplete and contain errors and isoforms. These properties can severely violate the underlying assumptions of orthology inference with existing heuristics. We present a procedure that uses phylogenies for both homology and orthology assignment. The procedure first uses similarity scores to infer putative homologs that are then aligned, constructed into phylogenies, and pruned of spurious branches caused by deep paralogs, misassembly, frameshifts, or recombination. These final homologs are then used to identify orthologs. We explore four alternative tree-based orthology inference approaches, of which two are new. These accommodate gene and genome duplications as well as gene tree discordance. We demonstrate these methods in three published data sets including the grape family, Hymenoptera, and millipedes with divergence times ranging from approximately 100 to over 400 Ma. The procedure significantly increased the completeness and accuracy of the inferred homologs and orthologs. We also found that data sets that are more recently diverged and/or include more high-coverage genomes had more complete sets of orthologs. To explicitly evaluate sources of conflicting phylogenetic signals, we applied serial jackknife analyses of gene regions keeping each locus intact. The methods described here can scale to over 100 taxa. They have been implemented in python with independent scripts for each step, making it easy to modify or incorporate them into existing pipelines. All scripts are available from https://bitbucket.org/yangya/phylogenomic_dataset_construction. PMID:25158799

  11. Optimizing de novo transcriptome assembly and extending genomic resources for striped catfish (Pangasianodon hypophthalmus).

    PubMed

    Thanh, Nguyen Minh; Jung, Hyungtaek; Lyons, Russell E; Njaci, Isaac; Yoon, Byoung-Ha; Chand, Vincent; Tuan, Nguyen Viet; Thu, Vo Thi Minh; Mather, Peter

    2015-10-01

    Striped catfish (Pangasianodon hypophthalmus) is a commercially important freshwater fish used in inland aquaculture in the Mekong Delta, Vietnam. The culture industry is facing a significant challenge however from saltwater intrusion into many low topographical coastal provinces across the Mekong Delta as a result of predicted climate change impacts. Developing genomic resources for this species can facilitate the production of improved culture lines that can withstand raised salinity conditions, and so we have applied high-throughput Ion Torrent sequencing of transcriptome libraries from six target osmoregulatory organs from striped catfish as a genomic resource for use in future selection strategies. We obtained 12,177,770 reads after trimming and processing with an average length of 97bp. De novo assemblies were generated using CLC Genomic Workbench, Trinity and Velvet/Oases with the best overall contig performance resulting from the CLC assembly. De novo assembly using CLC yielded 66,451 contigs with an average length of 478bp and N50 length of 506bp. A total of 37,969 contigs (57%) possessed significant similarity with proteins in the non-redundant database. Comparative analyses revealed that a significant number of contigs matched sequences reported in other teleost fishes, ranging in similarity from 45.2% with Atlantic cod to 52% with zebrafish. In addition, 28,879 simple sequence repeats (SSRs) and 55,721 single nucleotide polymorphisms (SNPs) were detected in the striped catfish transcriptome. The sequence collection generated in the current study represents the most comprehensive genomic resource for P. hypophthalmus available to date. Our results illustrate the utility of next-generation sequencing as an efficient tool for constructing a large genomic database for marker development in non-model species.

  12. Transcriptome and methylome profiling reveals relics of genome dominance in the mesopolyploid Brassica oleracea

    PubMed Central

    2014-01-01

    Background Brassica oleracea is a valuable vegetable species that has contributed to human health and nutrition for hundreds of years and comprises multiple distinct cultivar groups with diverse morphological and phytochemical attributes. In addition to this phenotypic wealth, B. oleracea offers unique insights into polyploid evolution, as it results from multiple ancestral polyploidy events and a final Brassiceae-specific triplication event. Further, B. oleracea represents one of the diploid genomes that formed the economically important allopolyploid oilseed, Brassica napus. A deeper understanding of B. oleracea genome architecture provides a foundation for crop improvement strategies throughout the Brassica genus. Results We generate an assembly representing 75% of the predicted B. oleracea genome using a hybrid Illumina/Roche 454 approach. Two dense genetic maps are generated to anchor almost 92% of the assembled scaffolds to nine pseudo-chromosomes. Over 50,000 genes are annotated and 40% of the genome predicted to be repetitive, thus contributing to the increased genome size of B. oleracea compared to its close relative B. rapa. A snapshot of both the leaf transcriptome and methylome allows comparisons to be made across the triplicated sub-genomes, which resulted from the most recent Brassiceae-specific polyploidy event. Conclusions Differential expression of the triplicated syntelogs and cytosine methylation levels across the sub-genomes suggest residual marks of the genome dominance that led to the current genome architecture. Although cytosine methylation does not correlate with individual gene dominance, the independent methylation patterns of triplicated copies suggest epigenetic mechanisms play a role in the functional diversification of duplicate genes. PMID:24916971

  13. Genomic and transcriptomic evidence for scavenging of diverse organic compounds by widespread deep-sea archaea

    PubMed Central

    Li, Meng; Baker, Brett J.; Anantharaman, Karthik; Jain, Sunit; Breier, John A.; Dick, Gregory J.

    2015-01-01

    Microbial activity is one of the most important processes to mediate the flux of organic carbon from the ocean surface to the seafloor. However, little is known about the microorganisms that underpin this key step of the global carbon cycle in the deep oceans. Here we present genomic and transcriptomic evidence that five ubiquitous archaeal groups actively use proteins, carbohydrates, fatty acids and lipids as sources of carbon and energy at depths ranging from 800 to 4,950 m in hydrothermal vent plumes and pelagic background seawater across three different ocean basins. Genome-enabled metabolic reconstructions and gene expression patterns show that these marine archaea are motile heterotrophs with extensive mechanisms for scavenging organic matter. Our results shed light on the ecological and physiological properties of ubiquitous marine archaea and highlight their versatile metabolic strategies in deep oceans that might play a critical role in global carbon cycling. PMID:26573375

  14. The genome and transcriptome of the zoonotic hookworm Ancylostoma ceylanicum identify infection-specific gene families.

    PubMed

    Schwarz, Erich M; Hu, Yan; Antoshechkin, Igor; Miller, Melanie M; Sternberg, Paul W; Aroian, Raffi V

    2015-04-01

    Hookworms infect over 400 million people, stunting and impoverishing them. Sequencing hookworm genomes and finding which genes they express during infection should help in devising new drugs or vaccines against hookworms. Unlike other hookworms, Ancylostoma ceylanicum infects both humans and other mammals, providing a laboratory model for hookworm disease. We determined an A. ceylanicum genome sequence of 313 Mb, with transcriptomic data throughout infection showing expression of 30,738 genes. Approximately 900 genes were upregulated during early infection in vivo, including ASPRs, a cryptic subfamily of activation-associated secreted proteins (ASPs). Genes downregulated during early infection included ion channels and G protein-coupled receptors; this downregulation was observed in both parasitic and free-living nematodes. Later, at the onset of heavy blood feeding, C-lectin genes were upregulated along with genes for secreted clade V proteins (SCVPs), encoding a previously undescribed protein family. These findings provide new drug and vaccine targets and should help elucidate hookworm pathogenesis.

  15. Amplified fragment length polymorphism: an invaluable fingerprinting technique for genomic, transcriptomic, and epigenetic studies.

    PubMed

    Paun, Ovidiu; Schönswetter, Peter

    2012-01-01

    Amplified fragment length polymorphism (AFLP) is a PCR-based technique that uses selective amplification of a subset of digested DNA fragments to generate and compare unique fingerprints for genomes of interest. The power of this method relies mainly in that it does not require prior information regarding the targeted genome, as well as in its high reproducibility and sensitivity for detecting polymorphism at the level of DNA sequence. Widely used for plant and microbial studies, AFLP is employed for a variety of applications, such as to assess genetic diversity within species or among closely related species, to infer population-level phylogenies and biogeographic patterns, to generate genetic maps, and to determine relatedness among cultivars. Variations of standard AFLP methodology have been also developed for targeting additional levels of diversity, such as transcriptomic variation and DNA methylation polymorphism.

  16. Genomic and transcriptomic evidence for scavenging of diverse organic compounds by widespread deep-sea archaea.

    PubMed

    Li, Meng; Baker, Brett J; Anantharaman, Karthik; Jain, Sunit; Breier, John A; Dick, Gregory J

    2015-11-17

    Microbial activity is one of the most important processes to mediate the flux of organic carbon from the ocean surface to the seafloor. However, little is known about the microorganisms that underpin this key step of the global carbon cycle in the deep oceans. Here we present genomic and transcriptomic evidence that five ubiquitous archaeal groups actively use proteins, carbohydrates, fatty acids and lipids as sources of carbon and energy at depths ranging from 800 to 4,950 m in hydrothermal vent plumes and pelagic background seawater across three different ocean basins. Genome-enabled metabolic reconstructions and gene expression patterns show that these marine archaea are motile heterotrophs with extensive mechanisms for scavenging organic matter. Our results shed light on the ecological and physiological properties of ubiquitous marine archaea and highlight their versatile metabolic strategies in deep oceans that might play a critical role in global carbon cycling.

  17. Sequencing Overview of Ewing Sarcoma: A Journey across Genomic, Epigenomic and Transcriptomic Landscapes

    PubMed Central

    Sand, Laurens G. L.; Szuhai, Karoly; Hogendoorn, Pancras C. W.

    2015-01-01

    Ewing sarcoma is an aggressive neoplasm occurring predominantly in adolescent Caucasians. At the genome level, a pathognomonic EWSR1-ETS translocation is present. The resulting fusion protein acts as a molecular driver in the tumor development and interferes, amongst others, with endogenous transcription and splicing. The Ewing sarcoma cell shows a poorly differentiated, stem-cell like phenotype. Consequently, the cellular origin of Ewing sarcoma is still a hot discussed topic. To further characterize Ewing sarcoma and to further elucidate the role of EWSR1-ETS fusion protein multiple genome, epigenome and transcriptome level studies were performed. In this review, the data from these studies were combined into a comprehensive overview. Presently, classical morphological predictive markers are used in the clinic and the therapy is dominantly based on systemic chemotherapy in combination with surgical interventions. Using sequencing, novel predictive markers and candidates for immuno- and targeted therapy were identified which were summarized in this review. PMID:26193259

  18. Dictyocaulus viviparus genome, variome and transcriptome elucidate lungworm biology and support future intervention

    PubMed Central

    McNulty, Samantha N.; Strübe, Christina; Rosa, Bruce A.; Martin, John C.; Tyagi, Rahul; Choi, Young-Jun; Wang, Qi; Hallsworth Pepin, Kymberlie; Zhang, Xu; Ozersky, Philip; Wilson, Richard K.; Sternberg, Paul W.; Gasser, Robin B.; Mitreva, Makedonka

    2016-01-01

    The bovine lungworm, Dictyocaulus viviparus (order Strongylida), is an important parasite of livestock that causes substantial economic and production losses worldwide. Here we report the draft genome, variome, and developmental transcriptome of D. viviparus. The genome (161 Mb) is smaller than those of related bursate nematodes and encodes fewer proteins (14,171 total). In the first genome-wide assessment of genomic variation in any parasitic nematode, we found a high degree of sequence variability in proteins predicted to be involved host-parasite interactions. Next, we used extensive RNA sequence data to track gene transcription across the life cycle of D. viviparus, and identified genes that might be important in nematode development and parasitism. Finally, we predicted genes that could be vital in host-parasite interactions, genes that could serve as drug targets, and putative RNAi effectors with a view to developing functional genomic tools. This extensive, well-curated dataset should provide a basis for developing new anthelmintics, vaccines, and improved diagnostic tests and serve as a platform for future investigations of drug resistance and epidemiology of the bovine lungworm and related nematodes. PMID:26856411

  19. De Novo Genome and Transcriptome Assembly of the Canadian Beaver (Castor canadensis).

    PubMed

    Lok, Si; Paton, Tara A; Wang, Zhuozhi; Kaur, Gaganjot; Walker, Susan; Yuen, Ryan K C; Sung, Wilson W L; Whitney, Joseph; Buchanan, Janet A; Trost, Brett; Singh, Naina; Apresto, Beverly; Chen, Nan; Coole, Matthew; Dawson, Travis J; Ho, Karen; Hu, Zhizhou; Pullenayegum, Sanjeev; Samler, Kozue; Shipstone, Arun; Tsoi, Fiona; Wang, Ting; Pereira, Sergio L; Rostami, Pirooz; Ryan, Carol Ann; Tong, Amy Hin Yan; Ng, Karen; Sundaravadanam, Yogi; Simpson, Jared T; Lim, Burton K; Engstrom, Mark D; Dutton, Christopher J; Kerr, Kevin C R; Franke, Maria; Rapley, William; Wintle, Richard F; Scherer, Stephen W

    2017-02-09

    The Canadian beaver (Castor canadensis) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon-gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology.

  20. De Novo Genome and Transcriptome Assembly of the Canadian Beaver (Castor canadensis)

    PubMed Central

    Lok, Si; Paton, Tara A.; Wang, Zhuozhi; Kaur, Gaganjot; Walker, Susan; Yuen, Ryan K. C.; Sung, Wilson W. L.; Whitney, Joseph; Buchanan, Janet A.; Trost, Brett; Singh, Naina; Apresto, Beverly; Chen, Nan; Coole, Matthew; Dawson, Travis J.; Ho, Karen; Hu, Zhizhou; Pullenayegum, Sanjeev; Samler, Kozue; Shipstone, Arun; Tsoi, Fiona; Wang, Ting; Pereira, Sergio L.; Rostami, Pirooz; Ryan, Carol Ann; Tong, Amy Hin Yan; Ng, Karen; Sundaravadanam, Yogi; Simpson, Jared T.; Lim, Burton K.; Engstrom, Mark D.; Dutton, Christopher J.; Kerr, Kevin C. R.; Franke, Maria; Rapley, William; Wintle, Richard F.; Scherer, Stephen W.

    2017-01-01

    The Canadian beaver (Castor canadensis) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon–gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology. PMID:28087693

  1. Development of Genomic Resources for Pacific Herring through Targeted Transcriptome Pyrosequencing

    PubMed Central

    Roberts, Steven B.; Hauser, Lorenz; Seeb, Lisa W.; Seeb, James E.

    2012-01-01

    Pacific herring (Clupea pallasii) support commercially and culturally important fisheries but have experienced significant additional pressure from a variety of anthropogenic and environmental sources. In order to provide genomic resources to facilitate organismal and population level research, high-throughput pyrosequencing (Roche 454) was carried out on transcriptome libraries from liver and testes samples taken in Prince William Sound, the Bering Sea, and the Gulf of Alaska. Over 40,000 contigs were identified with an average length of 728 bp. We describe an annotated transcriptome as well as a workflow for single nucleotide polymorphism (SNP) discovery and validation. A subset of 96 candidate SNPs chosen from 10,933 potential SNPs, were tested using a combination of Sanger sequencing and high-resolution melt-curve analysis. Five SNPs supported between-ocean-basin differentiation, while one SNP associated with immune function provided high differentiation between Prince William Sound and Kodiak Island within the Gulf of Alaska. These genomic resources provide a basis for environmental physiology studies and opportunities for marker development and subsequent population structure analysis. PMID:22383979

  2. Transcriptome analysis in Concholepas concholepas (Gastropoda, Muricidae): mining and characterization of new genomic and molecular markers.

    PubMed

    Cárdenas, Leyla; Sánchez, Roland; Gomez, Daniela; Fuenzalida, Gonzalo; Gallardo-Escárate, Cristián; Tanguy, Arnaud

    2011-09-01

    The marine gastropod Concholepas concholepas, locally known as the "loco", is the main target species of the benthonic Chilean fisheries. Genetic and genomic tools are necessary to study the genome of this species in order to understand the molecular basis of its development, growth, and other key traits to improve the management strategies and to identify local adaptation to prevent loss of biodiversity. Here, we use pyrosequencing technologies to generate the first transcriptomic database from adult specimens of the loco. After trimming, a total of 140,756 Expressed Sequence Tag sequences were achieved. Clustering and assembly analysis identified 19,219 contigs and 105,435 singleton sequences. BlastN analysis showed a significant identity with Expressed Sequence Tags of different gastropod species available in public databases. Similarly, BlastX results showed that only 895 out of the total 124,654 had significant hits and may represent novel genes for marine gastropods. From this database, simple sequence repeat motifs were also identified and a total of 38 primer pairs were designed and tested to assess their potential as informative markers and to investigate their cross-species amplification in different related gastropod species. This dataset represents the first publicly available 454 data for a marine gastropod endemic to the southeastern Pacific coast, providing a valuable transcriptomic resource for future efforts of gene discovery and development of functional markers in other marine gastropods.

  3. The capsicum transcriptome DB: a “hot” tool for genomic research

    PubMed Central

    Góngora-Castillo, Elsa; Fajardo-Jaime, Rubén; Fernández-Cortes, Araceli; Jofre-Garfias, Alba E; Lozoya-Gloria, Edmundo; Martínez, Octavio; Ochoa-Alejo, Neftalí; Rivera-Bustamante, Rafael

    2012-01-01

    Chili pepper (Capsicum annuum) is an economically important crop with no available public genome sequence. We describe a genomic resource to facilitate Capsicum annuum research. A collection of Expressed Sequence Tags (ESTs) derived from five C. annuum organs (root, stem, leaf, flower and fruit) were sequenced using the Sanger method and multiple leaf transcriptomes were deeply sampled using with GS-pyrosequencing. A hybrid assembly of 1,324,516 raw reads yielded 32,314 high quality contigs as validated by coverage and identity analysis with existing pepper sequences. Overall, 75.5% of the contigs had significant sequence similarity to entries in nucleic acid and protein databases; 23% of the sequences have not been previously reported for C. annuum and expand sequence resources for this species. A MySQL database and a user-friendly Web interface were constructed with search-tools that permit queries of the ESTs including sequence, functional annotation, Gene Ontology classification, metabolic pathways, and assembly information. The Capsicum Transcriptome DB is free available from http://www.bioingenios.ira.cinvestav.mx:81/Joomla/ PMID:22359434

  4. Genome and Transcriptome Sequences Reveal the Specific Parasitism of the Nematophagous Purpureocillium lilacinum 36-1

    PubMed Central

    Xie, Jialian; Li, Shaojun; Mo, Chenmi; Xiao, Xueqiong; Peng, Deliang; Wang, Gaofeng; Xiao, Yannong

    2016-01-01

    Purpureocillium lilacinum is a promising nematophagous ascomycete able to adapt diverse environments and it is also an opportunistic fungus that infects humans. A microbial inoculant of P. lilacinum has been registered to control plant parasitic nematodes. However, the molecular mechanism of the toxicological processes is still unclear because of the relatively few reports on the subject. In this study, using Illumina paired-end sequencing, the draft genome sequence and the transcriptome of P. lilacinum strain 36-1 infecting nematode-eggs were determined. Whole genome alignment indicated that P. lilacinum 36-1 possessed a more dynamic genome in comparison with P. lilacinum India strain. Moreover, a phylogenetic analysis showed that the P. lilacinum 36-1 had a closer relation to entomophagous fungi. The protein-coding genes in P. lilacinum 36-1 occurred much more frequently than they did in other fungi, which was a result of the depletion of repeat-induced point mutations (RIP). Comparative genome and transcriptome analyses revealed the genes that were involved in pathogenicity, particularly in the recognition, adhesion of nematode-eggs, downstream signal transduction pathways and hydrolase genes. By contrast, certain numbers of cellulose and xylan degradation genes and a lack of polysaccharide lyase genes showed the potential of P. lilacinum 36-1 as an endophyte. Notably, the expression of appressorium-formation and antioxidants-related genes exhibited similar infection patterns in P. lilacinum strain 36-1 to those of the model entomophagous fungi Metarhizium spp. These results uncovered the specific parasitism of P. lilacinum and presented the genes responsible for the infection of nematode-eggs. PMID:27486440

  5. Genotyping-by-Sequencing SNP Identification for Crops without a Reference Genome: Using Transcriptome Based Mapping as an Alternative Strategy

    PubMed Central

    Berthouly-Salazar, Cécile; Mariac, Cédric; Couderc, Marie; Pouzadoux, Juliette; Floc’h, Jean-Baptiste; Vigouroux, Yves

    2016-01-01

    Next-generation sequencing opens the way for genomic studies of diversity even for non-model crops and animals. Genome reduction techniques are becoming progressively more popular as they allow a fraction of the genome to be sequenced for multiple individuals and/or populations. These techniques are an efficient way to explore genome diversity in non-model crops and animals for which no reference genome is available. Genome reduction techniques emerged with the development of specific pipelines such as UNEAK (Universal Network Enabled Analysis Kit) and Stacks. However, even for non-model crops and animals, transcriptomes are easier to obtain, thereby making it possible to directly map reads. We investigate the direct use of transcriptome as an alternative strategy. Our specific objective was to compare SNPs obtained from the UNEAK pipeline as well as SNPs obtained by directly mapping genotyping-by-sequencing reads on a transcriptome. We assessed the feasibility of both SNP datasets, UNEAK and transcriptome mapping, to investigate the diversity of 91 samples of wild pearl millet sampled across its distribution area. Both approaches produced several tens of thousands of single nucleotide variants, but differed in the way the variants were identified, leading to differences in the frequency spectrum associated with marked differences in the assessment of diversity. Difference in the frequency spectrum significantly biased a large set of diversity analyses as well as detection of selection approaches. However, whatever the approach, we found very similar inference of genetic structure, with three major genetic groups from West, Central, and East Africa. For non-model crops, using transcriptome data as a reference is thus a particularly promising way to obtain a more thorough analysis of datasets generated using genome reduction techniques. PMID:27379109

  6. Genotyping-by-Sequencing SNP Identification for Crops without a Reference Genome: Using Transcriptome Based Mapping as an Alternative Strategy.

    PubMed

    Berthouly-Salazar, Cécile; Mariac, Cédric; Couderc, Marie; Pouzadoux, Juliette; Floc'h, Jean-Baptiste; Vigouroux, Yves

    2016-01-01

    Next-generation sequencing opens the way for genomic studies of diversity even for non-model crops and animals. Genome reduction techniques are becoming progressively more popular as they allow a fraction of the genome to be sequenced for multiple individuals and/or populations. These techniques are an efficient way to explore genome diversity in non-model crops and animals for which no reference genome is available. Genome reduction techniques emerged with the development of specific pipelines such as UNEAK (Universal Network Enabled Analysis Kit) and Stacks. However, even for non-model crops and animals, transcriptomes are easier to obtain, thereby making it possible to directly map reads. We investigate the direct use of transcriptome as an alternative strategy. Our specific objective was to compare SNPs obtained from the UNEAK pipeline as well as SNPs obtained by directly mapping genotyping-by-sequencing reads on a transcriptome. We assessed the feasibility of both SNP datasets, UNEAK and transcriptome mapping, to investigate the diversity of 91 samples of wild pearl millet sampled across its distribution area. Both approaches produced several tens of thousands of single nucleotide variants, but differed in the way the variants were identified, leading to differences in the frequency spectrum associated with marked differences in the assessment of diversity. Difference in the frequency spectrum significantly biased a large set of diversity analyses as well as detection of selection approaches. However, whatever the approach, we found very similar inference of genetic structure, with three major genetic groups from West, Central, and East Africa. For non-model crops, using transcriptome data as a reference is thus a particularly promising way to obtain a more thorough analysis of datasets generated using genome reduction techniques.

  7. Transcriptome profiling of the demosponge Amphimedon queenslandica reveals genome-wide events that accompany major life cycle transitions

    PubMed Central

    2012-01-01

    Background The biphasic life cycle with pelagic larva and benthic adult stages is widely observed in the animal kingdom, including the Porifera (sponges), which are the earliest branching metazoans. The demosponge, Amphimedon queenslandica, undergoes metamorphosis from a free-swimming larva into a sessile adult that bears no morphological resemblance to other animals. While the genome of A. queenslandica contains an extensive repertoire of genes very similar to that of complex bilaterians, it is as yet unclear how this is drawn upon to coordinate changing morphological features and ecological demands throughout the sponge life cycle. Results To identify genome-wide events that accompany the pelagobenthic transition in A. queenslandica, we compared global gene expression profiles at four key developmental stages by sequencing the poly(A) transcriptome using SOLiD technology. Large-scale changes in transcription were observed as sponge larvae settled on the benthos and began metamorphosis. Although previous systematics suggest that the only clear homology between Porifera and other animals is in the embryonic and larval stages, we observed extensive use of genes involved in metazoan-associated cellular processes throughout the sponge life cycle. Sponge-specific transcripts are not over-represented in the morphologically distinct adult; rather, many genes that encode typical metazoan features, such as cell adhesion and immunity, are upregulated. Our analysis further revealed gene families with candidate roles in competence, settlement, and metamorphosis in the sponge, including transcription factors, G-protein coupled receptors and other signaling molecules. Conclusions This first genome-wide study of the developmental transcriptome in an early branching metazoan highlights major transcriptional events that accompany the pelagobenthic transition and point to a network of regulatory mechanisms that coordinate changes in morphology with shifting environmental demands

  8. Genomic and transcriptomic insights into the efficient entomopathogenicity of Bacillus thuringiensis.

    PubMed

    Zhu, Lei; Peng, Donghai; Wang, Yueying; Ye, Weixing; Zheng, Jinshui; Zhao, Changming; Han, Dongmei; Geng, Ce; Ruan, Lifang; He, Jin; Yu, Ziniu; Sun, Ming

    2015-09-28

    Bacillus thuringiensis has been globally used as a microbial pesticide for over 70 years. However, information regarding its various adaptions and virulence factors and their roles in the entomopathogenic process remains limited. In this work, we present the complete genomes of two industrially patented Bacillus thuringiensis strains (HD-1 and YBT-1520). A comparative genomic analysis showed a larger and more complicated genome constitution that included novel insecticidal toxicity-related genes (ITRGs). All of the putative ITRGs were summarized according to the steps of infection. A comparative genomic analysis showed that highly toxic strains contained significantly more ITRGs, thereby providing additional strategies for infection, immune evasion, and cadaver utilization. Furthermore, a comparative transcriptomic analysis suggested that a high expression of these ITRGs was a key factor in efficient entomopathogenicity. We identified an active extra urease synthesis system in the highly toxic strains that may aid B. thuringiensis survival in insects (similar to previous results with well-known pathogens). Taken together, these results explain the efficient entomopathogenicity of B. thuringiensis. It provides novel insights into the strategies used by B. thuringiensis to resist and overcome host immune defenses and helps identify novel toxicity factors.

  9. Discover hidden splicing variations by mapping personal transcriptomes to personal genomes

    PubMed Central

    Stein, Shayna; Lu, Zhi-xiang; Bahrami-Samani, Emad; Park, Juw Won; Xing, Yi

    2015-01-01

    RNA-seq has become a popular technology for studying genetic variation of pre-mRNA alternative splicing. Commonly used RNA-seq aligners rely on the consensus splice site dinucleotide motifs to map reads across splice junctions. Consequently, genomic variants that create novel splice site dinucleotides may produce splice junction RNA-seq reads that cannot be mapped to the reference genome. We developed and evaluated an approach to identify ‘hidden’ splicing variations in personal transcriptomes, by mapping personal RNA-seq data to personal genomes. Computational analysis and experimental validation indicate that this approach identifies personal specific splice junctions at a low false positive rate. Applying this approach to an RNA-seq data set of 75 individuals, we identified 506 personal specific splice junctions, among which 437 were novel splice junctions not documented in current human transcript annotations. 94 splice junctions had splice site SNPs associated with GWAS signals of human traits and diseases. These involve genes whose splicing variations have been implicated in diseases (such as OAS1), as well as novel associations between alternative splicing and diseases (such as ICA1). Collectively, our work demonstrates that the personal genome approach to RNA-seq read alignment enables the discovery of a large but previously unknown catalog of splicing variations in human populations. PMID:26578562

  10. Genomic and transcriptomic insights into the efficient entomopathogenicity of Bacillus thuringiensis

    PubMed Central

    Zhu, Lei; Peng, Donghai; Wang, Yueying; Ye, Weixing; Zheng, Jinshui; Zhao, Changming; Han, Dongmei; Geng, Ce; Ruan, Lifang; He, Jin; Yu, Ziniu; Sun, Ming

    2015-01-01

    Bacillus thuringiensis has been globally used as a microbial pesticide for over 70 years. However, information regarding its various adaptions and virulence factors and their roles in the entomopathogenic process remains limited. In this work, we present the complete genomes of two industrially patented Bacillus thuringiensis strains (HD-1 and YBT-1520). A comparative genomic analysis showed a larger and more complicated genome constitution that included novel insecticidal toxicity-related genes (ITRGs). All of the putative ITRGs were summarized according to the steps of infection. A comparative genomic analysis showed that highly toxic strains contained significantly more ITRGs, thereby providing additional strategies for infection, immune evasion, and cadaver utilization. Furthermore, a comparative transcriptomic analysis suggested that a high expression of these ITRGs was a key factor in efficient entomopathogenicity. We identified an active extra urease synthesis system in the highly toxic strains that may aid B. thuringiensis survival in insects (similar to previous results with well-known pathogens). Taken together, these results explain the efficient entomopathogenicity of B. thuringiensis. It provides novel insights into the strategies used by B. thuringiensis to resist and overcome host immune defenses and helps identify novel toxicity factors. PMID:26411888

  11. Extensive Transcriptomic and Genomic Analysis Provides New Insights about Luminal Breast Cancers

    PubMed Central

    Tishchenko, Inna; Milioli, Heloisa Helena; Riveros, Carlos; Moscato, Pablo

    2016-01-01

    Despite constituting approximately two thirds of all breast cancers, the luminal A and B tumours are poorly classified at both clinical and molecular levels. There are contradictory reports on the nature of these subtypes: some define them as intrinsic entities, others as a continuum. With the aim of addressing these uncertainties and identifying molecular signatures of patients at risk, we conducted a comprehensive transcriptomic and genomic analysis of 2,425 luminal breast cancer samples. Our results indicate that the separation between the molecular luminal A and B subtypes—per definition—is not associated with intrinsic characteristics evident in the differentiation between other subtypes. Moreover, t-SNE and MST-kNN clustering approaches based on 10,000 probes, associated with luminal tumour initiation and/or development, revealed the close connections between luminal A and B tumours, with no evidence of a clear boundary between them. Thus, we considered all luminal tumours as a single heterogeneous group for analysis purposes. We first stratified luminal tumours into two distinct groups by their HER2 gene cluster co-expression: HER2-amplified luminal and ordinary-luminal. The former group is associated with distinct transcriptomic and genomic profiles, and poor prognosis; it comprises approximately 8% of all luminal cases. For the remaining ordinary-luminal tumours we further identified the molecular signature correlated with disease outcomes, exhibiting an approximately continuous gene expression range from low to high risk. Thus, we employed four virtual quantiles to segregate the groups of patients. The clinico-pathological characteristics and ratios of genomic aberrations are concordant with the variations in gene expression profiles, hinting at a progressive staging. The comparison with the current separation into luminal A and B subtypes revealed a substantially improved survival stratification. Concluding, we suggest a review of the definition of

  12. Genome and transcriptome sequencing of the halophilic fungus Wallemia ichthyophaga: haloadaptations present and absent

    PubMed Central

    2013-01-01

    Background The basidomycete Wallemia ichthyophaga from the phylogenetically distinct class Wallemiomycetes is the most halophilic fungus known to date. It requires at least 10% NaCl and thrives in saturated salt solution. To investigate the genomic basis of this exceptional phenotype, we obtained a de-novo genome sequence of the species type-strain and analysed its transcriptomic response to conditions close to the limits of its lower and upper salinity range. Results The unusually compact genome is 9.6 Mb large and contains 1.67% repetitive sequences. Only 4884 predicted protein coding genes cover almost three quarters of the sequence. Of 639 differentially expressed genes, two thirds are more expressed at lower salinity. Phylogenomic analysis based on the largest dataset used to date (whole proteomes) positions Wallemiomycetes as a 250-million-year-old sister group of Agaricomycotina. Contrary to the closely related species Wallemia sebi, W. ichthyophaga appears to have lost the ability for sexual reproduction. Several protein families are significantly expanded or contracted in the genome. Among these, there are the P-type ATPase cation transporters, but not the sodium/ hydrogen exchanger family. Transcription of all but three cation transporters is not salt dependent. The analysis also reveals a significant enrichment in hydrophobins, which are cell-wall proteins with multiple cellular functions. Half of these are differentially expressed, and most contain an unusually large number of acidic amino acids. This discovery is of particular interest due to the numerous applications of hydrophobines from other fungi in industry, pharmaceutics and medicine. Conclusions W. ichthyophaga is an extremophilic specialist that shows only low levels of adaptability and genetic recombination. This is reflected in the characteristics of its genome and its transcriptomic response to salt. No unusual traits were observed in common salt-tolerance mechanisms, such as transport of

  13. Analysis of the Legionella longbeachae genome and transcriptome uncovers unique strategies to cause Legionnaires' disease.

    PubMed

    Cazalet, Christel; Gomez-Valero, Laura; Rusniok, Christophe; Lomma, Mariella; Dervins-Ravault, Delphine; Newton, Hayley J; Sansom, Fiona M; Jarraud, Sophie; Zidane, Nora; Ma, Laurence; Bouchier, Christiane; Etienne, Jerôme; Hartland, Elizabeth L; Buchrieser, Carmen

    2010-02-19

    Legionella pneumophila and L. longbeachae are two species of a large genus of bacteria that are ubiquitous in nature. L. pneumophila is mainly found in natural and artificial water circuits while L. longbeachae is mainly present in soil. Under the appropriate conditions both species are human pathogens, capable of causing a severe form of pneumonia termed Legionnaires' disease. Here we report the sequencing and analysis of four L. longbeachae genomes, one complete genome sequence of L. longbeachae strain NSW150 serogroup (Sg) 1, and three draft genome sequences another belonging to Sg1 and two to Sg2. The genome organization and gene content of the four L. longbeachae genomes are highly conserved, indicating strong pressure for niche adaptation. Analysis and comparison of L. longbeachae strain NSW150 with L. pneumophila revealed common but also unexpected features specific to this pathogen. The interaction with host cells shows distinct features from L. pneumophila, as L. longbeachae possesses a unique repertoire of putative Dot/Icm type IV secretion system substrates, eukaryotic-like and eukaryotic domain proteins, and encodes additional secretion systems. However, analysis of the ability of a dotA mutant of L. longbeachae NSW150 to replicate in the Acanthamoeba castellanii and in a mouse lung infection model showed that the Dot/Icm type IV secretion system is also essential for the virulence of L. longbeachae. In contrast to L. pneumophila, L. longbeachae does not encode flagella, thereby providing a possible explanation for differences in mouse susceptibility to infection between the two pathogens. Furthermore, transcriptome analysis revealed that L. longbeachae has a less pronounced biphasic life cycle as compared to L. pneumophila, and genome analysis and electron microscopy suggested that L. longbeachae is encapsulated. These species-specific differences may account for the different environmental niches and disease epidemiology of these two Legionella

  14. Identification of G protein coupled receptors for opsines and neurohormones in Rhodnius prolixus. Genomic and transcriptomic analysis.

    PubMed

    Ons, Sheila; Lavore, Andrés; Sterkel, Marcos; Wulff, Juan Pedro; Sierra, Ivana; Martínez-Barnetche, Jesús; Rodriguez, Mario Henry; Rivera-Pomar, Rolando

    2016-02-01

    The importance of Chagas disease motivated the scientific effort to obtain the complete genomic sequence of the vector species Rhodnius prolixus, this information is also relevant to the understanding of triatomine biology in general. The central nervous system is the key regulator of insect physiology and behavior. Neurohormones (neuropeptides and biogenic amines) are the chemical messengers involved in the regulation and integration of neuroendocrine signals. In insects, this signaling is mainly mediated by the interaction of neurohormone ligands with G protein coupled receptors (GPCRs). The recently sequenced R. prolixus genome provides us with the opportunity to analyze this important family of genes in triatomines, supplying relevant information for further functional studies. Next-generation sequencing methods offer an excellent opportunity for transcriptomic exploration in key organs and tissues in the presence of a reference genome as well as when a reference genome is not available. We undertook a genomic analysis to obtain a genome-wide inventory of opsines and the GPCRs for neurohormones in R. prolixus. Furthermore, we performed a transcriptomic analysis of R. prolixus central nervous system, focusing on neuropeptide precursor genes and neurohormone and opsines GPCRs. In addition, we mined the whole transcriptomes of Triatoma dimidiata, Triatoma infestans and Triatoma pallidipennis - three sanitary relevant triatomine species - to identify neuropeptide precursors and GPCRs genes. Our study reveals a high degree of sequence conservation in the molecular components of the neuroendocrine system of triatomines.

  15. Genome-Scale Reconstruction of the Human Astrocyte Metabolic Network

    PubMed Central

    Martín-Jiménez, Cynthia A.; Salazar-Barreto, Diego; Barreto, George E.; González, Janneth

    2017-01-01

    Astrocytes are the most abundant cells of the central nervous system; they have a predominant role in maintaining brain metabolism. In this sense, abnormal metabolic states have been found in different neuropathological diseases. Determination of metabolic states of astrocytes is difficult to model using current experimental approaches given the high number of reactions and metabolites present. Thus, genome-scale metabolic networks derived from transcriptomic data can be used as a framework to elucidate how astrocytes modulate human brain metabolic states during normal conditions and in neurodegenerative diseases. We performed a Genome-Scale Reconstruction of the Human Astrocyte Metabolic Network with the purpose of elucidating a significant portion of the metabolic map of the astrocyte. This is the first global high-quality, manually curated metabolic reconstruction network of a human astrocyte. It includes 5,007 metabolites and 5,659 reactions distributed among 8 cell compartments, (extracellular, cytoplasm, mitochondria, endoplasmic reticle, Golgi apparatus, lysosome, peroxisome and nucleus). Using the reconstructed network, the metabolic capabilities of human astrocytes were calculated and compared both in normal and ischemic conditions. We identified reactions activated in these two states, which can be useful for understanding the astrocytic pathways that are affected during brain disease. Additionally, we also showed that the obtained flux distributions in the model, are in accordance with literature-based findings. Up to date, this is the most complete representation of the human astrocyte in terms of inclusion of genes, proteins, reactions and metabolic pathways, being a useful guide for in-silico analysis of several metabolic behaviors of the astrocyte during normal and pathologic states. PMID:28243200

  16. Genome-Scale Reconstruction of the Human Astrocyte Metabolic Network.

    PubMed

    Martín-Jiménez, Cynthia A; Salazar-Barreto, Diego; Barreto, George E; González, Janneth

    2017-01-01

    Astrocytes are the most abundant cells of the central nervous system; they have a predominant role in maintaining brain metabolism. In this sense, abnormal metabolic states have been found in different neuropathological diseases. Determination of metabolic states of astrocytes is difficult to model using current experimental approaches given the high number of reactions and metabolites present. Thus, genome-scale metabolic networks derived from transcriptomic data can be used as a framework to elucidate how astrocytes modulate human brain metabolic states during normal conditions and in neurodegenerative diseases. We performed a Genome-Scale Reconstruction of the Human Astrocyte Metabolic Network with the purpose of elucidating a significant portion of the metabolic map of the astrocyte. This is the first global high-quality, manually curated metabolic reconstruction network of a human astrocyte. It includes 5,007 metabolites and 5,659 reactions distributed among 8 cell compartments, (extracellular, cytoplasm, mitochondria, endoplasmic reticle, Golgi apparatus, lysosome, peroxisome and nucleus). Using the reconstructed network, the metabolic capabilities of human astrocytes were calculated and compared both in normal and ischemic conditions. We identified reactions activated in these two states, which can be useful for understanding the astrocytic pathways that are affected during brain disease. Additionally, we also showed that the obtained flux distributions in the model, are in accordance with literature-based findings. Up to date, this is the most complete representation of the human astrocyte in terms of inclusion of genes, proteins, reactions and metabolic pathways, being a useful guide for in-silico analysis of several metabolic behaviors of the astrocyte during normal and pathologic states.

  17. Genome-wide characterization of transcriptional start sites in humans by integrative transcriptome analysis

    PubMed Central

    Yamashita, Riu; Sathira, Nuankanya P.; Kanai, Akinori; Tanimoto, Kousuke; Arauchi, Takako; Tanaka, Yoshiaki; Hashimoto, Shin-ichi; Sugano, Sumio; Nakai, Kenta; Suzuki, Yutaka

    2011-01-01

    We performed a genome-wide analysis of transcriptional start sites (TSSs) in human genes by multifaceted use of a massively parallel sequencer. By analyzing 800 million sequences that were obtained from various types of transcriptome analyses, we characterized 140 million TSS tags in 12 human cell types. Despite the large number of TSS clusters (TSCs), the number of TSCs was observed to decrease sharply with increasing expression levels. Highly expressed TSCs exhibited several characteristic features: Nucleosome-seq analysis revealed highly ordered nucleosome structures, ChIP-seq analysis detected clear RNA polymerase II binding signals in their surrounding regions, evaluations of previously sequenced and newly shotgun-sequenced complete cDNA sequences showed that they encode preferable transcripts for protein translation, and RNA-seq analysis of polysome-incorporated RNAs yielded direct evidence that those transcripts are actually translated into proteins. We also demonstrate that integrative interpretation of transcriptome data is essential for the selection of putative alternative promoter TSCs, two of which also have protein consequences. Furthermore, discriminative chromatin features that separate TSCs at different expression levels were found for both genic TSCs and intergenic TSCs. The collected integrative information should provide a useful basis for future biological characterization of TSCs. PMID:21372179

  18. Comparative transcriptome analysis reveals insights into the streamlined genomes of haplosclerid demosponges

    NASA Astrophysics Data System (ADS)

    Guzman, Christine; Conaco, Cecilia

    2016-01-01

    Sponges (Porifera) are one of the most ancestral metazoan groups. They are characterized by a simple body plan lacking the true tissues and organ systems found in other animals. Members of this phylum display a remarkable diversity of form and function and yet little is known about the composition and complexity of their genomes. In this study, we sequenced the transcriptomes of two marine haplosclerid sponges belonging to Demospongiae, the largest and most diverse class within phylum Porifera, and compared their gene content with members of other sponge classes. We recovered 44,693 and 50,067 transcripts expressed in adult tissues of Haliclona amboinensis and Haliclona tubifera, respectively. These transcripts translate into 20,280 peptides in H. amboinensis and 18,000 peptides in H. tubifera. Genes associated with important signaling and metabolic pathways, regulatory networks, as well as genes that may be important in the organismal stress response, were identified in the transcriptomes. Futhermore, lineage-specific innovations were identified that may be correlated with observed sponge characters and ecological adaptations. The core gene complement expressed within the tissues of adult haplosclerid demosponges may represent a streamlined and flexible genetic toolkit that underlies the ecological success and resilience of sponges to environmental stress.

  19. Genome-wide transcriptome analysis revealed organelle specific responses to temperature variations in algae

    PubMed Central

    Shin, HyeonSeok; Hong, Seong-Joo; Yoo, Chan; Han, Mi-Ae; Lee, Hookeun; Choi, Hyung-Kyoon; Cho, Suhyung; Lee, Choul-Gyun; Cho, Byung-Kwan

    2016-01-01

    Temperature is a critical environmental factor that affects microalgal growth. However, microalgal coping mechanisms for temperature variations are unclear. Here, we determined changes in transcriptome, total carbohydrate, total fatty acid methyl ester, and fatty acid composition of Tetraselmis sp. KCTC12432BP, a strain with a broad temperature tolerance range, to elucidate the tolerance mechanisms in response to large temperature variations. Owing to unavailability of genome sequence information, de novo transcriptome assembly coupled with BLAST analysis was performed using strand specific RNA-seq data. This resulted in 26,245 protein-coding transcripts, of which 83.7% could be annotated to putative functions. We identified more than 681 genes differentially expressed, suggesting an organelle-specific response to temperature variation. Among these, the genes related to the photosynthetic electron transfer chain, which are localized in the plastid thylakoid membrane, were upregulated at low temperature. However, the transcripts related to the electron transport chain and biosynthesis of phosphatidylethanolamine localized in mitochondria were upregulated at high temperature. These results show that the low energy uptake by repressed photosynthesis under low and high temperature conditions is compensated by different mechanisms, including photosystem I and mitochondrial oxidative phosphorylation, respectively. This study illustrates that microalgae tolerate different temperature conditions through organelle specific mechanisms. PMID:27883062

  20. Comparative transcriptome analysis reveals insights into the streamlined genomes of haplosclerid demosponges

    PubMed Central

    Guzman, Christine; Conaco, Cecilia

    2016-01-01

    Sponges (Porifera) are one of the most ancestral metazoan groups. They are characterized by a simple body plan lacking the true tissues and organ systems found in other animals. Members of this phylum display a remarkable diversity of form and function and yet little is known about the composition and complexity of their genomes. In this study, we sequenced the transcriptomes of two marine haplosclerid sponges belonging to Demospongiae, the largest and most diverse class within phylum Porifera, and compared their gene content with members of other sponge classes. We recovered 44,693 and 50,067 transcripts expressed in adult tissues of Haliclona amboinensis and Haliclona tubifera, respectively. These transcripts translate into 20,280 peptides in H. amboinensis and 18,000 peptides in H. tubifera. Genes associated with important signaling and metabolic pathways, regulatory networks, as well as genes that may be important in the organismal stress response, were identified in the transcriptomes. Futhermore, lineage-specific innovations were identified that may be correlated with observed sponge characters and ecological adaptations. The core gene complement expressed within the tissues of adult haplosclerid demosponges may represent a streamlined and flexible genetic toolkit that underlies the ecological success and resilience of sponges to environmental stress. PMID:26738846

  1. Genome-based analysis of the transcriptome from mature chickpea root nodules

    PubMed Central

    Afonso-Grunz, Fabian; Molina, Carlos; Hoffmeier, Klaus; Rycak, Lukas; Kudapa, Himabindu; Varshney, Rajeev K.; Drevon, Jean-Jacques; Winter, Peter; Kahl, Günter

    2014-01-01

    Symbiotic nitrogen fixation (SNF) in root nodules of grain legumes such as chickpea is a highly complex process that drastically affects the gene expression patterns of both the prokaryotic as well as eukaryotic interacting cells. A successfully established symbiotic relationship requires mutual signaling mechanisms and a continuous adaptation of the metabolism of the involved cells to varying environmental conditions. Although some of these processes are well understood today many of the molecular mechanisms underlying SNF, especially in chickpea, remain unclear. Here, we reannotated our previously published transcriptome data generated by deepSuperSAGE (Serial Analysis of Gene Expression) to the recently published draft genome of chickpea to assess the root- and nodule-specific transcriptomes of the eukaryotic host cells. The identified gene expression patterns comprise up to 71 significantly differentially expressed genes and the expression of twenty of these was validated by quantitative real-time PCR with the tissues from five independent biological replicates. Many of the differentially expressed transcripts were found to encode proteins implicated in sugar metabolism, antioxidant defense as well as biotic and abiotic stress responses of the host cells, and some of them were already known to contribute to SNF in other legumes. The differentially expressed genes identified in this study represent candidates that can be used for further characterization of the complex molecular mechanisms underlying SNF in chickpea. PMID:25071808

  2. Integrated Transcriptomic-Proteomic Analysis Using a Proteogenomic Workflow Refines Rat Genome Annotation.

    PubMed

    Kumar, Dhirendra; Yadav, Amit Kumar; Jia, Xinying; Mulvenna, Jason; Dash, Debasis

    2016-01-01

    Proteogenomic re-annotation and mRNA splicing information can lead to the discovery of various protein forms for eukaryotic model organisms like rat. However, detection of novel proteoforms using mass spectrometry proteomics data remains a formidable challenge. We developed EuGenoSuite, an open source multiple algorithmic proteomic search tool and utilized it in our in-house integrated transcriptomic-proteomic pipeline to facilitate automated proteogenomic analysis. Using four proteogenomic pipelines (integrated transcriptomic-proteomic, Peppy, Enosi, and ProteoAnnotator) on publicly available RNA-sequence and MS proteomics data, we discovered 363 novel peptides in rat brain microglia representing novel proteoforms for 249 gene loci in the rat genome. These novel peptides aided in the discovery of novel exons, translation of annotated untranslated regions, pseudogenes, and splice variants for various loci; many of which have known disease associations, including neurological disorders like schizophrenia, amyotrophic lateral sclerosis, etc. Novel isoforms were also discovered for genes implicated in cardiovascular diseases and breast cancer for which rats are considered model organisms. Our integrative multi-omics data analysis not only enables the discovery of new proteoforms but also generates an improved reference for human disease studies in the rat model.

  3. The expanding transcriptome: the genome as the ‘Book of Sand'

    PubMed Central

    Mendes Soares, Luis M; Valcárcel, Juan

    2006-01-01

    The central dogma of molecular biology inspired by classical work in prokaryotic organisms accounts for only part of the genetic agenda of complex eukaryotes. First, post-transcriptional events lead to the generation of multiple mRNAs, proteins and functions from a single primary transcript, revealing regulatory networks distinct in mechanism and biological function from those controlling RNA transcription. Second, a variety of populous families of small RNAs (small nuclear RNAs, small nucleolar RNAs, microRNAs, siRNAs and shRNAs) assemble on ribonucleoprotein complexes and regulate virtually all aspects of the gene expression pathway, with profound biological consequences. Third, high-throughput methods of genomic analysis reveal that RNAs other than non-protein-coding RNAs (ncRNAs) represent a major component of the transcriptome that may perform novel functions in gene regulation and beyond. Post-transcriptional regulation, small RNAs and ncRNAs provide an expanding picture of the transcriptome that enriches our views of what genes are, how they operate, evolve and are regulated. PMID:16511566

  4. Genomic and Transcriptomic Features of Response to Anti-PD-1 Therapy in Metastatic Melanoma.

    PubMed

    Hugo, Willy; Zaretsky, Jesse M; Sun, Lu; Song, Chunying; Moreno, Blanca Homet; Hu-Lieskovan, Siwen; Berent-Maoz, Beata; Pang, Jia; Chmielowski, Bartosz; Cherry, Grace; Seja, Elizabeth; Lomeli, Shirley; Kong, Xiangju; Kelley, Mark C; Sosman, Jeffrey A; Johnson, Douglas B; Ribas, Antoni; Lo, Roger S

    2016-03-24

    PD-1 immune checkpoint blockade provides significant clinical benefits for melanoma patients. We analyzed the somatic mutanomes and transcriptomes of pretreatment melanoma biopsies to identify factors that may influence innate sensitivity or resistance to anti-PD-1 therapy. We find that overall high mutational loads associate with improved survival, and tumors from responding patients are enriched for mutations in the DNA repair gene BRCA2. Innately resistant tumors display a transcriptional signature (referred to as the IPRES, or innate anti-PD-1 resistance), indicating concurrent up-expression of genes involved in the regulation of mesenchymal transition, cell adhesion, extracellular matrix remodeling, angiogenesis, and wound healing. Notably, mitogen-activated protein kinase (MAPK)-targeted therapy (MAPK inhibitor) induces similar signatures in melanoma, suggesting that a non-genomic form of MAPK inhibitor resistance mediates cross-resistance to anti-PD-1 therapy. Validation of the IPRES in other independent tumor cohorts defines a transcriptomic subset across distinct types of advanced cancer. These findings suggest that attenuating the biological processes that underlie IPRES may improve anti-PD-1 response in melanoma and other cancer types.

  5. Genomics of Compositae crops: reference transcriptome assemblies and evidence of hybridization with wild relatives.

    PubMed

    Hodgins, Kathryn A; Lai, Zhao; Oliveira, Luiz O; Still, David W; Scascitelli, Moira; Barker, Michael S; Kane, Nolan C; Dempewolf, Hannes; Kozik, Alex; Kesseli, Richard V; Burke, John M; Michelmore, Richard W; Rieseberg, Loren H

    2014-01-01

    Although the Compositae harbours only two major food crops, sunflower and lettuce, many other species in this family are utilized by humans and have experienced various levels of domestication. Here, we have used next-generation sequencing technology to develop 15 reference transcriptome assemblies for Compositae crops or their wild relatives. These data allow us to gain insight into the evolutionary and genomic consequences of plant domestication. Specifically, we performed Illumina sequencing of Cichorium endivia, Cichorium intybus, Echinacea angustifolia, Iva annua, Helianthus tuberosus, Dahlia hybrida, Leontodon taraxacoides and Glebionis segetum, as well 454 sequencing of Guizotia scabra, Stevia rebaudiana, Parthenium argentatum and Smallanthus sonchifolius. Illumina reads were assembled using Trinity, and 454 reads were assembled using MIRA and CAP3. We evaluated the coverage of the transcriptomes using BLASTX analysis of a set of ultra-conserved orthologs (UCOs) and recovered most of these genes (88-98%). We found a correlation between contig length and read length for the 454 assemblies, and greater contig lengths for the 454 compared with the Illumina assemblies. This suggests that longer reads can aid in the assembly of more complete transcripts. Finally, we compared the divergence of orthologs at synonymous sites (Ks) between Compositae crops and their wild relatives and found greater divergence when the progenitors were self-incompatible. We also found greater divergence between pairs of taxa that had some evidence of postzygotic isolation. For several more distantly related congeners, such as chicory and endive, we identified a signature of introgression in the distribution of Ks values.

  6. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.

    PubMed

    Curtis, Christina; Shah, Sohrab P; Chin, Suet-Feung; Turashvili, Gulisa; Rueda, Oscar M; Dunning, Mark J; Speed, Doug; Lynch, Andy G; Samarajiwa, Shamith; Yuan, Yinyin; Gräf, Stefan; Ha, Gavin; Haffari, Gholamreza; Bashashati, Ali; Russell, Roslin; McKinney, Steven; Langerød, Anita; Green, Andrew; Provenzano, Elena; Wishart, Gordon; Pinder, Sarah; Watson, Peter; Markowetz, Florian; Murphy, Leigh; Ellis, Ian; Purushotham, Arnie; Børresen-Dale, Anne-Lise; Brenton, James D; Tavaré, Simon; Caldas, Carlos; Aparicio, Samuel

    2012-04-18

    The elucidation of breast cancer subgroups and their molecular drivers requires integrated views of the genome and transcriptome from representative numbers of patients. We present an integrated analysis of copy number and gene expression in a discovery and validation set of 997 and 995 primary breast tumours, respectively, with long-term clinical follow-up. Inherited variants (copy number variants and single nucleotide polymorphisms) and acquired somatic copy number aberrations (CNAs) were associated with expression in ~40% of genes, with the landscape dominated by cis- and trans-acting CNAs. By delineating expression outlier genes driven in cis by CNAs, we identified putative cancer genes, including deletions in PPP2R2A, MTAP and MAP2K4. Unsupervised analysis of paired DNA–RNA profiles revealed novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort. These include a high-risk, oestrogen-receptor-positive 11q13/14 cis-acting subgroup and a favourable prognosis subgroup devoid of CNAs. Trans-acting aberration hotspots were found to modulate subgroup-specific gene networks, including a TCR deletion-mediated adaptive immune response in the ‘CNA-devoid’ subgroup and a basal-specific chromosome 5 deletion-associated mitotic network. Our results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome.

  7. An evaluation of transcriptome-based exon capture for frog phylogenomics across multiple scales of divergence (Class: Amphibia, Order: Anura).

    PubMed

    Portik, Daniel M; Smith, Lydia L; Bi, Ke

    2016-09-01

    Custom sequence capture experiments are becoming an efficient approach for gathering large sets of orthologous markers in nonmodel organisms. Transcriptome-based exon capture utilizes transcript sequences to design capture probes, typically using a reference genome to identify intron-exon boundaries to exclude shorter exons (<200 bp). Here, we test directly using transcript sequences for probe design, which are often composed of multiple exons of varying lengths. Using 1260 orthologous transcripts, we conducted sequence captures across multiple phylogenetic scales for frogs, including outgroups ~100 Myr divergent from the ingroup. We recovered a large phylogenomic data set consisting of sequence alignments for 1047 of the 1260 transcriptome-based loci (~561 000 bp) and a large quantity of highly variable regions flanking the exons in transcripts (~70 000 bp), the latter improving substantially by only including ingroup species (~797 000 bp). We recovered both shorter (<100 bp) and longer exons (>200 bp), with no major reduction in coverage towards the ends of exons. We observed significant differences in the performance of blocking oligos for target enrichment and nontarget depletion during captures, and differences in PCR duplication rates resulting from the number of individuals pooled for capture reactions. We explicitly tested the effects of phylogenetic distance on capture sensitivity, specificity, and missing data, and provide a baseline estimate of expectations for these metrics based on a priori knowledge of nuclear pairwise differences among samples. We provide recommendations for transcriptome-based exon capture design based on our results, cost estimates and offer multiple pipelines for data assembly and analysis.

  8. Genome, Transcriptome, and Functional Analyses of Penicillium expansum Provide New Insights Into Secondary Metabolism and Pathogenicity.

    PubMed

    Ballester, Ana-Rosa; Marcet-Houben, Marina; Levin, Elena; Sela, Noa; Selma-Lázaro, Cristina; Carmona, Lourdes; Wisniewski, Michael; Droby, Samir; González-Candelas, Luis; Gabaldón, Toni

    2015-03-01

    The relationship between secondary metabolism and infection in pathogenic fungi has remained largely elusive. The genus Penicillium comprises a group of plant pathogens with varying host specificities and with the ability to produce a wide array of secondary metabolites. The genomes of three Penicillium expansum strains, the main postharvest pathogen of pome fruit, and one Pencillium italicum strain, a postharvest pathogen of citrus fruit, were sequenced and compared with 24 other fungal species. A genomic analysis of gene clusters responsible for the production of secondary metabolites was performed. Putative virulence factors in P. expansum were identified by means of a transcriptomic analysis of apple fruits during the course of infection. Despite a major genome contraction, P. expansum is the Penicillium species with the largest potential for the production of secondary metabolites. Results using knockout mutants clearly demonstrated that neither patulin nor citrinin are required by P. expansum to successfully infect apples. Li et al. ( MPMI-12-14-0398-FI ) reported similar results and conclusions in their recently accepted paper.

  9. Genome and Transcriptome of Clostridium phytofermentans, Catalyst for the Direct Conversion of Plant Feedstocks to Fuels

    PubMed Central

    Petit, Elsa; Coppi, Maddalena V.; Hayes, James C.; Tolonen, Andrew C.; Warnick, Thomas; Latouf, William G.; Amisano, Danielle; Biddle, Amy; Mukherjee, Supratim; Ivanova, Natalia; Lykidis, Athanassios; Land, Miriam; Hauser, Loren; Kyrpides, Nikos; Henrissat, Bernard; Lau, Joanne; Schnell, Danny J.; Church, George M.; Leschine, Susan B.; Blanchard, Jeffrey L.

    2015-01-01

    Clostridium phytofermentans was isolated from forest soil and is distinguished by its capacity to directly ferment plant cell wall polysaccharides into ethanol as the primary product, suggesting that it possesses unusual catabolic pathways. The objective of the present study was to understand the molecular mechanisms of biomass conversion to ethanol in a single organism, Clostridium phytofermentans, by analyzing its complete genome and transcriptome during growth on plant carbohydrates. The saccharolytic versatility of C. phytofermentans is reflected in a diversity of genes encoding ATP-binding cassette sugar transporters and glycoside hydrolases, many of which may have been acquired through horizontal gene transfer. These genes are frequently organized as operons that may be controlled individually by the many transcriptional regulators identified in the genome. Preferential ethanol production may be due to high levels of expression of multiple ethanol dehydrogenases and additional pathways maximizing ethanol yield. The genome also encodes three different proteinaceous bacterial microcompartments with the capacity to compartmentalize pathways that divert fermentation intermediates to various products. These characteristics make C. phytofermentans an attractive resource for improving the efficiency and speed of biomass conversion to biofuels. PMID:26035711

  10. Genome and transcriptome sequencing of lung cancers reveal diverse mutational and splicing events

    PubMed Central

    Liu, Jinfeng; Lee, William; Jiang, Zhaoshi; Chen, Zhongqiang; Jhunjhunwala, Suchit; Haverty, Peter M.; Gnad, Florian; Guan, Yinghui; Gilbert, Houston N.; Stinson, Jeremy; Klijn, Christiaan; Guillory, Joseph; Bhatt, Deepali; Vartanian, Steffan; Walter, Kimberly; Chan, Jocelyn; Holcomb, Thomas; Dijkgraaf, Peter; Johnson, Stephanie; Koeman, Julie; Minna, John D.; Gazdar, Adi F.; Stern, Howard M.; Hoeflich, Klaus P.; Wu, Thomas D.; Settleman, Jeff; de Sauvage, Frederic J.; Gentleman, Robert C.; Neve, Richard M.; Stokoe, David; Modrusan, Zora; Seshagiri, Somasekar; Shames, David S.; Zhang, Zemin

    2012-01-01

    Lung cancer is a highly heterogeneous disease in terms of both underlying genetic lesions and response to therapeutic treatments. We performed deep whole-genome sequencing and transcriptome sequencing on 19 lung cancer cell lines and three lung tumor/normal pairs. Overall, our data show that cell line models exhibit similar mutation spectra to human tumor samples. Smoker and never-smoker cancer samples exhibit distinguishable patterns of mutations. A number of epigenetic regulators, including KDM6A, ASH1L, SMARCA4, and ATAD2, are frequently altered by mutations or copy number changes. A systematic survey of splice-site mutations identified 106 splice site mutations associated with cancer specific aberrant splicing, including mutations in several known cancer-related genes. RAC1b, an isoform of the RAC1 GTPase that includes one additional exon, was found to be preferentially up-regulated in lung cancer. We further show that its expression is significantly associated with sensitivity to a MAP2K (MEK) inhibitor PD-0325901. Taken together, these data present a comprehensive genomic landscape of a large number of lung cancer samples and further demonstrate that cancer-specific alternative splicing is a widespread phenomenon that has potential utility as therapeutic biomarkers. The detailed characterizations of the lung cancer cell lines also provide genomic context to the vast amount of experimental data gathered for these lines over the decades, and represent highly valuable resources for cancer biology. PMID:23033341

  11. Genome and Transcriptome of Clostridium phytofermentans, Catalyst for the Direct Conversion of Plant Feedstocks to Fuels

    SciTech Connect

    Petit, Elsa; Coppi, Maddalena V.; Hayes, James C.; Tolonen, Andrew C.; Warnick, Thomas; Latouf, William G.; Amisano, Danielle; Biddle, Amy; Mukherjee, Supratim; Ivanova, Natalia; Lykidis, Athanassios; Land, Miriam; Hauser, Loren; Kyrpides, Nikos; Henrissat, Bernard; Lau, Joanne; Schnell, Danny J.; Church, George M.; Leschine, Susan B.; Blanchard, Jeffrey L.

    2015-06-02

    Clostridium phytofermentans was isolated from forest soil and is distinguished by its capacity to directly ferment plant cell wall polysaccharides into ethanol as the primary product, suggesting that it possesses unusual catabolic pathways. The objective of our present study was to understand the molecular mechanisms of biomass conversion to ethanol in a single organism, Clostridium phytofermentans, by analyzing its complete genome and transcriptome during growth on plant carbohydrates. The saccharolytic versatility of C. phytofermentans is reflected in a diversity of genes encoding ATP-binding cassette sugar transporters and glycoside hydrolases, many of which may have been acquired through horizontal gene transfer. These genes are frequently organized as operons that may be controlled individually by the many transcriptional regulators identified in the genome. Preferential ethanol production may be due to high levels of expression of multiple ethanol dehydrogenases and additional pathways maximizing ethanol yield. The genome also encodes three different proteinaceous bacterial microcompartments with the capacity to compartmentalize pathways that divert fermentation intermediates to various products. Lastly, these characteristics make C. phytofermentans an attractive resource for improving the efficiency and speed of biomass conversion to biofuels.

  12. Genome and Transcriptome of Clostridium phytofermentans, Catalyst for the Direct Conversion of Plant Feedstocks to Fuels

    DOE PAGES

    Petit, Elsa; Coppi, Maddalena V.; Hayes, James C.; ...

    2015-06-02

    Clostridium phytofermentans was isolated from forest soil and is distinguished by its capacity to directly ferment plant cell wall polysaccharides into ethanol as the primary product, suggesting that it possesses unusual catabolic pathways. The objective of our present study was to understand the molecular mechanisms of biomass conversion to ethanol in a single organism, Clostridium phytofermentans, by analyzing its complete genome and transcriptome during growth on plant carbohydrates. The saccharolytic versatility of C. phytofermentans is reflected in a diversity of genes encoding ATP-binding cassette sugar transporters and glycoside hydrolases, many of which may have been acquired through horizontal gene transfer.more » These genes are frequently organized as operons that may be controlled individually by the many transcriptional regulators identified in the genome. Preferential ethanol production may be due to high levels of expression of multiple ethanol dehydrogenases and additional pathways maximizing ethanol yield. The genome also encodes three different proteinaceous bacterial microcompartments with the capacity to compartmentalize pathways that divert fermentation intermediates to various products. Lastly, these characteristics make C. phytofermentans an attractive resource for improving the efficiency and speed of biomass conversion to biofuels.« less

  13. Comprehensive Comparative Genomic and Transcriptomic Analyses of the Legume Genes Controlling the Nodulation Process

    PubMed Central

    Qiao, Zhenzhen; Pingault, Lise; Nourbakhsh-Rey, Mehrnoush; Libault, Marc

    2016-01-01

    Nitrogen is one of the most essential plant nutrients and one of the major factors limiting crop productivity. Having the goal to perform a more sustainable agriculture, there is a need to maximize biological nitrogen fixation, a feature of legumes. To enhance our understanding of the molecular mechanisms controlling the interaction between legumes and rhizobia, the symbiotic partner fixing and assimilating the atmospheric nitrogen for the plant, researchers took advantage of genetic and genomic resources developed across different legume models (e.g., Medicago truncatula, Lotus japonicus, Glycine max, and Phaseolus vulgaris) to identify key regulatory protein coding genes of the nodulation process. In this study, we are presenting the results of a comprehensive comparative genomic analysis to highlight orthologous and paralogous relationships between the legume genes controlling nodulation. Mining large transcriptomic datasets, we also identified several orthologous and paralogous genes characterized by the induction of their expression during nodulation across legume plant species. This comprehensive study prompts new insights into the evolution of the nodulation process in legume plant and will benefit the scientific community interested in the transfer of functional genomic information between species. PMID:26858743

  14. Comprehensive transcriptome and improved genome annotation of Bacillus licheniformis WX-02.

    PubMed

    Guo, Jing; Cheng, Gang; Gou, Xiang-Yong; Xing, Feng; Li, Sen; Han, Yi-Chao; Wang, Long; Song, Jia-Ming; Shu, Cheng-Cheng; Chen, Shou-Wen; Chen, Ling-Ling

    2015-08-19

    The updated genome of Bacillus licheniformis WX-02 comprises a circular chromosome of 4286821 base-pairs containing 4512 protein-coding genes. We applied strand-specific RNA-sequencing to explore the transcriptome profiles of B. licheniformis WX-02 under normal and high-salt conditions (NaCl 6%). We identified 2381 co-expressed gene pairs constituting 871 operon structures. In addition, 1169 antisense transcripts and 90 small RNAs were detected. Systematic comparison of differentially expressed genes under different conditions revealed that genes involved in multiple functions were significantly repressed in long-term high salt adaptation process. Genes related to promotion of glutamic acid synthesis were activated by 6% NaCl, potentially explaining the high yield of γ-PGA under salt condition. This study will be useful for the optimization of crucial metabolic activities in this bacterium.

  15. The genome and transcriptome of Japanese flounder provide insights into flatfish asymmetry.

    PubMed

    Shao, Changwei; Bao, Baolong; Xie, Zhiyuan; Chen, Xinye; Li, Bo; Jia, Xiaodong; Yao, Qiulin; Ortí, Guillermo; Li, Wenhui; Li, Xihong; Hamre, Kristin; Xu, Juan; Wang, Lei; Chen, Fangyuan; Tian, Yongsheng; Schreiber, Alex M; Wang, Na; Wei, Fen; Zhang, Jilin; Dong, Zhongdian; Gao, Lei; Gai, Junwei; Sakamoto, Takashi; Mo, Sudong; Chen, Wenjun; Shi, Qiong; Li, Hui; Xiu, Yunji; Li, Yangzhen; Xu, Wenteng; Shi, Zhiyi; Zhang, Guojie; Power, Deborah M; Wang, Qingyin; Schartl, Manfred; Chen, Songlin

    2017-01-01

    Flatfish have the most extreme asymmetric body morphology of vertebrates. During metamorphosis, one eye migrates to the contralateral side of the skull, and this migration is accompanied by extensive craniofacial transformations and simultaneous development of lopsided body pigmentation. The evolution of this developmental and physiological innovation remains enigmatic. Comparative genomics of two flatfish and transcriptomic analyses during metamorphosis point to a role for thyroid hormone and retinoic acid signaling, as well as phototransduction pathways. We demonstrate that retinoic acid is critical in establishing asymmetric pigmentation and, via cross-talk with thyroid hormones, in modulating eye migration. The unexpected expression of the visual opsins from the phototransduction pathway in the skin translates illumination differences and generates retinoic acid gradients that underlie the generation of asymmetry. Identifying the genetic underpinning of this unique developmental process answers long-standing questions about the evolutionary origin of asymmetry, but it also provides insight into the mechanisms that control body shape in vertebrates.

  16. Molecular Subtyping of Pancreatic Cancer: Translating Genomics and Transcriptomics into the Clinic

    PubMed Central

    Du, Yongxing; Zhao, Bangbo; Liu, Ziwen; Ren, Xiaoxia; Zhao, Wenjing; Li, Zongze; You, Lei; Zhao, Yupei

    2017-01-01

    Pancreatic cancer remains one of the most lethal malignancies, and insights into both personalized diagnosis and intervention of this disease are urgently needed. The rapid development of sequencing technologies has enabled the successive completion of a series of genetic and epigenetic sequencing studies of pancreatic cancer. The mutational landscape of pancreatic cancer is generally portrayed in terms of somatic mutations, structural variations, epigenetic alterations and the core signaling pathways. In recent years, four significant molecular subtype classifications of pancreatic cancer have been proposed based on the expression of transcription factors and downstream targets or the distribution of structural rearrangements. Increasing researches focus on the identification of somatic mutations and other genetic aberrations that drive pancreatic cancer has led to a new era of precision medicine based on molecular subtyping. However, few known molecular classifications are used to guide clinical strategies. Specific scientific, regulatory and ethical challenges must be overcome before genomic and transcriptomic discoveries can be translated into the clinic. PMID:28367231

  17. An integrated genomic and transcriptomic survey of mucormycosis-causing fungi

    PubMed Central

    Chibucos, Marcus C.; Soliman, Sameh; Gebremariam, Teclegiorgis; Lee, Hongkyu; Daugherty, Sean; Orvis, Joshua; Shetty, Amol C.; Crabtree, Jonathan; Hazen, Tracy H.; Etienne, Kizee A.; Kumari, Priti; O'Connor, Timothy D.; Rasko, David A.; Filler, Scott G.; Fraser, Claire M.; Lockhart, Shawn R.; Skory, Christopher D.; Ibrahim, Ashraf S.; Bruno, Vincent M.

    2016-01-01

    Mucormycosis is a life-threatening infection caused by Mucorales fungi. Here we sequence 30 fungal genomes, and perform transcriptomics with three representative Rhizopus and Mucor strains and with human airway epithelial cells during fungal invasion, to reveal key host and fungal determinants contributing to pathogenesis. Analysis of the host transcriptional response to Mucorales reveals platelet-derived growth factor receptor B (PDGFRB) signaling as part of a core response to divergent pathogenic fungi; inhibition of PDGFRB reduces Mucorales-induced damage to host cells. The unique presence of CotH invasins in all invasive Mucorales, and the correlation between CotH gene copy number and clinical prevalence, are consistent with an important role for these proteins in mucormycosis pathogenesis. Our work provides insight into the evolution of this medically and economically important group of fungi, and identifies several molecular pathways that might be exploited as potential therapeutic targets. PMID:27447865

  18. Cryptococcus gattii comparative genomics and transcriptomics: a NIH/NIAID White Paper.

    PubMed

    Chaturvedi, V; Nierman, W C

    2012-06-01

    Cryptococcus gattii is an emerging global pathogen. Recent reports suggest that C. gattii cryptococcosis is more common in immunocompetent as well as HIV-infected AIDS patients than earlier estimated. An ongoing outbreak of C. gattii in Vancouver, Canada, and the US Pacific Northwest has heightened public health awareness in North America. We have few clues as to what causes emergence or re-emergence of highly pathogenic strains, why C. gattii split up from its sibling pathogen C. neoformans, why it thrives in trees instead, and why immunocompetent individuals are vulnerable to this pathogen? C. gattii comprises of four distinct lineages, but the information on the genome of C. gattii is inadequate and unrepresentative as it is limited to two strains, R265 and WM276, which are MATα, serotype B, genotype VGII/VGI from Canada and Australia, respectively. There is a wide gap in knowledge about the genomes of VGIII and VGIV strains, serotype C strains, and MATa strains. The geographical representation is inadequate in the absence of strains from California, South America, Asia, and Africa. Additional obstacles to work with this pathogen are the following: (a) complex molecular typing schemes and (b) lack of functional genomics analyses. We propose to complete genome sequencing of 12 reference strains by next-generation sequencing technology and to map their transcriptomes by RNA-Seq technology. This effort would lead to new resources for the scientific community including (1) insight from additional C. gattii genomes to anchor future research studies, (2) validation of single-nucleotide polymorphisms (SNPs) for molecular typing to improve epidemiology studies, and (3) transcript analyses from strains under relevant pathogenic and non-pathogenic conditions to accelerate the discovery of proteins for diagnostics, drug targets, and vaccines.

  19. Transcriptome Sequencing of Two Phenotypic Mosaic Eucalyptus Trees Reveals Large Scale Transcriptome Re-Modelling

    PubMed Central

    Padovan, Amanda; Patel, Hardip R.; Chuah, Aaron; Huttley, Gavin A.; Krause, Sandra T.; Degenhardt, Jörg; Foley, William J.; Külheim, Carsten

    2015-01-01

    Phenotypic mosaic trees offer an ideal system for studying differential gene expression. We have investigated two mosaic eucalypt trees from two closely related species (Eucalyptus melliodora and E. sideroxylon), which each support two types of leaves: one part of the canopy is resistant to insect herbivory and the remaining leaves are susceptible. Driving this ecological distinction are differences in plant secondary metabolites. We used these phenotypic mosaics to investigate genome wide patterns of foliar gene expression with the aim of identifying patterns of differential gene expression and the somatic mutation(s) that lead to this phenotypic mosaicism. We sequenced the mRNA pool from leaves of the resistant and susceptible ecotypes from both mosaic eucalypts using the Illumina HiSeq 2000 platform. We found large differences in pathway regulation and gene expression between the ecotypes of each mosaic. The expression of the genes in the MVA and MEP pathways is reflected by variation in leaf chemistry, however this is not the case for the terpene synthases. Apart from the terpene biosynthetic pathway, there are several other metabolic pathways that are differentially regulated between the two ecotypes, suggesting there is much more phenotypic diversity than has been described. Despite the close relationship between the two species, they show large differences in the global patterns of gene and pathway regulation. PMID:25978451

  20. Genome-Wide Transcriptome and Proteome Analysis on Different Developmental Stages of Cordyceps militaris

    PubMed Central

    Yin, Yalin; Yu, Guojun; Chen, Yijie; Jiang, Shuai; Wang, Man; Jin, Yanxia; Lan, Xianqing; Liang, Yi; Sun, Hui

    2012-01-01

    Background Cordyceps militaris, an ascomycete caterpillar fungus, has been used as a traditional Chinese medicine for many years owing to its anticancer and immunomodulatory activities. Currently, artificial culturing of this beneficial fungus has been widely used and can meet the market, but systematic molecular studies on the developmental stages of cultured C. militaris at transcriptional and translational levels have not been determined. Methodology/Principal Findings We utilized high-throughput Illumina sequencing to obtain the transcriptomes of C. militaris mycelium and fruiting body. All clean reads were mapped to C. militaris genome and most of the reads showed perfect coverage. Alternative splicing and novel transcripts were predicted to enrich the database. Gene expression analysis revealed that 2,113 genes were up-regulated in mycelium and 599 in fruiting body. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis were performed to analyze the genes with expression differences. Moreover, the putative cordycepin metabolism difference between different developmental stages was studied. In addition, the proteome data of mycelium and fruiting body were obtained by one-dimensional gel electrophoresis (1-DGE) coupled with nano-electrospray ionization liquid chromatography tandem mass spectrometry (nESI-LC-MS/MS). 359 and 214 proteins were detected from mycelium and fruiting body respectively. GO, KEGG and Cluster of Orthologous Groups (COG) analysis were further conducted to better understand their difference. We analyzed the amounts of some noteworthy proteins in these two samples including lectin, superoxide dismutase, glycoside hydrolase and proteins involved in cordycepin metabolism, providing important information for further protein studies. Conclusions/Significance The results reveal the difference in gene expression between the mycelium and fruiting body of artificially cultivated C. militaris by transcriptome and proteome

  1. Comparative genomics and transcriptome analysis of Aspergillus niger and metabolic engineering for citrate production

    PubMed Central

    Yin, Xian; Shin, Hyun-dong; Li, Jianghua; Du, Guocheng; Liu, Long; Chen, Jian

    2017-01-01

    Despite a long and successful history of citrate production in Aspergillus niger, the molecular mechanism of citrate accumulation is only partially understood. In this study, we used comparative genomics and transcriptome analysis of citrate-producing strains—namely, A. niger H915-1 (citrate titer: 157 g L−1), A1 (117 g L−1), and L2 (76 g L−1)—to gain a genome-wide view of the mechanism of citrate accumulation. Compared with A. niger A1 and L2, A. niger H915-1 contained 92 mutated genes, including a succinate-semialdehyde dehydrogenase in the γ-aminobutyric acid shunt pathway and an aconitase family protein involved in citrate synthesis. Furthermore, transcriptome analysis of A. niger H915-1 revealed that the transcription levels of 479 genes changed between the cell growth stage (6 h) and the citrate synthesis stage (12 h, 24 h, 36 h, and 48 h). In the glycolysis pathway, triosephosphate isomerase was up-regulated, whereas pyruvate kinase was down-regulated. Two cytosol ATP-citrate lyases, which take part in the cycle of citrate synthesis, were up-regulated, and may coordinate with the alternative oxidases in the alternative respiratory pathway for energy balance. Finally, deletion of the oxaloacetate acetylhydrolase gene in H915-1 eliminated oxalate formation but neither influence on pH decrease nor difference in citrate production were observed. PMID:28106122

  2. Genome-wide profiling of untranslated regions by paired-end ditag sequencing reveals unexpected transcriptome complexity in yeast.

    PubMed

    Kang, Ya-Ni; Lai, Deng-Pan; Ooi, Hong Sain; Shen, Ting-Ting; Kou, Yao; Tian, Jing; Czajkowsky, Daniel M; Shao, Zhifeng; Zhao, Xiaodong

    2015-02-01

    The identification of structural and functional elements encoded in a genome is a challenging task. Although the transcriptome of budding yeast has been extensively analyzed, the boundaries and untranslated regions of yeast genes remain elusive. To address this least-explored field of yeast genomics, we performed a transcript profiling analysis through paired-end ditag (PET) approach coupled with deep sequencing. With 562,133 PET sequences we accurately defined the boundaries and untranslated regions of 3,409 ORFs, suggesting many yeast genes have multiple transcription start sites (TSSs). We also identified 85 previously uncharacterized transcripts either in intergenic regions or from the opposite strand of reported genomic features. Furthermore, our data revealed the extensive 3' end heterogeneity of yeast genes and identified a novel putative motif for polyadenylation. Our results indicate the yeast transcriptome is more complex than expected. This study would serve as an invaluable resource for elucidating the regulation and evolution of yeast genes.

  3. Improved evidence-based genome-scale metabolic models for maize leaf, embryo, and endosperm

    SciTech Connect

    Seaver, Samuel M.D.; Bradbury, Louis M.T.; Frelin, Océane; Zarecki, Raphy; Ruppin, Eytan; Hanson, Andrew D.; Henry, Christopher S.

    2015-03-10

    There is a growing demand for genome-scale metabolic reconstructions for plants, fueled by the need to understand the metabolic basis of crop yield and by progress in genome and transcriptome sequencing. Methods are also required to enable the interpretation of plant transcriptome data to study how cellular metabolic activity varies under different growth conditions or even within different organs, tissues, and developmental stages. Such methods depend extensively on the accuracy with which genes have been mapped to the biochemical reactions in the plant metabolic pathways. Errors in these mappings lead to metabolic reconstructions with an inflated number of reactions and possible generation of unreliable metabolic phenotype predictions. Here we introduce a new evidence-based genome-scale metabolic reconstruction of maize, with significant improvements in the quality of the gene-reaction associations included within our model. We also present a new approach for applying our model to predict active metabolic genes based on transcriptome data. This method includes a minimal set of reactions associated with low expression genes to enable activity of a maximum number of reactions associated with high expression genes. We apply this method to construct an organ-specific model for the maize leaf, and tissue specific models for maize embryo and endosperm cells. We validate our models using fluxomics data for the endosperm and embryo, demonstrating an improved capacity of our models to fit the available fluxomics data. All models are publicly available via the DOE Systems Biology Knowledgebase and PlantSEED, and our new method is generally applicable for analysis transcript profiles from any plant, paving the way for further in silico studies with a wide variety of plant genomes.

  4. Improved evidence-based genome-scale metabolic models for maize leaf, embryo, and endosperm.

    PubMed

    Seaver, Samuel M D; Bradbury, Louis M T; Frelin, Océane; Zarecki, Raphy; Ruppin, Eytan; Hanson, Andrew D; Henry, Christopher S

    2015-01-01

    There is a growing demand for genome-scale metabolic reconstructions for plants, fueled by the need to understand the metabolic basis of crop yield and by progress in genome and transcriptome sequencing. Methods are also required to enable the interpretation of plant transcriptome data to study how cellular metabolic activity varies under different growth conditions or even within different organs, tissues, and developmental stages. Such methods depend extensively on the accuracy with which genes have been mapped to the biochemical reactions in the plant metabolic pathways. Errors in these mappings lead to metabolic reconstructions with an inflated number of reactions and possible generation of unreliable metabolic phenotype predictions. Here we introduce a new evidence-based genome-scale metabolic reconstruction of maize, with significant improvements in the quality of the gene-reaction associations included within our model. We also present a new approach for applying our model to predict active metabolic genes based on transcriptome data. This method includes a minimal set of reactions associated with low expression genes to enable activity of a maximum number of reactions associated with high expression genes. We apply this method to construct an organ-specific model for the maize leaf, and tissue specific models for maize embryo and endosperm cells. We validate our models using fluxomics data for the endosperm and embryo, demonstrating an improved capacity of our models to fit the available fluxomics data. All models are publicly available via the DOE Systems Biology Knowledgebase and PlantSEED, and our new method is generally applicable for analysis transcript profiles from any plant, paving the way for further in silico studies with a wide variety of plant genomes.

  5. Improved evidence-based genome-scale metabolic models for maize leaf, embryo, and endosperm

    PubMed Central

    Seaver, Samuel M. D.; Bradbury, Louis M. T.; Frelin, Océane; Zarecki, Raphy; Ruppin, Eytan; Hanson, Andrew D.; Henry, Christopher S.

    2015-01-01

    There is a growing demand for genome-scale metabolic reconstructions for plants, fueled by the need to understand the metabolic basis of crop yield and by progress in genome and transcriptome sequencing. Methods are also required to enable the interpretation of plant transcriptome data to study how cellular metabolic activity varies under different growth conditions or even within different organs, tissues, and developmental stages. Such methods depend extensively on the accuracy with which genes have been mapped to the biochemical reactions in the plant metabolic pathways. Errors in these mappings lead to metabolic reconstructions with an inflated number of reactions and possible generation of unreliable metabolic phenotype predictions. Here we introduce a new evidence-based genome-scale metabolic reconstruction of maize, with significant improvements in the quality of the gene-reaction associations included within our model. We also present a new approach for applying our model to predict active metabolic genes based on transcriptome data. This method includes a minimal set of reactions associated with low expression genes to enable activity of a maximum number of reactions associated with high expression genes. We apply this method to construct an organ-specific model for the maize leaf, and tissue specific models for maize embryo and endosperm cells. We validate our models using fluxomics data for the endosperm and embryo, demonstrating an improved capacity of our models to fit the available fluxomics data. All models are publicly available via the DOE Systems Biology Knowledgebase and PlantSEED, and our new method is generally applicable for analysis transcript profiles from any plant, paving the way for further in silico studies with a wide variety of plant genomes. PMID:25806041

  6. Improved evidence-based genome-scale metabolic models for maize leaf, embryo, and endosperm

    DOE PAGES

    Seaver, Samuel M.D.; Bradbury, Louis M.T.; Frelin, Océane; ...

    2015-03-10

    There is a growing demand for genome-scale metabolic reconstructions for plants, fueled by the need to understand the metabolic basis of crop yield and by progress in genome and transcriptome sequencing. Methods are also required to enable the interpretation of plant transcriptome data to study how cellular metabolic activity varies under different growth conditions or even within different organs, tissues, and developmental stages. Such methods depend extensively on the accuracy with which genes have been mapped to the biochemical reactions in the plant metabolic pathways. Errors in these mappings lead to metabolic reconstructions with an inflated number of reactions andmore » possible generation of unreliable metabolic phenotype predictions. Here we introduce a new evidence-based genome-scale metabolic reconstruction of maize, with significant improvements in the quality of the gene-reaction associations included within our model. We also present a new approach for applying our model to predict active metabolic genes based on transcriptome data. This method includes a minimal set of reactions associated with low expression genes to enable activity of a maximum number of reactions associated with high expression genes. We apply this method to construct an organ-specific model for the maize leaf, and tissue specific models for maize embryo and endosperm cells. We validate our models using fluxomics data for the endosperm and embryo, demonstrating an improved capacity of our models to fit the available fluxomics data. All models are publicly available via the DOE Systems Biology Knowledgebase and PlantSEED, and our new method is generally applicable for analysis transcript profiles from any plant, paving the way for further in silico studies with a wide variety of plant genomes.« less

  7. Genome and transcriptome analysis of the basidiomycetous yeast Pseudozyma antarctica producing extracellular glycolipids, mannosylerythritol lipids.

    PubMed

    Morita, Tomotake; Koike, Hideaki; Hagiwara, Hiroko; Ito, Emi; Machida, Masayuki; Sato, Shun; Habe, Hiroshi; Kitamoto, Dai

    2014-01-01

    Pseudozyma antarctica is a non-pathogenic phyllosphere yeast known as an excellent producer of mannosylerythritol lipids (MELs), multi-functional extracellular glycolipids, from vegetable oils. To clarify the genetic characteristics of P. antarctica, we analyzed the 18 Mb genome of P. antarctica T-34. On the basis of KOG analysis, the number of genes (219 genes) categorized into lipid transport and metabolism classification in P. antarctica was one and a half times larger than that of yeast Saccharomyces cerevisiae (140 genes). The gene encoding an ATP/citrate lyase (ACL) related to acetyl-CoA synthesis conserved in oleaginous strains was found in P. antarctica genome: the single ACL gene possesses the four domains identical to that of the human gene, whereas the other oleaginous ascomycetous species have the two genes covering the four domains. P. antarctica genome exhibited a remarkable degree of synteny to U. maydis genome, however, the comparison of the gene expression profiles under the culture on the two carbon sources, glucose and soybean oil, by the DNA microarray method revealed that transcriptomes between the two species were significantly different. In P. antarctica, expression of the gene sets relating fatty acid metabolism were markedly up-regulated under the oily conditions compared with glucose. Additionally, MEL biosynthesis cluster of P. antarctica was highly expressed regardless of the carbon source as compared to U. maydis. These results strongly indicate that P. antarctica has an oleaginous nature which is relevant to its non-pathogenic and MEL-overproducing characteristics. The analysis and dataset contribute to stimulate the development of improved strains with customized properties for high yield production of functional bio-based materials.

  8. The draft genome and transcriptome of Amaranthus hypochondriacus: a C4 dicot producing high-lysine edible pseudo-cereal.

    PubMed

    Sunil, Meeta; Hariharan, Arun K; Nayak, Soumya; Gupta, Saurabh; Nambisan, Suran R; Gupta, Ravi P; Panda, Binay; Choudhary, Bibha; Srinivasan, Subhashini

    2014-12-01

    Grain amaranths, edible C4 dicots, produce pseudo-cereals high in lysine. Lysine being one of the most limiting essential amino acids in cereals and C4 photosynthesis being one of the most sought-after phenotypes in protein-rich legume crops, the genome of one of the grain amaranths is likely to play a critical role in crop research. We have sequenced the genome and transcriptome of Amaranthus hypochondriacus, a diploid (2n = 32) belonging to the order Caryophyllales with an estimated genome size of 466 Mb. Of the 411 linkage single-nucleotide polymorphisms (SNPs) reported for grain amaranths, 355 SNPs (86%) are represented in the scaffolds and 74% of the 8.6 billion bases of the sequenced transcriptome map to the genomic scaffolds. The genome of A. hypochondriacus, codes for at least 24,829 proteins, shares the paleohexaploidy event with species under the superorders Rosids and Asterids, harbours 1 SNP in 1,000 bases, and contains 13.76% of repeat elements. Annotation of all the genes in the lysine biosynthetic pathway using comparative genomics and expression analysis offers insights into the high-lysine phenotype. As the first grain species under Caryophyllales and the first C4 dicot genome reported, the work presented here will be beneficial in improving crops and in expanding our understanding of angiosperm evolution.

  9. The Draft Genome and Transcriptome of Amaranthus hypochondriacus: A C4 Dicot Producing High-Lysine Edible Pseudo-Cereal

    PubMed Central

    Sunil, Meeta; Hariharan, Arun K.; Nayak, Soumya; Gupta, Saurabh; Nambisan, Suran R.; Gupta, Ravi P.; Panda, Binay; Choudhary, Bibha; Srinivasan, Subhashini

    2014-01-01

    Grain amaranths, edible C4 dicots, produce pseudo-cereals high in lysine. Lysine being one of the most limiting essential amino acids in cereals and C4 photosynthesis being one of the most sought-after phenotypes in protein-rich legume crops, the genome of one of the grain amaranths is likely to play a critical role in crop research. We have sequenced the genome and transcriptome of Amaranthus hypochondriacus, a diploid (2n = 32) belonging to the order Caryophyllales with an estimated genome size of 466 Mb. Of the 411 linkage single-nucleotide polymorphisms (SNPs) reported for grain amaranths, 355 SNPs (86%) are represented in the scaffolds and 74% of the 8.6 billion bases of the sequenced transcriptome map to the genomic scaffolds. The genome of A. hypochondriacus, codes for at least 24,829 proteins, shares the paleohexaploidy event with species under the superorders Rosids and Asterids, harbours 1 SNP in 1,000 bases, and contains 13.76% of repeat elements. Annotation of all the genes in the lysine biosynthetic pathway using comparative genomics and expression analysis offers insights into the high-lysine phenotype. As the first grain species under Caryophyllales and the first C4 dicot genome reported, the work presented here will be beneficial in improving crops and in expanding our understanding of angiosperm evolution. PMID:25071079

  10. Large-Scale Sequencing: The Future of Genomic Sciences Colloquium

    SciTech Connect

    Margaret Riley; Merry Buckley

    2009-01-01

    Genetic sequencing and the various molecular techniques it has enabled have revolutionized the field of microbiology. Examining and comparing the genetic sequences borne by microbes - including bacteria, archaea, viruses, and microbial eukaryotes - provides researchers insights into the processes microbes carry out, their pathogenic traits, and new ways to use microorganisms in medicine and manufacturing. Until recently, sequencing entire microbial genomes has been laborious and expensive, and the decision to sequence the genome of an organism was made on a case-by-case basis by individual researchers and funding agencies. Now, thanks to new technologies, the cost and effort of sequencing is within reach for even the smallest facilities, and the ability to sequence the genomes of a significant fraction of microbial life may be possible. The availability of numerous microbial genomes will enable unprecedented insights into microbial evolution, function, and physiology. However, the current ad hoc approach to gathering sequence data has resulted in an unbalanced and highly biased sampling of microbial diversity. A well-coordinated, large-scale effort to target the breadth and depth of microbial diversity would result in the greatest impact. The American Academy of Microbiology convened a colloquium to discuss the scientific benefits of engaging in a large-scale, taxonomically-based sequencing project. A group of individuals with expertise in microbiology, genomics, informatics, ecology, and evolution deliberated on the issues inherent in such an effort and generated a set of specific recommendations for how best to proceed. The vast majority of microbes are presently uncultured and, thus, pose significant challenges to such a taxonomically-based approach to sampling genome diversity. However, we have yet to even scratch the surface of the genomic diversity among cultured microbes. A coordinated sequencing effort of cultured organisms is an appropriate place to begin

  11. Draft Assembly of Elite Inbred Line PH207 Provides Insights into Genomic and Transcriptome Diversity in Maize[OPEN

    PubMed Central

    Soifer, Ilya; Barad, Omer; Shem-Tov, Doron; Baruch, Kobi; Lu, Fei; Hernandez, Alvaro G.; Wright, Chris L.; Koehler, Klaus; Buell, C. Robin; de Leon, Natalia

    2016-01-01

    Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison of these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools. PMID:27803309

  12. Draft Assembly of Elite Inbred Line PH207 Provides Insights into Genomic and Transcriptome Diversity in Maize.

    PubMed

    Hirsch, Candice N; Hirsch, Cory D; Brohammer, Alex B; Bowman, Megan J; Soifer, Ilya; Barad, Omer; Shem-Tov, Doron; Baruch, Kobi; Lu, Fei; Hernandez, Alvaro G; Fields, Christopher J; Wright, Chris L; Koehler, Klaus; Springer, Nathan M; Buckler, Edward; Buell, C Robin; de Leon, Natalia; Kaeppler, Shawn M; Childs, Kevin L; Mikel, Mark A

    2016-11-01

    Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison of these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools.

  13. Genomic and transcriptomic analysis of NDM-1 Klebsiella pneumoniae in spaceflight reveal mechanisms underlying environmental adaptability.

    PubMed

    Li, Jia; Liu, Fei; Wang, Qi; Ge, Pupu; Woo, Patrick C Y; Yan, Jinghua; Zhao, Yanlin; Gao, George F; Liu, Cui Hua; Liu, Changting

    2014-08-28

    The emergence and rapid spread of New Delhi Metallo-beta-lactamase-1 (NDM-1)-producing Klebsiella pneumoniae strains has caused a great concern worldwide. To better understand the mechanisms underlying environmental adaptation of those highly drug-resistant K. pneumoniae strains, we took advantage of the China's Shenzhou 10 spacecraft mission to conduct comparative genomic and transcriptomic analysis of a NDM-1 K. pneumoniae strain (ATCC BAA-2146) being cultivated under different conditions. The samples were recovered from semisolid medium placed on the ground (D strain), in simulated space condition (M strain), or in Shenzhou 10 spacecraft (T strain) for analysis. Our data revealed multiple variations underlying pathogen adaptation into different environments in terms of changes in morphology, H2O2 tolerance and biofilm formation ability, genomic stability and regulation of metabolic pathways. Additionally, we found a few non-coding RNAs to be differentially regulated. The results are helpful for better understanding the adaptive mechanisms of drug-resistant bacterial pathogens.

  14. A Genomic, Transcriptomic and Proteomic Look at the GE2270 Producer Planobispora rosea, an Uncommon Actinomycete

    PubMed Central

    Gallo, Giuseppe; Petiti, Luca; Corti, Giorgio; Alt, Silke; Cruz, Joao C. S.; Salzano, Anna Maria; Scaloni, Andrea; Puglia, Anna Maria; De Bellis, Gianluca; Peano, Clelia; Donadio, Stefano; Sosio, Margherita

    2015-01-01

    We report the genome sequence of Planobispora rosea ATCC 53733, a mycelium-forming soil-dweller belonging to one of the lesser studied genera of Actinobacteria and producing the thiopeptide GE2270. The P. rosea genome presents considerable convergence in gene organization and function with other members in the family Streptosporangiaceae, with a significant number (44%) of shared orthologs. Patterns of gene expression in P. rosea cultures during exponential and stationary phase have been analyzed using whole transcriptome shotgun sequencing and by proteome analysis. Among the differentially abundant proteins, those involved in protein metabolism are particularly represented, including the GE2270-insensitive EF-Tu. Two proteins from the pbt cluster, directing GE2270 biosynthesis, slightly increase their abundance values over time. While GE2270 production starts during the exponential phase, most pbt genes, as analyzed by qRT-PCR, are down-regulated. The exception is represented by pbtA, encoding the precursor peptide of the ribosomally synthesized GE2270, whose expression reached the highest level at the entry into stationary phase. PMID:26207753

  15. Heavy metals induce oxidative stress and genome-wide modulation in transcriptome of rice root.

    PubMed

    Dubey, Sonali; Shri, Manju; Misra, Prashant; Lakhwani, Deepika; Bag, Sumit Kumar; Asif, Mehar H; Trivedi, Prabodh Kumar; Tripathi, Rudro Deo; Chakrabarty, Debasis

    2014-06-01

    Industrial growth, ecological disturbances and agricultural practices have contaminated the soil and water with many harmful compounds, including heavy metals. These heavy metals affect growth and development of plants as well as cause severe human health hazards through food chain contamination. In past, studies have been made to identify biochemical and molecular networks associated with heavy metal toxicity and uptake in plants. Studies suggested that most of the physiological and molecular processes affected by different heavy metals are similar to those affected by other abiotic stresses. To identify common and unique responses by different metals, we have studied biochemical and genome-wide modulation in transcriptome of rice (IR-64 cultivar) root after exposure to cadmium (Cd), arsenate [As(V)], lead (Pb) and chromium [Cr(VI)] in hydroponic condition. We observed that root tissue shows variable responses for antioxidant enzyme system for different heavy metals. Genome-wide expression analysis suggests variable number of genes differentially expressed in root in response to As(V), Cd, Pb and Cr(VI) stresses. In addition to unique genes, each heavy metal modulated expression of a large number of common genes. Study also identified cis-acting regions of the promoters which can be determinants for the modulated expression of the genes in response to different heavy metals. Our study advances understanding related to various processes and networks which might be responsible for heavy metal stresses, accumulation and detoxification.

  16. The genome and transcriptome of the zoonotic hookworm Ancylostoma ceylanicum identify infection-specific gene families

    PubMed Central

    Schwarz, Erich M; Hu, Yan; Antoshechkin, Igor; Miller, Melanie M; Sternberg, Paul W; Aroian, Raffi V

    2015-01-01

    Hookworms infect over 400 million people, stunting and impoverishing them1–3. Sequencing hookworm genomes and finding which genes they express during infection should help in devising new drugs or vaccines against hookworms4,5. Unlike other hookworms, Ancylostoma ceylanicum infects both humans and other mammals, providing a laboratory model for hookworm disease6,7. We determined an A. ceylanicum genome sequence of 313 Mb, with transcriptomic data throughout infection showing expression of 30,738 genes. Approximately 900 genes were upregulated during early infection in vivo, including ASPRs, a cryptic subfamily of activation-associated secreted proteins (ASPs)8. Genes downregulated during early infection included ion channels and G protein–coupled receptors; this downregulation was observed in both parasitic and free-living nematodes. Later, at the onset of heavy blood feeding, C-lectin genes were upregulated along with genes for secreted clade V proteins (SCVPs), encoding a previously undescribed protein family. These findings provide new drug and vaccine targets and should help elucidate hookworm pathogenesis. PMID:25730766

  17. Genomic and transcriptomic analysis of NDM-1 Klebsiella pneumoniae in spaceflight reveal mechanisms underlying environmental adaptability

    PubMed Central

    Li, Jia; Liu, Fei; Wang, Qi; Ge, Pupu; Woo, Patrick C. Y.; Yan, Jinghua; Zhao, Yanlin; Gao, George F.; Liu, Cui Hua; Liu, Changting

    2014-01-01

    The emergence and rapid spread of New Delhi Metallo-beta-lactamase-1 (NDM-1)-producing Klebsiella pneumoniae strains has caused a great concern worldwide. To better understand the mechanisms underlying environmental adaptation of those highly drug-resistant K. pneumoniae strains, we took advantage of the China's Shenzhou 10 spacecraft mission to conduct comparative genomic and transcriptomic analysis of a NDM-1 K. pneumoniae strain (ATCC BAA-2146) being cultivated under different conditions. The samples were recovered from semisolid medium placed on the ground (D strain), in simulated space condition (M strain), or in Shenzhou 10 spacecraft (T strain) for analysis. Our data revealed multiple variations underlying pathogen adaptation into different environments in terms of changes in morphology, H2O2 tolerance and biofilm formation ability, genomic stability and regulation of metabolic pathways. Additionally, we found a few non-coding RNAs to be differentially regulated. The results are helpful for better understanding the adaptive mechanisms of drug-resistant bacterial pathogens. PMID:25163721

  18. Genome-wide primary transcriptome analysis of H2-producing archaeon Thermococcus onnurineus NA1.

    PubMed

    Cho, Suhyung; Kim, Min-Sik; Jeong, Yujin; Lee, Bo-Rahm; Lee, Jung-Hyun; Kang, Sung Gyun; Cho, Byung-Kwan

    2017-02-20

    In spite of their pivotal roles in transcriptional and post-transcriptional processes, the regulatory elements of archaeal genomes are not yet fully understood. Here, we determine the primary transcriptome of the H2-producing archaeon Thermococcus onnurineus NA1. We identified 1,082 purine-rich transcription initiation sites along with well-conserved TATA box, A-rich B recognition element (BRE), and promoter proximal element (PPE) motif in promoter regions, a high pyrimidine nucleotide content (T/C) at the -1 position, and Shine-Dalgarno (SD) motifs (GGDGRD) in 5' untranslated regions (5' UTRs). Along with differential transcript levels, 117 leaderless genes and 86 non-coding RNAs (ncRNAs) were identified, representing diverse cellular functions and potential regulatory functions under the different growth conditions. Interestingly, we observed low GC content in ncRNAs for RNA-based regulation via unstructured forms or interaction with other cellular components. Further comparative analysis of T. onnurineus upstream regulatory sequences with those of closely related archaeal genomes demonstrated that transcription of orthologous genes are initiated by highly conserved promoter sequences, however their upstream sequences for transcriptional and translational regulation are largely diverse. These results provide the genetic information of T. onnurineus for its future application in metabolic engineering.

  19. Genomic and transcriptomic analysis of imatinib resistance in gastrointestinal stromal tumors

    PubMed Central

    Takahashi, Tsuyoshi; Elzawahry, Asmaa; Mimaki, Sachiyo; Furukawa, Eisaku; Nakatsuka, Rie; Nakamura, Hiromi; Nishigaki, Takahiko; Serada, Satoshi; Naka, Tetsuji; Hirota, Seiichi; Shibata, Tatsuhiro; Tsuchihara, Katsuya

    2017-01-01

    Gastrointestinal stromal tumors represent the most common mesenchymal tumor of the digestive tract, driven by gain‐of‐function mutations in KIT. Despite its proven benefits, half of the patients treated with imatinib show disease progression within 2 years due to secondary resistance mutations in KIT. It remains unclear how the genomic and transcriptomic features change during the acquisition of imatinib resistance. Here, we performed exome sequencing and microarray transcription analysis for four imatinib‐resistant cell lines and one cell line briefly exposed to imatinib. We also performed exome sequencing of clinical tumor samples. The cell line briefly exposed to imatinib exhibited few single‐nucleotide variants and copy‐number alterations, but showed marked upregulation of genes related to detoxification and downregulation of genes involved in cell cycle progression. Meanwhile, resistant cell lines harbored numerous genomic changes: amplified genes related to detoxification and deleted genes with cyclin‐dependent kinase activity. Some variants in the resistant samples were traced back to the drug‐sensitive samples, indicating the presence of ancestral subpopulations. The subpopulations carried variants associated with cell death. Pre‐existing cancer cells with genetic alterations promoting apoptosis resistance may serve as a basis whereby cancer cells with critical mutations, such as secondary KIT mutations, can establish full imatinib resistance. © 2017 The Authors Genes, Chromosomes and Cancer Published by Wiley Periodicals, Inc. PMID:27997714

  20. Genome-wide primary transcriptome analysis of H2-producing archaeon Thermococcus onnurineus NA1

    PubMed Central

    Cho, Suhyung; Kim, Min-Sik; Jeong, Yujin; Lee, Bo-Rahm; Lee, Jung-Hyun; Kang, Sung Gyun; Cho, Byung-Kwan

    2017-01-01

    In spite of their pivotal roles in transcriptional and post-transcriptional processes, the regulatory elements of archaeal genomes are not yet fully understood. Here, we determine the primary transcriptome of the H2-producing archaeon Thermococcus onnurineus NA1. We identified 1,082 purine-rich transcription initiation sites along with well-conserved TATA box, A-rich B recognition element (BRE), and promoter proximal element (PPE) motif in promoter regions, a high pyrimidine nucleotide content (T/C) at the −1 position, and Shine-Dalgarno (SD) motifs (GGDGRD) in 5′ untranslated regions (5′ UTRs). Along with differential transcript levels, 117 leaderless genes and 86 non-coding RNAs (ncRNAs) were identified, representing diverse cellular functions and potential regulatory functions under the different growth conditions. Interestingly, we observed low GC content in ncRNAs for RNA-based regulation via unstructured forms or interaction with other cellular components. Further comparative analysis of T. onnurineus upstream regulatory sequences with those of closely related archaeal genomes demonstrated that transcription of orthologous genes are initiated by highly conserved promoter sequences, however their upstream sequences for transcriptional and translational regulation are largely diverse. These results provide the genetic information of T. onnurineus for its future application in metabolic engineering. PMID:28216628

  1. Population genomic footprints of fine-scale differentiation between habitats in Mediterranean blue tits.

    PubMed

    Szulkin, M; Gagnaire, P-A; Bierne, N; Charmantier, A

    2016-01-01

    Linking population genetic variation to the spatial heterogeneity of the environment is of fundamental interest to evolutionary biology and ecology, in particular when phenotypic differences between populations are observed at biologically small spatial scales. Here, we applied restriction-site associated DNA sequencing (RAD-Seq) to test whether phenotypically differentiated populations of wild blue tits (Cyanistes caeruleus) breeding in a highly heterogeneous environment exhibit genetic structure related to habitat type. Using 12 106 SNPs in 197 individuals from deciduous and evergreen oak woodlands, we applied complementary population genomic analyses, which revealed that genetic variation is influenced by both geographical distance and habitat type. A fine-scale genetic differentiation supported by genome- and transcriptome-wide analyses was found within Corsica, between two adjacent habitats where blue tits exhibit marked differences in breeding time while nesting < 6 km apart. Using redundancy analysis (RDA), we show that genomic variation remains associated with habitat type when controlling for spatial and temporal effects. Finally, our results suggest that the observed patterns of genomic differentiation were not driven by a small proportion of highly differentiated loci, but rather emerged through a process such as habitat choice, which reduces gene flow between habitats across the entire genome. The pattern of genomic isolation-by-environment closely matches differentiation observed at the phenotypic level, thereby offering significant potential for future inference of phenotype-genotype associations in a heterogeneous environment.

  2. The First Chameleon Transcriptome: Comparative Genomic Analysis of the OXPHOS System Reveals Loss of COX8 in Iguanian Lizards

    PubMed Central

    Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan

    2013-01-01

    Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system. PMID:24009133

  3. The first Chameleon transcriptome: comparative genomic analysis of the OXPHOS system reveals loss of COX8 in Iguanian lizards.

    PubMed

    Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan

    2013-01-01

    Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system.

  4. Integrative Analysis of Genomics and Transcriptome Data to Identify Potential Functional Genes of BMDs in Females.

    PubMed

    Chen, Yuan-Cheng; Guo, Yan-Fang; He, Hao; Lin, Xu; Wang, Xia-Fang; Zhou, Rou; Li, Wen-Ting; Pan, Dao-Yan; Shen, Jie; Deng, Hong-Wen

    2016-05-01

    Osteoporosis is known to be highly heritable. However, to date, the findings from more than 20 genome-wide association studies (GWASs) have explained less than 6% of genetic risks. Studies suggest that the missing heritability data may be because of joint effects among genes. To identify novel heritability for osteoporosis, we performed a system-level study on bone mineral density (BMD) by weighted gene coexpression network analysis (WGCNA), using the largest GWAS data set for BMD in the field, Genetic Factors for Osteoporosis Consortium (GEFOS-2), and a transcriptomic gene expression data set generated from transiliac bone biopsies in women. A weighted gene coexpression network was generated for 1574 genes with GWAS nominal evidence of association (p ≤ 0.05) based on dissimilarity measurement on the expression data. Twelve distinct gene modules were identified, and four modules showed nominally significant associations with BMD (p ≤ 0.05), but only one module, the yellow module, demonstrated a good correlation between module membership (MM) and gene significance (GS), suggesting that the yellow module serves an important biological role in bone regulation. Interestingly, through characterization of module content and topology, the yellow module was found to be significantly enriched with contractile fiber part (GO:044449), which is widely recognized as having a close relationship between muscle and bone. Furthermore, detailed submodule analyses of important candidate genes (HOMER1, SPTBN1) by all edges within the yellow module implied significant enrichment of functional connections between bone and cytoskeletal protein binding. Our study yielded novel information from system genetics analyses of GWAS data jointly with transcriptomic data. The findings highlighted a module and several genes in the model as playing important roles in the regulation of bone mass in females, which may yield novel insights into the genetic basis of osteoporosis. © 2016

  5. Transcriptomic signature to oxidative stress exposure at the time of embryonic genome activation in bovine blastocysts.

    PubMed

    Cagnone, Gael L M; Sirard, Marc-André

    2013-04-01

    In order to understand how in vitro culture affects embryonic quality, we analyzed survival and global gene expression in bovine blastocysts after exposure to increased oxidative stress conditions. Two pro-oxidant agents, one that acts extracellularly by promoting reactive oxygen species (ROS) production (0.01 mM 2,2'-azobis (2-amidinopropane) dihydrochloride [AAPH]) or another that acts intracellularly by inhibiting glutathione synthesis (0.4 mM buthionine sulfoximine [BSO]) were added separately to in vitro culture media from Day 3 (8-16-cell stage) onward. Transcriptomic analysis was then performed on resulting Day-7 blastocysts. In the literature, these two pro-oxidant conditions were shown to induce delayed degeneration in a proportion of Day-8 blastocysts. In our experiment, no morphological difference was visible, but AAPH tended to decrease the blastocyst rate while BSO significantly reduced it, indicating a differential impact on the surviving population. At the transcriptomic level, blastocysts that survived either pro-oxidant exposure showed oxidative stress and an inflammatory response (ARRB2), although AAPH induced higher disturbances in cellular homeostasis (SERPINE1). Functional genomics of the BSO profile, however, identified differential expression of genes related to glycine metabolism and energy metabolism (TPI1). These differential features might be indicative of pre-degenerative blastocysts (IGFBP7) in the AAPH population whereas BSO exposure would select the most viable individuals (TKDP1). Together, these results illustrate how oxidative disruption of pre-attachment development is associated with systematic up-regulation of several metabolic markers. Moreover, it indicates that a better capacity to survive anti-oxidant depletion may allow for the survival of blastocysts with a quieter metabolism after compaction.

  6. Complete Genome Sequence of Sporisorium scitamineum and Biotrophic Interaction Transcriptome with Sugarcane.

    PubMed

    Taniguti, Lucas M; Schaker, Patricia D C; Benevenuto, Juliana; Peters, Leila P; Carvalho, Giselle; Palhares, Alessandra; Quecine, Maria C; Nunes, Filipe R S; Kmit, Maria C P; Wai, Alvan; Hausner, Georg; Aitken, Karen S; Berkman, Paul J; Fraser, James A; Moolhuijzen, Paula M; Coutinho, Luiz L; Creste, Silvana; Vieira, Maria L C; Kitajima, João P; Monteiro-Vitorello, Claudia B

    2015-01-01

    Sporisorium scitamineum is a biotrophic fungus responsible for the sugarcane smut, a worldwide spread disease. This study provides the complete sequence of individual chromosomes of S. scitamineum from telomere to telomere achieved by a combination of PacBio long reads and Illumina short reads sequence data, as well as a draft sequence of a second fungal strain. Comparative analysis to previous available sequences of another strain detected few polymorphisms among the three genomes. The novel complete sequence described herein allowed us to identify and annotate extended subtelomeric regions, repetitive elements and the mitochondrial DNA sequence. The genome comprises 19,979,571 bases, 6,677 genes encoding proteins, 111 tRNAs and 3 assembled copies of rDNA, out of our estimated number of copies as 130. Chromosomal reorganizations were detected when comparing to sequences of S. reilianum, the closest smut relative, potentially influenced by repeats of transposable elements. Repetitive elements may have also directed the linkage of the two mating-type loci. The fungal transcriptome profiling from in vitro and from interaction with sugarcane at two time points (early infection and whip emergence) revealed that 13.5% of the genes were differentially expressed in planta and particular to each developmental stage. Among them are plant cell wall degrading enzymes, proteases, lipases, chitin modification and lignin degradation enzymes, sugar transporters and transcriptional factors. The fungus also modulates transcription of genes related to surviving against reactive oxygen species and other toxic metabolites produced by the plant. Previously described effectors in smut/plant interactions were detected but some new candidates are proposed. Ten genomic islands harboring some of the candidate genes unique to S. scitamineum were expressed only in planta. RNAseq data was also used to reassure gene predictions.

  7. Genomic and transcriptomic insights into the thermo-regulated biosynthesis of validamycin in Streptomyces hygroscopicus 5008

    PubMed Central

    2012-01-01

    Background Streptomyces hygroscopicus 5008 has been used for the production of the antifungal validamycin/jinggangmycin for more than 40 years. A high yield of validamycin is achieved by culturing the strain at 37°C, rather than at 30°C for normal growth and sporulation. The mechanism(s) of its thermo-regulated biosynthesis was largely unknown. Results The 10,383,684-bp genome of strain 5008 was completely sequenced and composed of a linear chromosome, a 164.57-kb linear plasmid, and a 73.28-kb circular plasmid. Compared with other Streptomyces genomes, the chromosome of strain 5008 has a smaller core region and shorter terminal inverted repeats, encodes more α/β hydrolases, major facilitator superfamily transporters, and Mg2+/Mn2+-dependent regulatory phosphatases. Transcriptomic analysis revealed that the expression of 7.5% of coding sequences was increased at 37°C, including biosynthetic genes for validamycin and other three secondary metabolites. At 37°C, a glutamate dehydrogenase was transcriptionally up-regulated, and further proved its involvement in validamycin production by gene replacement. Moreover, efficient synthesis and utilization of intracellular glutamate were noticed in strain 5008 at 37°C, revealing glutamate as the nitrogen source for validamycin biosynthesis. Furthermore, a SARP-family regulatory gene with enhanced transcription at 37°C was identified and confirmed to be positively involved in the thermo-regulation of validamycin production by gene inactivation and transcriptional analysis. Conclusions Strain 5008 seemed to have evolved with specific genomic components to facilitate the thermo-regulated validamycin biosynthesis. The data obtained here will facilitate future studies for validamycin yield improvement and industrial bioprocess optimization. PMID:22827618

  8. Complete Genome Sequence of Sporisorium scitamineum and Biotrophic Interaction Transcriptome with Sugarcane

    PubMed Central

    Benevenuto, Juliana; Peters, Leila P.; Carvalho, Giselle; Palhares, Alessandra; Quecine, Maria C.; Nunes, Filipe R. S.; Kmit, Maria C. P.; Wai, Alvan; Hausner, Georg; Aitken, Karen S.; Berkman, Paul J.; Fraser, James A.; Moolhuijzen, Paula M.; Coutinho, Luiz L.; Creste, Silvana; Vieira, Maria L. C.; Kitajima, João P.; Monteiro-Vitorello, Claudia B.

    2015-01-01

    Sporisorium scitamineum is a biotrophic fungus responsible for the sugarcane smut, a worldwide spread disease. This study provides the complete sequence of individual chromosomes of S. scitamineum from telomere to telomere achieved by a combination of PacBio long reads and Illumina short reads sequence data, as well as a draft sequence of a second fungal strain. Comparative analysis to previous available sequences of another strain detected few polymorphisms among the three genomes. The novel complete sequence described herein allowed us to identify and annotate extended subtelomeric regions, repetitive elements and the mitochondrial DNA sequence. The genome comprises 19,979,571 bases, 6,677 genes encoding proteins, 111 tRNAs and 3 assembled copies of rDNA, out of our estimated number of copies as 130. Chromosomal reorganizations were detected when comparing to sequences of S. reilianum, the closest smut relative, potentially influenced by repeats of transposable elements. Repetitive elements may have also directed the linkage of the two mating-type loci. The fungal transcriptome profiling from in vitro and from interaction with sugarcane at two time points (early infection and whip emergence) revealed that 13.5% of the genes were differentially expressed in planta and particular to each developmental stage. Among them are plant cell wall degrading enzymes, proteases, lipases, chitin modification and lignin degradation enzymes, sugar transporters and transcriptional factors. The fungus also modulates transcription of genes related to surviving against reactive oxygen species and other toxic metabolites produced by the plant. Previously described effectors in smut/plant interactions were detected but some new candidates are proposed. Ten genomic islands harboring some of the candidate genes unique to S. scitamineum were expressed only in planta. RNAseq data was also used to reassure gene predictions. PMID:26065709

  9. From genes to milk: genomic organization and epigenetic regulation of the mammary transcriptome.

    PubMed

    Lemay, Danielle G; Pollard, Katherine S; Martin, William F; Freeman Zadrowski, Courtneay; Hernandez, Joseph; Korf, Ian; German, J Bruce; Rijnkels, Monique

    2013-01-01

    Even in genomes lacking operons, a gene's position in the genome influences its potential for expression. The mechanisms by which adjacent genes are co-expressed are still not completely understood. Using lactation and the mammary gland as a model system, we explore the hypothesis that chromatin state contributes to the co-regulation of gene neighborhoods. The mammary gland represents a unique evolutionary model, due to its recent appearance, in the context of vertebrate genomes. An understanding of how the mammary gland is regulated to produce milk is also of biomedical and agricultural importance for human lactation and dairying. Here, we integrate epigenomic and transcriptomic data to develop a comprehensive regulatory model. Neighborhoods of mammary-expressed genes were determined using expression data derived from pregnant and lactating mice and a neighborhood scoring tool, G-NEST. Regions of open and closed chromatin were identified by ChIP-Seq of histone modifications H3K36me3, H3K4me2, and H3K27me3 in the mouse mammary gland and liver tissue during lactation. We found that neighborhoods of genes in regions of uniquely active chromatin in the lactating mammary gland, compared with liver tissue, were extremely rare. Rather, genes in most neighborhoods were suppressed during lactation as reflected in their expression levels and their location in regions of silenced chromatin. Chromatin silencing was largely shared between the liver and mammary gland during lactation, and what distinguished the mammary gland was mainly a small tissue-specific repertoire of isolated, expressed genes. These findings suggest that an advantage of the neighborhood organization is in the collective repression of groups of genes via a shared mechanism of chromatin repression. Genes essential to the mammary gland's uniqueness are isolated from neighbors, and likely have less tolerance for variation in expression, properties they share with genes responsible for an organism's survival.

  10. Analyses of transcriptome sequences reveal multiple ancient large-scale duplication events in the ancestor of Sphagnopsida (Bryophyta).

    PubMed

    Devos, Nicolas; Szövényi, Péter; Weston, David J; Rothfels, Carl J; Johnson, Matthew G; Shaw, A Jonathan

    2016-07-01

    The goal of this research was to investigate whether there has been a whole-genome duplication (WGD) in the ancestry of Sphagnum (peatmoss) or the class Sphagnopsida, and to determine if the timing of any such duplication(s) and patterns of paralog retention could help explain the rapid radiation and current ecological dominance of peatmosses. RNA sequencing (RNA-seq) data were generated for nine taxa in Sphagnopsida (Bryophyta). Analyses of frequency plots for synonymous substitutions per synonymous site (Ks ) between paralogous gene pairs and reconciliation of 578 gene trees were conducted to assess evidence of large-scale or genome-wide duplication events in each transcriptome. Both Ks frequency plots and gene tree-based analyses indicate multiple duplication events in the history of the Sphagnopsida. The most recent WGD event predates divergence of Sphagnum from the two other genera of Sphagnopsida. Duplicate retention is highly variable across species, which might be best explained by local adaptation. Our analyses indicate that the last WGD could have been an important factor underlying the diversification of peatmosses and facilitated their rise to ecological dominance in peatlands. The timing of the duplication events and their significance in the evolutionary history of peat mosses are discussed.

  11. Detection of driver protein complexes in breast cancer metastasis by large-scale transcriptome-interactome integration.

    PubMed

    Garcia, Maxime; Finetti, Pascal; Bertucci, Francois; Birnbaum, Daniel; Bidaut, Ghislain

    2014-01-01

    With the development of high-throughput gene expression profiling technologies came the opportunity to define genomic signatures predicting clinical condition or cancer patient outcome. However, such signatures show dependency on training set, lack of generalization, and instability, partly due to microarray data topology. Additional issues for analyzing tumor gene expression are that subtle molecular perturbations in driver genes leading to cancer and metastasis (masked in typical differential expression analysis) may provoke expression changes of greater amplitude in downstream genes (easily detected). In this chapter, we are describing an interactome-based algorithm, Interactome-Transcriptome Integration (ITI) that is used to find a generalizable signature for prediction of breast cancer relapse by superimposition of a large-scale protein-protein interaction data (human interactome) over several gene expression datasets. ITI extracts regions in the interactome whose expression is discriminating for predicting relapse-free survival in cancer and allow detection of subnetworks that constitutes a generalizable and stable genomic signature. In this chapter, we describe the practical aspects of running the full ITI pipeline (subnetwork detection and classification) on six microarray datasets.

  12. Identification of Candidate Adherent-Invasive E. coli Signature Transcripts by Genomic/Transcriptomic Analysis

    PubMed Central

    Zhang, Yuanhao; Rowehl, Leahana; Krumsiek, Julia M.; Orner, Erika P.; Shaikh, Nurmohammad; Tarr, Phillip I.; Sodergren, Erica; Weinstock, George M.; Boedeker, Edgar C.; Xiong, Xuejian; Parkinson, John; Frank, Daniel N.; Li, Ellen; Gathungu, Grace

    2015-01-01

    quantitative polymerase chain reaction assays for 6 genes were conducted on fecal and ileal RNA samples from 22 inflammatory bowel disease (IBD), and 32 patients without IBD (non-IBD). The expression of Cas loci was detected in a higher proportion of CD than non-IBD fecal and ileal RNA samples (p <0.05). These results support a comparative genomic/transcriptomic approach towards identifying candidate AIEC signature transcripts. PMID:26125937

  13. Identification of Candidate Adherent-Invasive E. coli Signature Transcripts by Genomic/Transcriptomic Analysis.

    PubMed

    Zhang, Yuanhao; Rowehl, Leahana; Krumsiek, Julia M; Orner, Erika P; Shaikh, Nurmohammad; Tarr, Phillip I; Sodergren, Erica; Weinstock, George M; Boedeker, Edgar C; Xiong, Xuejian; Parkinson, John; Frank, Daniel N; Li, Ellen; Gathungu, Grace

    2015-01-01

    quantitative polymerase chain reaction assays for 6 genes were conducted on fecal and ileal RNA samples from 22 inflammatory bowel disease (IBD), and 32 patients without IBD (non-IBD). The expression of Cas loci was detected in a higher proportion of CD than non-IBD fecal and ileal RNA samples (p <0.05). These results support a comparative genomic/transcriptomic approach towards identifying candidate AIEC signature transcripts.

  14. Analysis of Plant Pan-Genomes and Transcriptomes with GET_HOMOLOGUES-EST, a Clustering Solution for Sequences of the Same Species.

    PubMed

    Contreras-Moreira, Bruno; Cantalapiedra, Carlos P; García-Pereira, María J; Gordon, Sean P; Vogel, John P; Igartua, Ernesto; Casas, Ana M; Vinuesa, Pablo

    2017-01-01

    The pan-genome of a species is defined as the union of all the genes and non-coding sequences found in all its individuals. However, constructing a pan-genome for plants with large genomes is daunting both in sequencing cost and the scale of the required computational analysis. A more affordable alternative is to focus on the genic repertoire by using transcriptomic data. Here, the software GET_HOMOLOGUES-EST was benchmarked with genomic and RNA-seq data of 19 Arabidopsis thaliana ecotypes and then applied to the analysis of transcripts from 16 Hordeum vulgare genotypes. The goal was to sample their pan-genomes and classify sequences as core, if detected in all accessions, or accessory, when absent in some of them. The resulting sequence clusters were used to simulate pan-genome growth, and to compile Average Nucleotide Identity matrices that summarize intra-species variation. Although transcripts were found to under-estimate pan-genome size by at least 10%, we concluded that clusters of expressed sequences can recapitulate phylogeny and reproduce two properties observed in A. thaliana gene models: accessory loci show lower expression and higher non-synonymous substitution rates than core genes. Finally, accessory sequences were observed to preferentially encode transposon components in both species, plus disease resistance genes in cultivated barleys, and a variety of protein domains from other families that appear frequently associated with presence/absence variation in the literature. These results demonstrate that pan-genome analyses are useful to explore germplasm diversity.

  15. Analysis of Plant Pan-Genomes and Transcriptomes with GET_HOMOLOGUES-EST, a Clustering Solution for Sequences of the Same Species

    PubMed Central

    Contreras-Moreira, Bruno; Cantalapiedra, Carlos P.; García-Pereira, María J.; Gordon, Sean P.; Vogel, John P.; Igartua, Ernesto; Casas, Ana M.; Vinuesa, Pablo

    2017-01-01

    The pan-genome of a species is defined as the union of all the genes and non-coding sequences found in all its individuals. However, constructing a pan-genome for plants with large genomes is daunting both in sequencing cost and the scale of the required computational analysis. A more affordable alternative is to focus on the genic repertoire by using transcriptomic data. Here, the software GET_HOMOLOGUES-EST was benchmarked with genomic and RNA-seq data of 19 Arabidopsis thaliana ecotypes and then applied to the analysis of transcripts from 16 Hordeum vulgare genotypes. The goal was to sample their pan-genomes and classify sequences as core, if detected in all accessions, or accessory, when absent in some of them. The resulting sequence clusters were used to simulate pan-genome growth, and to compile Average Nucleotide Identity matrices that summarize intra-species variation. Although transcripts were found to under-estimate pan-genome size by at least 10%, we concluded that clusters of expressed sequences can recapitulate phylogeny and reproduce two properties observed in A. thaliana gene models: accessory loci show lower expression and higher non-synonymous substitution rates than core genes. Finally, accessory sequences were observed to preferentially encode transposon components in both species, plus disease resistance genes in cultivated barleys, and a variety of protein domains from other families that appear frequently associated with presence/absence variation in the literature. These results demonstrate that pan-genome analyses are useful to explore germplasm diversity. PMID:28261241

  16. Comparative description of ten transcriptomes of newly sequenced invertebrates and efficiency estimation of genomic sampling in non-model taxa

    PubMed Central

    2012-01-01

    Introduction Traditionally, genomic or transcriptomic data have been restricted to a few model or emerging model organisms, and to a handful of species of medical and/or environmental importance. Next-generation sequencing techniques have the capability of yielding massive amounts of gene sequence data for virtually any species at a modest cost. Here we provide a comparative analysis of de novo assembled transcriptomic data for ten non-model species of previously understudied animal taxa. Results cDNA libraries of ten species belonging to five animal phyla (2 Annelida [including Sipuncula], 2 Arthropoda, 2 Mollusca, 2 Nemertea, and 2 Porifera) were sequenced in different batches with an Illumina Genome Analyzer II (read length 100 or 150 bp), rendering between ca. 25 and 52 million reads per species. Read thinning, trimming, and de novo assembly were performed under different parameters to optimize output. Between 67,423 and 207,559 contigs were obtained across the ten species, post-optimization. Of those, 9,069 to 25,681 contigs retrieved blast hits against the NCBI non-redundant database, and approximately 50% of these were assigned with Gene Ontology terms, covering all major categories, and with similar percentages in all species. Local blasts against our datasets, using selected genes from major signaling pathways and housekeeping genes, revealed high efficiency in gene recovery compared to available genomes of closely related species. Intriguingly, our transcriptomic datasets detected multiple paralogues in all phyla and in nearly all gene pathways, including housekeeping genes that are traditionally used in phylogenetic applications for their purported single-copy nature. Conclusions We generated the first study of comparative transcriptomics across multiple animal phyla (comparing two species per phylum in most cases), established the first Illumina-based transcriptomic datasets for sponge, nemertean, and sipunculan species, and generated a tractable

  17. Comparative genomic hybridization and transcriptome analysis with a pan-genome microarray reveal distinctions between JP2 and non-JP2 genotypes of Aggregatibacter actinomycetemcomitans.

    PubMed

    Huang, Y; Kittichotirat, W; Mayer, M P A; Hall, R; Bumgarner, R; Chen, C

    2013-02-01

    It was postulated that the highly virulent JP2 genotype of Aggregatibacter actinomycetemcomitans may possess a constellation of distinct virulence determinants not found in non-JP2 genotypes. This study compared the genome content and the transcriptome of the serotype b JP2 genotype and the closely related serotype b non-JP2 genotype of A. actinomycetemcomitans. A custom-designed pan-genomic microarray of A. actinomycetemcomitans was constructed and validated against a panel of 11 sequenced reference strains. The microarray was subsequently used for comparative genomic hybridization of serotype b strains of JP2 (six strains) and non-JP2 (six strains) genotypes, and for transcriptome analysis of strains of JP2 (three strains) and non-JP2 (two strains). Two JP2-specific and two non-JP2-specific genomic islands were identified. In one instance, distinct genomic islands were found to be inserted into the same locus among strains of different genotypes. Transcriptome analysis identified five operons, including the leukotoxin operon, to have at least two genes with an expression ratio of 2 or greater between genotypes. Two of the differentially expressed operons were members of the membrane-bound nitrate reductase system (nap operon) and the Tol-Pal system of gram-negative bacterial species. This study is the first to demonstrate the differences in the full genome content and gene expression between A. actinomycetemcomitans strains of JP2 and non-JP2 genotypes. The information is essential for designing hypothesis-driven experiments to examine the pathogenic mechanisms of A. actinomycetemcomitans.

  18. Large-Scale Transcriptome Analysis in Faba Bean (Vicia faba L.) under Ascochyta fabae Infection.

    PubMed

    Ocaña, Sara; Seoane, Pedro; Bautista, Rocio; Palomino, Carmen; Claros, Gonzalo M; Torres, Ana M; Madrid, Eva

    2015-01-01

    Faba bean is an important food crop worldwide. However, progress in faba bean genomics lags far behind that of model systems due to limited availability of genetic and genomic information. Using the Illumina platform the faba bean transcriptome from leaves of two lines (29H and Vf136) subjected to Ascochyta fabae infection have been characterized. De novo transcriptome assembly provided a total of 39,185 different transcripts that were functionally annotated, and among these, 13,266 were assigned to gene ontology against Arabidopsis. Quality of the assembly was validated by RT-qPCR amplification of selected transcripts differentially expressed. Comparison of faba bean transcripts with those of better-characterized plant genomes such as Arabidopsis thaliana, Medicago truncatula and Cicer arietinum revealed a sequence similarity of 68.3%, 72.8% and 81.27%, respectively. Moreover, 39,060 single nucleotide polymorphism (SNP) and 3,669 InDels were identified for genotyping applications. Mapping of the sequence reads generated onto the assembled transcripts showed that 393 and 457 transcripts were overexpressed in the resistant (29H) and susceptible genotype (Vf136), respectively. Transcripts involved in plant-pathogen interactions such as leucine rich proteins (LRR) or plant growth regulators involved in plant adaptation to abiotic and biotic stresses were found to be differently expressed in the resistant line. The results reported here represent the most comprehensive transcript database developed so far in faba bean, providing valuable information that could be used to gain insight into the pathways involved in the resistance mechanism against A. fabae and to identify potential resistance genes to be further used in marker assisted selection.

  19. Large-Scale Transcriptome Analysis in Faba Bean (Vicia faba L.) under Ascochyta fabae Infection

    PubMed Central

    Ocaña, Sara; Seoane, Pedro; Bautista, Rocio; Palomino, Carmen; Claros, Gonzalo M.; Torres, Ana M.; Madrid, Eva

    2015-01-01

    Faba bean is an important food crop worldwide. However, progress in faba bean genomics lags far behind that of model systems due to limited availability of genetic and genomic information. Using the Illumina platform the faba bean transcriptome from leaves of two lines (29H and Vf136) subjected to Ascochyta fabae infection have been characterized. De novo transcriptome assembly provided a total of 39,185 different transcripts that were functionally annotated, and among these, 13,266 were assigned to gene ontology against Arabidopsis. Quality of the assembly was validated by RT-qPCR amplification of selected transcripts differentially expressed. Comparison of faba bean transcripts with those of better-characterized plant genomes such as Arabidopsis thaliana, Medicago truncatula and Cicer arietinum revealed a sequence similarity of 68.3%, 72.8% and 81.27%, respectively. Moreover, 39,060 single nucleotide polymorphism (SNP) and 3,669 InDels were identified for genotyping applications. Mapping of the sequence reads generated onto the assembled transcripts showed that 393 and 457 transcripts were overexpressed in the resistant (29H) and susceptible genotype (Vf136), respectively. Transcripts involved in plant-pathogen interactions such as leucine rich proteins (LRR) or plant growth regulators involved in plant adaptation to abiotic and biotic stresses were found to be differently expressed in the resistant line. The results reported here represent the most comprehensive transcript database developed so far in faba bean, providing valuable information that could be used to gain insight into the pathways involved in the resistance mechanism against A. fabae and to identify potential resistance genes to be further used in marker assisted selection. PMID:26267359

  20. Comparative and Transcriptome Analyses Uncover Key Aspects of Coding- and Long Noncoding RNAs in Flatworm Mitochondrial Genomes

    PubMed Central

    Ross, Eric; Blair, David; Guerrero-Hernández, Carlos; Alvarado, Alejandro Sánchez

    2016-01-01

    Exploiting the conservation of various features of mitochondrial genomes has been instrumental in resolving phylogenetic relationships. Despite extensive sequence evidence, it has not previously been possible to conclusively resolve some key aspects of flatworm mitochondrial genomes, including generally conserved traits, such as start codons, noncoding regions, the full complement of tRNAs, and whether ATP8 is, or is not, encoded by this extranuclear genome. In an effort to address these difficulties, we sought to determine the mitochondrial transcriptomes and genomes of sexual and asexual taxa of freshwater triclads, a group previously poorly represented in flatworm mitogenomic studies. We have discovered evidence for an alternative start codon, an extended cox1 gene, a previously undescribed conserved open reading frame, long noncoding RNAs, and a highly conserved gene order across the large evolutionary distances represented within the triclads. Our findings contribute to the expansion and refinement of mitogenomics to address evolutionary issues in this diverse group of animals. PMID:26921295

  1. The Genomics, Epigenomics, and Transcriptomics of HPV-Associated Oropharyngeal Cancer--Understanding the Basis of a Rapidly Evolving Disease.

    PubMed

    Lechner, M; Fenton, T R

    2016-01-01

    Human papillomavirus (HPV) has been shown to represent a major independent risk factor for head and neck squamous cell cancer, in particular for oropharyngeal carcinoma. This type of cancer is rapidly evolving in the Western world, with rising trends particularly in the young, and represents a distinct epidemiological, clinical, and molecular entity. It is the aim of this review to give a detailed description of genomic, epigenomic, transcriptomic, and posttranscriptional changes that underlie the phenotype of this deadly disease. The review will also link these changes and examine what is known about the interactions between the host genome and viral genome, and investigate changes specific for the viral genome. These data are then integrated into an updated model of HPV-induced head and neck carcinogenesis.

  2. Merkel Cell Polyomavirus Exhibits Dominant Control of the Tumor Genome and Transcriptome in Virus-Associated Merkel Cell Carcinoma

    PubMed Central

    Starrett, Gabriel J.; Marcelus, Christina; Cantalupo, Paul G.; Katz, Joshua P.; Cheng, Jingwei; Akagi, Keiko; Thakuria, Manisha; Rabinowits, Guilherme; Wang, Linda C.; Symer, David E.; Pipas, James M.

    2017-01-01

    ABSTRACT Merkel cell polyomavirus is the primary etiological agent of the aggressive skin cancer Merkel cell carcinoma (MCC). Recent studies have revealed that UV radiation is the primary mechanism for somatic mutagenesis in nonviral forms of MCC. Here, we analyze the whole transcriptomes and genomes of primary MCC tumors. Our study reveals that virus-associated tumors have minimally altered genomes compared to non-virus-associated tumors, which are dominated by UV-mediated mutations. Although virus-associated tumors contain relatively small mutation burdens, they exhibit a distinct mutation signature with observable transcriptionally biased kataegic events. In addition, viral integration sites overlap focal genome amplifications in virus-associated tumors, suggesting a potential mechanism for these events. Collectively, our studies indicate that Merkel cell polyomavirus is capable of hijacking cellular processes and driving tumorigenesis to the same severity as tens of thousands of somatic genome alterations. PMID:28049147

  3. Genomics, transcriptomics and proteomics: enabling insights into social evolution and disease challenges for managed and wild bees.

    PubMed

    Trapp, Judith; McAfee, Alison; Foster, Leonard J

    2017-02-01

    Globally, there are over 20 000 bee species (Hymenoptera: Apoidea: Anthophila) with a host of biologically fascinating characteristics. Although they have long been studied as models for social evolution, recent challenges to bee health (mainly diseases and pesticides) have gathered the attention of both public and research communities. Genome sequences of twelve bee species are now complete or under progress, facilitating the application of additional 'omic technologies. Here, we review recent developments in honey bee and native bee research in the genomic era. We discuss the progress in genome sequencing and functional annotation, followed by the enabled comparative genomics, proteomics and transcriptomics applications regarding social evolution and health. Finally, we end with comments on future challenges in the postgenomic era.

  4. Stepwise Evolution of Coral Biomineralization Revealed with Genome-Wide Proteomics and Transcriptomics

    PubMed Central

    Sawada, Hitoshi; Satoh, Noriyuki

    2016-01-01

    Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604

  5. Breast cancer genome and transcriptome integration implicates specific mutational signatures with immune cell infiltration.

    PubMed

    Smid, Marcel; Rodríguez-González, F Germán; Sieuwerts, Anieta M; Salgado, Roberto; Prager-Van der Smissen, Wendy J C; Vlugt-Daane, Michelle van der; van Galen, Anne; Nik-Zainal, Serena; Staaf, Johan; Brinkman, Arie B; van de Vijver, Marc J; Richardson, Andrea L; Fatima, Aquila; Berentsen, Kim; Butler, Adam; Martin, Sancha; Davies, Helen R; Debets, Reno; Gelder, Marion E Meijer-Van; van Deurzen, Carolien H M; MacGrogan, Gaëtan; Van den Eynden, Gert G G M; Purdie, Colin; Thompson, Alastair M; Caldas, Carlos; Span, Paul N; Simpson, Peter T; Lakhani, Sunil R; Van Laere, Steven; Desmedt, Christine; Ringnér, Markus; Tommasi, Stefania; Eyford, Jorunn; Broeks, Annegien; Vincent-Salomon, Anne; Futreal, P Andrew; Knappskog, Stian; King, Tari; Thomas, Gilles; Viari, Alain; Langerød, Anita; Børresen-Dale, Anne-Lise; Birney, Ewan; Stunnenberg, Hendrik G; Stratton, Mike; Foekens, John A; Martens, John W M

    2016-09-26

    A recent comprehensive whole genome analysis of a large breast cancer cohort was used to link known and novel drivers and substitution signatures to the transcriptome of 266 cases. Here, we validate that subtype-specific aberrations show concordant expression changes for, for example, TP53, PIK3CA, PTEN, CCND1 and CDH1. We find that CCND3 expression levels do not correlate with amplification, while increased GATA3 expression in mutant GATA3 cancers suggests GATA3 is an oncogene. In luminal cases the total number of substitutions, irrespective of type, associates with cell cycle gene expression and adverse outcome, whereas the number of mutations of signatures 3 and 13 associates with immune-response specific gene expression, increased numbers of tumour-infiltrating lymphocytes and better outcome. Thus, while earlier reports imply that the sheer number of somatic aberrations could trigger an immune-response, our data suggests that substitutions of a particular type are more effective in doing so than others.

  6. Genome-Wide Transcriptome Profiling of Region-Specific Vulnerability to Oxidative Stress in the Hippocampus

    PubMed Central

    Wang, Xinkun; Pal, Ranu; Chen, Xue-wen; Kumar, Keshava N.; Kim, Ok-Jin; Michaelis, Elias K.

    2007-01-01

    Neurons in the hippocampal CA1 region are particularly sensitive to oxidative stress (OS), whereas those in CA3 are resistant. To uncover mechanisms for selective CA1 vulnerability to OS, we treated organotypic hippocampal slices with duroquinone and compared transcriptional profiles of CA1 vs. CA3 cells at various intervals. Gene Ontology and biological pathway analyses of differentially expressed genes showed that at all time points, CA1 had higher transcriptional activity of stress/inflammatory response, transition metal transport, ferroxidase, and pre-synaptic signaling activity, while CA3 had higher GABA-signaling, postsynaptic, and calcium and potassium channel activity. Real-time PCR and immunoblots confirmed the transcriptome data and the induction of OS by duroquinone in both hippocampal regions. Our functional genomics approach has identified in CA1 cells molecular pathways as well as unique genes, such as, guanosine deaminase, lipocalin2, synaptotagmin 4, and latrophilin 2, whose time-dependent induction following the initiation of OS may represent attempts at neurite outgrowth, synaptic recovery, and resistance against OS. PMID:17553663

  7. Breast cancer genome and transcriptome integration implicates specific mutational signatures with immune cell infiltration

    PubMed Central

    Smid, Marcel; Rodríguez-González, F. Germán; Sieuwerts, Anieta M.; Salgado, Roberto; Prager-Van der Smissen, Wendy J. C.; Vlugt-Daane, Michelle van der; van Galen, Anne; Nik-Zainal, Serena; Staaf, Johan; Brinkman, Arie B.; van de Vijver, Marc J.; Richardson, Andrea L.; Fatima, Aquila; Berentsen, Kim; Butler, Adam; Martin, Sancha; Davies, Helen R.; Debets, Reno; Gelder, Marion E. Meijer-Van; van Deurzen, Carolien H. M.; MacGrogan, Gaëtan; Van den Eynden, Gert G. G. M.; Purdie, Colin; Thompson, Alastair M.; Caldas, Carlos; Span, Paul N.; Simpson, Peter T.; Lakhani, Sunil R.; Van Laere, Steven; Desmedt, Christine; Ringnér, Markus; Tommasi, Stefania; Eyford, Jorunn; Broeks, Annegien; Vincent-Salomon, Anne; Futreal, P. Andrew; Knappskog, Stian; King, Tari; Thomas, Gilles; Viari, Alain; Langerød, Anita; Børresen-Dale, Anne-Lise; Birney, Ewan; Stunnenberg, Hendrik G.; Stratton, Mike; Foekens, John A.; Martens, John W. M.

    2016-01-01

    A recent comprehensive whole genome analysis of a large breast cancer cohort was used to link known and novel drivers and substitution signatures to the transcriptome of 266 cases. Here, we validate that subtype-specific aberrations show concordant expression changes for, for example, TP53, PIK3CA, PTEN, CCND1 and CDH1. We find that CCND3 expression levels do not correlate with amplification, while increased GATA3 expression in mutant GATA3 cancers suggests GATA3 is an oncogene. In luminal cases the total number of substitutions, irrespective of type, associates with cell cycle gene expression and adverse outcome, whereas the number of mutations of signatures 3 and 13 associates with immune-response specific gene expression, increased numbers of tumour-infiltrating lymphocytes and better outcome. Thus, while earlier reports imply that the sheer number of somatic aberrations could trigger an immune-response, our data suggests that substitutions of a particular type are more effective in doing so than others. PMID:27666519

  8. Genomic and transcriptomic analyses of the tangerine pathotype of Alternaria alternata in response to oxidative stress

    PubMed Central

    Wang, Mingshuang; Sun, Xuepeng; Yu, Dongliang; Xu, Jianping; Chung, Kuangren; Li, Hongye

    2016-01-01

    The tangerine pathotype of Alternaria alternata produces the A. citri toxin (ACT) and is the causal agent of citrus brown spot that results in significant yield losses worldwide. Both the production of ACT and the ability to detoxify reactive oxygen species (ROS) are required for A. alternata pathogenicity in citrus. In this study, we report the 34.41 Mb genome sequence of strain Z7 of the tangerine pathotype of A. alternata. The host selective ACT gene cluster in strain Z7 was identified, which included 25 genes with 19 of them not reported previously. Of these, 10 genes were present only in the tangerine pathotype, representing the most likely candidate genes for this pathotype specialization. A transcriptome analysis of the global effects of H2O2 on gene expression revealed 1108 up-regulated and 498 down-regulated genes. Expressions of those genes encoding catalase, peroxiredoxin, thioredoxin and glutathione were highly induced. Genes encoding several protein families including kinases, transcription factors, transporters, cytochrome P450, ubiquitin and heat shock proteins were found associated with adaptation to oxidative stress. Our data not only revealed the molecular basis of ACT biosynthesis but also provided new insights into the potential pathways that the phytopathogen A. alternata copes with oxidative stress. PMID:27582273

  9. Linking amyotrophic lateral sclerosis and spinal muscular atrophy through RNA-transcriptome homeostasis: a genomics perspective.

    PubMed

    Gama-Carvalho, Margarida; L Garcia-Vaquero, Marina; R Pinto, Francisco; Besse, Florence; Weis, Joachim; Voigt, Aaron; Schulz, Jörg B; De Las Rivas, Javier

    2017-04-01

    In this review, we present our most recent understanding of key biomolecular processes that underlie two motor neuron degenerative disorders, amyotrophic lateral sclerosis, and spinal muscular atrophy. We focus on the role of four multifunctional proteins involved in RNA metabolism (TDP-43, FUS, SMN, and Senataxin) that play a causal role in these diseases. Recent results have led to a novel scenario of intricate connections between these four proteins, bringing transcriptome homeostasis into the spotlight as a common theme in motor neuron degeneration. We review reported functional and physical interactions between these four proteins, highlighting their common association with nuclear bodies and small nuclear ribonucleoprotein particle biogenesis and function. We discuss how these interactions are turning out to be particularly relevant for the control of transcription and chromatin homeostasis, including the recent identification of an association between SMN and Senataxin required to ensure the resolution of DNA-RNA hybrid formation and proper termination by RNA polymerase II. These connections strongly support the existence of common pathways underlying the spinal muscular atrophy and amyotrophic lateral sclerosis phenotype. We also discuss the potential of genome-wide expression profiling, in particular RNA sequencing derived data, to contribute to unravelling the underlying mechanisms. We provide a review of publicly available datasets that have addressed both diseases using these approaches, and highlight the value of investing in cross-disease studies to promote our understanding of the pathways leading to neurodegeneration.

  10. Transcriptome complexity in cardiac development and diseases--an expanding universe between genome and phenome.

    PubMed

    Gao, Chen; Wang, Yibin

    2014-01-01

    With the advancement of transcriptome profiling by micro-arrays and high-throughput RNA-sequencing, transcriptome complexity and its dynamics are revealed at different levels in cardiovascular development and diseases. In this review, we will highlight the recent progress in our knowledge of cardiovascular transcriptome complexity contributed by RNA splicing, RNA editing and noncoding RNAs. The emerging importance of many of these previously under-explored aspects of gene regulation in cardiovascular development and pathology will be discussed.

  11. Separation and parallel sequencing of the genomes and transcriptomes of single cells using G&T-seq.

    PubMed

    Macaulay, Iain C; Teng, Mabel J; Haerty, Wilfried; Kumar, Parveen; Ponting, Chris P; Voet, Thierry

    2016-11-01

    Parallel sequencing of a single cell's genome and transcriptome provides a powerful tool for dissecting genetic variation and its relationship with gene expression. Here we present a detailed protocol for G&T-seq, a method for separation and parallel sequencing of genomic DNA and full-length polyA(+) mRNA from single cells. We provide step-by-step instructions for the isolation and lysis of single cells; the physical separation of polyA(+) mRNA from genomic DNA using a modified oligo-dT bead capture and the respective whole-transcriptome and whole-genome amplifications; and library preparation and sequence analyses of these amplification products. The method allows the detection of thousands of transcripts in parallel with the genetic variants captured by the DNA-seq data from the same single cell. G&T-seq differs from other currently available methods for parallel DNA and RNA sequencing from single cells, as it involves physical separation of the DNA and RNA and does not require bespoke microfluidics platforms. The process can be implemented manually or through automation. When performed manually, paired genome and transcriptome sequencing libraries from eight single cells can be produced in ∼3 d by researchers experienced in molecular laboratory work. For users with experience in the programming and operation of liquid-handling robots, paired DNA and RNA libraries from 96 single cells can be produced in the same time frame. Sequence analysis and integration of single-cell G&T-seq DNA and RNA data requires a high level of bioinformatics expertise and familiarity with a wide range of informatics tools.

  12. Simplified DGS procedure for large-scale genome structural study.

    PubMed

    Jung, Yong-Chul; Xu, Jia; Chen, Jun; Kim, Yeong; Winchester, David; Wang, San Ming

    2009-11-01

    Ditag genome scanning (DGS) uses next-generation DNA sequencing to sequence the ends of ditag fragments produced by restriction enzymes. These sequences are compared to known genome sequences to determine their structure. In order to use DGS for large-scale genome structural studies, we have substantially revised the original protocol by replacing the in vivo genomic DNA cloning with in vitro adaptor ligation, eliminating the ditag concatemerization steps, and replacing the 454 sequencer with Solexa or SOLiD sequencers for ditag sequence collection. This revised protocol further increases genome coverage and resolution and allows DGS to be used to analyze multiple genomes simultaneously.

  13. Molecular classification of diffuse cerebral WHO grade II/III gliomas using genome- and transcriptome-wide profiling improves stratification of prognostically distinct patient groups.

    PubMed

    Weller, Michael; Weber, Ruthild G; Willscher, Edith; Riehmer, Vera; Hentschel, Bettina; Kreuz, Markus; Felsberg, Jörg; Beyer, Ulrike; Löffler-Wirth, Henry; Kaulich, Kerstin; Steinbach, Joachim P; Hartmann, Christian; Gramatzki, Dorothee; Schramm, Johannes; Westphal, Manfred; Schackert, Gabriele; Simon, Matthias; Martens, Tobias; Boström, Jan; Hagel, Christian; Sabel, Michael; Krex, Dietmar; Tonn, Jörg C; Wick, Wolfgang; Noell, Susan; Schlegel, Uwe; Radlwimmer, Bernhard; Pietsch, Torsten; Loeffler, Markus; von Deimling, Andreas; Binder, Hans; Reifenberger, Guido

    2015-05-01

    Cerebral gliomas of World Health Organization (WHO) grade II and III represent a major challenge in terms of histological classification and clinical management. Here, we asked whether large-scale genomic and transcriptomic profiling improves the definition of prognostically distinct entities. We performed microarray-based genome- and transcriptome-wide analyses of primary tumor samples from a prospective German Glioma Network cohort of 137 patients with cerebral gliomas, including 61 WHO grade II and 76 WHO grade III tumors. Integrative bioinformatic analyses were employed to define molecular subgroups, which were then related to histology, molecular biomarkers, including isocitrate dehydrogenase 1 or 2 (IDH1/2) mutation, 1p/19q co-deletion and telomerase reverse transcriptase (TERT) promoter mutations, and patient outcome. Genomic profiling identified five distinct glioma groups, including three IDH1/2 mutant and two IDH1/2 wild-type groups. Expression profiling revealed evidence for eight transcriptionally different groups (five IDH1/2 mutant, three IDH1/2 wild type), which were only partially linked to the genomic groups. Correlation of DNA-based molecular stratification with clinical outcome allowed to define three major prognostic groups with characteristic genomic aberrations. The best prognosis was found in patients with IDH1/2 mutant and 1p/19q co-deleted tumors. Patients with IDH1/2 wild-type gliomas and glioblastoma-like genomic alterations, including gain on chromosome arm 7q (+7q), loss on chromosome arm 10q (-10q), TERT promoter mutation and oncogene amplification, displayed the worst outcome. Intermediate survival was seen in patients with IDH1/2 mutant, but 1p/19q intact, mostly astrocytic gliomas, and in patients with IDH1/2 wild-type gliomas lacking the +7q/-10q genotype and TERT promoter mutation. This molecular subgrouping stratified patients into prognostically distinct groups better than histological classification. Addition of gene expression

  14. A large set of 26 new reference transcriptomes dedicated to comparative population genomics in crops and wild relatives.

    PubMed

    Sarah, Gautier; Homa, Felix; Pointet, Stéphanie; Contreras, Sandy; Sabot, François; Nabholz, Benoit; Santoni, Sylvain; Sauné, Laure; Ardisson, Morgane; Chantret, Nathalie; Sauvage, Christopher; Tregear, James; Jourda, Cyril; Pot, David; Vigouroux, Yves; Chair, Hana; Scarcelli, Nora; Billot, Claire; Yahiaoui, Nabila; Bacilieri, Roberto; Khadari, Bouchaib; Boccara, Michel; Barnaud, Adéline; Péros, Jean-Pierre; Labouisse, Jean-Pierre; Pham, Jean-Louis; David, Jacques; Glémin, Sylvain; Ruiz, Manuel

    2016-08-04

    We produced a unique large data set of reference transcriptomes to obtain new knowledge about the evolution of plant genomes and crop domestication. For this purpose, we validated a RNA-Seq data assembly protocol to perform comparative population genomics. For the validation, we assessed and compared the quality of de novo Illumina short-read assemblies using data from two crops for which an annotated reference genome was available, namely grapevine and sorghum. We used the same protocol for the release of 26 new transcriptomes of crop plants and wild relatives, including still understudied crops such as yam, pearl millet and fonio. The species list has a wide taxonomic representation with the inclusion of 15 monocots and 11 eudicots. All contigs were annotated using BLAST, prot4EST and Blast2GO. A strong originality of the data set is that each crop is associated with close relative species, which will permit whole-genome comparative evolutionary studies between crops and their wild-related species. This large resource will thus serve research communities working on both crops and model organisms. All the data are available at http://arcad-bioinformatics.southgreen.fr/.

  15. Modeling cancer metabolism on a genome scale

    PubMed Central

    Yizhak, Keren; Chaneton, Barbara; Gottlieb, Eyal; Ruppin, Eytan

    2015-01-01

    Cancer cells have fundamentally altered cellular metabolism that is associated with their tumorigenicity and malignancy. In addition to the widely studied Warburg effect, several new key metabolic alterations in cancer have been established over the last decade, leading to the recognition that altered tumor metabolism is one of the hallmarks of cancer. Deciphering the full scope and functional implications of the dysregulated metabolism in cancer requires both the advancement of a variety of omics measurements and the advancement of computational approaches for the analysis and contextualization of the accumulated data. Encouragingly, while the metabolic network is highly interconnected and complex, it is at the same time probably the best characterized cellular network. Following, this review discusses the challenges that genome-scale modeling of cancer metabolism has been facing. We survey several recent studies demonstrating the first strides that have been done, testifying to the value of this approach in portraying a network-level view of the cancer metabolism and in identifying novel drug targets and biomarkers. Finally, we outline a few new steps that may further advance this field. PMID:26130389

  16. Modeling cancer metabolism on a genome scale.

    PubMed

    Yizhak, Keren; Chaneton, Barbara; Gottlieb, Eyal; Ruppin, Eytan

    2015-06-30

    Cancer cells have fundamentally altered cellular metabolism that is associated with their tumorigenicity and malignancy. In addition to the widely studied Warburg effect, several new key metabolic alterations in cancer have been established over the last decade, leading to the recognition that altered tumor metabolism is one of the hallmarks of cancer. Deciphering the full scope and functional implications of the dysregulated metabolism in cancer requires both the advancement of a variety of omics measurements and the advancement of computational approaches for the analysis and contextualization of the accumulated data. Encouragingly, while the metabolic network is highly interconnected and complex, it is at the same time probably the best characterized cellular network. Following, this review discusses the challenges that genome-scale modeling of cancer metabolism has been facing. We survey several recent studies demonstrating the first strides that have been done, testifying to the value of this approach in portraying a network-level view of the cancer metabolism and in identifying novel drug targets and biomarkers. Finally, we outline a few new steps that may further advance this field.

  17. Ensembl Genomes 2013: scaling up access to genome-wide data

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provi...

  18. Guitar: An R/Bioconductor Package for Gene Annotation Guided Transcriptomic Analysis of RNA-Related Genomic Features.

    PubMed

    Cui, Xiaodong; Wei, Zhen; Zhang, Lin; Liu, Hui; Sun, Lei; Zhang, Shao-Wu; Huang, Yufei; Meng, Jia

    2016-01-01

    Biological features, such as genes and transcription factor binding sites, are often denoted with genome-based coordinates as the genomic features. While genome-based representation is usually very effective in correlating various biological features, it can be tedious to examine the relationship between RNA-related genomic features and the landmarks of RNA transcripts with existing tools due to the difficulty in the conversion between genome-based coordinates and RNA-based coordinates. We developed here an open source Guitar R/Bioconductor package for sketching the transcriptomic view of RNA-related biological features represented by genome based coordinates. Internally, Guitar package extracts the standardized RNA coordinates with respect to the landmarks of RNA transcripts, with which hundreds of millions of RNA-related genomic features can then be efficiently analyzed within minutes. We demonstrated the usage of Guitar package in analyzing posttranscriptional RNA modifications (5-methylcytosine and N6-methyladenosine) derived from high-throughput sequencing approaches (MeRIP-Seq and RNA BS-Seq) and show that RNA 5-methylcytosine (m(5)C) is enriched in 5'UTR. The newly developed Guitar R/Bioconductor package achieves stable performance on the data tested and revealed novel biological insights. It will effectively facilitate the analysis of RNA methylation data and other RNA-related biological features in the future.

  19. Oxidative Stress and Heat-Shock Responses in Desulfovibrio vulgaris by Genome-Wide Transcriptomic Analysis

    SciTech Connect

    Zhang, Weiwen; Culley, David E.; Hogan, Mike; Vitiritti, Luigi; Brockman, Fred J.

    2006-05-30

    Abstract Sulfate-reducing bacteria, like Desulfovibrio vulgaris have developed a set of reactions allowing them to survive in environments. To obtain further knowledge of the protecting mechanisms employed in D. vulgaris against the oxidative stress and heat shock, we performed a genome-wide transcriptomic analysis to determine the cellular responses to both stimuli. The results showed that 130 genes were responsive to oxidative stress, while 427 genes responsive to heat-shock, respectively. Functional analyses suggested that the genes regulated were involved in a variety of cellular functions. Metabolic analysis showed that amino acid biosynthetic pathways were induced by both oxidative stress and heat shock treatments, while fatty acid metabolism, purine and cofactor biosynthesis were induced by heat shock only. Rubrerythrin gene (rbR) were upregulated by the oxidative stress, suggesting its important role in the oxidative resistance, whereas the expression of rubredoxin oxidoreductase (rbO), superoxide ismutase (sodB) and catalase (katA) genes were not subjected to regulation by oxidative stress in D. vulgaris. In addition, the results showed that thioredoxin reductase (trxB) was responsive to oxidative stress, suggesting the thiol-specific redox system might be involved in oxidative protection in D. vulgaris. Comparison of cellular responses to oxidative stress and heat-shock allowed the identification of 66 genes that showed a similar drastic response to both environmental stimuli, implying that they might be part of the general stress response (GSR) network in D. vulgaris, which was further supported by the finding of a conserved motif upstream these common-responsive genes.

  20. A genomic and transcriptomic approach to investigate the blue pigment phenotype in Pseudomonas fluorescens.

    PubMed

    Andreani, Nadia Andrea; Carraro, Lisa; Martino, Maria Elena; Fondi, Marco; Fasolato, Luca; Miotto, Giovanni; Magro, Massimiliano; Vianello, Fabio; Cardazzo, Barbara

    2015-11-20

    Pseudomonas fluorescens is a well-known food spoiler, able to cause serious economic losses in the food industry due to its ability to produce many extracellular, and often thermostable, compounds. The most outstanding spoilage events involving P. fluorescens were blue discoloration of several food stuffs, mainly dairy products. The bacteria involved in such high-profile cases have been identified as belonging to a clearly distinct phylogenetic cluster of the P. fluorescens group. Although the blue pigment has recently been investigated in several studies, the biosynthetic pathway leading to the pigment formation, as well as its chemical nature, remain challenging and unsolved points. In the present paper, genomic and transcriptomic data of 4 P. fluorescens strains (2 blue-pigmenting strains and 2 non-pigmenting strains) were analyzed to evaluate the presence and the expression of blue strain-specific genes. In particular, the pangenome analysis showed the presence in the blue-pigmenting strains of two copies of genes involved in the tryptophan biosynthesis pathway (including trpABCDF). The global expression profiling of blue-pigmenting strains versus non-pigmenting strains showed a general up-regulation of genes involved in iron uptake and a down-regulation of genes involved in primary metabolism. Chromogenic reaction of the blue-pigmenting bacterial cells with Kovac's reagent indicated an indole-derivative as the precursor of the blue pigment. Finally, solubility tests and MALDI-TOF mass spectrometry analysis of the isolated pigment suggested that its molecular structure is very probably a hydrophobic indigo analog.

  1. Capturing the response of Clostridium acetobutylicum to chemical stressors using a regulated genome-scale metabolic model

    DOE PAGES

    Dash, Satyakam; Mueller, Thomas J.; Venkataramanan, Keerthi P.; ...

    2014-10-14

    Clostridia are anaerobic Gram-positive Firmicutes containing broad and flexible systems for substrate utilization, which have been used successfully to produce a range of industrial compounds. Clostridium acetobutylicum has been used to produce butanol on an industrial scale through acetone-butanol-ethanol (ABE) fermentation. A genome-scale metabolic (GSM) model is a powerful tool for understanding the metabolic capacities of an organism and developing metabolic engineering strategies for strain development. The integration of stress related specific transcriptomics information with the GSM model provides opportunities for elucidating the focal points of regulation.

  2. Genomic and transcriptomic hallmarks of poorly differentiated and anaplastic thyroid cancers

    PubMed Central

    Ibrahimpasic, Tihana; Boucai, Laura; Shah, Ronak H.; Dogan, Snjezana; Ricarte-Filho, Julio C.; Krishnamoorthy, Gnana P.; Schultz, Nikolaus; Berger, Michael F.; Sander, Chris; Taylor, Barry S.; Ghossein, Ronald; Ganly, Ian; Fagin, James A.

    2016-01-01

    BACKGROUND. Poorly differentiated thyroid cancer (PDTC) and anaplastic thyroid cancer (ATC) are rare and frequently lethal tumors that so far have not been subjected to comprehensive genetic characterization. METHODS. We performed next-generation sequencing of 341 cancer genes from 117 patient-derived PDTCs and ATCs and analyzed the transcriptome of a representative subset of 37 tumors. Results were analyzed in the context of The Cancer Genome Atlas study (TCGA study) of papillary thyroid cancers (PTC). RESULTS. Compared to PDTCs, ATCs had a greater mutation burden, including a higher frequency of mutations in TP53, TERT promoter, PI3K/AKT/mTOR pathway effectors, SWI/SNF subunits, and histone methyltransferases. BRAF and RAS were the predominant drivers and dictated distinct tropism for nodal versus distant metastases in PDTC. RAS and BRAF sharply distinguished between PDTCs defined by the Turin (PDTC-Turin) versus MSKCC (PDTC-MSK) criteria, respectively. Mutations of EIF1AX, a component of the translational preinitiation complex, were markedly enriched in PDTCs and ATCs and had a striking pattern of co-occurrence with RAS mutations. While TERT promoter mutations were rare and subclonal in PTCs, they were clonal and highly prevalent in advanced cancers. Application of the TCGA-derived BRAF-RAS score (a measure of MAPK transcriptional output) revealed a preserved relationship with BRAF/RAS mutation in PDTCs, whereas ATCs were BRAF-like irrespective of driver mutation. CONCLUSIONS. These data support a model of tumorigenesis whereby PDTCs and ATCs arise from well-differentiated tumors through the accumulation of key additional genetic abnormalities, many of which have prognostic and possible therapeutic relevance. The widespread genomic disruptions in ATC compared with PDTC underscore their greater virulence and higher mortality. FUNDING. This work was supported in part by NIH grants CA50706, CA72597, P50-CA72012, P30-CA008748, and 5T32-CA160001; the Lefkovsky Family

  3. Transcriptomic and proteomic analyses on the supercooling ability and mining of antifreeze proteins of the Chinese white wax scale insect.

    PubMed

    Yu, Shu-Hui; Yang, Pu; Sun, Tao; Qi, Qian; Wang, Xue-Qing; Chen, Xiao-Ming; Feng, Ying; Liu, Bo-Wen

    2016-06-01

    The Chinese white wax scale insect, Ericerus pela, can survive at extremely low temperatures, and some overwintering individuals exhibit supercooling at temperatures below -30°C. To investigate the deep supercooling ability of E. pela, transcriptomic and proteomic analyses were performed to delineate the major gene and protein families responsible for the deep supercooling ability of overwintering females. Gene Ontology (GO) classification and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis indicated that genes involved in the mitogen-activated protein kinase, calcium, and PI3K-Akt signaling pathways and pathways associated with the biosynthesis of soluble sugars, sugar alcohols and free amino acids were dominant. Proteins responsible for low-temperature stress, such as cold acclimation proteins, glycerol biosynthesis-related enzymes and heat shock proteins (HSPs) were identified. However, no antifreeze proteins (AFPs) were identified through sequence similarity search methods. A random forest approach identified 388 putative AFPs in the proteome. The AFP gene ep-afp was expressed in Escherichia coli, and the expressed protein exhibited a thermal hysteresis activity of 0.97°C, suggesting its potential role in the deep supercooling ability of E. pela.

  4. A Differential Genome-Wide Transcriptome Analysis: Impact of Cellular Copper on Complex Biological Processes like Aging and Development

    PubMed Central

    Servos, Jörg; Hamann, Andrea; Grimm, Carolin; Osiewacz, Heinz D.

    2012-01-01

    The regulation of cellular copper homeostasis is crucial in biology. Impairments lead to severe dysfunctions and are known to affect aging and development. Previously, a loss-of-function mutation in the gene encoding the copper-sensing and copper-regulated transcription factor GRISEA of the filamentous fungus Podospora anserina was reported to lead to cellular copper depletion and a pleiotropic phenotype with hypopigmentation of the mycelium and the ascospores, affected fertility and increased lifespan by approximately 60% when compared to the wild type. This phenotype is linked to a switch from a copper-dependent standard to an alternative respiration leading to both a reduced generation of reactive oxygen species (ROS) and of adenosine triphosphate (ATP). We performed a genome-wide comparative transcriptome analysis of a wild-type strain and the copper-depleted grisea mutant. We unambiguously assigned 9,700 sequences of the transcriptome in both strains to the more than 10,600 predicted and annotated open reading frames of the P. anserina genome indicating 90% coverage of the transcriptome. 4,752 of the transcripts differed significantly in abundance with 1,156 transcripts differing at least 3-fold. Selected genes were investigated by qRT-PCR analyses. Apart from this general characterization we analyzed the data with special emphasis on molecular pathways related to the grisea mutation taking advantage of the available complete genomic sequence of P. anserina. This analysis verified but also corrected conclusions from earlier data obtained by single gene analysis, identified new candidates of factors as part of the cellular copper homeostasis system including target genes of transcription factor GRISEA, and provides a rich reference source of quantitative data for further in detail investigations. Overall, the present study demonstrates the importance of systems biology approaches also in cases were mutations in single genes are analyzed to explain the

  5. Large-Scale Transcriptome Analysis of Two Sugarcane Genotypes Contrasting for Lignin Content

    PubMed Central

    Vicentini, Renato; Bottcher, Alexandra; Brito, Michael dos Santos; dos Santos, Adriana Brombini; Creste, Silvana; Landell, Marcos Guimarães de Andrade; Cesarino, Igor; Mazzafera, Paulo

    2015-01-01

    Sugarcane is an important crop worldwide for sugar and first generation ethanol production. Recently, the residue of sugarcane mills, named bagasse, has been considered a promising lignocellulosic biomass to produce the second-generation ethanol. Lignin is a major factor limiting the use of bagasse and other plant lignocellulosic materials to produce second-generation ethanol. Lignin biosynthesis pathway is a complex network and changes in the expression of genes of this pathway have in general led to diverse and undesirable impacts on plant structure and physiology. Despite its economic importance, sugarcane genome was still not sequenced. In this study a high-throughput transcriptome evaluation of two sugarcane genotypes contrasting for lignin content was carried out. We generated a set of 85,151 transcripts of sugarcane using RNA-seq and de novo assembling. More than 2,000 transcripts showed differential expression between the genotypes, including several genes involved in the lignin biosynthetic pathway. This information can give valuable knowledge on the lignin biosynthesis and its interactions with other metabolic pathways in the complex sugarcane genome. PMID:26241317

  6. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

    SciTech Connect

    Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; Ryken, Scott A.; Yost, Will; Jha, Ramesh K.; Patterson, Johnathan; Monnat, Raymond J.; Barlow, Steven B.; Starkenburg, Shawn R.; Cattolico, Rose Ann; Richardson, Paul M.

    2015-09-23

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. In conclusion, the Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.

  7. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

    DOE PAGES

    Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; ...

    2015-09-23

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. Themore » nuclear genome of C. tobin is small (59 Mb), compact (∼40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. In conclusion, the Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.« less

  8. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae).

    PubMed

    Hovde, Blake T; Deodato, Chloe R; Hunsperger, Heather M; Ryken, Scott A; Yost, Will; Jha, Ramesh K; Patterson, Johnathan; Monnat, Raymond J; Barlow, Steven B; Starkenburg, Shawn R; Cattolico, Rose Ann

    2015-01-01

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼ 40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two "red" RuBisCO activases that are shared across many algal lineages. The Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.

  9. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

    PubMed Central

    Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; Ryken, Scott A.; Yost, Will; Jha, Ramesh K.; Patterson, Johnathan; Monnat, Raymond J.; Barlow, Steven B.; Starkenburg, Shawn R.; Cattolico, Rose Ann

    2015-01-01

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. The Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes. PMID:26397803

  10. Genome and Transcriptome Analysis of the Fungal Pathogen Fusarium oxysporum f. sp. cubense Causing Banana Vascular Wilt Disease

    PubMed Central

    Zeng, Huicai; Fan, Dingding; Zhu, Yabin; Feng, Yue; Wang, Guofen; Peng, Chunfang; Jiang, Xuanting; Zhou, Dajie; Ni, Peixiang; Liang, Changcong; Liu, Lei; Wang, Jun; Mao, Chao

    2014-01-01

    Background The asexual fungus Fusarium oxysporum f. sp. cubense (Foc) causing vascular wilt disease is one of the most devastating pathogens of banana (Musa spp.). To understand the molecular underpinning of pathogenicity in Foc, the genomes and transcriptomes of two Foc isolates were sequenced. Methodology/Principal Findings Genome analysis revealed that the genome structures of race 1 and race 4 isolates were highly syntenic with those of F. oxysporum f. sp. lycopersici strain Fol4287. A large number of putative virulence associated genes were identified in both Foc genomes, including genes putatively involved in root attachment, cell degradation, detoxification of toxin, transport, secondary metabolites biosynthesis and signal transductions. Importantly, relative to the Foc race 1 isolate (Foc1), the Foc race 4 isolate (Foc4) has evolved with some expanded gene families of transporters and transcription factors for transport of toxins and nutrients that may facilitate its ability to adapt to host environments and contribute to pathogenicity to banana. Transcriptome analysis disclosed a significant difference in transcriptional responses between Foc1 and Foc4 at 48 h post inoculation to the banana ‘Brazil’ in comparison with the vegetative growth stage. Of particular note, more virulence-associated genes were up regulated in Foc4 than in Foc1. Several signaling pathways like the mitogen-activated protein kinase Fmk1 mediated invasion growth pathway, the FGA1-mediated G protein signaling pathway and a pathogenicity associated two-component system were activated in Foc4 rather than in Foc1. Together, these differences in gene content and transcription response between Foc1 and Foc4 might account for variation in their virulence during infection of the banana variety ‘Brazil’. Conclusions/Significance Foc genome sequences will facilitate us to identify pathogenicity mechanism involved in the banana vascular wilt disease development. These will thus advance

  11. Selecting Superior De Novo Transcriptome Assemblies: Lessons Learned by Leveraging the Best Plant Genome

    PubMed Central

    Honaas, Loren A.; Wafula, Eric K.; Wickett, Norman J.; Der, Joshua P.; Zhang, Yeting; Edger, Patrick P.; Altman, Naomi S.; Pires, J. Chris; Leebens-Mack, James H.; dePamphilis, Claude W.

    2016-01-01

    Whereas de novo assemblies of RNA-Seq data are being published for a growing number of species across the tree of life, there are currently no broadly accepted methods for evaluating such assemblies. Here we present a detailed comparison of 99 transcriptome assemblies, generated with 6 de novo assemblers including CLC, Trinity, SOAP, Oases, ABySS and NextGENe. Controlled analyses of de novo assemblies for Arabidopsis thaliana and Oryza sativa transcriptomes provide new insights into the strengths and limitations of transcriptome assembly strategies. We find that the leading assemblers generate reassuringly accurate assemblies for the majority of transcripts. At the same time, we find a propensity for assemblers to fail to fully assemble highly expressed genes. Surprisingly, the instance of true chimeric assemblies is very low for all assemblers. Normalized libraries are reduced in highly abundant transcripts, but they also lack 1000s of low abundance transcripts. We conclude that the quality of de novo transcriptome assemblies is best assessed through consideration of a combination of metrics: 1) proportion of reads mapping to an assembly 2) recovery of conserved, widely expressed genes, 3) N50 length statistics, and 4) the total number of unigenes. We provide benchmark Illumina transcriptome data and introduce SCERNA, a broadly applicable modular protocol for de novo assembly improvement. Finally, our de novo assembly of the Arabidopsis leaf transcriptome revealed ~20 putative Arabidopsis genes lacking in the current annotation. PMID:26731733

  12. Selecting Superior De Novo Transcriptome Assemblies: Lessons Learned by Leveraging the Best Plant Genome.

    PubMed

    Honaas, Loren A; Wafula, Eric K; Wickett, Norman J; Der, Joshua P; Zhang, Yeting; Edger, Patrick P; Altman, Naomi S; Pires, J Chris; Leebens-Mack, James H; dePamphilis, Claude W

    2016-01-01

    Whereas de novo assemblies of RNA-Seq data are being published for a growing number of species across the tree of life, there are currently no broadly accepted methods for evaluating such assemblies. Here we present a detailed comparison of 99 transcriptome assemblies, generated with 6 de novo assemblers including CLC, Trinity, SOAP, Oases, ABySS and NextGENe. Controlled analyses of de novo assemblies for Arabidopsis thaliana and Oryza sativa transcriptomes provide new insights into the strengths and limitations of transcriptome assembly strategies. We find that the leading assemblers generate reassuringly accurate assemblies for the majority of transcripts. At the same time, we find a propensity for assemblers to fail to fully assemble highly expressed genes. Surprisingly, the instance of true chimeric assemblies is very low for all assemblers. Normalized libraries are reduced in highly abundant transcripts, but they also lack 1000s of low abundance transcripts. We conclude that the quality of de novo transcriptome assemblies is best assessed through consideration of a combination of metrics: 1) proportion of reads mapping to an assembly 2) recovery of conserved, widely expressed genes, 3) N50 length statistics, and 4) the total number of unigenes. We provide benchmark Illumina transcriptome data and introduce SCERNA, a broadly applicable modular protocol for de novo assembly improvement. Finally, our de novo assembly of the Arabidopsis leaf transcriptome revealed ~20 putative Arabidopsis genes lacking in the current annotation.

  13. Local Adaptation at the Transcriptome Level in Brown Trout: Evidence from Early Life History Temperature Genomic Reaction Norms

    PubMed Central

    Meier, Kristian; Hansen, Michael Møller; Normandeau, Eric; Mensberg, Karen-Lise D.; Frydenberg, Jane; Larsen, Peter Foged; Bekkevold, Dorte; Bernatchez, Louis

    2014-01-01

    Local adaptation and its underlying molecular basis has long been a key focus in evolutionary biology. There has recently been increased interest in the evolutionary role of plasticity and the molecular mechanisms underlying local adaptation. Using transcriptome analysis, we assessed differences in gene expression profiles for three brown trout (Salmo trutta) populations, one resident and two anadromous, experiencing different temperature regimes in the wild. The study was based on an F2 generation raised in a common garden setting. A previous study of the F1 generation revealed different reaction norms and significantly higher QST than FST among populations for two early life-history traits. In the present study we investigated if genomic reaction norm patterns were also present at the transcriptome level. Eggs from the three populations were incubated at two temperatures (5 and 8 degrees C) representing conditions encountered in the local environments. Global gene expression for fry at the stage of first feeding was analysed using a 32k cDNA microarray. The results revealed differences in gene expression between populations and temperatures and population × temperature interactions, the latter indicating locally adapted reaction norms. Moreover, the reaction norms paralleled those observed previously at early life-history traits. We identified 90 cDNA clones among the genes with an interaction effect that were differently expressed between the ecologically divergent populations. These included genes involved in immune- and stress response. We observed less plasticity in the resident as compared to the anadromous populations, possibly reflecting that the degree of environmental heterogeneity encountered by individuals throughout their life cycle will select for variable level of phenotypic plasticity at the transcriptome level. Our study demonstrates the usefulness of transcriptome approaches to identify genes with different temperature reaction norms. The

  14. Identification of large-scale genomic variation in cancer genomes using in silico reference models

    PubMed Central

    Killcoyne, Sarah; del Sol, Antonio

    2016-01-01

    Identifying large-scale structural variation in cancer genomes continues to be a challenge to researchers. Current methods rely on genome alignments based on a reference that can be a poor fit to highly variant and complex tumor genomes. To address this challenge we developed a method that uses available breakpoint information to generate models of structural variations. We use these models as references to align previously unmapped and discordant reads from a genome. By using these models to align unmapped reads, we show that our method can help to identify large-scale variations that have been previously missed. PMID:26264669

  15. Complete genome sequence and transcriptomics analyses reveal pigment biosynthesis and regulatory mechanisms in an industrial strain, Monascus purpureus YY-1.

    PubMed

    Yang, Yue; Liu, Bin; Du, Xinjun; Li, Ping; Liang, Bin; Cheng, Xiaozhen; Du, Liangcheng; Huang, Di; Wang, Lei; Wang, Shuo

    2015-02-09

    Monascus has been used to produce natural colorants and food supplements for more than one thousand years, and approximately more than one billion people eat Monascus-fermented products during their daily life. In this study, using next-generation sequencing and optical mapping approaches, a 24.1-Mb complete genome of an industrial strain, Monascus purpureus YY-1, was obtained. This genome consists of eight chromosomes and 7,491 genes. Phylogenetic analysis at the genome level provides convincing evidence for the evolutionary position of M. purpureus. We provide the first comprehensive prediction of the biosynthetic pathway for Monascus pigment. Comparative genomic analyses show that the genome of M. purpureus is 13.6-40% smaller than those of closely related filamentous fungi and has undergone significant gene losses, most of which likely occurred during its specialized adaptation to starch-based foods. Comparative transcriptome analysis reveals that carbon starvation stress, resulting from the use of relatively low-quality carbon sources, contributes to the high yield of pigments by repressing central carbon metabolism and augmenting the acetyl-CoA pool. Our work provides important insights into the evolution of this economically important fungus and lays a foundation for future genetic manipulation and engineering of this strain.

  16. LEMONS – A Tool for the Identification of Splice Junctions in Transcriptomes of Organisms Lacking Reference Genomes

    PubMed Central

    Bouskila, Amos; Chorev, Michal; Carmel, Liran; Mishmar, Dan

    2015-01-01

    RNA-seq is becoming a preferred tool for genomics studies of model and non-model organisms. However, DNA-based analysis of organisms lacking sequenced genomes cannot rely on RNA-seq data alone to isolate most genes of interest, as DNA codes both exons and introns. With this in mind, we designed a novel tool, LEMONS, that exploits the evolutionary conservation of both exon/intron boundary positions and splice junction recognition signals to produce high throughput splice-junction predictions in the absence of a reference genome. When tested on multiple annotated vertebrate mRNA data, LEMONS accurately identified 87% (average) of the splice-junctions. LEMONS was then applied to our updated Mediterranean chameleon transcriptome, which lacks a reference genome, and predicted a total of 90,820 exon-exon junctions. We experimentally verified these splice-junction predictions by amplifying and sequencing twenty randomly selected genes from chameleon DNA templates. Exons and introns were detected in 19 of 20 of the positions predicted by LEMONS. To the best of our knowledge, LEMONS is currently the only experimentally verified tool that can accurately predict splice-junctions in organisms that lack a reference genome. PMID:26606265

  17. Impact of a short-term exposure to spaceflight on the phenotype, genome, transcriptome and proteome of Escherichia coli

    NASA Astrophysics Data System (ADS)

    Li, Tianzhi; Chang, De; Xu, Huiwen; Chen, Jiapeng; Su, Longxiang; Guo, Yinghua; Chen, Zhenhong; Wang, Yajuan; Wang, Li; Wang, Junfeng; Fang, Xiangqun; Liu, Changting

    2015-07-01

    Escherichia coli (E. coli) is the most widely applied model organism in current biological science. As a widespread opportunistic pathogen, E. coli can survive not only by symbiosis with human, but also outside the host as well, which necessitates the evaluation of its response to the space environment. Therefore, to keep humans safe in space, it is necessary to understand how the bacteria respond to this environment. Despite extensive investigations for a few decades, the response of E. coli to the real space environment is still controversial. To better understand the mechanisms how E. coli overcomes harsh environments such as microgravity in space and to investigate whether these factors may induce pathogenic changes in E. coli that are potentially detrimental to astronauts, we conducted detailed genomics, transcriptomic and proteomic studies on E. coli that experienced 17 days of spaceflight. By comparing two flight strains LCT-EC52 and LCT-EC59 to a control strain LCT-EC106 that was cultured under the same temperature conditions on the ground, we identified metabolism changes, polymorphism changes, differentially expressed genes and proteins in the two flight strains. The flight strains differed from the control in the utilization of more than 30 carbon sources. Two single nucleotide polymorphisms (SNPs) and one deletion were identified in the flight strains. The expression level of more than 1000 genes altered in flight strains. Genes involved in chemotaxis, lipid metabolism and cell motility express differently. Moreover, the two flight strains also differed extensively from each other in terms of metabolism, transcriptome and proteome, indicating the impact of space environment on individual cells is heterogeneous and probably genotype-dependent. This study presents the first systematic profile of E. coli genome, transcriptome and proteome after spaceflight, which helps to elucidate the mechanism that controls the adaptation of microbes to the space

  18. Comparative transcriptome assembly and genome-guided profiling for Brettanomyces bruxellensis LAMAP2480 during p-coumaric acid stress

    PubMed Central

    Godoy, Liliana; Vera-Wolf, Patricia; Martinez, Claudio; Ugalde, Juan A.; Ganga, María Angélica

    2016-01-01

    Brettanomyces bruxellensis has been described as the main contaminant yeast in wine production, due to its ability to convert the hydroxycinnamic acids naturally present in the grape phenolic derivatives, into volatile phenols. Currently, there are no studies in B. bruxellensis which explains the resistance mechanisms to hydroxycinnamic acids, and in particular to p-coumaric acid which is directly involved in alterations to wine. In this work, we performed a transcriptome analysis of B. bruxellensis LAMAP248rown in the presence and absence of p-coumaric acid during lag phase. Because of reported genetic variability among B. bruxellensis strains, to complement de novo assembly of the transcripts, we used the high-quality genome of B. bruxellensis AWRI1499, as well as the draft genomes of strains CBS2499 and0 g LAMAP2480. The results from the transcriptome analysis allowed us to propose a model in which the entrance of p-coumaric acid to the cell generates a generalized stress condition, in which the expression of proton pump and efflux of toxic compounds are induced. In addition, these mechanisms could be involved in the outflux of nitrogen compounds, such as amino acids, decreasing the overall concentration and triggering the expression of nitrogen metabolism genes. PMID:27678167

  19. Comparative transcriptome assembly and genome-guided profiling for Brettanomyces bruxellensis LAMAP2480 during p-coumaric acid stress.

    PubMed

    Godoy, Liliana; Vera-Wolf, Patricia; Martinez, Claudio; Ugalde, Juan A; Ganga, María Angélica

    2016-09-28

    Brettanomyces bruxellensis has been described as the main contaminant yeast in wine production, due to its ability to convert the hydroxycinnamic acids naturally present in the grape phenolic derivatives, into volatile phenols. Currently, there are no studies in B. bruxellensis which explains the resistance mechanisms to hydroxycinnamic acids, and in particular to p-coumaric acid which is directly involved in alterations to wine. In this work, we performed a transcriptome analysis of B. bruxellensis LAMAP248rown in the presence and absence of p-coumaric acid during lag phase. Because of reported genetic variability among B. bruxellensis strains, to complement de novo assembly of the transcripts, we used the high-quality genome of B. bruxellensis AWRI1499, as well as the draft genomes of strains CBS2499 and0 g LAMAP2480. The results from the transcriptome analysis allowed us to propose a model in which the entrance of p-coumaric acid to the cell generates a generalized stress condition, in which the expression of proton pump and efflux of toxic compounds are induced. In addition, these mechanisms could be involved in the outflux of nitrogen compounds, such as amino acids, decreasing the overall concentration and triggering the expression of nitrogen metabolism genes.

  20. Scaling, crumpled wires, and genome packing in virions

    NASA Astrophysics Data System (ADS)

    de Holanda, V. H.; Gomes, M. A. F.

    2016-12-01

    The packing of a genome in virions is a topic of intense current interest in biology and biological physics. The area is dominated by allometric scaling relations that connect, e.g., the length of the encapsulated genome and the size of the corresponding virion capsid. Here we report scaling laws obtained from extensive experiments of packing of a macroscopic wire within rigid three-dimensional spherical and nonspherical cavities that can shed light on the details of the genome packing in virions. We show that these results obtained with crumpled wires are comparable to those from a large compilation of biological data from several classes of virions.

  1. In the fast lane: large-scale bacterial genome engineering.

    PubMed

    Fehér, Tamás; Burland, Valerie; Pósfai, György

    2012-07-31

    The last few years have witnessed rapid progress in bacterial genome engineering. The long-established, standard ways of DNA synthesis, modification, transfer into living cells, and incorporation into genomes have given way to more effective, large-scale, robust genome modification protocols. Expansion of these engineering capabilities is due to several factors. Key advances include: (i) progress in oligonucleotide synthesis and in vitro and in vivo assembly methods, (ii) optimization of recombineering techniques, (iii) introduction of parallel, large-scale, combinatorial, and automated genome modification procedures, and (iv) rapid identification of the modifications by barcode-based analysis and sequencing. Combination of the brute force of these techniques with sophisticated bioinformatic design and modeling opens up new avenues for the analysis of gene functions and cellular network interactions, but also in engineering more effective producer strains. This review presents a summary of recent technological advances in bacterial genome engineering.

  2. Genome-scale resources for Thermoanaerobacterium saccharolyticum

    DOE PAGES

    Currie, Devin H.; Raman, Babu; Gowen, Christopher M.; ...

    2015-06-26

    Thermoanaerobacterium saccharolyticum is a hemicellulose-degrading thermophilic anaerobe that was previously engineered to produce ethanol at high yield. For this research, a major project was undertaken to develop this organism into an industrial biocatalyst, but the lack of genome information and resources were recognized early on as a key limitation.

  3. The OME Framework for genome-scale systems biology

    SciTech Connect

    Palsson, Bernhard O.; Ebrahim, Ali; Federowicz, Steve

    2014-12-19

    The life sciences are undergoing continuous and accelerating integration with computational and engineering sciences. The biology that many in the field have been trained on may be hardly recognizable in ten to twenty years. One of the major drivers for this transformation is the blistering pace of advancements in DNA sequencing and synthesis. These advances have resulted in unprecedented amounts of new data, information, and knowledge. Many software tools have been developed to deal with aspects of this transformation and each is sorely needed [1-3]. However, few of these tools have been forced to deal with the full complexity of genome-scale models along with high throughput genome- scale data. This particular situation represents a unique challenge, as it is simultaneously necessary to deal with the vast breadth of genome-scale models and the dizzying depth of high-throughput datasets. It has been observed time and again that as the pace of data generation continues to accelerate, the pace of analysis significantly lags behind [4]. It is also evident that, given the plethora of databases and software efforts [5-12], it is still a significant challenge to work with genome-scale metabolic models, let alone next-generation whole cell models [13-15]. We work at the forefront of model creation and systems scale data generation [16-18]. The OME Framework was borne out of a practical need to enable genome-scale modeling and data analysis under a unified framework to drive the next generation of genome-scale biological models. Here we present the OME Framework. It exists as a set of Python classes. However, we want to emphasize the importance of the underlying design as an addition to the discussions on specifications of a digital cell. A great deal of work and valuable progress has been made by a number of communities [13, 19-24] towards interchange formats and implementations designed to achieve similar goals. While many software tools exist for handling genome-scale

  4. Traumatic Brain Injury Induces Genome-Wide Transcriptomic, Methylomic, and Network Perturbations in Brain and Blood Predicting Neurological Disorders.

    PubMed

    Meng, Qingying; Zhuang, Yumei; Ying, Zhe; Agrawal, Rahul; Yang, Xia; Gomez-Pinilla, Fernando

    2017-02-01

    The complexity of the traumatic brain injury (TBI) pathology, particularly concussive injury, is a serious obstacle for diagnosis, treatment, and long-term prognosis. Here we utilize modern systems biology in a rodent model of concussive injury to gain a thorough view of the impact of TBI on fundamental aspects of gene regulation, which have the potential to drive or alter the course of the TBI pathology. TBI perturbed epigenomic programming, transcriptional activities (expression level and alternative splicing), and the organization of genes in networks centered around genes such as Anax2, Ogn, and Fmod. Transcriptomic signatures in the hippocampus are involved in neuronal signaling, metabolism, inflammation, and blood function, and they overlap with those in leukocytes from peripheral blood. The homology between genomic signatures from blood and brain elicited by TBI provides proof of concept information for development of biomarkers of TBI based on composite genomic patterns. By intersecting with human genome-wide association studies, many TBI signature genes and network regulators identified in our rodent model were causally associated with brain disorders with relevant link to TBI. The overall results show that concussive brain injury reprograms genes which could lead to predisposition to neurological and psychiatric disorders, and that genomic information from peripheral leukocytes has the potential to predict TBI pathogenesis in the brain.

  5. Assessment of the Diversity of Dairy Lactococcus lactis subsp. lactis Isolates by an Integrated Approach Combining Phenotypic, Genomic, and Transcriptomic Analyses ▿ †

    PubMed Central

    Tan-a-ram, Punthip; Cardoso, Tamara; Daveran-Mingot, Marie-Line; Kanchanatawee, Sunthorn; Loubière, Pascal; Girbal, Laurence; Cocaign-Bousquet, Muriel

    2011-01-01

    The intrasubspecies diversity of six strains of Lactococcus lactis subsp. lactis was investigated at the genomic level and in terms of phenotypic and transcriptomic profiles in an ultrafiltration cheese model. The six strains were isolated from various sources, but all exhibited a dairy phenotype (growth in ultrafiltration cheese model and high acidification rate). The six strains exhibited similar behaviors in terms of growth during cheese ripening, while different acidification capabilities were detected. Even if all strains displayed large genomic similarities, sharing a large core genome of almost 2,000 genes, the expression of this core genome directly in the cheese matrix revealed major strain-specific differences that potentially could account for the observed different acidification capabilities. This work demonstrated that significant transcriptomic polymorphisms exist even among Lactococcus lactis subsp. lactis strains with the same dairy origin. PMID:21131529

  6. Characterization of the genome and transcriptome of the blue tit Cyanistes caeruleus: polymorphisms, sex-biased expression and selection signals.

    PubMed

    Mueller, Jakob C; Kuhl, Heiner; Timmermann, Bernd; Kempenaers, Bart

    2016-03-01

    Decoding genomic sequences and determining their variation within populations has potential to reveal adaptive processes and unravel the genetic basis of ecologically relevant trait variation within a species. The blue tit Cyanistes caeruleus--a long-time ecological model species--has been used to investigate fitness consequences of variation in mating and reproductive behaviour. However, very little is known about the underlying genetic changes due to natural and sexual selection in the genome of this songbird. As a step to bridge this gap, we assembled the first draft genome of a single blue tit, mapped the transcriptome of five females and five males to this reference, identified genomewide variants and performed sex-differential expression analysis in the gonads, brain and other tissues. In the gonads, we found a high number of sex-biased genes, and of those, a similar proportion were sex-limited (genes only expressed in one sex) in males and females. However, in the brain, the proportion of female-limited genes within the female-biased gene category (82%) was substantially higher than the proportion of male-limited genes within the male-biased category (6%). This suggests a predominant on-off switching mechanism for the female-limited genes. In addition, most male-biased genes were located on the Z-chromosome, indicating incomplete dosage compensation for the male-biased genes. We called more than 500,000 SNPs from the RNA-seq data. Heterozygote detection in the single reference individual was highly congruent between DNA-seq and RNA-seq calling. Using information from these polymorphisms, we identified potential selection signals in the genome. We list candidate genes which can be used for further sequencing and detailed selection studies, including genes potentially related to meiotic drive evolution. A public genome browser of the blue tit with the described information is available at http://public-genomes-ngs.molgen.mpg.de.

  7. Genomic and Transcriptomic Associations Identify a New Insecticide Resistance Phenotype for the Selective Sweep at the Cyp6g1 Locus of Drosophila melanogaster

    PubMed Central

    Battlay, Paul; Schmidt, Joshua M.; Fournier-Level, Alexandre; Robin, Charles

    2016-01-01

    Scans of the Drosophila melanogaster genome have identified organophosphate resistance loci among those with the most pronounced signature of positive selection. In this study, the molecular basis of resistance to the organophosphate insecticide azinphos-methyl was investigated using the Drosophila Genetic Reference Panel, and genome-wide association. Recently released full transcriptome data were used to extend the utility of the Drosophila Genetic Reference Panel resource beyond traditional genome-wide association studies to allow systems genetics analyses of phenotypes. We found that both genomic and transcriptomic associations independently identified Cyp6g1, a gene involved in resistance to DDT and neonicotinoid insecticides, as the top candidate for azinphos-methyl resistance. This was verified by transgenically overexpressing Cyp6g1 using natural regulatory elements from a resistant allele, resulting in a 6.5-fold increase in resistance. We also identified four novel candidate genes associated with azinphos-methyl resistance, all of which are involved in either regulation of fat storage, or nervous system development. In Cyp6g1, we find a demonstrable resistance locus, a verification that transcriptome data can be used to identify variants associated with insecticide resistance, and an overlap between peaks of a genome-wide association study, and a genome-wide selective sweep analysis. PMID:27317781

  8. Discovery of Genes Related to Insecticide Resistance in Bactrocera dorsalis by Functional Genomic Analysis of a De Novo Assembled Transcriptome

    PubMed Central

    Hsu, Ju-Chun; Wu, Wen-Jer; Feng, Hai-Tung; Haymer, David S.; Chen, Chien-Yu

    2012-01-01

    Insecticide resistance has recently become a critical concern for control of many insect pest species. Genome sequencing and global quantization of gene expression through analysis of the transcriptome can provide useful information relevant to this challenging problem. The oriental fruit fly, Bactrocera dorsalis, is one of the world's most destructive agricultural pests, and recently it has been used as a target for studies of genetic mechanisms related to insecticide resistance. However, prior to this study, the molecular data available for this species was largely limited to genes identified through homology. To provide a broader pool of gene sequences of potential interest with regard to insecticide resistance, this study uses whole transcriptome analysis developed through de novo assembly of short reads generated by next-generation sequencing (NGS). The transcriptome of B. dorsalis was initially constructed using Illumina's Solexa sequencing technology. Qualified reads were assembled into contigs and potential splicing variants (isotigs). A total of 29,067 isotigs have putative homologues in the non-redundant (nr) protein database from NCBI, and 11,073 of these correspond to distinct D. melanogaster proteins in the RefSeq database. Approximately 5,546 isotigs contain coding sequences that are at least 80% complete and appear to represent B. dorsalis genes. We observed a strong correlation between the completeness of the assembled sequences and the expression intensity of the transcripts. The assembled sequences were also used to identify large numbers of genes potentially belonging to families related to insecticide resistance. A total of 90 P450-, 42 GST-and 37 COE-related genes, representing three major enzyme families involved in insecticide metabolism and resistance, were identified. In addition, 36 isotigs were discovered to contain target site sequences related to four classes of resistance genes. Identified sequence motifs were also analyzed to

  9. Reconstructing genome-scale metabolic models with merlin

    PubMed Central

    Dias, Oscar; Rocha, Miguel; Ferreira, Eugénio C.; Rocha, Isabel

    2015-01-01

    The Metabolic Models Reconstruction Using Genome-Scale Information (merlin) tool is a user-friendly Java application that aids the reconstruction of genome-scale metabolic models for any organism that has its genome sequenced. It performs the major steps of the reconstruction process, including the functional genomic annotation of the whole genome and subsequent construction of the portfolio of reactions. Moreover, merlin includes tools for the identification and annotation of genes encoding transport proteins, generating the transport reactions for those carriers. It also performs the compartmentalisation of the model, predicting the organelle localisation of the proteins encoded in the genome and thus the localisation of the metabolites involved in the reactions promoted by such enzymes. The gene-proteins-reactions (GPR) associations are automatically generated and included in the model. Finally, merlin expedites the transition from genomic data to draft metabolic models reconstructions exported in the SBML standard format, allowing the user to have a preliminary view of the biochemical network, which can be manually curated within the environment provided by merlin. PMID:25845595

  10. Reconstructing genome-scale metabolic models with merlin.

    PubMed

    Dias, Oscar; Rocha, Miguel; Ferreira, Eugénio C; Rocha, Isabel

    2015-04-30

    The Metabolic Models Reconstruction Using Genome-Scale Information (merlin) tool is a user-friendly Java application that aids the reconstruction of genome-scale metabolic models for any organism that has its genome sequenced. It performs the major steps of the reconstruction process, including the functional genomic annotation of the whole genome and subsequent construction of the portfolio of reactions. Moreover, merlin includes tools for the identification and annotation of genes encoding transport proteins, generating the transport reactions for those carriers. It also performs the compartmentalisation of the model, predicting the organelle localisation of the proteins encoded in the genome and thus the localisation of the metabolites involved in the reactions promoted by such enzymes. The gene-proteins-reactions (GPR) associations are automatically generated and included in the model. Finally, merlin expedites the transition from genomic data to draft metabolic models reconstructions exported in the SBML standard format, allowing the user to have a preliminary view of the biochemical network, which can be manually curated within the environment provided by merlin.

  11. Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.

    PubMed

    Armero, Alix; Baudouin, Luc; Bocs, Stéphanie; This, Dominique

    2017-01-01

    The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/).

  12. Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut

    PubMed Central

    Armero, Alix; Bocs, Stéphanie; This, Dominique

    2017-01-01

    The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/). PMID:28334050

  13. Complete Genome Sequence and Transcriptomic Analysis of the Novel Pathogen Elizabethkingia anophelis in Response to Oxidative Stress.

    PubMed

    Li, Yingying; Liu, Yang; Chew, Su Chuen; Tay, Martin; Salido, May Margarette Santillan; Teo, Jeanette; Lauro, Federico M; Givskov, Michael; Yang, Liang

    2015-05-26

    Elizabethkingia anophelis is an emerging pathogen that can cause life-threatening infections in neonates, severely immunocompromised and postoperative patients. The lack of genomic information on E. anophelis hinders our understanding of its mechanisms of pathogenesis. Here, we report the first complete genome sequence of E. anophelis NUHP1 and assess its response to oxidative stress. Elizabethkingia anophelis NUHP1 has a circular genome of 4,369,828 base pairs and 4,141 predicted coding sequences. Sequence analysis indicates that E. anophelis has well-developed systems for scavenging iron and stress response. Many putative virulence factors and antibiotic resistance genes were identified, underscoring potential host-pathogen interactions and antibiotic resistance. RNA-sequencing-based transcriptome profiling indicates that expressions of genes involved in synthesis of an yersiniabactin-like iron siderophore and heme utilization are highly induced as a protective mechanism toward oxidative stress caused by hydrogen peroxide treatment. Chrome azurol sulfonate assay verified that siderophore production of E. anophelis is increased in the presence of oxidative stress. We further showed that hemoglobin facilitates the growth, hydrogen peroxide tolerance, cell attachment, and biofilm formation of E. anophelis NUHP1. Our study suggests that siderophore production and heme uptake pathways might play essential roles in stress response and virulence of the emerging pathogen E. anophelis.

  14. Complete Genome Sequence and Transcriptomic Analysis of the Novel Pathogen Elizabethkingia anophelis in Response to Oxidative Stress

    PubMed Central

    Li, Yingying; Liu, Yang; Chew, Su Chuen; Tay, Martin; Salido, May Margarette Santillan; Teo, Jeanette; Lauro, Federico M.; Givskov, Michael; Yang, Liang

    2015-01-01

    Elizabethkingia anophelis is an emerging pathogen that can cause life-threatening infections in neonates, severely immunocompromised and postoperative patients. The lack of genomic information on E. anophelis hinders our understanding of its mechanisms of pathogenesis. Here, we report the first complete genome sequence of E. anophelis NUHP1 and assess its response to oxidative stress. Elizabethkingia anophelis NUHP1 has a circular genome of 4,369,828 base pairs and 4,141 predicted coding sequences. Sequence analysis indicates that E. anophelis has well-developed systems for scavenging iron and stress response. Many putative virulence factors and antibiotic resistance genes were identified, underscoring potential host–pathogen interactions and antibiotic resistance. RNA-sequencing-based transcriptome profiling indicates that expressions of genes involved in synthesis of an yersiniabactin-like iron siderophore and heme utilization are highly induced as a protective mechanism toward oxidative stress caused by hydrogen peroxide treatment. Chrome azurol sulfonate assay verified that siderophore production of E. anophelis is increased in the presence of oxidative stress. We further showed that hemoglobin facilitates the growth, hydrogen peroxide tolerance, cell attachment, and biofilm formation of E. anophelis NUHP1. Our study suggests that siderophore production and heme uptake pathways might play essential roles in stress response and virulence of the emerging pathogen E. anophelis. PMID:26019164

  15. Characterization of the mechanism of prolonged adaptation to osmotic stress of Jeotgalibacillus malaysiensis via genome and transcriptome sequencing analyses

    PubMed Central

    Yaakop, Amira Suriaty; Chan, Kok-Gan; Ee, Robson; Lim, Yan Lue; Lee, Siew-Kim; Manan, Fazilah Abd; Goh, Kian Mau

    2016-01-01

    Jeotgalibacillus malaysiensis, a moderate halophilic bacterium isolated from a pelagic area, can endure higher concentrations of sodium chloride (NaCl) than other Jeotgalibacillus type strains. In this study, we therefore chose to sequence and assemble the entire J. malaysiensis genome. This is the first report to provide a detailed analysis of the genomic features of J. malaysiensis, and to perform genetic comparisons between this microorganism and other halophiles. J. malaysiensis encodes a native megaplasmid (pJeoMA), which is greater than 600 kilobases in size, that is absent from other sequenced species of Jeotgalibacillus. Subsequently, RNA-Seq-based transcriptome analysis was utilised to examine adaptations of J. malaysiensis to osmotic stress. Specifically, the eggNOG (evolutionary genealogy of genes: Non-supervised Orthologous Groups) and KEGG (Kyoto Encyclopaedia of Genes and Genomes) databases were used to elucidate the overall effects of osmotic stress on the organism. Generally, saline stress significantly affected carbohydrate, energy, and amino acid metabolism, as well as fatty acid biosynthesis. Our findings also indicate that J. malaysiensis adopted a combination of approaches, including the uptake or synthesis of osmoprotectants, for surviving salt stress. Among these, proline synthesis appeared to be the preferred method for withstanding prolonged osmotic stress in J. malaysiensis. PMID:27641516

  16. Genome-Wide Host-Pathogen Interaction Unveiled by Transcriptomic Response of Diamondback Moth to Fungal Infection

    PubMed Central

    Chu, Zhen-Jian; Wang, Yu-Jun; Ying, Sheng-Hua; Wang, Xiao-Wei; Feng, Ming-Guang

    2016-01-01

    Genome-wide insight into insect pest response to the infection of Beauveria bassiana (fungal insect pathogen) is critical for genetic improvement of fungal insecticides but has been poorly explored. We constructed three pairs of transcriptomes of Plutella xylostella larvae at 24, 36 and 48 hours post treatment of infection (hptI) and of control (hptC) for insight into the host-pathogen interaction at genomic level. There were 2143, 3200 and 2967 host genes differentially expressed at 24, 36 and 48 hptI/hptC respectively. These infection-responsive genes (~15% of the host genome) were enriched in various immune processes, such as complement and coagulation cascades, protein digestion and absorption, and drug metabolism-cytochrome P450. Fungal penetration into cuticle and host defense reaction began at 24 hptI, followed by most intensive host immune response at 36 hptI and attenuated immunity at 48 hptI. Contrastingly, 44% of fungal genes were differentially expressed in the infection course and enriched in several biological processes, such as antioxidant activity, peroxidase activity and proteolysis. There were 1636 fungal genes co-expressed during 24–48 hptI, including 116 encoding putative secretion proteins. Our results provide novel insights into the insect-pathogen interaction and help to probe molecular mechanisms involved in the fungal infection to the global pest. PMID:27043942

  17. The Draft Genome and Transcriptome of Panagrellus redivivus Are Shaped by the Harsh Demands of a Free-Living Lifestyle

    PubMed Central

    Srinivasan, Jagan; Dillman, Adler R.; Macchietto, Marissa G.; Heikkinen, Liisa; Lakso, Merja; Fracchia, Kelley M.; Antoshechkin, Igor; Mortazavi, Ali; Wong, Garry; Sternberg, Paul W.

    2013-01-01

    Nematodes compose an abundant and diverse invertebrate phylum with members inhabiting nearly every ecological niche. Panagrellus redivivus (the “microworm”) is a free-living nematode frequently used to understand the evolution of developmental and behavioral processes given its phylogenetic distance to Caenorhabditis elegans. Here we report the de novo sequencing of the genome, transcriptome, and small RNAs of P. redivivus. Using a combination of automated gene finders and RNA-seq data, we predict 24,249 genes and 32,676 transcripts. Small RNA analysis revealed 248 microRNA (miRNA) hairpins, of which 63 had orthologs in other species. Fourteen miRNA clusters containing 42 miRNA precursors were found. The RNA interference, dauer development, and programmed cell death pathways are largely conserved. Analysis of protein family domain abundance revealed that P. redivivus has experienced a striking expansion of BTB domain-containing proteins and an unprecedented expansion of the cullin scaffold family of proteins involved in multi-subunit ubiquitin ligases, suggesting proteolytic plasticity and/or tighter regulation of protein turnover. The eukaryotic release factor protein family has also been dramatically expanded and suggests an ongoing evolutionary arms race with viruses and transposons. The P. redivivus genome provides a resource to advance our understanding of nematode evolution and biology and to further elucidate the genomic architecture leading to free-living lineages, taking advantage of the many fascinating features of this worm revealed by comparative studies. PMID:23410827

  18. Mapping of chimpanzee full-length cDNAs onto the human genome unveils large potential divergence of the transcriptome.

    PubMed

    Sakate, Ryuichi; Suto, Yumiko; Imanishi, Tadashi; Tanoue, Tetsuya; Hida, Munetomo; Hayasaka, Ikuo; Kusuda, Jun; Gojobori, Takashi; Hashimoto, Katsuyuki; Hirai, Momoki

    2007-09-01

    The genetic basis of the phenotypic difference between human and chimpanzee is one of the most actively pursued issues in current genomics. Although the genomic divergence between the two species has been described, the transcriptomic divergence has not been well documented. Thus, we newly sequenced and analyzed chimpanzee full-length cDNAs (FLcDNAs) representing 87 protein-coding genes. The number of nucleotide substitutions and sites of insertions/deletions (indels) was counted as a measure of sequence divergence between the chimpanzee FLcDNAs and the human genome onto which the FLcDNAs were mapped. Difference in transcription start/termination sites (TSSs/TTSs) and alternative splicing (AS) exons was also counted as a measure of structural divergence between the chimpanzee FLcDNAs and their orthologous human transcripts (NCBI RefSeq). As a result, we found that transposons (Alu) and repetitive segments caused large indels, which strikingly increased the average amount of sequence divergence up to more than 2% in the 3'-UTRs. Moreover, 20 out of the 87 transcripts contained more than 10% structural divergence in length. In particular, two-thirds of the structural divergence was found in the 3'-UTRs, and variable transcription start sites were conspicuous in the 5'-UTRs. As both transcriptional and translational efficiency were supposed to be related to 5'- and 3'-UTR sequences, these results lead to the idea that the difference in gene regulation can be a major cause of the difference in phenotype between human and chimpanzee.

  19. Genome-Wide Host-Pathogen Interaction Unveiled by Transcriptomic Response of Diamondback Moth to Fungal Infection.

    PubMed

    Chu, Zhen-Jian; Wang, Yu-Jun; Ying, Sheng-Hua; Wang, Xiao-Wei; Feng, Ming-Guang

    2016-01-01

    Genome-wide insight into insect pest response to the infection of Beauveria bassiana (fungal insect pathogen) is critical for genetic improvement of fungal insecticides but has been poorly explored. We constructed three pairs of transcriptomes of Plutella xylostella larvae at 24, 36 and 48 hours post treatment of infection (hptI) and of control (hptC) for insight into the host-pathogen interaction at genomic level. There were 2143, 3200 and 2967 host genes differentially expressed at 24, 36 and 48 hptI/hptC respectively. These infection-responsive genes (~15% of the host genome) were enriched in various immune processes, such as complement and coagulation cascades, protein digestion and absorption, and drug metabolism-cytochrome P450. Fungal penetration into cuticle and host defense reaction began at 24 hptI, followed by most intensive host immune response at 36 hptI and attenuated immunity at 48 hptI. Contrastingly, 44% of fungal genes were differentially expressed in the infection course and enriched in several biological processes, such as antioxidant activity, peroxidase activity and proteolysis. There were 1636 fungal genes co-expressed during 24-48 hptI, including 116 encoding putative secretion proteins. Our results provide novel insights into the insect-pathogen interaction and help to probe molecular mechanisms involved in the fungal infection to the global pest.

  20. A blow to the fly - Lucilia cuprina draft genome and transcriptome to support advances in biology and biotechnology.

    PubMed

    Anstead, Clare A; Batterham, Philip; Korhonen, Pasi K; Young, Neil D; Hall, Ross S; Bowles, Vernon M; Richards, Stephen; Scott, Maxwell J; Gasser, Robin B

    2016-01-01

    The blow fly, Lucilia cuprina (Wiedemann, 1830) is a parasitic insect of major global economic importance. Maggots of this fly parasitize the skin of animal hosts, feed on excretions and tissues, and cause severe disease (flystrike or myiasis). Although there has been considerable research on L. cuprina over the years, little is understood about the molecular biology, biochemistry and genetics of this parasitic fly, as well as its relationship with its hosts and the disease that it causes. This situation might change with the recent report of the draft genome and transcriptome of this blow fly, which has given new and global insights into its biology, interactions with the host animal and aspects of insecticide resistance at the molecular level. This genomic resource will likely enable many fundamental and applied research areas in the future. The present article gives a background on L. cuprina and myiasis, a brief account of past and current treatment, prevention and control approaches, and provides a perspective on the impact that the L. cuprina genome should have on future research of this and related parasitic flies, and the design of new and improved interventions for myiasis.

  1. Brain transcriptome of the violet-eared waxbill Uraeginthus granatina and recent evolution in the songbird genome

    PubMed Central

    Balakrishnan, Christopher N.; Chapus, Charles; Brewer, Michael S.; Clayton, David F.

    2013-01-01

    Songbirds are important models for the study of social behaviour and communication. To complement the recent genome sequencing of the domesticated zebra finch, we sequenced the brain transcriptome of a closely related songbird species, the violet-eared waxbill (Uraeginthus granatina). Both the zebra finch and violet-eared waxbill are members of the family Estrildidae, but differ markedly in their social behaviour. Using Roche 454 RNA sequencing, we generated an assembly and annotation of 11 084 waxbill orthologues of 17 475 zebra finch genes (64%), with an average transcript length of 1555 bp. We also identified 5985 single nucleotide polymorphisms (SNPs) of potential utility for future population genomic studies. Comparing the two species, we found evidence for rapid protein evolution (ω) and low polymorphism of the avian Z sex chromosome, consistent with prior studies of more divergent avian species. An intriguing outlier was putative chromosome 4A, which showed a high density of SNPs and low evolutionary rate relative to other chromosomes. Genome-wide ω was identical in zebra finch and violet-eared waxbill lineages, suggesting a similar demographic history with efficient purifying natural selection. Further comparisons of these and other estrildid finches may provide insights into the evolutionary neurogenomics of social behaviour. PMID:24004662

  2. Ecological venomics: How genomics, transcriptomics and proteomics can shed new light on the ecology and evolution of venom.

    PubMed

    Sunagar, Kartik; Morgenstern, David; Reitzel, Adam M; Moran, Yehu

    2016-03-01

    Animal venom is a complex cocktail of bioactive chemicals that traditionally drew interest mostly from biochemists and pharmacologists. However, in recent years the evolutionary and ecological importance of venom is realized as this trait has direct and strong influence on interactions between species. Moreover, venom content can be modulated by environmental factors. Like many other fields of biology, venom research has been revolutionized in recent years by the introduction of systems biology approaches, i.e., genomics, transcriptomics and proteomics. The employment of these methods in venom research is known as 'venomics'. In this review we describe the history and recent advancements of venomics and discuss how they are employed in studying venom in general and in particular in the context of evolutionary ecology. We also discuss the pitfalls and challenges of venomics and what the future may hold for this emerging scientific field.

  3. Genome-wide expression profiling of the transcriptomes of four Paulownia tomentosa accessions in response to drought.

    PubMed

    Dong, Yanpeng; Fan, Guoqiang; Deng, Minjie; Xu, Enkai; Zhao, Zhenli

    2014-10-01

    Paulownia tomentosa is an important foundation forest tree species in semiarid areas. The lack of genetic information hinders research into the mechanisms involved in its response to abiotic stresses. Here, short-read sequencing technology (Illumina) was used to de novo assemble the transcriptome on P. tomentosa. A total of 99,218 unigenes with a mean length of 949 nucleotides were assembled. 68,295 unigenes were selected and the functions of their products were predicted using Clusters of Orthologous Groups, Gene Ontology, and Kyoto Encyclopedia of Genes and Genomes annotations. Afterwards, hundreds of genes involved in drought response were identified. Twelve putative drought response genes were analyzed by quantitative real-time polymerase chain reaction. This study provides a dataset of genes and inherent biochemical pathways, which will help in understanding the mechanisms of the water-deficit response in P. tomentosa. To our knowledge, this is the first study to highlight the genetic makeup of P. tomentosa.

  4. The Discovery of Novel Genomic, Transcriptomic, and Proteomic Biomarkers in Cardiovascular and Peripheral Vascular Disease: The State of the Art

    PubMed Central

    de Franciscis, Stefano; Metzinger, Laurent; Serra, Raffaele

    2016-01-01

    Cardiovascular disease (CD) and peripheral vascular disease (PVD) are leading causes of mortality and morbidity in western countries and also responsible of a huge burden in terms of disability, functional decline, and healthcare costs. Biomarkers are measurable biological elements that reflect particular physiological or pathological states or predisposition towards diseases and they are currently widely studied in medicine and especially in CD. In this context, biomarkers can also be used to assess the severity or the evolution of several diseases, as well as the effectiveness of particular therapies. Genomics, transcriptomics, and proteomics have opened new windows on disease phenomena and may permit in the next future an effective development of novel diagnostic and prognostic medicine in order to better prevent or treat CD. This review will consider the current evidence of novel biomarkers with clear implications in the improvement of risk assessment, prevention strategies, and medical decision making in the field of CD. PMID:27298828

  5. The Mediterranean scorpion Mesobuthus gibbosus (Scorpiones, Buthidae): transcriptome analysis and organization of the genome encoding chlorotoxin-like peptides

    PubMed Central

    2014-01-01

    Background Transcrof toxin genes of scorpion species have been published. Up to this moment, no information on the gene characterization of M. gibbosus is available. Results This study provides the first insight into gene expression in venom glands from M. gibbosus scorpion. A cDNA library was generated from the venom glands and subsequently analyzed (301 clones). Sequences from 177 high-quality ESTs were grouped as 48 Mgib sequences, of those 48 sequences, 40 (29 “singletons” and 11 “contigs”) correspond with one or more ESTs. We identified putative precursor sequences and were grouped them in different categories (39 unique transcripts, one with alternative reading frames), resulting in the identification of 12 new toxin-like and 5 antimicrobial precursors (transcripts). The analysis of the gene families revealed several new components categorized among various toxin families with effect on ion channels. Sequence analysis of a new KTx precursor provides evidence to validate a new KTx subfamily (α-KTx 27.x). A second part of this work involves the genomic organization of three Meg-chlorotoxin-like genes (ClTxs). Genomic DNA sequence reveals close similarities (presence of one same-phase intron) with the sole genomic organization of chlorotoxins ever reported (from M. martensii). Conclusions Transcriptome analysis is a powerful strategy that provides complete information of the gene expression and molecular diversity of the venom glands (telson). In this work, we generated the first catalogue of the gene expression and genomic organization of toxins from M. gibbosus. Our result represents a relevant contribution to the knowledge of toxin transcripts and complementary information related with other cell function proteins and venom peptide transcripts. The genomic organization of the chlorotoxin genes may help to understand the diversity of this gene family. PMID:24746279

  6. Territorial Polymers and Large Scale Genome Organization

    NASA Astrophysics Data System (ADS)

    Grosberg, Alexander

    2012-02-01

    Chromatin fiber in interphase nucleus represents effectively a very long polymer packed in a restricted volume. Although polymer models of chromatin organization were considered, most of them disregard the fact that DNA has to stay not too entangled in order to function properly. One polymer model with no entanglements is the melt of unknotted unconcatenated rings. Extensive simulations indicate that rings in the melt at large length (monomer numbers) N approach the compact state, with gyration radius scaling as N^1/3, suggesting every ring being compact and segregated from the surrounding rings. The segregation is consistent with the known phenomenon of chromosome territories. Surface exponent β (describing the number of contacts between neighboring rings scaling as N^β) appears only slightly below unity, β 0.95. This suggests that the loop factor (probability to meet for two monomers linear distance s apart) should decay as s^-γ, where γ= 2 - β is slightly above one. The later result is consistent with HiC data on real human interphase chromosomes, and does not contradict to the older FISH data. The dynamics of rings in the melt indicates that the motion of one ring remains subdiffusive on the time scale well above the stress relaxation time.

  7. Systematic Identification and Assessment of Therapeutic Targets for Breast Cancer Based on Genome-Wide RNA Interference Transcriptomes

    PubMed Central

    Liu, Yang; Yin, Xiaoyao; Zhong, Jing; Guan, Naiyang; Luo, Zhigang; Min, Lishan; Yao, Xing; Bo, Xiaochen; Dai, Licheng; Bai, Hui

    2017-01-01

    With accumulating public omics data, great efforts have been made to characterize the genetic heterogeneity of breast cancer. However, identifying novel targets and selecting the best from the sizeable lists of candidate targets is still a key challenge for targeted therapy, largely owing to the lack of economical, efficient and systematic discovery and assessment to prioritize potential therapeutic targets. Here, we describe an approach that combines the computational evaluation and objective, multifaceted assessment to systematically identify and prioritize targets for biological validation and therapeutic exploration. We first establish the reference gene expression profiles from breast cancer cell line MCF7 upon genome-wide RNA interference (RNAi) of a total of 3689 genes, and the breast cancer query signatures using RNA-seq data generated from tissue samples of clinical breast cancer patients in the Cancer Genome Atlas (TCGA). Based on gene set enrichment analysis, we identified a set of 510 genes that when knocked down could significantly reverse the transcriptome of breast cancer state. We then perform multifaceted assessment to analyze the gene set to prioritize potential targets for gene therapy. We also propose drug repurposing opportunities and identify potentially druggable proteins that have been poorly explored with regard to the discovery of small-molecule modulators. Finally, we obtained a small list of candidate therapeutic targets for four major breast cancer subtypes, i.e., luminal A, luminal B, HER2+ and triple negative breast cancer. This RNAi transcriptome-based approach can be a helpful paradigm for relevant researches to identify and prioritize candidate targets for experimental validation. PMID:28245581

  8. Analysis of CATMA transcriptome data identifies hundreds of novel functional genes and improves gene models in the Arabidopsis genome

    PubMed Central

    Aubourg, Sébastien; Martin-Magniette, Marie-Laure; Brunaud, Véronique; Taconnat, Ludivine; Bitton, Frédérique; Balzergue, Sandrine; Jullien, Pauline E; Ingouff, Mathieu; Thareau, Vincent; Schiex, Thomas; Lecharny, Alain; Renou, Jean-Pierre

    2007-01-01

    Background Since the finishing of the sequencing of the Arabidopsis thaliana genome, the Arabidopsis community and the annotator centers have been working on the improvement of gene annotation at the structural and functional levels. In this context, we have used the large CATMA resource on the Arabidopsis transcriptome to search for genes missed by different annotation processes. Probes on the CATMA microarrays are specific gene sequence tags (GSTs) based on the CDS models predicted by the Eugene software. Among the 24 576 CATMA v2 GSTs, 677 are in regions considered as intergenic by the TAIR annotation. We analyzed the cognate transcriptome data in the CATMA resource and carried out data-mining to characterize novel genes and improve gene models. Results The statistical analysis of the results of more than 500 hybridized samples distributed among 12 organs provides an experimental validation for 465 novel genes. The hybridization evidence was confirmed by RT-PCR approaches for 88% of the 465 novel genes. Comparisons with the current annotation show that these novel genes often encode small proteins, with an average size of 137 aa. Our approach has also led to the improvement of pre-existing gene models through both the extension of 16 CDS and the identification of 13 gene models erroneously constituted of two merged CDS. Conclusion This work is a noticeable step forward in the improvement of the Arabidopsis genome annotation. We increased the number of Arabidopsis validated genes by 465 novel transcribed genes to which we associated several functional annotations such as expression profiles, sequence conservation in plants, cognate transcripts and protein motifs. PMID:17980019

  9. Transcriptome sequencing and genome-wide association analyses reveal lysosomal function and actin cytoskeleton remodeling in schizophrenia and bipolar disorder.

    PubMed

    Zhao, Z; Xu, J; Chen, J; Kim, S; Reimers, M; Bacanu, S-A; Yu, H; Liu, C; Sun, J; Wang, Q; Jia, P; Xu, F; Zhang, Y; Kendler, K S; Peng, Z; Chen, X

    2015-05-01

    Schizophrenia (SCZ) and bipolar disorder (BPD) are severe mental disorders with high heritability. Clinicians have long noticed the similarities of clinic symptoms between these disorders. In recent years, accumulating evidence indicates some shared genetic liabilities. However, what is shared remains elusive. In this study, we conducted whole transcriptome analysis of post-mortem brain tissues (cingulate cortex) from SCZ, BPD and control subjects, and identified differentially expressed genes in these disorders. We found 105 and 153 genes differentially expressed in SCZ and BPD, respectively. By comparing the t-test scores, we found that many of the genes differentially expressed in SCZ and BPD are concordant in their expression level (q⩽0.01, 53 genes; q⩽0.05, 213 genes; q⩽0.1, 885 genes). Using genome-wide association data from the Psychiatric Genomics Consortium, we found that these differentially and concordantly expressed genes were enriched in association signals for both SCZ (P<10(-7)) and BPD (P=0.029). To our knowledge, this is the first time that a substantially large number of genes show concordant expression and association for both SCZ and BPD. Pathway analyses of these genes indicated that they are involved in the lysosome, Fc gamma receptor-mediated phagocytosis, regulation of actin cytoskeleton pathways, along with several cancer pathways. Functional analyses of these genes revealed an interconnected pathway network centered on lysosomal function and the regulation of actin cytoskeleton. These pathways and their interacting network were principally confirmed by an independent transcriptome sequencing data set of the hippocampus. Dysregulation of lysosomal function and cytoskeleton remodeling has direct impacts on endocytosis, phagocytosis, exocytosis, vesicle trafficking, neuronal maturation and migration, neurite outgrowth and synaptic density and plasticity, and different aspects of these processes have been implicated in SCZ and BPD.

  10. Transcriptome Analysis of Kiwifruit (Actinidia chinensis) Bark in Response to Armoured Scale Insect (Hemiberlesia lataniae) Feeding

    PubMed Central

    Hill, M. Garry; Wurms, Kirstin V.; Davy, Marcus W.; Gould, Elaine; Allan, Andrew; Mauchline, Nicola A.; Luo, Zhiwei; Ah Chee, Annette; Stannard, Kate; Storey, Roy D.; Rikkerink, Erik H.

    2015-01-01

    The kiwifruit cultivar Actinidia chinensis ‘Hort16A’ is resistant to the polyphagous armoured scale insect pest Hemiberlesia lataniae (Hemiptera: Diaspididae). A cDNA microarray consisting of 17,512 unigenes selected from over 132,000 expressed sequence tags (ESTs) was used to measure the transcriptomic profile of the A. chinensis ‘Hort16A’ canes in response to a controlled infestation of H. lataniae. After 2 days, 272 transcripts were differentially expressed. After 7 days, 5,284 (30%) transcripts were differentially expressed. The transcripts were grouped into 22 major functional categories using MapMan software. After 7 days, transcripts associated with photosynthesis (photosystem II) were significantly down-regulated, while those associated with secondary metabolism were significantly up-regulated. A total of 643 transcripts associated with response to stress were differentially expressed. This included biotic stress-related transcripts orthologous with pathogenesis related proteins, the phenylpropanoid pathway, NBS-LRR (R) genes, and receptor-like kinase–leucine rich repeat signalling proteins. While transcriptional studies are not conclusive in their own right, results were suggestive of a defence response involving both ETI and PTI, with predominance of the SA signalling pathway. Exogenous application of an SA-mimic decreased H. lataniae growth on A. chinensis ‘Hort16A’ plants in two laboratory experiments. PMID:26571404

  11. Transcriptome Analysis of Kiwifruit (Actinidia chinensis) Bark in Response to Armoured Scale Insect (Hemiberlesia lataniae) Feeding.

    PubMed

    Hill, M Garry; Wurms, Kirstin V; Davy, Marcus W; Gould, Elaine; Allan, Andrew; Mauchline, Nicola A; Luo, Zhiwei; Ah Chee, Annette; Stannard, Kate; Storey, Roy D; Rikkerink, Erik H

    2015-01-01

    The kiwifruit cultivar Actinidia chinensis 'Hort16A' is resistant to the polyphagous armoured scale insect pest Hemiberlesia lataniae (Hemiptera: Diaspididae). A cDNA microarray consisting of 17,512 unigenes selected from over 132,000 expressed sequence tags (ESTs) was used to measure the transcriptomic profile of the A. chinensis 'Hort16A' canes in response to a controlled infestation of H. lataniae. After 2 days, 272 transcripts were differentially expressed. After 7 days, 5,284 (30%) transcripts were differentially expressed. The transcripts were grouped into 22 major functional categories using MapMan software. After 7 days, transcripts associated with photosynthesis (photosystem II) were significantly down-regulated, while those associated with secondary metabolism were significantly up-regulated. A total of 643 transcripts associated with response to stress were differentially expressed. This included biotic stress-related transcripts orthologous with pathogenesis related proteins, the phenylpropanoid pathway, NBS-LRR (R) genes, and receptor-like kinase-leucine rich repeat signalling proteins. While transcriptional studies are not conclusive in their own right, results were suggestive of a defence response involving both ETI and PTI, with predominance of the SA signalling pathway. Exogenous application of an SA-mimic decreased H. lataniae growth on A. chinensis 'Hort16A' plants in two laboratory experiments.

  12. Reconstructing genome-wide regulatory network of E. coli using transcriptome data and predicted transcription factor activities

    PubMed Central

    2011-01-01

    Background Gene regulatory networks play essential roles in living organisms to control growth, keep internal metabolism running and respond to external environmental changes. Understanding the connections and the activity levels of regulators is important for the research of gene regulatory networks. While relevance score based algorithms that reconstruct gene regulatory networks from transcriptome data can infer genome-wide gene regulatory networks, they are unfortunately prone to false positive results. Transcription factor activities (TFAs) quantitatively reflect the ability of the transcription factor to regulate target genes. However, classic relevance score based gene regulatory network reconstruction algorithms use models do not include the TFA layer, thus missing a key regulatory element. Results This work integrates TFA prediction algorithms with relevance score based network reconstruction algorithms to reconstruct gene regulatory networks with improved accuracy over classic relevance score based algorithms. This method is called Gene expression and Transcription factor activity based Relevance Network (GTRNetwork). Different combinations of TFA prediction algorithms and relevance score functions have been applied to find the most efficient combination. When the integrated GTRNetwork method was applied to E. coli data, the reconstructed genome-wide gene regulatory network predicted 381 new regulatory links. This reconstructed gene regulatory network including the predicted new regulatory links show promising biological significances. Many of the new links are verified by known TF binding site information, and many other links can be verified from the literature and databases such as EcoCyc. The reconstructed gene regulatory network is applied to a recent transcriptome analysis of E. coli during isobutanol stress. In addition to the 16 significantly changed TFAs detected in the original paper, another 7 significantly changed TFAs have been detected by

  13. Large-scale investigation of genomic markers for severe periodontitis.

    PubMed

    Suzuki, Asami; Ji, Guijin; Numabe, Yukihiro; Ishii, Keisuke; Muramatsu, Masaaki; Kamoi, Kyuichi

    2004-09-01

    The purpose of the present study was to investigate the genomic markers for periodontitis, using large-scale single-nucleotide polymorphism (SNP) association studies comparing healthy volunteers and patients with periodontitis. Genomic DNA was obtained from 19 healthy volunteers and 22 patients with severe periodontitis, all of whom were Japanese. The subjects were genotyped at 637 SNPs in 244 genes on a large scale, using the TaqMan polymerase chain reaction (PCR) system. Statistically significant differences in allele and genotype frequencies were analyzed with Fisher's exact test. We found statistically significant differences (P < 0.01) between the healthy volunteers and patients with severe periodontitis in the following genes; gonadotropin-releasing hormone 1 (GNRH1), phosphatidylinositol 3-kinase regulatory 1 (PIK3R1), dipeptidylpeptidase 4 (DPP4), fibrinogen-like 2 (FGL2), and calcitonin receptor (CALCR). These results suggest that SNPs in the GNRH1, PIK3R1, DPP4, FGL2, and CALCR genes are genomic markers for severe periodontitis. Our findings indicate the necessity of analyzing SNPs in genes on a large scale (i.e., genome-wide approach), to identify genomic markers for periodontitis.

  14. Genome-scale engineering for systems and synthetic biology

    PubMed Central

    Esvelt, Kevin M; Wang, Harris H

    2013-01-01

    Genome-modification technologies enable the rational engineering and perturbation of biological systems. Historically, these methods have been limited to gene insertions or mutations at random or at a few pre-defined locations across the genome. The handful of methods capable of targeted gene editing suffered from low efficiencies, significant labor costs, or both. Recent advances have dramatically expanded our ability to engineer cells in a directed and combinatorial manner. Here, we review current technologies and methodologies for genome-scale engineering, discuss the prospects for extending efficient genome modification to new hosts, and explore the implications of continued advances toward the development of flexibly programmable chasses, novel biochemistries, and safer organismal and ecological engineering. PMID:23340847

  15. Genome-scale engineering for systems and synthetic biology.

    PubMed

    Esvelt, Kevin M; Wang, Harris H

    2013-01-01

    Genome-modification technologies enable the rational engineering and perturbation of biological systems. Historically, these methods have been limited to gene insertions or mutations at random or at a few pre-defined locations across the genome. The handful of methods capable of targeted gene editing suffered from low efficiencies, significant labor costs, or both. Recent advances have dramatically expanded our ability to engineer cells in a directed and combinatorial manner. Here, we review current technologies and methodologies for genome-scale engineering, discuss the prospects for extending efficient genome modification to new hosts, and explore the implications of continued advances toward the development of flexibly programmable chasses, novel biochemistries, and safer organismal and ecological engineering.

  16. Ensembl Genomes 2013: scaling up access to genome-wide data

    PubMed Central

    Kersey, Paul Julian; Allen, James E.; Christensen, Mikkel; Davis, Paul; Falin, Lee J.; Grabmueller, Christoph; Hughes, Daniel Seth Toney; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Langridge, Nicholas; McDowall, Mark D.; Maheswari, Uma; Maslen, Gareth; Nuhn, Michael; Ong, Chuang Kee; Paulini, Michael; Pedro, Helder; Toneva, Iliana; Tuli, Mary Ann; Walts, Brandon; Williams, Gareth; Wilson, Derek; Youens-Clark, Ken; Monaco, Marcela K.; Stein, Joshua; Wei, Xuehong; Ware, Doreen; Bolser, Daniel M.; Howe, Kevin Lee; Kulesha, Eugene; Lawson, Daniel; Staines, Daniel Michael

    2014-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provides a complementary set of resources for non-vertebrate species through a consistent set of programmatic and interactive interfaces. These provide access to data including reference sequence, gene models, transcriptional data, polymorphisms and comparative analysis. This article provides an update to the previous publications about the resource, with a focus on recent developments. These include the addition of important new genomes (and related data sets) including crop plants, vectors of human disease and eukaryotic pathogens. In addition, the resource has scaled up its representation of bacterial genomes, and now includes the genomes of over 9000 bacteria. Specific extensions to the web and programmatic interfaces have been developed to support users in navigating these large data sets. Looking forward, analytic tools to allow targeted selection of data for visualization and download are likely to become increasingly important in future as the number of available genomes increases within all domains of life, and some of the challenges faced in representing bacterial data are likely to become commonplace for eukaryotes in future. PMID:24163254

  17. Single Cell Genomics: Advances and Future Perspectives

    PubMed Central

    Macaulay, Iain C.; Voet, Thierry

    2014-01-01

    Advances in whole-genome and whole-transcriptome amplification have permitted the sequencing of the minute amounts of DNA and RNA present in a single cell, offering a window into the extent and nature of genomic and transcriptomic heterogeneity which occurs in both normal development and disease. Single-cell approaches stand poised to revolutionise our capacity to understand the scale of genomic, epigenomic, and transcriptomic diversity that occurs during the lifetime of an individual organism. Here, we review the major technological and biological breakthroughs achieved, describe the remaining challenges to overcome, and provide a glimpse into the promise of recent and future developments. PMID:24497842

  18. Symbiodinium Transcriptomes: Genome Insights into the Dinoflagellate Symbionts of Reef-Building Corals

    PubMed Central

    Sunagawa, Shinichi; Yum, Lauren K.; DeSalvo, Michael K.; Lindquist, Erika; Coffroth, Mary Alice; Voolstra, Christian R.; Medina, Mónica

    2012-01-01

    Dinoflagellates are unicellular algae that are ubiquitously abundant in aquatic environments. Species of the genus Symbiodinium form symbiotic relationships with reef-building corals and other marine invertebrates. Despite their ecologic importance, little is known about the genetics of dinoflagellates in general and Symbiodinium in particular. Here, we used 454 sequencing to generate transcriptome data from two Symbiodinium species from different clades (clade A and clade B). With more than 56,000 assembled sequences per species, these data represent the largest transcriptomic resource for dinoflagellates to date. Our results corroborate previous observations that dinoflagellates possess the complete nucleosome machinery. We found a complete set of core histones as well as several H3 variants and H2A.Z in one species. Furthermore, transcriptome analysis points toward a low number of transcription factors in Symbiodinium spp. that also differ in the distribution of DNA-binding domains relative to other eukaryotes. In particular the cold shock domain was predominant among transcription factors. Additionally, we found a high number of antioxidative genes in comparison to non-symbiotic but evolutionary related organisms. These findings might be of relevance in the context of the role that Symbiodinium spp. play as coral symbionts. Our data represent the most comprehensive dinoflagellate EST data set to date. This study provides a comprehensive resource to further analyze the genetic makeup, metabolic capacities, and gene repertoire of Symbiodinium and dinoflagellates. Overall, our findings indicate that Symbiodinium possesses some unique characteristics, in particular the transcriptional regulation in Symbiodinium may differ from the currently known mechanisms of eukaryotic gene regulation. PMID:22529998

  19. Genome resequencing in Populus: Revealing large-scale genome variation and implications on specialized-trait genomics

    SciTech Connect

    Muchero, Wellington; Labbe, Jessy L; Priya, Ranjan; DiFazio, Steven P; Tuskan, Gerald A

    2014-01-01

    To date, Populus ranks among a few plant species with a complete genome sequence and other highly developed genomic resources. With the first genome sequence among all tree species, Populus has been adopted as a suitable model organism for genomic studies in trees. However, far from being just a model species, Populus is a key renewable economic resource that plays a significant role in providing raw materials for the biofuel and pulp and paper industries. Therefore, aside from leading frontiers of basic tree molecular biology and ecological research, Populus leads frontiers in addressing global economic challenges related to fuel and fiber production. The latter fact suggests that research aimed at improving quality and quantity of Populus as a raw material will likely drive the pursuit of more targeted and deeper research in order to unlock the economic potential tied in molecular biology processes that drive this tree species. Advances in genome sequence-driven technologies, such as resequencing individual genotypes, which in turn facilitates large scale SNP discovery and identification of large scale polymorphisms are key determinants of future success in these initiatives. In this treatise we discuss implications of genome sequence-enable technologies on Populus genomic and genetic studies of complex and specialized-traits.

  20. Unraveling the 3D genome: genomics tools for multi-scale exploration

    PubMed Central

    Risca, Viviana I.; Greenleaf, William J.

    2015-01-01

    A decade of rapid method development has begun to yield exciting insights into the three-dimensional architecture of the metazoan genome and the roles it may play in regulating transcription. We review here core methods and new tools in the modern genomicist’s toolbox at three length scales, ranging from single base pair to megabase scale chromosomal domains, and discuss the emerging picture of the 3D genome that these tools have revealed. Blind spots remain, especially at intermediate length scales spanning a few nucleosomes, but thanks in part to new technologies that permit targeted alteration of chromatin states and time-resolved studies, the next decade holds great promise for hypothesis-driven research into the mechanisms that drive genome architecture and transcriptional regulation. PMID:25887733

  1. Genome-scale cold stress response regulatory networks in ten Arabidopsis thaliana ecotypes

    PubMed Central

    2013-01-01

    Background Low temperature leads to major crop losses every year. Although several studies have been conducted focusing on diversity of cold tolerance level in multiple phenotypically divergent Arabidopsis thaliana (A. thaliana) ecotypes, genome-scale molecular understanding is still lacking. Results In this study, we report genome-scale transcript response diversity of 10 A. thaliana ecotypes originating from different geographical locations to non-freezing cold stress (10°C). To analyze the transcriptional response diversity, we initially compared transcriptome changes in all 10 ecotypes using Arabidopsis NimbleGen ATH6 microarrays. In total 6061 transcripts were significantly cold regulated (p < 0.01) in 10 ecotypes, including 498 transcription factors and 315 transposable elements. The majority of the transcripts (75%) showed ecotype specific expression pattern. By using sequence data available from Arabidopsis thaliana 1001 genome project, we further investigated sequence polymorphisms in the core cold stress regulon genes. Significant numbers of non-synonymous amino acid changes were observed in the coding region of the CBF regulon genes. Considering the limited knowledge about regulatory interactions between transcription factors and their target genes in the model plant A. thaliana, we have adopted a powerful systems genetics approach- Network Component Analysis (NCA) to construct an in-silico transcriptional regulatory network model during response to cold stress. The resulting regulatory network contained 1,275 nodes and 7,720 connections, with 178 transcription factors and 1,331 target genes. Conclusions A. thaliana ecotypes exhibit considerable variation in transcriptome level responses to non-freezing cold stress treatment. Ecotype specific transcripts and related gene ontology (GO) categories were identified to delineate natural variation of cold stress regulated differential gene expression in the model plant A. thaliana. The predicted

  2. From genes to milk: Genomic organization and epigenetic regulation of the mammary transcriptome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Even in genomes lacking operons, a gene's position in the genome influences its potential for expression. The mechanisms by which adjacent genes are co-expressed are still not completely understood. Using lactation and the mammary gland as a model system, we explore the hypothesis that chromatin sta...

  3. Salivary Genomics, Transcriptomics and Proteomics: The Emerging Concept of the Oral Ecosystem and their Use in the Early Diagnosis of Cancer and other Diseases

    PubMed Central

    Fábián, T.K; Fejérdy, P; Csermely, P

    2008-01-01

    There is an increasingly growing interest world-wide for the genomics, transcriptomics and proteomics of saliva and the oral cavity, since they provide a non-invasive source of unprecedently rich genetic information. The complexity of oral systems biology goes much beyond the human genome, transcriptome and proteome revealed by oral mucosal cells, gingival crevicular fluid, and saliva, and includes the complexity of the oral microbiota, the symbiotic assembly of bacterial, fungal and other microbial flora in the oral cavity. In our review we summarize the recent information on oral genomics, transcriptomics and proteomics, of both human and microbial origin. We also give an introduction and practical advice on sample collection, handling and storage for analysis. Finally, we show the usefulness of salivary and oral genomics in early diagnosis of cancer, as well as in uncovering other systemic diseases, infections and oral disorders. We close the review by highlighting a number of possible exploratory pathways in this emerging, hot research field. PMID:19424479

  4. Analysis of the Phlebiopsis gigantea Genome, Transcriptome and Secretome Provides Insight into Its Pioneer Colonization Strategies of Wood

    DOE PAGES

    Hori, Chiaki; Ishida, Takuya; Igarashi, Kiyohiko; ...

    2014-12-04

    Collectively classified as white-rot fungi, certain basidiomycetes efficiently degrade the major structural polymers of wood cell walls. A small subset of these Agaricomycetes, exemplified by Phlebiopsis gigantea, is capable of colonizing freshly exposed conifer sapwood despite its high content of extractives, which retards the establishment of other fungal species. The mechanism(s) by which P. gigantea tolerates and metabolizes resinous compounds have not been explored. Here, we report the annotated P. gigantea genome and compare profiles of its transcriptome and secretome when cultured on freshcut versus solvent-extracted loblolly pine wood. The P. gigantea genome contains a conventional repertoire of hydrolase genesmore » involved in cellulose/hemicellulose degradation, whose patterns of expression were relatively unperturbed by the absence of extractives. The expression of genes typically ascribed to lignin degradation was also largely unaffected. In contrast, genes likely involved in the transformation and detoxification of wood extractives were highly induced in its presence. Their products included an ABC transporter, lipases, cytochrome P450s, glutathione S-transferase and aldehyde dehydrogenase. Other regulated genes of unknown function and several constitutively expressed genes are also likely involved in P. gigantea’s extractives metabolism. These results contribute to our fundamental understanding of pioneer colonization of conifer wood and provide insight into the diverse chemistries employed by fungi in carbon cycling processes.« less

  5. Analysis of the Phlebiopsis gigantea Genome, Transcriptome and Secretome Provides Insight into Its Pioneer Colonization Strategies of Wood

    SciTech Connect

    Hori, Chiaki; Ishida, Takuya; Igarashi, Kiyohiko; Samejima, Masahiro; Suzuki, Hitoshi; Master, Emma; Ferreira, Patricia; Ruiz-Dueñas, Francisco J.; Held, Benjamin; Canessa, Paulo; Larrondo, Luis F.; Schmoll, Monika; Druzhinina, Irina S.; Kubicek, Christian P.; Gaskell, Jill A.; Kersten, Phil; St. John, Franz; Glasner, Jeremy; Sabat, Grzegorz; Splinter BonDurant, Sandra; Syed, Khajamohiddin; Yadav, Jagjit; Mgbeahuruike, Anthony C.; Kovalchuk, Andriy; Asiegbu, Fred O.; Lackner, Gerald; Hoffmeister, Dirk; Rencoret, Jorge; Gutiérrez, Ana; Sun, Hui; Lindquist, Erika; Barry, Kerrie; Riley, Robert; Grigoriev, Igor V.; Henrissat, Bernard; Berka, Randy M.; Martínez, Angel T.; Covert, Sarah F.; Blanchette, Robert A.; Cullen, Daniel; Copenhaver, Gregory P.

    2014-12-04

    Collectively classified as white-rot fungi, certain basidiomycetes efficiently degrade the major structural polymers of wood cell walls. A small subset of these Agaricomycetes, exemplified by Phlebiopsis gigantea, is capable of colonizing freshly exposed conifer sapwood despite its high content of extractives, which retards the establishment of other fungal species. The mechanism(s) by which P. gigantea tolerates and metabolizes resinous compounds have not been explored. Here, we report the annotated P. gigantea genome and compare profiles of its transcriptome and secretome when cultured on freshcut versus solvent-extracted loblolly pine wood. The P. gigantea genome contains a conventional repertoire of hydrolase genes involved in cellulose/hemicellulose degradation, whose patterns of expression were relatively unperturbed by the absence of extractives. The expression of genes typically ascribed to lignin degradation was also largely unaffected. In contrast, genes likely involved in the transformation and detoxification of wood extractives were highly induced in its presence. Their products included an ABC transporter, lipases, cytochrome P450s, glutathione S-transferase and aldehyde dehydrogenase. Other regulated genes of unknown function and several constitutively expressed genes are also likely involved in P. gigantea’s extractives metabolism. These results contribute to our fundamental understanding of pioneer colonization of conifer wood and provide insight into the diverse chemistries employed by fungi in carbon cycling processes.

  6. Genomic organization, transcriptomic analysis, and functional characterization of avian α- and β-keratins in diverse feather forms.

    PubMed

    Ng, Chen Siang; Wu, Ping; Fan, Wen-Lang; Yan, Jie; Chen, Chih-Kuan; Lai, Yu-Ting; Wu, Siao-Man; Mao, Chi-Tang; Chen, Jun-Jie; Lu, Mei-Yeh Jade; Ho, Meng-Ru; Widelitz, Randall B; Chen, Chih-Feng; Chuong, Cheng-Ming; Li, Wen-Hsiung

    2014-08-24

    Feathers are hallmark avian integument appendages, although they were also present on theropods. They are composed of flexible corneous materials made of α- and β-keratins, but their genomic organization and their functional roles in feathers have not been well studied. First, we made an exhaustive search of α- and β-keratin genes in the new chicken genome assembly (Galgal4). Then, using transcriptomic analysis, we studied α- and β-keratin gene expression patterns in five types of feather epidermis. The expression patterns of β-keratin genes were different in different feather types, whereas those of α-keratin genes were less variable. In addition, we obtained extensive α- and β-keratin mRNA in situ hybridization data, showing that α-keratins and β-keratins are preferentially expressed in different parts of the feather components. Together, our data suggest that feather morphological and structural diversity can largely be attributed to differential combinations of α- and β-keratin genes in different intrafeather regions and/or feather types from different body parts. The expression profiles provide new insights into the evolutionary origin and diversification of feathers. Finally, functional analysis using mutant chicken keratin forms based on those found in the human α-keratin mutation database led to abnormal phenotypes. This demonstrates that the chicken can be a convenient model for studying the molecular biology of human keratin-based diseases.

  7. Genomic Organization, Transcriptomic Analysis, and Functional Characterization of Avian α- and β-Keratins in Diverse Feather Forms

    PubMed Central

    Fan, Wen-Lang; Yan, Jie; Chen, Chih-Kuan; Lai, Yu-Ting; Wu, Siao-Man; Mao, Chi-Tang; Chen, Jun-Jie; Lu, Mei-Yeh Jade; Ho, Meng-Ru; Widelitz, Randall B.; Chen, Chih-Feng; Chuong, Cheng-Ming; Li, Wen-Hsiung

    2014-01-01

    Feathers are hallmark avian integument appendages, although they were also present on theropods. They are composed of flexible corneous materials made of α- and β-keratins, but their genomic organization and their functional roles in feathers have not been well studied. First, we made an exhaustive search of α- and β-keratin genes in the new chicken genome assembly (Galgal4). Then, using transcriptomic analysis, we studied α- and β-keratin gene expression patterns in five types of feather epidermis. The expression patterns of β-keratin genes were different in different feather types, whereas those of α-keratin genes were less variable. In addition, we obtained extensive α- and β-keratin mRNA in situ hybridization data, showing that α-keratins and β-keratins are preferentially expressed in different parts of the feather components. Together, our data suggest that feather morphological and structural diversity can largely be attributed to differential combinations of α- and β-keratin genes in different intrafeather regions and/or feather types from different body parts. The expression profiles provide new insights into the evolutionary origin and diversification of feathers. Finally, functional analysis using mutant chicken keratin forms based on those found in the human α-keratin mutation database led to abnormal phenotypes. This demonstrates that the chicken can be a convenient model for studying the molecular biology of human keratin-based diseases. PMID:25152353

  8. Whipworm genome and dual-species transcriptome analyses provide molecular insights into an intimate host-parasite interaction

    PubMed Central

    Nichol, Sarah; Tracey, Alan; Holroyd, Nancy; Cotton, James A.; Stanley, Eleanor J.; Zarowiecki, Magdalena; Liu, Jimmy Z.; Huckvale, Thomas; Cooper, Philip J.; Grencis, Richard K.; Berriman, Matthew

    2014-01-01

    Whipworms are common soil-transmitted helminths that cause debilitating chronic infections in man. These nematodes are only distantly related to Caenorhabditis elegans and have evolved to occupy an unusual niche, tunneling through epithelial cells of the large intestine. Here we present the genome sequences of the human-infective Trichuris trichiura and the murine laboratory model T. muris. Based on whole transcriptome analyses we identify many genes that are expressed in a gender- or life stage-specific manner and characterise the transcriptional landscape of a morphological region with unique biological adaptations, namely bacillary band and stichosome, found only in whipworms and related parasites. Using RNAseq data from whipworm-infected mice we describe the regulated Th1-like immune response of the chronically infected cecum in unprecedented detail. In silico screening identifies numerous potential new drug targets against trichuriasis. Together, these genomes and associated functional data elucidate key aspects of the molecular host-parasite interactions that define chronic whipworm infection. PMID:24929830

  9. Analysis of the Phlebiopsis gigantea Genome, Transcriptome and Secretome Provides Insight into Its Pioneer Colonization Strategies of Wood

    PubMed Central

    Hori, Chiaki; Ishida, Takuya; Igarashi, Kiyohiko; Samejima, Masahiro; Suzuki, Hitoshi; Master, Emma; Ferreira, Patricia; Ruiz-Dueñas, Francisco J.; Held, Benjamin; Canessa, Paulo; Larrondo, Luis F.; Schmoll, Monika; Druzhinina, Irina S.; Kubicek, Christian P.; Gaskell, Jill A.; Kersten, Phil; St. John, Franz; Glasner, Jeremy; Sabat, Grzegorz; Splinter BonDurant, Sandra; Syed, Khajamohiddin; Yadav, Jagjit; Mgbeahuruike, Anthony C.; Kovalchuk, Andriy; Asiegbu, Fred O.; Lackner, Gerald; Hoffmeister, Dirk; Rencoret, Jorge; Gutiérrez, Ana; Sun, Hui; Lindquist, Erika; Barry, Kerrie; Riley, Robert; Grigoriev, Igor V.; Henrissat, Bernard; Kües, Ursula; Berka, Randy M.; Martínez, Angel T.; Covert, Sarah F.; Blanchette, Robert A.; Cullen, Daniel

    2014-01-01

    Collectively classified as white-rot fungi, certain basidiomycetes efficiently degrade the major structural polymers of wood cell walls. A small subset of these Agaricomycetes, exemplified by Phlebiopsis gigantea, is capable of colonizing freshly exposed conifer sapwood despite its high content of extractives, which retards the establishment of other fungal species. The mechanism(s) by which P. gigantea tolerates and metabolizes resinous compounds have not been explored. Here, we report the annotated P. gigantea genome and compare profiles of its transcriptome and secretome when cultured on fresh-cut versus solvent-extracted loblolly pine wood. The P. gigantea genome contains a conventional repertoire of hydrolase genes involved in cellulose/hemicellulose degradation, whose patterns of expression were relatively unperturbed by the absence of extractives. The expression of genes typically ascribed to lignin degradation was also largely unaffected. In contrast, genes likely involved in the transformation and detoxification of wood extractives were highly induced in its presence. Their products included an ABC transporter, lipases, cytochrome P450s, glutathione S-transferase and aldehyde dehydrogenase. Other regulated genes of unknown function and several constitutively expressed genes are also likely involved in P. gigantea's extractives metabolism. These results contribute to our fundamental understanding of pioneer colonization of conifer wood and provide insight into the diverse chemistries employed by fungi in carbon cycling processes. PMID:25474575

  10. Genome-wide transcriptomic analysis of the sporophyte of the moss Physcomitrella patens.

    PubMed

    O'Donoghue, Martin-Timothy; Chater, Caspar; Wallace, Simon; Gray, Julie E; Beerling, David J; Fleming, Andrew J

    2013-09-01

    Bryophytes, the most basal of the extant land plants, diverged at least 450 million years ago. A major feature of these plants is the biphasic alternation of generations between a dominant haploid gametophyte and a minor diploid sporophyte phase. These dramatic differences in form and function occur in a constant genetic background, raising the question of whether the switch from gametophyte-to-sporophyte development reflects major changes in the spectrum of genes being expressed or alternatively whether only limited changes in gene expression occur and the differences in plant form are due to differences in how the gene products are put together. This study performed replicated microarray analyses of RNA from several thousand dissected and developmentally staged sporophytes of the moss Physcomitrella patens, allowing analysis of the transcriptomes of the sporophyte and early gametophyte, as well as the early stages of moss sporophyte development. The data indicate that more significant changes in transcript profile occur during the switch from gametophyte to sporophyte than recently reported, with over 12% of the entire transcriptome of P. patens being altered during this major developmental transition. Analysis of the types of genes contributing to these differences supports the view of the early sporophyte being energetically and nutritionally dependent on the gametophyte, provides a profile of homologues to genes involved in angiosperm stomatal development and physiology which suggests a deeply conserved mechanism of stomatal control, and identifies a novel series of transcription factors associated with moss sporophyte development.

  11. Genome-wide transcriptomic analysis of the sporophyte of the moss Physcomitrella patens

    PubMed Central

    O’Donoghue, Martin-Timothy; Chater, Caspar; Wallace, Simon; Gray, Julie E.; Beerling, David J.; Fleming, Andrew J.

    2013-01-01

    Bryophytes, the most basal of the extant land plants, diverged at least 450 million years ago. A major feature of these plants is the biphasic alternation of generations between a dominant haploid gametophyte and a minor diploid sporophyte phase. These dramatic differences in form and function occur in a constant genetic background, raising the question of whether the switch from gametophyte-to-sporophyte development reflects major changes in the spectrum of genes being expressed or alternatively whether only limited changes in gene expression occur and the differences in plant form are due to differences in how the gene products are put together. This study performed replicated microarray analyses of RNA from several thousand dissected and developmentally staged sporophytes of the moss Physcomitrella patens, allowing analysis of the transcriptomes of the sporophyte and early gametophyte, as well as the early stages of moss sporophyte development. The data indicate that more significant changes in transcript profile occur during the switch from gametophyte to sporophyte than recently reported, with over 12% of the entire transcriptome of P. patens being altered during this major developmental transition. Analysis of the types of genes contributing to these differences supports the view of the early sporophyte being energetically and nutritionally dependent on the gametophyte, provides a profile of homologues to genes involved in angiosperm stomatal development and physiology which suggests a deeply conserved mechanism of stomatal control, and identifies a novel series of transcription factors associated with moss sporophyte development. PMID:23888066

  12. Scalable Parameter Estimation for Genome-Scale Biochemical Reaction Networks

    PubMed Central

    Kaltenbacher, Barbara; Hasenauer, Jan

    2017-01-01

    Mechanistic mathematical modeling of biochemical reaction networks using ordinary differential equation (ODE) models has improved our understanding of small- and medium-scale biological processes. While the same should in principle hold for large- and genome-scale processes, the computational methods for the analysis of ODE models which describe hundreds or thousands of biochemical species and reactions are missing so far. While individual simulations are feasible, the inference of the model parameters from experimental data is computationally too intensive. In this manuscript, we evaluate adjoint sensitivity analysis for parameter estimation in large scale biochemical reaction networks. We present the approach for time-discrete measurement and compare it to state-of-the-art methods used in systems and computational biology. Our comparison reveals a significantly improved computational efficiency and a superior scalability of adjoint sensitivity analysis. The computational complexity is effectively independent of the number of parameters, enabling the analysis of large- and genome-scale models. Our study of a comprehensive kinetic model of ErbB signaling shows that parameter estimation using adjoint sensitivity analysis requires a fraction of the computation time of established methods. The proposed method will facilitate mechanistic modeling of genome-scale cellular processes, as required in the age of omics. PMID:28114351

  13. Genome-scale metabolic representation of Amycolatopsis balhimycina.

    PubMed

    Vongsangnak, Wanwipa; Figueiredo, Luís Filipe; Förster, Jochen; Weber, Tilmann; Thykaer, Jette; Stegmann, Evi; Wohlleben, Wolfgang; Nielsen, Jens

    2012-07-01

    Infection caused by methicillin-resistant Staphylococcus aureus (MRSA) is an increasing societal problem. Typically, glycopeptide antibiotics are used in the treatment of these infections. The most comprehensively studied glycopeptide antibiotic biosynthetic pathway is that of balhimycin biosynthesis in Amycolatopsis balhimycina. The balhimycin yield obtained by A. balhimycina is, however, low and there is therefore a need to improve balhimycin production. In this study, we performed genome sequencing, assembly and annotation analysis of A. balhimycina and further used these annotated data to reconstruct a genome-scale metabolic model for the organism. Here we generated an almost complete A. balhimycina genome sequence comprising 10,562,587 base pairs assembled into 2,153 contigs. The high GC-genome (∼ 69%) includes 8,585 open reading frames (ORFs). We used our integrative toolbox called SEQTOR for functional annotation and then integrated annotated data with biochemical and physiological information available for this organism to reconstruct a genome-scale metabolic model of A. balhimycina. The resulting metabolic model contains 583 ORFs as protein encoding genes (7% of the predicted 8,585 ORFs), 407 EC numbers, 647 metabolites and 1,363 metabolic reactions. During the analysis of the metabolic model, linear, quadratic and evolutionary programming algorithms using flux balance analysis (FBA), minimization of metabolic adjustment (MOMA), and OptGene, respectively were applied as well as phenotypic behavior and improved balhimycin production were simulated. The A. balhimycina model shows a good agreement between in silico data and experimental data and also identifies key reactions associated with increased balhimycin production. The reconstruction of the genome-scale metabolic model of A. balhimycina serves as a basis for physiological characterization. The model allows a rational design of engineering strategies for increasing balhimycin production in A

  14. Automation on the generation of genome-scale metabolic models.

    PubMed

    Reyes, R; Gamermann, D; Montagud, A; Fuente, D; Triana, J; Urchueguía, J F; de Córdoba, P Fernández

    2012-12-01

    Nowadays, the reconstruction of genome-scale metabolic models is a nonautomatized and interactive process based on decision making. This lengthy process usually requires a full year of one person's work in order to satisfactory collect, analyze, and validate the list of all metabolic reactions present in a specific organism. In order to write this list, one manually has to go through a huge amount of genomic, metabolomic, and physiological information. Currently, there is no optimal algorithm that allows one to automatically go through all this information and generate the models taking into account probabilistic criteria of unicity and completeness that a biologist would consider. This work presents the automation of a methodology for the reconstruction of genome-scale metabolic models for any organism. The methodology that follows is the automatized version of the steps implemented manually for the reconstruction of the genome-scale metabolic model of a photosynthetic organism, Synechocystis sp. PCC6803. The steps for the reconstruction are implemented in a computational platform (COPABI) that generates the models from the probabilistic algorithms that have been developed. For validation of the developed algorithm robustness, the metabolic models of several organisms generated by the platform have been studied together with published models that have been manually curated. Network properties of the models, like connectivity and average shortest mean path of the different models, have been compared and analyzed.

  15. Large-scale data mining pilot project in human genome

    SciTech Connect

    Musick, R.; Fidelis, R.; Slezak, T.

    1997-05-01

    This whitepaper briefly describes a new, aggressive effort in large- scale data Livermore National Labs. The implications of `large- scale` will be clarified Section. In the short term, this effort will focus on several @ssion-critical questions of Genome project. We will adapt current data mining techniques to the Genome domain, to quantify the accuracy of inference results, and lay the groundwork for a more extensive effort in large-scale data mining. A major aspect of the approach is that we will be fully-staffed data warehousing effort in the human Genome area. The long term goal is strong applications- oriented research program in large-@e data mining. The tools, skill set gained will be directly applicable to a wide spectrum of tasks involving a for large spatial and multidimensional data. This includes applications in ensuring non-proliferation, stockpile stewardship, enabling Global Ecology (Materials Database Industrial Ecology), advancing the Biosciences (Human Genome Project), and supporting data for others (Battlefield Management, Health Care).

  16. Unraveling Fungal Radiation Resistance Regulatory Networks through the Genome-Wide Transcriptome and Genetic Analyses of Cryptococcus neoformans

    PubMed Central

    Jung, Kwang-Woo; Yang, Dong-Hoon; Kim, Min-Kyu; Seo, Ho Seong

    2016-01-01

    ABSTRACT The basidiomycetous fungus Cryptococcus neoformans has been known to be highly radiation resistant and has been found in fatal radioactive environments such as the damaged nuclear reactor at Chernobyl. To elucidate the mechanisms underlying the radiation resistance phenotype of C. neoformans, we identified genes affected by gamma radiation through genome-wide transcriptome analysis and characterized their functions. We found that genes involved in DNA damage repair systems were upregulated in response to gamma radiation. Particularly, deletion of recombinase RAD51 and two DNA-dependent ATPase genes, RAD54 and RDH54, increased cellular susceptibility to both gamma radiation and DNA-damaging agents. A variety of oxidative stress response genes were also upregulated. Among them, sulfiredoxin contributed to gamma radiation resistance in a peroxiredoxin/thioredoxin-independent manner. Furthermore, we found that genes involved in molecular chaperone expression, ubiquitination systems, and autophagy were induced, whereas genes involved in the biosynthesis of proteins and fatty acids/sterols were downregulated. Most importantly, we discovered a number of novel C. neoformans genes, the expression of which was modulated by gamma radiation exposure, and their deletion rendered cells susceptible to gamma radiation exposure, as well as DNA damage insults. Among these genes, we found that a unique transcription factor containing the basic leucine zipper domain, named Bdr1, served as a regulator of the gamma radiation resistance of C. neoformans by controlling expression of DNA repair genes, and its expression was regulated by the evolutionarily conserved DNA damage response protein kinase Rad53. Taken together, the current transcriptome and functional analyses contribute to the understanding of the unique molecular mechanism of the radiation-resistant fungus C. neoformans. PMID:27899501

  17. Unraveling Fungal Radiation Resistance Regulatory Networks through the Genome-Wide Transcriptome and Genetic Analyses of Cryptococcus neoformans.

    PubMed

    Jung, Kwang-Woo; Yang, Dong-Hoon; Kim, Min-Kyu; Seo, Ho Seong; Lim, Sangyong; Bahn, Yong-Sun

    2016-11-29

    The basidiomycetous fungus Cryptococcus neoformans has been known to be highly radiation resistant and has been found in fatal radioactive environments such as the damaged nuclear reactor at Chernobyl. To elucidate the mechanisms underlying the radiation resistance phenotype of C. neoformans, we identified genes affected by gamma radiation through genome-wide transcriptome analysis and characterized their functions. We found that genes involved in DNA damage repair systems were upregulated in response to gamma radiation. Particularly, deletion of recombinase RAD51 and two DNA-dependent ATPase genes, RAD54 and RDH54, increased cellular susceptibility to both gamma radiation and DNA-damaging agents. A variety of oxidative stress response genes were also upregulated. Among them, sulfiredoxin contributed to gamma radiation resistance in a peroxiredoxin/thioredoxin-independent manner. Furthermore, we found that genes involved in molecular chaperone expression, ubiquitination systems, and autophagy were induced, whereas genes involved in the biosynthesis of proteins and fatty acids/sterols were downregulated. Most importantly, we discovered a number of novel C. neoformans genes, the expression of which was modulated by gamma radiation exposure, and their deletion rendered cells susceptible to gamma radiation exposure, as well as DNA damage insults. Among these genes, we found that a unique transcription factor containing the basic leucine zipper domain, named Bdr1, served as a regulator of the gamma radiation resistance of C. neoformans by controlling expression of DNA repair genes, and its expression was regulated by the evolutionarily conserved DNA damage response protein kinase Rad53. Taken together, the current transcriptome and functional analyses contribute to the understanding of the unique molecular mechanism of the radiation-resistant fungus C. neoformans IMPORTANCE: Although there are no natural environments under intense radiation, some living organisms

  18. Life-style transitions in plant pathogenic Colletotrichum fungi deciphered by genome and transcriptome analyses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Colletotrichum species are devastating fungal pathogens of major crop plants worldwide. Infection involves differentiation of specialized cell-types associated with host surface penetration, growth inside living host cells (biotrophy) and tissue destruction (necrotrophy). Here we report genome and t...

  19. Next-generation transcriptome assembly

    SciTech Connect

    Martin, Jeffrey A.; Wang, Zhong

    2011-09-01

    Transcriptomics studies often rely on partial reference transcriptomes that fail to capture the full catalog of transcripts and their variations. Recent advances in sequencing technologies and assembly algorithms have facilitated the reconstruction of the entire transcriptome by deep RNA sequencing (RNA-seq), even without a reference genome. However, transcriptome assembly from billions of RNA-seq reads, which are often very short, poses a significant informatics challenge. This Review summarizes the recent developments in transcriptome assembly approaches - reference-based, de novo and combined strategies-along with some perspectives on transcriptome assembly in the near future.

  20. An integrated genomic-transcriptomic approach supports a role for the proto-oncogene BCL3 in atherosclerosis.

    PubMed

    Marchetti, Giovanna; Girelli, Domenico; Zerbinati, Carlotta; Lunghi, Barbara; Friso, Simonetta; Meneghetti, Silvia; Coen, Matteo; Gagliano, Teresa; Guastella, Giuseppe; Bochaton-Piallat, Marie-Luce; Pizzolo, Francesca; Mascoli, Francesco; Malerba, Giovanni; Bovolenta, Matteo; Ferracin, Manuela; Olivieri, Oliviero; Bernardi, Francesco; Martinelli, Nicola

    2015-03-01

    Data with border-line statistical significance, copiously generated in genome-wide association studies of coronary artery disease (CAD), could include functionally relevant associations. We propose an integrated genomic and transcriptomic approach for unravelling new potential genetic signatures of atherosclerosis. Fifteen among 91 single nucleotide polymorphisms (SNPs) were first selected for association in a sex- and age-adjusted model by examining 510 patients with CAD and myocardial infarction and 388 subjects with normal coronary arteries (CAD-free) in the replication stages of a genome-wide association study. We investigated the expression of 71 genes proximal to the 15 tag-SNPs by two subsequent steps of microarray-based mRNA profiling, the former in vascular smooth muscle cell populations, isolated from non-atherosclerotic and atherosclerotic human carotid portions, and the latter in whole carotid specimens. BCL3 and PVRL2, contiguously located on chromosome 19, and ABCA1, extensively investigated before, were found to be differentially expressed. BCL3 and PVRL2 SNPs were genotyped within a second population of CAD patients (n=442) and compared with CAD-free subjects (n=393). The carriership of the BCL3 rs2965169 G allele was more represented among CAD patients and remained independently associated with CAD after adjustment for all the traditional cardiovascular risk factors (odds ratio=1.70 with 95% confidence interval 1.07-2.71), while the BCL3 rs8100239 A allele correlated with metabolic abnormalities. The up-regulation of BCL3 mRNA levels in atherosclerotic tissue samples was consistent with BCL3 protein expression, which was detected by immunostaining in the intima-media of atherosclerotic specimens, but not within non-atherosclerotic ones. Our integrated approach suggests a role for BCL3 in cardiovascular diseases.

  1. Comparative genomic and transcriptomic analyses of NaCl-tolerant Staphylococcus sp. OJ82 isolated from fermented seafood.

    PubMed

    Choi, Sungjong; Jung, Jaejoon; Jeon, Che Ok; Park, Woojun

    2014-01-01

    Bacteria belonging to the Staphylococcus genus reside in various natural environments; however, only disease-associated Staphylococcus strains have received attention while ecological function and physiologies of non-pathogenic strains were often neglected. Because high level of tolerance against NaCl is a common trait of Staphylococcus, we investigated the characteristics of halotolerance in Staphylococcus sp. OJ82 isolated from fermented seafood containing a high concentration of NaCl. Among the 292 isolates screened, OJ82 showed the highest β-galactosidase and extracellular protease activities under high-salt conditions. Comparative genomic analysis with other Staphylococcus strains showed that (a) replication origins are highly conserved, (b) the OJ82 strain has a high number of amino acid transport- and metabolism-related genes, and (c) OJ82 has many unique proteins (15 %) and 12 prophage-related genomic islands. RNA-seq analysis under high-salt conditions showed that genes involved in cell membranes, transport, osmotic stress, ATP synthesis, and translation are highly expressed. OJ82 may use the ribulose monophosphate pathway to detoxify some toxic intermediates under high-salt conditions. Six new and three known non-coding small RNAs of the OJ82 strain were also found in the RNA-seq analysis. Genomic and transcriptomic analyses identified target β-galactosidase and extracellular protease. Interestingly, the OJ82 strain became resistant to bacteriocin produced by the Bacillus strain only under high-salt conditions. Our data showed that the OJ82 strain adapted to high-salt conditions by expressing core cellular processes (translation, ATP production) and defense genes (membrane synthesis, compatible solute transports, ribulose monophosphate pathway) could survive bacteriocin exposure under high-salt conditions.

  2. Genome-Guided Analysis and Whole Transcriptome Profiling of the Mesophilic Syntrophic Acetate Oxidising Bacterium Syntrophaceticus schinkii

    PubMed Central

    Manzoor, Shahid; Bongcam-Rudloff, Erik; Schnürer, Anna; Müller, Bettina

    2016-01-01

    Syntrophaceticus schinkii is a mesophilic, anaerobic bacterium capable of oxidising acetate to CO2 and H2 in intimate association with a methanogenic partner, a syntrophic relationship which operates close to the energetic limits of microbial life. Syntrophaceticus schinkii has been identified as a key organism in engineered methane-producing processes relying on syntrophic acetate oxidation as the main methane-producing pathway. However, due to strict cultivation requirements and difficulties in reconstituting the thermodynamically unfavourable acetate oxidation, the physiology of this functional group is poorly understood. Genome-guided and whole transcriptome analyses performed in the present study provide new insights into habitat adaptation, syntrophic acetate oxidation and energy conservation. The working draft genome of Syntrophaceticus schinkii indicates limited metabolic capacities, with lack of organic nutrient uptake systems, chemotactic machineries, carbon catabolite repression and incomplete biosynthesis pathways. Ech hydrogenase, [FeFe] hydrogenases, [NiFe] hydrogenases, F1F0-ATP synthase and membrane-bound and cytoplasmic formate dehydrogenases were found clearly expressed, whereas Rnf and a predicted oxidoreductase/heterodisulphide reductase complex, both found encoded in the genome, were not expressed under syntrophic growth condition. A transporter sharing similarities to the high-affinity acetate transporters of aceticlastic methanogens was also found expressed, suggesting that Syntrophaceticus schinkii can potentially compete with methanogens for acetate. Acetate oxidation seems to proceed via the Wood-Ljungdahl pathway as all genes involved in this pathway were highly expressed. This study shows that Syntrophaceticus schinkii is a highly specialised, habitat-adapted organism relying on syntrophic acetate oxidation rather than metabolic versatility. By expanding its complement of respiratory complexes, it might overcome limiting bioenergetic

  3. Genome, transcriptome, and secretome analysis of wood decay fungus postia placenta supports unique mechanisms of lignocellulose conversion

    SciTech Connect

    Martinez, Diego; Challacombe, Jean F; Misra, Monica; Xie, Gary; Brettin, Thomas; Morgenstern, Ingo; Hibbett, David; Schmoll, Monika; Kubicek, Christian P; Ferreira, Patricia; Ruiz - Duenase, Francisco J; Martinez, Angel T; Kersten, Phil; Hammel, Kenneth E; Vanden Wymelenberg, Amber; Gaskell, Jill; Lindquist, Erika; Sabati, Grzegorz; Bondurant, Sandra S; Larrondo, Luis F; Canessa, Paulo; Vicunna, Rafael; Yadavk, Jagiit; Doddapaneni, Harshavardhan; Subramaniank, Venkataramanan; Pisabarro, Antonio G; Lavin, Jose L; Oguiza, Jose A; Master, Emma; Henrissat, Bernard; Coutinho, Pedro M; Harris, Paul; Magnuson, Jon K; Baker, Scott; Bruno, Kenneth; Kenealy, William; Hoegger, Patrik J; Kues, Ursula; Ramaiva, Preethi; Lucas, Susan; Salamov, Asaf; Shapiro, Harris; Tuh, Hank; Chee, Christine L; Teter, Sarah; Yaver, Debbie; James, Tim; Mokrejs, Martin; Pospisek, Martin; Grigoriev, Igor; Rokhsar, Dan; Berka, Randy; Cullen, Dan

    2008-01-01

    Brown-rot fungi such as Postia placenta are common inhabitants of forest ecosystems and are also largely responsible for the destructive decay of wooden structures. Rapid depolymerization of cellulose is a distinguishing feature of brown-rot, but the biochemical mechanisms and underlying genetics are poorly understood. Systematic examination of the P. placenta genome, transcriptome and secretome revealed unique extracellular enzyme systems, including an unusual repertoire of extracellular glycoside hydrolases. Genes encoding exocellobiohydrolases and cellulose-binding domains, typical of cellulolytic microbes, are absent in this efficient cellulose-degrading fungus. When P. placenta was grown in medium containing cellulose as sole carbon source, transcripts corresponding to many hemicellulases and to a single putative {beta}-1-4 endoglucanase were expressed at high levels relative to glucose grown cultures. These transcript profiles were confirmed by direct identification of peptides by liquid chromatography-tandem mass spectrometry (LC{center_dot}MSIMS). Also upregulated during growth on cellulose medium were putative iron reductases, quinone reductase, and structurally divergent oxidases potentially involved in extracellular generation of Fe(II) and H202. These observations are consistent with a biodegradative role for Fenton chemistry in which Fe(II) and H202 react to form hydroxyl radicals, highly reactive oxidants capable of depolymerizing cellulose. The P. placenta genome resources provide unparalleled opportunities for investigating such unusual mechanisms of cellulose conversion. More broadly, the genome offers insight into the diversification of lignocellulose degrading mechanisms in fungi. Comparisons to the closely related white-rot fungus Phanerochaete chrysosporium support an evolutionary shift from white-rot to brown-rot during which the capacity for efficient depolymerization of lignin was lost.

  4. Transcriptome Profiling in Rat Inbred Strains and Experimental Cross Reveals Discrepant Genetic Architecture of Genome-Wide Gene Expression

    PubMed Central

    Kaisaki, Pamela J.; Otto, Georg W.; Argoud, Karène; Collins, Stephan C.; Wallis, Robert H.; Wilder, Steven P.; Yau, Anthony C. Y.; Hue, Christophe; Calderari, Sophie; Bihoreau, Marie-Thérèse; Cazier, Jean-Baptiste; Mott, Richard; Gauguier, Dominique

    2016-01-01

    To test the impact of genetic heterogeneity on cis- and trans-mediated mechanisms of gene expression regulation, we profiled the transcriptome of adipose tissue in 20 inbred congenic strains derived from diabetic Goto–Kakizaki (GK) rats and Brown–Norway (BN) controls, which contain well-defined blocks (1–183 Mb) of genetic polymorphisms, and in 123 genetically heterogeneous rats of an (GK × BN)F2 offspring. Within each congenic we identified 73–1351 differentially expressed genes (DEGs), only 7.7% of which mapped within the congenic blocks, and which may be regulated in cis. The remainder localized outside the blocks, and therefore must be regulated in trans. Most trans-regulated genes exhibited approximately twofold expression changes, consistent with monoallelic expression. Altered biological pathways were replicated between congenic strains sharing blocks of genetic polymorphisms, but polymorphisms at different loci also had redundant effects on transcription of common distant genes and pathways. We mapped 2735 expression quantitative trait loci (eQTL) in the F2 cross, including 26% predominantly cis-regulated genes, which validated DEGs in congenic strains. A hotspot of >300 eQTL in a 10 cM region of chromosome 1 was enriched in DEGs in a congenic strain. However, many DEGs among GK, BN and congenic strains did not replicate as eQTL in F2 hybrids, demonstrating distinct mechanisms of gene expression when alleles segregate in an outbred population or are fixed homozygous across the entire genome or in short genomic regions. Our analysis provides conceptual advances in our understanding of the complex architecture of genome expression and pathway regulation, and suggests a prominent impact of epistasis and monoallelic expression on gene transcription. PMID:27646706

  5. Genome Sequence and Transcriptome Analysis of Meat-Spoilage-Associated Lactic Acid Bacterium Lactococcus piscium MKFS47

    PubMed Central

    Johansson, Per; Laine, Pia; Smolander, Olli-Pekka; Sonck, Matti; Rahkila, Riitta; Jääskeläinen, Elina; Paulin, Lars; Auvinen, Petri; Björkroth, Johanna

    2015-01-01

    Lactococcus piscium is a psychrotrophic lactic acid bacterium and is known to be one of the predominant species within spoilage microbial communities in cold-stored packaged foods, particularly in meat products. Its presence in such products has been associated with the formation of buttery and sour off-odors. Nevertheless, the spoilage potential of L. piscium varies dramatically depending on the strain and growth conditions. Additional knowledge about the genome is required to explain such variation, understand its phylogeny, and study gene functions. Here, we present the complete and annotated genomic sequence of L. piscium MKFS47, combined with a time course analysis of the glucose catabolism-based transcriptome. In addition, a comparative analysis of gene contents was done for L. piscium MKFS47 and 29 other lactococci, revealing three distinct clades within the genus. The genome of L. piscium MKFS47 consists of one chromosome, carrying 2,289 genes, and two plasmids. A wide range of carbohydrates was predicted to be fermented, and growth on glycerol was observed. Both carbohydrate and glycerol catabolic pathways were significantly upregulated in the course of time as a result of glucose exhaustion. At the same time, differential expression of the pyruvate utilization pathways, implicated in the formation of spoilage substances, switched the metabolism toward a heterofermentative mode. In agreement with data from previous inoculation studies, L. piscium MKFS47 was identified as an efficient producer of buttery-odor compounds under aerobic conditions. Finally, genes and pathways that may contribute to increased survival in meat environments were considered. PMID:25819958

  6. Intraindividual dynamics of transcriptome and genome-wide stability of DNA methylation

    PubMed Central

    Furukawa, Ryohei; Hachiya, Tsuyoshi; Ohmomo, Hideki; Shiwa, Yuh; Ono, Kanako; Suzuki, Sadafumi; Satoh, Mamoru; Hitomi, Jiro; Sobue, Kenji; Shimizu, Atsushi

    2016-01-01

    Cytosine methylation at CpG dinucleotides is an epigenetic mechanism that affects the gene expression profiles responsible for the functional differences in various cells and tissues. Although gene expression patterns are dynamically altered in response to various stimuli, the intraindividual dynamics of DNA methylation in human cells are yet to be fully understood. Here, we investigated the extent to which DNA methylation contributes to the dynamics of gene expression by collecting 24 blood samples from two individuals over a period of 3 months. Transcriptome and methylome association analyses revealed that only ~2% of dynamic changes in gene expression could be explained by the intraindividual variation of DNA methylation levels in peripheral blood mononuclear cells and purified monocytes. These results showed that DNA methylation levels remain stable for at least several months, suggesting that disease-associated DNA methylation markers are useful for estimating the risk of disease manifestation. PMID:27192970

  7. MEMOSys: Bioinformatics platform for genome-scale metabolic models

    PubMed Central

    2011-01-01

    Background Recent advances in genomic sequencing have enabled the use of genome sequencing in standard biological and biotechnological research projects. The challenge is how to integrate the large amount of data in order to gain novel biological insights. One way to leverage sequence data is to use genome-scale metabolic models. We have therefore designed and implemented a bioinformatics platform which supports the development of such metabolic models. Results MEMOSys (MEtabolic MOdel research and development System) is a versatile platform for the management, storage, and development of genome-scale metabolic models. It supports the development of new models by providing a built-in version control system which offers access to the complete developmental history. Moreover, the integrated web board, the authorization system, and the definition of user roles allow collaborations across departments and institutions. Research on existing models is facilitated by a search system, references to external databases, and a feature-rich comparison mechanism. MEMOSys provides customizable data exchange mechanisms using the SBML format to enable analysis in external tools. The web application is based on the Java EE framework and offers an intuitive user interface. It currently contains six annotated microbial metabolic models. Conclusions We have developed a web-based system designed to provide researchers a novel application facilitating the management and development of metabolic models. The system is freely available at http://www.icbi.at/MEMOSys. PMID:21276275

  8. A meta-analysis of the genomic and transcriptomic composition of complex life

    PubMed Central

    Liu, Ganqiang; Mattick, John S.; Taft, Ryan J.

    2013-01-01

    It is now clear that animal genomes are predominantly non-protein-coding, and that these sequences encode a wide array of RNA transcripts and other regulatory elements that are fundamental to the development of complex life. We have previously argued that the proportion of an animal genome that is non-protein-coding DNA (ncDNA) correlates well with its apparent biological complexity. Here we extend on that work and, using data from a total of 1,627 prokaryotic and 153 eukaryotic complete and annotated genomes, show that the proportion of ncDNA per haploid genome is significantly positively correlated with a previously published proxy of biological complexity, the number of distinct cell types. This is in contrast to the amount of the genome that encodes proteins, which we show is essentially unchanged across Metazoa. Furthermore, using a total of 179 RNA-seq data sets from nematode (47), fruit fly (72), zebrafish (20) and human (42), we show, consistent with other recent reports, that the vast majority of ncDNA in animals is transcribed. This includes more than 60 human loci previously considered “gene deserts,” many of which are expressed tissue-specifically and associated with previously reported GWAS SNPs. These results suggest that ncDNA, and the ncRNAs encoded within it, may be intimately involved in the evolution, maintenance and development of complex life. PMID:23759593

  9. Selection and evaluation of novel reference genes for quantitative reverse transcription PCR (qRT-PCR) based on genome and transcriptome data in Brassica napus L.

    PubMed

    Yang, Hongli; Liu, Jing; Huang, Shunmou; Guo, Tingting; Deng, Linbin; Hua, Wei

    2014-03-15

    Selection of reference genes in Brassica napus, a tetraploid (4×) species, is a very difficult task without information on genome and transcriptome. By now, only several traditional reference genes which show significant expression differentiation under different conditions are used in B. napus. In the present study, based on genome and transcriptome data of the rapeseed Zhongshuang-11 cultivar, 14 candidate reference genes were screened for investigation in different tissues, cultivars, and treated conditions of B. napus. These genes were as follows: ELF5, ENTH, F-BOX7, F-BOX2, FYPP1, GDI1, GYF, MCP2d, OTP80, PPR, SPOC, Unknown1, Unknown2 and UBA. Among them, excluding GYF and FYPP1, another 12 genes, were identified to perform better than traditional reference genes ACTIN7 and GAPDH. To further validate the accuracy of the newly developed reference genes in normalization, expression levels of BnCAT1 (B. napus catalase 1) in different rapeseed tissues and seedlings under stress conditions were normalized by the three most stable reference genes PPR, GDI1, and ENTH and little difference existed in normalization results. To the best of our knowledge, this is the first time B. napus reference genes have been provided with the help of complete genome and transcriptome information. The new reference genes provided in this study are more accurate than previously reported reference genes in quantifying expression levels of B. napus genes.

  10. Capturing the response of Clostridium acetobutylicum to chemical stressors using a regulated genome-scale metabolic model

    SciTech Connect

    Dash, Satyakam; Mueller, Thomas J.; Venkataramanan, Keerthi P.; Papoutsakis, Eleftherios T.; Maranas, Costas D.

    2014-10-14

    Clostridia are anaerobic Gram-positive Firmicutes containing broad and flexible systems for substrate utilization, which have been used successfully to produce a range of industrial compounds. Clostridium acetobutylicum has been used to produce butanol on an industrial scale through acetone-butanol-ethanol (ABE) fermentation. A genome-scale metabolic (GSM) model is a powerful tool for understanding the metabolic capacities of an organism and developing metabolic engineering strategies for strain development. The integration of stress related specific transcriptomics information with the GSM model provides opportunities for elucidating the focal points of regulation.

  11. Transcriptome analysis of tube foot and large scale marker discovery in sea cucumber, Apostichopus japonicus.

    PubMed

    Zhou, Xiaoxu; Wang, Hongdi; Cui, Jun; Qiu, Xuemei; Chang, Yaqing; Wang, Xiuli

    2016-12-01

    Tube foot as one of the ambulacral appendages types in Aspidochirote holothurioids, is known for their functions in locomotion, feeding, chemoreception, light sensitivity and respiration. In this study, we explored the characteristic of transcriptome in the tube foot of sea cucumber (Apostichopus japonicus). Our results showed that among 390 unigenes which specifically expressed in the tube foot, 190 of them were annotated. Based on the assembly transcriptome, we found 219,860 SNPs from 34,749 unigenes, 97,683, 53,624, 27,767 and 40,786 were located in CDSs, 5'-UTRs, 3'-UTRs and non-CDS separately. Furthermore, 12,114 SSRs were detected from 7394 unigenes. Target genes of four specifically expressed miRNAs (miR-29a, miR-29b, miR-278-3p and miR-2005) in tube foot were also predicted based on the transcriptome, which contain immune-related factors (MBL, VLRA, AjC3, MyD88, CFB), skin pigmentation (MITF), candidate regeneration factor (TRP) and holothurians autolysis-related factor (CL). These results develop a relatively large number of molecular markers and transcriptome resources, and will provide a foundation for further analyses on the function and molecular mechanisms underlying A. japonicas tube foot.

  12. Parallel analysis of Arabidopsis circadian clock mutants reveals different scales of transcriptome and proteome regulation

    PubMed Central

    Graf, Alexander; Coman, Diana; Walsh, Sean; Flis, Anna; Stitt, Mark; Gruissem, Wilhelm

    2017-01-01

    The circadian clock regulates physiological processes central to growth and survival. To date, most plant circadian clock studies have relied on diurnal transcriptome changes to elucidate molecular connections between the circadian clock and observable phenotypes in wild-type plants. Here, we have integrated RNA-sequencing and protein mass spectrometry data to comparatively analyse the lhycca1, prr7prr9, gi and toc1 circadian clock mutant rosette at the end of day and end of night. Each mutant affects specific sets of genes and proteins, suggesting that the circadian clock regulation is modular. Furthermore, each circadian clock mutant maintains its own dynamically fluctuating transcriptome and proteome profile specific to subcellular compartments. Most of the measured protein levels do not correlate with changes in their corresponding transcripts. Transcripts and proteins that have coordinated changes in abundance are enriched for carbohydrate- and cold-responsive genes. Transcriptome changes in all four circadian clock mutants also affect genes encoding starch degradation enzymes, transcription factors and protein kinases. The comprehensive transcriptome and proteome datasets demonstrate that future system-driven research of the circadian clock requires multi-level experimental approaches. Our work also shows that further work is needed to elucidate the roles of post-translational modifications and protein degradation in the regulation of clock-related processes. PMID:28250106

  13. Integration of genetic, genomic and transcriptomic information identifies putative regulators of adventitious root formation in Populus

    DOE PAGES

    Ribeiro, Cintia L.; Silva, Cynthia M.; Drost, Derek R.; ...

    2016-03-16

    In this study, adventitious roots (AR) develop from tissues other than the primary root, in a process physiologically regulated by phytohormones. Adventitious roots provide structural support and contribute to water and nutrient absorption, and are critical for commercial vegetative propagation of several crops. Here we quantified the number of AR, root architectural traits and root biomass in cuttings from a pseudo-backcross population of Populus deltoides and Populus trichocarpa. Quantitative trait loci (QTL) mapping and whole-transcriptome analysis of individuals with alternative QTL alleles for AR number were used to identify putative regulators of AR development. As a result, parental individuals andmore » progeny showed extensive segregation for AR developmental traits. Quantitative trait loci for number of AR mapped consistently in the same interval of linkage group (LG) II and LG XIV, explaining 7–10 % of the phenotypic variation. A time series transcriptome analysis identified 26,121 genes differentially expressed during AR development, particularly during the first 24 h after cuttings were harvested. Of those, 1929 genes were differentially regulated between individuals carrying alternative alleles for the two QTL for number of AR, in one or more time point. Eighty-one of these genes were physically located within the QTL intervals for number of AR, including putative homologs of the Arabidopsis genes SUPERROOT2 (SUR2) and TRYPTOPHAN SYNTHASE ALPHA CHAIN (TSA1), both of which are involved in the auxin indole-3-acetic acid (IAA) biosynthesis pathway. In conclusion, this study suggests the involvement of two genes of the tryptophan-dependent auxin biosynthesis pathway, SUR2 and TSA1, in the regulation of a critical trait for the clonal propagation of woody species. A possible model for this regulation is that poplar individuals that have poor AR formation synthesize auxin indole-3-acetic acid (IAA) primarily through the tryptophan (Trp) pathway. Much of

  14. Integration of genetic, genomic and transcriptomic information identifies putative regulators of adventitious root formation in Populus

    SciTech Connect

    Ribeiro, Cintia L.; Silva, Cynthia M.; Drost, Derek R.; Novaes, Evandro; Novaes, Carolina R. D. B.; Dervinis, Christopher; Kirst, Matias

    2016-03-16

    In this study, adventitious roots (AR) develop from tissues other than the primary root, in a process physiologically regulated by phytohormones. Adventitious roots provide structural support and contribute to water and nutrient absorption, and are critical for commercial vegetative propagation of several crops. Here we quantified the number of AR, root architectural traits and root biomass in cuttings from a pseudo-backcross population of Populus deltoides and Populus trichocarpa. Quantitative trait loci (QTL) mapping and whole-transcriptome analysis of individuals with alternative QTL alleles for AR number were used to identify putative regulators of AR development. As a result, parental individuals and progeny showed extensive segregation for AR developmental traits. Quantitative trait loci for number of AR mapped consistently in the same interval of linkage group (LG) II and LG XIV, explaining 7–10 % of the phenotypic variation. A time series transcriptome analysis identified 26,121 genes differentially expressed during AR development, particularly during the first 24 h after cuttings were harvested. Of those, 1929 genes were differentially regulated between individuals carrying alternative alleles for the two QTL for number of AR, in one or more time point. Eighty-one of these genes were physically located within the QTL intervals for number of AR, including putative homologs of the Arabidopsis genes SUPERROOT2 (SUR2) and TRYPTOPHAN SYNTHASE ALPHA CHAIN (TSA1), both of which are involved in the auxin indole-3-acetic acid (IAA) biosynthesis pathway. In conclusion, this study suggests the involvement of two genes of the tryptophan-dependent auxin biosynthesis pathway, SUR2 and TSA1, in the regulation of a critical trait for the clonal propagation of woody species. A possible model for this regulation is that poplar individuals that have poor AR formation synthesize auxin indole-3-acetic acid (IAA) primarily through the tryptophan (Trp) pathway. Much of the Trp

  15. Genome, transcriptome, and functional analyses of Penicillium expansum provide new insights into secondary metabolism and pathogenicity

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The relationship between secondary metabolism and infection in pathogenic fungi has remained largely elusive. Penicillium comprises a group of plant pathogens with varying host specificities and with the ability to produce a wide array of secondary metabolites. The genomes of three Penicillium exp...

  16. Genomic divergences among cattle, dog and human estimated from large-scale alignments of genomic sequences

    PubMed Central

    Liu, George E; Matukumalli, Lakshmi K; Sonstegard, Tad S; Shade, Larry L; Van Tassell, Curtis P

    2006-01-01

    Background Approximately 11 Mb of finished high quality genomic sequences were sampled from cattle, dog and human to estimate genomic divergences and their regional variation among these lineages. Results Optimal three-way multi-species global sequence alignments for 84 cattle clones or loci (each >50 kb of genomic sequence) were constructed using the human and dog genome assemblies as references. Genomic divergences and substitution rates were examined for each clone and for various sequence classes under different functional constraints. Analysis of these alignments revealed that the overall genomic divergences are relatively constant (0.32–0.37 change/site) for pairwise comparisons among cattle, dog and human; however substitution rates vary across genomic regions and among different sequence classes. A neutral mutation rate (2.0–2.2 × 10(-9) change/site/year) was derived from ancestral repetitive sequences, whereas the substitution rate in coding sequences (1.1 × 10(-9) change/site/year) was approximately half of the overall rate (1.9–2.0 × 10(-9) change/site/year). Relative rate tests also indicated that cattle have a significantly faster rate of substitution as compared to dog and that this difference is about 6%. Conclusion This analysis provides a large-scale and unbiased assessment of genomic divergences and regional variation of substitution rates among cattle, dog and human. It is expected that these data will serve as a baseline for future mammalian molecular evolution studies. PMID:16759380

  17. Elucidating the modes of action for bioactive compounds in a cell-specific manner by large-scale chemically-induced transcriptomics

    PubMed Central

    Iwata, Michio; Sawada, Ryusuke; Iwata, Hiroaki; Kotera, Masaaki; Yamanishi, Yoshihiro

    2017-01-01

    The identification of the modes of action of bioactive compounds is a major challenge in chemical systems biology of diseases. Genome-wide expression profiling of transcriptional responses to compound treatment for human cell lines is a promising unbiased approach for the mode-of-action analysis. Here we developed a novel approach to elucidate the modes of action of bioactive compounds in a cell-specific manner using large-scale chemically-induced transcriptome data acquired from the Library of Integrated Network-based Cellular Signatures (LINCS), and analyzed 16,268 compounds and 68 human cell lines. First, we performed pathway enrichment analyses of regulated genes to reveal active pathways among 163 biological pathways. Next, we explored potential target proteins (including primary targets and off-targets) with cell-specific transcriptional similarity using chemical–protein interactome. Finally, we predicted new therapeutic indications for 461 diseases based on the target proteins. We showed the usefulness of the proposed approach in terms of prediction coverage, interpretation, and large-scale applicability, and validated the new prediction results experimentally by an in vitro cellular assay. The approach has a high potential for advancing drug discovery and repositioning. PMID:28071740

  18. The transcriptomes of novel marmoset monkey embryonic stem cell lines reflect distinct genomic features

    PubMed Central

    Debowski, Katharina; Drummer, Charis; Lentes, Jana; Cors, Maren; Dressel, Ralf; Lingner, Thomas; Salinas-Riester, Gabriela; Fuchs, Sigrid; Sasaki, Erika; Behr, Rüdiger

    2016-01-01

    Embryonic stem cells (ESCs) are useful for the study of embryonic development. However, since research on naturally conceived human embryos is limited, non-human primate (NHP) embryos and NHP ESCs represent an excellent alternative to the corresponding human entities. Though, ESC lines derived from naturally conceived NHP embryos are still very rare. Here, we report the generation and characterization of four novel ESC lines derived from natural preimplantation embryos of the common marmoset monkey (Callithrix jacchus). For the first time we document derivation of NHP ESCs derived from morula stages. We show that quantitative chromosome-wise transcriptome analyses precisely reflect trisomies present in both morula-derived ESC lines. We also demonstrate that the female ESC lines exhibit different states of X-inactivation which is impressively reflected by the abundance of the lncRNA X inactive-specific transcript (XIST). The novel marmoset ESC lines will promote basic primate embryo and ESC studies as well as preclinical testing of ESC-based regenerative approaches in NHP. PMID:27385131

  19. Transcriptomics and functional genomics of plant defence induction by phloem-feeding insects.

    PubMed

    Thompson, Gary A; Goggin, Fiona L

    2006-01-01

    The relationship between phloem-feeding insects (PFIs) and plants offers an intriguing example of a highly specialized biotic interaction. These insects have evolved to survive on a nutritionally imbalanced diet of phloem sap, and to minimize wound responses in their host plants. As a consequence, plant perception of and responses to PFIs differ from plant interactions with other insect-feeding guilds. Transcriptome-wide analyses of gene expression are currently being applied to characterize plant responses to PFIs in crop plants with race-specific innate resistance, as well as in compatible interactions with susceptible hosts. Recent studies indicate that PFIs induce transcriptional reprogramming in their host plants, and that plant responses to PFIs appear to be quantitatively and qualitatively different from responses to other insects or pathogens. Transcript profiling studies also suggest that PFIs induce cell wall modifications, reduce photosynthetic activity, manipulate source-sink relations, and modify secondary metabolism in their hosts, and many of these responses appear to occur within the phloem tissue. Plant responses to these insects appear to be regulated in part by the salicylate, jasmonate, and ethylene signalling pathways. As additional transcript profiling data become available, forward and reverse genetic approaches will be necessary to determine which changes in gene expression influence resistance or susceptibility to PFIs.

  20. Identification of candidate genes associated with porcine meat color traits by genome-wide transcriptome analysis

    PubMed Central

    Li, Bojiang; Dong, Chao; Li, Pinghua; Ren, Zhuqing; Wang, Han; Yu, Fengxiang; Ning, Caibo; Liu, Kaiqing; Wei, Wei; Huang, Ruihua; Chen, Jie; Wu, Wangjun; Liu, Honglin

    2016-01-01

    Meat color is considered to be the most important indicator of meat quality, however, the molecular mechanisms underlying traits related to meat color remain mostly unknown. In this study, to elucidate the molecular basis of meat color, we constructed six cDNA libraries from biceps femoris (Bf) and soleus (Sol), which exhibit obvious differences in meat color, and analyzed the whole-transcriptome differences between Bf (white muscle) and Sol (red muscle) using high-throughput sequencing technology. Using DEseq2 method, we identified 138 differentially expressed genes (DEGs) between Bf and Sol. Using DEGseq method, we identified 770, 810, and 476 DEGs in comparisons between Bf and Sol in three separate animals. Of these DEGs, 52 were overlapping DEGs. Using these data, we determined the enriched GO terms, metabolic pathways and candidate genes associated with meat color traits. Additionally, we mapped 114 non-redundant DEGs to the meat color QTLs via a comparative analysis with the porcine quantitative trait loci (QTL) database. Overall, our data serve as a valuable resource for identifying genes whose functions are critical for meat color traits and can accelerate studies of the molecular mechanisms of meat color formation. PMID:27748458

  1. Animal tracking meets migration genomics: transcriptomic analysis of a partially migratory bird species.

    PubMed

    Franchini, Paolo; Irisarri, Iker; Fudickar, Adam; Schmidt, Andreas; Meyer, Axel; Wikelski, Martin; Partecke, Jesko

    2017-03-17

    Seasonal migration is a widespread phenomenon, which is found in many different lineages of animals. This spectacular behaviour allows animals to avoid seasonally adverse environmental conditions to exploit more favourable habitats. Migration has been intensively studied in birds, which display astonishing variation in migration strategies, thus providing a powerful system for studying the ecological and evolutionary processes that shape migratory behaviour. Despite intensive research, the genetic basis of migration remains largely unknown. Here we used state-of-the-art radio-tracking technology to characterize the migratory behaviour of a partially migratory population of European blackbirds (Turdus merula) in southern Germany. We compared gene expression of resident and migrant individuals using high-throughput transcriptomics in blood samples. Analyses of sequence variation revealed a non-significant genetic structure between blackbirds differing by their migratory phenotype. We detected only four differentially expressed genes between migrants and residents, which might be associated with hyperphagia, moulting, and enhanced DNA replication and transcription. The most pronounced changes in gene expression occurred between migratory birds depending on when, in relation to their date of departure, blood was collected. Overall, the differentially expressed genes detected in this analysis may play crucial roles in determining the decision to migrate, or in controlling the physiological processes required for the onset of migration. These results provide new insights into, and testable hypotheses for, the molecular mechanisms controlling the migratory phenotype and its underlying physiological mechanisms in blackbirds and other migratory bird species. This article is protected by copyright. All rights reserved.

  2. Characterizing acetogenic metabolism using a genome-scale metabolic reconstruction of Clostridium ljungdahlii

    SciTech Connect

    Nagarajan, H; Sahin, M; Nogales, J; Latif, H; Lovley, DR; Ebrahim, A; Zengler, K

    2013-11-25

    Background: The metabolic capabilities of acetogens to ferment a wide range of sugars, to grow autotrophically on H-2/CO2, and more importantly on synthesis gas (H-2/CO/CO2) make them very attractive candidates as production hosts for biofuels and biocommodities. Acetogenic metabolism is considered one of the earliest modes of bacterial metabolism. A thorough understanding of various factors governing the metabolism, in particular energy conservation mechanisms, is critical for metabolic engineering of acetogens for targeted production of desired chemicals. Results: Here, we present the genome-scale metabolic network of Clostridium ljungdahlii, the first such model for an acetogen. This genome-scale model (iHN637) consisting of 637 genes, 785 reactions, and 698 metabolites captures all the major central metabolic and biosynthetic pathways, in particular pathways involved in carbon fixation and energy conservation. A combination of metabolic modeling, with physiological and transcriptomic data provided insights into autotrophic metabolism as well as aided the characterization of a nitrate reduction pathway in C. ljungdahlii. Analysis of the iHN637 metabolic model revealed that flavin based electron bifurcation played a key role in energy conservation during autotrophic growth and helped identify genes for some of the critical steps in this mechanism. Conclusions: iHN637 represents a predictive model that recapitulates experimental data, and provides valuable insights into the metabolic response of C. ljungdahlii to genetic perturbations under various growth conditions. Thus, the model will be instrumental in guiding metabolic engineering of C. ljungdahlii for the industrial production of biocommodities and biofuels.

  3. Characterizing acetogenic metabolism using a genome-scale metabolic reconstruction of Clostridium ljungdahlii

    PubMed Central

    2013-01-01

    Background The metabolic capabilities of acetogens to ferment a wide range of sugars, to grow autotrophically on H2/CO2, and more importantly on synthesis gas (H2/CO/CO2) make them very attractive candidates as production hosts for biofuels and biocommodities. Acetogenic metabolism is considered one of the earliest modes of bacterial metabolism. A thorough understanding of various factors governing the metabolism, in particular energy conservation mechanisms, is critical for metabolic engineering of acetogens for targeted production of desired chemicals. Results Here, we present the genome-scale metabolic network of Clostridium ljungdahlii, the first such model for an acetogen. This genome-scale model (iHN637) consisting of 637 genes, 785 reactions, and 698 metabolites captures all the major central metabolic and biosynthetic pathways, in particular pathways involved in carbon fixation and energy conservation. A combination of metabolic modeling, with physiological and transcriptomic data provided insights into autotrophic metabolism as well as aided the characterization of a nitrate reduction pathway in C. ljungdahlii. Analysis of the iHN637 metabolic model revealed that flavin based electron bifurcation played a key role in energy conservation during autotrophic growth and helped identify genes for some of the critical steps in this mechanism. Conclusions iHN637 represents a predictive model that recapitulates experimental data, and provides valuable insights into the metabolic response of C. ljungdahlii to genetic perturbations under various growth conditions. Thus, the model will be instrumental in guiding metabolic engineering of C. ljungdahlii for the industrial production of biocommodities and biofuels. PMID:24274140

  4. Genome-Wide Association and Transcriptome Analyses Reveal Candidate Genes Underlying Yield-determining Traits in Brassica napus

    PubMed Central

    Lu, Kun; Peng, Liu; Zhang, Chao; Lu, Junhua; Yang, Bo; Xiao, Zhongchun; Liang, Ying; Xu, Xingfu; Qu, Cunmin; Zhang, Kai; Liu, Liezhao; Zhu, Qinlong; Fu, Minglian; Yuan, Xiaoyan; Li, Jiana

    2017-01-01

    Yield is one of the most important yet complex crop traits. To improve our understanding of the genetic basis of yield establishment, and to identify candidate genes responsible for yield improvement in Brassica napus, we performed genome-wide association studies (GWAS) for seven yield-determining traits [main inflorescence pod number (MIPN), branch pod number (BPN), pod number per plant (PNP), seed number per pod (SPP), thousand seed weight, main inflorescence yield (MIY), and branch yield], using data from 520 diverse B. napus accessions from two different yield environments. In total, we detected 128 significant single nucleotide polymorphisms (SNPs), 93 of which were revealed as novel by integrative analysis. A combination of GWAS and transcriptome sequencing on 21 haplotype blocks from samples pooled by four extremely high-yielding or low-yielding accessions revealed the differential expression of 14 crucial candiate genes (such as Bna.MYB83, Bna.SPL5, and Bna.ROP3) associated with multiple traits or containing multiple SNPs associated with the same trait. Functional annotation and expression pattern analyses further demonstrated that these 14 candiate genes might be important in developmental processes and biomass accumulation, thus affecting the yield establishment of B. napus. These results provide valuable information for understanding the genetic mechanisms underlying the establishment of high yield in B. napus, and lay the foundation for developing high-yielding B. napus varieties. PMID:28261256

  5. Genome-wide transcriptome analysis of the plant pathogen Xanthomonas identifies sRNAs with putative virulence functions

    PubMed Central

    Schmidtke, Cornelius; Findeiß, Sven; Sharma, Cynthia M.; Kuhfuß, Juliane; Hoffmann, Steve; Vogel, Jörg; Stadler, Peter F.; Bonas, Ulla

    2012-01-01

    The Gram-negative plant-pathogenic bacterium Xanthomonas campestris pv. vesicatoria (Xcv) is an important model to elucidate the mechanisms involved in the interaction with the host. To gain insight into the transcriptome of the Xcv strain 85–10, we took a differential RNA sequencing (dRNA-seq) approach. Using a novel method to automatically generate comprehensive transcription start site (TSS) maps we report 1421 putative TSSs in the Xcv genome. Genes in Xcv exhibit a poorly conserved −10 promoter element and no consensus Shine-Dalgarno sequence. Moreover, 14% of all mRNAs are leaderless and 13% of them have unusually long 5′-UTRs. Northern blot analyses confirmed 16 intergenic small RNAs and seven cis-encoded antisense RNAs in Xcv. Expression of eight intergenic transcripts was controlled by HrpG and HrpX, key regulators of the Xcv type III secretion system. More detailed characterization identified sX12 as a small RNA that controls virulence of Xcv by affecting the interaction of the pathogen and its host plants. The transcriptional landscape of Xcv is unexpectedly complex, featuring abundant antisense transcripts, alternative TSSs and clade-specific small RNAs. PMID:22080557

  6. Genome wide transcriptome profiling of Fusarium oxysporum f sp. ciceris conidial germination reveals new insights into infection-related genes

    PubMed Central

    Sharma, Mamta; Sengupta, Anindita; Ghosh, Raju; Agarwal, Gaurav; Tarafdar, Avijit; Nagavardhini, A; Pande, Suresh; Varshney, Rajeev K

    2016-01-01

    Vascular wilt caused by Fusarium oxysporum f. sp. ciceris (Foc) is a serious disease of chickpea (Cicer arietinum L.) accounting for approximately 10–15% annual crop loss. The fungus invades the plant via roots, colonizes the xylem vessels and prevents the upward translocation of water and nutrients. Infection is initiated by conidia that invade the host tissue often by penetration of intact epidermal cells. Here, we report the characterization of the transcriptome of Foc sequenced using Illumina Hiseq technology during its conidial germination at different time points. Genome-wide expression profiling revealed that genes linked to fungal development are transcribed in successive ways. Analysis showed that Foc have large sets of germination-related genes and families of genes encoding secreted effectors, cell wall/pectin-degrading enzymes, metabolism related enzymes, transporters and peptidases. We found that metabolism related enzymes are up-regulated at early time point whereas most transporters and secondary metabolites important for tissue colonization and pathogenicity are up-regulated later as evident from the qRT-PCR. The study demonstrated that early conidial germination in Foc is accompanied by rapid shifts in gene expression that prepare the fungus for germ tube outgrowth, host cell invasion and pathogenesis. This work lays the foundation for facilitating further research towards understanding this host-pathogen interaction. PMID:27853284

  7. Unraveling adaptation of Pontibacter korlensis to radiation and infertility in desert through complete genome and comparative transcriptomic analysis

    PubMed Central

    Dai, Jun; Dai, Wenkui; Qiu, Chuangzhao; Yang, Zhenyu; Zhang, Yi; Zhou, Mengzhou; Zhang, Lei; Fang, Chengxiang; Gao, Qiang; Yang, Qiao; Li, Xin; Wang, Zhi; Wang, Zhiyong; Jia, Zhenhua; Chen, Xiong

    2015-01-01

    The desert is a harsh habitat for flora and microbial life due to its aridness and strong radiation. In this study, we constructed the first complete and deeply annotated genome of the genus Pontibacter (Pontibacter korlensis X14-1T = CCTCC AB 206081T, X14-1). Reconstruction of the sugar metabolism process indicated that strain X14-1 can utilize diverse sugars, including cellulose, starch and sucrose; this result is consistent with previous experiments. Strain X14-1 is also able to resist desiccation and radiation in the desert through well-armed systems related to DNA repair, radical oxygen species (ROS) detoxification and the OstAB and TreYZ pathways for trehalose synthesis. A comparative transcriptomic analysis under gamma radiation revealed that strain X14-1 presents high-efficacy operating responses to radiation, including the robust expression of catalase and the manganese transport protein. Evaluation of 73 novel genes that are differentially expressed showed that some of these genes may contribute to the strain’s adaptation to radiation and desiccation through ferric transport and preservation. PMID:26057562

  8. Recurrent mutation of the ID3 gene in Burkitt lymphoma identified by integrated genome, exome and transcriptome sequencing.

    PubMed

    Richter, Julia; Schlesner, Matthias; Hoffmann, Steve; Kreuz, Markus; Leich, Ellen; Burkhardt, Birgit; Rosolowski, Maciej; Ammerpohl, Ole; Wagener, Rabea; Bernhart, Stephan H; Lenze, Dido; Szczepanowski, Monika; Paulsen, Maren; Lipinski, Simone; Russell, Robert B; Adam-Klages, Sabine; Apic, Gordana; Claviez, Alexander; Hasenclever, Dirk; Hovestadt, Volker; Hornig, Nadine; Korbel, Jan O; Kube, Dieter; Langenberger, David; Lawerenz, Chris; Lisfeld, Jasmin; Meyer, Katharina; Picelli, Simone; Pischimarov, Jordan; Radlwimmer, Bernhard; Rausch, Tobias; Rohde, Marius; Schilhabel, Markus; Scholtysik, René; Spang, Rainer; Trautmann, Heiko; Zenz, Thorsten; Borkhardt, Arndt; Drexler, Hans G; Möller, Peter; MacLeod, Roderick A F; Pott, Christiane; Schreiber, Stefan; Trümper, Lorenz; Loeffler, Markus; Stadler, Peter F; Lichter, Peter; Eils, Roland; Küppers, Ralf; Hummel, Michael; Klapper, Wolfram; Rosenstiel, Philip; Rosenwald, Andreas; Brors, Benedikt; Siebert, Reiner

    2012-12-01

    Burkitt lymphoma is a mature aggressive B-cell lymphoma derived from germinal center B cells. Its cytogenetic hallmark is the Burkitt translocation t(8;14)(q24;q32) and its variants, which juxtapose the MYC oncogene with one of the three immunoglobulin loci. Consequently, MYC is deregulated, resulting in massive perturbation of gene expression. Nevertheless, MYC deregulation alone seems not to be sufficient to drive Burkitt lymphomagenesis. By whole-genome, whole-exome and transcriptome sequencing of four prototypical Burkitt lymphomas with immunoglobulin gene (IG)-MYC translocation, we identified seven recurrently mutated genes. One of these genes, ID3, mapped to a region of focal homozygous loss in Burkitt lymphoma. In an extended cohort, 36 of 53 molecularly defined Burkitt lymphomas (68%) carried potentially damaging mutations of ID3. These were strongly enriched at somatic hypermutation motifs. Only 6 of 47 other B-cell lymphomas with the IG-MYC translocation (13%) carried ID3 mutations. These findings suggest that cooperation between ID3 inactivation and IG-MYC translocation is a hallmark of Burkitt lymphomagenesis.

  9. Genome-wide transcriptomic analysis uncovers the molecular basis underlying early flowering and apetalous characteristic in Brassica napus L

    PubMed Central

    Yu, Kunjiang; Wang, Xiaodong; Chen, Feng; Chen, Song; Peng, Qi; Li, Hongge; Zhang, Wei; Hu, Maolong; Chu, Pu; Zhang, Jiefu; Guan, Rongzhan

    2016-01-01

    Floral transition and petal onset, as two main aspects of flower development, are crucial to rapeseed evolutionary success and yield formation. Currently, very little is known regarding the genetic architecture that regulates flowering time and petal morphogenesis in Brassica napus. In the present study, a genome-wide transcriptomic analysis was performed with an absolutely apetalous and early flowering line, APL01, and a normally petalled line, PL01, using high-throughput RNA sequencing. In total, 13,205 differential expressed genes were detected, of which 6111 genes were significantly down-regulated, while 7094 genes were significantly up-regulated in the young inflorescences of APL01 compared with PL01. The expression levels of a vast number of genes involved in protein biosynthesis were altered in response to the early flowering and apetalous character. Based on the putative rapeseed flowering genes, an early flowering network, mainly comprised of vernalization and photoperiod pathways, was built. Additionally, 36 putative upstream genes possibly governing the apetalous character of line APL01 were identified, and six genes potentially regulating petal origination were obtained by combining with three petal-related quantitative trait loci. These findings will facilitate understanding of the molecular mechanisms underlying floral transition and petal initiation in B. napus. PMID:27460760

  10. Integration of genomic, transcriptomic and proteomic data identifies two biologically distinct subtypes of invasive lobular breast cancer

    PubMed Central

    Michaut, Magali; Chin, Suet-Feung; Majewski, Ian; Severson, Tesa M.; Bismeijer, Tycho; de Koning, Leanne; Peeters, Justine K.; Schouten, Philip C.; Rueda, Oscar M.; Bosma, Astrid J.; Tarrant, Finbarr; Fan, Yue; He, Beilei; Xue, Zheng; Mittempergher, Lorenza; Kluin, Roelof J.C.; Heijmans, Jeroen; Snel, Mireille; Pereira, Bernard; Schlicker, Andreas; Provenzano, Elena; Ali, Hamid Raza; Gaber, Alexander; O’Hurley, Gillian; Lehn, Sophie; Muris, Jettie J.F.; Wesseling, Jelle; Kay, Elaine; Sammut, Stephen John; Bardwell, Helen A.; Barbet, Aurélie S.; Bard, Floriane; Lecerf, Caroline; O’Connor, Darran P.; Vis, Daniël J.; Benes, Cyril H.; McDermott, Ultan; Garnett, Mathew J.; Simon, Iris M.; Jirström, Karin; Dubois, Thierry; Linn, Sabine C.; Gallagher, William M.; Wessels, Lodewyk F.A.; Caldas, Carlos; Bernards, Rene

    2016-01-01

    Invasive lobular carcinoma (ILC) is the second most frequently occurring histological breast cancer subtype after invasive ductal carcinoma (IDC), accounting for around 10% of all breast cancers. The molecular processes that drive the development of ILC are still largely unknown. We have performed a comprehensive genomic, transcriptomic and proteomic analysis of a large ILC patient cohort and present here an integrated molecular portrait of ILC. Mutations in CDH1 and in the PI3K pathway are the most frequent molecular alterations in ILC. We identified two main subtypes of ILCs: (i) an immune related subtype with mRNA up-regulation of PD-L1, PD-1 and CTLA-4 and greater sensitivity to DNA-damaging agents in representative cell line models; (ii) a hormone related subtype, associated with Epithelial to Mesenchymal Transition (EMT), and gain of chromosomes 1q and 8q and loss of chromosome 11q. Using the somatic mutation rate and eIF4B protein level, we identified three groups with different clinical outcomes, including a group with extremely good prognosis. We provide a comprehensive overview of the molecular alterations driving ILC and have explored links with therapy response. This molecular characterization may help to tailor treatment of ILC through the application of specific targeted, chemo- and/or immune-therapies. PMID:26729235

  11. Complete genome sequence and transcriptomic analysis of a novel marine strain Bacillus weihaiensis reveals the mechanism of brown algae degradation.

    PubMed

    Zhu, Yueming; Chen, Peng; Bao, Yunjuan; Men, Yan; Zeng, Yan; Yang, Jiangang; Sun, Jibin; Sun, Yuanxia

    2016-11-30

    A novel marine strain representing efficient degradation ability toward brown algae was isolated, identified, and assigned to Bacillus weihaiensis Alg07. The alga-associated marine bacteria promote the nutrient cycle and perform important functions in the marine ecosystem. The de novo sequencing of the B. weihaiensis Alg07 genome was carried out. Results of gene annotation and carbohydrate-active enzyme analysis showed that the strain harbored enzymes that can completely degrade alginate and laminarin, which are the specific polysaccharides of brown algae. We also found genes for the utilization of mannitol, the major storage monosaccharide in the cell of brown algae. To understand the process of brown algae decomposition by B. weihaiensis Alg07, RNA-seq transcriptome analysis and qRT-PCR were performed. The genes involved in alginate metabolism were all up-regulated in the initial stage of kelp degradation, suggesting that the strain Alg07 first degrades alginate to destruct the cell wall so that the laminarin and mannitol are released and subsequently decomposed. The key genes involved in alginate and laminarin degradation were expressed in Escherichia coli and characterized. Overall, the model of brown algae degradation by the marine strain Alg07 was established, and novel alginate lyases and laminarinase were discovered.

  12. Complete genome sequence and transcriptomic analysis of a novel marine strain Bacillus weihaiensis reveals the mechanism of brown algae degradation

    PubMed Central

    Zhu, Yueming; Chen, Peng; Bao, Yunjuan; Men, Yan; Zeng, Yan; Yang, Jiangang; Sun, Jibin; Sun, Yuanxia

    2016-01-01

    A novel marine strain representing efficient degradation ability toward brown algae was isolated, identified, and assigned to Bacillus weihaiensis Alg07. The alga-associated marine bacteria promote the nutrient cycle and perform important functions in the marine ecosystem. The de novo sequencing of the B. weihaiensis Alg07 genome was carried out. Results of gene annotation and carbohydrate-active enzyme analysis showed that the strain harbored enzymes that can completely degrade alginate and laminarin, which are the specific polysaccharides of brown algae. We also found genes for the utilization of mannitol, the major storage monosaccharide in the cell of brown algae. To understand the process of brown algae decomposition by B. weihaiensis Alg07, RNA-seq transcriptome analysis and qRT-PCR were performed. The genes involved in alginate metabolism were all up-regulated in the initial stage of kelp degradation, suggesting that the strain Alg07 first degrades alginate to destruct the cell wall so that the laminarin and mannitol are released and subsequently decomposed. The key genes involved in alginate and laminarin degradation were expressed in Escherichia coli and characterized. Overall, the model of brown algae degradation by the marine strain Alg07 was established, and novel alginate lyases and laminarinase were discovered. PMID:27901120

  13. Genomic, Transcriptomic, and Proteomic Analysis Provide Insights Into the Cold Adaptation Mechanism of the Obligate Psychrophilic Fungus Mrakia psychrophila

    PubMed Central

    Su, Yao; Jiang, Xianzhi; Wu, Wenping; Wang, Manman; Hamid, M. Imran; Xiang, Meichun; Liu, Xingzhong

    2016-01-01

    Mrakia psychrophila is an obligate psychrophilic fungus. The cold adaptation mechanism of psychrophilic fungi remains unknown. Comparative genomics analysis indicated that M. psychrophila had a specific codon usage preference, especially for codons of Gly and Arg and its major facilitator superfamily (MFS) transporter gene family was expanded. Transcriptomic analysis revealed that genes involved in ribosome and energy metabolism were upregulated at 4°, while genes involved in unfolded protein binding, protein processing in the endoplasmic reticulum, proteasome, spliceosome, and mRNA surveillance were upregulated at 20°. In addition, genes related to unfolded protein binding were alternatively spliced. Consistent with other psychrophiles, desaturase and glycerol 3-phosphate dehydrogenase, which are involved in biosynthesis of unsaturated fatty acid and glycerol respectively, were upregulated at 4°. Cold adaptation of M. psychrophila is mediated by synthesizing unsaturated fatty acids to maintain membrane fluidity and accumulating glycerol as a cryoprotectant. The proteomic analysis indicated that the correlations between the dynamic patterns between transcript level changes and protein level changes for some pathways were positive at 4°, but negative at 20°. The death of M. psychrophila above 20° might be caused by an unfolded protein response. PMID:27633791

  14. Carotenoid metabolic profiling and transcriptome-genome mining reveal functional equivalence among blue-pigmented copepods and appendicularia.

    PubMed

    Mojib, Nazia; Amad, Maan; Thimma, Manjula; Aldanondo, Naroa; Kumaran, Mande; Irigoien, Xabier

    2014-06-01

    The tropical oligotrophic oceanic areas are characterized by high water transparency and annual solar radiation. Under these conditions, a large number of phylogenetically diverse mesozooplankton species living in the surface waters (neuston) are found to be blue pigmented. In the present study, we focused on understanding the metabolic and genetic basis of the observed blue phenotype functional equivalence between the blue-pigmented organisms from the phylum Arthropoda, subclass Copepoda (Acartia fossae) and the phylum Chordata, class Appendicularia (Oikopleura dioica) in the Red Sea. Previous studies have shown that carotenoid-protein complexes are responsible for blue coloration in crustaceans. Therefore, we performed carotenoid metabolic profiling using both targeted and nontargeted (high-resolution mass spectrometry) approaches in four different blue-pigmented genera of copepods and one blue-pigmented species of appendicularia. Astaxanthin was found to be the principal carotenoid in all the species. The pathway analysis showed that all the species can synthesize astaxanthin from β-carotene, ingested from dietary sources, via 3-hydroxyechinenone, canthaxanthin, zeaxanthin, adonirubin or adonixanthin. Further, using de novo assembled transcriptome of blue A. fossae (subclass Copepoda), we identified highly expressed homologous β-carotene hydroxylase enzymes and putative carotenoid-binding proteins responsible for astaxanthin formation and the blue phenotype. In blue O. dioica (class Appendicularia), corresponding putative genes were identified from the reference genome. Collectively, our data provide molecular evidences for the bioconversion and accumulation of blue astaxanthin-protein complexes underpinning the observed ecological functional equivalence and adaptive convergence among neustonic mesozooplankton.

  15. Genome-wide transcriptome profiling provides insights into floral bud development of summer-flowering Camellia azalea.

    PubMed

    Fan, Zhengqi; Li, Jiyuan; Li, Xinlei; Wu, Bin; Wang, Jiangying; Liu, Zhongchi; Yin, Hengfu

    2015-05-15

    The transition from vegetative to reproductive growth in woody perennials involves pathways controlling flowering timing, bud dormancy and outgrowth in responses to seasonal cues. However little is known about the mechanism governing the adaptation of signaling pathways to environmental conditions in trees. Camellia azalea is a rare species in this genus flowering during summer, which provides a unique resource for floral timing breeding. Here we reported a comprehensive transcriptomics study to capture the global gene profiles during floral bud development in C. azalea. We examined the genome-wide gene expression between three developmental stages including floral bud initiation, floral organ differentiation and bud outgrowth, and identified nine co-expression clusters with distinctive patterns. Further, we identified the differential expressed genes (DEGs) during development and characterized the functional properties of DEGs by Gene Ontology analysis. We showed that transition from floral bud initiation to floral organ differentiation required changes of genes in flowering timing regulation, while transition to floral bud outgrowth was regulated by various pathways such as cold and light signaling, phytohormone pathways and plant metabolisms. Further analyses of dormancy associated MADS-box genes revealed that SVP- and AGL24- like genes displayed distinct expression patterns suggesting divergent roles during floral bud development.

  16. First insights into the transcriptome and development of new genomic tools of a widespread circum-Mediterranean tree species, Pinus halepensis Mill.

    PubMed

    Pinosio, S; González-Martínez, S C; Bagnoli, F; Cattonaro, F; Grivet, D; Marroni, F; Lorenzo, Z; Pausas, J G; Verdú, M; Vendramin, G G

    2014-07-01

    Aleppo pine (Pinus halepensis Mill.) is a relevant conifer species for studying adaptive responses to drought and fire regimes in the Mediterranean region. In this study, we performed Illumina next-generation sequencing of two phenotypically divergent Aleppo pine accessions with the aims of (i) characterizing the transcriptome through Illumina RNA-Seq on trees phenotypically divergent for adaptive traits linked to fire adaptation and drought, (ii) performing a functional annotation of the assembled transcriptome, (iii) identifying genes with accelerated evolutionary rates, (iv) studying the expression levels of the annotated genes and (v) developing gene-based markers for population genomic and association genetic studies. The assembled transcriptome consisted of 48,629 contigs and covered about 54.6 Mbp. The comparison of Aleppo pine transcripts to Picea sitchensis protein-coding sequences resulted in the detection of 34,014 SNPs across species, with a Ka /Ks average value of 0.216, suggesting that the majority of the assembled genes are under negative selection. Several genes were differentially expressed across the two pine accessions with contrasted phenotypes, including a glutathione-s-transferase, a cellulose synthase and a cobra-like protein. A large number of new markers (3334 amplifiable SSRs and 28,236 SNPs) have been identified which should facilitate future population genomics and association genetics in this species. A 384-SNP Oligo Pool Assay for genotyping with the Illumina VeraCode technology has been designed which showed an high overall SNP conversion rate (76.6%). Our results showed that Illumina next-generation sequencing is a valuable technology to obtain an extensive overview on whole transcriptomes of nonmodel species with large genomes.

  17. Genome-scale models of bacterial metabolism: reconstruction and applications

    PubMed Central

    Durot, Maxime; Bourguignon, Pierre-Yves; Schachter, Vincent

    2009-01-01

    Genome-scale metabolic models bridge the gap between genome-derived biochemical information and metabolic phenotypes in a principled manner, providing a solid interpretative framework for experimental data related to metabolic states, and enabling simple in silico experiments with whole-cell metabolism. Models have been reconstructed for almost 20 bacterial species, so far mainly through expert curation efforts integrating information from the literature with genome annotation. A wide variety of computational methods exploiting metabolic models have been developed and applied to bacteria, yielding valuable insights into bacterial metabolism and evolution, and providing a sound basis for computer-assisted design in metabolic engineering. Recent advances in computational systems biology and high-throughput experimental technologies pave the way for the systematic reconstruction of metabolic models from genomes of new species, and a corresponding expansion of the scope of their applications. In this review, we provide an introduction to the key ideas of metabolic modeling, survey the methods, and resources that enable model reconstruction and refinement, and chart applications to the investigation of global properties of metabolic systems, the interpretation of experimental results, and the re-engineering of their biochemical capabilities. PMID:19067749

  18. Genome-scale evidence of the nematode-arthropod clade

    PubMed Central

    Dopazo, Hernán; Dopazo, Joaquín

    2005-01-01

    Background The issue of whether coelomates form a single clade, the Coelomata, or whether all animals that moult an exoskeleton (such as the coelomate arthropods and the pseudocoelomate nematodes) form a distinct clade, the Ecdysozoa, is the most puzzling issue in animal systematics and a major open-ended subject in evolutionary biology. Previous single-gene and genome-scale analyses designed to resolve the issue have produced contradictory results. Here we present the first genome-scale phylogenetic evidence that strongly supports the Ecdysozoa hypothesis. Results Through the most extensive phylogenetic analysis carried out to date, the complete genomes of 11 eukaryotic species have been analyzed in order to find homologous sequences derived from 18 human chromosomes. Phylogenetic analysis of datasets showing an increased adjustment to equal evolutionary rates between nematode and arthropod sequences produced a gradual change from support for Coelomata to support for Ecdysozoa. Transition between topologies occurred when fast-evolving sequences of Caenorhabditis elegans were removed. When chordate, nematode and arthropod sequences were constrained to fit equal evolutionary rates, the Ecdysozoa topology was statistically accepted whereas Coelomata was rejected. Conclusions The reliability of a monophyletic group clustering arthropods and nematodes was unequivocally accepted in datasets where traces of the long-branch attraction effect were removed. This is the first phylogenomic evidence to strongly support the 'moulting clade' hypothesis. PMID:15892869

  19. The genome and transcriptome of Phalaenopsis yield insights into floral organ development and flowering regulation.

    PubMed

    Huang, Jian-Zhi; Lin, Chih-Peng; Cheng, Ting-Chi; Huang, Ya-Wen; Tsai, Yi-Jung; Cheng, Shu-Yun; Chen, Yi-Wen; Lee, Chueh-Pai; Chung, Wan-Chia; Chang, Bill Chia-Han; Chin, Shih-Wen; Lee, Chen-Yu; Chen, Fure-Chyi

    2016-01-01

    The Phalaenopsis orchid is an important potted flower of high economic value around the world. We report the 3.1 Gb draft genome assembly of an important winter flowering Phalaenopsis 'KHM190' cultivar. We generated 89.5 Gb RNA-seq and 113 million sRNA-seq reads to use these data to identify 41,153 protein-coding genes and 188 miRNA families. We also generated a draft genome for Phalaenopsis pulcherrima 'B8802,' a summer flowering species, via resequencing. Comparison of genome data between the two Phalaenopsis cultivars allowed the identification of 691,532 single-nucleotide polymorphisms. In this study, we reveal that the key role of PhAGL6b in the regulation of labellum organ development involves alternative splicing in the big lip mutant. Petal or sepal overexpressing PhAGL6b leads to the conversion into a lip-like structure. We also discovered that the gibberellin pathway that regulates the expression of flowering time genes during the reproductive phase change is induced by cool temperature. Our work thus depicted a valuable resource for the flowering control, flower architecture development, and breeding of the Phalaenopsis orchids.

  20. Genomic and Transcriptomic Evidence for Carbohydrate Consumption among Microorganisms in a Cold Seep Brine Pool

    PubMed Central

    Zhang, Weipeng; Ding, Wei; Yang, Bo; Tian, Renmao; Gu, Shuo; Luo, Haiwei; Qian, Pei-Yuan

    2016-01-01

    The detailed lifestyle of microorganisms in deep-sea brine environments remains largely unexplored. Using a carefully calibrated genome binning approach, we reconstructed partial to nearly-complete genomes of 51 microorganisms in biofilms from the Thuwal cold seep brine pool of the Red Sea. The recovered metagenome-assembled genomes (MAGs) belong to six different phyla: Actinobacteria, Proteobacteria, Candidatus Cloacimonetes, Candidatus Marinimicrobia, Bathyarchaeota, and Thaumarchaeota. By comparison with close relatives of these microorganisms, we identified a number of unique genes associated with organic carbon metabolism and energy generation. These genes included various glycoside hydrolases, nitrate and sulfate reductases, putative bacterial microcompartment biosynthetic clusters (BMC), and F420H2 dehydrogenases. Phylogenetic analysis suggested that the acquisition of these genes probably occurred through horizontal gene transfer (HGT). Metatranscriptomics illustrated that glycoside hydrolases are among the most highly expressed genes. Our results suggest that the microbial inhabitants are well adapted to this brine environment, and anaerobic carbohydrate consumption mediated by glycoside hydrolases and electron transport systems (ETSs) is a dominant process performed by microorganisms from various phyla within this ecosystem. PMID:27895636

  1. Genomic and Transcriptomic Evidence for Carbohydrate Consumption among Microorganisms in a Cold Seep Brine Pool.

    PubMed

    Zhang, Weipeng; Ding, Wei; Yang, Bo; Tian, Renmao; Gu, Shuo; Luo, Haiwei; Qian, Pei-Yuan

    2016-01-01

    The detailed lifestyle of microorganisms in deep-sea brine environments remains largely unexplored. Using a carefully calibrated genome binning approach, we reconstructed partial to nearly-complete genomes of 51 microorganisms in biofilms from the Thuwal cold seep brine pool of the Red Sea. The recovered metagenome-assembled genomes (MAGs) belong to six different phyla: Actinobacteria, Proteobacteria, Candidatus Cloacimonetes, Candidatus Marinimicrobia, Bathyarchaeota, and Thaumarchaeota. By comparison with close relatives of these microorganisms, we identified a number of unique genes associated with organic carbon metabolism and energy generation. These genes included various glycoside hydrolases, nitrate and sulfate reductases, putative bacterial microcompartment biosynthetic clusters (BMC), and F420H2 dehydrogenases. Phylogenetic analysis suggested that the acquisition of these genes probably occurred through horizontal gene transfer (HGT). Metatranscriptomics illustrated that glycoside hydrolases are among the most highly expressed genes. Our results suggest that the microbial inhabitants are well adapted to this brine environment, and anaerobic carbohydrate consumption mediated by glycoside hydrolases and electron transport systems (ETSs) is a dominant process performed by microorganisms from various phyla within this ecosystem.

  2. A whole-genome sequence and transcriptome perspective on HER2-positive breast cancers

    PubMed Central

    Ferrari, Anthony; Vincent-Salomon, Anne; Pivot, Xavier; Sertier, Anne-Sophie; Thomas, Emilie; Tonon, Laurie; Boyault, Sandrine; Mulugeta, Eskeatnaf; Treilleux, Isabelle; MacGrogan, Gaëtan; Arnould, Laurent; Kielbassa, Janice; Le Texier, Vincent; Blanché, Hélène; Deleuze, Jean-François; Jacquemier, Jocelyne; Mathieu, Marie-Christine; Penault-Llorca, Frédérique; Bibeau, Frédéric; Mariani, Odette; Mannina, Cécile; Pierga, Jean-Yves; Trédan, Olivier; Bachelot, Thomas; Bonnefoi, Hervé; Romieu, Gilles; Fumoleau, Pierre; Delaloge, Suzette; Rios, Maria; Ferrero, Jean-Marc; Tarpin, Carole; Bouteille, Catherine; Calvo, Fabien; Gut, Ivo Glynne; Gut, Marta; Martin, Sancha; Nik-Zainal, Serena; Stratton, Michael R.; Pauporté, Iris; Saintigny, Pierre; Birnbaum, Daniel; Viari, Alain; Thomas, Gilles

    2016-01-01

    HER2-positive breast cancer has long proven to be a clinically distinct class of breast cancers for which several targeted therapies are now available. However, resistance to the treatment associated with specific gene expressions or mutations has been observed, revealing the underlying diversity of these cancers. Therefore, understanding the full extent of the HER2-positive disease heterogeneity still remains challenging. Here we carry out an in-depth genomic characterization of 64 HER2-positive breast tumour genomes that exhibit four subgroups, based on the expression data, with distinctive genomic features in terms of somatic mutations, copy-number changes or structural variations. The results suggest that, despite being clinically defined by a specific gene amplification, HER2-positive tumours melt into the whole luminal–basal breast cancer spectrum rather than standing apart. The results also lead to a refined ERBB2 amplicon of 106 kb and show that several cases of amplifications are compatible with a breakage–fusion–bridge mechanism. PMID:27406316

  3. Comprehensive Genome-Wide Transcriptomic Analysis of Immature Articular Cartilage following Ischemic Osteonecrosis of the Femoral Head in Piglets

    PubMed Central

    Adapala, Naga Suresh; Kim, Harry K. W.

    2016-01-01

    Objective Ischemic osteonecrosis of the femoral head (ONFH) in piglets results in an ischemic injury to the immature articular cartilage. The molecular changes in the articular cartilage in response to ONFH have not been investigated using a transcriptomic approach. The purpose of this study was to perform a genome-wide transcriptomic analysis to identify genes that are upregulated in the immature articular cartilage following ONFH. Methods ONFH was induced in the right femoral head of 6-week old piglets. The unoperated femoral head was used as the normal control. At 24 hours (acute ischemic-hypoxic injury), 2 weeks (avascular necrosis in the femoral head) and 4 weeks (early repair) after surgery (n = 4 piglets/time point), RNA was isolated from the articular cartilage of the femoral head. A microarray analysis was performed using Affymetrix Porcine GeneChip Array. An enrichment analysis and functional clustering of the genes upregulated due to ONFH were performed using DAVID and STRING software, respectively. The increased expression of selected genes was confirmed by a real-time qRTPCR analysis. Results Induction of ONFH resulted in the upregulation of 383 genes at 24 hours, 122 genes at 2 weeks and 124 genes at 4 weeks compared to the normal controls. At 24 hours, the genes involved in oxidoreductive, cell-survival, and angiogenic responses were significantly enriched among the upregulated genes. These genes were involved in HIF-1, PI3K-Akt, and MAPK signaling pathways. At 2 weeks, secretory and signaling proteins involved in angiogenic and inflammatory responses, PI3K-Akt and matrix-remodeling pathways were significantly enriched. At 4 weeks, genes that represent inflammatory cytokines and chemokine signaling pathways were significantly enriched. Several index genes (genes that are upregulated at more than one time point following ONFH and are known to be important in various biological processes) including HIF-1A, VEGFA, IL-6, IL6R, IL-8, CCL2, FGF2, TGFB2

  4. 13C metabolic flux analysis at a genome-scale.

    PubMed

    Gopalakrishnan, Saratram; Maranas, Costas D

    2015-11-01

    Metabolic models used in 13C metabolic flux analysis generally include a limited number of reactions primarily from central metabolism. They typically omit degradation pathways, complete cofactor balances, and atom transition contributions for reactions outside central metabolism. This study addresses the impact on prediction fidelity of scaling-up mapping models to a genome-scale. The core mapping model employed in this study accounts for (75 reactions and 65 metabolites) primarily from central metabolism. The genome-scale metabolic mapping model (GSMM) (697 reaction and 595 metabolites) is constructed using as a basis the iAF1260 model upon eliminating reactions guaranteed not to carry flux based on growth and fermentation data for a minimal glucose growth medium. Labeling data for 17 amino acid fragments obtained from cells fed with glucose labeled at the second carbon was used to obtain fluxes and ranges. Metabolic fluxes and confidence intervals are estimated, for both core and genome-scale mapping models, by minimizing the sum of square of differences between predicted and experimentally measured labeling patterns using the EMU decomposition algorithm. Overall, we find that both topology and estimated values of the metabolic fluxes remain largely consistent between core and GSM model. Stepping up to a genome-scale mapping model leads to wider flux inference ranges for 20 key reactions present in the core model. The glycolysis flux range doubles due to the possibility of active gluconeogenesis, the TCA flux range expanded by 80% due to the availability of a bypass through arginine consistent with labeling data, and the transhydrogenase reaction flux was essentially unresolved due to the presence of as many as five routes for the inter-conversion of NADPH to NADH afforded by the genome-scale model. By globally accounting for ATP demands in the GSMM model the unused ATP decreased drastically with the lower bound matching the maintenance ATP requirement. A non

  5. Pesticide toxicogenomics across scales: in vitro transcriptome predicts mechanisms and outcomes of exposure in vivo

    PubMed Central

    Porreca, Immacolata; D’Angelo, Fulvio; De Franceschi, Lucia; Mattè, Alessandro; Ceccarelli, Michele; Iolascon, Achille; Zamò, Alberto; Russo, Filomena; Ravo, Maria; Tarallo, Roberta; Scarfò, Marzia; Weisz, Alessandro; De Felice, Mario; Mallardo, Massimo; Ambrosino, Concetta

    2016-01-01

    In vitro Omics analysis (i.e. transcriptome) is suggested to predict in vivo toxicity and adverse effects in humans, although the causal link between high-throughput data and effects in vivo is not easily established. Indeed, the chemical-organism interaction can involve processes, such as adaptation, not established in cell cultures. Starting from this consideration we investigate the transcriptomic response of immortalized thyrocytes to ethylenthiourea and chlorpyrifos. In vitro data revealed specific and common genes/mechanisms of toxicity, controlling the proliferation/survival of the thyrocytes and unrelated hematopoietic cell lineages. These results were phenotypically confirmed in vivo by the reduction of circulating T4 hormone and the development of pancytopenia after long exposure. Our data imply that in vitro toxicogenomics is a powerful tool in predicting adverse effects in vivo, experimentally confirming the vision described as Tox21c (Toxicity Testing in the 21st century) although not fully recapitulating the biocomplexity of a living animal. PMID:27905518

  6. Fish-T1K (Transcriptomes of 1,000 Fishes) Project: large-scale transcriptome data for fish evolution studies.

    PubMed

    Sun, Ying; Huang, Yu; Li, Xiaofeng; Baldwin, Carole C; Zhou, Zhuocheng; Yan, Zhixiang; Crandall, Keith A; Zhang, Yong; Zhao, Xiaomeng; Wang, Min; Wong, Alex; Fang, Chao; Zhang, Xinhui; Huang, Hai; Lopez, Jose V; Kilfoyle, Kirk; Zhang, Yong; Ortí, Guillermo; Venkatesh, Byrappa; Shi, Qiong

    2016-01-01

    Ray-finned fishes (Actinopterygii) represent more than 50 % of extant vertebrates and are of great evolutionary, ecologic and economic significance, but they are relatively underrepresented in 'omics studies. Increased availability of transcriptome data for these species will allow researchers to better understand changes in gene expression, and to carry out functional analyses. An international project known as the "Transcriptomes of 1,000 Fishes" (Fish-T1K) project has been established to generate RNA-seq transcriptome sequences for 1,000 diverse species of ray-finned fishes. The first phase of this project has produced transcriptomes from more than 180 ray-finned fishes, representing 142 species and covering 51 orders and 109 families. Here we provide an overview of the goals of this project and the work done so far.

  7. Genome-scale rates of evolutionary change in bacteria

    PubMed Central

    Duchêne, Sebastian; Holt, Kathryn E.; Weill, François-Xavier; Le Hello, Simon; Hawkey, Jane; Edwards, David J.; Fourment, Mathieu

    2016-01-01

    Estimating the rates at which bacterial genomes evolve is critical to understanding major evolutionary and ecological processes such as disease emergence, long-term host–pathogen associations and short-term transmission patterns. The surge in bacterial genomic data sets provides a new opportunity to estimate these rates and reveal the factors that shape bacterial evolutionary dynamics. For many organisms estimates of evolutionary rate display an inverse association with the time-scale over which the data are sampled. However, this relationship remains unexplored in bacteria due to the difficulty in estimating genome-wide evolutionary rates, which are impacted by the extent of temporal structure in the data and the prevalence of recombination. We collected 36 whole genome sequence data sets from 16 species of bacterial pathogens to systematically estimate and compare their evolutionary rates and assess the extent of temporal structure in the absence of recombination. The majority (28/36) of data sets possessed sufficient clock-like structure to robustly estimate evolutionary rates. However, in some species reliable estimates were not possible even with ‘ancient DNA’ data sampled over many centuries, suggesting that they evolve very slowly or that they display extensive rate variation among lineages. The robustly estimated evolutionary rates spanned several orders of magnitude, from approximately 10−5 to 10−8 nucleotide substitutions per site year−1. This variation was negatively associated with sampling time, with this relationship best described by an exponential decay curve. To avoid potential estimation biases, such time-dependency should be considered when inferring evolutionary time-scales in bacteria. PMID:28348834

  8. Transcriptome-scale similarities between mouse and human skeletal muscles with normal and myopathic phenotypes

    PubMed Central

    Kho, Alvin T; Kang, Peter B; Kohane, Isaac S; Kunkel, Louis M

    2006-01-01

    Background Mouse and human skeletal muscle transcriptome profiles vary by muscle type, raising the question of which mouse muscle groups have the greatest molecular similarities to human skeletal muscle. Methods Orthologous (whole, sub-) transcriptome profiles were compared among four mouse-human transcriptome datasets: (M) six muscle groups obtained from three mouse strains (wildtype, mdx, mdx5cv); (H1) biopsied human quadriceps from controls and Duchenne muscular dystrophy patients; (H2) four different control human muscle types obtained at autopsy; and (H3) 12 different control human tissues (ten non-muscle). Results Of the six mouse muscles examined, mouse soleus bore the greatest molecular similarities to human skeletal muscles, independent of the latters' anatomic location/muscle type, disease state, age and sampling method (autopsy versus biopsy). Significant similarity to any one mouse muscle group was not observed for non-muscle human tissues (dataset H3), indicating this finding to be muscle specific. Conclusion This observation may be partly explained by the higher type I fiber content of soleus relative to the other mouse muscles sampled. PMID:16522209

  9. Genome-wide genetic and transcriptomic investigation of variation in antibody response to dietary antigens

    PubMed Central

    Rubicz, Rohina; Yolken, Robert; Alaedini, Armin; Drigalenko, Eugene; Charlesworth, Jac C.; Carless, Melanie A.; Severance, Emily G.; Krivogorsky, Bogdana; Dyer, Thomas D.; Kent, Jack W.; Curran, Joanne E.; Johnson, Matthew P.; Cole, Shelley A.; Almasy, Laura; Moses, Eric K.; Blangero, John; Göring, Harald H.H.

    2014-01-01

    Increased immunoglobulin G (IgG) response to dietary antigens can be associated with gastrointestinal dysfunction and autoimmunity. The underlying processes contributing to these adverse reactions remain largely unknown, and it is likely that genetic factors play a role. Here we estimate heritability and attempt to localize genetic factors influencing IgG antibody levels against food-derived antigens using an integrative genomics approach. IgG antibody levels were determined by ELISA in >1300 Mexican Americans for the following food antigens: wheat gliadin; bovine casein; and two forms of bovine serum albumin (BSA-a and BSA-b). Pedigree-based variance components methods were used to estimate additive genetic heritability (h2), perform genome-wide association analyses, and identify transcriptional signatures (based on 19,858 transcripts from peripheral blood lymphocytes). Heritability estimates were significant for all traits (0.15-0.53), and shared environment (based on shared residency among study participants) was significant for casein (0.09) and BSA-a (0.33). Genome-wide significant evidence of association was obtained only for antibody to gliadin (p=8.57×10-8), mapping to the human leukocyte antigen II region, with HLA-DRA and BTNL2 as the best candidate genes. Lack of association of known celiac disease risk alleles HLA-DQ2.5 and -DQ8 with anti-gliadin antibodies in the studied population suggests a separate genetic etiology. Significant transcriptional signatures were found for all IgG levels except BSA-b. These results demonstrate that individual genetic differences contribute to food antigen antibody measures in this population. Further investigations may elucidate the underlying immunological processes involved. PMID:24962563

  10. Genome Sequencing and Comparative Transcriptomics of the Model Entomopathogenic Fungi Metarhizium anisopliae and M. acridum

    PubMed Central

    Shang, Yanfang; Duan, Zhibing; Hu, Xiao; Xie, Xue-Qin; Zhou, Gang; Peng, Guoxiong; Luo, Zhibing; Huang, Wei; Wang, Bing; Fang, Weiguo; Wang, Sibao; Zhong, Yi; Ma, Li-Jun; St. Leger, Raymond J.; Zhao, Guo-Ping; Pei, Yan; Feng, Ming-Guang; Xia, Yuxian; Wang, Chengshu

    2011-01-01

    Metarhizium spp. are being used as environmentally friendly alternatives to chemical insecticides, as model systems for studying insect-fungus interactions, and as a resource of genes for biotechnology. We present a comparative analysis of the genome sequences of the broad-spectrum insect pathogen Metarhizium anisopliae and the acridid-specific M. acridum. Whole-genome analyses indicate that the genome structures of these two species are highly syntenic and suggest that the genus Metarhizium evolved from plant endophytes or pathogens. Both M. anisopliae and M. acridum have a strikingly larger proportion of genes encoding secreted proteins than other fungi, while ∼30% of these have no functionally characterized homologs, suggesting hitherto unsuspected interactions between fungal pathogens and insects. The analysis of transposase genes provided evidence of repeat-induced point mutations occurring in M. acridum but not in M. anisopliae. With the help of pathogen-host interaction gene database, ∼16% of Metarhizium genes were identified that are similar to experimentally verified genes involved in pathogenicity in other fungi, particularly plant pathogens. However, relative to M. acridum, M. anisopliae has evolved with many expanded gene families of proteases, chitinases, cytochrome P450s, polyketide synthases, and nonribosomal peptide synthetases for cuticle-degradation, detoxification, and toxin biosynthesis that may facilitate its ability to adapt to heterogenous environments. Transcriptional analysis of both fungi during early infection processes provided further insights into the genes and pathways involved in infectivity and specificity. Of particular note, M. acridum transcribed distinct G-protein coupled receptors on cuticles from locusts (the natural hosts) and cockroaches, whereas M. anisopliae transcribed the same receptor on both hosts. This study will facilitate the identification of virulence genes and the development of improved biocontrol strains

  11. The Dynamic Genome and Transcriptome of the Human Fungal Pathogen Blastomyces and Close Relative Emmonsia

    PubMed Central

    Gallo, Juan E.; Holder, Jason; Sullivan, Thomas D.; Marty, Amber J.; Carmen, John C.; Chen, Zehua; Ding, Li; Gujja, Sharvari; Magrini, Vincent; Misas, Elizabeth; Mitreva, Makedonka; Priest, Margaret; Saif, Sakina; Whiston, Emily A.; Young, Sarah; Zeng, Qiandong; Goldman, William E.; Mardis, Elaine R.; Taylor, John W.; McEwen, Juan G.; Clay, Oliver K.; Klein, Bruce S.; Cuomo, Christina A.

    2015-01-01

    Three closely related thermally dimorphic pathogens are causal agents of major fungal diseases affecting humans in the Americas: blastomycosis, histoplasmosis and paracoccidioidomycosis. Here we report the genome sequence and analysis of four strains of the etiological agent of blastomycosis, Blastomyces, and two species of the related genus Emmonsia, typically pathogens of small mammals. Compared to related species, Blastomyces genomes are highly expanded, with long, often sharply demarcated tracts of low GC-content sequence. These GC-poor isochore-like regions are enriched for gypsy elements, are variable in total size between isolates, and are least expanded in the avirulent B. dermatitidis strain ER-3 as compared with the virulent B. gilchristii strain SLH14081. The lack of similar regions in related species suggests these isochore-like regions originated recently in the ancestor of the Blastomyces lineage. While gene content is highly conserved between Blastomyces and related fungi, we identified changes in copy number of genes potentially involved in host interaction, including proteases and characterized antigens. In addition, we studied gene expression changes of B. dermatitidis during the interaction of the infectious yeast form with macrophages and in a mouse model. Both experiments highlight a strong antioxidant defense response in Blastomyces, and upregulation of dioxygenases in vivo suggests that dioxide produced by antioxidants may be further utilized for amino acid metabolism. We identify a number of functional categories upregulated exclusively in vivo, such as secreted proteins, zinc acquisition proteins, and cysteine and tryptophan metabolism, which may include critical virulence factors missed before in in vitro studies. Across the dimorphic fungi, loss of certain zinc acquisition genes and differences in amino acid metabolism suggest unique adaptations of Blastomyces to its host environment. These results reveal the dynamics of genome evolution

  12. Genomic and transcriptomic predictors of response levels to endurance exercise training.

    PubMed

    Sarzynski, Mark A; Ghosh, Sujoy; Bouchard, Claude

    2016-05-28

    Predicting the responsiveness to regular exercise is a topic of great relevance due to its potential role in personalized exercise medicine applications. The present review focuses on cardiorespiratory fitness (commonly measured by maximal oxygen uptake, V̇O2 max ), a trait with wide-ranging impact on health and performance indicators. Gains in V̇O2 max demonstrate large inter-individual variation even in response to standardized exercise training programmes. The estimated ΔVO2 max heritability of 47% suggests that genomic-based predictors alone are insufficient to account for the total trainability variance. Candidate gene and genome-wide linkage studies have not significantly contributed to our understanding of the molecular basis of trainability. A genome-wide association study suggested that V̇O2 max trainability is influenced by multiple genes of small effects, but these findings still await rigorous replication. Valuable evidence, however, has been obtained by combining skeletal muscle transcript abundance profiles with common DNA variants for the prediction of the V̇O2 max response to exercise training. Although the physiological determinants of V̇O2 max measured at a given time are largely enunciated, what is poorly understood are the details of tissue-specific molecular mechanisms that limit V̇O2 max and related signalling pathways in response to exercise training. Bioinformatics explorations based on thousands of variants have been used to interrogate pathways and systems instead of single variants and genes, and the main findings, along with those from exercise experimental studies, have been summarized here in a working model of V̇O2 max trainability.

  13. Beyond the transcriptome: completion of act one of the Immunological Genome Project.

    PubMed

    Kim, Charles C; Lanier, Lewis L

    2013-10-01

    The Immunological Genome Consortium has generated a public resource (www.immgen.org) that provides a compendium of gene expression profiles of ∼270 leukocyte subsets in the mouse. This effort established carefully standardized operating procedures that resulted in a transcriptional dataset of unprecedented comprehensiveness and quality. The findings have been detailed recently in a series of publications providing molecular insights into the development, heterogeneity, and/or function of these cellular lineages and distinct subpopulations. Here, we review the key findings of these studies, highlighting what has been gained and how the knowledge can be used to accelerate progress toward a comprehensive understanding of the immune system.

  14. Pangolin genomes and the evolution of mammalian scales and immunity.

    PubMed

    Choo, Siew Woh; Rayko, Mike; Tan, Tze King; Hari, Ranjeev; Komissarov, Aleksey; Wee, Wei Yee; Yurchenko, Andrey A; Kliver, Sergey; Tamazian, Gaik; Antunes, Agostinho; Wilson, Richard K; Warren, Wesley C; Koepfli, Klaus-Peter; Minx, Patrick; Krasheninnikova, Ksenia; Kotze, Antoinette; Dalton, Desire L; Vermaak, Elaine; Paterson, Ian C; Dobrynin, Pavel; Sitam, Frankie Thomas; Rovie-Ryan, Jeffrine J; Johnson, Warren E; Yusoff, Aini Mohamed; Luo, Shu-Jin; Karuppannan, Kayal Vizi; Fang, Gang; Zheng, Deyou; Gerstein, Mark B; Lipovich, Leonard; O'Brien, Stephen J; Wong, Guat Jah

    2016-10-01

    Pangolins, unique mammals with scales over most of their body, no teeth, poor vision, and an acute olfactory system, comprise the only placental order (Pholidota) without a whole-genome map. To investigate pangolin biology and evolution, we developed genome assemblies of the Malayan (Manis javanica) and Chinese (M. pentadactyla) pangolins. Strikingly, we found that interferon epsilon (IFNE), exclusively expressed in epithelial cells and important in skin and mucosal immunity, is pseudogenized in all African and Asian pangolin species that we examined, perhaps impacting resistance to infection. We propose that scale development was an innovation that provided protection against injuries or stress and reduced pangolin vulnerability to infection. Further evidence of specialized adaptations was evident from positively selected genes involving immunity-related pathways, inflammation, energy storage and metabolism, muscular and nervous systems, and scale/hair development. Olfactory receptor gene families are significantly expanded in pangolins, reflecting their well-developed olfaction system. This study provides insights into mammalian adaptation and functional diversification, new research tools and questions, and perhaps a new natural IFNE-deficient animal model for studying mammalian immunity.

  15. Pangolin genomes and the evolution of mammalian scales and immunity

    PubMed Central

    Rayko, Mike; Tan, Tze King; Hari, Ranjeev; Komissarov, Aleksey; Wee, Wei Yee; Yurchenko, Andrey A.; Kliver, Sergey; Tamazian, Gaik; Antunes, Agostinho; Wilson, Richard K.; Warren, Wesley C.; Koepfli, Klaus-Peter; Minx, Patrick; Krasheninnikova, Ksenia; Kotze, Antoinette; Dalton, Desire L.; Vermaak, Elaine; Paterson, Ian C.; Dobrynin, Pavel; Sitam, Frankie Thomas; Rovie-Ryan, Jeffrine J.; Johnson, Warren E.; Yusoff, Aini Mohamed; Luo, Shu-Jin; Karuppannan, Kayal Vizi; Fang, Gang; Zheng, Deyou; Gerstein, Mark B.; Lipovich, Leonard; O'Brien, Stephen J.; Wong, Guat Jah

    2016-01-01

    Pangolins, unique mammals with scales over most of their body, no teeth, poor vision, and an acute olfactory system, comprise the only placental order (Pholidota) without a whole-genome map. To investigate pangolin biology and evolution, we developed genome assemblies of the Malayan (Manis javanica) and Chinese (M. pentadactyla) pangolins. Strikingly, we found that interferon epsilon (IFNE), exclusively expressed in epithelial cells and important in skin and mucosal immunity, is pseudogenized in all African and Asian pangolin species that we examined, perhaps impacting resistance to infection. We propose that scale development was an innovation that provided protection against injuries or stress and reduced pangolin vulnerability to infection. Further evidence of specialized adaptations was evident from positively selected genes involving immunity-related pathways, inflammation, energy storage and metabolism, muscular and nervous systems, and scale/hair development. Olfactory receptor gene families are significantly expanded in pangolins, reflecting their well-developed olfaction system. This study provides insights into mammalian adaptation and functional diversification, new research tools and questions, and perhaps a new natural IFNE-deficient animal model for studying mammalian immunity. PMID:27510566

  16. The somatic mutation profiles of 2,433 breast cancers refines their genomic and transcriptomic landscapes

    PubMed Central

    Pereira, Bernard; Chin, Suet-Feung; Rueda, Oscar M.; Vollan, Hans-Kristian Moen; Provenzano, Elena; Bardwell, Helen A.; Pugh, Michelle; Jones, Linda; Russell, Roslin; Sammut, Stephen-John; Tsui, Dana W. Y.; Liu, Bin; Dawson, Sarah-Jane; Abraham, Jean; Northen, Helen; Peden, John F.; Mukherjee, Abhik; Turashvili, Gulisa; Green, Andrew R.; McKinney, Steve; Oloumi, Arusha; Shah, Sohrab; Rosenfeld, Nitzan; Murphy, Leigh; Bentley, David R.; Ellis, Ian O.; Purushotham, Arnie; Pinder, Sarah E.; Børresen-Dale, Anne-Lise; Earl, Helena M.; Pharoah, Paul D.; Ross, Mark T.; Aparicio, Samuel; Caldas, Carlos

    2016-01-01

    The genomic landscape of breast cancer is complex, and inter- and intra-tumour heterogeneity are important challenges in treating the disease. In this study, we sequence 173 genes in 2,433 primary breast tumours that have copy number aberration (CNA), gene expression and long-term clinical follow-up data. We identify 40 mutation-driver (Mut-driver) genes, and determine associations between mutations, driver CNA profiles, clinical-pathological parameters and survival. We assess the clonal states of Mut-driver mutations, and estimate levels of intra-tumour heterogeneity using mutant-allele fractions. Associations between PIK3CA mutations and reduced survival are identified in three subgroups of ER-positive cancer (defined by amplification of 17q23, 11q13–14 or 8q24). High levels of intra-tumour heterogeneity are in general associated with a worse outcome, but highly aggressive tumours with 11q13–14 amplification have low levels of intra-tumour heterogeneity. These results emphasize the importance of genome-based stratification of breast cancer, and have important implications for designing therapeutic strategies. PMID:27161491

  17. Global correlation of genome and transcriptome changes in classical Hodgkin lymphoma.

    PubMed

    Kluiver, Joost; Kok, Klaas; Pfeil, Ines; de Jong, Debora; Blokzijl, Tjasso; Harms, Geert; van der Vlies, Pieter; Diepstra, Arjan; Atayar, Ciğdem; Poppema, Sibrand; Küppers, Ralf; van den Berg, Anke

    2007-03-01

    To identify genes involved in the pathogenesis of classical Hodgkin lymphoma (cHL), we performed serial analysis of gene expression (SAGE) and array-based comparative genomic hybridization (aCGH). Comparison of SAGE libraries of cHL cell lines L428 and L1236 with that of germinal centre B cells revealed consistent overexpression of only 14 genes. In contrast, 141 genes were downregulated in both cHL cell lines, including many B cell and HLA genes. aCGH revealed gain of 2p, 7p, 9p, 11q and Xq and loss of 4q and 11q. Eighteen percent of the differentially expressed genes mapped to regions with loss or gain and a good correlation was observed between underexpression and loss or overexpression and gain of DNA. Remarkably, gain of 2p and 9p did not correlate with increased expression of the proposed target genes c-REL and JAK2. Downregulation of many genes within the HLA region also did not correlate with loss of DNA. FSCN1 and IRAK1 mapping at genomic loci (7p and Xq) that frequently showed gain were overexpressed in cHL cell lines and might be involved in the pathogenesis of cHL.

  18. Genomic and Transcriptomic Alterations Associated with STAT3 Activation in Head and Neck Cancer

    PubMed Central

    Peyser, Noah D.; Pendleton, Kelsey; Gooding, William E.; Lui, Vivian W. Y.; Johnson, Daniel E.; Grandis, Jennifer R.

    2016-01-01

    Background Hyperactivation of STAT3 via constitutive phosphorylation of tyrosine 705 (Y705) is common in most human cancers, including head and neck squamous carcinoma (HNSCC). STAT3 is rarely mutated in cancer and the (epi)genetic alterations that lead to STAT3 activation are incompletely understood. Here we used an unbiased approach to identify genomic and epigenomic changes associated with pSTAT3(Y705) expression using data generated by The Cancer Genome Atlas (TCGA). Methods and Findings Mutation, mRNA expression, promoter methylation, and copy number alteration data were extracted from TCGA and examined in the context of pSTAT3(Y705) protein expression. mRNA expression levels of 1279 genes were found to be associated with pSTAT3(705) expression. Association of pSTAT3(Y705) expression with caspase-8 mRNA expression was validated by immunoblot analysis in HNSCC cells. Mutation, promoter hypermethylation, and copy number alteration of any gene were not significantly associated with increased pSTAT3(Y705) protein expression. Conclusions These cumulative results suggest that unbiased approaches may be useful in identifying the molecular underpinnings of oncogenic signaling, including STAT3 activation, in HNSCC. Larger datasets will likely be necessary to elucidate signaling consequences of infrequent alterations. PMID:27855189

  19. Genome-scale constraint-based modeling of Geobacter metallireducens

    PubMed Central

    Sun, Jun; Sayyar, Bahareh; Butler, Jessica E; Pharkya, Priti; Fahland, Tom R; Famili, Iman; Schilling, Christophe H; Lovley, Derek R; Mahadevan, Radhakrishnan

    2009-01-01

    Background Geobacter metallireducens was the first organism that can be grown in pure culture to completely oxidize organic compounds with Fe(III) oxide serving as electron acceptor. Geobacter species, including G. sulfurreducens and G. metallireducens, are used for bioremediation and electricity generation from waste organic matter and renewable biomass. The constraint-based modeling approach enables the development of genome-scale in silico models that can predict the behavior of complex biological systems and their responses to the environments. Such a modeling approach was applied to provide physiological and ecological insights on the metabolism of G. metallireducens. Results The genome-scale metabolic model of G. metallireducens was constructed to include 747 genes and 697 reactions. Compared to the G. sulfurreducens model, the G. metallireducens metabolic model contains 118 unique reactions that reflect many of G. metallireducens' specific metabolic capabilities. Detailed examination of the G. metallireducens model suggests that its central metabolism contains several energy-inefficient reactions that are not present in the G. sulfurreducens model. Experimental biomass yield of G. metallireducens growing on pyruvate was lower than the predicted optimal biomass yield. Microarray data of G. metallireducens growing with benzoate and acetate indicated that genes encoding these energy-inefficient reactions were up-regulated by benzoate. These results suggested that the energy-inefficient reactions were likely turned off during G. metallireducens growth with acetate for optimal biomass yield, but were up-regulated during growth with complex electron donors such as benzoate for rapid energy generation. Furthermore, several computational modeling approaches were applied to accelerate G. metallireducens research. For example, growth of G. metallireducens with different electron donors and electron acceptors were studied using the genome-scale metabolic model, which

  20. Genome-scale study reveals reduced metabolic adaptability in patients with non-alcoholic fatty liver disease.

    PubMed

    Hyötyläinen, Tuulia; Jerby, Livnat; Petäjä, Elina M; Mattila, Ismo; Jäntti, Sirkku; Auvinen, Petri; Gastaldelli, Amalia; Yki-Järvinen, Hannele; Ruppin, Eytan; Orešič, Matej

    2016-02-03

    Non-alcoholic fatty liver disease (NAFLD) is a major risk factor leading to chronic liver disease and type 2 diabetes. Here we chart liver metabolic activity and functionality in NAFLD by integrating global transcriptomic data, from human liver biopsies, and metabolic flux data, measured across the human splanchnic vascular bed, within a genome-scale model of human metabolism. We show that an increased amount of liver fat induces mitochondrial metabolism, lipolysis, glyceroneogenesis and a switch from lactate to glycerol as substrate for gluconeogenesis, indicating an intricate balance of exacerbated opposite metabolic processes in glycemic regulation. These changes were associated with reduced metabolic adaptability on a network level in the sense that liver fat accumulation puts increasing demands on the liver to adaptively regulate metabolic responses to maintain basic liver functions. We propose that failure to meet excessive metabolic challenges coupled with reduced metabolic adaptability may lead to a vicious pathogenic cycle leading to the co-morbidities of NAFLD.

  1. Genome-scale study reveals reduced metabolic adaptability in patients with non-alcoholic fatty liver disease

    PubMed Central

    Hyötyläinen, Tuulia; Jerby, Livnat; Petäjä, Elina M.; Mattila, Ismo; Jäntti, Sirkku; Auvinen, Petri; Gastaldelli, Amalia; Yki-Järvinen, Hannele; Ruppin, Eytan; Orešič, Matej

    2016-01-01

    Non-alcoholic fatty liver disease (NAFLD) is a major risk factor leading to chronic liver disease and type 2 diabetes. Here we chart liver metabolic activity and functionality in NAFLD by integrating global transcriptomic data, from human liver biopsies, and metabolic flux data, measured across the human splanchnic vascular bed, within a genome-scale model of human metabolism. We show that an increased amount of liver fat induces mitochondrial metabolism, lipolysis, glyceroneogenesis and a switch from lactate to glycerol as substrate for gluconeogenesis, indicating an intricate balance of exacerbated opposite metabolic processes in glycemic regulation. These changes were associated with reduced metabolic adaptability on a network level in the sense that liver fat accumulation puts increasing demands on the liver to adaptively regulate metabolic responses to maintain basic liver functions. We propose that failure to meet excessive metabolic challenges coupled with reduced metabolic adaptability may lead to a vicious pathogenic cycle leading to the co-morbidities of NAFLD. PMID:26839171

  2. Global transcriptomic profiling using small volumes of whole blood: a cost-effective method for translational genomic biomarker identification in small animals.

    PubMed

    Fricano, Meagan M; Ditewig, Amy C; Jung, Paul M; Liguori, Michael J; Blomme, Eric A G; Yang, Yi

    2011-01-01

    Blood is an ideal tissue for the identification of novel genomic biomarkers for toxicity or efficacy. However, using blood for transcriptomic profiling presents significant technical challenges due to the transcriptomic changes induced by ex vivo handling and the interference of highly abundant globin mRNA. Most whole blood RNA stabilization and isolation methods also require significant volumes of blood, limiting their effective use in small animal species, such as rodents. To overcome these challenges, a QIAzol-based RNA stabilization and isolation method (QSI) was developed to isolate sufficient amounts of high quality total RNA from 25 to 500 μL of rat whole blood. The method was compared to the standard PAXgene Blood RNA System using blood collected from rats exposed to saline or lipopolysaccharide (LPS). The QSI method yielded an average of 54 ng total RNA per μL of rat whole blood with an average RNA Integrity Number (RIN) of 9, a performance comparable with the standard PAXgene method. Total RNA samples were further processed using the NuGEN Ovation Whole Blood Solution system and cDNA was hybridized to Affymetrix Rat Genome 230 2.0 Arrays. The microarray QC parameters using RNA isolated with the QSI method were within the acceptable range for microarray analysis. The transcriptomic profiles were highly correlated with those using RNA isolated with the PAXgene method and were consistent with expected LPS-induced inflammatory responses. The present study demonstrated that the QSI method coupled with NuGEN Ovation Whole Blood Solution system is cost-effective and particularly suitable for transcriptomic profiling of minimal volumes of whole blood, typical of those obtained with small animal species.

  3. Human whole genome genotype and transcriptome data for Alzheimer’s and other neurodegenerative diseases

    PubMed Central

    Allen, Mariet; Carrasquillo, Minerva M.; Funk, Cory; Heavner, Benjamin D.; Zou, Fanggeng; Younkin, Curtis S.; Burgess, Jeremy D.; Chai, High-Seng; Crook, Julia; Eddy, James A.; Li, Hongdong; Logsdon, Ben; Peters, Mette A.; Dang, Kristen K.; Wang, Xue; Serie, Daniel; Wang, Chen; Nguyen, Thuy; Lincoln, Sarah; Malphrus, Kimberly; Bisceglio, Gina; Li, Ma; Golde, Todd E.; Mangravite, Lara M.; Asmann, Yan; Price, Nathan D.; Petersen, Ronald C.; Graff-Radford, Neill R.; Dickson, Dennis W.; Younkin, Steven G.; Ertekin-Taner, Nilüfer

    2016-01-01

    Previous genome-wide association studies (GWAS), conducted by our group and others, have identified loci that harbor risk variants for neurodegenerative diseases, including Alzheimer's disease (AD). Human disease variants are enriched for polymorphisms that affect gene expression, including some that are known to associate with expression changes in the brain. Postulating that many variants confer risk to neurodegenerative disease via transcriptional regulatory mechanisms, we have analyzed gene expression levels in the brain tissue of subjects with AD and related diseases. Herein, we describe our collective datasets comprised of GWAS data from 2,099 subjects; microarray gene expression data from 773 brain samples, 186 of which also have RNAseq; and an independent cohort of 556 brain samples with RNAseq. We expect that these datasets, which are available to all qualified researchers, will enable investigators to explore and identify transcriptional mechanisms contributing to neurodegenerative diseases. PMID:27727239

  4. Design and application of genome-scale reconstructed metabolic models.

    PubMed

    Rocha, Isabel; Förster, Jochen; Nielsen, Jens

    2008-01-01

    In this chapter, the process for the reconstruction of genome-scale metabolic networks is described, and some of the main applications of such models are illustrated. The reconstruction process can be viewed as an iterative process where information obtained from several sources is combined to construct a preliminary set of reactions and constraints. This involves steps such as genome annotation; identification of the reactions from the annotated genome sequence and available literature; determination of the reaction stoichiometry; definition of compartmentation and assignment of localization; determination of the biomass composition; measurement, calculation, or fitting of energy requirements; and definition of additional constraints. The reaction and constraint sets, after debugging, may be integrated into a stoichiometric model that can be used for simulation using tools such as Flux Balance Analysis (Section 3.8). From the flux distributions obtained, physiologic parameters such as growth yields or minimal medium components can be calculated, and their distance from similar experimental data provides a basis from where the model may need to be improved.

  5. Genome-scale metabolic models: reconstruction and analysis.

    PubMed

    Baart, Gino J E; Martens, Dirk E

    2012-01-01

    Metabolism can be defined as the complete set of chemical reactions that occur in living organisms in order to maintain life. Enzymes are the main players in this process as they are responsible for catalyzing the chemical reactions. The enzyme-reaction relationships can be used for the reconstruction of a network of reactions, which leads to a metabolic model of metabolism. A genome-scale metabolic network of chemical reactions that take place inside a living organism is primarily reconstructed from the information that is present in its genome and the literature and involves steps such as functional annotation of the genome, identification of the associated reactions and determination of their stoichiometry, assignment of localization, determination of the biomass composition, estimation of energy requirements, and definition of model constraints. This information can be integrated into a stoichiometric model of metabolism that can be used for detailed analysis of the metabolic potential of the organism using constraint-based modeling approaches and hence is valuable in understanding its metabolic capabilities.

  6. Mechanism for DNA transposons to generate introns on genomic scales.

    PubMed

    Huff, Jason T; Zilberman, Daniel; Roy, Scott W

    2016-10-27

    The discovery of introns four decades ago was one of the most unexpected findings in molecular biology. Introns are sequences interrupting genes that must be removed as part of messenger RNA production. Genome sequencing projects have shown that most eukaryotic genes contain at least one intron, and frequently many. Comparison of these genomes reveals a history of long evolutionary periods during which few introns were gained, punctuated by episodes of rapid, extensive gain. However, although several detailed mechanisms for such episodic intron generation have been proposed, none has been empirically supported on a genomic scale. Here we show how short, non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from the gene sequence that is duplicated upon transposon insertion, allowing perfect splicing out of the RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between pre-existing nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases and prevalence of nucleosome-sized exons observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism that can plausibly account for episodes of rapid, extensive intron gain during eukaryotic evolution.

  7. Transcriptomics and Functional Genomics of ROS-Induced Cell Death Regulation by RADICAL-INDUCED CELL DEATH1

    PubMed Central

    Salojärvi, Jarkko; Cui, Fuqiang; Sipari, Nina; Leppälä, Johanna; Lamminmäki, Airi; Tomai, Gloria; Narayanasamy, Shaman; Reddy, Ramesha A.; Keinänen, Markku; Overmyer, Kirk; Kangasjärvi, Jaakko

    2014-01-01

    Plant responses to changes in environmental conditions are mediated by a network of signaling events leading to downstream responses, including changes in gene expression and activation of cell death programs. Arabidopsis thaliana RADICAL-INDUCED CELL DEATH1 (RCD1) has been proposed to regulate plant stress responses by protein-protein interactions with transcription factors. Furthermore, the rcd1 mutant has defective control of cell death in response to apoplastic reactive oxygen species (ROS). Combining transcriptomic and functional genomics approaches we first used microarray analysis in a time series to study changes in gene expression after apoplastic ROS treatment in rcd1. To identify a core set of cell death regulated genes, RCD1-regulated genes were clustered together with other array experiments from plants undergoing cell death or treated with various pathogens, plant hormones or other chemicals. Subsequently, selected rcd1 double mutants were constructed to further define the genetic requirements for the execution of apoplastic ROS induced cell death. Through the genetic analysis we identified WRKY70 and SGT1b as cell death regulators functioning downstream of RCD1 and show that quantitative rather than qualitative differences in gene expression related to cell death appeared to better explain the outcome. Allocation of plant energy to defenses diverts resources from growth. Recently, a plant response termed stress-induced morphogenic response (SIMR) was proposed to regulate the balance between defense and growth. Using a rcd1 double mutant collection we show that SIMR is mostly independent of the classical plant defense signaling pathways and that the redox balance is involved in development of SIMR. PMID:24550736

  8. Predicting novel pathways in genome-scale metabolic networks.

    PubMed

    Schuster, Stefan; de Figueiredo, Luís F; Kaleta, Christoph

    2010-10-01

    Elementary-modes analysis has become a well-established theoretical tool in metabolic pathway analysis. It allows one to decompose complex metabolic networks into the smallest functional entities, which can be interpreted as biochemical pathways. This analysis has, in medium-size metabolic networks, led to the successful theoretical prediction of hitherto unknown pathways. For illustration, we discuss the example of the phosphoenolpyruvate-glyoxylate cycle in Escherichia coli. Elementary-modes analysis meets with the problem of combinatorial explosion in the number of pathways with increasing system size, which has hampered scaling it up to genome-wide models. We present a novel approach to overcoming this obstacle. That approach is based on elementary flux patterns, which are defined as sets of reactions representing the basic routes through a particular subsystem that are compatible with admissible fluxes in a (possibly) much larger metabolic network. The subsystem can be made up by reactions in which we are interested in, for example, reactions producing a certain metabolite. This allows one to predict novel metabolic pathways in genome-scale networks.

  9. Transcriptomic and genomic analysis of cellulose fermentation by Clostridium thermocellum ATCC 27405

    SciTech Connect

    Raman, Babu; McKeown, Catherine K; Rodriguez, Jr., Miguel; Brown, Steven D; Mielenz, Jonathan R

    2011-01-01

    The ability of Clostridium thermocellum ATCC 27405 wild-type strain to hydrolyze cellulose and ferment the degradation products directly to ethanol and other metabolic byproducts makes it an attractive candidate for consolidated bioprocessing of cellulosic biomass to biofuels. In this study, whole-genome microarrays were used to investigate the expression of C. thermocellum mRNA during growth on crystalline cellulose in controlled replicate batch fermentations. A time-series analysis of gene expression revealed changes in transcript levels of {approx}40% of genes ({approx}1300 out of 3198 ORFs encoded in the genome) during transition from early-exponential to late-stationary phase. K-means clustering of genes with statistically significant changes in transcript levels identified six distinct clusters of temporal expression. Broadly, genes involved in energy production, translation, glycolysis and amino acid, nucleotide and coenzyme metabolism displayed a decreasing trend in gene expression as cells entered stationary phase. In comparison, genes involved in cell structure and motility, chemotaxis, signal transduction and transcription showed an increasing trend in gene expression. Hierarchical clustering of cellulosome-related genes highlighted temporal changes in composition of this multi-enzyme complex during batch growth on crystalline cellulose, with increased expression of several genes encoding hydrolytic enzymes involved in degradation of non-cellulosic substrates in stationary phase. Overall, the results suggest that under low substrate availability, growth slows due to decreased metabolic potential and C. thermocellum alters its gene expression to (i) modulate the composition of cellulosomes that are released into the environment with an increased proportion of enzymes than can efficiently degrade plant polysaccharides other than cellulose, (ii) enhance signal transduction and chemotaxis mechanisms perhaps to sense the oligosaccharide hydrolysis products

  10. Transcriptomic and genomic evidence for Streptococcus agalactiae adaptation to the bovine environment

    PubMed Central

    2013-01-01

    Background Streptococcus agalactiae is a major cause of bovine mastitis, which is the dominant health disorder affecting milk production within the dairy industry and is responsible for substantial financial losses to the industry worldwide. However, there is considerable evidence for host adaptation (ecotypes) within S. agalactiae, with both bovine and human sourced isolates showing a high degree of distinctiveness, suggesting differing ability to cause mastitis. Here, we (i) generate RNAseq data from three S. agalactiae isolates (two putative bovine adapted and one human) and (ii) compare publicly available whole genome shotgun sequence data from an additional 202 isolates, obtained from six host species, to elucidate possible genetic factors/adaptations likely important for S. agalactiae growth and survival in the bovine mammary gland. Results Tests for differential expression showed distinct expression profiles for the three isolates when grown in bovine milk. A key finding for the two putatively bovine adapted isolates was the up regulation of a lactose metabolism operon (Lac.2) that was strongly correlated with the bovine environment (all 36 bovine sourced isolates on GenBank possessed the operon, in contrast to only 8/151 human sourced isolates). Multi locus sequence typing of all genome sequences and phylogenetic analysis using conserved operon genes from 44 S. agalactiae isolates and 16 additional Streptococcus species provided strong evidence for acquisition of the operon via multiple lateral gene transfer events, with all Streptococcus species known to be major causes of mastitis, identified as possible donors. Furthermore, lactose fermentation tests were only positive for isolates possessing Lac.2. Combined, these findings suggest that lactose metabolism is likely an important adaptation to the bovine environment. Additional up regulation in the bovine adapted isolates included genes involved in copper homeostasis, metabolism of purine, pyrimidine

  11. The genome, transcriptome, and proteome of the nematode Steinernema carpocapsae: evolutionary signatures of a pathogenic lifestyle.

    PubMed

    Rougon-Cardoso, Alejandra; Flores-Ponce, Mitzi; Ramos-Aboites, Hilda Eréndira; Martínez-Guerrero, Christian Eduardo; Hao, You-Jin; Cunha, Luis; Rodríguez-Martínez, Jonathan Alejandro; Ovando-Vázquez, Cesaré; Bermúdez-Barrientos, José Roberto; Abreu-Goodger, Cei; Chavarría-Hernández, Norberto; Simões, Nelson; Montiel, Rafael

    2016-11-23

    The entomopathogenic nematode Steinernema carpocapsae has been widely used for the biological control of insect pests. It shares a symbiotic relationship with the bacterium Xenorhabdus nematophila, and is emerging as a genetic model to study symbiosis and pathogenesis. We obtained a high-quality draft of the nematode's genome comprising 84,613,633 bp in 347 scaffolds, with an N50 of 1.24 Mb. To improve annotation, we sequenced both short and long RNA and conducted shotgun proteomic analyses. S. carpocapsae shares orthologous genes with other parasitic nematodes that are absent in the free-living nematode C. elegans, it has ncRNA families that are enriched in parasites, and expresses proteins putatively associated with parasitism and pathogenesis, suggesting an active role for the nematode during the pathogenic process. Host and parasites might engage in a co-evolutionary arms-race dynamic with genes participating in their interaction showing signatures of positive selection. Our analyses indicate that the consequence of this arms race is better characterized by positive selection altering specific functions instead of just increasing the number of positively selected genes, adding a new perspective to these co-evolutionary theories. We identified a protein, ATAD-3, that suggests a relevant role for mitochondrial function in the evolution and mechanisms of nematode parasitism.

  12. The genome, transcriptome, and proteome of the nematode Steinernema carpocapsae: evolutionary signatures of a pathogenic lifestyle

    PubMed Central

    Rougon-Cardoso, Alejandra; Flores-Ponce, Mitzi; Ramos-Aboites, Hilda Eréndira; Martínez-Guerrero, Christian Eduardo; Hao, You-Jin; Cunha, Luis; Rodríguez-Martínez, Jonathan Alejandro; Ovando-Vázquez, Cesaré; Bermúdez-Barrientos, José Roberto; Abreu-Goodger, Cei; Chavarría-Hernández, Norberto; Simões, Nelson; Montiel, Rafael

    2016-01-01

    The entomopathogenic nematode Steinernema carpocapsae has been widely used for the biological control of insect pests. It shares a symbiotic relationship with the bacterium Xenorhabdus nematophila, and is emerging as a genetic model to study symbiosis and pathogenesis. We obtained a high-quality draft of the nematode’s genome comprising 84,613,633 bp in 347 scaffolds, with an N50 of 1.24 Mb. To improve annotation, we sequenced both short and long RNA and conducted shotgun proteomic analyses. S. carpocapsae shares orthologous genes with other parasitic nematodes that are absent in the free-living nematode C. elegans, it has ncRNA families that are enriched in parasites, and expresses proteins putatively associated with parasitism and pathogenesis, suggesting an active role for the nematode during the pathogenic process. Host and parasites might engage in a co-evolutionary arms-race dynamic with genes participating in their interaction showing signatures of positive selection. Our analyses indicate that the consequence of this arms race is better characterized by positive selection altering specific functions instead of just increasing the number of positively selected genes, adding a new perspective to these co-evolutionary theories. We identified a protein, ATAD-3, that suggests a relevant role for mitochondrial function in the evolution and mechanisms of nematode parasitism. PMID:27876851

  13. Combined Large-Scale Phenotyping and Transcriptomics in Maize Reveals a Robust Growth Regulatory Network1[OPEN

    PubMed Central

    Herman, Dorota; Slabbinck, Bram; Pè, Mario Enrico

    2016-01-01

    Leaves are vital organs for biomass and seed production because of their role in the generation of metabolic energy and organic compounds. A better understanding of the molecular networks underlying leaf development is crucial to sustain global requirements for food and renewable energy. Here, we combined transcriptome profiling of proliferative leaf tissue with in-depth phenotyping of the fourth leaf at later stages of development in 197 recombinant inbred lines of two different maize (Zea mays) populations. Previously, correlation analysis in a classical biparental mapping population identified 1,740 genes correlated with at least one of 14 traits. Here, we extended these results with data from a multiparent advanced generation intercross population. As expected, the phenotypic variability was found to be larger in the latter population than in the biparental population, although general conclusions on the correlations among the traits are comparable. Data integration from the two diverse populations allowed us to identify a set of 226 genes that are robustly associated with diverse leaf traits. This set of genes is enriched for transcriptional regulators and genes involved in protein synthesis and cell wall metabolism. In order to investigate the molecular network context of the candidate gene set, we integrated our data with publicly available functional genomics data and identified a growth regulatory network of 185 genes. Our results illustrate the power of combining in-depth phenotyping with transcriptomics in mapping populations to dissect the genetic control of complex traits and present a set of candidate genes for use in biomass improvement. PMID:26754667

  14. The ClinSeq Project: Piloting large-scale genome sequencing for research in genomic medicine

    PubMed Central

    Biesecker, Leslie G.; Mullikin, James C.; Facio, Flavia M.; Turner, Clesson; Cherukuri, Praveen F.; Blakesley, Robert W.; Bouffard, Gerard G.; Chines, Peter S.; Cruz, Pedro; Hansen, Nancy F.; Teer, Jamie K.; Maskeri, Baishali; Young, Alice C.; Manolio, Teri A.; Wilson, Alexander F.; Finkel, Toren; Hwang, Paul; Arai, Andrew; Remaley, Alan T.; Sachdev, Vandana; Shamburek, Robert; Cannon, Richard O.; Green, Eric D.

    2009-01-01

    ClinSeq is a pilot project to investigate the use of whole-genome sequencing as a tool for clinical research. By piloting the acquisition of large amounts of DNA sequence data from individual human subjects, we are fostering the development of hypothesis-generating approaches for performing research in genomic medicine, including the exploration of issues related to the genetic architecture of disease, implementation of genomic technology, informed consent, disclosure of genetic information, and archiving, analyzing, and displaying sequence data. In the initial phase of ClinSeq, we are enrolling roughly 1000 participants; the evaluation of each includes obtaining a detailed family and medical history, as well as a clinical evaluation. The participants are being consented broadly for research on many traits and for whole-genome sequencing. Initially, Sanger-based sequencing of 300–400 genes thought to be relevant to atherosclerosis is being performed, with the resulting data analyzed for rare, high-penetrance variants associated with specific clinical traits. The participants are also being consented to allow the contact of family members for additional studies of sequence variants to explore their potential association with specific phenotypes. Here, we present the general considerations in designing ClinSeq, preliminary results based on the generation of an initial 826 Mb of sequence data, the findings for several genes that serve as positive controls for the project, and our views about the potential implications of ClinSeq. The early experiences with ClinSeq illustrate how large-scale medical sequencing can be a practical, productive, and critical component of research in genomic medicine. PMID:19602640

  15. MicroRNAs shape circadian hepatic gene expression on a transcriptome-wide scale

    PubMed Central

    Du, Ngoc-Hien; Arpat, Alaaddin Bulak; De Matos, Mara; Gatfield, David

    2014-01-01

    A considerable proportion of mammalian gene expression undergoes circadian oscillations. Post-transcriptional mechanisms likely make important contributions to mRNA abundance rhythms. We have investigated how microRNAs (miRNAs) contribute to core clock and clock-controlled gene expression using mice in which miRNA biogenesis can be inactivated in the liver. While the hepatic core clock was surprisingly resilient to miRNA loss, whole transcriptome sequencing uncovered widespread effects on clock output gene expression. Cyclic transcription paired with miRNA-mediated regulation was thus identified as a frequent phenomenon that affected up to 30% of the rhythmic transcriptome and served to post-transcriptionally adjust the phases and amplitudes of rhythmic mRNA accumulation. However, only few mRNA rhythms were actually generated by miRNAs. Overall, our study suggests that miRNAs function to adapt clock-driven gene expression to tissue-specific requirements. Finally, we pinpoint several miRNAs predicted to act as modulators of rhythmic transcripts, and identify rhythmic pathways particularly prone to miRNA regulation. DOI: http://dx.doi.org/10.7554/eLife.02510.001 PMID:24867642

  16. The Tetraodon nigroviridis reference transcriptome: developmental transition, length retention and microsynteny of long non-coding RNAs in a compact vertebrate genome

    PubMed Central

    Basu, Swaraj; Hadzhiev, Yavor; Petrosino, Giuseppe; Nepal, Chirag; Gehrig, Jochen; Armant, Olivier; Ferg, Marco; Strahle, Uwe; Sanges, Remo; Müller, Ferenc

    2016-01-01

    Pufferfish such as fugu and tetraodon carry the smallest genomes among all vertebrates and are ideal for studying genome evolution. However, comparative genomics using these species is hindered by the poor annotation of their genomes. We performed RNA sequencing during key stages of maternal to zygotic transition of Tetraodon nigroviridis and report its first developmental transcriptome. We assembled 61,033 transcripts (23,837 loci) representing 80% of the annotated gene models and 3816 novel coding transcripts from 2667 loci. We demonstrate the similarities of gene expression profiles between pufferfish and zebrafish during maternal to zygotic transition and annotated 1120 long non-coding RNAs (lncRNAs) many of which differentially expressed during development. The promoters for 60% of the assembled transcripts result validated by CAGE-seq. Despite the extreme compaction of the tetraodon genome and the dramatic loss of transposons, the length of lncRNA exons remain comparable to that of other vertebrates and a small set of lncRNAs appears enriched for transposable elements suggesting a selective pressure acting on lncRNAs length and composition. Finally, a set of lncRNAs are microsyntenic between teleost and vertebrates, which indicates potential regulatory interactions between lncRNAs and their flanking coding genes. Our work provides a fundamental molecular resource for vertebrate comparative genomics and embryogenesis studies. PMID:27628538

  17. Genome-wide transcriptomic analysis reveals correlation between higher WRKY61 expression and reduced symptom severity in Turnip crinkle virus infected Arabidopsis thaliana

    PubMed Central

    Gao, Ruimin; Liu, Peng; Yong, Yuhan; Wong, Sek-Man

    2016-01-01

    Turnip crinkle virus (TCV) is a carmovirus that infects many Arabidopsis ecotypes. Most studies mainly focused on discovery of resistance genes against TCV infection, and there is no Next Generation Sequencing based comparative genome wide transcriptome analysis reported. In this study, RNA-seq based transcriptome analysis revealed that 238 (155 up-regulated and 83 down-regulated) significant differentially expressed genes with at least 15-fold change were determined. Fifteen genes (including upregulated, unchanged and downregulated) were selected for RNA-seq data validation using quantitative real-time PCR, which showed consistencies between these two sets of data. GO enrichment analysis showed that numerous terms such as stress, immunity, defence and chemical stimulus were affected in TCV-infected plants. One putative plant defence related gene named WRKY61 was selected for further investigation. It showed that WRKY61 overexpression plants displayed reduced symptoms and less virus accumulation, as compared to wild type (WT) and WRKY61 deficient lines, suggesting that higher WRKY61 expression level reduced TCV viral accumulation. In conclusion, our transcriptome analysis showed that global gene expression was detected in TCV-infected Arabidopsis thaliana. WRKY61 gene was shown to be negatively correlated with TCV infection and viral symptoms, which may be connected to plant immunity pathways. PMID:27086702

  18. Joint genome-wide association and transcriptome sequencing reveals a complex polygenic network underlying hypocotyl elongation in rapeseed (Brassica napus L.)

    PubMed Central

    Luo, Xiang; Xue, Zhifei; Ma, Chaozhi; Hu, Kaining; Zeng, Ziru; Dou, Shengwei; Tu, Jinxing; Shen, Jinxiong; Yi, Bin; Fu, Tingdong

    2017-01-01

    Hypocotyl elongation is considered an important typical seedling trait contributing directly to an increase in and stabilization of the yield in Brassica napus, but its molecular genetic mechanism is poorly understood. In the present study, hypocotyl lengths of 210 lines were measured in an illuminated culture room. A genome-wide association study (GWAS) was performed with 23,435 single nucleotide polymorphisms (SNPs) for hypocotyl length. Three lines with long hypocotyl length and three lines with short hypocotyl length from one doubled haploid line (DH) population were used for transcriptome sequencing. A GWAS followed by transcriptome analysis identified 29 differentially expressed genes associated with significant SNPs in B. napus. These genes regulate hypocotyl elongation by mediating flowering morphogenesis, circadian clock, hormone biosynthesis, or important metabolic signaling pathways. Among these genes, BnaC07g46770D negatively regulates hypocotyl elongation directly, as well as flowering time. Our results indicate that a joint GWAS and transcriptome analysis has significant potential for identifying the genes responsible for hypocotyl elongation; The extension of hypocotyl is a complex biological process regulated by a polygenic network. PMID:28139730

  19. Molecular Models Underlying the Sensitive and Specific Determination of Copy Number Changes in the Human Genome and Transcriptome

    NASA Astrophysics Data System (ADS)

    Laderman, Stephen

    2004-03-01

    DNA microarrays enable the measurement of relative abundances of messenger RNA (mRNA) molecules across biological conditions on a gene-by-gene basis for tens of thousands of genes at a time, encompassing essentially all the mRNA molecules present in the sample's cells. These highly multiplexed molecular measurements are greatly accelerating basic and applied research in disease progression and developmental biology. In the study of cancer and certain genetic disorders, similar snapshots of copy numbers of genomic DNA are also of great interest. For example, tumor suppressor genes that protect against improper cell growth are lost from the genome and oncogenes that accelerate cell proliferation are multiplied within the genome as cancer cells evolve. Microarray measurements of mRNA and DNA are themselves an aggregate of molecular level phenomena. Quantitative models of in vitro behavior at the molecular scale are essential for creating effective measurement systems that have sufficient sensitivity and specificity to realize the highest utility. Such models are actively used in critical areas such as designing the means of incorporating fluorophores into the sample molecules, building high quality DNA sequences onto the microarrays, and developing assay parameters that balance high sensitivity with an ability to distinguish genes from each other. Trends in the applications of the technology ensure continued development towards even more precise, reproducible, sensitive and specific measurement systems, encouraging the further development and application of relevant molecular models.

  20. Genome-scale Proteome Quantification by DEEP SEQ Mass Spectrometry

    PubMed Central

    Zhou, Feng; Lu, Yu; Ficarro, Scott B.; Adelmant, Guillaume; Jiang, Wenyu; Luckey, C. John; Marto, Jarrod A.

    2013-01-01

    Advances in chemistry and massively parallel detection underlie DNA sequencing platforms that are poised for application in personalized medicine. In stark contrast, systematic generation of protein-level data lags well-behind genomics in virtually every aspect: depth of coverage, throughput, ease of sample preparation, and experimental time. Here, to bridge this gap, we develop an approach based on simple detergent lysis and single-enzyme digest, extreme, orthogonal separation of peptides, and true nanoflow LC-MS/MS that provides high peak capacity and ionization efficiency. This automated, deep efficient peptide sequencing and quantification (DEEP SEQ) mass spectrometry platform provides genome-scale proteome coverage equivalent to RNA-seq ribosomal profiling and accurate quantification for multiplexed isotope labels. In a model of the embryonic to epiblast transition in murine stem cells, we unambiguously quantify 11,352 gene products that span 70% of Swiss-Prot and capture protein regulation across the full detectable range of high-throughput gene expression and protein translation. PMID:23863870

  1. Next-generation genome-scale models for metabolic engineering.

    PubMed

    King, Zachary A; Lloyd, Colton J; Feist, Adam M; Palsson, Bernhard O

    2015-12-01

    Constraint-based reconstruction and analysis (COBRA) methods have become widely used tools for metabolic engineering in both academic and industrial laboratories. By employing a genome-scale in silico representation of the metabolic network of a host organism, COBRA methods can be used to predict optimal genetic modifications that improve the rate and yield of chemical production. A new generation of COBRA models and methods is now being developed--encompassing many biological processes and simulation strategies-and next-generation models enable new types of predictions. Here, three key examples of applying COBRA methods to strain optimization are presented and discussed. Then, an outlook is provided on the next generation of COBRA models and the new types of predictions they will enable for systems metabolic engineering.

  2. A refined genome-scale reconstruction of Chlamydomonas metabolism provides a platform for systems-level analyses

    PubMed Central

    Imam, Saheed; Schäuble, Sascha; Valenzuela, Jacob; de Lomana, Adrián López García; Carter, Warren; Price, Nathan D.; Baliga, Nitin S.

    2015-01-01

    Summary Microalgae have reemerged as organisms of prime biotechnological interest due to their ability to synthesize a suite of valuable chemicals. To harness the capabilities of these organisms, we need a comprehensive systems-level understanding of their metabolism, which can be fundamentally achieved through large-scale mechanistic models of metabolism. In this study, we present a revised and significantly improved genome-scale metabolic model for the widely-studied microalga, Chlamydomonas reinhardtii. The model, iCre1355, represents a major advance over previous models, both in content and predictive power. iCre1355 encompasses a broad range of metabolic functions encoded across the nuclear, chloroplast and mitochondrial genomes accounting for 1355 genes (1460 transcripts), 2394 and 1133 metabolites. We found improved performance over the previous metabolic model based on comparisons of predictive accuracy across 306 phenotypes (from 81 mutants), lipid yield analysis and growth rates derived from chemostat-grown cells (under 3 conditions). Measurement of macronutrient uptake revealed carbon and phosphate to be good predictors of growth rate, while nitrogen consumption appeared to be in excess. We analyzed high-resolution time series transcriptomics data using iCre1355 to uncover dynamic pathway-level changes that occur in response to nitrogen starvation and changes in light intensity. This approach enabled accurate prediction of growth rates, the cessation of growth and accumulation of triacylglycerols during nitrogen starvation, and the temporal response of different growth-associated pathways to increased light intensity. Thus, iCre1355 represents an experimentally-validated genome-scale reconstruction of C. reinhardtii metabolism that should serve as a useful resource for studying the metabolic processes of this and related microalgae. PMID:26485611

  3. Genomic, Transcriptomic and Metabolomic Studies of Two Well-Characterized, Laboratory-Derived Vancomycin-Intermediate Staphylococcus aureus Strains Derived from the Same Parent Strain

    PubMed Central

    Hattangady, Dipti S.; Singh, Atul K.; Muthaiyan, Arun; Jayaswal, Radheshyam K.; Gustafson, John E.; Ulanov, Alexander V.; Li, Zhong; Wilkinson, Brian J.; Pfeltz, Richard F.

    2015-01-01

    Complete genome comparisons, transcriptomic and metabolomic studies were performed on two laboratory-selected, well-characterized vancomycin-intermediate Staphylococcus aureus (VISA) derived from the same parent MRSA that have changes in cell wall composition and decreased autolysis. A variety of mutations were found in the VISA, with more in strain 13136p−m+V20 (vancomycin MIC = 16 µg/mL) than strain 13136p−m+V5 (MIC = 8 µg/mL). Most of the mutations have not previously been associated with the VISA phenotype; some were associated with cell wall metabolism and many with stress responses, notably relating to DNA damage. The genomes and transcriptomes of the two VISA support the importance of gene expression regulation to the VISA phenotype. Similarities in overall transcriptomic and metabolomic data indicated that the VISA physiologic state includes elements of the stringent response, such as downregulation of protein and nucleotide synthesis, the pentose phosphate pathway and nutrient transport systems. Gene expression for secreted virulence determinants was generally downregulated, but was more variable for surface-associated virulence determinants, although capsule formation was clearly inhibited. The importance of activated stress response elements could be seen across all three analyses, as in the accumulation of osmoprotectant metabolites such as proline and glutamate. Concentrations of potential cell wall precursor amino acids and glucosamine were increased in the VISA strains. Polyamines were decreased in the VISA, which may facilitate the accrual of mutations. Overall, the studies confirm the wide variability in mutations and gene expression patterns that can lead to the VISA phenotype. PMID:27025616

  4. Optimizing Hybrid de Novo Transcriptome Assembly and Extending Genomic Resources for Giant Freshwater Prawns (Macrobrachium rosenbergii): The Identification of Genes and Markers Associated with Reproduction

    PubMed Central

    Jung, Hyungtaek; Yoon, Byung-Ha; Kim, Woo-Jin; Kim, Dong-Wook; Hurwood, David A.; Lyons, Russell E.; Salin, Krishna R.; Kim, Heui-Soo; Baek, Ilseon; Chand, Vincent; Mather, Peter B.

    2016-01-01

    The giant freshwater prawn, Macrobrachium rosenbergii, a sexually dimorphic decapod crustacean is currently the world’s most economically important cultured freshwater crustacean species. Despite its economic importance, there is currently a lack of genomic resources available for this species, and this has limited exploration of the molecular mechanisms that control the M. rosenbergii sex-differentiation system more widely in freshwater prawns. Here, we present the first hybrid transcriptome from M. rosenbergii applying RNA-Seq technologies directed at identifying genes that have potential functional roles in reproductive-related traits. A total of 13,733,210 combined raw reads (1720 Mbp) were obtained from Ion-Torrent PGM and 454 FLX. Bioinformatic analyses based on three state-of-the-art assemblers, the CLC Genomic Workbench, Trans-ABySS, and Trinity, that use single and multiple k-mer methods respectively, were used to analyse the data. The influence of multiple k-mers on assembly performance was assessed to gain insight into transcriptome assembly from short reads. After optimisation, de novo assembly resulted in 44,407 contigs with a mean length of 437 bp, and the assembled transcripts were further functionally annotated to detect single nucleotide polymorphisms and simple sequence repeat motifs. Gene expression analysis was also used to compare expression patterns from ovary and testis tissue libraries to identify genes with potential roles in reproduction and sex differentiation. The large transcript set assembled here represents the most comprehensive set of transcriptomic resources ever developed for reproduction traits in M. rosenbergii, and the large number of genetic markers predicted should constitute an invaluable resource for future genetic research studies on M. rosenbergii and can be applied more widely on other freshwater prawn species in the genus Macrobrachium. PMID:27164098

  5. Optimizing Hybrid de Novo Transcriptome Assembly and Extending Genomic Resources for Giant Freshwater Prawns (Macrobrachium rosenbergii): The Identification of Genes and Markers Associated with Reproduction.

    PubMed

    Jung, Hyungtaek; Yoon, Byung-Ha; Kim, Woo-Jin; Kim, Dong-Wook; Hurwood, David A; Lyons, Russell E; Salin, Krishna R; Kim, Heui-Soo; Baek, Ilseon; Chand, Vincent; Mather, Peter B

    2016-05-07

    The giant freshwater prawn, Macrobrachium rosenbergii, a sexually dimorphic decapod crustacean is currently the world's most economically important cultured freshwater crustacean species. Despite its economic importance, there is currently a lack of genomic resources available for this species, and this has limited exploration of the molecular mechanisms that control the M. rosenbergii sex-differentiation system more widely in freshwater prawns. Here, we present the first hybrid transcriptome from M. rosenbergii applying RNA-Seq technologies directed at identifying genes that have potential functional roles in reproductive-related traits. A total of 13,733,210 combined raw reads (1720 Mbp) were obtained from Ion-Torrent PGM and 454 FLX. Bioinformatic analyses based on three state-of-the-art assemblers, the CLC Genomic Workbench, Trans-ABySS, and Trinity, that use single and multiple k-mer methods respectively, were used to analyse the data. The influence of multiple k-mers on assembly performance was assessed to gain insight into transcriptome assembly from short reads. After optimisation, de novo assembly resulted in 44,407 contigs with a mean length of 437 bp, and the assembled transcripts were further functionally annotated to detect single nucleotide polymorphisms and simple sequence repeat motifs. Gene expression analysis was also used to compare expression patterns from ovary and testis tissue libraries to identify genes with potential roles in reproduction and sex differentiation. The large transcript set assembled here represents the most comprehensive set of transcriptomic resources ever developed for reproduction traits in M. rosenbergii, and the large number of genetic markers predicted should constitute an invaluable resource for future genetic research studies on M. rosenbergii and can be applied more widely on other freshwater prawn species in the genus Macrobrachium.

  6. Accomplishments in genome-scale in silico modeling for industrial and medical biotechnology

    PubMed Central

    Milne, Caroline B.; Kim, Pan-Jun; Eddy, James A.; Price, Nathan D.

    2011-01-01

    Driven by advancements in high-throughput biological technologies and the growing number of sequenced genomes, the construction of in silico models at the genome scale has provided powerful tools to investigate a vast array of biological systems and applications. Here, we review comprehensively the uses of such models in industrial and medical biotechnology, including biofuel generation, food production, and drug development. While the use of in silico models is still in its early stages for delivering to industry, significant initial successes have been achieved. For the cases presented here, genome-scale models predict engineering strategies to enhance properties of interest in an organism or to inhibit harmful mechanisms of pathogens or in disease. Going forward, genome-scale in silico models promise to extend their application and analysis scope to become a transformative tool in biotechnology. As such, genome-scale models can provide a basis for rational genome-scale engineering and synthetic biology. PMID:19946878

  7. Advances in Swine Transcriptomics

    PubMed Central

    Tuggle, Christopher K.; Wang, Yanfang; Couture, Oliver

    2007-01-01

    The past five years have seen a tremendous rise in porcine transcriptomic data. Available porcine Expressed Sequence Tags (ESTs) have expanded greatly, with over 623,000 ESTs deposited in Genbank. ESTs have been used to expand the pig-human comparative maps, but such data has also been used in many ways to understand pig gene expression. Several methods have been used to identify genes differentially expressed (DE) in specific tissues or cell types under different treatments. These include open screening methods such as suppression subtractive hybridization, differential display, serial analysis of gene expression, and EST sequence frequency, as well as closed methods that measure expression of a defined set of sequences such as hybridization to membrane arrays and microarrays. The use of microarrays to begin large-scale transcriptome analysis has been recently reported, using either specialized or broad-coverage arrays. This review covers published results using the above techniques in the pig, as well as unpublished data provided by the research community, and reports on unpublished Affymetrix data from our group. Published and unpublished bioinformatics efforts are discussed, including recent work by our group to integrate two broad-coverage microarray platforms. We conclude by predicting experiments that will become possible with new anticipated tools and data, including the porcine genome sequence. We emphasize that the need for bioinformatics infrastructure to efficiently store and analyze the expanding amounts of gene expression data is critical, and that this deficit has emerged as a limiting factor for acceleration of genomic understanding in the pig. PMID:17384733

  8. Babelomics: an integrative platform for the analysis of transcriptomics, proteomics and genomic data with advanced functional profiling.

    PubMed

    Medina, Ignacio; Carbonell, José; Pulido, Luis; Madeira, Sara C; Goetz, Stefan; Conesa, Ana; Tárraga, Joaquín; Pascual-Montano, Alberto; Nogales-Cadenas, Ruben; Santoyo, Javier; García, Francisco; Marbà, Martina; Montaner, David; Dopazo, Joaquín

    2010-07-01

    Babelomics is a response to the growing necessity of integrating and analyzing different types of genomic data in an environment that allows an easy functional interpretation of the results. Babelomics includes a complete suite of methods for the analysis of gene expression data that include normalization (covering most commercial platforms), pre-processing, differential gene expression (case-controls, multiclass, survival or continuous values), predictors, clustering; large-scale genotyping assays (case controls and TDTs, and allows population stratification analysis and correction). All these genomic data analysis facilities are integrated and connected to multiple options for the functional interpretation of the experiments. Different methods of functional enrichment or gene set enrichment can be used to understand the functional basis of the experiment analyzed. Many sources of biological information, which include functional (GO, KEGG, Biocarta, Reactome, etc.), regulatory (Transfac, Jaspar, ORegAnno, miRNAs, etc.), text-mining or protein-protein interaction modules can be used for this purpose. Finally a tool for the de novo functional annotation of sequences has been included in the system. This provides support for the functional analysis of non-model species. Mirrors of Babelomics or command line execution of their individual components are now possible. Babelomics is available at http://www.babelomics.org.

  9. Phylogenetic marker development for target enrichment from transcriptome and genome skim data: the pipeline and its application in southern African Oxalis (Oxalidaceae).

    PubMed

    Schmickl, Roswitha; Liston, Aaron; Zeisek, Vojtěch; Oberlander, Kenneth; Weitemier, Kevin; Straub, Shannon C K; Cronn, Richard C; Dreyer, Léanne L; Suda, Jan

    2016-09-01

    Phylogenetics benefits from using a large number of putatively independent nuclear loci and their combination with other sources of information, such as the plastid and mitochondrial genomes. To facilitate the selection of orthologous low-copy nuclear (LCN) loci for phylogenetics in nonmodel organisms, we created an automated and interactive script to select hundreds of LCN loci by a comparison between transcriptome and genome skim data. We used our script to obtain LCN genes for southern African Oxalis (Oxalidaceae), a speciose plant lineage in the Greater Cape Floristic Region. This resulted in 1164 LCN genes greater than 600 bp. Using target enrichment combined with genome skimming (Hyb-Seq), we obtained on average 1141 LCN loci, nearly the whole plastid genome and the nrDNA cistron from 23 southern African Oxalis species. Despite a wide range of gene trees, the phylogeny based on the LCN genes was very robust, as retrieved through various gene and species tree reconstruction methods as well as concatenation. Cytonuclear discordance was strong. This indicates that organellar phylogenies alone are unlikely to represent the species tree and stresses the utility of Hyb-Seq in phylogenetics.

  10. The genome and transcriptome of Trichormus sp. NMC-1: insights into adaptation to extreme environments on the Qinghai-Tibet Plateau

    PubMed Central

    Qiao, Qin; Huang, Yanyan; Qi, Ji; Qu, Mingzhi; Jiang, Chen; Lin, Pengcheng; Li, Renhui; Song, Lirong; Yonezawa, Takahiro; Hasegawa, Masami; Crabbe, M. James C.; Chen, Fan; Zhang, Ticao; Zhong, Yang

    2016-01-01

    The Qinghai-Tibet Plateau (QTP) has the highest biodiversity for an extreme environment worldwide, and provides an ideal natural laboratory to study adaptive evolution. In this study, we generated a draft genome sequence of cyanobacteria Trichormus sp. NMC-1 in the QTP and performed whole transcriptome sequencing under low temperature to investigate the genetic mechanism by which T. sp. NMC-1 adapted to the specific environment. Its genome sequence was 5.9 Mb with a G+C content of 39.2% and encompassed a total of 5362 CDS. A phylogenomic tree indicated that this strain belongs to the Trichormus and Anabaena cluster. Genome comparison between T. sp. NMC-1 and six relatives showed that functionally unknown genes occupied a much higher proportion (28.12%) of the T. sp. NMC-1 genome. In addition, functions of specific, significant positively selected, expanded orthogroups, and differentially expressed genes involved in signal transduction, cell wall/membrane biogenesis, secondary metabolite biosynthesis, and energy production and conversion were analyzed to elucidate specific adaptation traits. Further analyses showed that the CheY-like genes, extracellular polysaccharide and mycosporine-like amino acids might play major roles in adaptation to harsh environments. Our findings indicate that sophisticated genetic mechanisms are involved in cyanobacterial adaptation to the extreme environment of the QTP. PMID:27381465

  11. Genome-scale mRNA and small RNA transcriptomic insights into initiation of citrus apomixis

    PubMed Central

    Long, Jian-Mei; Liu, Zheng; Wu, Xiao-Meng; Fang, Yan-Ni; Jia, Hui-Hui; Xie, Zong-Zhou; Deng, Xiu-Xin; Guo, Wen-Wu

    2016-01-01

    Nucellar embryony (NE) is an adventitious form of apomixis common in citrus, wherein asexual embryos initiate directly from nucellar cells surrounding the embryo sac. NE enables the fixation of desirable agronomic traits and the production of clonal offspring of virus-free rootstock, but impedes progress in hybrid breeding. In spite of the great importance of NE in citrus breeding and commercial production, little is understood about the underlying molecular mechanisms. In this study, the stages of nucellar embryo initiation (NEI) were determined for two polyembryonic citrus cultivars via histological observation. To explore the genes and regulatory pathways involved in NEI, we performed mRNA-seq and sRNA-seq analyses of ovules immediately prior to and at stages during NEI in the two pairs of cultivars. A total of 305 differentially expressed genes (DEGs) were identified between the poly- and monoembryonic ovules. Gene ontology (GO) analysis revealed that several processes are significantly enriched based on DEGs. In particular, response to stress, and especially response to oxidative stress, was over-represented in polyembryonic ovules. Nearly 150 miRNAs, comprising ~90 conserved and ~60 novel miRNAs, were identified in the ovules of either cultivar pair. Only two differentially expressed miRNAs (DEMs) were identified, of which the novel miRN23-5p was repressed whereas the targets accumulated in the polyembryonic ovules. This integrated study on the transcriptional and post-transcriptional regulatory profiles between poly- and monoembryonic citrus ovules provides new insights into the mechanism of NE, which should contribute to revealing the regulatory mechanisms of plant apomixis. PMID:27619233

  12. Genome-scale mRNA and small RNA transcriptomic insights into initiation of citrus apomixis.

    PubMed

    Long, Jian-Mei; Liu, Zheng; Wu, Xiao-Meng; Fang, Yan-Ni; Jia, Hui-Hui; Xie, Zong-Zhou; Deng, Xiu-Xin; Guo, Wen-Wu

    2016-10-01

    Nucellar embryony (NE) is an adventitious form of apomixis common in citrus, wherein asexual embryos initiate directly from nucellar cells surrounding the embryo sac. NE enables the fixation of desirable agronomic traits and the production of clonal offspring of virus-free rootstock, but impedes progress in hybrid breeding. In spite of the great importance of NE in citrus breeding and commercial production, little is understood about the underlying molecular mechanisms. In this study, the stages of nucellar embryo initiation (NEI) were determined for two polyembryonic citrus cultivars via histological observation. To explore the genes and regulatory pathways involved in NEI, we performed mRNA-seq and sRNA-seq analyses of ovules immediately prior to and at stages during NEI in the two pairs of cultivars. A total of 305 differentially expressed genes (DEGs) were identified between the poly- and monoembryonic ovules. Gene ontology (GO) analysis revealed that several processes are significantly enriched based on DEGs. In particular, response to stress, and especially response to oxidative stress, was over-represented in polyembryonic ovules. Nearly 150 miRNAs, comprising ~90 conserved and ~60 novel miRNAs, were identified in the ovules of either cultivar pair. Only two differentially expressed miRNAs (DEMs) were identified, of which the novel miRN23-5p was repressed whereas the targets accumulated in the polyembryonic ovules. This integrated study on the transcriptional and post-transcriptional regulatory profiles between poly- and monoembryonic citrus ovules provides new insights into the mechanism of NE, which should contribute to revealing the regulatory mechanisms of plant apomixis.

  13. Architecture of thermal adaptation in an Exiguobacterium sibiricum strain isolated from 3 million year old permafrost: A genome and transcriptome approach

    PubMed Central

    Rodrigues, Debora F; Ivanova, Natalia; He, Zhili; Huebner, Marianne; Zhou, Jizhong; Tiedje, James M

    2008-01-01

    Background Many microorganisms have a wide temperature growth range and versatility to tolerate large thermal fluctuations in diverse environments, however not many have been fully explored over their entire growth temperature range through a holistic view of its physiology, genome, and transcriptome. We used Exiguobacterium sibiricum strain 255-15, a psychrotrophic bacterium from 3 million year old Siberian permafrost that grows from -5°C to 39°C to study its thermal adaptation. Results The E. sibiricum genome has one chromosome and two small plasmids with a total of 3,015 protein-encoding genes (CDS), and a GC content of 47.7%. The genome and transcriptome analysis along with the organism's known physiology was used to better understand its thermal adaptation. A total of 27%, 3.2%, and 5.2% of E. sibiricum CDS spotted on the DNA microarray detected differentially expressed genes in cells grown at -2.5°C, 10°C, and 39°C, respectively, when compared to cells grown at 28°C. The hypothetical and unknown genes represented 10.6%, 0.89%, and 2.3% of the CDS differentially expressed when grown at -2.5°C, 10°C, and 39°C versus 28°C, respectively. Conclusion The results show that E. sibiricum is constitutively adapted to cold temperatures stressful to mesophiles since little differential gene expression was observed between 4°C and 28°C, but at the extremities of its Arrhenius growth profile, namely -2.5°C and 39°C, several physiological and metabolic adaptations associated with stress responses were observed. PMID:19019206

  14. RNA-Seq of Guar (Cyamopsis tetragonoloba, L. Taub.) Leaves: De novo Transcriptome Assembly, Functional Annotation and Development of Genomic Resources

    PubMed Central

    Tanwar, Umesh K.; Pruthi, Vikas; Randhawa, Gursharn S.

    2017-01-01

    Genetic improvement in industrially important guar (Cyamopsis tetragonoloba, L. Taub.) crop has been hindered due to the lack of sufficient genomic or transcriptomic resources. In this study, RNA-Seq technology was employed to characterize the transcriptome of leaf tissues from two guar varieties, namely, M-83 and RGC-1066. Approximately 30 million high-quality pair-end reads of each variety generated by Illumina HiSeq platform were used for de novo assembly by Trinity program. A total of 62,146 non-redundant unigenes with an average length of 679 bp were obtained. The quality assessment of assembled unigenes revealed 87.50% of complete and 97.18% partial core eukaryotic genes (CEGs). Sequence similarity analyses and annotation of the unigenes against non-redundant protein (Nr) and Gene Ontology (GO) databases identified 175,882 GO annotations. A total of 11,308 guar unigenes were annotated with various enzyme codes (EC) and categorized in six categories with 55 subclasses. The annotation of biochemical pathways resulted in a total of 11,971 unigenes assigned with 145 KEGG maps and 1759 enzyme codes. The species distribution analysis of the unigenes showed highest similarity with Glycine max genes. A total of 5773 potential simple sequence repeats (SSRs) and 3594 high-quality single nucleotide polymorphisms (SNPs) were identified. Out of 20 randomly selected SSRs for wet laboratory validation, 13 showed consistent PCR amplification in both guar varieties. In silico studies identified 145 polymorphic SSR markers in two varieties. To the best of our knowledge, this is the first report on transcriptome analysis and SNPs identification in guar till date. PMID:28210265

  15. Genome-scale genetic engineering in Escherichia coli.

    PubMed

    Jeong, Jaehwan; Cho, Namjin; Jung, Daehee; Bang, Duhee

    2013-11-01

    Genome engineering has been developed to create useful strains for biological studies and industrial uses. However, a continuous challenge remained in the field: technical limitations in high-throughput screening and precise manipulation of strains. Today, technical improvements have made genome engineering more rapid and efficient. This review introduces recent advances in genome engineering technologies applied to Escherichia coli as well as multiplex automated genome engineering (MAGE), a recent technique proposed as a powerful toolkit due to its straightforward process, rapid experimental procedures, and highly efficient properties.

  16. Temporal profiling of the coding and noncoding murine cytomegalovirus transcriptomes.

    PubMed

    Lacaze, Paul; Forster, Thorsten; Ross, Alan; Kerr, Lorraine E; Salvo-Chirnside, Eliane; Lisnic, Vanda Juranic; López-Campos, Guillermo H; García-Ramírez, José J; Messerle, Martin; Trgovcich, Joanne; Angulo, Ana; Ghazal, Peter

    2011-06-01

    The global transcriptional program of murine cytomegalovirus (MCMV), involving coding, noncoding, and antisense transcription, remains unknown. Here we report an oligonucleotide custom microarray platform capable of measuring both coding and noncoding transcription on a genome-wide scale. By profiling MCMV wild-type and immediate-early mutant strains in fibroblasts, we found rapid activation of the transcriptome by 6.5 h postinfection, with absolute dependency on ie3, but not ie1 or ie2, for genomic programming of viral gene expression. Evidence is also presented to show, for the first time, genome-wide noncoding and bidirectional transcription at late stages of MCMV infection.

  17. Genomics and Transcriptomics Analyses of the Oil-Accumulating Basidiomycete Yeast Trichosporon oleaginosus: Insights into Substrate Utilization and Alternative Evolutionary Trajectories of Fungal Mating Systems

    PubMed Central

    Bracharz, Felix; Lorenzen, Jan; Kracht, Octavia N.; Chovatia, Mansi; Daum, Chris; Deshpande, Shweta; Lipzen, Anna; Nolan, Matt; Ohm, Robin A.; Grigoriev, Igor V.; Sun, Sheng; Heitman, Joseph

    2015-01-01

    ABSTRACT Microbial fermentation of agro-industrial waste holds great potential for reducing the environmental impact associated with the production of lipids for industrial purposes from plant biomass. However, the chemical complexity of many residues currently prevents efficient conversion into lipids, creating a high demand for strains with the ability to utilize all energy-rich components of agricultural residues. Here, we present results of genome and transcriptome analyses of Trichosporon oleaginosus. This oil-accumulating yeast is able to grow on a wide variety of substrates, including pentoses and N-acetylglucosamine, making it an interesting candidate for biotechnological applications. Transcriptomics shows specific changes in gene expression patterns under lipid-accumulating conditions. Furthermore, gene content and expression analyses indicate that T. oleaginosus is well-adapted for the utilization of chitin-rich biomass. We also focused on the T. oleaginosus mating type, because this species is a member of the Tremellomycetes, a group that has been intensively analyzed as a model for the evolution of sexual development, the best-studied member being Cryptococcus neoformans. The structure of the T. oleaginosus mating-type regions differs significantly from that of other Tremellomycetes and reveals a new evolutionary trajectory paradigm. Comparative analysis shows that recruitment of developmental genes to the ancestral tetrapolar mating-type loci occurred independently in the Trichosporon and Cryptococcus lineages, supporting the hypothesis of a trend toward larger mating-type regions in fungi. PMID:26199329

  18. Whole genome and transcriptome analyses of environmental antibiotic sensitive and multi-resistant Pseudomonas aeruginosa isolates exposed to waste water and tap water.

    PubMed

    Schwartz, Thomas; Armant, Olivier; Bretschneider, Nancy; Hahn, Alexander; Kirchen, Silke; Seifert, Martin; Dötsch, Andreas

    2015-01-01

    The fitness of sensitive and resistant Pseudomonas aeruginosa in different aquatic environments depends on genetic capacities and transcriptional regulation. Therefore, an antibiotic-sensitive isolate PA30 and a multi-resistant isolate PA49 originating from waste waters were compared via whole genome and transcriptome Illumina sequencing after exposure to municipal waste water and tap water. A number of different genomic islands (e.g. PAGIs, PAPIs) were identified in the two environmental isolates beside the highly conserved core genome. Exposure to tap water and waste water exhibited similar transcriptional impacts on several gene clusters (antibiotic and metal resistance, genetic mobile elements, efflux pumps) in both environmental P. aeruginosa isolates. The MexCD-OprJ efflux pump was overexpressed in PA49 in response to waste water. The expression of resistance genes, genetic mobile elements in PA49 was independent from the water matrix. Consistently, the antibiotic sensitive strain PA30 did not show any difference in expression of the intrinsic resistance determinants and genetic mobile elements. Thus, the exposure of both isolates to polluted waste water and oligotrophic tap water resulted in similar expression profiles of mentioned genes. However, changes in environmental milieus resulted in rather unspecific transcriptional responses than selected and stimuli-specific gene regulation.

  19. The genome and life-stage specific transcriptomes of Globodera pallida elucidate key aspects of plant parasitism by a cyst nematode

    PubMed Central

    2014-01-01

    Background Globodera pallida is a devastating pathogen of potato crops, making it one of the most economically important plant parasitic nematodes. It is also an important model for the biology of cyst nematodes. Cyst nematodes and root-knot nematodes are the two most important plant parasitic nematode groups and together represent a global threat to food security. Results We present the complete genome sequence of G. pallida, together with transcriptomic data from most of the nematode life cycle, particularly focusing on the life cycle stages involved in root invasion and establishment of the biotrophic feeding site. Despite the relatively close phylogenetic relationship with root-knot nematodes, we describe a very different gene family content between the two groups and in particular extensive differences in the repertoire of effectors, including an enormous expansion of the SPRY domain protein family in G. pallida, which includes the SPRYSEC family of effectors. This highlights the distinct biology of cyst nematodes compared to the root-knot nematodes that were, until now, the only sedentary plant parasitic nematodes for which genome information was available. We also present in-depth descriptions of the repertoires of other genes likely to be important in understanding the unique biology of cyst nematodes and of potential drug targets and other targets for their control. Conclusions The data and analyses we present will be central in exploiting post-genomic approaches in the development of much-needed novel strategies for the control of G. pallida and related pathogens. PMID:24580726

  20. Transcriptomes of the desiccation-tolerant resurrection plant Craterostigma plantagineum.

    PubMed

    Rodriguez, Maria C Suarez; Edsgärd, Daniel; Hussain, Syed S; Alquezar, David; Rasmussen, Morten; Gilbert, Thomas; Nielsen, Bjørn H; Bartels, Dorothea; Mundy, John

    2010-07-01

    Studies of the resurrection plant Craterostigma plantagineum have revealed some of the mechanisms which these desiccation-tolerant plants use to survive environments with extreme dehydration and restricted seasonal water. Most resurrection plants are polyploid with large genomes, which has hindered efforts to obtain whole genome sequences and perform mutational analysis. However, the application of deep sequencing technologies to transcriptomics now permits large-scale analyses of gene expression patterns despite the lack of a reference genome. Here we use pyro-sequencing to characterize the transcriptomes of C. plantagineum leaves at four stages of dehydration and rehydration. This reveals that genes involved in several pathways, such as those required for vitamin K and thiamin biosynthesis, are tightly regulated at the level of gene expression. Our analysis also provides a comprehensive picture of the array of cellular responses controlled by gene expression that allow resurrection plants to survive desiccation.

  1. Transcriptome Analysis of Two Vicia sativa Subspecies: Mining Molecular Markers to Enhance Genomic Resources for Vetch Improvement.

    PubMed

    Kim, Tae-Sung; Raveendar, Sebastin; Suresh, Sundan; Lee, Gi-An; Lee, Jung-Ro; Cho, Joon-Hyeong; Lee, Sok-Young; Ma, Kyung-Ho; Cho, Gyu-Taek; Chung, Jong-Wook

    2015-11-02

    The vetch (Vicia sativa) is one of the most important annual forage legumes globally due to its multiple uses and high nutritional content. Despite these agronomical benefits, many drawbacks, including cyano-alanine toxin, has reduced the agronomic value of vetch varieties. Here, we used 454 technology to sequence the two V. sativa subspecies (ssp. sativa and ssp. nigra) to enrich functional information and genetic marker resources for the vetch research community. A total of 86,532 and 47,103 reads produced 35,202 and 18,808 unigenes with average lengths of 735 and 601 bp for V. sativa sativa and V. sativa nigra, respectively. Gene Ontology annotations and the cluster of orthologous gene classes were used to annotate the function of the Vicia transcriptomes. The Vicia transcriptome sequences were then mined for simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers. About 13% and 3% of the Vicia unigenes contained the putative SSR and SNP sequences, respectively. Among those SSRs, 100 were chosen for the validation and the polymorphism test using the Vicia germplasm set. Thus, our approach takes advantage of the utility of transcriptomic data to expedite a vetch breeding program.

  2. Transcriptome Analysis of Two Vicia sativa Subspecies: Mining Molecular Markers to Enhance Genomic Resources for Vetch Improvement

    PubMed Central

    Kim, Tae-Sung; Raveendar, Sebastin; Suresh, Sundan; Lee, Gi-An; Lee, Jung-Ro; Cho, Joon-Hyeong; Lee, Sok-Young; Ma, Kyung-Ho; Cho, Gyu-Taek; Chung, Jong-Wook

    2015-01-01

    The vetch (Vicia sativa) is one of the most important annual forage legumes globally due to its multiple uses and high nutritional content. Despite these agronomical benefits, many drawbacks, including cyano-alanine toxin, has reduced the agronomic value of vetch varieties. Here, we used 454 technology to sequence the two V. sativa subspecies (ssp. sativa and ssp. nigra) to enrich functional information and genetic marker resources for the vetch research community. A total of 86,532 and 47,103 reads produced 35,202 and 18,808 unigenes with average lengths of 735 and 601 bp for V. sativa sativa and V. sativa nigra, respectively. Gene Ontology annotations and the cluster of orthologous gene classes were used to annotate the function of the Vicia transcriptomes. The Vicia transcriptome sequences were then mined for simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers. About 13% and 3% of the Vicia unigenes contained the putative SSR and SNP sequences, respectively. Among those SSRs, 100 were chosen for the validation and the polymorphism test using the Vicia germplasm set. Thus, our approach takes advantage of the utility of transcriptomic data to expedite a vetch breeding program. PMID:26540077

  3. Genome-scale transcriptional analyses of first-generation interspecific sunflower hybrids reveals broad regulatory compatibility

    PubMed Central

    2013-01-01

    Background Interspecific hybridization creates individuals harboring diverged genomes. The interaction of these genomes can generate successful evolutionary novelty or disadvantageous genomic conflict. Annual sunflowers Helianthus annuus and H. petiolaris have a rich history of hybridization in natural populations. Although first-generation hybrids generally have low fertility, hybrid swarms that include later generation and fully fertile backcross plants have been identified, as well as at least three independently-originated stable hybrid taxa. We examine patterns of transcript accumulation in the earliest stages of hybridization of these species via analyses of transcriptome sequences from laboratory-derived F1 offspring of an inbred H. annuus cultivar and a wild H. petiolaris accession. Results While nearly 14% of the reference transcriptome showed significant accumulation differences between parental accessions, total F1 transcript levels showed little evidence of dominance, as midparent transcript levels were highly predictive of transcript accumulation in F1 plants. Allelic bias in F1 transcript accumulation was detected in 20% of transcripts containing sufficient polymorphism to distinguish parental alleles; however the magnitude of these biases were generally smaller than differences among parental accessions. Conclusions While analyses of allelic bias suggest that cis regulatory differences between H. annuus and H. petiolaris are common, their effect on transcript levels may be more subtle than trans-acting regulatory differences. Overall, these analyses found little evidence of regulatory incompatibility or dominance interactions between parental genomes within F1 hybrid individuals, although it is unclear whether this is a legacy or an enabler of introgression between species. PMID:23701699

  4. High-throughput generation, optimization and analysis of genome-scale metabolic models.

    SciTech Connect

    Henry, C. S.; DeJongh, M.; Best, A. A.; Frybarger, P. M.; Linsay, B.; Stevens, R. L.

    2010-09-01

    Genome-scale metabolic models have proven to be valuable for predicting organism phenotypes from genotypes. Yet efforts to develop new models are failing to keep pace with genome sequencing. To address this problem, we introduce the Model SEED, a web-based resource for high-throughput generation, optimization and analysis of genome-scale metabolic models. The Model SEED integrates existing methods and introduces techniques to automate nearly every step of this process, taking {approx}48 h to reconstruct a metabolic model from an assembled genome sequence. We apply this resource to generate 130 genome-scale metabolic models representing a taxonomically diverse set of bacteria. Twenty-two of the models were validated against available gene essentiality and Biolog data, with the average model accuracy determined to be 66% before optimization and 87% after optimization.

  5. Adaptation and evolution of deep-sea scale worms (Annelida: Polynoidae): insights from transcriptome comparison with a shallow-water species

    PubMed Central

    Zhang, Yanjie; Sun, Jin; Chen, Chong; Watanabe, Hiromi K.; Feng, Dong; Zhang, Yu; Chiu, Jill M.Y.; Qian, Pei-Yuan; Qiu, Jian-Wen

    2017-01-01

    Polynoid scale worms (Polynoidae, Annelida) invaded deep-sea chemosynthesis-based ecosystems approximately 60 million years ago, but little is known about their genetic adaptation to the extreme deep-sea environment. In this study, we reported the first two transcriptomes of deep-sea polynoids (Branchipolynoe pettiboneae, Lepidonotopodium sp.) and compared them with the transcriptome of a shallow-water polynoid (Harmothoe imbricata). We determined codon and amino acid usage, positive selected genes, highly expressed genes and putative duplicated genes. Transcriptome assembly produced 98,806 to 225,709 contigs in the three species. There were more positively charged amino acids (i.e., histidine and arginine) and less negatively charged amino acids (i.e., aspartic acid and glutamic acid) in the deep-sea species. There were 120 genes showing clear evidence of positive selection. Among the 10% most highly expressed genes, there were more hemoglobin genes with high expression levels in both deep-sea species. The duplicated genes related to DNA recombination and metabolism, and gene expression were only enriched in deep-sea species. Deep-sea scale worms adopted two strategies of adaptation to hypoxia in the chemosynthesis-based habitats (i.e., rapid evolution of tetra-domain hemoglobin in Branchipolynoe or high expression of single-domain hemoglobin in Lepidonotopodium sp.).

  6. Developmental transcriptome of Aplysia californica.

    PubMed

    Heyland, Andreas; Vue, Zer; Voolstra, Christian R; Medina, Mónica; Moroz, Leonid L

    2011-03-15

    Genome-wide transcriptional changes in development provide important insight into mechanisms underlying growth, differentiation, and patterning. However, such large-scale developmental studies have been limited to a few representatives of Ecdysozoans and Chordates. Here, we characterize transcriptomes of embryonic, larval, and metamorphic development in the marine mollusc Aplysia californica and reveal novel molecular components associated with life history transitions. Specifically, we identify more than 20 signal peptides, putative hormones, and transcription factors in association with early development and metamorphic stages-many of which seem to be evolutionarily conserved elements of signal transduction pathways. We also characterize genes related to biomineralization-a critical process of molluscan development. In summary, our experiment provides the first large-scale survey of gene expression in mollusc development, and complements previous studies on the regulatory mechanisms underlying body plan patterning and the formation of larval and juvenile structures. This study serves as a resource for further functional annotation of transcripts and genes in Aplysia, specifically and molluscs in general. A comparison of the Aplysia developmental transcriptome with similar studies in the zebra fish Danio rerio, the fruit fly Drosophila melanogaster, the nematode Caenorhabditis elegans, and other studies on molluscs suggests an overall highly divergent pattern of gene regulatory mechanisms that are likely a consequence of the different developmental modes of these organisms.

  7. Developmental Transcriptome of Aplysia californica

    PubMed Central

    HEYLAND, ANDREAS; VUE, ZER; VOOLSTRA, CHRISTIAN R.; MEDINA, MÓNICA; MOROZ, LEONID L.

    2014-01-01

    Genome-wide transcriptional changes in development provide important insight into mechanisms underlying growth, differentiation, and patterning. However, such large-scale developmental studies have been limited to a few representatives of Ecdysozoans and Chordates. Here, we characterize transcriptomes of embryonic, larval, and metamorphic development in the marine mollusc Aplysia californica and reveal novel molecular components associated with life history transitions. Specifically, we identify more than 20 signal peptides, putative hormones, and transcription factors in association with early development and metamorphic stages—many of which seem to be evolutionarily conserved elements of signal transduction pathways. We also characterize genes related to biomineralization—a critical process of molluscan development. In summary, our experiment provides the first large-scale survey of gene expression in mollusc development, and complements previous studies on the regulatory mechanisms underlying body plan patterning and the formation of larval and juvenile structures. This study serves as a resource for further functional annotation of transcripts and genes in Aplysia, specifically and molluscs in general. A comparison of the Aplysia developmental transcriptome with similar studies in the zebra fish Danio rerio, the fruit fly Drosophila melanogaster, the nematode Caenorhabditis elegans, and other studies on molluscs suggests an overall highly divergent pattern of gene regulatory mechanisms that are likely a consequence of the different developmental modes of these organisms. PMID:21328528

  8. Decoding Synteny Blocks and Large-Scale Duplications in Mammalian and Plant Genomes

    NASA Astrophysics Data System (ADS)

    Peng, Qian; Alekseyev, Max A.; Tesler, Glenn; Pevzner, Pavel A.

    The existing synteny block reconstruction algorithms use anchors (e.g., orthologous genes) shared over all genomes to construct the synteny blocks for multiple genomes. This approach, while efficient for a few genomes, cannot be scaled to address the need to construct synteny blocks in many mammalian genomes that are currently being sequenced. The problem is that the number of anchors shared among all genomes quickly decreases with the increase in the number of genomes. Another problem is that many genomes (plant genomes in particular) had extensive duplications, which makes decoding of genomic architecture and rearrangement analysis in plants difficult. The existing synteny block generation algorithms in plants do not address the issue of generating non-overlapping synteny blocks suitable for analyzing rearrangements and evolution history of duplications. We present a new algorithm based on the A-Bruijn graph framework that overcomes these difficulties and provides a unified approach to synteny block reconstruction for multiple genomes, and for genomes with large duplications.

  9. The Bluejay genome browser.

    PubMed

    Soh, Jung; Gordon, Paul M K; Sensen, Christoph W

    2012-03-01

    The Bluejay genome browser is a stand-alone visualization tool for the multi-scale viewing of annotated genomes and other genomic elements. Bluejay allows users to customize display features to suit their needs, and produces publication-quality graphics. Bluejay provides a multitude of ways to interrelate biological data at the genome scale. Users can load gene expression data into a genome display for expression visualization in context. Multiple genomes can be compared concurrently, including time series expression data, based on Gene Ontology labels. External, context-sensitive biological Web Services are linked to the displayed genomic elements ad hoc for in-depth genomic data analysis and interpretation. Users can mark multiple points of interest in a genome by creating waypoints, and exploit them for easy navigation of single or multiple genomes. Using this comprehensive visual environment, users can study a gene not just in relation to its genome, but also its transcriptome and evolutionary origins. Written in Java, Bluejay is platform-independent and is freely available from http://bluejay.ucalgary.ca.

  10. Bioinformatic Analyses of Unique (Orphan) Core Genes of the Genus Acidithiobacillus: Functional Inferences and Use As Molecular Probes for Genomic and Metagenomic/Transcriptomic Interrogation

    PubMed Central

    González, Carolina; Lazcano, Marcelo; Valdés, Jorge; Holmes, David S.

    2016-01-01

    Using phylogenomic and gene compositional analyses, five highly conserved gene families have been detected in the core genome of the phylogenetically coherent genus Acidithiobacillus of the class Acidithiobacillia. These core gene families are absent in the closest extant genus Thermithiobacillus tepidarius that subtends the Acidithiobacillus genus and roots the deepest in this class. The predicted proteins encoded by these core gene families are not detected by a BLAST search in the NCBI non-redundant database of more than 90 million proteins using a relaxed cut-off of 1.0e−5. None of the five families has a clear functional prediction. However, bioinformatic scrutiny, using pI prediction, motif/domain searches, cellular location predictions, genomic context analyses, and chromosome topology studies together with previously published transcriptomic and proteomic data, suggests that some may have functions associated with membrane remodeling during cell division perhaps in response to pH stress. Despite the high level of amino acid sequence conservation within each family, there is sufficient nucleotide variation of the respective genes to permit the use of the DNA sequences to distinguish different species of Acidithiobacillus, making them useful additions to the armamentarium of tools for phylogenetic analysis. Since the protein families are unique to the Acidithiobacillus genus, they can also be leveraged as probes to detect the genus in environmental metagenomes and metatranscriptomes, including industrial biomining operations, and acid mine drainage (AMD). PMID:28082953

  11. Mining whole genomes and transcriptomes of Jatropha (Jatropha curcas) and Castor bean (Ricinus communis) for NBS-LRR genes and defense response associated transcription factors.

    PubMed

    Sood, Archit; Jaiswal, Varun; Chanumolu, Sree Krishna; Malhotra, Nikhil; Pal, Tarun; Chauhan, Rajinder Singh

    2014-11-01

    Jatropha (Jatropha curcas L.) and Castor bean (Ricinus communis) are oilseed crops of family Euphorbiaceae with the potential of producing high quality biodiesel and having industrial value. Both the bioenergy plants are becoming susceptible to various biotic stresses directly affecting the oil quality and content. No report exists as of today on analysis of Nucleotide Binding Site-Leucine Rich Repeat (NBS-LRR) gene repertoire and defense response transcription factors in both the plant species. In silico analysis of whole genomes and transcriptomes identified 47 new NBS-LRR genes in both the species and 122 and 318 defense response related transcription factors in Jatropha and Castor bean, respectively. The identified NBS-LRR genes and defense response transcription factors were mapped onto the respective genomes. Common and unique NBS-LRR genes and defense related transcription factors were identified in both the plant species. All NBS-LRR genes in both the species were characterized into Toll/interleukin-1 receptor NBS-LRRs (TNLs) and coiled-coil NBS-LRRs (CNLs), position on contigs, gene clusters and motifs and domains distribution. Transcript abundance or expression values were measured for all NBS-LRR genes and defense response transcription factors, suggesting their functional role. The current study provides a repertoire of NBS-LRR genes and transcription factors which can be used in not only dissecting the molecular basis of disease resistance phenotype but also in developing disease resistant genotypes in Jatropha and Castor bean through transgenic or molecular breeding approaches.

  12. Transcriptome analysis of tetraploid cells identifies cyclin D2 as a facilitator of adaptation to genome doubling in the presence of p53

    PubMed Central

    Potapova, Tamara A.; Seidel, Christopher W.; Box, Andrew C.; Rancati, Giulia; Li, Rong

    2016-01-01

    Tetraploidization, or genome doubling, is a prominent event in tumorigenesis, primarily because cell division in polyploid cells is error-prone and produces aneuploid cells. This study investigates changes in gene expression evoked in acute and adapted tetraploid cells and their effect on cell-cycle progression. Acute polyploidy was generated by knockdown of the essential regulator of cytokinesis anillin, which resulted in cytokinesis failure and formation of binucleate cells, or by chemical inhibition of Aurora kinases, causing abnormal mitotic exit with formation of single cells with aberrant nuclear morphology. Transcriptome analysis of these acute tetraploid cells revealed common signatures of activation of the tumor-suppressor protein p53. Suppression of proliferation in these cells was dependent on p53 and its transcriptional target, CDK inhibitor p21. Rare proliferating tetraploid cells can emerge from acute polyploid populations. Gene expression analysis of single cell–derived, adapted tetraploid clones showed up-regulation of several p53 target genes and cyclin D2, the activator of CDK4/6/2. Overexpression of cyclin D2 in diploid cells strongly potentiated the ability to proliferate with increased DNA content despite the presence of functional p53. These results indicate that p53-mediated suppression of proliferation of polyploid cells can be averted by increased levels of oncogenes such as cyclin D2, elucidating a possible route for tetraploidy-mediated genomic instability in carcinogenesis. PMID:27559130

  13. Genome-Wide Identification and Transcriptome-Based Expression Profiling of the Sox Gene Family in the Nile Tilapia (Oreochromis niloticus)

    PubMed Central

    Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou

    2016-01-01

    The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts. PMID:26907269

  14. Genome-wide landscape of DNA methylomes and their relationship with mRNA and miRNA transcriptomes in oxidative and glycolytic skeletal muscles

    PubMed Central

    Shen, Linyuan; Du, Jingjing; Xia, Yudong; Tan, Zhendong; Fu, Yuhua; Yang, Qiong; Li, Xuewei; Tang, Guoqing; Jiang, Yanzhi; Wang, Jinyong; Li, Mingzhou; Zhang, Shunhua; Zhu, Li

    2016-01-01

    The physiological, biochemical and functional differences between oxidative and glycolytic muscles play important roles in human metabolic health and in animal meat quality. To explore these differences, we determined the genome-wide landscape of DNA methylomes and their relationship with the mRNA and miRNA transcriptomes of the oxidative muscle psoas major (PMM) and the glycolytic muscle longissimus dorsi (LDM). We observed the hypo-methylation of sub-telomeric regions. A high mitochondrial content contributed to fast replicative senescence in PMM. The differentially methylated regions (DMRs) in promoters (478) and gene bodies (5,718) were mainly enriched in GTPase regulator activity and signaling cascade-mediated pathways. Integration analysis revealed that the methylation status within gene promoters (or gene bodies) and miRNA promoters was negatively correlated with mRNA and miRNA expression, respectively. Numerous genes were closely related to distinct phenotypic traits between LDM and PMM. For example, the hyper-methylation and down-regulation of HK-2 and PFKFB4 were related to decrease glycolytic potential in PMM. In addition, promoter hypo-methylation and the up-regulation of miR-378 silenced the expression of the target genes and promoted capillary biosynthesis in PMM. Together, these results improve understanding of muscle metabolism and development from genomic and epigenetic perspectives. PMID:27561200

  15. Genome-Wide Transcriptome Profiling of Mycobacterium smegmatis MC² 155 Cultivated in Minimal Media Supplemented with Cholesterol, Androstenedione or Glycerol.

    PubMed

    Li, Qun; Ge, Fanglan; Tan, Yunya; Zhang, Guangxiang; Li, Wei

    2016-05-07

    Mycobacterium smegmatis strain MC² 155 is an attractive model organism for the study of M. tuberculosis and other mycobacterial pathogens, as it can grow well using cholesterol as a carbon resource. However, its global transcriptomic response remains largely unrevealed. In this study, M. smegmatis MC² 155 cultivated in androstenedione, cholesterol and glycerol supplemented media were collected separately for a RNA-Sequencing study. The results showed that 6004, 6681 and 6348 genes were expressed in androstenedione, cholesterol and glycerol supplemented media, and 5891 genes were expressed in all three conditions, with 237 specially expressed in cholesterol added medium. A total of 1852 and 454 genes were significantly up-regulated by cholesterol compared with the other two supplements. Only occasional changes were observed in basic carbon and nitrogen metabolism, while almost all of the genes involved in cholesterol catabolism and mammalian cell entry (MCE) were up-regulated by cholesterol, but not by androstenedione. Eleven and 16 gene clusters were induced by cholesterol when compared with glycerol or androstenedione, respectively. This study provides a comprehensive analysis of the cholesterol responsive transcriptome of M. smegmatis. Our results indicated that cholesterol induced many more genes and increased the expression of the majority of genes involved in cholesterol degradation and MCE in M. smegmatis, while androstenedione did not have the same effect.

  16. Resolution of deep eudicot phylogeny and their temporal diversification using nuclear genes from transcriptomic and genomic datasets.

    PubMed

    Zeng, Liping; Zhang, Ning; Zhang, Qiang; Endress, Peter K; Huang, Jie; Ma, Hong

    2017-03-14

    Explosive diversification is widespread in eukaryotes, making it difficult to resolve phylogenetic relationships. Eudicots contain c. 75% of extant flowering plants, are important for human livelihood and terrestrial ecosystems, and have probably experienced explosive diversifications. The eudicot phylogenetic relationships, especially among those of the Pentapetalae, remain unresolved. Here, we present a highly supported eudicot phylogeny and diversification rate shifts using 31 newly generated transcriptomes and 88 other datasets covering 70% of eudicot orders. A highly supported eudicot phylogeny divided Pentapetalae into two groups: one with rosids, Saxifragales, Vitales and Santalales; the other containing asterids, Caryophyllales and Dilleniaceae, with uncertainty for Berberidopsidales. Molecular clock analysis estimated that crown eudicots originated c. 146 Ma, considerably earlier than earliest tricolpate pollen fossils and most other molecular clock estimates, and Pentapetalae sequentially diverged into eight major lineages within c. 15 Myr. Two identified increases of diversification rate are located in the stems leading to Pentapetalae and asterids, and lagged behind the gamma hexaploidization. The nuclear genes from newly generated transcriptomes revealed a well-resolved eudicot phylogeny, sequential separation of major core eudicot lineages and temporal mode of diversifications, providing new insights into the evolutionary trend of morphologies and contributions to the diversification of eudicots.

  17. Large-scale genomic analysis of ovarian carcinomas.

    PubMed

    Gorringe, Kylie L; Campbell, Ian G

    2009-04-01

    Epithelial ovarian cancers are typified by frequent genomic aberrations that have been difficult to unravel. Recently, high-resolution array technologies have provided the first glimpse of the remarkable complexity of these aberrations with some ovarian cancers containing hundreds of copy number breakpoints, micro-deletions and amplifications. Many of these alterations contain cancer-related genes suggesting that the majority is disease-associated and not just the product of random genomic instability. Future developments such as next-generation sequencing and integrated analysis of data from multiple array platforms on large numbers of samples are poised to revolutionize our understanding of this complex disease.

  18. Pore-scale simulation of microbial growth using a genome-scale metabolic model: Implications for Darcy-scale reactive transport

    SciTech Connect

    Tartakovsky, Guzel D.; Tartakovsky, Alexandre M.; Scheibe, Timothy D.; Fang, Yilin; Mahadevan, Radhakrishnan; Lovley, Derek R.

    2013-09-07

    Recent advances in microbiology have enabled the quantitative simulation of microbial metabolism and growth based on genome-scale characterization of metabolic pathways and fluxes. We have incorporated a genome-scale metabolic model of the iron-reducing bacteria Geobacter sulfurreducens into a pore-scale simulation of microbial growth based on coupling of iron reduction to oxidation of a soluble electron donor (acetate). In our model, fluid flow and solute transport is governed by a combination of the Navier-Stokes and advection-diffusion-reaction equations. Microbial growth occurs only on the surface of soil grains where solid-phase mineral iron oxides are available. Mass fluxes of chemical species associated with microbial growth are described by the genome-scale microbial model, implemented using a constraint-based metabolic model, and provide the Robin-type boundary condition for the advection-diffusion equation at soil grain surfaces. Conventional models of microbially-mediated subsurface reactions use a lumped reaction model that does not consider individual microbial reaction pathways, and describe reactions rates using empirically-derived rate formulations such as the Monod-type kinetics. We have used our pore-scale model to explore the relationship between genome-scale metabolic models and Monod-type formulations, and to assess the manifestation of pore-scale variability (microenvironments) in terms of apparent Darcy-scale microbial reaction rates. The genome-scale model predicted lower biomass yield, and different stoichiometry for iron consumption, in comparisonto prior Monod formulations based on energetics considerations. We were able to fit an equivalent Monod model, by modifying the reaction stoichiometry and biomass yield coefficient, that could effectively match results of the genome-scale simulation of microbial behaviors under excess nutrient conditions, but predictions of the fitted Monod model deviated from those of the genome-scale model under

  19. Pore-scale simulation of microbial growth using a genome-scale metabolic model: Implications for Darcy-scale reactive transport

    NASA Astrophysics Data System (ADS)

    Scheibe, T. D.; Tartakovsky, G.; Tartakovsky, A. M.; Fang, Y.; Mahadevan, R.; Lovley, D. R.

    2012-12-01

    Recent advances in microbiology have enabled the quantitative simulation of microbial metabolism and growth based on genome-scale characterization of metabolic pathways and fluxes. We have incorporated a genome-scale metabolic model of the iron-reducing bacteria Geobacter sulfurreducens into a pore-scale simulation of microbial growth based on coupling of iron reduction to oxidation of a soluble electron donor (acetate). In our model, fluid flow and solute transport is governed by a combination of the Navier-Stokes and advection-diffusion-reaction equations. Microbial growth occurs only on the surface of soil grains where solid-phase mineral iron oxides are available. Mass fluxes of chemical species associated with microbial growth are described by the genome-scale microbial model, implemented using a constraint-based metabolic model, and provide the Robin-type boundary condition for the advection-diffusion equation at soil grain surfaces. Conventional models of microbially-mediated subsurface reactions use a lumped reaction model that does not consider individual microbial reaction pathways, and describe reactions rates using empirically-derived rate formulations such as the Monod-type kinetics. We have used our pore-scale model to explore the relationship between genome-scale metabolic models and Monod-type formulations, and to assess the manifestation of pore-scale variability (microenvironments) in terms of apparent Darcy-scale microbial reaction rates. The genome-scale model predicted lower biomass yield, and different stoichiometry for iron consumption, in comparison to prior Monod formulations based on energetics considerations. We were able to fit an equivalent Monod model, by modifying the reaction stoichiometry and biomass yield coefficient, that could effectively match results of the genome-scale simulation of microbial behaviors under excess nutrient conditions, but predictions of the fitted Monod model deviated from those of the genome-scale model

  20. Pore-scale simulation of microbial growth using a genome-scale metabolic model: Implications for Darcy-scale reactive transport

    NASA Astrophysics Data System (ADS)

    Tartakovsky, G. D.; Tartakovsky, A. M.; Scheibe, T. D.; Fang, Y.; Mahadevan, R.; Lovley, D. R.

    2013-09-01

    Recent advances in microbiology have enabled the quantitative simulation of microbial metabolism and growth based on genome-scale characterization of metabolic pathways and fluxes. We have incorporated a genome-scale metabolic model of the iron-reducing bacteria Geobacter sulfurreducens into a pore-scale simulation of microbial growth based on coupling of iron reduction to oxidation of a soluble electron donor (acetate). In our model, fluid flow and solute transport is governed by a combination of the Navier-Stokes and advection-diffusion-reaction equations. Microbial growth occurs only on the surface of soil grains where solid-phase mineral iron oxides are available. Mass fluxes of chemical species associated with microbial growth are described by the genome-scale microbial model, implemented using a constraint-based metabolic model, and provide the Robin-type boundary condition for the advection-diffusion equation at soil grain surfaces. Conventional models of microbially-mediated subsurface reactions use a lumped reaction model that does not consider individual microbial reaction pathways, and describe reactions rates using empirically-derived rate formulations such as the Monod-type kinetics. We have used our pore-scale model to explore the relationship between genome-scale metabolic models and Monod-type formulations, and to assess the manifestation of pore-scale variability (microenvironments) in terms of apparent Darcy-scale microbial reaction rates. The genome-scale model predicted lower biomass yield, and different stoichiometry for iron consumption, in comparison to prior Monod formulations based on energetics considerations. We were able to fit an equivalent Monod model, by modifying the reaction stoichiometry and biomass yield coefficient, that could effectively match results of the genome-scale simulation of microbial behaviors under excess nutrient conditions, but predictions of the fitted Monod model deviated from those of the genome-scale model

  1. RGS2 expression predicts amyloid-β sensitivity, MCI and Alzheimer's disease: genome-wide transcriptomic profiling and bioinformatics data mining

    PubMed Central

    Hadar, A; Milanesi, E; Squassina, A; Niola, P; Chillotti, C; Pasmanik-Chor, M; Yaron, O; Martásek, P; Rehavi, M; Weissglas-Volkov, D; Shomron, N; Gozes, I; Gurwitz, D

    2016-01-01

    Alzheimer's disease (AD) is the most frequent cause of dementia. Misfolded protein pathological hallmarks of AD are brain deposits of amyloid-β (Aβ) plaques and phosphorylated tau neurofibrillary tangles. However, doubts about the role of Aβ in AD pathology have been raised as Aβ is a common component of extracellular brain deposits found, also by in vivo imaging, in non-demented aged individuals. It has been suggested that some individuals are more prone to Aβ neurotoxicity and hence more likely to develop AD when aging brains start accumulating Aβ plaques. Here, we applied genome-wide transcriptomic profiling of lymphoblastoid cells lines (LCLs) from healthy individuals and AD patients for identifying genes that predict sensitivity to Aβ. Real-time PCR validation identified 3.78-fold lower expression of RGS2 (regulator of G-protein signaling 2; P=0.0085) in LCLs from healthy individuals exhibiting high vs low Aβ sensitivity. Furthermore, RGS2 showed 3.3-fold lower expression (P=0.0008) in AD LCLs compared with controls. Notably, RGS2 expression in AD LCLs correlated with the patients' cognitive function. Lower RGS2 expression levels were also discovered in published expression data sets from postmortem AD brain tissues as well as in mild cognitive impairment and AD blood samples compared with controls. In conclusion, Aβ sensitivity phenotyping followed by transcriptomic profiling and published patient data mining identified reduced peripheral and brain expression levels of RGS2, a key regulator of G-protein-coupled receptor signaling and neuronal plasticity. RGS2 is suggested as a novel AD biomarker (alongside other genes) toward early AD detection and future disease modifying therapeutics. PMID:27701409

  2. Genome-wide transcriptomic and proteomic analyses of bollworm-infested developing cotton bolls revealed the genes and pathways involved in the insect pest defence mechanism.

    PubMed

    Kumar, Saravanan; Kanakachari, Mogilicherla; Gurusamy, Dhandapani; Kumar, Krishan; Narayanasamy, Prabhakaran; Kethireddy Venkata, Padmalatha; Solanke, Amolkumar; Gamanagatti, Savita; Hiremath, Vamadevaiah; Katageri, Ishwarappa S; Leelavathi, Sadhu; Kumar, Polumetla Ananda; Reddy, Vanga Siva

    2016-06-01

    Cotton bollworm, Helicoverpa armigera, is a major insect pest that feeds on cotton bolls causing extensive damage leading to crop and productivity loss. In spite of such a major impact, cotton plant response to bollworm infection is yet to be witnessed. In this context, we have studied the genome-wide response of cotton bolls infested with bollworm using transcriptomic and proteomic approaches. Further, we have validated this data using semi-quantitative real-time PCR. Comparative analyses have revealed that 39% of the transcriptome and 35% of the proteome were differentially regulated during bollworm infestation. Around 36% of significantly regulated transcripts and 45% of differentially expressed proteins were found to be involved in signalling followed by redox regulation. Further analysis showed that defence-related stress hormones and their lipid precursors, transcription factors, signalling molecules, etc. were stimulated, whereas the growth-related counterparts were suppressed during bollworm infestation. Around 26% of the significantly up-regulated proteins were defence molecules, while >50% of the significantly down-regulated were related to photosynthesis and growth. Interestingly, the biosynthesis genes for synergistically regulated jasmonate, ethylene and suppressors of the antagonistic factor salicylate were found to be up-regulated, suggesting a choice among stress-responsive phytohormone regulation. Manual curation of the enzymes and TFs highlighted the components of retrograde signalling pathways. Our data suggest that a selective regulatory mechanism directs the reallocation of metabolic resources favouring defence over growth under bollworm infestation and these insights could be exploited to develop bollworm-resistant cotton varieties.

  3. Accomplishments in genome-scale in silico modeling for industrial and medical biotechnology.

    PubMed

    Milne, Caroline B; Kim, Pan-Jun; Eddy, James A; Price, Nathan D

    2009-12-01

    Driven by advancements in high-throughput biological technologies and the growing number of sequenced genomes, the construction of in silico models at the genome scale has provided powerful tools to investigate a vast array of biological systems and applications. Here, we review comprehensively the uses of such models in industrial and medical biotechnology, including biofuel generation, food production, and drug development. While the use of in silico models is still in its early stages for delivering to industry, significant initial successes have been achieved. For the cases presented here, genome-scale models predict engineering strategies to enhance properties of interest in an organism or to inhibit harmful mechanisms of pathogens. Going forward, genome-scale in silico models promise to extend their application and analysis scope to become a trans-formative tool in biotechnology.

  4. Genome Assembly of the Fungus Cochliobolus miyabeanus, and Transcriptome Analysis during Early Stages of Infection on American Wildrice (Zizania palustris L.)

    PubMed Central

    Castell-Miller, Claudia V.; Gutierrez-Gonzalez, Juan J.; Tu, Zheng Jin; Bushley, Kathryn E.; Hainaut, Matthieu; Henrissat, Bernard; Samac, Deborah A.

    2016-01-01

    The fungus Cochliobolus miyabeanus causes severe leaf spot disease on rice (Oryza sativa) and two North American specialty crops, American wildrice (Zizania palustris) and switchgrass (Panicum virgatum). Despite the importance of C. miyabeanus as a disease-causing agent in wildrice, little is known about either the mechanisms of pathogenicity or host defense responses. To start bridging these gaps, the genome of C. miyabeanus strain TG12bL2 was shotgun sequenced using Illumina technology. The genom