Sample records for pyrosequencing based transcriptome

  1. Transcriptome assembly and digital gene expression atlas of the rainbow trout

    USDA-ARS?s Scientific Manuscript database

    Background: Transcriptome analysis is a preferred method for gene discovery, marker development and gene expression profiling in non-model organisms. Previously, we sequenced a transcriptome reference using Sanger-based and 454-pyrosequencing, however, a transcriptome assembly is still incomplete an...

  2. Pyrosequencing the Bemisia tabaci Transcriptome Reveals a Highly Diverse Bacterial Community and a Robust System for Insecticide Resistance

    PubMed Central

    Wu, Qing-jun; Wang, Shao-li; Yang, Xin; Yang, Ni-na; Li, Ru-mei; Jiao, Xiao-guo; Pan, Hui-peng; Liu, Bai-ming; Su, Qi; Xu, Bao-yun; Hu, Song-nian; Zhou, Xu-guo; Zhang, You-jun

    2012-01-01

    Background Bemisia tabaci (Gennadius) is a phloem-feeding insect poised to become one of the major insect pests in open field and greenhouse production systems throughout the world. The high level of resistance to insecticides is a main factor that hinders continued use of insecticides for suppression of B. tabaci. Despite its prevalence, little is known about B. tabaci at the genome level. To fill this gap, an invasive B. tabaci B biotype was subjected to pyrosequencing-based transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes. Methodology and Principal Findings Using Roche 454 pyrosequencing, 857,205 reads containing approximately 340 megabases were obtained from the B. tabaci transcriptome. De novo assembly generated 178,669 unigenes including 30,980 from insects, 17,881 from bacteria, and 129,808 from the nohit. A total of 50,835 (28.45%) unigenes showed similarity to the non-redundant database in GenBank with a cut-off E-value of 10–5. Among them, 40,611 unigenes were assigned to one or more GO terms and 6,917 unigenes were assigned to 288 known pathways. De novo metatranscriptome analysis revealed highly diverse bacterial symbionts in B. tabaci, and demonstrated the host-symbiont cooperation in amino acid production. In-depth transcriptome analysis indentified putative molecular markers, and genes potentially involved in insecticide resistance and nutrient digestion. The utility of this transcriptome was validated by a thiamethoxam resistance study, in which annotated cytochrome P450 genes were significantly overexpressed in the resistant B. tabaci in comparison to its susceptible counterparts. Conclusions This transcriptome/metatranscriptome analysis sheds light on the molecular understanding of symbiosis and insecticide resistance in an agriculturally important phloem-feeding insect pest, and lays the foundation for future functional genomics research of the B. tabaci complex. Moreover, current pyrosequencing effort greatly enriched the existing whitefly EST database, and makes RNAseq a viable option for future genomic analysis. PMID:22558125

  3. 454 pyrosequencing based transcriptome analysis of Zygaena filipendulae with focus on genes involved in biosynthesis of cyanogenic glucosides.

    PubMed

    Zagrobelny, Mika; Scheibye-Alsing, Karsten; Jensen, Niels Bjerg; Møller, Birger Lindberg; Gorodkin, Jan; Bak, Søren

    2009-12-02

    An essential driving component in the co-evolution of plants and insects is the ability to produce and handle bioactive compounds. Plants produce bioactive natural products for defense, but some insects detoxify and/or sequester the compounds, opening up for new niches with fewer competitors. To study the molecular mechanism behind the co-adaption in plant-insect interactions, we have investigated the interactions between Lotus corniculatus and Zygaena filipendulae. They both contain cyanogenic glucosides which liberate toxic hydrogen cyanide upon breakdown. Moths belonging to the Zygaena family are the only insects known, able to carry out both de novo biosynthesis and sequestration of the same cyanogenic glucosides as those from their feed plants. The biosynthetic pathway for cyanogenic glucoside biosynthesis in Z. filipendulae proceeds using the same intermediates as in the well known pathway from plants, but none of the enzymes responsible have been identified. A genomics strategy founded on 454 pyrosequencing of the Z. filipendulae transcriptome was undertaken to identify some of these enzymes in Z. filipendulae. Comparisons of the Z. filipendulae transcriptome with the sequenced genomes of Bombyx mori, Drosophila melanogaster, Tribolium castaneum, Apis mellifera and Anopheles gambiae indicate a high coverage of the Z. filipendulae transcriptome. 11% of the Z. filipendulae transcriptome sequences were assigned to Gene Ontology categories. Candidate genes for enzymes functioning in the biosynthesis of cyanogenic glucosides (cytochrome P450 and family 1 glycosyltransferases) were identified based on sequence length, number of copies and presence/absence of close homologs in D. melanogaster, B. mori and the cyanogenic butterfly Heliconius. Examination of biased codon usage, GC content and selection on gene candidates support the notion of cyanogenesis as an "old" trait within Ditrysia, as well as its origins being convergent between plants and insects. Pyrosequencing is an attractive approach to gain access to genes in the biosynthesis of bio-active natural products from insects and other organisms, for which the genome sequence is not known. Based on analysis of the Z. filipendulae transcriptome, promising gene candidates for biosynthesis of cyanogenic glucosides was identified, and the suitability of Z. filipendulae as a model system for cyanogenesis in insects is evident.

  4. Metatranscriptomics and Pyrosequencing Facilitate Discovery of Potential Viral Natural Enemies of the Invasive Caribbean Crazy Ant, Nylanderia pubens

    PubMed Central

    Valles, Steven M.; Oi, David H.; Yu, Fahong; Tan, Xin-Xing; Buss, Eileen A.

    2012-01-01

    Background Nylanderia pubens (Forel) is an invasive ant species that in recent years has developed into a serious nuisance problem in the Caribbean and United States. A rapidly expanding range, explosive localized population growth, and control difficulties have elevated this ant to pest status. Professional entomologists and the pest control industry in the United States are urgently trying to understand its biology and develop effective control methods. Currently, no known biological-based control agents are available for use in controlling N. pubens. Methodology and Principal Findings Metagenomics and pyrosequencing techniques were employed to examine the transcriptome of field-collected N. pubens colonies in an effort to identify virus infections with potential to serve as control agents against this pest ant. Pyrosequencing (454-platform) of a non-normalized N. pubens expression library generated 1,306,177 raw sequence reads comprising 450 Mbp. Assembly resulted in generation of 59,017 non-redundant sequences, including 27,348 contigs and 31,669 singlets. BLAST analysis of these non-redundant sequences identified 51 of potential viral origin. Additional analyses winnowed this list of potential viruses to three that appear to replicate in N. pubens. Conclusions Pyrosequencing the transcriptome of field-collected samples of N. pubens has identified at least three sequences that are likely of viral origin and, in which, N. pubens serves as host. In addition, the N. pubens transcriptome provides a genetic resource for the scientific community which is especially important at this early stage of developing a knowledgebase for this new pest. PMID:22384082

  5. De Novo Transcriptome of the Hemimetabolous German Cockroach (Blattella germanica)

    PubMed Central

    Zhou, Xiaojie; Qian, Kun; Tong, Ying; Zhu, Junwei Jerry; Qiu, Xinghui; Zeng, Xiaopeng

    2014-01-01

    Background The German cockroach, Blattella germanica, is an important insect pest that transmits various pathogens mechanically and causes severe allergic diseases. This insect has long served as a model system for studies of insect biology, physiology and ecology. However, the lack of genome or transcriptome information heavily hinder our further understanding about the German cockroach in every aspect at a molecular level and on a genome-wide scale. To explore the transcriptome and identify unique sequences of interest, we subjected the B. germanica transcriptome to massively parallel pyrosequencing and generated the first reference transcriptome for B. germanica. Methodology/Principal Findings A total of 1,365,609 raw reads with an average length of 529 bp were generated via pyrosequencing the mixed cDNA library from different life stages of German cockroach including maturing oothecae, nymphs, adult females and males. The raw reads were de novo assembled to 48,800 contigs and 3,961 singletons with high-quality unique sequences. These sequences were annotated and classified functionally in terms of BLAST, GO and KEGG, and the genes putatively coding detoxification enzyme systems, insecticide targets, key components in systematic RNA interference, immunity and chemoreception pathways were identified. A total of 3,601 SSRs (Simple Sequence Repeats) loci were also predicted. Conclusions/Significance The whole transcriptome pyrosequencing data from this study provides a usable genetic resource for future identification of potential functional genes involved in various biological processes. PMID:25265537

  6. Transcriptome assembly, gene annotation and tissue gene expression atlas of the rainbow trout

    USDA-ARS?s Scientific Manuscript database

    Efforts to obtain a comprehensive genome sequence for rainbow trout are ongoing and will be complimented by transcriptome information that will enhance genome assembly and annotation. Previously, we reported a transcriptome reference sequence using a 19X coverage of Sanger and 454-pyrosequencing dat...

  7. New in-depth rainbow trout transcriptome reference and digital atlas of gene expression

    USDA-ARS?s Scientific Manuscript database

    Sequencing the rainbow trout genome is underway and a transcriptome reference sequence is required to help in genome assembly and gene discovery. Previously, we reported a transcriptome reference sequence using a 19X coverage of 454-pyrosequencing data. Although this work added a great wealth of ann...

  8. Pyrosequencing the Manduca sexta larval midgut transcriptome: messages for digestion, detoxification and defence.

    PubMed

    Pauchet, Y; Wilkinson, P; Vogel, H; Nelson, D R; Reynolds, S E; Heckel, D G; ffrench-Constant, R H

    2010-02-01

    The tobacco hornworm Manduca sexta is an important model for insect physiology but genomic and transcriptomic data are currently lacking. Following a recent pyrosequencing study generating immune related expressed sequence tags (ESTs), here we use this new technology to define the M. sexta larval midgut transcriptome. We generated over 387,000 midgut ESTs, using a combination of Sanger and 454 sequencing, and classified predicted proteins into those involved in digestion, detoxification and immunity. In many cases the depth of 454 pyrosequencing coverage allowed us to define the entire cDNA sequence of a particular gene. Many new M. sexta genes are described including up to 36 new cytochrome P450s, some of which have been implicated in the metabolism of host plant-derived nicotine. New lepidopteran gene families such as the beta-fructofuranosidases, previously thought to be restricted to Bombyx mori, are also described. An unexpectedly high number of ESTs were involved in immunity, for example 39 contigs encoding serpins, and the increasingly appreciated role of the midgut in insect immunity is discussed. Similar studies of other tissues will allow for a tissue by tissue description of the M. sexta transcriptome and will form an essential complimentary step on the road to genome sequencing and annotation.

  9. Digital Marine Bioprospecting: Mining New Neurotoxin Drug Candidates from the Transcriptomes of Cold-Water Sea Anemones

    PubMed Central

    Urbarova, Ilona; Karlsen, Bård Ove; Okkenhaug, Siri; Seternes, Ole Morten; Johansen, Steinar D.; Emblem, Åse

    2012-01-01

    Marine bioprospecting is the search for new marine bioactive compounds and large-scale screening in extracts represents the traditional approach. Here, we report an alternative complementary protocol, called digital marine bioprospecting, based on deep sequencing of transcriptomes. We sequenced the transcriptomes from the adult polyp stage of two cold-water sea anemones, Bolocera tuediae and Hormathia digitata. We generated approximately 1.1 million quality-filtered sequencing reads by 454 pyrosequencing, which were assembled into approximately 120,000 contigs and 220,000 single reads. Based on annotation and gene ontology analysis we profiled the expressed mRNA transcripts according to known biological processes. As a proof-of-concept we identified polypeptide toxins with a potential blocking activity on sodium and potassium voltage-gated channels from digital transcriptome libraries. PMID:23170083

  10. Pyrosequencing the Midgut Transcriptome of the Banana Weevil Cosmopolites sordidus (Germar) (Coleoptera: Curculionidae) Reveals Multiple Protease-Like Transcripts.

    PubMed

    Valencia, Arnubio; Wang, Haichuan; Soto, Alberto; Aristizabal, Manuel; Arboleda, Jorge W; Eyun, Seong-Il; Noriega, Daniel D; Siegfried, Blair

    2016-01-01

    The banana weevil Cosmopolites sordidus is an important and serious insect pest in most banana and plantain-growing areas of the world. In spite of the economic importance of this insect pest very little genomic and transcriptomic information exists for this species. In the present study, we characterized the midgut transcriptome of C. sordidus using massive 454-pyrosequencing. We generated over 590,000 sequencing reads that assembled into 30,840 contigs with more than 400 bp, representing a significant expansion of existing sequences available for this insect pest. Among them, 16,427 contigs contained one or more GO terms. In addition, 15,263 contigs were assigned an EC number. In-depth transcriptome analysis identified genes potentially involved in insecticide resistance, peritrophic membrane biosynthesis, immunity-related function and defense against pathogens, and Bacillus thuringiensis toxins binding proteins as well as multiple enzymes involved with protein digestion. This transcriptome will provide a valuable resource for understanding larval physiology and for identifying novel target sites and management approaches for this important insect pest.

  11. Pyrosequencing the Midgut Transcriptome of the Banana Weevil Cosmopolites sordidus (Germar) (Coleoptera: Curculionidae) Reveals Multiple Protease-Like Transcripts

    PubMed Central

    Valencia, Arnubio; Wang, Haichuan; Soto, Alberto; Aristizabal, Manuel; Arboleda, Jorge W.; Eyun, Seong-il; Noriega, Daniel D.; Siegfried, Blair

    2016-01-01

    The banana weevil Cosmopolites sordidus is an important and serious insect pest in most banana and plantain-growing areas of the world. In spite of the economic importance of this insect pest very little genomic and transcriptomic information exists for this species. In the present study, we characterized the midgut transcriptome of C. sordidus using massive 454-pyrosequencing. We generated over 590,000 sequencing reads that assembled into 30,840 contigs with more than 400 bp, representing a significant expansion of existing sequences available for this insect pest. Among them, 16,427 contigs contained one or more GO terms. In addition, 15,263 contigs were assigned an EC number. In-depth transcriptome analysis identified genes potentially involved in insecticide resistance, peritrophic membrane biosynthesis, immunity-related function and defense against pathogens, and Bacillus thuringiensis toxins binding proteins as well as multiple enzymes involved with protein digestion. This transcriptome will provide a valuable resource for understanding larval physiology and for identifying novel target sites and management approaches for this important insect pest. PMID:26949943

  12. Transcriptome Exploration in Leymus chinensis under Saline-Alkaline Treatment Using 454 Pyrosequencing

    PubMed Central

    Sun, Yepeng; Wang, Fawei; Wang, Nan; Dong, Yuanyuan; Liu, Qi; Zhao, Lei; Chen, Huan; Liu, Weican; Yin, Hailong; Zhang, Xiaomei; Yuan, Yanxi; Li, Haiyan

    2013-01-01

    Background Leymus chinensis (Trin.) Tzvel. is a high saline-alkaline tolerant forage grass genus of the tribe Gramineae family, which also plays an important role in protection of natural environment. To date, little is known about the saline-alkaline tolerance of L. chinensis on the molecular level. To better understand the molecular mechanism of saline-alkaline tolerance in L. chinensis, 454 pyrosequencing was used for the transcriptome study. Results We used Roche-454 massive parallel pyrosequencing technology to sequence two different cDNA libraries that were built from the two samples of control and under saline-alkaline treatment (optimal stress concentration-Hoagland solution with 100 mM NaCl and 200 mM NaHCO3). A total of 363,734 reads in control group and 526,267 reads in treatment group with an average length of 489 bp and 493 bp were obtained, respectively. The reads were assembled into 104,105 unigenes with MIRA sequence assemable software, among which, 73,665 unigenes were in control group, 88,016 unigenes in treatment group and 57,576 unigenes in both groups. According to the comparative expression analysis between the two groups with the threshold of “log2 Ratio ≥1”, there were 36,497 up-regulated unegenes and 18,218 down-regulated unigenes predicted to be the differentially expressed genes. After gene annotation and pathway enrichment analysis, most of them were involved in stress and tolerant function, signal transduction, energy production and conversion, and inorganic ion transport. Furthermore, 16 of these differentially expressed genes were selected for real-time PCR validation, and they were successfully confirmed with the results of 454 pyrosequencing. Conclusions This work is the first time to study the transcriptome of L. chinensis under saline-alkaline treatment based on the 454-FLX massively parallel DNA sequencing platform. It also deepened studies on molecular mechanisms of saline-alkaline in L. chinensis, and constituted a database for future studies. PMID:23365637

  13. Construction of a robust microarray from a non-model species (largemouth bass) using pyrosequencing technology

    PubMed Central

    Garcia-Reyero, Natàlia; Griffitt, Robert J.; Liu, Li; Kroll, Kevin J.; Farmerie, William G.; Barber, David S.; Denslow, Nancy D.

    2009-01-01

    A novel custom microarray for largemouth bass (Micropterus salmoides) was designed with sequences obtained from a normalized cDNA library using the 454 Life Sciences GS-20 pyrosequencer. This approach yielded in excess of 58 million bases of high-quality sequence. The sequence information was combined with 2,616 reads obtained by traditional suppressive subtractive hybridizations to derive a total of 31,391 unique sequences. Annotation and coding sequences were predicted for these transcripts where possible. 16,350 annotated transcripts were selected as target sequences for the design of the custom largemouth bass oligonucleotide microarray. The microarray was validated by examining the transcriptomic response in male largemouth bass exposed to 17β-œstradiol. Transcriptomic responses were assessed in liver and gonad, and indicated gene expression profiles typical of exposure to œstradiol. The results demonstrate the potential to rapidly create the tools necessary to assess large scale transcriptional responses in non-model species, paving the way for expanded impact of toxicogenomics in ecotoxicology. PMID:19936325

  14. Transcriptomic analysis reveals numerous diverse protein kinases and transcription factors involved in desiccation tolerance in the resurrection plant Myrothamnus flabellifolia

    USDA-ARS?s Scientific Manuscript database

    The woody resurrection plant Myrothamnus flabellifolia has remarkable tolerance to desiccation. Pyro-sequencing technology permitted us to analyze the transcriptome of M. flabellifolia during both dehydration and rehydration. We identified a total of 8287 and 8542 differentially transcribed genes du...

  15. Profiling the venom gland transcriptomes of Costa Rican snakes by 454 pyrosequencing

    PubMed Central

    2011-01-01

    Background A long term research goal of venomics, of applied importance for improving current antivenom therapy, but also for drug discovery, is to understand the pharmacological potential of venoms. Individually or combined, proteomic and transcriptomic studies have demonstrated their feasibility to explore in depth the molecular diversity of venoms. In the absence of genome sequence, transcriptomes represent also valuable searchable databases for proteomic projects. Results The venom gland transcriptomes of 8 Costa Rican taxa from 5 genera (Crotalus, Bothrops, Atropoides, Cerrophidion, and Bothriechis) of pitvipers were investigated using high-throughput 454 pyrosequencing. 100,394 out of 330,010 masked reads produced significant hits in the available databases. 5.165,220 nucleotides (8.27%) were masked by RepeatMasker, the vast majority of which corresponding to class I (retroelements) and class II (DNA transposons) mobile elements. BLAST hits included 79,991 matches to entries of the taxonomic suborder Serpentes, of which 62,433 displayed similarity to documented venom proteins. Strong discrepancies between the transcriptome-computed and the proteome-gathered toxin compositions were obvious at first sight. Although the reasons underlaying this discrepancy are elusive, since no clear trend within or between species is apparent, the data indicate that individual mRNA species may be translationally controlled in a species-dependent manner. The minimum number of genes from each toxin family transcribed into the venom gland transcriptome of each species was calculated from multiple alignments of reads matched to a full-length reference sequence of each toxin family. Reads encoding ORF regions of Kazal-type inhibitor-like proteins were uniquely found in Bothriechis schlegelii and B. lateralis transcriptomes, suggesting a genus-specific recruitment event during the early-Middle Miocene. A transcriptome-based cladogram supports the large divergence between A. mexicanus and A. picadoi, and a closer kinship between A. mexicanus and C. godmani. Conclusions Our comparative next-generation sequencing (NGS) analysis reveals taxon-specific trends governing the formulation of the venom arsenal. Knowledge of the venom proteome provides hints on the translation efficiency of toxin-coding transcripts, contributing thereby to a more accurate interpretation of the transcriptome. The application of NGS to the analysis of snake venom transcriptomes, may represent the tool for opening the door to systems venomics. PMID:21605378

  16. Global characterization of Artemisia annua glandular trichome transcriptome using 454 pyrosequencing

    PubMed Central

    Wang, Wei; Wang, Yejun; Zhang, Qing; Qi, Yan; Guo, Dianjing

    2009-01-01

    Background Glandular trichomes produce a wide variety of commercially important secondary metabolites in many plant species. The most prominent anti-malarial drug artemisinin, a sesquiterpene lactone, is produced in glandular trichomes of Artemisia annua. However, only limited genomic information is currently available in this non-model plant species. Results We present a global characterization of A. annua glandular trichome transcriptome using 454 pyrosequencing. Sequencing runs using two normalized cDNA collections from glandular trichomes yielded 406,044 expressed sequence tags (average length = 210 nucleotides), which assembled into 42,678 contigs and 147,699 singletons. Performing a second sequencing run only increased the number of genes identified by ~30%, indicating that massively parallel pyrosequencing provides deep coverage of the A. annua trichome transcriptome. By BLAST search against the NCBI non-redundant protein database, putative functions were assigned to over 28,573 unigenes, including previously undescribed enzymes likely involved in sesquiterpene biosynthesis. Comparison with ESTs derived from trichome collections of other plant species revealed expressed genes in common functional categories across different plant species. RT-PCR analysis confirmed the expression of selected unigenes and novel transcripts in A. annua glandular trichomes. Conclusion The presence of contigs corresponding to enzymes for terpenoids and flavonoids biosynthesis suggests important metabolic activity in A. annua glandular trichomes. Our comprehensive survey of genes expressed in glandular trichome will facilitate new gene discovery and shed light on the regulatory mechanism of artemisinin metabolism and trichome function in A. annua. PMID:19818120

  17. Comparing de novo assemblers for 454 transcriptome data

    PubMed Central

    2010-01-01

    Background Roche 454 pyrosequencing has become a method of choice for generating transcriptome data from non-model organisms. Once the tens to hundreds of thousands of short (250-450 base) reads have been produced, it is important to correctly assemble these to estimate the sequence of all the transcripts. Most transcriptome assembly projects use only one program for assembling 454 pyrosequencing reads, but there is no evidence that the programs used to date are optimal. We have carried out a systematic comparison of five assemblers (CAP3, MIRA, Newbler, SeqMan and CLC) to establish best practices for transcriptome assemblies, using a new dataset from the parasitic nematode Litomosoides sigmodontis. Results Although no single assembler performed best on all our criteria, Newbler 2.5 gave longer contigs, better alignments to some reference sequences, and was fast and easy to use. SeqMan assemblies performed best on the criterion of recapitulating known transcripts, and had more novel sequence than the other assemblers, but generated an excess of small, redundant contigs. The remaining assemblers all performed almost as well, with the exception of Newbler 2.3 (the version currently used by most assembly projects), which generated assemblies that had significantly lower total length. As different assemblers use different underlying algorithms to generate contigs, we also explored merging of assemblies and found that the merged datasets not only aligned better to reference sequences than individual assemblies, but were also more consistent in the number and size of contigs. Conclusions Transcriptome assemblies are smaller than genome assemblies and thus should be more computationally tractable, but are often harder because individual contigs can have highly variable read coverage. Comparing single assemblers, Newbler 2.5 performed best on our trial data set, but other assemblers were closely comparable. Combining differently optimal assemblies from different programs however gave a more credible final product, and this strategy is recommended. PMID:20950480

  18. High-Throughput Sequence Analysis of Turbot (Scophthalmus maximus) Transcriptome Using 454-Pyrosequencing for the Discovery of Antiviral Immune Genes

    PubMed Central

    Pereiro, Patricia; Balseiro, Pablo; Romero, Alejandro; Dios, Sonia; Forn-Cuni, Gabriel; Fuste, Berta; Planas, Josep V.; Beltran, Sergi; Novoa, Beatriz; Figueras, Antonio

    2012-01-01

    Background Turbot (Scophthalmus maximus L.) is an important aquacultural resource both in Europe and Asia. However, there is little information on gene sequences available in public databases. Currently, one of the main problems affecting the culture of this flatfish is mortality due to several pathogens, especially viral diseases which are not treatable. In order to identify new genes involved in immune defense, we conducted 454-pyrosequencing of the turbot transcriptome after different immune stimulations. Methodology/Principal Findings Turbot were injected with viral stimuli to increase the expression level of immune-related genes. High-throughput deep sequencing using 454-pyrosequencing technology yielded 915,256 high-quality reads. These sequences were assembled into 55,404 contigs that were subjected to annotation steps. Intriguingly, 55.16% of the deduced protein was not significantly similar to any sequences in the databases used for the annotation and only 0.85% of the BLASTx top-hits matched S. maximus protein sequences. This relatively low level of annotation is possibly due to the limited information for this specie and other flatfish in the database. These results suggest the identification of a large number of new genes in turbot and in fish in general. A more detailed analysis showed the presence of putative members of several innate and specific immune pathways. Conclusions/Significance To our knowledge, this study is the first transcriptome analysis using 454-pyrosequencing for turbot. Previously, there were only 12,471 EST and less of 1,500 nucleotide sequences for S. maximus in NCBI database. Our results provide a rich source of data (55,404 contigs and 181,845 singletons) for discovering and identifying new genes, which will serve as a basis for microarray construction, gene expression characterization and for identification of genetic markers to be used in several applications. Immune stimulation in turbot was very effective, obtaining an enormous variety of sequences belonging to genes involved in the defense mechanisms. PMID:22629298

  19. Characterization of the rainbow trout transcriptome using Sanger and 454-Pyrosequencing approaches

    USDA-ARS?s Scientific Manuscript database

    BACKGROUND: Rainbow trout is an important fish species for aquaculture and a model species for research investigations associated with carcinogenesis, comparative immunology, toxicology and the evolutionary biology. However, to date there is no genome reference sequence to facilitate the development...

  20. Characterization of the rainbow trout transcriptome using Sanger and 454-pyrosequencing approaches

    USDA-ARS?s Scientific Manuscript database

    Background: Rainbow trout is an important fish for aquaculture and recreational fisheries and serves as a model species for research investigations associated with carcinogenesis, comparative immunology, toxicology and the evolutionary biology. However, to date there is no genome reference sequence...

  1. Gene discovery using massively parallel pyrosequencing to develop ESTs for the flesh fly Sarcophaga crassipalpis

    USDA-ARS?s Scientific Manuscript database

    Flesh flies in the genus Sarcophaga are important models for investigating endocrinology, diapause, cold hardiness, reproduction, and immunity. Despite the prominence of Sarcophaga flesh flies as models for insect physiology and biochemistry, and in forensic studies, little genomic or transcriptom...

  2. Developmental Gene Discovery in a Hemimetabolous Insect: De Novo Assembly and Annotation of a Transcriptome for the Cricket Gryllus bimaculatus

    PubMed Central

    Zeng, Victor; Ewen-Campen, Ben; Horch, Hadley W.; Roth, Siegfried; Mito, Taro; Extavour, Cassandra G.

    2013-01-01

    Most genomic resources available for insects represent the Holometabola, which are insects that undergo complete metamorphosis like beetles and flies. In contrast, the Hemimetabola (direct developing insects), representing the basal branches of the insect tree, have very few genomic resources. We have therefore created a large and publicly available transcriptome for the hemimetabolous insect Gryllus bimaculatus (cricket), a well-developed laboratory model organism whose potential for functional genetic experiments is currently limited by the absence of genomic resources. cDNA was prepared using mRNA obtained from adult ovaries containing all stages of oogenesis, and from embryo samples on each day of embryogenesis. Using 454 Titanium pyrosequencing, we sequenced over four million raw reads, and assembled them into 21,512 isotigs (predicted transcripts) and 120,805 singletons with an average coverage per base pair of 51.3. We annotated the transcriptome manually for over 400 conserved genes involved in embryonic patterning, gametogenesis, and signaling pathways. BLAST comparison of the transcriptome against the NCBI non-redundant protein database (nr) identified significant similarity to nr sequences for 55.5% of transcriptome sequences, and suggested that the transcriptome may contain 19,874 unique transcripts. For predicted transcripts without significant similarity to known sequences, we assessed their similarity to other orthopteran sequences, and determined that these transcripts contain recognizable protein domains, largely of unknown function. We created a searchable, web-based database to allow public access to all raw, assembled and annotated data. This database is to our knowledge the largest de novo assembled and annotated transcriptome resource available for any hemimetabolous insect. We therefore anticipate that these data will contribute significantly to more effective and higher-throughput deployment of molecular analysis tools in Gryllus. PMID:23671567

  3. Assessment of replicate bias in 454 pyrosequencing and a multi-purpose read-filtering tool.

    PubMed

    Jérôme, Mariette; Noirot, Céline; Klopp, Christophe

    2011-05-26

    Roche 454 pyrosequencing platform is often considered the most versatile of the Next Generation Sequencing technology platforms, permitting the sequencing of large genomes, the analysis of variations or the study of transcriptomes. A recent reported bias leads to the production of multiple reads for a unique DNA fragment in a random manner within a run. This bias has a direct impact on the quality of the measurement of the representation of the fragments using the reads. Other cleaning steps are usually performed on the reads before assembly or alignment. PyroCleaner is a software module intended to clean 454 pyrosequencing reads in order to ease the assembly process. This program is a free software and is distributed under the terms of the GNU General Public License as published by the Free Software Foundation. It implements several filters using criteria such as read duplication, length, complexity, base-pair quality and number of undetermined bases. It also permits to clean flowgram files (.sff) of paired-end sequences generating on one hand validated paired-ends file and the other hand single read file. Read cleaning has always been an important step in sequence analysis. The pyrocleaner python module is a Swiss knife dedicated to 454 reads cleaning. It includes commonly used filters as well as specialised ones such as duplicated read removal and paired-end read verification.

  4. Transcriptome sequencing and annotation of the microalgae Dunaliella tertiolecta: Pathway description and gene discovery for production of next-generation biofuels

    PubMed Central

    2011-01-01

    Background Biodiesel or ethanol derived from lipids or starch produced by microalgae may overcome many of the sustainability challenges previously ascribed to petroleum-based fuels and first generation plant-based biofuels. The paucity of microalgae genome sequences, however, limits gene-based biofuel feedstock optimization studies. Here we describe the sequencing and de novo transcriptome assembly for the non-model microalgae species, Dunaliella tertiolecta, and identify pathways and genes of importance related to biofuel production. Results Next generation DNA pyrosequencing technology applied to D. tertiolecta transcripts produced 1,363,336 high quality reads with an average length of 400 bases. Following quality and size trimming, ~ 45% of the high quality reads were assembled into 33,307 isotigs with a 31-fold coverage and 376,482 singletons. Assembled sequences and singletons were subjected to BLAST similarity searches and annotated with Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology (KO) identifiers. These analyses identified the majority of lipid and starch biosynthesis and catabolism pathways in D. tertiolecta. Conclusions The construction of metabolic pathways involved in the biosynthesis and catabolism of fatty acids, triacylglycrols, and starch in D. tertiolecta as well as the assembled transcriptome provide a foundation for the molecular genetics and functional genomics required to direct metabolic engineering efforts that seek to enhance the quantity and character of microalgae-based biofuel feedstock. PMID:21401935

  5. Transcriptome Analysis of Barbarea vulgaris Infested with Diamondback Moth (Plutella xylostella) Larvae

    PubMed Central

    Shen, Di; Wang, Haiping; Wu, Qingjun; Lu, Peng; Qiu, Yang; Song, Jiangping; Zhang, Youjun; Li, Xixiang

    2013-01-01

    Background The diamondback moth (DBM, Plutella xylostella) is a crucifer-specific pest that causes significant crop losses worldwide. Barbarea vulgaris (Brassicaceae) can resist DBM and other herbivorous insects by producing feeding-deterrent triterpenoid saponins. Plant breeders have long aimed to transfer this insect resistance to other crops. However, a lack of knowledge on the biosynthetic pathways and regulatory networks of these insecticidal saponins has hindered their practical application. A pyrosequencing-based transcriptome analysis of B. vulgaris during DBM larval feeding was performed to identify genes and gene networks responsible for saponin biosynthesis and its regulation at the genome level. Principal Findings Approximately 1.22, 1.19, 1.16, 1.23, 1.16, 1.20, and 2.39 giga base pairs of clean nucleotides were generated from B. vulgaris transcriptomes sampled 1, 4, 8, 12, 24, and 48 h after onset of P. xylostella feeding and from non-inoculated controls, respectively. De novo assembly using all data of the seven transcriptomes generated 39,531 unigenes. A total of 37,780 (95.57%) unigenes were annotated, 14,399 of which were assigned to one or more gene ontology terms and 19,620 of which were assigned to 126 known pathways. Expression profiles revealed 2,016–4,685 up-regulated and 557–5188 down-regulated transcripts. Secondary metabolic pathways, such as those of terpenoids, glucosinolates, and phenylpropanoids, and its related regulators were elevated. Candidate genes for the triterpene saponin pathway were found in the transcriptome. Orthological analysis of the transcriptome with four other crucifer transcriptomes identified 592 B. vulgaris-specific gene families with a P-value cutoff of 1e−5. Conclusion This study presents the first comprehensive transcriptome analysis of B. vulgaris subjected to a series of DBM feedings. The biosynthetic and regulatory pathways of triterpenoid saponins and other DBM deterrent metabolites in this plant were classified. The results of this study will provide useful data for future investigations on pest-resistance phytochemistry and plant breeding. PMID:23696897

  6. Gene discovery using next-generation pyrosequencing to develop ESTs for Phalaenopsis orchids

    PubMed Central

    2011-01-01

    Background Orchids are one of the most diversified angiosperms, but few genomic resources are available for these non-model plants. In addition to the ecological significance, Phalaenopsis has been considered as an economically important floriculture industry worldwide. We aimed to use massively parallel 454 pyrosequencing for a global characterization of the Phalaenopsis transcriptome. Results To maximize sequence diversity, we pooled RNA from 10 samples of different tissues, various developmental stages, and biotic- or abiotic-stressed plants. We obtained 206,960 expressed sequence tags (ESTs) with an average read length of 228 bp. These reads were assembled into 8,233 contigs and 34,630 singletons. The unigenes were searched against the NCBI non-redundant (NR) protein database. Based on sequence similarity with known proteins, these analyses identified 22,234 different genes (E-value cutoff, e-7). Assembled sequences were annotated with Gene Ontology, Gene Family and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Among these annotations, over 780 unigenes encoding putative transcription factors were identified. Conclusion Pyrosequencing was effective in identifying a large set of unigenes from Phalaenopsis. The informative EST dataset we developed constitutes a much-needed resource for discovery of genes involved in various biological processes in Phalaenopsis and other orchid species. These transcribed sequences will narrow the gap between study of model organisms with many genomic resources and species that are important for ecological and evolutionary studies. PMID:21749684

  7. Transcriptome sequencing and de novo analysis of the copepod Calanus sinicus using 454 GS FLX.

    PubMed

    Ning, Juan; Wang, Minxiao; Li, Chaolun; Sun, Song

    2013-01-01

    Despite their species abundance and primary economic importance, genomic information about copepods is still limited. In particular, genomic resources are lacking for the copepod Calanus sinicus, which is a dominant species in the coastal waters of East Asia. In this study, we performed de novo transcriptome sequencing to produce a large number of expressed sequence tags for the copepod C. sinicus. Copepodid larvae and adults were used as the basic material for transcriptome sequencing. Using 454 pyrosequencing, a total of 1,470,799 reads were obtained, which were assembled into 56,809 high quality expressed sequence tags. Based on their sequence similarity to known proteins, about 14,000 different genes were identified, including members of all major conserved signaling pathways. Transcripts that were putatively involved with growth, lipid metabolism, molting, and diapause were also identified among these genes. Differentially expressed genes related to several processes were found in C. sinicus copepodid larvae and adults. We detected 284,154 single nucleotide polymorphisms (SNPs) that provide a resource for gene function studies. Our data provide the most comprehensive transcriptome resource available for C. sinicus. This resource allowed us to identify genes associated with primary physiological processes and SNPs in coding regions, which facilitated the quantitative analysis of differential gene expression. These data should provide foundation for future genetic and genomic studies of this and related species.

  8. Transcriptomics of the Bed Bug (Cimex lectularius)

    PubMed Central

    Rajarapu, Swapna P.; Jones, Susan C.; Mittapalli, Omprakash

    2011-01-01

    Background Bed bugs (Cimex lectularius) are blood-feeding insects poised to become one of the major pests in households throughout the United States. Resistance of C. lectularius to insecticides/pesticides is one factor thought to be involved in its sudden resurgence. Despite its high-impact status, scant knowledge exists at the genomic level for C. lectularius. Hence, we subjected the C. lectularius transcriptome to 454 pyrosequencing in order to identify potential genes involved in pesticide resistance. Methodology and Principal Findings Using 454 pyrosequencing, we obtained a total of 216,419 reads with 79,596,412 bp, which were assembled into 35,646 expressed sequence tags (3902 contigs and 31744 singletons). Nearly 85.9% of the C. lectularius sequences showed similarity to insect sequences, but 44.8% of the deduced proteins of C. lectularius did not show similarity with sequences in the GenBank non-redundant database. KEGG analysis revealed putative members of several detoxification pathways involved in pesticide resistance. Lamprin domains, Protein Kinase domains, Protein Tyrosine Kinase domains and cytochrome P450 domains were among the top Pfam domains predicted for the C. lectularius sequences. An initial assessment of putative defense genes, including a cytochrome P450 and a glutathione-S-transferase (GST), revealed high transcript levels for the cytochrome P450 (CYP9) in pesticide-exposed versus pesticide-susceptible C. lectularius populations. A significant number of single nucleotide polymorphisms (296) and microsatellite loci (370) were predicted in the C. lectularius sequences. Furthermore, 59 putative sequences of Wolbachia were retrieved from the database. Conclusions To our knowledge this is the first study to elucidate the genetic makeup of C. lectularius. This pyrosequencing effort provides clues to the identification of potential detoxification genes involved in pesticide resistance of C. lectularius and lays the foundation for future functional genomics studies. PMID:21283830

  9. Transcriptomics of In Vitro Immune-Stimulated Hemocytes from the Manila Clam Ruditapes philippinarum Using High-Throughput Sequencing

    PubMed Central

    Moreira, Rebeca; Balseiro, Pablo; Planas, Josep V.; Fuste, Berta; Beltran, Sergi; Novoa, Beatriz; Figueras, Antonio

    2012-01-01

    Background The Manila clam (Ruditapes philippinarum) is a worldwide cultured bivalve species with important commercial value. Diseases affecting this species can result in large economic losses. Because knowledge of the molecular mechanisms of the immune response in bivalves, especially clams, is scarce and fragmentary, we sequenced RNA from immune-stimulated R. philippinarum hemocytes by 454-pyrosequencing to identify genes involved in their immune defense against infectious diseases. Methodology and Principal Findings High-throughput deep sequencing of R. philippinarum using 454 pyrosequencing technology yielded 974,976 high-quality reads with an average read length of 250 bp. The reads were assembled into 51,265 contigs and the 44.7% of the translated nucleotide sequences into protein were annotated successfully. The 35 most frequently found contigs included a large number of immune-related genes, and a more detailed analysis showed the presence of putative members of several immune pathways and processes like the apoptosis, the toll like signaling pathway and the complement cascade. We have found sequences from molecules never described in bivalves before, especially in the complement pathway where almost all the components are present. Conclusions This study represents the first transcriptome analysis using 454-pyrosequencing conducted on R. philippinarum focused on its immune system. Our results will provide a rich source of data to discover and identify new genes, which will serve as a basis for microarray construction and the study of gene expression as well as for the identification of genetic markers. The discovery of new immune sequences was very productive and resulted in a large variety of contigs that may play a role in the defense mechanisms of Ruditapes philippinarum. PMID:22536348

  10. Massively parallel pyrosequencing-based transcriptome analyses of small brown planthopper (Laodelphax striatellus), a vector insect transmitting rice stripe virus (RSV)

    PubMed Central

    2010-01-01

    Background The small brown planthopper (Laodelphax striatellus) is an important agricultural pest that not only damages rice plants by sap-sucking, but also acts as a vector that transmits rice stripe virus (RSV), which can cause even more serious yield loss. Despite being a model organism for studying entomology, population biology, plant protection, molecular interactions among plants, viruses and insects, only a few genomic sequences are available for this species. To investigate its transcriptome and determine the differences between viruliferous and naïve L. striatellus, we employed 454-FLX high-throughput pyrosequencing to generate EST databases of this insect. Results We obtained 201,281 and 218,681 high-quality reads from viruliferous and naïve L. striatellus, respectively, with an average read length as 230 bp. These reads were assembled into contigs and two EST databases were generated. When all reads were combined, 16,885 contigs and 24,607 singletons (a total of 41,492 unigenes) were obtained, which represents a transcriptome of the insect. BlastX search against the NCBI-NR database revealed that only 6,873 (16.6%) of these unigenes have significant matches. Comparison of the distribution of GO classification among viruliferous, naïve, and combined EST databases indicated that these libraries are broadly representative of the L. striatellus transcriptomes. Functionally diverse transcripts from RSV, endosymbiotic bacteria Wolbachia and yeast-like symbiotes were identified, which reflects the possible lifestyles of these microbial symbionts that live in the cells of the host insect. Comparative genomic analysis revealed that L. striatellus encodes similar innate immunity regulatory systems as other insects, such as RNA interference, JAK/STAT and partial Imd cascades, which might be involved in defense against viral infection. In addition, we determined the differences in gene expression between vector and naïve samples, which generated a list of candidate genes that are potentially involved in the symbiosis of L. striatellus and RSV. Conclusions To our knowledge, the present study is the first description of a genomic project for L. striatellus. The identification of transcripts from RSV, Wolbachia, yeast-like symbiotes and genes abundantly expressed in viruliferous insect, provided a starting-point for investigating the molecular basis of symbiosis among these organisms. PMID:20462456

  11. Transcriptomic analysis of grain amaranth (Amaranthus hypochondriacus) using 454 pyrosequencing: comparison with A. tuberculatus, expression profiling in stems and in response to biotic and abiotic stress

    PubMed Central

    2011-01-01

    Background Amaranthus hypochondriacus, a grain amaranth, is a C4 plant noted by its ability to tolerate stressful conditions and produce highly nutritious seeds. These possess an optimal amino acid balance and constitute a rich source of health-promoting peptides. Although several recent studies, mostly involving subtractive hybridization strategies, have contributed to increase the relatively low number of grain amaranth expressed sequence tags (ESTs), transcriptomic information of this species remains limited, particularly regarding tissue-specific and biotic stress-related genes. Thus, a large scale transcriptome analysis was performed to generate stem- and (a)biotic stress-responsive gene expression profiles in grain amaranth. Results A total of 2,700,168 raw reads were obtained from six 454 pyrosequencing runs, which were assembled into 21,207 high quality sequences (20,408 isotigs + 799 contigs). The average sequence length was 1,064 bp and 930 bp for isotigs and contigs, respectively. Only 5,113 singletons were recovered after quality control. Contigs/isotigs were further incorporated into 15,667 isogroups. All unique sequences were queried against the nr, TAIR, UniRef100, UniRef50 and Amaranthaceae EST databases for annotation. Functional GO annotation was performed with all contigs/isotigs that produced significant hits with the TAIR database. Only 8,260 sequences were found to be homologous when the transcriptomes of A. tuberculatus and A. hypochondriacus were compared, most of which were associated with basic house-keeping processes. Digital expression analysis identified 1,971 differentially expressed genes in response to at least one of four stress treatments tested. These included several multiple-stress-inducible genes that could represent potential candidates for use in the engineering of stress-resistant plants. The transcriptomic data generated from pigmented stems shared similarity with findings reported in developing stems of Arabidopsis and black cottonwood (Populus trichocarpa). Conclusions This study represents the first large-scale transcriptomic analysis of A. hypochondriacus, considered to be a highly nutritious and stress-tolerant crop. Numerous genes were found to be induced in response to (a)biotic stress, many of which could further the understanding of the mechanisms that contribute to multiple stress-resistance in plants, a trait that has potential biotechnological applications in agriculture. PMID:21752295

  12. Gene discovery using massively parallel pyrosequencing to develop ESTs for the flesh fly Sarcophaga crassipalpis

    PubMed Central

    Hahn, Daniel A; Ragland, Gregory J; Shoemaker, D DeWayne; Denlinger, David L

    2009-01-01

    Background Flesh flies in the genus Sarcophaga are important models for investigating endocrinology, diapause, cold hardiness, reproduction, and immunity. Despite the prominence of Sarcophaga flesh flies as models for insect physiology and biochemistry, and in forensic studies, little genomic or transcriptomic data are available for members of this genus. We used massively parallel pyrosequencing on the Roche 454-FLX platform to produce a substantial EST dataset for the flesh fly Sarcophaga crassipalpis. To maximize sequence diversity, we pooled RNA extracted from whole bodies of all life stages and normalized the cDNA pool after reverse transcription. Results We obtained 207,110 ESTs with an average read length of 241 bp. These reads assembled into 20,995 contigs and 31,056 singletons. Using BLAST searches of the NR and NT databases we were able to identify 11,757 unique gene elements (E<0.0001) representing approximately 9,000 independent transcripts. Comparison of the distribution of S. crassipalpis unigenes among GO Biological Process functional groups with that of the Drosophila melanogaster transcriptome suggests that our ESTs are broadly representative of the flesh fly transcriptome. Insertion and deletion errors in 454 sequencing present a serious hurdle to comparative transcriptome analysis. Aided by a new approach to correcting for these errors, we performed a comparative analysis of genetic divergence across GO categories among S. crassipalpis, D. melanogaster, and Anopheles gambiae. The results suggest that non-synonymous substitutions occur at similar rates across categories, although genes related to response to stimuli may evolve slightly faster. In addition, we identified over 500 potential microsatellite loci and more than 12,000 SNPs among our ESTs. Conclusion Our data provides the first large-scale EST-project for flesh flies, a much-needed resource for exploring this model species. In addition, we identified a large number of potential microsatellite and SNP markers that could be used in population and systematic studies of S. crassipalpis and other flesh flies. PMID:19454017

  13. Impact of a novel protein meal on the gastrointestinal microbiota and the host transcriptome of larval zebrafish Danio rerio

    PubMed Central

    Rurangwa, Eugene; Sipkema, Detmer; Kals, Jeroen; ter Veld, Menno; Forlenza, Maria; Bacanu, Gianina M.; Smidt, Hauke; Palstra, Arjan P.

    2015-01-01

    Larval zebrafish was subjected to a methodological exploration of the gastrointestinal microbiota and transcriptome. Assessed was the impact of two dietary inclusion levels of a novel protein meal (NPM) of animal origin (ragworm Nereis virens) on the gastrointestinal tract (GIT). Microbial development was assessed over the first 21 days post egg fertilization (dpf) through 16S rRNA gene-based microbial composition profiling by pyrosequencing. Differentially expressed genes in the GIT were demonstrated at 21 dpf by whole transcriptome sequencing (mRNAseq). Larval zebrafish showed rapid temporal changes in microbial colonization but domination occurred by one to three bacterial species generally belonging to Proteobacteria and Firmicutes. The high iron content of NPM may have led to an increased relative abundance of bacteria that were related to potential pathogens and bacteria with an increased iron metabolism. Functional classification of the 328 differentially expressed genes indicated that the GIT of larvae fed at higher NPM level was more active in transmembrane ion transport and protein synthesis. mRNAseq analysis did not reveal a major activation of genes involved in the immune response or indicating differences in iron uptake and homeostasis in zebrafish fed at the high inclusion level of NPM. PMID:25983694

  14. De novo assembly and characterization of leaf transcriptome for the development of functional molecular markers of the extremophile multipurpose tree species Prosopis alba

    PubMed Central

    2013-01-01

    Background Prosopis alba (Fabaceae) is an important native tree adapted to arid and semiarid regions of north-western Argentina which is of great value as multipurpose species. Despite its importance, the genomic resources currently available for the entire Prosopis genus are still limited. Here we describe the development of a leaf transcriptome and the identification of new molecular markers that could support functional genetic studies in natural and domesticated populations of this genus. Results Next generation DNA pyrosequencing technology applied to P. alba transcripts produced a total of 1,103,231 raw reads with an average length of 421 bp. De novo assembling generated a set of 15,814 isotigs and 71,101 non-assembled sequences (singletons) with an average of 991 bp and 288 bp respectively. A total of 39,000 unique singletons were identified after clustering natural and artificial duplicates from pyrosequencing reads. Regarding the non-redundant sequences or unigenes, 22,095 out of 54,814 were successfully annotated with Gene Ontology terms. Moreover, simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 5,992 and 6,236 markers, respectively, throughout the genome. For the validation of the the predicted SSR markers, a subset of 87 SSRs selected through functional annotation evidence was successfully amplified from six DNA samples of seedlings. From this analysis, 11 of these 87 SSRs were identified as polymorphic. Additionally, another set of 123 nuclear polymorphic SSRs were determined in silico, of which 50% have the probability of being effectively polymorphic. Conclusions This study generated a successful global analysis of the P. alba leaf transcriptome after bioinformatic and wet laboratory validations of RNA-Seq data. The limited set of molecular markers currently available will be significantly increased with the thousands of new markers that were identified in this study. This information will strongly contribute to genomics resources for P. alba functional analysis and genetics. Finally, it will also potentially contribute to the development of population-based genome studies in the genera. PMID:24125525

  15. Analysis of the Olive Fruit Fly Bactrocera oleae Transcriptome and Phylogenetic Classification of the Major Detoxification Gene Families

    PubMed Central

    Rombauts, Stephane; Chrisargiris, Antonis; Van Leeuwen, Thomas; Vontas, John

    2013-01-01

    The olive fruit fly Bactrocera oleae has a unique ability to cope with olive flesh, and is the most destructive pest of olives worldwide. Its control has been largely based on the use of chemical insecticides, however, the selection of insecticide resistance against several insecticides has evolved. The study of detoxification mechanisms, which allow the olive fruit fly to defend against insecticides, and/or phytotoxins possibly present in the mesocarp, has been hampered by the lack of genomic information in this species. In the NCBI database less than 1,000 nucleotide sequences have been deposited, with less than 10 detoxification gene homologues in total. We used 454 pyrosequencing to produce, for the first time, a large transcriptome dataset for B. oleae. A total of 482,790 reads were assembled into 14,204 contigs. More than 60% of those contigs (8,630) were larger than 500 base pairs, and almost half of them matched with genes of the order of the Diptera. Analysis of the Gene Ontology (GO) distribution of unique contigs, suggests that, compared to other insects, the assembly is broadly representative for the B. oleae transcriptome. Furthermore, the transcriptome was found to contain 55 P450, 43 GST-, 15 CCE- and 18 ABC transporter-genes. Several of those detoxification genes, may putatively be involved in the ability of the olive fruit fly to deal with xenobiotics, such as plant phytotoxins and insecticides. In summary, our study has generated new data and genomic resources, which will substantially facilitate molecular studies in B. oleae, including elucidation of detoxification mechanisms of xenobiotic, as well as other important aspects of olive fruit fly biology. PMID:23824998

  16. Comparative day/night metatranscriptomic analysis of microbial communities in the North Pacific subtropical gyre.

    PubMed

    Poretsky, Rachel S; Hewson, Ian; Sun, Shulei; Allen, Andrew E; Zehr, Jonathan P; Moran, Mary Ann

    2009-06-01

    Metatranscriptomic analyses of microbial assemblages (< 5 microm) from surface water at the Hawaiian Ocean Time-Series (HOT) revealed community-wide metabolic activities and day/night patterns of differential gene expression. Pyrosequencing produced 75 558 putative mRNA reads from a day transcriptome and 75 946 from a night transcriptome. Taxonomic binning of annotated mRNAs indicated that Cyanobacteria contributed a greater percentage of the transcripts (54% of annotated sequences) than expected based on abundance (35% of cell counts and 21% 16S rRNA of libraries), and may represent the most actively transcribing cells in this surface ocean community in both the day and night. Major heterotrophic taxa contributing to the community transcriptome included alpha-Proteobacteria (19% of annotated sequences, most of which were SAR11-related) and gamma-Proteobacteria (4%). The composition of transcript pools was consistent with models of prokaryotic gene expression, including operon-based transcription patterns and an abundance of genes predicted to be highly expressed. Metabolic activities that are shared by many microbial taxa (e.g. glycolysis, citric acid cycle, amino acid biosynthesis and transcription and translation machinery) were well represented among the community transcripts. There was an overabundance of transcripts for photosynthesis, C1 metabolism and oxidative phosphorylation in the day compared with night, and evidence that energy acquisition is coordinated with solar radiation levels for both autotrophic and heterotrophic microbes. In contrast, housekeeping activities such as amino acid biosynthesis, membrane synthesis and repair, and vitamin biosynthesis were overrepresented in the night transcriptome. Direct sequencing of these environmental transcripts has provided detailed information on metabolic and biogeochemical responses of a microbial community to solar forcing.

  17. PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics

    PubMed Central

    2012-01-01

    Background The peanut (Arachis hypogaea) is an important crop cultivated worldwide for oil production and food sources. Its complex genetic architecture (e.g., the large and tetraploid genome possibly due to unique cross of wild diploid relatives and subsequent chromosome duplication: 2n = 4x = 40, AABB, 2800 Mb) presents a major challenge for its genome sequencing and makes it a less-studied crop. Without a doubt, transcriptome sequencing is the most effective way to harness the genome structure and gene expression dynamics of this non-model species that has a limited genomic resource. Description With the development of next generation sequencing technologies such as 454 pyro-sequencing and Illumina sequencing by synthesis, the transcriptomics data of peanut is rapidly accumulated in both the public databases and private sectors. Integrating 187,636 Sanger reads (103,685,419 bases), 1,165,168 Roche 454 reads (333,862,593 bases) and 57,135,995 Illumina reads (4,073,740,115 bases), we generated the first release of our peanut transcriptome assembly that contains 32,619 contigs. We provided EC, KEGG and GO functional annotations to these contigs and detected SSRs, SNPs and other genetic polymorphisms for each contig. Based on both open-source and our in-house tools, PeanutDB presents many seamlessly integrated web interfaces that allow users to search, filter, navigate and visualize easily the whole transcript assembly, its annotations and detected polymorphisms and simple sequence repeats. For each contig, sequence alignment is presented in both bird’s-eye view and nucleotide level resolution, with colorfully highlighted regions of mismatches, indels and repeats that facilitate close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors. Conclusion As a public genomic database that integrates peanut transcriptome data from different sources, PeanutDB (http://bioinfolab.muohio.edu/txid3818v1) provides the Peanut research community with an easy-to-use web portal that will definitely facilitate genomics research and molecular breeding in this less-studied crop. PMID:22712730

  18. Impact of enrofloxacin on the human intestinal microbiota revealed by comparative molecular analysis.

    PubMed

    Kim, Bong-Soo; Kim, Jong Nam; Yoon, Seok-Hwan; Chun, Jongsik; Cerniglia, Carl E

    2012-06-01

    The indigenous human intestinal microbiota could be disrupted by residues of antibiotics in foods as well as therapeutically administered antibiotics to humans. These disruptions may lead to adverse health outcomes. To observe the possible impact of residues of antibiotics at concentrations below therapeutic levels on human intestinal microbiota, we performed studies using in vitro cultures of fecal suspensions from three individuals with 10 different concentrations (0, 0.1, 0.5, 1, 5, 10, 15, 25, 50 and 150 μg/ml) of the fluoroquinolone, enrofloxacin. The bacterial communities of the control and enrofloxacin dosed fecal samples were analyzed by denaturing gradient gel electrophoresis (DGGE) and pyrosequencing. In addition, changes of functional gene expression were analyzed by a pyrosequencing-based random whole-community mRNA sequencing method. Although each individual had a unique microbial composition, the communities of all individuals were affected by enrofloxacin. The proportions of two phyla, namely, Bacteroidetes and Proteobacteria, were significantly reduced with increasing concentrations of enrofloxacin exposure, while the proportion of Firmicutes increased. Principal Coordinate Analysis (PCoA) using the Fast UniFrac indicated that the community structures of intestinal microbiota were shifted by enrofloxacin. Most of the mRNA transcripts and the anti-microbial drug resistance genes increased with increasing concentrations of enrofloxacin. 16S rRNA gene pyrosequencing of control and enrofloxacin treated fecal suspensions provided valuable information of affected bacterial taxa down to the species level, and the community transcriptomic analyses using mRNA revealed the functional gene expression responses of the changed bacterial communities by enrofloxacin. Published by Elsevier Ltd.

  19. Characterization of the Antarctic sea urchin (Sterechinus neumayeri) transcriptome and mitogenome: a molecular resource for phylogenetics, ecophysiology and global change biology.

    PubMed

    Dilly, G F; Gaitán-Espitia, J D; Hofmann, G E

    2015-03-01

    This is the first de novo transcriptome and complete mitochondrial genome of an Antarctic sea urchin species sequenced to date. Sterechinus neumayeri is an Antarctic sea urchin and a model species for ecology, development, physiology and global change biology. To identify transcripts important to ocean acidification (OA) and thermal stress, this transcriptome was created pooling, and 13 larval samples representing developmental stages on day 11 (late gastrula), 19 (early pluteus) and 30 (mid pluteus) maintained at three CO2 levels (421, 652, and 1071 μatm) as well as four additional heat-shocked samples. The normalized cDNA pool was sequenced using emulsion PCR (pyrosequencing) resulting in 1.34M reads with an average read length of 492 base pairs. 40,994 isotigs were identified, averaging 1188 bp with a median coverage of 11×. Additional primer design and gap sequencing were required to complete the mitochondrial genome. The mitogenome of S. neumayeri is a circular DNA molecule with a length of 15 684 bp that contains all 37 genes normally found in metazoans. We detail the main features of the transcriptome and the mitogenome architecture and investigate the phylogenetic relationships of S. neumayeri within Echinoidea. In addition, we provide comparative analyses of S. neumayeri with its closest relative, Strongylocentrotus purpuratus, including a list of potential OA gene targets. The resources described here will support a variety of quantitative (genomic, proteomic, multistress and comparative) studies to interrogate physiological responses to OA and other stressors in this important Antarctic calcifier. © 2014 John Wiley & Sons Ltd.

  20. Transcript expression profiling for adventitious roots of Panax ginseng Meyer.

    PubMed

    Subramaniyam, Sathiyamoorthy; Mathiyalagan, Ramya; Natarajan, Sathishkumar; Kim, Yu-Jin; Jang, Moon-Gi; Park, Jun-Hyung; Yang, Deok Chun

    2014-08-01

    Panax ginseng Meyer is one of the major medicinal plants in oriental countries belonging to the Araliaceae family which are the primary source for ginsenosides. However, very few genes were characterized for ginsenoside pathway, due to the limited genome information. Through this study, we obtained a comprehensive transcriptome from adventitious roots, which were treated with methyl jasmonic acids for different time points (control, 2h, 6h, 12h, and 24h) and sequenced by RNA 454 pyrosequencing technology. Reference transcriptome 39,304,529 (0.04GB) was obtained from 5,724,987,880 bases (5.7GB) of 22 libraries by de novo assembly and 35,266 (58.5%) transcripts were annotated with biological schemas (GO and KEGG). The digital gene expression patterns were obtained from in vitro grown adventitious root sequences which mapped to reference, from that, 3813 (6.3%) unique transcripts were involved in ≥2 fold up and downregulations. Finally, candidates for ginsenoside pathway genes were predicted from observed expression patterns. Among them, 30 transcription factors, 20 cytochromes, and 11 glycosyl transferases were predicted as ginsenoside candidates. These data can remarkably expand the existing transcriptome resources of Panax, especially to predict existence of gene networks in P. ginseng. The entity of the data provides a valuable platform to reveal more on secondary metabolism and abiotic stresses from P. ginseng in vitro grown adventitious roots. Copyright © 2014 Elsevier B.V. All rights reserved.

  1. Transcriptomic analysis of Siberian ginseng (Eleutherococcus senticosus) to discover genes involved in saponin biosynthesis.

    PubMed

    Hwang, Hwan-Su; Lee, Hyoshin; Choi, Yong Eui

    2015-03-14

    Eleutherococcus senticosus, Siberian ginseng, is a highly valued woody medicinal plant belonging to the family Araliaceae. E. senticosus produces a rich variety of saponins such as oleanane-type, noroleanane-type, 29-hydroxyoleanan-type, and lupane-type saponins. Genomic or transcriptomic approaches have not been used to investigate the saponin biosynthetic pathway in this plant. In this study, de novo sequencing was performed to select candidate genes involved in the saponin biosynthetic pathway. A half-plate 454 pyrosequencing run produced 627,923 high-quality reads with an average sequence length of 422 bases. De novo assembly generated 72,811 unique sequences, including 15,217 contigs and 57,594 singletons. Approximately 48,300 (66.3%) unique sequences were annotated using BLAST similarity searches. All of the mevalonate pathway genes for saponin biosynthesis starting from acetyl-CoA were isolated. Moreover, 206 reads of cytochrome P450 (CYP) and 145 reads of uridine diphosphate glycosyltransferase (UGT) sequences were isolated. Based on methyl jasmonate (MeJA) treatment and real-time PCR (qPCR) analysis, 3 CYPs and 3 UGTs were finally selected as candidate genes involved in the saponin biosynthetic pathway. The identified sequences associated with saponin biosynthesis will facilitate the study of the functional genomics of saponin biosynthesis and genetic engineering of E. senticosus.

  2. Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.

    PubMed

    Taheri, Sima; Lee Abdullah, Thohirah; Yusop, Mohd Rafii; Hanafi, Mohamed Musa; Sahebi, Mahbod; Azizi, Parisa; Shamshiri, Redmond Ramin

    2018-02-13

    Microsatellites, or simple sequence repeats (SSRs), are one of the most informative and multi-purpose genetic markers exploited in plant functional genomics. However, the discovery of SSRs and development using traditional methods are laborious, time-consuming, and costly. Recently, the availability of high-throughput sequencing technologies has enabled researchers to identify a substantial number of microsatellites at less cost and effort than traditional approaches. Illumina is a noteworthy transcriptome sequencing technology that is currently used in SSR marker development. Although 454 pyrosequencing datasets can be used for SSR development, this type of sequencing is no longer supported. This review aims to present an overview of the next generation sequencing, with a focus on the efficient use of de novo transcriptome sequencing (RNA-Seq) and related tools for mining and development of microsatellites in plants.

  3. Transcriptome analysis of eyestalk and hemocytes in the ridgetail white prawn Exopalaemon carinicauda: assembly, annotation and marker discovery.

    PubMed

    Li, Jitao; Li, Jian; Chen, Ping; Liu, Ping; He, Yuying

    2015-01-01

    The ridgetail white prawn Exopalaemon carinicauda is one of major economic mariculture species in eastern China. The deficiency of genomic and transcriptomic data is becoming the bottleneck of further researches on its good traits. In the present study, 454 pyrosequencing was undertaken to investigate the transcriptome profiles of E. carinicauda. A collection of 1,028,710 sequence reads (459.59 Mb) obtained from cDNA prepared from eyestalk and hemocytes was assembled into 162,056 expressed sequence tags (ESTs). Of these, 29.88 % of 48,428 contigs and 70.12 % of 113,628 singlets possessed high similarities to sequences in the GenBank non-redundant database, with most significant (E value <1e(-10)) unigenes matches occurring with crustacean and insect sequences. KEGG analysis of unigenes identified putative members of biological pathways related to growth and immunity. In addition, we obtained a total of putative 125,112 SNPs and 13,467 microsatellites. These results will contribute to the understanding of the genome makeup and provide useful information for future functional genomic research in E. carinicauda.

  4. Transcriptome analysis of Capsicum annuum varieties Mandarin and Blackcluster: assembly, annotation and molecular marker discovery.

    PubMed

    Ahn, Yul-Kyun; Tripathi, Swati; Kim, Jeong-Ho; Cho, Young-Il; Lee, Hye-Eun; Kim, Do-Sun; Woo, Jong-Gyu; Cho, Myeong-Cheoul

    2014-01-10

    Next generation sequencing technologies have proven to be a rapid and cost-effective means to assemble and characterize gene content and identify molecular markers in various organisms. Pepper (Capsicum annuum L., Solanaceae) is a major staple vegetable crop, which is economically important and has worldwide distribution. High-throughput transcriptome profiling of two pepper cultivars, Mandarin and Blackcluster, using 454 GS-FLX pyrosequencing yielded 279,221 and 316,357 sequenced reads with a total 120.44 and 142.54Mb of sequence data (average read length of 431 and 450 nucleotides). These reads resulted from 17,525 and 16,341 'isogroups' and were assembled into 19,388 and 18,057 isotigs, and 22,217 and 13,153 singletons for both the cultivars, respectively. Assembled sequences were annotated functionally based on homology to genes in multiple public databases. Detailed sequence variant analysis identified a total of 9701 and 12,741 potential SNPs which eventually resulted in 1025 and 1059 genotype specific SNPs, for both the varieties, respectively, after examining SNP frequency distribution for each mapped unigenes. These markers for pepper will be highly valuable for marker-assisted breeding and other genetic studies. © 2013 Elsevier B.V. All rights reserved.

  5. De novo assembly and transcriptome analysis of five major tissues of Jatropha curcas L. using GS FLX titanium platform of 454 pyrosequencing

    PubMed Central

    2011-01-01

    Background Jatropha curcas L. is an important non-edible oilseed crop with promising future in biodiesel production. However, factors like oil yield, oil composition, toxic compounds in oil cake, pests and diseases limit its commercial potential. Well established genetic engineering methods using cloned genes could be used to address these limitations. Earlier, 10,983 unigenes from Sanger sequencing of ESTs, and 3,484 unique assembled transcripts from 454 pyrosequencing of uncloned cDNAs were reported. In order to expedite the process of gene discovery, we have undertaken 454 pyrosequencing of normalized cDNAs prepared from roots, mature leaves, flowers, developing seeds, and embryos of J. curcas. Results From 383,918 raw reads, we obtained 381,957 quality-filtered and trimmed reads that are suitable for the assembly of transcript sequences. De novo contig assembly of these reads generated 17,457 assembled transcripts (contigs) and 54,002 singletons. Average length of the assembled transcripts was 916 bp. About 30% of the transcripts were longer than 1000 bases, and the size of the longest transcript was 7,173 bases. BLASTX analysis revealed that 2,589 of these transcripts are full-length. The assembled transcripts were validated by RT-PCR analysis of 28 transcripts. The results showed that the transcripts were correctly assembled and represent actively expressed genes. KEGG pathway mapping showed that 2,320 transcripts are related to major biochemical pathways including the oil biosynthesis pathway. Overall, the current study reports 14,327 new assembled transcripts which included 2589 full-length transcripts and 27 transcripts that are directly involved in oil biosynthesis. Conclusion The large number of transcripts reported in the current study together with existing ESTs and transcript sequences will serve as an invaluable genetic resource for crop improvement in jatropha. Sequence information of those genes that are involved in oil biosynthesis could be used for metabolic engineering of jatropha to increase oil content, and to modify oil composition. PMID:21492485

  6. De novo assembly and transcriptome analysis of five major tissues of Jatropha curcas L. using GS FLX titanium platform of 454 pyrosequencing.

    PubMed

    Natarajan, Purushothaman; Parani, Madasamy

    2011-04-15

    Jatropha curcas L. is an important non-edible oilseed crop with promising future in biodiesel production. However, factors like oil yield, oil composition, toxic compounds in oil cake, pests and diseases limit its commercial potential. Well established genetic engineering methods using cloned genes could be used to address these limitations. Earlier, 10,983 unigenes from Sanger sequencing of ESTs, and 3,484 unique assembled transcripts from 454 pyrosequencing of uncloned cDNAs were reported. In order to expedite the process of gene discovery, we have undertaken 454 pyrosequencing of normalized cDNAs prepared from roots, mature leaves, flowers, developing seeds, and embryos of J. curcas. From 383,918 raw reads, we obtained 381,957 quality-filtered and trimmed reads that are suitable for the assembly of transcript sequences. De novo contig assembly of these reads generated 17,457 assembled transcripts (contigs) and 54,002 singletons. Average length of the assembled transcripts was 916 bp. About 30% of the transcripts were longer than 1000 bases, and the size of the longest transcript was 7,173 bases. BLASTX analysis revealed that 2,589 of these transcripts are full-length. The assembled transcripts were validated by RT-PCR analysis of 28 transcripts. The results showed that the transcripts were correctly assembled and represent actively expressed genes. KEGG pathway mapping showed that 2,320 transcripts are related to major biochemical pathways including the oil biosynthesis pathway. Overall, the current study reports 14,327 new assembled transcripts which included 2589 full-length transcripts and 27 transcripts that are directly involved in oil biosynthesis. The large number of transcripts reported in the current study together with existing ESTs and transcript sequences will serve as an invaluable genetic resource for crop improvement in jatropha. Sequence information of those genes that are involved in oil biosynthesis could be used for metabolic engineering of jatropha to increase oil content, and to modify oil composition.

  7. Sugarcane giant borer transcriptome analysis and identification of genes related to digestion.

    PubMed

    Fonseca, Fernando Campos de Assis; Firmino, Alexandre Augusto Pereira; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; de Souza Júnior, José Dijair Antonino; de Sousa Júnior, José Dijair Antonino; Silva-Junior, Orzenil Bonfim; Togawa, Roberto Coiti; Pappas, Georgios Joannis; de Góis, Luiz Avelar Brandão; da Silva, Maria Cristina Mattar; Grossi-de-Sá, Maria Fátima

    2015-01-01

    Sugarcane is a widely cultivated plant that serves primarily as a source of sugar and ethanol. Its annual yield can be significantly reduced by the action of several insect pests including the sugarcane giant borer (Telchin licus licus), a lepidopteran that presents a long life cycle and which efforts to control it using pesticides have been inefficient. Although its economical relevance, only a few DNA sequences are available for this species in the GenBank. Pyrosequencing technology was used to investigate the transcriptome of several developmental stages of the insect. To maximize transcript diversity, a pool of total RNA was extracted from whole body insects and used to construct a normalized cDNA database. Sequencing produced over 650,000 reads, which were de novo assembled to generate a reference library of 23,824 contigs. After quality score and annotation, 43% of the contigs had at least one BLAST hit against the NCBI non-redundant database, and 40% showed similarities with the lepidopteran Bombyx mori. In a further analysis, we conducted a comparison with Manduca sexta midgut sequences to identify transcripts of genes involved in digestion. Of these transcripts, many presented an expansion or depletion in gene number, compared to B. mori genome. From the sugarcane giant borer (SGB) transcriptome, a number of aminopeptidase N (APN) cDNAs were characterized based on homology to those reported as Cry toxin receptors. This is the first report that provides a large-scale EST database for the species. Transcriptome analysis will certainly be useful to identify novel developmental genes, to better understand the insect's biology and to guide the development of new strategies for insect-pest control.

  8. Massive sequencing of Ulmus minor’s transcriptome provides new molecular tools for a genus under the constant threat of Dutch elm disease

    PubMed Central

    Perdiguero, Pedro; Venturas, Martin; Cervera, María Teresa; Gil, Luis; Collada, Carmen

    2015-01-01

    Elms, especially Ulmus minor and U. americana, are carrying out a hard battle against Dutch elm disease (DED). This vascular wilt disease, caused by Ophiostoma ulmi and O. novo-ulmi, appeared in the twentieth century and killed millions of elms across North America and Europe. Elm breeding and conservation programmes have identified a reduced number of DED tolerant genotypes. In this study, three U. minor genotypes with contrasted levels of tolerance to DED were exposed to several biotic and abiotic stresses in order to (i) obtain a de novo assembled transcriptome of U. minor using 454 pyrosequencing, (ii) perform a functional annotation of the assembled transcriptome, (iii) identify genes potentially involved in the molecular response to environmental stress, and (iv) develop gene-based markers to support breeding programmes. A total of 58,429 putative unigenes were identified after assembly and filtering of the transcriptome. 32,152 of these unigenes showed homology with proteins identified in the genome from the most common plant model species. Well-known family proteins and transcription factors involved in abiotic, biotic or both stresses were identified after functional annotation. A total of 30,693 polymorphisms were identified in 7,125 isotigs, a large number of them corresponding to single nucleotide polymorphisms (SNPs; 27,359). In a subset randomly selected for validation, 87% of the SNPs were confirmed. The material generated may be valuable for future Ulmus gene expression, population genomics and association genetics studies, especially taking into account the scarce molecular information available for this genus and the great impact that DED has on elm populations. PMID:26257751

  9. Sugarcane Giant Borer Transcriptome Analysis and Identification of Genes Related to Digestion

    PubMed Central

    de Assis Fonseca, Fernando Campos; Firmino, Alexandre Augusto Pereira; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; de Sousa Júnior, José Dijair Antonino; Silva-Junior, Orzenil Bonfim; Togawa, Roberto Coiti; Pappas, Georgios Joannis; de Góis, Luiz Avelar Brandão; da Silva, Maria Cristina Mattar; Grossi-de-Sá, Maria Fátima

    2015-01-01

    Sugarcane is a widely cultivated plant that serves primarily as a source of sugar and ethanol. Its annual yield can be significantly reduced by the action of several insect pests including the sugarcane giant borer (Telchin licus licus), a lepidopteran that presents a long life cycle and which efforts to control it using pesticides have been inefficient. Although its economical relevance, only a few DNA sequences are available for this species in the GenBank. Pyrosequencing technology was used to investigate the transcriptome of several developmental stages of the insect. To maximize transcript diversity, a pool of total RNA was extracted from whole body insects and used to construct a normalized cDNA database. Sequencing produced over 650,000 reads, which were de novo assembled to generate a reference library of 23,824 contigs. After quality score and annotation, 43% of the contigs had at least one BLAST hit against the NCBI non-redundant database, and 40% showed similarities with the lepidopteran Bombyx mori. In a further analysis, we conducted a comparison with Manduca sexta midgut sequences to identify transcripts of genes involved in digestion. Of these transcripts, many presented an expansion or depletion in gene number, compared to B. mori genome. From the sugarcane giant borer (SGB) transcriptome, a number of aminopeptidase N (APN) cDNAs were characterized based on homology to those reported as Cry toxin receptors. This is the first report that provides a large-scale EST database for the species. Transcriptome analysis will certainly be useful to identify novel developmental genes, to better understand the insect’s biology and to guide the development of new strategies for insect-pest control. PMID:25706301

  10. Transcriptome analysis of the honey bee fungal pathogen, Ascosphaera apis: implications for host pathogenesis

    PubMed Central

    2012-01-01

    Background We present a comprehensive transcriptome analysis of the fungus Ascosphaera apis, an economically important pathogen of the Western honey bee (Apis mellifera) that causes chalkbrood disease. Our goals were to further annotate the A. apis reference genome and to identify genes that are candidates for being differentially expressed during host infection versus axenic culture. Results We compared A. apis transcriptome sequence from mycelia grown on liquid or solid media with that dissected from host-infected tissue. 454 pyrosequencing provided 252 Mb of filtered sequence reads from both culture types that were assembled into 10,087 contigs. Transcript contigs, protein sequences from multiple fungal species, and ab initio gene predictions were included as evidence sources in the Maker gene prediction pipeline, resulting in 6,992 consensus gene models. A phylogeny based on 12 of these protein-coding loci further supported the taxonomic placement of Ascosphaera as sister to the core Onygenales. Several common protein domains were less abundant in A. apis compared with related ascomycete genomes, particularly cytochrome p450 and protein kinase domains. A novel gene family was identified that has expanded in some ascomycete lineages, but not others. We manually annotated genes with homologs in other fungal genomes that have known relevance to fungal virulence and life history. Functional categories of interest included genes involved in mating-type specification, intracellular signal transduction, and stress response. Computational and manual annotations have been made publicly available on the Bee Pests and Pathogens website. Conclusions This comprehensive transcriptome analysis substantially enhances our understanding of the A. apis genome and its expression during infection of honey bee larvae. It also provides resources for future molecular studies of chalkbrood disease and ultimately improved disease management. PMID:22747707

  11. New approach for the study of mite reproduction: The first transcriptome analysis of a mite, Phytoseiulus persimilis (Acari: Phytoseiidae).

    PubMed

    Cabrera, Ana R; Donohue, Kevin V; Khalil, Sayed M S; Scholl, Elizabeth; Opperman, Charles; Sonenshine, Daniel E; Roe, R Michael

    2011-01-01

    Many species of mites and ticks are of agricultural and medical importance. Much can be learned from the study of transcriptomes of acarines which can generate DNA-sequence information of potential target genes for the control of acarine pests. High throughput transcriptome sequencing can also yield sequences of genes critical during physiological processes poorly understood in acarines, i.e., the regulation of female reproduction in mites. The predatory mite, Phytoseiulus persimilis, was selected to conduct a transcriptome analysis using 454 pyrosequencing. The objective of this project was to obtain DNA-sequence information of expressed genes from P. persimilis with special interest in sequences corresponding to vitellogenin (Vg) and the vitellogenin receptor (VgR). These genes are critical to the understanding of vitellogenesis, and they will facilitate the study of the regulation of mite female reproduction. A total of 12,556 contiguous sequences (contigs) were assembled with an average size of 935bp. From these sequences, the putative translated peptides of 11 contigs were similar in amino acid sequences to other arthropod Vgs, while 6 were similar to VgRs. We selected some of these sequences to conduct stage-specific expression studies to further determine their function. 2010 Elsevier Ltd. All rights reserved.

  12. 5'-Serial Analysis of Gene Expression studies reveal a transcriptomic switch during fruiting body development in Coprinopsis cinerea

    PubMed Central

    2013-01-01

    Background The transition from the vegetative mycelium to the primordium during fruiting body development is the most complex and critical developmental event in the life cycle of many basidiomycete fungi. Understanding the molecular mechanisms underlying this process has long been a goal of research on basidiomycetes. Large scale assessment of the expressed transcriptomes of these developmental stages will facilitate the generation of a more comprehensive picture of the mushroom fruiting process. In this study, we coupled 5'-Serial Analysis of Gene Expression (5'-SAGE) to high-throughput pyrosequencing from 454 Life Sciences to analyze the transcriptomes and identify up-regulated genes among vegetative mycelium (Myc) and stage 1 primordium (S1-Pri) of Coprinopsis cinerea during fruiting body development. Results We evaluated the expression of >3,000 genes in the two respective growth stages and discovered that almost one-third of these genes were preferentially expressed in either stage. This identified a significant turnover of the transcriptome during the course of fruiting body development. Additionally, we annotated more than 79,000 transcription start sites (TSSs) based on the transcriptomes of the mycelium and stage 1 primoridum stages. Patterns of enrichment based on gene annotations from the GO and KEGG databases indicated that various structural and functional protein families were uniquely employed in either stage and that during primordial growth, cellular metabolism is highly up-regulated. Various signaling pathways such as the cAMP-PKA, MAPK and TOR pathways were also identified as up-regulated, consistent with the model that sensing of nutrient levels and the environment are important in this developmental transition. More than 100 up-regulated genes were also found to be unique to mushroom forming basidiomycetes, highlighting the novelty of fruiting body development in the fungal kingdom. Conclusions We implicated a wealth of new candidate genes important to early stages of mushroom fruiting development, though their precise molecular functions and biological roles are not yet fully known. This study serves to advance our understanding of the molecular mechanisms of fruiting body development in the model mushroom C. cinerea. PMID:23514374

  13. Transcriptome sequencing of the Antarctic vascular plant Deschampsia antarctica Desv. under abiotic stress.

    PubMed

    Lee, Jungeun; Noh, Eun Kyeung; Choi, Hyung-Seok; Shin, Seung Chul; Park, Hyun; Lee, Hyoungseok

    2013-03-01

    Antarctic hairgrass (Deschampsia antarctica Desv.) is the only natural grass species in the maritime Antarctic. It has been studied as an extremophile that has successfully adapted to marginal land with the harshest environment for terrestrial plants. However, limited genetic research has focused on this species due to the lack of genomic resources. Here, we present the first de novo assembly of its transcriptome by massive parallel sequencing and its expression profile using D. antarctica grown under various stress conditions. Total sequence reads generated by pyrosequencing were assembled into 60,765 unigenes (28,177 contigs and 32,588 singletons). A total of 29,173 unique protein-coding genes were identified based on sequence similarities to known proteins. The combined results from all three stress conditions indicated differential expression of 3,110 genes. Quantitative reverse transcription polymerase chain reaction showed that several well-known stress-responsive genes encoding late embryogenesis abundant protein, dehydrin 1, and ice recrystallization inhibition protein were induced dramatically and that genes encoding U-box-domain-containing protein, electron transfer flavoprotein-ubiquinone, and F-box-containing protein were induced by abiotic stressors in a manner conserved with other plant species. We identified more than 2,000 simple sequence repeats that can be developed as functional molecular markers. This dataset is the most comprehensive transcriptome resource currently available for D. antarctica and is therefore expected to be an important foundation for future genetic studies of grasses and extremophiles.

  14. A physiologically-oriented transcriptomic analysis of the midgut of Tenebrio molitor.

    PubMed

    Moreira, Nathalia R; Cardoso, Christiane; Dias, Renata O; Ferreira, Clelia; Terra, Walter R

    2017-05-01

    Physiological data showed that T. molitor midgut is buffered at pH 5.6 at the two anterior thirds and at 7.9 at the posterior third. Furthermore, water is absorbed and secreted at the anterior and posterior midgut, respectively, driving a midgut counter flux of fluid. To look for the molecular mechanisms underlying these phenomena and nutrient absorption as well, a transcriptomic approach was used. For this, 11 types of transporters were chosen from the midgut transcriptome obtained by pyrosequencing (Roche 454). After annotation with the aid of databanks and manual curation, the sequences were validated by RT-PCR. The expression level of each gene at anterior, middle and posterior midgut and carcass (larva less midgut) was evaluated by RNA-seq taking into account reference sequences based on 454 contigs and reads obtained by Illumina sequencing. The data showed that sugar and amino acid uniporters and symporters are expressed along the whole midgut. In the anterior midgut are found transporters for NH 3 and NH 4 + that with a chloride channel may be responsible for acidifying the lumen. At the posterior midgut, bicarbonate-Cl - antiporter with bicarbonate supplied by carbonic anhydrase may alkalinize the lumen. Water absorption caused mainly by an anterior Na + -K + -2Cl - symporter and water secretion caused by a posterior K + -Cl - may drive the midgut counter flux. Transporters that complement the action of those described were also found. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Characterization of gonadal transcriptomes from the turbot (Scophthalmus maximus).

    PubMed

    Hu, Yulong; Huang, Meng; Wang, Weiji; Guan, Jiantao; Kong, Jie

    2016-01-01

    The mechanisms underlying sexual reproduction and sex ratio determination remains unclear in turbot, a flatfish of great commercial value. And there is limited information in the turbot database regarding genes related to the reproductive system. Here, we conducted high-throughput transcriptome profiling of turbot gonad tissues to better understand their reproductive functions and to supply essential gene sequence information for marker-assisted selection programs in the turbot industry. In this study, two gonad libraries representing sex differences in Scophthalmus maximus yielded 453 818 high-quality reads that were assembled into 24 611 contigs and 33 713 singletons by using 454 pyrosequencing, 13 936 contigs and singletons (CS) of which were annotated using BLASTx. GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analyses revealed that various biological functions and processes were associated with many of the annotated CS. Expression analyses showed that 510 genes were differentially expressed in males versus females; 80% of these genes were annotated. In addition, 6484 and 6036 single nucleotide polymorphisms (SNPs) were identified in male and female libraries, respectively. This transcriptome resource will serve as the foundation for cDNA or SNP microarray construction, gene expression characterization, and sex-specific linkage mapping in turbot.

  16. Transcriptome Analysis of the Entomopathogenic Oomycete Lagenidium giganteum Reveals Putative Virulence Factors

    PubMed Central

    Quiroz Velasquez, Paula F.; Abiff, Sumayyah K.; Fins, Katrina C.; Conway, Quincy B.; Salazar, Norma C.; Delgado, Ana Paula; Dawes, Jhanelle K.; Douma, Lauren G.

    2014-01-01

    A combination of 454 pyrosequencing and Sanger sequencing was used to sample and characterize the transcriptome of the entomopathogenic oomycete Lagenidium giganteum. More than 50,000 high-throughput reads were annotated through homology searches. Several selected reads served as seeds for the amplification and sequencing of full-length transcripts. Phylogenetic analyses inferred from full-length cellulose synthase alignments revealed that L giganteum is nested within the peronosporalean galaxy and as such appears to have evolved from a phytopathogenic ancestor. In agreement with the phylogeny reconstructions, full-length L. giganteum oomycete effector orthologs, corresponding to the cellulose-binding elicitor lectin (CBEL), crinkler (CRN), and elicitin proteins, were characterized by domain organizations similar to those of pathogenicity factors of plant-pathogenic oomycetes. Importantly, the L. giganteum effectors provide a basis for detailing the roles of canonical CRN, CBEL, and elicitin proteins in the infectious process of an oomycete known principally as an animal pathogen. Finally, phylogenetic analyses and genome mining identified members of glycoside hydrolase family 5 subfamily 27 (GH5_27) as putative virulence factors active on the host insect cuticle, based in part on the fact that GH5_27 genes are shared by entomopathogenic oomycetes and fungi but are underrepresented in nonentomopathogenic genomes. The genomic resources gathered from the L. giganteum transcriptome analysis strongly suggest that filamentous entomopathogens (oomycetes and fungi) exhibit convergent evolution: they have evolved independently from plant-associated microbes, have retained genes indicative of plant associations, and may share similar cores of virulence factors, such as GH5_27 enzymes, that are absent from the genomes of their plant-pathogenic relatives. PMID:25107973

  17. Complete genome sequence of a novel Plum pox virus strain W isolate determined by 454 pyrosequencing.

    PubMed

    Sheveleva, Anna; Kudryavtseva, Anna; Speranskaya, Anna; Belenikin, Maxim; Melnikova, Natalia; Chirkov, Sergei

    2013-10-01

    The near-complete (99.7 %) genome sequence of a novel Russian Plum pox virus (PPV) isolate Pk, belonging to the strain Winona (W), has been determined by 454 pyrosequencing with the exception of the thirty-one 5'-terminal nucleotides. This region was amplified using 5'RACE kit and sequenced by the Sanger method. Genomic RNA released from immunocaptured PPV particles was employed for generation of cDNA library using TransPlex Whole transcriptome amplification kit (WTA2, Sigma-Aldrich). The entire Pk genome has identity level of 92.8-94.5 % when compared to the complete nucleotide sequences of other PPV-W isolates (W3174, LV-141pl, LV-145bt, and UKR 44189), confirming a high degree of variability within the PPV-W strain. The isolates Pk and LV-141pl are most closely related. The Pk has been found in a wild plum (Prunus domestica) in a new region of Russia indicating widespread dissemination of the PPV-W strain in the European part of the former USSR.

  18. Comparative 454 pyrosequencing of transcripts from two olive genotypes during fruit development

    PubMed Central

    Alagna, Fiammetta; D'Agostino, Nunzio; Torchia, Laura; Servili, Maurizio; Rao, Rosa; Pietrella, Marco; Giuliano, Giovanni; Chiusano, Maria Luisa; Baldoni, Luciana; Perrotta, Gaetano

    2009-01-01

    Background Despite its primary economic importance, genomic information on olive tree is still lacking. 454 pyrosequencing was used to enrich the very few sequence data currently available for the Olea europaea species and to identify genes involved in expression of fruit quality traits. Results Fruits of Coratina, a widely cultivated variety characterized by a very high phenolic content, and Tendellone, an oleuropein-lacking natural variant, were used as starting material for monitoring the transcriptome. Four different cDNA libraries were sequenced, respectively at the beginning and at the end of drupe development. A total of 261,485 reads were obtained, for an output of about 58 Mb. Raw sequence data were processed using a four step pipeline procedure and data were stored in a relational database with a web interface. Conclusion Massively parallel sequencing of different fruit cDNA collections has provided large scale information about the structure and putative function of gene transcripts accumulated during fruit development. Comparative transcript profiling allowed the identification of differentially expressed genes with potential relevance in regulating the fruit metabolism and phenolic content during ripening. PMID:19709400

  19. Transcriptome profiling of Diachasmimorpha longicaudata towards useful molecular tools for population management.

    PubMed

    Mannino, M Constanza; Rivarola, Máximo; Scannapieco, Alejandra C; González, Sergio; Farber, Marisa; Cladera, Jorge L; Lanzavecchia, Silvia B

    2016-10-12

    Diachasmimorpha longicaudata (Hymenoptera: Braconidae) is a solitary parasitoid of Tephritidae (Diptera) fruit flies of economic importance currently being mass-reared in bio-factories and successfully used worldwide. A peculiar biological aspect of Hymenoptera is its haplo-diploid life cycle, where females (diploid) develop from fertilized eggs and males (haploid) from unfertilized eggs. Diploid males were described in many species and recently evidenced in D. longicaudata by mean of inbreeding studies. Sex determination in this parasitoid is based on the Complementary Sex Determination (CSD) system, with alleles from at least one locus involved in early steps of this pathway. Since limited information is available about genetics of this parasitoid species, a deeper analysis on D. longicaudata's genomics is required to provide molecular tools for achieving a more cost effective production under artificial rearing conditions. We report here the first transcriptome analysis of male-larvae, adult females and adult males of D. longicaudata using 454-pyrosequencing. A total of 469766 reads were analyzed and 8483 high-quality isotigs were assembled. After functional annotation, a total of 51686 unigenes were produced, from which, 7021 isotigs and 20227 singletons had at least one BLAST hit against the NCBI non-redundant protein database. A preliminary comparison of adult female and male evidenced that 98 transcripts showed differential expression profiles, with at least a 10-fold difference. Among the functionally annotated transcripts we detected four sequences potentially involved in sex determination and three homologues to two known genes involved in the sex determination cascade. Finally, a total of 4674SimpleSequence Repeats (SSRs) were in silico identified and characterized. The information obtained here will significantly contribute to the development of D. longicaudata functional genomics, genetics and population-based genome studies. Thousands of new microsatellite markers were identified as toolkits for population genetics analysis. The transcriptome characterized here is the starting point to elucidate the molecular bases of the sex determination mechanism in this species.

  20. Sympatric ecological speciation meets pyrosequencing: sampling the transcriptome of the apple maggot Rhagoletis pomonella

    PubMed Central

    2009-01-01

    Background The full power of modern genetics has been applied to the study of speciation in only a small handful of genetic model species - all of which speciated allopatrically. Here we report the first large expressed sequence tag (EST) study of a candidate for ecological sympatric speciation, the apple maggot Rhagoletis pomonella, using massively parallel pyrosequencing on the Roche 454-FLX platform. To maximize transcript diversity we created and sequenced separate libraries from larvae, pupae, adult heads, and headless adult bodies. Results We obtained 239,531 sequences which assembled into 24,373 contigs. A total of 6810 unique protein coding genes were identified among the contigs and long singletons, corresponding to 48% of all known Drosophila melanogaster protein-coding genes. Their distribution across GO classes suggests that we have obtained a representative sample of the transcriptome. Among these sequences are many candidates for potential R. pomonella "speciation genes" (or "barrier genes") such as those controlling chemosensory and life-history timing processes. Furthermore, we identified important marker loci including more than 40,000 single nucleotide polymorphisms (SNPs) and over 100 microsatellites. An initial search for SNPs at which the apple and hawthorn host races differ suggested at least 75 loci warranting further work. We also determined that developmental expression differences remained even after normalization; transcripts expected to show different expression levels between larvae and pupae in D. melanogaster also did so in R. pomonella. Preliminary comparative analysis of transcript presences and absences revealed evidence of gene loss in Drosophila and gain in the higher dipteran clade Schizophora. Conclusions These data provide a much needed resource for exploring mechanisms of divergence in this important model for sympatric ecological speciation. Our description of ESTs from a substantial portion of the R. pomonella transcriptome will facilitate future functional studies of candidate genes for olfaction and diapause-related life history timing, and will enable large scale expression studies. Similarly, the identification of new SNP and microsatellite markers will facilitate future population and quantitative genetic studies of divergence between the apple and hawthorn-infesting host races. PMID:20035631

  1. Transcriptome analysis in cotton boll weevil (Anthonomus grandis) and RNA interference in insect pests.

    PubMed

    Firmino, Alexandre Augusto Pereira; Fonseca, Fernando Campos de Assis; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; Antonino de Souza, José Dijair; Togawa, Roberto Coiti; Silva-Junior, Orzenil Bonfim; Pappas, Georgios Joannis; da Silva, Maria Cristina Mattar; Engler, Gilbert; Grossi-de-Sa, Maria Fatima

    2013-01-01

    Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi) as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families' data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects.

  2. Transcriptome Analysis in Cotton Boll Weevil (Anthonomus grandis) and RNA Interference in Insect Pests

    PubMed Central

    Coelho, Roberta Ramos; Antonino de Souza Jr, José Dijair; Togawa, Roberto Coiti; Silva-Junior, Orzenil Bonfim; Pappas-Jr, Georgios Joannis; da Silva, Maria Cristina Mattar; Engler, Gilbert; Grossi-de-Sa, Maria Fatima

    2013-01-01

    Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi) as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families’ data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects. PMID:24386449

  3. Bio-crude transcriptomics: gene discovery and metabolic network reconstruction for the biosynthesis of the terpenome of the hydrocarbon oil-producing green alga, Botryococcus braunii race B (Showa).

    PubMed

    Molnár, István; Lopez, David; Wisecaver, Jennifer H; Devarenne, Timothy P; Weiss, Taylor L; Pellegrini, Matteo; Hackett, Jeremiah D

    2012-10-30

    Microalgae hold promise for yielding a biofuel feedstock that is sustainable, carbon-neutral, distributed, and only minimally disruptive for the production of food and feed by traditional agriculture. Amongst oleaginous eukaryotic algae, the B race of Botryococcus braunii is unique in that it produces large amounts of liquid hydrocarbons of terpenoid origin. These are comparable to fossil crude oil, and are sequestered outside the cells in a communal extracellular polymeric matrix material. Biosynthetic engineering of terpenoid bio-crude production requires identification of genes and reconstruction of metabolic pathways responsible for production of both hydrocarbons and other metabolites of the alga that compete for photosynthetic carbon and energy. A de novo assembly of 1,334,609 next-generation pyrosequencing reads form the Showa strain of the B race of B. braunii yielded a transcriptomic database of 46,422 contigs with an average length of 756 bp. Contigs were annotated with pathway, ontology, and protein domain identifiers. Manual curation allowed the reconstruction of pathways that produce terpenoid liquid hydrocarbons from primary metabolites, and pathways that divert photosynthetic carbon into tetraterpenoid carotenoids, diterpenoids, and the prenyl chains of meroterpenoid quinones and chlorophyll. Inventories of machine-assembled contigs are also presented for reconstructed pathways for the biosynthesis of competing storage compounds including triacylglycerol and starch. Regeneration of S-adenosylmethionine, and the extracellular localization of the hydrocarbon oils by active transport and possibly autophagy are also investigated. The construction of an annotated transcriptomic database, publicly available in a web-based data depository and annotation tool, provides a foundation for metabolic pathway and network reconstruction, and facilitates further omics studies in the absence of a genome sequence for the Showa strain of B. braunii, race B. Further, the transcriptome database empowers future biosynthetic engineering approaches for strain improvement and the transfer of desirable traits to heterologous hosts.

  4. Bio-crude transcriptomics: Gene discovery and metabolic network reconstruction for the biosynthesis of the terpenome of the hydrocarbon oil-producing green alga, Botryococcus braunii race B (Showa)*

    PubMed Central

    2012-01-01

    Background Microalgae hold promise for yielding a biofuel feedstock that is sustainable, carbon-neutral, distributed, and only minimally disruptive for the production of food and feed by traditional agriculture. Amongst oleaginous eukaryotic algae, the B race of Botryococcus braunii is unique in that it produces large amounts of liquid hydrocarbons of terpenoid origin. These are comparable to fossil crude oil, and are sequestered outside the cells in a communal extracellular polymeric matrix material. Biosynthetic engineering of terpenoid bio-crude production requires identification of genes and reconstruction of metabolic pathways responsible for production of both hydrocarbons and other metabolites of the alga that compete for photosynthetic carbon and energy. Results A de novo assembly of 1,334,609 next-generation pyrosequencing reads form the Showa strain of the B race of B. braunii yielded a transcriptomic database of 46,422 contigs with an average length of 756 bp. Contigs were annotated with pathway, ontology, and protein domain identifiers. Manual curation allowed the reconstruction of pathways that produce terpenoid liquid hydrocarbons from primary metabolites, and pathways that divert photosynthetic carbon into tetraterpenoid carotenoids, diterpenoids, and the prenyl chains of meroterpenoid quinones and chlorophyll. Inventories of machine-assembled contigs are also presented for reconstructed pathways for the biosynthesis of competing storage compounds including triacylglycerol and starch. Regeneration of S-adenosylmethionine, and the extracellular localization of the hydrocarbon oils by active transport and possibly autophagy are also investigated. Conclusions The construction of an annotated transcriptomic database, publicly available in a web-based data depository and annotation tool, provides a foundation for metabolic pathway and network reconstruction, and facilitates further omics studies in the absence of a genome sequence for the Showa strain of B. braunii, race B. Further, the transcriptome database empowers future biosynthetic engineering approaches for strain improvement and the transfer of desirable traits to heterologous hosts. PMID:23110428

  5. 454 Pyrosequencing of Olive (Olea europaea L.) Transcriptome in Response to Salinity

    PubMed Central

    Bazakos, Christos; Manioudaki, Maria E.; Sarropoulou, Elena; Spano, Thodhoraq; Kalaitzis, Panagiotis

    2015-01-01

    Olive (Olea europaea L.) is one of the most important crops in the Mediterranean region. The expansion of cultivation in areas irrigated with low quality and saline water has negative effects on growth and productivity however the investigation of the molecular basis of salt tolerance in olive trees has been only recently initiated. To this end, we investigated the molecular response of cultivar Kalamon to salinity stress using next-generation sequencing technology to explore the transcriptome profile of olive leaves and roots and identify differentially expressed genes that are related to salt tolerance response. Out of 291,958 obtained trimmed reads, 28,270 unique transcripts were identified of which 35% are annotated, a percentage that is comparable to similar reports on non-model plants. Among the 1,624 clusters in roots that comprise more than one read, 24 were differentially expressed comprising 9 down- and 15 up-regulated genes. Respectively, inleaves, among the 2,642 clusters, 70 were identified as differentially expressed, with 14 down- and 56 up-regulated genes. Using next-generation sequencing technology we were able to identify salt-response-related transcripts. Furthermore we provide an annotated transcriptome of olive as well as expression data, which are both significant tools for further molecular studies in olive. PMID:26576008

  6. 454 Pyrosequencing of Olive (Olea europaea L.) Transcriptome in Response to Salinity.

    PubMed

    Bazakos, Christos; Manioudaki, Maria E; Sarropoulou, Elena; Spano, Thodhoraq; Kalaitzis, Panagiotis

    2015-01-01

    Olive (Olea europaea L.) is one of the most important crops in the Mediterranean region. The expansion of cultivation in areas irrigated with low quality and saline water has negative effects on growth and productivity however the investigation of the molecular basis of salt tolerance in olive trees has been only recently initiated. To this end, we investigated the molecular response of cultivar Kalamon to salinity stress using next-generation sequencing technology to explore the transcriptome profile of olive leaves and roots and identify differentially expressed genes that are related to salt tolerance response. Out of 291,958 obtained trimmed reads, 28,270 unique transcripts were identified of which 35% are annotated, a percentage that is comparable to similar reports on non-model plants. Among the 1,624 clusters in roots that comprise more than one read, 24 were differentially expressed comprising 9 down- and 15 up-regulated genes. Respectively, inleaves, among the 2,642 clusters, 70 were identified as differentially expressed, with 14 down- and 56 up-regulated genes. Using next-generation sequencing technology we were able to identify salt-response-related transcripts. Furthermore we provide an annotated transcriptome of olive as well as expression data, which are both significant tools for further molecular studies in olive.

  7. Pyrosequencing-based quantitative measurement of CALR mutation allele burdens and their clinical implications in patients with myeloproliferative neoplasms.

    PubMed

    Oh, Yejin; Song, Ik-Chan; Kim, Jimyung; Kwon, Gye Cheol; Koo, Sun Hoe; Kim, Seon Young

    2018-05-01

    We developed a pyrosequencing-based method for the quantification of CALR mutations and compared the results using Sanger sequencing, fragment length analysis (FLA), digital-droplet PCR (ddPCR), and next-generation sequencing (NGS). Method validation studies were performed using cloned plasmid controls. Samples from 24 patients with myeloproliferative neoplasms were evaluated. Among the 24 patients, 15 had CALR mutations (7 type 1, 2 type 2, and 6 other mutations). The type 1 or type 2 mutation-positive results from pyrosequencing exhibited 100% concordance with the Sanger sequencing results. One novel CALR mutation was not detected by pyrosequencing. The CALR mutation allele burdens measured by pyrosequencing were slightly lower than those measured by FLA but slightly higher than the results obtained using ddPCR. Pyrosequencing exhibited high correlations with both methods. The mutation allele burdens estimated by NGS were significantly lower than those measured by pyrosequencing. An increased CALR mutation allele burden was associated with overt primary myelofibrosis. Patients with >70% mutation allele burdens in myeloid cells had a significantly longer time from diagnosis (P = 0.007), more bone marrow fibrosis (P = 0.010), and lower hemoglobin (P = 0.007). Pyrosequencing was a useful rapid sequencing method to determine the burden of CALR mutations. Copyright © 2018 Elsevier B.V. All rights reserved.

  8. De novo sequencing and characterization of floral transcriptome in two species of buckwheat (Fagopyrum)

    PubMed Central

    2011-01-01

    Background Transcriptome sequencing data has become an integral component of modern genetics, genomics and evolutionary biology. However, despite advances in the technologies of DNA sequencing, such data are lacking for many groups of living organisms, in particular, many plant taxa. We present here the results of transcriptome sequencing for two closely related plant species. These species, Fagopyrum esculentum and F. tataricum, belong to the order Caryophyllales - a large group of flowering plants with uncertain evolutionary relationships. F. esculentum (common buckwheat) is also an important food crop. Despite these practical and evolutionary considerations Fagopyrum species have not been the subject of large-scale sequencing projects. Results Normalized cDNA corresponding to genes expressed in flowers and inflorescences of F. esculentum and F. tataricum was sequenced using the 454 pyrosequencing technology. This resulted in 267 (for F. esculentum) and 229 (F. tataricum) thousands of reads with average length of 341-349 nucleotides. De novo assembly of the reads produced about 25 thousands of contigs for each species, with 7.5-8.2× coverage. Comparative analysis of two transcriptomes demonstrated their overall similarity but also revealed genes that are presumably differentially expressed. Among them are retrotransposon genes and genes involved in sugar biosynthesis and metabolism. Thirteen single-copy genes were used for phylogenetic analysis; the resulting trees are largely consistent with those inferred from multigenic plastid datasets. The sister relationships of the Caryophyllales and asterids now gained high support from nuclear gene sequences. Conclusions 454 transcriptome sequencing and de novo assembly was performed for two congeneric flowering plant species, F. esculentum and F. tataricum. As a result, a large set of cDNA sequences that represent orthologs of known plant genes as well as potential new genes was generated. PMID:21232141

  9. Transcriptome de novo assembly from next-generation sequencing and comparative analyses in the hexaploid salt marsh species Spartina maritima and Spartina alterniflora (Poaceae)

    PubMed Central

    Ferreira de Carvalho, J; Poulain, J; Da Silva, C; Wincker, P; Michon-Coudouel, S; Dheilly, A; Naquin, D; Boutte, J; Salmon, A; Ainouche, M

    2013-01-01

    Spartina species have a critical ecological role in salt marshes and represent an excellent system to investigate recurrent polyploid speciation. Using the 454 GS-FLX pyrosequencer, we assembled and annotated the first reference transcriptome (from roots and leaves) for two related hexaploid Spartina species that hybridize in Western Europe, the East American invasive Spartina alterniflora and the Euro-African S. maritima. The de novo read assembly generated 38 478 consensus sequences and 99% found an annotation using Poaceae databases, representing a total of 16 753 non-redundant genes. Spartina expressed sequence tags were mapped onto the Sorghum bicolor genome, where they were distributed among the subtelomeric arms of the 10 S. bicolor chromosomes, with high gene density correlation. Normalization of the complementary DNA library improved the number of annotated genes. Ecologically relevant genes were identified among GO biological function categories in salt and heavy metal stress response, C4 photosynthesis and in lignin and cellulose metabolism. Expression of some of these genes had been found to be altered by hybridization and genome duplication in a previous microarray-based study in Spartina. As these species are hexaploid, up to three duplicated homoeologs may be expected per locus. When analyzing sequence polymorphism at four different loci in S. maritima and S. alterniflora, we found up to four haplotypes per locus, suggesting the presence of two expressed homoeologous sequences with one or two allelic variants each. This reference transcriptome will allow analysis of specific Spartina genes of ecological or evolutionary interest, estimation of homoeologous gene expression variation using RNA-seq and further gene expression evolution analyses in natural populations. PMID:23149455

  10. Comparative transcriptome analysis of the Asteraceae halophyte Karelinia caspica under salt stress.

    PubMed

    Zhang, Xia; Liao, Maoseng; Chang, Dan; Zhang, Fuchun

    2014-12-17

    Much attention has been given to the potential of halophytes as sources of tolerance traits for introduction into cereals. However, a great deal remains unknown about the diverse mechanisms employed by halophytes to cope with salinity. To characterize salt tolerance mechanisms underlying Karelinia caspica, an Asteraceae halophyte, we performed Large-scale transcriptomic analysis using a high-throughput Illumina sequencing platform. Comparative gene expression analysis was performed to correlate the effects of salt stress and ABA regulation at the molecular level. Total sequence reads generated by pyrosequencing were assembled into 287,185 non-redundant transcripts with an average length of 652 bp. Using the BLAST function in the Swiss-Prot, NCBI nr, GO, KEGG, and KOG databases, a total of 216,416 coding sequences associated with known proteins were annotated. Among these, 35,533 unigenes were classified into 69 gene ontology categories, and 18,378 unigenes were classified into 202 known pathways. Based on the fold changes observed when comparing the salt stress and control samples, 60,127 unigenes were differentially expressed, with 38,122 and 22,005 up- and down-regulated, respectively. Several of the differentially expressed genes are known to be involved in the signaling pathway of the plant hormone ABA, including ABA metabolism, transport, and sensing as well as the ABA signaling cascade. Transcriptome profiling of K. caspica contribute to a comprehensive understanding of K. caspica at the molecular level. Moreover, the global survey of differentially expressed genes in this species under salt stress and analyses of the effects of salt stress and ABA regulation will contribute to the identification and characterization of genes and molecular mechanisms underlying salt stress responses in Asteraceae plants.

  11. High-throughput transcriptome sequencing and preliminary functional analysis in four Neotropical tree species.

    PubMed

    Brousseau, Louise; Tinaut, Alexandra; Duret, Caroline; Lang, Tiange; Garnier-Gere, Pauline; Scotti, Ivan

    2014-03-27

    The Amazonian rainforest is predicted to suffer from ongoing environmental changes. Despite the need to evaluate the impact of such changes on tree genetic diversity, we almost entirely lack genomic resources. In this study, we analysed the transcriptome of four tropical tree species (Carapa guianensis, Eperua falcata, Symphonia globulifera and Virola michelii) with contrasting ecological features, belonging to four widespread botanical families (respectively Meliaceae, Fabaceae, Clusiaceae and Myristicaceae). We sequenced cDNA libraries from three organs (leaves, stems, and roots) using 454 pyrosequencing. We have developed an R and bioperl-based bioinformatic procedure for de novo assembly, gene functional annotation and marker discovery. Mismatch identification takes into account single-base quality values as well as the likelihood of false variants as a function of contig depth and number of sequenced chromosomes. Between 17103 (for Symphonia globulifera) and 23390 (for Eperua falcata) contigs were assembled. Organs varied in the numbers of unigenes they apparently express, with higher number in roots. Patterns of gene expression were similar across species, with metabolism of aromatic compounds standing out as an overrepresented gene function. Transcripts corresponding to several gene functions were found to be over- or underrepresented in each organ. We identified between 4434 (for Symphonia globulifera) and 9076 (for Virola surinamensis) well-supported mismatches. The resulting overall mismatch density was comprised between 0.89 (S. globulifera) and 1.05 (V. surinamensis) mismatches/100 bp in variation-containing contigs. The relative representation of gene functions in the four transcriptomes suggests that secondary metabolism may be particularly important in tropical trees. The differential representation of transcripts among tissues suggests differential gene expression, which opens the way to functional studies in these non-model, ecologically important species. We found substantial amounts of mismatches in the four species. These newly identified putative variants are a first step towards acquiring much needed genomic resources for tropical tree species.

  12. Microbial metatranscriptomics in a permanent marine oxygen minimum zone.

    PubMed

    Stewart, Frank J; Ulloa, Osvaldo; DeLong, Edward F

    2012-01-01

    Simultaneous characterization of taxonomic composition, metabolic gene content and gene expression in marine oxygen minimum zones (OMZs) has potential to broaden perspectives on the microbial and biogeochemical dynamics in these environments. Here, we present a metatranscriptomic survey of microbial community metabolism in the Eastern Tropical South Pacific OMZ off northern Chile. Community RNA was sampled in late austral autumn from four depths (50, 85, 110, 200 m) extending across the oxycline and into the upper OMZ. Shotgun pyrosequencing of cDNA yielded 180,000 to 550,000 transcript sequences per depth. Based on functional gene representation, transcriptome samples clustered apart from corresponding metagenome samples from the same depth, highlighting the discrepancies between metabolic potential and actual transcription. BLAST-based characterizations of non-ribosomal RNA sequences revealed a dominance of genes involved with both oxidative (nitrification) and reductive (anammox, denitrification) components of the marine nitrogen cycle. Using annotations of protein-coding genes as proxies for taxonomic affiliation, we observed depth-specific changes in gene expression by key functional taxonomic groups. Notably, transcripts most closely matching the genome of the ammonia-oxidizing archaeon Nitrosopumilus maritimus dominated the transcriptome in the upper three depths, representing one in five protein-coding transcripts at 85 m. In contrast, transcripts matching the anammox bacterium Kuenenia stuttgartiensis dominated at the core of the OMZ (200 m; 1 in 12 protein-coding transcripts). The distribution of N. maritimus-like transcripts paralleled that of transcripts matching ammonia monooxygenase genes, which, despite being represented by both bacterial and archaeal sequences in the community DNA, were dominated (> 99%) by archaeal sequences in the RNA, suggesting a substantial role for archaeal nitrification in the upper OMZ. These data, as well as those describing other key OMZ metabolic processes (e.g. sulfur oxidation), highlight gene-specific expression patterns in the context of the entire community transcriptome, as well as identify key functional groups for taxon-specific genomic profiling. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.

  13. Pyrosequencing and de novo assembly of Antarctic krill (Euphausia superba) transcriptome to study the adaptability of krill to climate-induced environmental changes

    PubMed Central

    Meyer, B; Martini, P; Biscontin, A; De Pittà, C; Romualdi, C; Teschke, M; Frickenhaus, S; Harms, L; Freier, U; Jarman, S; Kawaguchi, S

    2015-01-01

    The Antarctic krill, Euphausia superba, has a key position in the Southern Ocean food web by serving as direct link between primary producers and apex predators. The south-west Atlantic sector of the Southern Ocean, where the majority of the krill population is located, is experiencing one of the most profound environmental changes worldwide. Up to now, we have only cursory information about krill’s genomic plasticity to cope with the ongoing environmental changes induced by anthropogenic CO2 emission. The genome of krill is not yet available due to its large size (about 48 Gbp). Here, we present two cDNA normalized libraries from whole krill and krill heads sampled in different seasons that were combined with two data sets of krill transcriptome projects, already published, to produce the first knowledgebase krill ‘master’ transcriptome. The new library produced 25% more E. superba transcripts and now includes nearly all the enzymes involved in the primary oxidative metabolism (Glycolysis, Krebs cycle and oxidative phosphorylation) as well as all genes involved in glycogenesis, glycogen breakdown, gluconeogenesis, fatty acid synthesis and fatty acids β-oxidation. With these features, the ‘master’ transcriptome provides the most complete picture of metabolic pathways in Antarctic krill and will provide a major resource for future physiological and molecular studies. This will be particularly valuable for characterizing the molecular networks that respond to stressors caused by the anthropogenic CO2 emissions and krill’s capacity to cope with the ongoing environmental changes in the Atlantic sector of the Southern Ocean. PMID:25818178

  14. Transcriptome Analysis of Sarracenia, an Insectivorous Plant

    PubMed Central

    Srivastava, Anuj; Rogers, Willie L.; Breton, Catherine M.; Cai, Liming; Malmberg, Russell L.

    2011-01-01

    Sarracenia species (pitcher plants) are carnivorous plants which obtain a portion of their nutrients from insects captured in the pitchers. To investigate these plants, we sequenced the transcriptome of two species, Sarracenia psittacina and Sarracenia purpurea, using Roche 454 pyrosequencing technology. We obtained 46 275 and 36 681 contigs by de novo assembly methods for S. psittacina and S. purpurea, respectively, and further identified 16 163 orthologous contigs between them. Estimation of synonymous substitution rates between orthologous and paralogous contigs indicates the events of genome duplication and speciation within the Sarracenia genus both occurred ∼2 million years ago. The ratios of synonymous and non-synonymous substitution rates indicated that 491 contigs have been under positive selection (Ka/Ks > 1). Significant proportions of these contigs were involved in functions related to binding activity. We also found that the greatest sequence similarity for both of these species was to Vitis vinifera, which is most consistent with a non-current classification of the order Ericales as an asterid. This study has provided new insights into pitcher plants and will contribute greatly to future research on this genus and its distinctive ecological adaptations. PMID:21676972

  15. Transcriptome analysis of sarracenia, an insectivorous plant.

    PubMed

    Srivastava, Anuj; Rogers, Willie L; Breton, Catherine M; Cai, Liming; Malmberg, Russell L

    2011-08-01

    Sarracenia species (pitcher plants) are carnivorous plants which obtain a portion of their nutrients from insects captured in the pitchers. To investigate these plants, we sequenced the transcriptome of two species, Sarracenia psittacina and Sarracenia purpurea, using Roche 454 pyrosequencing technology. We obtained 46 275 and 36 681 contigs by de novo assembly methods for S. psittacina and S. purpurea, respectively, and further identified 16 163 orthologous contigs between them. Estimation of synonymous substitution rates between orthologous and paralogous contigs indicates the events of genome duplication and speciation within the Sarracenia genus both occurred ∼2 million years ago. The ratios of synonymous and non-synonymous substitution rates indicated that 491 contigs have been under positive selection (K(a)/K(s) > 1). Significant proportions of these contigs were involved in functions related to binding activity. We also found that the greatest sequence similarity for both of these species was to Vitis vinifera, which is most consistent with a non-current classification of the order Ericales as an asterid. This study has provided new insights into pitcher plants and will contribute greatly to future research on this genus and its distinctive ecological adaptations.

  16. Introduction of the hybcell-based compact sequencing technology and comparison to state-of-the-art methodologies for KRAS mutation detection.

    PubMed

    Zopf, Agnes; Raim, Roman; Danzer, Martin; Niklas, Norbert; Spilka, Rita; Pröll, Johannes; Gabriel, Christian; Nechansky, Andreas; Roucka, Markus

    2015-03-01

    The detection of KRAS mutations in codons 12 and 13 is critical for anti-EGFR therapy strategies; however, only those methodologies with high sensitivity, specificity, and accuracy as well as the best cost and turnaround balance are suitable for routine daily testing. Here we compared the performance of compact sequencing using the novel hybcell technology with 454 next-generation sequencing (454-NGS), Sanger sequencing, and pyrosequencing, using an evaluation panel of 35 specimens. A total of 32 mutations and 10 wild-type cases were reported using 454-NGS as the reference method. Specificity ranged from 100% for Sanger sequencing to 80% for pyrosequencing. Sanger sequencing and hybcell-based compact sequencing achieved a sensitivity of 96%, whereas pyrosequencing had a sensitivity of 88%. Accuracy was 97% for Sanger sequencing, 85% for pyrosequencing, and 94% for hybcell-based compact sequencing. Quantitative results were obtained for 454-NGS and hybcell-based compact sequencing data, resulting in a significant correlation (r = 0.914). Whereas pyrosequencing and Sanger sequencing were not able to detect multiple mutated cell clones within one tumor specimen, 454-NGS and the hybcell-based compact sequencing detected multiple mutations in two specimens. Our comparison shows that the hybcell-based compact sequencing is a valuable alternative to state-of-the-art methodologies used for detection of clinically relevant point mutations.

  17. Transcriptomic analysis reveals numerous diverse protein kinases and transcription factors involved in desiccation tolerance in the resurrection plant Myrothamnus flabellifolia

    PubMed Central

    Ma, Chao; Wang, Hong; Macnish, Andrew J; Estrada-Melo, Alejandro C; Lin, Jing; Chang, Youhong; Reid, Michael S; Jiang, Cai-Zhong

    2015-01-01

    The woody resurrection plant Myrothamnus flabellifolia has remarkable tolerance to desiccation. Pyro-sequencing technology permitted us to analyze the transcriptome of M. flabellifolia during both dehydration and rehydration. We identified a total of 8287 and 8542 differentially transcribed genes during dehydration and rehydration treatments respectively. Approximately 295 transcription factors (TFs) and 484 protein kinases (PKs) were up- or down-regulated in response to desiccation stress. Among these, the transcript levels of 53 TFs and 91 PKs increased rapidly and peaked early during dehydration. These regulators transduce signal cascades of molecular pathways, including the up-regulation of ABA-dependent and independent drought stress pathways and the activation of protective mechanisms for coping with oxidative damage. Antioxidant systems are up-regulated, and the photosynthetic system is modified to reduce ROS generation. Secondary metabolism may participate in the desiccation tolerance of M. flabellifolia as indicated by increases in transcript abundance of genes involved in isopentenyl diphosphate biosynthesis. Up-regulation of genes encoding late embryogenesis abundant proteins and sucrose phosphate synthase is also associated with increased tolerance to desiccation. During rehydration, the transcriptome is also enriched in transcripts of genes encoding TFs and PKs, as well as genes involved in photosynthesis, and protein synthesis. The data reported here contribute comprehensive insights into the molecular mechanisms of desiccation tolerance in M. flabellifolia. PMID:26504577

  18. Multiplex pyrosequencing of InDel markers for forensic DNA analysis.

    PubMed

    Bus, Magdalena M; Karas, Ognjen; Allen, Marie

    2016-12-01

    The capillary electrophoresis (CE) technology is commonly used for fragment length separation of markers in forensic DNA analysis. In this study, pyrosequencing technology was used as an alternative and rapid tool for the analysis of biallelic InDel (insertion/deletion) markers for individual identification. The DNA typing is based on a subset of the InDel markers that are included in the Investigator ® DIPplex Kit, which are sequenced in a multiplex pyrosequencing analysis. To facilitate the analysis of degraded DNA, the polymerase chain reaction (PCR) fragments were kept short in the primer design. Samples from individuals of Swedish origin were genotyped using the pyrosequencing strategy and analysis of the Investigator ® DIPplex markers with CE. A comparison between the pyrosequencing and CE data revealed concordant results demonstrating a robust and correct genotyping by pyrosequencing. Using optimal marker combination and a directed dispensation strategy, five markers could be multiplexed and analyzed simultaneously. In this proof-of-principle study, we demonstrate that multiplex InDel pyrosequencing analysis is possible. However, further studies on degraded samples, lower DNA quantities, and mixtures will be required to fully optimize InDel analysis by pyrosequencing for forensic applications. Overall, although CE analysis is implemented in most forensic laboratories, multiplex InDel pyrosequencing offers a cost-effective alternative for some applications. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  19. Transcriptome analysis of Bupleurum chinense focusing on genes involved in the biosynthesis of saikosaponins

    PubMed Central

    2011-01-01

    Abstract Background Bupleurum chinense DC. is a widely used traditional Chinese medicinal plant. Saikosaponins are the major bioactive constituents of B. chinense, but relatively little is known about saikosaponin biosynthesis. The 454 pyrosequencing technology provides a promising opportunity for finding novel genes that participate in plant metabolism. Consequently, this technology may help to identify the candidate genes involved in the saikosaponin biosynthetic pathway. Results One-quarter of the 454 pyrosequencing runs produced a total of 195, 088 high-quality reads, with an average read length of 356 bases (NCBI SRA accession SRA039388). A de novo assembly generated 24, 037 unique sequences (22, 748 contigs and 1, 289 singletons), 12, 649 (52.6%) of which were annotated against three public protein databases using a basic local alignment search tool (E-value ≤1e-10). All unique sequences were compared with NCBI expressed sequence tags (ESTs) (237) and encoding sequences (44) from the Bupleurum genus, and with a Sanger-sequenced EST dataset (3, 111). The 23, 173 (96.4%) unique sequences obtained in the present study represent novel Bupleurum genes. The ESTs of genes related to saikosaponin biosynthesis were found to encode known enzymes that catalyze the formation of the saikosaponin backbone; 246 cytochrome P450 (P450s) and 102 glycosyltransferases (GTs) unique sequences were also found in the 454 dataset. Full length cDNAs of 7 P450s and 7 uridine diphosphate GTs (UGTs) were verified by reverse transcriptase polymerase chain reaction or by cloning using 5' and/or 3' rapid amplification of cDNA ends. Two P450s and three UGTs were identified as the most likely candidates involved in saikosaponin biosynthesis. This finding was based on the coordinate up-regulation of their expression with β-AS in methyl jasmonate-treated adventitious roots and on their similar expression patterns with β-AS in various B. chinense tissues. Conclusions A collection of high-quality ESTs for B. chinense obtained by 454 pyrosequencing is provided here for the first time. These data should aid further research on the functional genomics of B. chinense and other Bupleurum species. The candidate genes for enzymes involved in saikosaponin biosynthesis, especially the P450s and UGTs, that were revealed provide a substantial foundation for follow-up research on the metabolism and regulation of the saikosaponins. PMID:22047182

  20. Construction of an Ostrea edulis database from genomic and expressed sequence tags (ESTs) obtained from Bonamia ostreae infected haemocytes: Development of an immune-enriched oligo-microarray.

    PubMed

    Pardo, Belén G; Álvarez-Dios, José Antonio; Cao, Asunción; Ramilo, Andrea; Gómez-Tato, Antonio; Planas, Josep V; Villalba, Antonio; Martínez, Paulino

    2016-12-01

    The flat oyster, Ostrea edulis, is one of the main farmed oysters, not only in Europe but also in the United States and Canada. Bonamiosis due to the parasite Bonamia ostreae has been associated with high mortality episodes in this species. This parasite is an intracellular protozoan that infects haemocytes, the main cells involved in oyster defence. Due to the economical and ecological importance of flat oyster, genomic data are badly needed for genetic improvement of the species, but they are still very scarce. The objective of this study is to develop a sequence database, OedulisDB, with new genomic and transcriptomic resources, providing new data and convenient tools to improve our knowledge of the oyster's immune mechanisms. Transcriptomic and genomic sequences were obtained using 454 pyrosequencing and compiled into an O. edulis database, OedulisDB, consisting of two sets of 10,318 and 7159 unique sequences that represent the oyster's genome (WG) and de novo haemocyte transcriptome (HT), respectively. The flat oyster transcriptome was obtained from two strains (naïve and tolerant) challenged with B. ostreae, and from their corresponding non-challenged controls. Approximately 78.5% of 5619 HT unique sequences were successfully annotated by Blast search using public databases. A total of 984 sequences were identified as being related to immune response and several key immune genes were identified for the first time in flat oyster. Additionally, transcriptome information was used to design and validate the first oligo-microarray in flat oyster enriched with immune sequences from haemocytes. Our transcriptomic and genomic sequencing and subsequent annotation have largely increased the scarce resources available for this economically important species and have enabled us to develop an OedulisDB database and accompanying tools for gene expression analysis. This study represents the first attempt to characterize in depth the O. edulis haemocyte transcriptome in response to B. ostreae through massively sequencing and has aided to improve our knowledge of the immune mechanisms of flat oyster. The validated oligo-microarray and the establishment of a reference transcriptome will be useful for large-scale gene expression studies in this species. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. Pyrosequencing for Microbial Identification and Characterization

    PubMed Central

    Cummings, Patrick J.; Ahmed, Ray; Durocher, Jeffrey A.; Jessen, Adam; Vardi, Tamar; Obom, Kristina M.

    2013-01-01

    Pyrosequencing is a versatile technique that facilitates microbial genome sequencing that can be used to identify bacterial species, discriminate bacterial strains and detect genetic mutations that confer resistance to anti-microbial agents. The advantages of pyrosequencing for microbiology applications include rapid and reliable high-throughput screening and accurate identification of microbes and microbial genome mutations. Pyrosequencing involves sequencing of DNA by synthesizing the complementary strand a single base at a time, while determining the specific nucleotide being incorporated during the synthesis reaction. The reaction occurs on immobilized single stranded template DNA where the four deoxyribonucleotides (dNTP) are added sequentially and the unincorporated dNTPs are enzymatically degraded before addition of the next dNTP to the synthesis reaction. Detection of the specific base incorporated into the template is monitored by generation of chemiluminescent signals. The order of dNTPs that produce the chemiluminescent signals determines the DNA sequence of the template. The real-time sequencing capability of pyrosequencing technology enables rapid microbial identification in a single assay. In addition, the pyrosequencing instrument, can analyze the full genetic diversity of anti-microbial drug resistance, including typing of SNPs, point mutations, insertions, and deletions, as well as quantification of multiple gene copies that may occur in some anti-microbial resistance patterns. PMID:23995536

  2. Pyrosequencing for microbial identification and characterization.

    PubMed

    Cummings, Patrick J; Ahmed, Ray; Durocher, Jeffrey A; Jessen, Adam; Vardi, Tamar; Obom, Kristina M

    2013-08-22

    Pyrosequencing is a versatile technique that facilitates microbial genome sequencing that can be used to identify bacterial species, discriminate bacterial strains and detect genetic mutations that confer resistance to anti-microbial agents. The advantages of pyrosequencing for microbiology applications include rapid and reliable high-throughput screening and accurate identification of microbes and microbial genome mutations. Pyrosequencing involves sequencing of DNA by synthesizing the complementary strand a single base at a time, while determining the specific nucleotide being incorporated during the synthesis reaction. The reaction occurs on immobilized single stranded template DNA where the four deoxyribonucleotides (dNTP) are added sequentially and the unincorporated dNTPs are enzymatically degraded before addition of the next dNTP to the synthesis reaction. Detection of the specific base incorporated into the template is monitored by generation of chemiluminescent signals. The order of dNTPs that produce the chemiluminescent signals determines the DNA sequence of the template. The real-time sequencing capability of pyrosequencing technology enables rapid microbial identification in a single assay. In addition, the pyrosequencing instrument, can analyze the full genetic diversity of anti-microbial drug resistance, including typing of SNPs, point mutations, insertions, and deletions, as well as quantification of multiple gene copies that may occur in some anti-microbial resistance patterns.

  3. Transcriptome Characterization of Cymbidium sinense 'Dharma' Using 454 Pyrosequencing and Its Application in the Identification of Genes Associated with Leaf Color Variation.

    PubMed

    Zhu, Genfa; Yang, Fengxi; Shi, Shanshan; Li, Dongmei; Wang, Zhen; Liu, Hailin; Huang, Dan; Wang, Caiyun

    2015-01-01

    The highly variable leaf color of Cymbidium sinense significantly improves its horticultural and economic value, and makes it highly desirable in the flower markets in China and Southeast Asia. However, little is understood about the molecular mechanism underlying leaf-color variations. In this study, we found the content of photosynthetic pigments, especially chlorophyll degradation metabolite in the leaf-color mutants is distinguished significantly from that in the wild type of Cymbidium sinense 'Dharma'. To further determine the candidate genes controlling leaf-color variations, we first sequenced the global transcriptome using 454 pyrosequencing. More than 0.7 million expressed sequence tags (ESTs) with an average read length of 445.9 bp were generated and assembled into 103,295 isotigs representing 68,460 genes. Of these isotigs, 43,433 were significantly aligned to known proteins in the public database, of which 29,299 could be categorized into 42 functional groups in the gene ontology system, 10,079 classified into 23 functional classifications in the clusters of orthologous groups system, and 23,092 assigned to 139 clusters of specific metabolic pathways in the Kyoto Encyclopedia of Genes and Genomes. Among these annotations, 95 isotigs were designated as involved in chlorophyll metabolism. On this basis, we identified 16 key enzyme-encoding genes in the chlorophyll metabolism pathway, the full length cDNAs and expressions of which were further confirmed. Expression pattern indicated that the key enzyme-encoding genes for chlorophyll degradation were more highly expressed in the leaf color mutants, as was consistent with their lower chlorophyll contents. This study is the first to supply an informative 454 EST dataset for Cymbidium sinense 'Dharma' and to identify original leaf color-associated genes, which provide important resources to facilitate gene discovery for molecular breeding, marketable trait discovery, and investigating various biological process in this species.

  4. Transcriptome Characterization of Cymbidium sinense 'Dharma' Using 454 Pyrosequencing and Its Application in the Identification of Genes Associated with Leaf Color Variation

    PubMed Central

    Shi, Shanshan; Li, Dongmei; Wang, Zhen; Liu, Hailin; Huang, Dan; Wang, Caiyun

    2015-01-01

    The highly variable leaf color of Cymbidium sinense significantly improves its horticultural and economic value, and makes it highly desirable in the flower markets in China and Southeast Asia. However, little is understood about the molecular mechanism underlying leaf-color variations. In this study, we found the content of photosynthetic pigments, especially chlorophyll degradation metabolite in the leaf-color mutants is distinguished significantly from that in the wild type of Cymbidium sinense 'Dharma'. To further determine the candidate genes controlling leaf-color variations, we first sequenced the global transcriptome using 454 pyrosequencing. More than 0.7 million expressed sequence tags (ESTs) with an average read length of 445.9 bp were generated and assembled into 103,295 isotigs representing 68,460 genes. Of these isotigs, 43,433 were significantly aligned to known proteins in the public database, of which 29,299 could be categorized into 42 functional groups in the gene ontology system, 10,079 classified into 23 functional classifications in the clusters of orthologous groups system, and 23,092 assigned to 139 clusters of specific metabolic pathways in the Kyoto Encyclopedia of Genes and Genomes. Among these annotations, 95 isotigs were designated as involved in chlorophyll metabolism. On this basis, we identified 16 key enzyme-encoding genes in the chlorophyll metabolism pathway, the full length cDNAs and expressions of which were further confirmed. Expression pattern indicated that the key enzyme-encoding genes for chlorophyll degradation were more highly expressed in the leaf color mutants, as was consistent with their lower chlorophyll contents. This study is the first to supply an informative 454 EST dataset for Cymbidium sinense 'Dharma' and to identify original leaf color-associated genes, which provide important resources to facilitate gene discovery for molecular breeding, marketable trait discovery, and investigating various biological process in this species. PMID:26042676

  5. Pyrosequencing Analysis of Bench-Scale Nitrifying BiofiltersRemoving Trihalomethanes

    EPA Science Inventory

    The bacterial biofilm communities in four nitrifying biofilters degrading regulated drinking water trihalomethanes were characterized by 454 pyrosequencing. The three most abundant phylotypes based on total diversity were Nitrosomonas (70%), Nitrobacter (14%), and Chitinophagace...

  6. Transcriptome Analysis in Sheepgrass (Leymus chinensis): A Dominant Perennial Grass of the Eurasian Steppe

    PubMed Central

    Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe

    2013-01-01

    Background Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. Results The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. Conclusions This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species. PMID:23861841

  7. Transcriptome analysis in sheepgrass (Leymus chinensis): a dominant perennial grass of the Eurasian Steppe.

    PubMed

    Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe

    2013-01-01

    Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.

  8. Pyrosequencing and de novo assembly of Antarctic krill (Euphausia superba) transcriptome to study the adaptability of krill to climate-induced environmental changes.

    PubMed

    Meyer, B; Martini, P; Biscontin, A; De Pittà, C; Romualdi, C; Teschke, M; Frickenhaus, S; Harms, L; Freier, U; Jarman, S; Kawaguchi, S

    2015-11-01

    The Antarctic krill, Euphausia superba, has a key position in the Southern Ocean food web by serving as direct link between primary producers and apex predators. The south-west Atlantic sector of the Southern Ocean, where the majority of the krill population is located, is experiencing one of the most profound environmental changes worldwide. Up to now, we have only cursory information about krill's genomic plasticity to cope with the ongoing environmental changes induced by anthropogenic CO2 emission. The genome of krill is not yet available due to its large size (about 48 Gbp). Here, we present two cDNA normalized libraries from whole krill and krill heads sampled in different seasons that were combined with two data sets of krill transcriptome projects, already published, to produce the first knowledgebase krill 'master' transcriptome. The new library produced 25% more E. superba transcripts and now includes nearly all the enzymes involved in the primary oxidative metabolism (Glycolysis, Krebs cycle and oxidative phosphorylation) as well as all genes involved in glycogenesis, glycogen breakdown, gluconeogenesis, fatty acid synthesis and fatty acids β-oxidation. With these features, the 'master' transcriptome provides the most complete picture of metabolic pathways in Antarctic krill and will provide a major resource for future physiological and molecular studies. This will be particularly valuable for characterizing the molecular networks that respond to stressors caused by the anthropogenic CO2 emissions and krill's capacity to cope with the ongoing environmental changes in the Atlantic sector of the Southern Ocean. © 2015 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd.

  9. Transcriptome analysis reveals the time of the fourth round of genome duplication in common carp (Cyprinus carpio)

    PubMed Central

    2012-01-01

    Background Common carp (Cyprinus carpio) is thought to have undergone one extra round of genome duplication compared to zebrafish. Transcriptome analysis has been used to study the existence and timing of genome duplication in species for which genome sequences are incomplete. Large-scale transcriptome data for the common carp genome should help reveal the timing of the additional duplication event. Results We have sequenced the transcriptome of common carp using 454 pyrosequencing. After assembling the 454 contigs and the published common carp sequences together, we obtained 49,669 contigs and identified genes using homology searches and an ab initio method. We identified 4,651 orthologous pairs between common carp and zebrafish and found 129,984 paralogous pairs within the common carp. An estimation of the synonymous substitution rate in the orthologous pairs indicated that common carp and zebrafish diverged 120 million years ago (MYA). We identified one round of genome duplication in common carp and estimated that it had occurred 5.6 to 11.3 MYA. In zebrafish, no genome duplication event after speciation was observed, suggesting that, compared to zebrafish, common carp had undergone an additional genome duplication event. We annotated the common carp contigs with Gene Ontology terms and KEGG pathways. Compared with zebrafish gene annotations, we found that a set of biological processes and pathways were enriched in common carp. Conclusions The assembled contigs helped us to estimate the time of the fourth-round of genome duplication in common carp. The resource that we have built as part of this study will help advance functional genomics and genome annotation studies in the future. PMID:22424280

  10. SNP-based real-time pyrosequencing as a sensitive and specific tool for identification and differentiation of Rickettsia species in Ixodes ricinus ticks.

    PubMed

    Janecek, Elisabeth; Streichan, Sabine; Strube, Christina

    2012-10-18

    Rickettsioses are caused by pathogenic species of the genus Rickettsia and play an important role as emerging diseases. The bacteria are transmitted to mammal hosts including humans by arthropod vectors. Since detection, especially in tick vectors, is usually based on PCR with genus-specific primers to include different occurring Rickettsia species, subsequent species identification is mainly achieved by Sanger sequencing. In the present study a real-time pyrosequencing approach was established with the objective to differentiate between species occurring in German Ixodes ticks, which are R. helvetica, R. monacensis, R. massiliae, and R. felis. Tick material from a quantitative real-time PCR (qPCR) based study on Rickettsia-infections in I. ricinus allowed direct comparison of both sequencing techniques, Sanger and real-time pyrosequencing. A sequence stretch of rickettsial citrate synthase (gltA) gene was identified to contain divergent single nucleotide polymorphism (SNP) sites suitable for Rickettsia species differentiation. Positive control plasmids inserting the respective target sequence of each Rickettsia species of interest were constructed for initial establishment of the real-time pyrosequencing approach using Qiagen's PSQ 96MA Pyrosequencing System operating in a 96-well format. The approach included an initial amplification reaction followed by the actual pyrosequencing, which is traceable by pyrograms in real-time. Afterwards, real-time pyrosequencing was applied to 263 Ixodes tick samples already detected Rickettsia-positive in previous qPCR experiments. Establishment of real-time pyrosequencing using positive control plasmids resulted in accurate detection of all SNPs in all included Rickettsia species. The method was then applied to 263 Rickettsia-positive Ixodes ricinus samples, of which 153 (58.2%) could be identified for their species (151 R. helvetica and 2 R. monacensis) by previous custom Sanger sequencing. Real-time pyrosequencing identified all Sanger-determined ticks as well as 35 previously undifferentiated ticks resulting in a total number of 188 (71.5%) identified samples. Pyrosequencing sensitivity was found to be strongly dependent on gltA copy numbers in the reaction setup. Whereas less than 101 copies in the initial amplification reaction resulted in identification of 15.1% of the samples only, the percentage increased to 54.2% at 101-102 copies, to 95.6% at >102-103 copies and reached 100% samples identified for their Rickettsia species if more than 103 copies were present in the template. The established real-time pyrosequencing approach represents a reliable method for detection and differentiation of Rickettsia spp. present in I. ricinus diagnostic material and prevalence studies. Furthermore, the method proved to be faster, more cost-effective as well as more sensitive than custom Sanger sequencing with simultaneous high specificity.

  11. Evaluation of culture-based techniques and 454 pyrosequencing for the analysis of fungal diversity in potting media and organic fertilizers.

    PubMed

    Al-Sadi, A M; Al-Mazroui, S S; Phillips, A J L

    2015-08-01

    Potting media and organic fertilizers (OFs) are commonly used in agricultural systems. However, there is a lack of studies on the efficiency of culture-based techniques in assessing the level of fungal diversity in these products. A study was conducted to investigate the efficiency of seven culture-based techniques and pyrosequencing for characterizing fungal diversity in potting media and OFs. Fungal diversity was evaluated using serial dilution, direct plating and baiting with carrot slices, potato slices, radish seeds, cucumber seeds and cucumber cotyledons. Identity of all the isolates was confirmed on the basis of the internal transcribed spacer region of the ribosomal RNA (ITS rRNA) sequence data. The direct plating technique was found to be superior over other culture-based techniques in the number of fungal species detected. It was also found to be simple and the least time consuming technique. Comparing the efficiency of direct plating with 454 pyrosequencing revealed that pyrosequencing detected 12 and 15 times more fungal species from potting media and OFs respectively. Analysis revealed that there were differences between potting media and OFs in the dominant phyla, classes, orders, families, genera and species detected. Zygomycota (52%) and Chytridiomycota (60%) were the predominant phyla in potting media and OFs respectively. The superiority of pyrosequencing over cultural methods could be related to the ability to detect obligate fungi, slow growing fungi and fungi that exist at low population densities. The evaluated methods in this study, especially direct plating and pyrosequencing, may be used as tools to help detect and reduce movement of unwanted fungi between countries and regions. © 2015 The Society for Applied Microbiology.

  12. Evidence for trade-offs in detoxification and chemosensation gene signatures in Plutella xylostella.

    PubMed

    Bautista, Ma Anita M; Bhandary, Binny; Wijeratne, Asela J; Michel, Andrew P; Hoy, Casey W; Mittapalli, Omprakash

    2015-03-01

    Detoxification genes have been associated with insecticide adaptation in the diamondback moth, Plutella xylostella. The link between chemosensation genes and adaptation, however, remains unexplored. To gain a better understanding of the involvement of these genes in insecticide adaptation, the authors exposed lines of P. xylostella to either high uniform (HU) or low heterogeneous (LH) concentrations of permethrin, expecting primarily physiological or behavioral selection respectively. Initially, 454 pyrosequencing was applied, followed by an examination of expression profiles of candidate genes that responded to selection [cytochrome P450 (CYP), glutathione S-transferase (GST), carboxylesterase (CarE), chemosensory protein (CSP) and odorant-binding protein (OBP)] by quantitative PCR in the larvae. Toxicity and behavioral assays were also conducted to document the effects of the two forms of exposure. Pyrosequencing of the P. xylostella transcriptome from adult heads and third instars produced 198,753 reads with 52,752,486 bases. Quantitative PCR revealed overexpression of CYP4M14, CYP305B1 and CSP8 in HU larvae. OBP13, however, was highest in LH. Larvae from LH and HU lines had up to five- and 752-fold resistance levels respectively, which could be due to overexpression of P450s. However, the behavioral responses of all lines to a series of permethrin concentrations did not vary significantly in any of the generations examined, in spite of the observed upregulation of CSP8 and OBP13. Expression patterns from the target genes provide insights into behavioral and physiological responses to permethrin and suggest a new avenue of research on the role of chemosensation genes in insect adaptation to toxins. © 2014 Society of Chemical Industry.

  13. Deep sequencing analysis of the transcriptomes of peanut aerial and subterranean young pods identifies candidate genes related to early embryo abortion.

    PubMed

    Chen, Xiaoping; Zhu, Wei; Azam, Sarwar; Li, Heying; Zhu, Fanghe; Li, Haifen; Hong, Yanbin; Liu, Haiyan; Zhang, Erhua; Wu, Hong; Yu, Shanlin; Zhou, Guiyuan; Li, Shaoxiong; Zhong, Ni; Wen, Shijie; Li, Xingyu; Knapp, Steve J; Ozias-Akins, Peggy; Varshney, Rajeev K; Liang, Xuanqiang

    2013-01-01

    The failure of peg penetration into the soil leads to seed abortion in peanut. Knowledge of genes involved in these processes is comparatively deficient. Here, we used RNA-seq to gain insights into transcriptomes of aerial and subterranean pods. More than 2 million transcript reads with an average length of 396 bp were generated from one aerial (AP) and two subterranean (SP1 and SP2) pod libraries using pyrosequencing technology. After assembly, sets of 49 632, 49 952 and 50 494 from a total of 74 974 transcript assembly contigs (TACs) were identified in AP, SP1 and SP2, respectively. A clear linear relationship in the gene expression level was observed between these data sets. In brief, 2194 differentially expressed TACs with a 99.0% true-positive rate were identified, among which 859 and 1068 TACs were up-regulated in aerial and subterranean pods, respectively. Functional analysis showed that putative function based on similarity with proteins catalogued in UniProt and gene ontology term classification could be determined for 59 342 (79.2%) and 42 955 (57.3%) TACs, respectively. A total of 2968 TACs were mapped to 174 KEGG pathways, of which 168 were shared by aerial and subterranean transcriptomes. TACs involved in photosynthesis were significantly up-regulated and enriched in the aerial pod. In addition, two senescence-associated genes were identified as significantly up-regulated in the aerial pod, which potentially contribute to embryo abortion in aerial pods, and in turn, to cessation of swelling. The data set generated in this study provides evidence for some functional genes as robust candidates underlying aerial and subterranean pod development and contributes to an elucidation of the evolutionary implications resulting from fruit development under light and dark conditions. © 2012 The Authors Plant Biotechnology Journal © 2012 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.

  14. Transcriptome analysis in Concholepas concholepas (Gastropoda, Muricidae): mining and characterization of new genomic and molecular markers.

    PubMed

    Cárdenas, Leyla; Sánchez, Roland; Gomez, Daniela; Fuenzalida, Gonzalo; Gallardo-Escárate, Cristián; Tanguy, Arnaud

    2011-09-01

    The marine gastropod Concholepas concholepas, locally known as the "loco", is the main target species of the benthonic Chilean fisheries. Genetic and genomic tools are necessary to study the genome of this species in order to understand the molecular basis of its development, growth, and other key traits to improve the management strategies and to identify local adaptation to prevent loss of biodiversity. Here, we use pyrosequencing technologies to generate the first transcriptomic database from adult specimens of the loco. After trimming, a total of 140,756 Expressed Sequence Tag sequences were achieved. Clustering and assembly analysis identified 19,219 contigs and 105,435 singleton sequences. BlastN analysis showed a significant identity with Expressed Sequence Tags of different gastropod species available in public databases. Similarly, BlastX results showed that only 895 out of the total 124,654 had significant hits and may represent novel genes for marine gastropods. From this database, simple sequence repeat motifs were also identified and a total of 38 primer pairs were designed and tested to assess their potential as informative markers and to investigate their cross-species amplification in different related gastropod species. This dataset represents the first publicly available 454 data for a marine gastropod endemic to the southeastern Pacific coast, providing a valuable transcriptomic resource for future efforts of gene discovery and development of functional markers in other marine gastropods. Copyright © 2011 Elsevier B.V. All rights reserved.

  15. Transcriptomes analysis of Aeromonas molluscorum Av27 cells exposed to tributyltin (TBT): Unravelling the effects from the molecular level to the organism

    PubMed Central

    Cruz, Andreia; Rodrigues, Raquel; Pinheiro, Miguel; Mendo, Sónia

    2015-01-01

    Aeromonas molluscorum Av27 cells were exposed to 0, 5 and 50 μM of TBT and the respective transcriptomes were obtained by pyrosequencing. Gene Ontology revealed that exposure to 5 μM TBT results in a higher number of repressed genes in contrast with 50 μM of TBT, where the number of over-expressed genes is greater. At both TBT concentrations, higher variations in gene expression were found in the functional categories associated with enzymatic activities, transport/binding and oxidation-reduction. A number of proteins are affected by TBT, such as the acriflavin resistance protein, several transcription-related proteins, several Hsps, ABC transporters, CorA and ZntB and other outer membrane efflux proteins, all of these involved in cellular metabolic processes, important to maintain overall cell viability. Using the STRING tool, several proteins with unknown function were related with others involved in degradation processes, such as the pyoverdine chromophore biosynthetic protein, that has been described as playing a role in the Sn–C cleavage of organotins. This approach has allowed a better understanding of the molecular effects of exposure of bacterial cells to TBT. Furthermore it contributes to the knowledge of the functional genomic aspects of bacteria exposed to this pollutant. Furthermore, the transcriptomic data gathered, and now publically available, constitute a valuable resource for comparative genome analysis. PMID:26171931

  16. Transcriptomes analysis of Aeromonas molluscorum Av27 cells exposed to tributyltin (TBT): Unravelling the effects from the molecular level to the organism.

    PubMed

    Cruz, Andreia; Rodrigues, Raquel; Pinheiro, Miguel; Mendo, Sónia

    2015-08-01

    Aeromonas molluscorum Av27 cells were exposed to 0, 5 and 50 μM of TBT and the respective transcriptomes were obtained by pyrosequencing. Gene Ontology revealed that exposure to 5 μM TBT results in a higher number of repressed genes in contrast with 50 μM of TBT, where the number of over-expressed genes is greater. At both TBT concentrations, higher variations in gene expression were found in the functional categories associated with enzymatic activities, transport/binding and oxidation-reduction. A number of proteins are affected by TBT, such as the acriflavin resistance protein, several transcription-related proteins, several Hsps, ABC transporters, CorA and ZntB and other outer membrane efflux proteins, all of these involved in cellular metabolic processes, important to maintain overall cell viability. Using the STRING tool, several proteins with unknown function were related with others involved in degradation processes, such as the pyoverdine chromophore biosynthetic protein, that has been described as playing a role in the Sn-C cleavage of organotins. This approach has allowed a better understanding of the molecular effects of exposure of bacterial cells to TBT. Furthermore it contributes to the knowledge of the functional genomic aspects of bacteria exposed to this pollutant. Furthermore, the transcriptomic data gathered, and now publically available, constitute a valuable resource for comparative genome analysis. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  17. Development and validation of a mixed-tissue oligonucleotide DNA microarray for Atlantic bluefin tuna, Thunnus thynnus (Linnaeus, 1758).

    PubMed

    Trumbić, Željka; Bekaert, Michaël; Taggart, John B; Bron, James E; Gharbi, Karim; Mladineo, Ivona

    2015-11-25

    The largest of the tuna species, Atlantic bluefin tuna (Thunnus thynnus), inhabits the North Atlantic Ocean and the Mediterranean Sea and is considered to be an endangered species, largely a consequence of overfishing. T. thynnus aquaculture, referred to as fattening or farming, is a capture based activity dependent on yearly renewal from the wild. Thus, the development of aquaculture practices independent of wild resources can provide an important contribution towards ensuring security and sustainability of this species in the longer-term. The development of such practices is today greatly assisted by large scale transcriptomic studies. We have used pyrosequencing technology to sequence a mixed-tissue normalised cDNA library, derived from adult T. thynnus. A total of 976,904 raw sequence reads were assembled into 33,105 unique transcripts having a mean length of 893 bases and an N50 of 870. Of these, 33.4% showed similarity to known proteins or gene transcripts and 86.6% of them were matched to the congeneric Pacific bluefin tuna (Thunnus orientalis) genome, compared to 70.3% for the more distantly related Nile tilapia (Oreochromis niloticus) genome. Transcript sequences were used to develop a novel 15 K Agilent oligonucleotide DNA microarray for T. thynnus and comparative tissue gene expression profiles were inferred for gill, heart, liver, ovaries and testes. Functional contrasts were strongest between gills and ovaries. Gills were particularly associated with immune system, signal transduction and cell communication, while ovaries displayed signatures of glycan biosynthesis, nucleotide metabolism, transcription, translation, replication and repair. Sequence data generated from a novel mixed-tissue T. thynnus cDNA library provide an important transcriptomic resource that can be further employed for study of various aspects of T. thynnus ecology and genomics, with strong applications in aquaculture. Tissue-specific gene expression profiles inferred through the use of novel oligo-microarray can serve in the design of new and more focused transcriptomic studies for future research of tuna physiology and assessment of the welfare in a production environment.

  18. Transcriptome-wide investigation of genomic imprinting in chicken.

    PubMed

    Frésard, Laure; Leroux, Sophie; Servin, Bertrand; Gourichon, David; Dehais, Patrice; Cristobal, Magali San; Marsaud, Nathalie; Vignoles, Florence; Bed'hom, Bertrand; Coville, Jean-Luc; Hormozdiari, Farhad; Beaumont, Catherine; Zerjal, Tatiana; Vignal, Alain; Morisson, Mireille; Lagarrigue, Sandrine; Pitel, Frédérique

    2014-04-01

    Genomic imprinting is an epigenetic mechanism by which alleles of some specific genes are expressed in a parent-of-origin manner. It has been observed in mammals and marsupials, but not in birds. Until now, only a few genes orthologous to mammalian imprinted ones have been analyzed in chicken and did not demonstrate any evidence of imprinting in this species. However, several published observations such as imprinted-like QTL in poultry or reciprocal effects keep the question open. Our main objective was thus to screen the entire chicken genome for parental-allele-specific differential expression on whole embryonic transcriptomes, using high-throughput sequencing. To identify the parental origin of each observed haplotype, two chicken experimental populations were used, as inbred and as genetically distant as possible. Two families were produced from two reciprocal crosses. Transcripts from 20 embryos were sequenced using NGS technology, producing ∼200 Gb of sequences. This allowed the detection of 79 potentially imprinted SNPs, through an analysis method that we validated by detecting imprinting from mouse data already published. However, out of 23 candidates tested by pyrosequencing, none could be confirmed. These results come together, without a priori, with previous statements and phylogenetic considerations assessing the absence of genomic imprinting in chicken.

  19. De Novo Assembly and Functional Annotation of the Olive (Olea europaea) Transcriptome

    PubMed Central

    Muñoz-Mérida, Antonio; González-Plaza, Juan José; Cañada, Andrés; Blanco, Ana María; García-López, Maria del Carmen; Rodríguez, José Manuel; Pedrola, Laia; Sicardo, M. Dolores; Hernández, M. Luisa; De la Rosa, Raúl; Belaj, Angjelina; Gil-Borja, Mayte; Luque, Francisco; Martínez-Rivas, José Manuel; Pisano, David G.; Trelles, Oswaldo; Valpuesta, Victoriano; Beuzón, Carmen R.

    2013-01-01

    Olive breeding programmes are focused on selecting for traits as short juvenile period, plant architecture suited for mechanical harvest, or oil characteristics, including fatty acid composition, phenolic, and volatile compounds to suit new markets. Understanding the molecular basis of these characteristics and improving the efficiency of such breeding programmes require the development of genomic information and tools. However, despite its economic relevance, genomic information on olive or closely related species is still scarce. We have applied Sanger and 454 pyrosequencing technologies to generate close to 2 million reads from 12 cDNA libraries obtained from the Picual, Arbequina, and Lechin de Sevilla cultivars and seedlings from a segregating progeny of a Picual × Arbequina cross. The libraries include fruit mesocarp and seeds at three relevant developmental stages, young stems and leaves, active juvenile and adult buds as well as dormant buds, and juvenile and adult roots. The reads were assembled by library or tissue and then assembled together into 81 020 unigenes with an average size of 496 bases. Here, we report their assembly and their functional annotation. PMID:23297299

  20. De novo assembly and characterization of transcriptomes of early-stage fruit from two genotypes of Annona squamosa L. with contrast in seed number.

    PubMed

    Gupta, Yogesh; Pathak, Ashish K; Singh, Kashmir; Mantri, Shrikant S; Singh, Sudhir P; Tuli, Rakesh

    2015-02-14

    Annona squamosa L., a popular fruit tree, is the most widely cultivated species of the genus Annona. The lack of transcriptomic and genomic information limits the scope of genome investigations in this important shrub. It bears aggregate fruits with numerous seeds. A few rare accessions with very few seeds have been reported for Annona. A massive pyrosequencing (Roche, 454 GS FLX+) of transcriptome from early stages of fruit development (0, 4, 8 and 12 days after pollination) was performed to produce expression datasets in two genotypes, Sitaphal and NMK-1, that show a contrast in the number of seeds set in fruits. The data reported here is the first source of genome-wide differential transcriptome sequence in two genotypes of A. squamosa, and identifies several candidate genes related to seed development. Approximately 1.9 million high-quality clean reads were obtained in the cDNA library from the developing fruits of both the genotypes, with an average length of about 568 bp. Quality-reads were assembled de novo into 2074 to 11004 contigs in the developing fruit samples at different stages of development. The contig sequence data of all the four stages of each genotype were combined into larger units resulting into 14921 (Sitaphal) and 14178 (NMK-1) unigenes, with a mean size of more than 1 Kb. Assembled unigenes were functionally annotated by querying against the protein sequences of five different public databases (NCBI non redundant, Prunus persica, Vitis vinifera, Fragaria vesca, and Amborella trichopoda), with an E-value cut-off of 10(-5). A total of 4588 (Sitaphal) and 2502 (NMK-1) unigenes did not match any known protein in the NR database. These sequences could be genes specific to Annona sp. or belong to untranslated regions. Several of the unigenes representing pathways related to primary and secondary metabolism, and seed and fruit development expressed at a higher level in Sitaphal, the densely seeded cultivar in comparison to the poorly seeded NMK-1. A total of 2629 (Sitaphal) and 3445 (NMK-1) Simple Sequence Repeat (SSR) motifs were identified respectively in the two genotypes. These could be potential candidates for transcript based microsatellite analysis in A. squamosa. The present work provides early-stage fruit specific transcriptome sequence resource for A. squamosa. This repository will serve as a useful resource for investigating the molecular mechanisms of fruit development, and improvement of fruit related traits in A. squamosa and related species.

  1. Bioinformatic prediction of G protein-coupled receptor encoding sequences from the transcriptome of the foreleg, including the Haller’s organ, of the cattle tick, Rhipicephalus australis

    PubMed Central

    Munoz, Sergio; Guerrero, Felix D.; Kellogg, Anastasia; Heekin, Andrew M.

    2017-01-01

    The cattle tick of Australia, Rhipicephalus australis, is a vector for microbial parasites that cause serious bovine diseases. The Haller’s organ, located in the tick’s forelegs, is crucial for host detection and mating. To facilitate the development of new technologies for better control of this agricultural pest, we aimed to sequence and annotate the transcriptome of the R. australis forelegs and associated tissues, including the Haller's organ. As G protein-coupled receptors (GPCRs) are an important family of eukaryotic proteins studied as pharmaceutical targets in humans, we prioritized the identification and classification of the GPCRs expressed in the foreleg tissues. The two forelegs from adult R. australis were excised, RNA extracted, and pyrosequenced with 454 technology. Reads were assembled into unigenes and annotated by sequence similarity. Python scripts were written to find open reading frames (ORFs) from each unigene. These ORFs were analyzed by different GPCR prediction approaches based on sequence alignments, support vector machines, hidden Markov models, and principal component analysis. GPCRs consistently predicted by multiple methods were further studied by phylogenetic analysis and 3D homology modeling. From 4,782 assembled unigenes, 40,907 possible ORFs were predicted. Using Blastp, Pfam, GPCRpred, TMHMM, and PCA-GPCR, a basic set of 46 GPCR candidates were compiled and a phylogenetic tree was constructed. With further screening of tertiary structures predicted by RaptorX, 6 likely GPCRs emerged and the strongest candidate was classified by PCA-GPCR to be a GABAB receptor. PMID:28231302

  2. Bioinformatic prediction of G protein-coupled receptor encoding sequences from the transcriptome of the foreleg, including the Haller's organ, of the cattle tick, Rhipicephalus australis.

    PubMed

    Munoz, Sergio; Guerrero, Felix D; Kellogg, Anastasia; Heekin, Andrew M; Leung, Ming-Ying

    2017-01-01

    The cattle tick of Australia, Rhipicephalus australis, is a vector for microbial parasites that cause serious bovine diseases. The Haller's organ, located in the tick's forelegs, is crucial for host detection and mating. To facilitate the development of new technologies for better control of this agricultural pest, we aimed to sequence and annotate the transcriptome of the R. australis forelegs and associated tissues, including the Haller's organ. As G protein-coupled receptors (GPCRs) are an important family of eukaryotic proteins studied as pharmaceutical targets in humans, we prioritized the identification and classification of the GPCRs expressed in the foreleg tissues. The two forelegs from adult R. australis were excised, RNA extracted, and pyrosequenced with 454 technology. Reads were assembled into unigenes and annotated by sequence similarity. Python scripts were written to find open reading frames (ORFs) from each unigene. These ORFs were analyzed by different GPCR prediction approaches based on sequence alignments, support vector machines, hidden Markov models, and principal component analysis. GPCRs consistently predicted by multiple methods were further studied by phylogenetic analysis and 3D homology modeling. From 4,782 assembled unigenes, 40,907 possible ORFs were predicted. Using Blastp, Pfam, GPCRpred, TMHMM, and PCA-GPCR, a basic set of 46 GPCR candidates were compiled and a phylogenetic tree was constructed. With further screening of tertiary structures predicted by RaptorX, 6 likely GPCRs emerged and the strongest candidate was classified by PCA-GPCR to be a GABAB receptor.

  3. Transcriptomic survey of the midgut of Anthonomus grandis (Coleoptera: Curculionidae).

    PubMed

    Salvador, Ricardo; Príncipi, Darío; Berretta, Marcelo; Fernández, Paula; Paniego, Norma; Sciocco-Cap, Alicia; Hopp, Esteban

    2014-01-01

    Anthonomus grandis Boheman is a key pest in cotton crops in the New World. Its larval stage develops within the flower bud using it as food and as protection against its predators. This behavior limits the effectiveness of its control using conventional insecticide applications and biocontrol techniques. In spite of its importance, little is known about its genome sequence and, more important, its specific expression in key organs like the midgut. Total mRNA isolated from larval midguts was used for pyrosequencing. Sequence reads were assembled and annotated to generate a unigene data set. In total, 400,000 reads from A. grandis midgut with an average length of 237 bp were assembled and combined into 20,915 contigs. The assembled reads fell into 6,621 genes models. BlastX search using the NCBI-NR database showed that 3,006 unigenes had significant matches to known sequences. Gene Ontology (GO) mapping analysis evidenced that A. grandis is able to transcripts coding for proteins involved in catalytic processing of macromolecules that allows its adaptation to very different feeding source scenarios. Furthermore, transcripts encoding for proteins involved in detoxification mechanisms such as p450 genes, glutathione-S-transferase, and carboxylesterases are also expressed. This is the first report of a transcriptomic study in A. grandis and the largest set of sequence data reported for this species. These data are valuable resources to expand the knowledge of this insect group and could be used in the design of new control strategies based in molecular information. © The Author 2014. Published by Oxford University Press on behalf of the Entomological Society of America.

  4. IgM Repertoire Biodiversity is Reduced in HIV-1 Infection and Systemic Lupus Erythematosus.

    PubMed

    Yin, Li; Hou, Wei; Liu, Li; Cai, Yunpeng; Wallet, Mark Andrew; Gardner, Brent Paul; Chang, Kaifen; Lowe, Amanda Catherine; Rodriguez, Carina Adriana; Sriaroon, Panida; Farmerie, William George; Sleasman, John William; Goodenow, Maureen Michels

    2013-01-01

    HIV-1 infection or systemic lupus erythematosus (SLE) disrupt B cell homeostasis, reduce memory B cells, and impair function of IgG and IgM antibodies. To determine how disturbances in B cell populations producing polyclonal antibodies relate to the IgM repertoire, the IgM transcriptome in health and disease was explored at the complementarity determining region 3 (CDRH3) sequence level. 454-deep pyrosequencing in combination with a novel analysis pipeline was applied to define populations of IGHM CDRH3 sequences based on absence or presence of somatic hypermutations (SHM) in peripheral blood B cells. HIV or SLE subjects have reduced biodiversity within their IGHM transcriptome compared to healthy subjects, mainly due to a significant decrease in the number of unique combinations of alleles, although recombination machinery was intact. While major differences between sequences without or with SHM occurred among all groups, IGHD and IGHJ allele use, CDRH3 length distribution, or generation of SHM were similar among study cohorts. Antiretroviral therapy failed to normalize IGHM biodiversity in HIV-infected individuals. All subjects had a low frequency of allelic combinations within the IGHM repertoire similar to known broadly neutralizing HIV-1 antibodies. Polyclonal expansion would decrease overall IgM biodiversity independent of other mechanisms for development of the B cell repertoire. Applying deep sequencing as a strategy to follow development of the IgM repertoire in health and disease provides a novel molecular assessment of multiple points along the B cell differentiation pathway that is highly sensitive for detecting perturbations within the repertoire at the population level.

  5. Transcriptomic Survey of the Midgut of Anthonomus grandis (Coleoptera: Curculionidae)

    PubMed Central

    Salvador, Ricardo; Príncipi, Darío; Berretta, Marcelo; Fernández, Paula; Paniego, Norma; Sciocco-Cap, Alicia; Hopp, Esteban

    2014-01-01

    Abstract Anthonomus grandis Boheman is a key pest in cotton crops in the New World. Its larval stage develops within the flower bud using it as food and as protection against its predators. This behavior limits the effectiveness of its control using conventional insecticide applications and biocontrol techniques. In spite of its importance, little is known about its genome sequence and, more important, its specific expression in key organs like the midgut. Total mRNA isolated from larval midguts was used for pyrosequencing. Sequence reads were assembled and annotated to generate a unigene data set. In total, 400,000 reads from A. grandis midgut with an average length of 237 bp were assembled and combined into 20,915 contigs. The assembled reads fell into 6,621 genes models. BlastX search using the NCBI-NR database showed that 3,006 unigenes had significant matches to known sequences. Gene Ontology (GO) mapping analysis evidenced that A. grandis is able to transcripts coding for proteins involved in catalytic processing of macromolecules that allows its adaptation to very different feeding source scenarios. Furthermore, transcripts encoding for proteins involved in detoxification mechanisms such as p450 genes, glutathione-S-transferase , and carboxylesterases are also expressed. This is the first report of a transcriptomic study in A. grandis and the largest set of sequence data reported for this species. These data are valuable resources to expand the knowledge of this insect group and could be used in the design of new control strategies based in molecular information. PMID:25473064

  6. Functional similarity and molecular divergence of a novel reproductive transcriptome in two male-pregnant Syngnathus pipefish species

    PubMed Central

    Small, Clayton M; Harlin-Cognato, April D; Jones, Adam G

    2013-01-01

    Evolutionary studies have revealed that reproductive proteins in animals and plants often evolve more rapidly than the genome-wide average. The causes of this pattern, which may include relaxed purifying selection, sexual selection, sexual conflict, pathogen resistance, reinforcement, or gene duplication, remain elusive. Investigative expansions to additional taxa and reproductive tissues have the potential to shed new light on this unresolved problem. Here, we embark on such an expansion, in a comparison of the brood-pouch transcriptome between two male-pregnant species of the pipefish genus Syngnathus. Male brooding tissues in syngnathid fishes represent a novel, nonurogenital reproductive trait, heretofore mostly uncharacterized from a molecular perspective. We leveraged next-generation sequencing (Roche 454 pyrosequencing) to compare transcript abundance in the male brooding tissues of pregnant with nonpregnant samples from Gulf (S. scovelli) and dusky (S. floridae) pipefish. A core set of protein-coding genes, including multiple members of astacin metalloprotease and c-type lectin gene families, is consistent between species in both the direction and magnitude of expression bias. As predicted, coding DNA sequence analysis of these putative “male pregnancy proteins” suggests rapid evolution relative to nondifferentially expressed genes and reflects signatures of adaptation similar in magnitude to those reported from Drosophila male accessory gland proteins. Although the precise drivers of male pregnancy protein divergence remain unknown, we argue that the male pregnancy transcriptome in syngnathid fishes, a clade diverse with respect to brooding morphology and mating system, represents a unique and promising object of study for understanding the perplexing evolutionary nature of reproductive molecules. PMID:24324861

  7. De Novo Assembly, Functional Annotation and Comparative Analysis of Withania somnifera Leaf and Root Transcriptomes to Identify Putative Genes Involved in the Withanolides Biosynthesis

    PubMed Central

    Gupta, Parul; Goel, Ridhi; Pathak, Sumya; Srivastava, Apeksha; Singh, Surya Pratap; Sangwan, Rajender Singh; Asif, Mehar Hasan; Trivedi, Prabodh Kumar

    2013-01-01

    Withania somnifera is one of the most valuable medicinal plants used in Ayurvedic and other indigenous medicine systems due to bioactive molecules known as withanolides. As genomic information regarding this plant is very limited, little information is available about biosynthesis of withanolides. To facilitate the basic understanding about the withanolide biosynthesis pathways, we performed transcriptome sequencing for Withania leaf (101L) and root (101R) which specifically synthesize withaferin A and withanolide A, respectively. Pyrosequencing yielded 8,34,068 and 7,21,755 reads which got assembled into 89,548 and 1,14,814 unique sequences from 101L and 101R, respectively. A total of 47,885 (101L) and 54,123 (101R) could be annotated using TAIR10, NR, tomato and potato databases. Gene Ontology and KEGG analyses provided a detailed view of all the enzymes involved in withanolide backbone synthesis. Our analysis identified members of cytochrome P450, glycosyltransferase and methyltransferase gene families with unique presence or differential expression in leaf and root and might be involved in synthesis of tissue-specific withanolides. We also detected simple sequence repeats (SSRs) in transcriptome data for use in future genetic studies. Comprehensive sequence resource developed for Withania, in this study, will help to elucidate biosynthetic pathway for tissue-specific synthesis of secondary plant products in non-model plant organisms as well as will be helpful in developing strategies for enhanced biosynthesis of withanolides through biotechnological approaches. PMID:23667511

  8. Position-specific automated processing of V3 env ultra-deep pyrosequencing data for predicting HIV-1 tropism

    PubMed Central

    Jeanne, Nicolas; Saliou, Adrien; Carcenac, Romain; Lefebvre, Caroline; Dubois, Martine; Cazabat, Michelle; Nicot, Florence; Loiseau, Claire; Raymond, Stéphanie; Izopet, Jacques; Delobel, Pierre

    2015-01-01

    HIV-1 coreceptor usage must be accurately determined before starting CCR5 antagonist-based treatment as the presence of undetected minor CXCR4-using variants can cause subsequent virological failure. Ultra-deep pyrosequencing of HIV-1 V3 env allows to detect low levels of CXCR4-using variants that current genotypic approaches miss. However, the computation of the mass of sequence data and the need to identify true minor variants while excluding artifactual sequences generated during amplification and ultra-deep pyrosequencing is rate-limiting. Arbitrary fixed cut-offs below which minor variants are discarded are currently used but the errors generated during ultra-deep pyrosequencing are sequence-dependant rather than random. We have developed an automated processing of HIV-1 V3 env ultra-deep pyrosequencing data that uses biological filters to discard artifactual or non-functional V3 sequences followed by statistical filters to determine position-specific sensitivity thresholds, rather than arbitrary fixed cut-offs. It allows to retain authentic sequences with point mutations at V3 positions of interest and discard artifactual ones with accurate sensitivity thresholds. PMID:26585833

  9. Position-specific automated processing of V3 env ultra-deep pyrosequencing data for predicting HIV-1 tropism.

    PubMed

    Jeanne, Nicolas; Saliou, Adrien; Carcenac, Romain; Lefebvre, Caroline; Dubois, Martine; Cazabat, Michelle; Nicot, Florence; Loiseau, Claire; Raymond, Stéphanie; Izopet, Jacques; Delobel, Pierre

    2015-11-20

    HIV-1 coreceptor usage must be accurately determined before starting CCR5 antagonist-based treatment as the presence of undetected minor CXCR4-using variants can cause subsequent virological failure. Ultra-deep pyrosequencing of HIV-1 V3 env allows to detect low levels of CXCR4-using variants that current genotypic approaches miss. However, the computation of the mass of sequence data and the need to identify true minor variants while excluding artifactual sequences generated during amplification and ultra-deep pyrosequencing is rate-limiting. Arbitrary fixed cut-offs below which minor variants are discarded are currently used but the errors generated during ultra-deep pyrosequencing are sequence-dependant rather than random. We have developed an automated processing of HIV-1 V3 env ultra-deep pyrosequencing data that uses biological filters to discard artifactual or non-functional V3 sequences followed by statistical filters to determine position-specific sensitivity thresholds, rather than arbitrary fixed cut-offs. It allows to retain authentic sequences with point mutations at V3 positions of interest and discard artifactual ones with accurate sensitivity thresholds.

  10. Transcriptome Analysis in Sheepgrass (Leymus chinensis). A Dominant Perennial Grass of the Eurasian Steppe

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Shuangyan; Huang, Xin; Yang, Xiaohan

    BACKGROUND: Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. RESULTS: The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resultedmore » in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. CONCLUSIONS: This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.« less

  11. The first genome-level transcriptome of the wood-degrading fungus Phanerochaete chrysosporium grown on red oak.

    PubMed

    Sato, Shin; Feltus, F Alex; Iyer, Prashanti; Tien, Ming

    2009-06-01

    As part of an effort to determine all the gene products involved in wood degradation, we have performed massively parallel pyrosequencing on an expression library from the white rot fungus Phanerochaete chrysosporium grown in shallow stationary cultures with red oak as the carbon source. Approximately 48,000 high quality sequence tags (246 bp average length) were generated. 53% of the sequence tags aligned to 4,262 P. chrysosporium gene models, and an additional 18.5% of the tags reliably aligned to the P. chrysosporium genome providing evidence for 961 putative novel fragmented gene models. Due to their role in lignocellulose degradation, the secreted proteins were focused upon. Our results show that the four enzymes required for cellulose degradation: endocellulase, exocellulase CBHI, exocellulase CBHII, and beta-glucosidase are all produced. For hemicellulose degradation, not all known enzymes were produced, but endoxylanases, acetyl xylan esterases and mannosidases were detected. For lignin degradation, the role of peroxidases has been questioned; however, our results show that lignin peroxidase is highly expressed along with the H(2)O(2) generating enzyme, alcohol oxidase. The transcriptome snapshot reveals that H(2)O(2) generation and utilization are central in wood degradation. Our results also reveal new transcripts that encode extracellular proteins with no known function.

  12. The transcriptome recipe for the venom cocktail of Tityus bahiensis scorpion.

    PubMed

    de Oliveira, Ursula Castro; Candido, Denise Maria; Dorce, Valquíria Abrão Coronado; Junqueira-de-Azevedo, Inácio de Loiola Meirelles

    2015-03-01

    Scorpion venom is a mixture of peptides, including antimicrobial, bradykinin-potentiating and anionic peptides and small to medium proteins, such as ion channel toxins, metalloproteinases and phospholipases that together cause severe clinical manifestation. Tityus bahiensis is the second most medically important scorpion species in Brazil and it is widely distributed in the country with the exception of the North Region. Here we sequenced and analyzed the transcripts from the venom glands of T. bahiensis, aiming at identifying and annotating venom gland expressed genes. A total of 116,027 long reads were generated by pyrosequencing and assembled in 2891 isotigs. An annotation process identified transcripts by similarity to known toxins, revealing that putative venom components represent 7.4% of gene expression. The major toxins identified are potassium and sodium channel toxins, whereas metalloproteinases showed an unexpected high abundance. Phylogenetic analysis of deduced metalloproteinases from T. bahiensis and other scorpions revealed a pattern of ancient and intraspecific gene expansions. Other venom molecules identified include antimicrobial, anionic and bradykinin-potentiating peptides, besides several putative new venom components. This report provides the first attempt to massively identify the venom components of this species and constitutes one of the few transcriptomic efforts on the genus Tityus. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. Transcriptome-wide investigation of genomic imprinting in chicken

    PubMed Central

    Frésard, Laure; Leroux, Sophie; Servin, Bertrand; Gourichon, David; Dehais, Patrice; Cristobal, Magali San; Marsaud, Nathalie; Vignoles, Florence; Bed'hom, Bertrand; Coville, Jean-Luc; Hormozdiari, Farhad; Beaumont, Catherine; Zerjal, Tatiana; Vignal, Alain; Morisson, Mireille; Lagarrigue, Sandrine; Pitel, Frédérique

    2014-01-01

    Genomic imprinting is an epigenetic mechanism by which alleles of some specific genes are expressed in a parent-of-origin manner. It has been observed in mammals and marsupials, but not in birds. Until now, only a few genes orthologous to mammalian imprinted ones have been analyzed in chicken and did not demonstrate any evidence of imprinting in this species. However, several published observations such as imprinted-like QTL in poultry or reciprocal effects keep the question open. Our main objective was thus to screen the entire chicken genome for parental-allele-specific differential expression on whole embryonic transcriptomes, using high-throughput sequencing. To identify the parental origin of each observed haplotype, two chicken experimental populations were used, as inbred and as genetically distant as possible. Two families were produced from two reciprocal crosses. Transcripts from 20 embryos were sequenced using NGS technology, producing ∼200 Gb of sequences. This allowed the detection of 79 potentially imprinted SNPs, through an analysis method that we validated by detecting imprinting from mouse data already published. However, out of 23 candidates tested by pyrosequencing, none could be confirmed. These results come together, without a priori, with previous statements and phylogenetic considerations assessing the absence of genomic imprinting in chicken. PMID:24452801

  14. Picoliter DNA Sequencing Chemistry on an Electrowetting-based Digital Microfluidic Platform

    PubMed Central

    Ferguson Welch, Erin R.; Lin, Yan-You; Madison, Andrew; Fair, R.B.

    2011-01-01

    The results of investigations into performing DNA sequencing chemistry on a picoliter-scale electrowetting digital microfluidic platform are reported. Pyrosequencing utilizes pyrophosphate produced during nucleotide base addition to initiate a process ending with detection through a chemiluminescence reaction using firefly luciferase. The intensity of light produced during the reaction can be quantified to determine the number of bases added to the DNA strand. The logic-based control and discrete fluid droplets of a digital microfluidic device lend themselves well to the pyrosequencing process. Bead-bound DNA is magnetically held in a single location, and wash or reagent droplets added or split from it to circumvent product dilution. Here we discuss the dispensing, control, and magnetic manipulation of the paramagnetic beads used to hold target DNA. We also demonstrate and characterize the picoliter-scale reaction of luciferase with adenosine triphosphate to represent the detection steps of pyrosequencing and all necessary alterations for working on this scale. PMID:21298802

  15. Development of Reference Transcriptomes for the Major Field Insect Pests of Cowpea: A Toolbox for Insect Pest Management Approaches in West Africa

    PubMed Central

    Agunbiade, Tolulope A.; Sun, Weilin; Coates, Brad S.; Djouaka, Rousseau; Tamò, Manuele; Ba, Malick N.; Binso-Dabire, Clementine; Baoua, Ibrahim; Olds, Brett P.; Pittendrigh, Barry R.

    2013-01-01

    Cowpea is a widely cultivated and major nutritional source of protein for many people that live in West Africa. Annual yields and longevity of grain storage is greatly reduced by feeding damage caused by a complex of insect pests that include the pod sucking bugs, Anoplocnemis curvipes Fabricius (Hemiptera: Coreidae) and Clavigralla tomentosicollis Stål (Hemiptera: Coreidae); as well as phloem-feeding cowpea aphids, Aphis craccivora Koch (Hemiptera: Aphididae) and flower thrips, Megalurothrips sjostedti Trybom (Thysanoptera: Thripidae). Efforts to control these pests remain a challenge and there is a need to understand the structure and movement of these pest populations in order to facilitate the development of integrated pest management strategies (IPM). Molecular tools have the potential to help facilitate a better understanding of pest populations. Towards this goal, we used 454 pyrosequencing technology to generate 319,126, 176,262, 320,722 and 227,882 raw reads from A. curvipes, A. craccivora, C. tomentosicollis and M. sjostedti, respectively. The reads were de novo assembled into 11,687, 7,647, 10,652 and 7,348 transcripts for A. curvipes, A. craccivora, C. tomentosicollis and M. sjostedti, respectively. Functional annotation of the resulting transcripts identified genes putatively involved in insecticide resistance, pathogen defense and immunity. Additionally, sequences that matched the primary aphid endosymbiont, Buchnera aphidicola, were identified among A. craccivora transcripts. Furthermore, 742, 97, 607 and 180 single nucleotide polymorphisms (SNPs) were respectively predicted among A. curvipes, A. craccivora, C. tomentosicollis and M. sjostedti transcripts, and will likely be valuable tools for future molecular genetic marker development. These results demonstrate that Roche 454-based transcriptome sequencing could be useful for the development of genomic resources for cowpea pest insects in West Africa. PMID:24278221

  16. Transcriptome analysis of Pacific white shrimp (Litopenaeus vannamei) hepatopancreas in response to Taura syndrome Virus (TSV) experimental infection.

    PubMed

    Zeng, Digang; Chen, Xiuli; Xie, Daxiang; Zhao, Yongzhen; Yang, Chunling; Li, Yongmei; Ma, Ning; Peng, Min; Yang, Qiong; Liao, Zhenping; Wang, Hui; Chen, Xiaohan

    2013-01-01

    The Pacific white shrimp, Litopenaeus vannamei, is a worldwide cultured crustacean species with important commercial value. Over the last two decades, Taura syndrome virus (TSV) has seriously threatened the shrimp aquaculture industry in the Western Hemisphere. To better understand the interaction between shrimp immune and TSV, we performed a transcriptome analysis in the hepatopancreas of L. vannamei challenged with TSV, using the 454 pyrosequencing (Roche) technology. We obtained 126919 and 102181 high-quality reads from TSV-infected and non-infected (control) L. vannamei cDNA libraries, respectively. The overall de novo assembly of cDNA sequence data generated 15004 unigenes, with an average length of 507 bp. Based on BLASTX search (E-value <10-5) against NR, Swissprot, GO, COG and KEGG databases, 10425 unigenes (69.50% of all unigenes) were annotated with gene descriptions, gene ontology terms, or metabolic pathways. In addition, we identified 770 microsatellites and designed 497 sets of primers. Comparative genomic analysis revealed that 1311 genes differentially expressed in the infected shrimp compared to the controls, including 559 up- and 752 down- regulated genes. Among the differentially expressed genes, several are involved in various animal immune functions, such as antiviral, antimicrobial, proteases, protease inhibitors, signal transduction, transcriptional control, cell death and cell adhesion. This study provides valuable information on shrimp gene activities against TSV infection. Results can contribute to the in-depth study of candidate genes in shrimp immunity, and improves our current understanding of this host-virus interaction. In addition, the large amount of transcripts reported in this study provide a rich source for identification of novel genes in shrimp.

  17. Tumor Heterogeneity, Single-Cell Sequencing, and Drug Resistance.

    PubMed

    Schmidt, Felix; Efferth, Thomas

    2016-06-16

    Tumor heterogeneity has been compared with Darwinian evolution and survival of the fittest. The evolutionary ecosystem of tumors consisting of heterogeneous tumor cell populations represents a considerable challenge to tumor therapy, since all genetically and phenotypically different subpopulations have to be efficiently killed by therapy. Otherwise, even small surviving subpopulations may cause repopulation and refractory tumors. Single-cell sequencing allows for a better understanding of the genomic principles of tumor heterogeneity and represents the basis for more successful tumor treatments. The isolation and sequencing of single tumor cells still represents a considerable technical challenge and consists of three major steps: (1) single cell isolation (e.g., by laser-capture microdissection), fluorescence-activated cell sorting, micromanipulation, whole genome amplification (e.g., with the help of Phi29 DNA polymerase), and transcriptome-wide next generation sequencing technologies (e.g., 454 pyrosequencing, Illumina sequencing, and other systems). Data demonstrating the feasibility of single-cell sequencing for monitoring the emergence of drug-resistant cell clones in patient samples are discussed herein. It is envisioned that single-cell sequencing will be a valuable asset to assist the design of regimens for personalized tumor therapies based on tumor subpopulation-specific genetic alterations in individual patients.

  18. Transcriptome mining of immune-related genes in the muricid snail Concholepas concholepas.

    PubMed

    Détrée, Camille; López-Landavery, Edgar; Gallardo-Escárate, Cristian; Lafarga-De la Cruz, Fabiola

    2017-12-01

    The population of the Chilean endemic marine gastropod Concholepas concholepas locally called "loco" has dramatically decreased in the past 50 years as a result of intense activity of local fisheries and high environmental variability observed along the Chilean coast, including episodes of hypoxia, changes in sea surface temperature, ocean acidification and diseases. In this study, we set out to explore the molecular basis of C. concholepas to cope with biotic stressors such as exposure to the pathogenic bacterium Vibrio anguillarum. Here, 454pyrosequencing was conducted and 61 transcripts related to the immune response in this muricid species were identified. Among these, the expression of six genes (CcNFκβ, CcIκβ, CcLITAF, CcTLR, CcCas8 and CcCath) involved in the regulation of inflammatory, apoptotic and immune processes upon stimuli, were evaluated during the first 33 h post challenge (hpc). The results showed that CcTLR, CcCas8 and CcCath have an initial response at 4 hpc, evidencing an up-regulation from 4 to 24 hpc. Notably, the response of CcNFKB occurred 2 h later with a statistically significant up-regulation at 6 hpc and 10 hpc. Furthermore, the challenge with V. anguillarum induced a statistically significant down-regulation of CcIKB between 2 and 10 hpc as well as a down-regulation of CcLITAF between 2 and 4 hpc followed in both cases by an up-regulation between 24 and 33 hpc. This work describes the first transcriptomic effort to characterize the immune response of C. concholepas and constitutes a valuable transcriptomic resource for future efforts to develop sustainable aquaculture and conservations tools for this endemic marine snail species. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Comparison of Microbiomes between Red Poultry Mite Populations (Dermanyssus gallinae): Predominance of Bartonella-like Bacteria.

    PubMed

    Hubert, Jan; Erban, Tomas; Kopecky, Jan; Sopko, Bruno; Nesvorna, Marta; Lichovnikova, Martina; Schicht, Sabine; Strube, Christina; Sparagano, Olivier

    2017-11-01

    Blood feeding red poultry mites (RPM) serve as vectors of pathogenic bacteria and viruses among vertebrate hosts including wild birds, poultry hens, mammals, and humans. The microbiome of RPM has not yet been studied by high-throughput sequencing. RPM eggs, larvae, and engorged adult/nymph samples obtained in four poultry houses in Czechia were used for microbiome analyses by Illumina amplicon sequencing of the 16S ribosomal RNA (rRNA) gene V4 region. A laboratory RPM population was used as positive control for transcriptome analysis by pyrosequencing with identification of sequences originating from bacteria. The samples of engorged adult/nymph stages had 100-fold more copies of 16S rRNA gene copies than the samples of eggs and larvae. The microbiome composition showed differences among the four poultry houses and among observed developmental stadia. In the adults' microbiome 10 OTUs comprised 90 to 99% of all sequences. Bartonella-like bacteria covered between 30 and 70% of sequences in RPM microbiome and 25% bacterial sequences in transcriptome. The phylogenetic analyses of 16S rRNA gene sequences revealed two distinct groups of Bartonella-like bacteria forming sister groups: (i) symbionts of ants; (ii) Bartonella genus. Cardinium, Wolbachia, and Rickettsiella sp. were found in the microbiomes of all tested stadia, while Spiroplasma eriocheiris and Wolbachia were identified in the laboratory RPM transcriptome. The microbiomes from eggs, larvae, and engorged adults/nymphs differed. Bartonella-like symbionts were found in all stadia and sampling sites. Bartonella-like bacteria was the most diversified group within the RPM microbiome. The presence of identified putative pathogenic bacteria is relevant with respect to human and animal health issues while the identification of symbiontic bacteria can lead to new control methods targeting them to destabilize the arthropod host.

  20. Fungal Diversity in Tomato Rhizosphere Soil under Conventional and Desert Farming Systems

    PubMed Central

    Kazerooni, Elham A.; Maharachchikumbura, Sajeewa S. N.; Rethinasamy, Velazhahan; Al-Mahrouqi, Hamed; Al-Sadi, Abdullah M.

    2017-01-01

    This study examined fungal diversity and composition in conventional (CM) and desert farming (DE) systems in Oman. Fungal diversity in the rhizosphere of tomato was assessed using 454-pyrosequencing and culture-based techniques. Both techniques produced variable results in terms of fungal diversity, with 25% of the fungal classes shared between the two techniques. In addition, pyrosequencing recovered more taxa compared to direct plating. These findings could be attributed to the ability of pyrosequencing to recover taxa that cannot grow or are slow growing on culture media. Both techniques showed that fungal diversity in the conventional farm was comparable to that in the desert farm. However, the composition of fungal classes and taxa in the two farming systems were different. Pyrosequencing revealed that Microsporidetes and Dothideomycetes are the two most common fungal classes in CM and DE, respectively. However, the culture-based technique revealed that Eurotiomycetes was the most abundant class in both farming systems and some classes, such as Microsporidetes, were not detected by the culture-based technique. Although some plant pathogens (e.g., Pythium or Fusarium) were detected in the rhizosphere of tomato, the majority of fungal species in the rhizosphere of tomato were saprophytes. Our study shows that the cultivation system may have an impact on fungal diversity. The factors which affected fungal diversity in both farms are discussed. PMID:28824590

  1. Development of 23 novel polymorphic EST-SSR markers for the endangered relict conifer Metasequoia glyptostroboides.

    PubMed

    Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng

    2015-09-01

    Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species.

  2. Development of 23 novel polymorphic EST-SSR markers for the endangered relict conifer Metasequoia glyptostroboides1

    PubMed Central

    Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng

    2015-01-01

    Premise of the study: Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag–simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. Methods and Results: We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. Conclusions: These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species. PMID:26421250

  3. Mitochondrial sequence analysis for forensic identification using pyrosequencing technology.

    PubMed

    Andréasson, H; Asp, A; Alderborn, A; Gyllensten, U; Allen, M

    2002-01-01

    Over recent years, requests for mtDNA analysis in the field of forensic medicine have notably increased, and the results of such analyses have proved to be very useful in forensic cases where nuclear DNA analysis cannot be performed. Traditionally, mtDNA has been analyzed by DNA sequencing of the two hypervariable regions, HVI and HVII, in the D-loop. DNA sequence analysis using the conventional Sanger sequencing is very robust but time consuming and labor intensive. By contrast, mtDNA analysis based on the pyrosequencing technology provides fast and accurate results from the human mtDNA present in many types of evidence materials in forensic casework. The assay has been developed to determine polymorphic sites in the mitochondrial D-loop as well as the coding region to further increase the discrimination power of mtDNA analysis. The pyrosequencing technology for analysis of mtDNA polymorphisms has been tested with regard to sensitivity, reproducibility, and success rate when applied to control samples and actual casework materials. The results show that the method is very accurate and sensitive; the results are easily interpreted and provide a high success rate on casework samples. The panel of pyrosequencing reactions for the mtDNA polymorphisms were chosen to result in an optimal discrimination power in relation to the number of bases determined.

  4. Pyrosequencing analysis of the gyrB gene to differentiate bacteria responsible for diarrheal diseases.

    PubMed

    Hou, X-L; Cao, Q-Y; Jia, H-Y; Chen, Z

    2008-07-01

    Pathogens causing acute diarrhea include a large variety of species from Enterobacteriaceae and Vibrionaceae. A method based on pyrosequencing was used here to differentiate bacteria commonly associated with diarrhea in China; the method is targeted to a partial amplicon of the gyrB gene, which encodes the B subunit of DNA gyrase. Twenty-eight specific polymorphic positions were identified from sequence alignment of a large sequence dataset and targeted using 17 sequencing primers. Of 95 isolates tested, belonging to 13 species within 7 genera, most could be identified to the species level; O157 type could be differentiated from other E. coli types; Salmonella enterica subsp. enterica could be identified at the serotype level; the genus Shigella, except for S. boydii and S. dysenteriae, could also be identified. All these isolates were also subjected to conventional sequencing of a relatively long ( approximately1.2 kb) region of gyrB DNA; these results confirmed those with pyrosequencing. Twenty-two fecal samples were surveyed, the results of which were concordant with culture-based bacterial identification, and the pathogen detection limit with simulated stool specimens was 10(4) CFU/ml. DNA from different pathogens was also mixed to simulate a case of multibacterial infection, and the generated signals correlated well with the mix ratio. In summary, the gyrB-based pyrosequencing approach proved to have significant reliability and discriminatory power for enteropathogenic bacterial identification and provided a fast and effective method for clinical diagnosis.

  5. Performance of Different Analytical Software Packages in Quantification of DNA Methylation by Pyrosequencing.

    PubMed

    Grasso, Chiara; Trevisan, Morena; Fiano, Valentina; Tarallo, Valentina; De Marco, Laura; Sacerdote, Carlotta; Richiardi, Lorenzo; Merletti, Franco; Gillio-Tos, Anna

    2016-01-01

    Pyrosequencing has emerged as an alternative method of nucleic acid sequencing, well suited for many applications which aim to characterize single nucleotide polymorphisms, mutations, microbial types and CpG methylation in the target DNA. The commercially available pyrosequencing systems can harbor two different types of software which allow analysis in AQ or CpG mode, respectively, both widely employed for DNA methylation analysis. Aim of the study was to assess the performance for DNA methylation analysis at CpG sites of the two pyrosequencing software which allow analysis in AQ or CpG mode, respectively. Despite CpG mode having been specifically generated for CpG methylation quantification, many investigations on this topic have been carried out with AQ mode. As proof of equivalent performance of the two software for this type of analysis is not available, the focus of this paper was to evaluate if the two modes currently used for CpG methylation assessment by pyrosequencing may give overlapping results. We compared the performance of the two software in quantifying DNA methylation in the promoter of selected genes (GSTP1, MGMT, LINE-1) by testing two case series which include DNA from paraffin embedded prostate cancer tissues (PC study, N = 36) and DNA from blood fractions of healthy people (DD study, N = 28), respectively. We found discrepancy in the two pyrosequencing software-based quality assignment of DNA methylation assays. Compared to the software for analysis in the AQ mode, less permissive criteria are supported by the Pyro Q-CpG software, which enables analysis in CpG mode. CpG mode warns the operators about potential unsatisfactory performance of the assay and ensures a more accurate quantitative evaluation of DNA methylation at CpG sites. The implementation of CpG mode is strongly advisable in order to improve the reliability of the methylation analysis results achievable by pyrosequencing.

  6. Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.

    PubMed

    Shahin, Arwa; van Kaauwen, Martijn; Esselink, Danny; Bargsten, Joachim W; van Tuyl, Jaap M; Visser, Richard G F; Arens, Paul

    2012-11-20

    Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies.

  7. De novo assembly and characterization of tissue specific transcriptomes in the emerald notothen, Trematomus bernacchii.

    PubMed

    Huth, Troy J; Place, Sean P

    2013-11-20

    The notothenioids comprise a diverse group of fishes that rapidly radiated after isolation by the Antarctic Circumpolar Current approximately 14-25 million years ago. Given that evolutionary adaptation has led to finely tuned traits with narrow physiological limits in these organisms, this system provides a unique opportunity to examine physiological trade-offs and limits of adaptive responses to environmental perturbation. As such, notothenioids have a rich history with respect to studies attempting to understand the vulnerability of polar ecosystems to the negative impacts associated with global climate change. Unfortunately, despite being a model system for understanding physiological adaptations to extreme environments, we still lack fundamental molecular tools for much of the Nototheniidae family. Specimens of the emerald notothen, Trematomus bernacchii, were acclimated for 28 days in flow-through seawater tanks maintained near ambient seawater temperatures (-1.5°C) or at +4°C. Following acclimation, tissue specific cDNA libraries for liver, gill and brain were created by pooling RNA from n = 5 individuals per temperature treatment. The tissue specific libraries were bar-coded and used for 454 pyrosequencing, which yielded over 700 thousand sequencing reads. A de novo assembly and annotation of these reads produced a functional transcriptome library of T. bernacchii containing 30,107 unigenes, 13,003 of which possessed significant homology to a known protein product. Digital gene expression analysis of these extremely cold adapted fish reinforced the loss of an inducible heat shock response and allowed the preliminary exploration into other elements of the cellular stress response. Preliminary exploration of the transcriptome of T. bernacchii under elevated temperatures enabled a semi-quantitative comparison to prior studies aimed at characterizing the thermal response of this endemic fish whose size, abundance and distribution has established it as a pivotal species in polar research spanning several decades. The comparison of these findings to previous studies demonstrates the efficacy of transcriptomics and digital gene expression analysis as tools in future studies of polar organisms and has greatly increased the available genomic resources for the suborder Notothenioidei, particularly in the Trematominae subfamily.

  8. De novo assembly and characterization of tissue specific transcriptomes in the emerald notothen, Trematomus bernacchii

    PubMed Central

    2013-01-01

    Background The notothenioids comprise a diverse group of fishes that rapidly radiated after isolation by the Antarctic Circumpolar Current approximately 14–25 million years ago. Given that evolutionary adaptation has led to finely tuned traits with narrow physiological limits in these organisms, this system provides a unique opportunity to examine physiological trade-offs and limits of adaptive responses to environmental perturbation. As such, notothenioids have a rich history with respect to studies attempting to understand the vulnerability of polar ecosystems to the negative impacts associated with global climate change. Unfortunately, despite being a model system for understanding physiological adaptations to extreme environments, we still lack fundamental molecular tools for much of the Nototheniidae family. Results Specimens of the emerald notothen, Trematomus bernacchii, were acclimated for 28 days in flow-through seawater tanks maintained near ambient seawater temperatures (−1.5°C) or at +4°C. Following acclimation, tissue specific cDNA libraries for liver, gill and brain were created by pooling RNA from n = 5 individuals per temperature treatment. The tissue specific libraries were bar-coded and used for 454 pyrosequencing, which yielded over 700 thousand sequencing reads. A de novo assembly and annotation of these reads produced a functional transcriptome library of T. bernacchii containing 30,107 unigenes, 13,003 of which possessed significant homology to a known protein product. Digital gene expression analysis of these extremely cold adapted fish reinforced the loss of an inducible heat shock response and allowed the preliminary exploration into other elements of the cellular stress response. Conclusions Preliminary exploration of the transcriptome of T. bernacchii under elevated temperatures enabled a semi-quantitative comparison to prior studies aimed at characterizing the thermal response of this endemic fish whose size, abundance and distribution has established it as a pivotal species in polar research spanning several decades. The comparison of these findings to previous studies demonstrates the efficacy of transcriptomics and digital gene expression analysis as tools in future studies of polar organisms and has greatly increased the available genomic resources for the suborder Notothenioidei, particularly in the Trematominae subfamily. PMID:24252228

  9. Integration of deep transcriptome and proteome analyses reveals the components of alkaloid metabolism in opium poppy cell cultures

    PubMed Central

    2010-01-01

    Background Papaver somniferum (opium poppy) is the source for several pharmaceutical benzylisoquinoline alkaloids including morphine, the codeine and sanguinarine. In response to treatment with a fungal elicitor, the biosynthesis and accumulation of sanguinarine is induced along with other plant defense responses in opium poppy cell cultures. The transcriptional induction of alkaloid metabolism in cultured cells provides an opportunity to identify components of this process via the integration of deep transcriptome and proteome databases generated using next-generation technologies. Results A cDNA library was prepared for opium poppy cell cultures treated with a fungal elicitor for 10 h. Using 454 GS-FLX Titanium pyrosequencing, 427,369 expressed sequence tags (ESTs) with an average length of 462 bp were generated. Assembly of these sequences yielded 93,723 unigenes, of which 23,753 were assigned Gene Ontology annotations. Transcripts encoding all known sanguinarine biosynthetic enzymes were identified in the EST database, 5 of which were represented among the 50 most abundant transcripts. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) of total protein extracts from cell cultures treated with a fungal elicitor for 50 h facilitated the identification of 1,004 proteins. Proteins were fractionated by one-dimensional SDS-PAGE and digested with trypsin prior to LC-MS/MS analysis. Query of an opium poppy-specific EST database substantially enhanced peptide identification. Eight out of 10 known sanguinarine biosynthetic enzymes and many relevant primary metabolic enzymes were represented in the peptide database. Conclusions The integration of deep transcriptome and proteome analyses provides an effective platform to catalogue the components of secondary metabolism, and to identify genes encoding uncharacterized enzymes. The establishment of corresponding transcript and protein databases generated by next-generation technologies in a system with a well-defined metabolite profile facilitates an improved linkage between genes, enzymes, and pathway components. The proteome database represents the most relevant alkaloid-producing enzymes, compared with the much deeper and more complete transcriptome library. The transcript database contained full-length mRNAs encoding most alkaloid biosynthetic enzymes, which is a key requirement for the functional characterization of novel gene candidates. PMID:21083930

  10. Analysis of genetically modified organisms by pyrosequencing on a portable photodiode-based bioluminescence sequencer.

    PubMed

    Song, Qinxin; Wei, Guijiang; Zhou, Guohua

    2014-07-01

    A portable bioluminescence analyser for detecting the DNA sequence of genetically modified organisms (GMOs) was developed by using a photodiode (PD) array. Pyrosequencing on eight genes (zSSIIb, Bt11 and Bt176 gene of genetically modified maize; Lectin, 35S-CTP4, CP4EPSPS, CaMV35S promoter and NOS terminator of the genetically modified Roundup ready soya) was successfully detected with this instrument. The corresponding limit of detection (LOD) was 0.01% with 35 PCR cycles. The maize and soya available from three different provenances in China were detected. The results indicate that pyrosequencing using the small size of the detector is a simple, inexpensive, and reliable way in a farm/field test of GMO analysis. Copyright © 2014 Elsevier Ltd. All rights reserved.

  11. Multiplex pyrosequencing assay using AdvISER-MH-PYRO algorithm: a case for rapid and cost-effective genotyping analysis of prostate cancer risk-associated SNPs.

    PubMed

    Ambroise, Jérôme; Butoescu, Valentina; Robert, Annie; Tombal, Bertrand; Gala, Jean-Luc

    2015-06-25

    Single Nucleotide Polymorphisms (SNPs) identified in Genome Wide Association Studies (GWAS) have generally moderate association with related complex diseases. Accordingly, Multilocus Genetic Risk Scores (MGRSs) have been computed in previous studies in order to assess the cumulative association of multiple SNPs. When several SNPs have to be genotyped for each patient, using successive uniplex pyrosequencing reactions increases analytical reagent expenses and Turnaround Time (TAT). While a set of several pyrosequencing primers could theoretically be used to analyze multiplex amplicons, this would generate overlapping primer-specific pyro-signals that are visually uninterpretable. In the current study, two multiplex assays were developed consisting of a quadruplex (n=4) and a quintuplex (n=5) polymerase chain reaction (PCR) each followed by multiplex pyrosequencing analysis. The aim was to reliably but rapidly genotype a set of prostate cancer-related SNPs (n=9). The nucleotide dispensation order was selected using SENATOR software. Multiplex pyro-signals were analyzed using the new AdvISER-MH-PYRO software based on a sparse representation of the signal. Using uniplex assays as gold standard, the concordance between multiplex and uniplex assays was assessed on DNA extracted from patient blood samples (n = 10). All genotypes (n=90) generated with the quadruplex and the quintuplex pyroquencing assays were perfectly (100 %) concordant with uniplex pyrosequencing. Using multiplex genotyping approach for analyzing a set of 90 patients allowed reducing TAT by approximately 75 % (i.e., from 2025 to 470 min) while reducing reagent consumption and cost by approximately 70 % (i.e., from ~229 US$ /patient to ~64 US$ /patient). This combination of quadruplex and quintuplex pyrosequencing and PCR assays enabled to reduce the amount of DNA required for multi-SNP analysis, and to lower the global TAT and costs of SNP genotyping while providing results as reliable as uniplex analysis. Using this combined multiplex approach also substantially reduced the production of waste material. These genotyping assays appear therefore to be biologically, economically and ecologically highly relevant, being worth to be integrated in genetic-based predictive strategies for better selecting patients at risk for prostate cancer. In addition, the same approach could now equally be transposed to other clinical/research applications relying on the computation of MGRS based on multi-SNP genotyping.

  12. An ovary transcriptome for all maturational stages of the striped bass (Morone saxatilis), a highly advanced perciform fish.

    PubMed

    Reading, Benjamin J; Chapman, Robert W; Schaff, Jennifer E; Scholl, Elizabeth H; Opperman, Charles H; Sullivan, Craig V

    2012-02-21

    The striped bass and its relatives (genus Morone) are important fisheries and aquaculture species native to estuaries and rivers of the Atlantic coast and Gulf of Mexico in North America. To open avenues of gene expression research on reproduction and breeding of striped bass, we generated a collection of expressed sequence tags (ESTs) from a complementary DNA (cDNA) library representative of their ovarian transcriptome. Sequences of a total of 230,151 ESTs (51,259,448 bp) were acquired by Roche 454 pyrosequencing of cDNA pooled from ovarian tissues obtained at all stages of oocyte growth, at ovulation (eggs), and during preovulatory atresia. Quality filtering of ESTs allowed assembly of 11,208 high-quality contigs ≥ 100 bp, including 2,984 contigs 500 bp or longer (average length 895 bp). Blastx comparisons revealed 5,482 gene orthologues (E-value < 10-3), of which 4,120 (36.7% of total contigs) were annotated with Gene Ontology terms (E-value < 10-6). There were 5,726 remaining unknown unique sequences (51.1% of total contigs). All of the high-quality EST sequences are available in the National Center for Biotechnology Information (NCBI) Short Read Archive (GenBank: SRX007394). Informative contigs were considered to be abundant if they were assembled from groups of ESTs comprising ≥ 0.15% of the total short read sequences (≥ 345 reads/contig). Approximately 52.5% of these abundant contigs were predicted to have predominant ovary expression through digital differential display in silico comparisons to zebrafish (Danio rerio) UniGene orthologues. Over 1,300 Gene Ontology terms from Biological Process classes of Reproduction, Reproductive process, and Developmental process were assigned to this collection of annotated contigs. This first large reference sequence database available for the ecologically and economically important temperate basses (genus Morone) provides a foundation for gene expression studies in these species. The predicted predominance of ovary gene expression and assignment of directly relevant Gene Ontology classes suggests a powerful utility of this dataset for analysis of ovarian gene expression related to fundamental questions of oogenesis. Additionally, a high definition Agilent 60-mer oligo ovary 'UniClone' microarray with 8 × 15,000 probe format has been designed based on this striped bass transcriptome (eArray Group: Striper Group, Design ID: 029004).

  13. [Sensitivity and specificity of nested PCR pyrosequencing in hepatitis B virus drug resistance gene testing].

    PubMed

    Sun, Shumei; Zhou, Hao; Zhou, Bin; Hu, Ziyou; Hou, Jinlin; Sun, Jian

    2012-05-01

    To evaluate the sensitivity and specificity of nested PCR combined with pyrosequencing in the detection of HBV drug-resistance gene. RtM204I (ATT) mutant and rtM204 (ATG) nonmutant plasmids mixed at different ratios were detected for mutations using nested-PCR combined with pyrosequencing, and the results were compared with those by conventional PCR pyrosequencing to analyze the linearity and consistency of the two methods. Clinical specimens with different viral loads were examined for drug-resistant mutations using nested PCR pyrosequencing and nested PCR combined with dideoxy sequencing (Sanger) for comparison of the detection sensitivity and specificity. The fitting curves demonstrated good linearity of both conventional PCR pyrosequencing and nested PCR pyrosequencing (R(2)>0.99, P<0.05). Nested PCR showed a better consistency with the predicted value than conventional PCR, and was superior to conventional PCR for detection of samples containing 90% mutant plasmid. In the detection of clinical specimens, Sanger sequencing had a significantly lower sensitivity than nested PCR pyrosequencing (92% vs 100%, P<0.01). The detection sensitivity of Sanger sequencing varied with the viral loads, especially in samples with low viral copies (HBV DNA ≤3log10 copies/ml), where the sensitivity was 78%, significantly lower than that of pyrosequencing (100%, P<0.01). Neither of the two methods yielded positive results for the negative control samples, suggesting their good specificity. Compared with nested PCR and Sanger sequencing method, nested PCR pyrosequencing has a higher sensitivity especially in clinical specimens with low viral copies, which can be important for early detection of HBV mutant strains and hence more effective clinical management.

  14. Determination of multiple-clone infection at allelic dimorphism site of Plasmodium vivax merozoite surface protein-1 in the Republic of Korea by pyrosequencing assay.

    PubMed

    Dinzouna-Boutamba, Sylvatrie-Danne; Lee, Sanghyun; Son, Ui-Han; Yun, Hae Soo; Joo, So-Young; Jeong, Sookwan; Rhee, Man Hee; Kwak, Dongmi; Xuan, Xuenan; Hong, Yeonchul; Chung, Dong-Il; Goo, Youn-Kyoung

    2017-12-01

    Allelic diversity leading to multiple gene polymorphisms of vivax malaria parasites has been shown to greatly contribute to antigenic variation and drug resistance, increasing the potential for multiple-clone infections within the host. Therefore, to identify multiple-clone infections and the predominant haplotype of Plasmodium vivax in a South Korean population, P. vivax merozoite surface protein-1 (PvMSP-1) was analyzed by pyrosequencing. Pyrosequencing of 156 vivax malaria-infected samples yielded 97 (62.18%) output pyrograms showing two main types of peak patterns of the dimorphic allele for threonine and alanine (T1476A). Most of the samples evaluated (88.66%) carried multiple-clone infections (wild- and mutant-types), whereas 11.34% of the same population carried only the mutant-type (1476A). In addition, each allele showed a high frequency of guanine (G) base substitution at both the first and third positions (86.07% and 81.13%, respectively) of the nucleotide combinations. Pyrosequencing of the PvMSP-1 42-kDa fragment revealed a heterogeneous parasite population, with the mutant-type dominant compared to the wild-type. Understanding the genetic diversity and multiple-clone infection rates may lead to improvements in vivax malaria prevention and strategic control plans. Further studies are needed to improve the efficacy of the pyrosequencing assay with large sample sizes and additional nucleotide positions. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. Insights into the development and evolution of exaggerated traits using de novo transcriptomes of two species of horned scarab beetles.

    PubMed

    Warren, Ian A; Vera, J Cristobal; Johns, Annika; Zinna, Robert; Marden, James H; Emlen, Douglas J; Dworkin, Ian; Lavine, Laura C

    2014-01-01

    Scarab beetles exhibit an astonishing variety of rigid exo-skeletal outgrowths, known as "horns". These traits are often sexually dimorphic and vary dramatically across species in size, shape, location, and allometry with body size. In many species, the horn exhibits disproportionate growth resulting in an exaggerated allometric relationship with body size, as compared to other traits, such as wings, that grow proportionately with body size. Depending on the species, the smallest males either do not produce a horn at all, or they produce a disproportionately small horn for their body size. While the diversity of horn shapes and their behavioural ecology have been reasonably well studied, we know far less about the proximate mechanisms that regulate horn growth. Thus, using 454 pyrosequencing, we generated transcriptome profiles, during horn growth and development, in two different scarab beetle species: the Asian rhinoceros beetle, Trypoxylus dichotomus, and the dung beetle, Onthophagus nigriventris. We obtained over half a million reads for each species that were assembled into over 6,000 and 16,000 contigs respectively. We combined these data with previously published studies to look for signatures of molecular evolution. We found a small subset of genes with horn-biased expression showing evidence for recent positive selection, as is expected with sexual selection on horn size. We also found evidence of relaxed selection present in genes that demonstrated biased expression between horned and horn-less morphs, consistent with the theory of developmental decoupling of phenotypically plastic traits.

  16. Droplet-based pyrosequencing using digital microfluidics.

    PubMed

    Boles, Deborah J; Benton, Jonathan L; Siew, Germaine J; Levy, Miriam H; Thwar, Prasanna K; Sandahl, Melissa A; Rouse, Jeremy L; Perkins, Lisa C; Sudarsan, Arjun P; Jalili, Roxana; Pamula, Vamsee K; Srinivasan, Vijay; Fair, Richard B; Griffin, Peter B; Eckhardt, Allen E; Pollack, Michael G

    2011-11-15

    The feasibility of implementing pyrosequencing chemistry within droplets using electrowetting-based digital microfluidics is reported. An array of electrodes patterned on a printed-circuit board was used to control the formation, transportation, merging, mixing, and splitting of submicroliter-sized droplets contained within an oil-filled chamber. A three-enzyme pyrosequencing protocol was implemented in which individual droplets contained enzymes, deoxyribonucleotide triphosphates (dNTPs), and DNA templates. The DNA templates were anchored to magnetic beads which enabled them to be thoroughly washed between nucleotide additions. Reagents and protocols were optimized to maximize signal over background, linearity of response, cycle efficiency, and wash efficiency. As an initial demonstration of feasibility, a portion of a 229 bp Candida parapsilosis template was sequenced using both a de novo protocol and a resequencing protocol. The resequencing protocol generated over 60 bp of sequence with 100% sequence accuracy based on raw pyrogram levels. Excellent linearity was observed for all of the homopolymers (two, three, or four nucleotides) contained in the C. parapsilosis sequence. With improvements in microfluidic design it is expected that longer reads, higher throughput, and improved process integration (i.e., "sample-to-sequence" capability) could eventually be achieved using this low-cost platform.

  17. Pyrosequencing-based assessment of microbial community shifts in leachate from animal carcass burial lysimeter.

    PubMed

    Kim, Hyun Young; Seo, Jiyoung; Kim, Tae-Hun; Shim, Bomi; Cha, Seok Mun; Yu, Seungho

    2017-06-01

    This study examined the use of microbial community structure as a bio-indicator of decomposition levels. High-throughput pyrosequencing technology was used to assess the shift in microbial community of leachate from animal carcass lysimeter. The leachate samples were collected monthly for one year and a total of 164,639 pyrosequencing reads were obtained and used in the taxonomic classification and operational taxonomy units (OTUs) distribution analysis based on sequence similarity. Our results show considerable changes in the phylum-level bacterial composition, suggesting that the microbial community is a sensitive parameter affected by the burial environment. The phylum classification results showed that Proteobacteria (Pseudomonas) were the most influential taxa in earlier decomposition stage whereas Firmicutes (Clostridium, Sporanaerobacter, and Peptostreptococcus) were dominant in later stage under anaerobic conditions. The result of this study can provide useful information on a time series of leachate profiles of microbial community structures and suggest patterns of microbial diversity in livestock burial sites. In addition, this result can be applicable to predict the decomposition stages under clay loam based soil conditions of animal livestock. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Droplet-Based Pyrosequencing Using Digital Microfluidics

    PubMed Central

    Boles, Deborah J.; Benton, Jonathan L.; Siew, Germaine J.; Levy, Miriam H.; Thwar, Prasanna K.; Sandahl, Melissa A.; Rouse, Jeremy L.; Perkins, Lisa C.; Sudarsan, Arjun P.; Jalili, Roxana; Pamula, Vamsee K.; Srinivasan, Vijay; Fair, Richard B.; Griffin, Peter B.; Eckhardt, Allen E.; Pollack, Michael G.

    2013-01-01

    The feasibility of implementing pyrosequencing chemistry within droplets using electrowetting-based digital microfluidics is reported. An array of electrodes patterned on a printed-circuit board was used to control the formation, transportation, merging, mixing, and splitting of submicroliter-sized droplets contained within an oil-filled chamber. A three-enzyme pyrosequencing protocol was implemented in which individual droplets contained enzymes, deoxyribonucleotide triphosphates (dNTPs), and DNA templates. The DNA templates were anchored to magnetic beads which enabled them to be thoroughly washed between nucleotide additions. Reagents and protocols were optimized to maximize signal over background, linearity of response, cycle efficiency, and wash efficiency. As an initial demonstration of feasibility, a portion of a 229 bp Candida parapsilosis template was sequenced using both a de novo protocol and a resequencing protocol. The resequencing protocol generated over 60 bp of sequence with 100% sequence accuracy based on raw pyrogram levels. Excellent linearity was observed for all of the homopolymers (two, three, or four nucleotides) contained in the C. parapsilosis sequence. With improvements in microfluidic design it is expected that longer reads, higher throughput, and improved process integration (i.e., “sample-to-sequence” capability) could eventually be achieved using this low-cost platform. PMID:21932784

  19. Microbial analysis in primary and persistent endodontic infections by using pyrosequencing.

    PubMed

    Hong, Bo-Young; Lee, Tae-Kwon; Lim, Sang-Min; Chang, Seok Woo; Park, Joonhong; Han, Seung Hyun; Zhu, Qiang; Safavi, Kamran E; Fouad, Ashraf F; Kum, Kee Yeon

    2013-09-01

    The aim of this study was to investigate the bacterial community profile of intracanal microbiota in primary and persistent endodontic infections associated with asymptomatic chronic apical periodontitis by using GS-FLX Titanium pyrosequencing. The null hypothesis was that there is no difference in diversity of overall bacterial community profiles between primary and persistent infections. Pyrosequencing analysis from 10 untreated and 8 root-filled samples was conducted. Analysis from 18 samples yielded total of 124,767 16S rRNA gene sequences (with a mean of 6932 reads per sample) that were taxonomically assigned into 803 operational taxonomic units (3% distinction), 148 genera, and 10 phyla including unclassified. Bacteroidetes was the most abundant phylum in both primary and persistent infections. There were no significant differences in bacterial diversity between the 2 infection groups (P > .05). The bacterial community profile that was based on dendrogram showed that bacterial population in both infections was not significantly different in their structure and composition (P > .05). The present pyrosequencing study demonstrates that persistent infections have as diverse bacterial community as primary infections. Copyright © 2013 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  20. Rapid phylogenetic dissection of prokaryotic community structure in tidal flat using pyrosequencing.

    PubMed

    Kim, Bong-Soo; Kim, Byung Kwon; Lee, Jae-Hak; Kim, Myungjin; Lim, Young Woon; Chun, Jongsik

    2008-08-01

    Dissection of prokaryotic community structure is prerequisite to understand their ecological roles. Various methods are available for such a purpose which amplification and sequencing of 16S rRNA genes gained its popularity. However, conventional methods based on Sanger sequencing technique require cloning process prior to sequencing, and are expensive and labor-intensive. We investigated prokaryotic community structure in tidal flat sediments, Korea, using pyrosequencing and a subsequent automated bioinformatic pipeline for the rapid and accurate taxonomic assignment of each amplicon. The combination of pyrosequencing and bioinformatic analysis showed that bacterial and archaeal communities were more diverse than previously reported in clone library studies. Pyrosequencing analysis revealed 21 bacterial divisions and 37 candidate divisions. Proteobacteria was the most abundant division in the bacterial community, of which Gamma-and Delta-Proteobacteria were the most abundant. Similarly, 4 archaeal divisions were found in tidal flat sediments. Euryarchaeota was the most abundant division in the archaeal sequences, which were further divided into 8 classes and 11 unclassified euryarchaeota groups. The system developed here provides a simple, in-depth and automated way of dissecting a prokaryotic community structure without extensive pretreatment such as cloning.

  1. Clinical Neuropathology practice news 1-2014: Pyrosequencing meets clinical and analytical performance criteria for routine testing of MGMT promoter methylation status in glioblastoma

    PubMed Central

    Preusser, Matthias; Berghoff, Anna S.; Manzl, Claudia; Filipits, Martin; Weinhäusel, Andreas; Pulverer, Walter; Dieckmann, Karin; Widhalm, Georg; Wöhrer, Adelheid; Knosp, Engelbert; Marosi, Christine; Hainfellner, Johannes A.

    2014-01-01

    Testing of the MGMT promoter methylation status in glioblastoma is relevant for clinical decision making and research applications. Two recent and independent phase III therapy trials confirmed a prognostic and predictive value of the MGMT promoter methylation status in elderly glioblastoma patients. Several methods for MGMT promoter methylation testing have been proposed, but seem to be of limited test reliability. Therefore, and also due to feasibility reasons, translation of MGMT methylation testing into routine use has been protracted so far. Pyrosequencing after prior DNA bisulfite modification has emerged as a reliable, accurate, fast and easy-to-use method for MGMT promoter methylation testing in tumor tissues (including formalin-fixed and paraffin-embedded samples). We performed an intra- and inter-laboratory ring trial which demonstrates a high analytical performance of this technique. Thus, pyrosequencing-based assessment of MGMT promoter methylation status in glioblastoma meets the criteria of high analytical test performance and can be recommended for clinical application, provided that strict quality control is performed. Our article summarizes clinical indications, practical instructions and open issues for MGMT promoter methylation testing in glioblastoma using pyrosequencing. PMID:24359605

  2. First comparative transcriptomic analysis of wild adult male and female Lutzomyia longipalpis, vector of visceral leishmaniasis.

    PubMed

    McCarthy, Christina B; Santini, María Soledad; Pimenta, Paulo F P; Diambra, Luis A

    2013-01-01

    Leishmaniasis is a vector-borne disease with a complex epidemiology and ecology. Visceral leishmaniasis (VL) is its most severe clinical form as it results in death if not treated. In Latin America VL is caused by the protist parasite Leishmania infantum (syn. chagasi) and transmitted by Lutzomyia longipalpis. This phlebotomine sand fly is only found in the New World, from Mexico to Argentina. However, due to deforestation, migration and urbanisation, among others, VL in Latin America is undergoing an evident geographic expansion as well as dramatic changes in its transmission patterns. In this context, the first VL outbreak was recently reported in Argentina, which has already caused 7 deaths and 83 reported cases. Insect vector transcriptomic analyses enable the identification of molecules involved in the insect's biology and vector-parasite interaction. Previous studies on laboratory reared Lu. longipalpis have provided a descriptive repertoire of gene expression in the whole insect, midgut, salivary gland and male reproductive organs. Nevertheless, the study of wild specimens would contribute a unique insight into the development of novel bioinsecticides. Given the recent VL outbreak in Argentina and the compelling need to develop appropriate control strategies, this study focused on wild male and female Lu. longipalpis from an Argentine endemic (Posadas, Misiones) and a Brazilian non-endemic (Lapinha Cave, Minas Gerais) VL location. In this study, total RNA was extracted from the sand flies, submitted to sequence independent amplification and high-throughput pyrosequencing. This is the first time an unbiased and comprehensive transcriptomic approach has been used to analyse an infectious disease vector in its natural environment. Transcripts identified in the sand flies showed characteristic profiles which correlated with the environment of origin and with taxa previously identified in these same specimens. Among these, various genes represented putative targets for vector control via RNA interference (RNAi).

  3. Transcriptome Analysis of an Insecticide Resistant Housefly Strain: Insights about SNPs and Regulatory Elements in Cytochrome P450 Genes.

    PubMed

    Mahmood, Khalid; Højland, Dorte H; Asp, Torben; Kristensen, Michael

    2016-01-01

    Insecticide resistance in the housefly, Musca domestica, has been investigated for more than 60 years. It will enter a new era after the recent publication of the housefly genome and the development of multiple next generation sequencing technologies. The genetic background of the xenobiotic response can now be investigated in greater detail. Here, we investigate the 454-pyrosequencing transcriptome of the spinosad-resistant 791spin strain in relation to the housefly genome with focus on P450 genes. The de novo assembly of clean reads gave 35,834 contigs consisting of 21,780 sequences of the spinosad resistant strain. The 3,648 sequences were annotated with an enzyme code EC number and were mapped to 124 KEGG pathways with metabolic processes as most highly represented pathway. One hundred and twenty contigs were annotated as P450s covering 44 different P450 genes of housefly. Eight differentially expressed P450s genes were identified and investigated for SNPs, CpG islands and common regulatory motifs in promoter and coding regions. Functional annotation clustering of metabolic related genes and motif analysis of P450s revealed their association with epigenetic, transcription and gene expression related functions. The sequence variation analysis resulted in 12 SNPs and eight of them found in cyp6d1. There is variation in location, size and frequency of CpG islands and specific motifs were also identified in these P450s. Moreover, identified motifs were associated to GO terms and transcription factors using bioinformatic tools. Transcriptome data of a spinosad resistant strain provide together with genome data fundamental support for future research to understand evolution of resistance in houseflies. Here, we report for the first time the SNPs, CpG islands and common regulatory motifs in differentially expressed P450s. Taken together our findings will serve as a stepping stone to advance understanding of the mechanism and role of P450s in xenobiotic detoxification.

  4. Transcriptome Analysis of Yellow Horn (Xanthoceras sorbifolia Bunge): A Potential Oil-Rich Seed Tree for Biodiesel in China

    PubMed Central

    Liu, Yulin; Huang, Zhedong; Ao, Yan; Li, Wei; Zhang, Zhixiang

    2013-01-01

    Background Yellow horn (Xanthoceras sorbifolia Bunge) is an oil-rich seed shrub that grows well in cold, barren environments and has great potential for biodiesel production in China. However, the limited genetic data means that little information about the key genes involved in oil biosynthesis is available, which limits further improvement of this species. In this study, we describe sequencing and de novo transcriptome assembly to produce the first comprehensive and integrated genomic resource for yellow horn and identify the pathways and key genes related to oil accumulation. In addition, potential molecular markers were identified and compiled. Methodology/Principal Findings Total RNA was isolated from 30 plants from two regions, including buds, leaves, flowers and seeds. Equal quantities of RNA from these tissues were pooled to construct a cDNA library for 454 pyrosequencing. A total of 1,147,624 high-quality reads with total and average lengths of 530.6 Mb and 462 bp, respectively, were generated. These reads were assembled into 51,867 unigenes, corresponding to a total of 36.1 Mb with a mean length, N50 and median of 696, 928 and 570 bp, respectively. Of the unigenes, 17,541 (33.82%) were unmatched in any public protein databases. We identified 281 unigenes that may be involved in de novo fatty acid (FA) and triacylglycerol (TAG) biosynthesis and metabolism. Furthermore, 6,707 SSRs, 16,925 SNPs and 6,201 InDels with high-confidence were also identified in this study. Conclusions This transcriptome represents a new functional genomics resource and a foundation for further studies on the metabolic engineering of yellow horn to increase oil content and modify oil composition. The potential molecular markers identified in this study provide a basis for polymorphism analysis of Xanthoceras, and even Sapindaceae; they will also accelerate the process of breeding new varieties with better agronomic characteristics. PMID:24040247

  5. Transcriptome and proteome profiling of adventitious root development in hybrid larch (Larix kaempferi × Larix olgensis).

    PubMed

    Han, Hua; Sun, Xiaomei; Xie, Yunhui; Feng, Jian; Zhang, Shougong

    2014-11-26

    Hybrids of larch (Larix kaempferi × Larix olgensis) are important afforestation species in northeastern China. They are routinely propagated via rooted stem cuttings. Despite the importance of rooting, little is known about the regulation of adventitious root development in larch hybrids. 454 GS FLX Titanium technology represents a new method for characterizing the transcriptomes of non-model species. This method can be used to identify differentially expressed genes, and then two-dimensional difference gel electrophoresis (2D-DIGE) and matrix-assisted laser desorption-ionization time-of-flight mass spectrometry (MALDI-TOF/TOF MS) analyses can be used to analyze their corresponding proteins. In this study, we analyzed semi-lignified cuttings of two clones of L. kaempferi × L. olgensis with different rooting capacities to study the molecular basis of adventitious root development. We analyzed two clones; clone 25-5, with strong rooting capacity, and clone 23-12, with weak rooting capacity. We constructed four cDNA libraries from 25-5 and 23-12 at two development stages. Sequencing was conducted using the 454 pyrosequencing platform. A total of 957832 raw reads was produced; 95.07% were high-quality reads, and were assembled into 45137 contigs and 61647 singletons. The functions of the unigenes, as indicated by their Gene Ontology annotation, included diverse roles in the molecular functions, biological processes, and cellular component categories. We analyzed 75 protein spots (-fold change ≥ 2, P ≤ 0.05) by 2D-DIGE, and identified the differentially expressed proteins using MALDI-TOF/TOF MS. A joint analysis of transcriptome and proteome showed genes related to two pathways, polyamine synthesis and stress response, might play an important role on adventitious root development. These results provide fundamental and important information for research on the molecular mechanism of adventitious root development. We also demonstrated for the first time the combined use of two important technologies as a powerful approach to advance research on non-model, but otherwise important, larch species.

  6. First Comparative Transcriptomic Analysis of Wild Adult Male and Female Lutzomyia longipalpis, Vector of Visceral Leishmaniasis

    PubMed Central

    McCarthy, Christina B.; Santini, María Soledad; Pimenta, Paulo F. P.; Diambra, Luis A.

    2013-01-01

    Leishmaniasis is a vector-borne disease with a complex epidemiology and ecology. Visceral leishmaniasis (VL) is its most severe clinical form as it results in death if not treated. In Latin America VL is caused by the protist parasite Leishmania infantum (syn. chagasi) and transmitted by Lutzomyia longipalpis. This phlebotomine sand fly is only found in the New World, from Mexico to Argentina. However, due to deforestation, migration and urbanisation, among others, VL in Latin America is undergoing an evident geographic expansion as well as dramatic changes in its transmission patterns. In this context, the first VL outbreak was recently reported in Argentina, which has already caused 7 deaths and 83 reported cases. Insect vector transcriptomic analyses enable the identification of molecules involved in the insect's biology and vector-parasite interaction. Previous studies on laboratory reared Lu. longipalpis have provided a descriptive repertoire of gene expression in the whole insect, midgut, salivary gland and male reproductive organs. Nevertheless, the study of wild specimens would contribute a unique insight into the development of novel bioinsecticides. Given the recent VL outbreak in Argentina and the compelling need to develop appropriate control strategies, this study focused on wild male and female Lu. longipalpis from an Argentine endemic (Posadas, Misiones) and a Brazilian non-endemic (Lapinha Cave, Minas Gerais) VL location. In this study, total RNA was extracted from the sand flies, submitted to sequence independent amplification and high-throughput pyrosequencing. This is the first time an unbiased and comprehensive transcriptomic approach has been used to analyse an infectious disease vector in its natural environment. Transcripts identified in the sand flies showed characteristic profiles which correlated with the environment of origin and with taxa previously identified in these same specimens. Among these, various genes represented putative targets for vector control via RNA interference (RNAi). PMID:23554910

  7. Antarctic krill 454 pyrosequencing reveals chaperone and stress transcriptome.

    PubMed

    Clark, Melody S; Thorne, Michael A S; Toullec, Jean-Yves; Meng, Yan; Guan, Le Luo; Peck, Lloyd S; Moore, Stephen

    2011-01-06

    The Antarctic krill Euphausia superba is a keystone species in the Antarctic food chain. Not only is it a significant grazer of phytoplankton, but it is also a major food item for charismatic megafauna such as whales and seals and an important Southern Ocean fisheries crop. Ecological data suggest that this species is being affected by climate change and this will have considerable consequences for the balance of the Southern Ocean ecosystem. Hence, understanding how this organism functions is a priority area and will provide fundamental data for life history studies, energy budget calculations and food web models. The assembly of the 454 transcriptome of E. superba resulted in 22,177 contigs with an average size of 492bp (ranging between 137 and 8515bp). In depth analysis of the data revealed an extensive catalogue of the cellular chaperone systems and the major antioxidant proteins. Full length sequences were characterised for the chaperones HSP70, HSP90 and the super-oxide dismutase antioxidants, with the discovery of potentially novel duplications of these genes. The sequence data contained 41,470 microsatellites and 17,776 Single Nucleotide Polymorphisms (SNPs/INDELS), providing a resource for population and also gene function studies. This paper details the first 454 generated data for a pelagic Antarctic species or any pelagic crustacean globally. The classical "stress proteins", such as HSP70, HSP90, ferritin and GST were all highly expressed. These genes were shown to be over expressed in the transcriptomes of Antarctic notothenioid fish and hypothesized as adaptations to living in the cold, with the associated problems of decreased protein folding efficiency and increased vulnerability to damage by reactive oxygen species. Hence, these data will provide a major resource for future physiological work on krill, but in particular a suite of "stress" genes for studies understanding marine ectotherms' capacities to cope with environmental change.

  8. High-throughput sequencing and analysis of the gill tissue transcriptome from the deep-sea hydrothermal vent mussel Bathymodiolus azoricus

    PubMed Central

    2010-01-01

    Background Bathymodiolus azoricus is a deep-sea hydrothermal vent mussel found in association with large faunal communities living in chemosynthetic environments at the bottom of the sea floor near the Azores Islands. Investigation of the exceptional physiological reactions that vent mussels have adopted in their habitat, including responses to environmental microbes, remains a difficult challenge for deep-sea biologists. In an attempt to reveal genes potentially involved in the deep-sea mussel innate immunity we carried out a high-throughput sequence analysis of freshly collected B. azoricus transcriptome using gills tissues as the primary source of immune transcripts given its strategic role in filtering the surrounding waterborne potentially infectious microorganisms. Additionally, a substantial EST data set was produced and from which a comprehensive collection of genes coding for putative proteins was organized in a dedicated database, "DeepSeaVent" the first deep-sea vent animal transcriptome database based on the 454 pyrosequencing technology. Results A normalized cDNA library from gills tissue was sequenced in a full 454 GS-FLX run, producing 778,996 sequencing reads. Assembly of the high quality reads resulted in 75,407 contigs of which 3,071 were singletons. A total of 39,425 transcripts were conceptually translated into amino-sequences of which 22,023 matched known proteins in the NCBI non-redundant protein database, 15,839 revealed conserved protein domains through InterPro functional classification and 9,584 were assigned with Gene Ontology terms. Queries conducted within the database enabled the identification of genes putatively involved in immune and inflammatory reactions which had not been previously evidenced in the vent mussel. Their physical counterpart was confirmed by semi-quantitative quantitative Reverse-Transcription-Polymerase Chain Reactions (RT-PCR) and their RNA transcription level by quantitative PCR (qPCR) experiments. Conclusions We have established the first tissue transcriptional analysis of a deep-sea hydrothermal vent animal and generated a searchable catalog of genes that provides a direct method of identifying and retrieving vast numbers of novel coding sequences which can be applied in gene expression profiling experiments from a non-conventional model organism. This provides the most comprehensive sequence resource for identifying novel genes currently available for a deep-sea vent organism, in particular, genes putatively involved in immune and inflammatory reactions in vent mussels. The characterization of the B. azoricus transcriptome will facilitate research into biological processes underlying physiological adaptations to hydrothermal vent environments and will provide a basis for expanding our understanding of genes putatively involved in adaptations processes during post-capture long term acclimatization experiments, at "sea-level" conditions, using B. azoricus as a model organism. PMID:20937131

  9. Development of an ELA-DRA gene typing method based on pyrosequencing technology.

    PubMed

    Díaz, S; Echeverría, M G; It, V; Posik, D M; Rogberg-Muñoz, A; Pena, N L; Peral-García, P; Vega-Pla, J L; Giovambattista, G

    2008-11-01

    The polymorphism of equine lymphocyte antigen (ELA) class II DRA gene had been detected by polymerase chain reaction-single-strand conformational polymorphism (PCR-SSCP) and reference strand-mediated conformation analysis. These methodologies allowed to identify 11 ELA-DRA exon 2 sequences, three of which are widely distributed among domestic horse breeds. Herein, we describe the development of a pyrosequencing-based method applicable to ELA-DRA typing, by screening samples from eight different horse breeds previously typed by PCR-SSCP. This sequence-based method would be useful in high-throughput genotyping of major histocompatibility complex genes in horses and other animal species, making this system interesting as a rapid screening method for animal genotyping of immune-related genes.

  10. RNA-Seq reveals genotype-specific molecular responses to water deficit in eucalyptus

    PubMed Central

    2011-01-01

    Background In a context of climate change, phenotypic plasticity provides long-lived species, such as trees, with the means to adapt to environmental variations occurring within a single generation. In eucalyptus plantations, water availability is a key factor limiting productivity. However, the molecular mechanisms underlying the adaptation of eucalyptus to water shortage remain unclear. In this study, we compared the molecular responses of two commercial eucalyptus hybrids during the dry season. Both hybrids differ in productivity when grown under water deficit. Results Pyrosequencing of RNA extracted from shoot apices provided extensive transcriptome coverage - a catalog of 129,993 unigenes (49,748 contigs and 80,245 singletons) was generated from 398 million base pairs, or 1.14 million reads. The pyrosequencing data enriched considerably existing Eucalyptus EST collections, adding 36,985 unigenes not previously represented. Digital analysis of read abundance in 14,460 contigs identified 1,280 that were differentially expressed between the two genotypes, 155 contigs showing differential expression between treatments (irrigated vs. non irrigated conditions during the dry season), and 274 contigs with significant genotype-by-treatment interaction. The more productive genotype displayed a larger set of genes responding to water stress. Moreover, stress signal transduction seemed to involve different pathways in the two genotypes, suggesting that water shortage induces distinct cellular stress cascades. Similarly, the response of functional proteins also varied widely between genotypes: the most productive genotype decreased expression of genes related to photosystem, transport and secondary metabolism, whereas genes related to primary metabolism and cell organisation were over-expressed. Conclusions For the most productive genotype, the ability to express a broader set of genes in response to water availability appears to be a key characteristic in the maintenance of biomass growth during the dry season. Its strategy may involve a decrease of photosynthetic activity during the dry season associated with resources reallocation through major changes in the expression of primary metabolism associated genes. Further efforts will be needed to assess the adaptive nature of the genes highlighted in this study. PMID:22047139

  11. Rapid Molecular Identification of Pathogenic Yeasts by Pyrosequencing Analysis of 35 Nucleotides of Internal Transcribed Spacer 2 ▿

    PubMed Central

    Borman, Andrew M.; Linton, Christopher J.; Oliver, Debra; Palmer, Michael D.; Szekely, Adrien; Johnson, Elizabeth M.

    2010-01-01

    Rapid identification of yeast species isolates from clinical samples is particularly important given their innately variable antifungal susceptibility profiles. Here, we have evaluated the utility of pyrosequencing analysis of a portion of the internal transcribed spacer 2 region (ITS2) for identification of pathogenic yeasts. A total of 477 clinical isolates encompassing 43 different fungal species were subjected to pyrosequencing analysis in a strictly blinded study. The molecular identifications produced by pyrosequencing were compared with those obtained using conventional biochemical tests (AUXACOLOR2) and following PCR amplification and sequencing of the D1-D2 portion of the nuclear 28S large rRNA gene. More than 98% (469/477) of isolates encompassing 40 of the 43 fungal species tested were correctly identified by pyrosequencing of only 35 bp of ITS2. Moreover, BLAST searches of the public synchronized databases with the ITS2 pyrosequencing signature sequences revealed that there was only minimal sequence redundancy in the ITS2 under analysis. In all cases, the pyrosequencing signature sequences were unique to the yeast species (or species complex) under investigation. Finally, when pyrosequencing was combined with the Whatman FTA paper technology for the rapid extraction of fungal genomic DNA, molecular identification could be accomplished within 6 h from the time of starting from pure cultures. PMID:20702674

  12. Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa

    PubMed Central

    2012-01-01

    Background Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Results Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Conclusions Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies. PMID:23167289

  13. Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

    PubMed Central

    2011-01-01

    Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a plant model system. The genes characterized will be useful for future research not only in the species included in the present study, but also in related species for which no genomic resources are yet available. Our results demonstrate the efficiency of massively parallel transcriptome sequencing in a comparative framework as an approach for developing genomic resources in diverse groups of non-model organisms. PMID:21791039

  14. PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.

    PubMed

    Gan, Ruei-Chi; Chen, Ting-Wen; Wu, Timothy H; Huang, Po-Jung; Lee, Chi-Ching; Yeh, Yuan-Ming; Chiu, Cheng-Hsun; Huang, Hsien-Da; Tang, Petrus

    2016-12-22

    Next-generation sequencing promises the de novo genomic and transcriptomic analysis of samples of interests. However, there are only a few organisms having reference genomic sequences and even fewer having well-defined or curated annotations. For transcriptome studies focusing on organisms lacking proper reference genomes, the common strategy is de novo assembly followed by functional annotation. However, things become even more complicated when multiple transcriptomes are compared. Here, we propose a new analysis strategy and quantification methods for quantifying expression level which not only generate a virtual reference from sequencing data, but also provide comparisons between transcriptomes. First, all reads from the transcriptome datasets are pooled together for de novo assembly. The assembled contigs are searched against NCBI NR databases to find potential homolog sequences. Based on the searched result, a set of virtual transcripts are generated and served as a reference transcriptome. By using the same reference, normalized quantification values including RC (read counts), eRPKM (estimated RPKM) and eTPM (estimated TPM) can be obtained that are comparable across transcriptome datasets. In order to demonstrate the feasibility of our strategy, we implement it in the web service PARRoT. PARRoT stands for Pipeline for Analyzing RNA Reads of Transcriptomes. It analyzes gene expression profiles for two transcriptome sequencing datasets. For better understanding of the biological meaning from the comparison among transcriptomes, PARRoT further provides linkage between these virtual transcripts and their potential function through showing best hits in SwissProt, NR database, assigning GO terms. Our demo datasets showed that PARRoT can analyze two paired-end transcriptomic datasets of approximately 100 million reads within just three hours. In this study, we proposed and implemented a strategy to analyze transcriptomes from non-reference organisms which offers the opportunity to quantify and compare transcriptome profiles through a homolog based virtual transcriptome reference. By using the homolog based reference, our strategy effectively avoids the problems that may cause from inconsistencies among transcriptomes. This strategy will shed lights on the field of comparative genomics for non-model organism. We have implemented PARRoT as a web service which is freely available at http://parrot.cgu.edu.tw .

  15. Bacterial Population in Intestines of the Black Tiger Shrimp (Penaeus monodon) under Different Growth Stages

    PubMed Central

    Rungrassamee, Wanilada; Klanchui, Amornpan; Chaiyapechara, Sage; Maibunkaew, Sawarot; Tangphatsornruang, Sithichoke; Jiravanichpaisal, Pikul; Karoonuthaisiri, Nitsara

    2013-01-01

    Intestinal bacterial communities in aquaculture have been drawn to attention due to potential benefit to their hosts. To identify core intestinal bacteria in the black tiger shrimp (Penaeus monodon), bacterial populations of disease-free shrimp were characterized from intestines of four developmental stages (15-day-old post larvae (PL15), 1- (J1), 2- (J2), and 3-month-old (J3) juveniles) using pyrosequencing, real-time PCR and denaturing gradient gel electrophoresis (DGGE) approaches. A total of 25,121 pyrosequencing reads (reading length = 442±24 bases) were obtained, which were categorized by barcode for PL15 (7,045 sequences), J1 (3,055 sequences), J2 (13,130 sequences) and J3 (1,890 sequences). Bacteria in the phyla Bacteroides, Firmicutes and Proteobacteria were found in intestines at all four growth stages. There were 88, 14, 27, and 20 bacterial genera associated with the intestinal tract of PL15, J1, J2 and J3, respectively. Pyrosequencing analysis revealed that Proteobacteria (class Gammaproteobacteria) was a dominant bacteria group with a relative abundance of 89% for PL15 and 99% for J1, J2 and J3. Real-time PCR assay also confirmed that Gammaproteobacteria had the highest relative abundance in intestines from all growth stages. Intestinal bacterial communities from the three juvenile stages were more similar to each other than that of the PL shrimp based on PCA analyses of pyrosequencing results and their DGGE profiles. This study provides descriptive bacterial communities associated to the black tiger shrimp intestines during these growth development stages in rearing facilities. PMID:23577162

  16. Bacterial population in intestines of the black tiger shrimp (Penaeus monodon) under different growth stages.

    PubMed

    Rungrassamee, Wanilada; Klanchui, Amornpan; Chaiyapechara, Sage; Maibunkaew, Sawarot; Tangphatsornruang, Sithichoke; Jiravanichpaisal, Pikul; Karoonuthaisiri, Nitsara

    2013-01-01

    Intestinal bacterial communities in aquaculture have been drawn to attention due to potential benefit to their hosts. To identify core intestinal bacteria in the black tiger shrimp (Penaeus monodon), bacterial populations of disease-free shrimp were characterized from intestines of four developmental stages (15-day-old post larvae (PL15), 1- (J1), 2- (J2), and 3-month-old (J3) juveniles) using pyrosequencing, real-time PCR and denaturing gradient gel electrophoresis (DGGE) approaches. A total of 25,121 pyrosequencing reads (reading length = 442±24 bases) were obtained, which were categorized by barcode for PL15 (7,045 sequences), J1 (3,055 sequences), J2 (13,130 sequences) and J3 (1,890 sequences). Bacteria in the phyla Bacteroides, Firmicutes and Proteobacteria were found in intestines at all four growth stages. There were 88, 14, 27, and 20 bacterial genera associated with the intestinal tract of PL15, J1, J2 and J3, respectively. Pyrosequencing analysis revealed that Proteobacteria (class Gammaproteobacteria) was a dominant bacteria group with a relative abundance of 89% for PL15 and 99% for J1, J2 and J3. Real-time PCR assay also confirmed that Gammaproteobacteria had the highest relative abundance in intestines from all growth stages. Intestinal bacterial communities from the three juvenile stages were more similar to each other than that of the PL shrimp based on PCA analyses of pyrosequencing results and their DGGE profiles. This study provides descriptive bacterial communities associated to the black tiger shrimp intestines during these growth development stages in rearing facilities.

  17. The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE

    PubMed Central

    2011-01-01

    Background The combination of high-throughput transcript profiling and next-generation sequencing technologies is a prerequisite for genome-wide comprehensive transcriptome analysis. Our recent innovation of deepSuperSAGE is based on an advanced SuperSAGE protocol and its combination with massively parallel pyrosequencing on Roche's 454 sequencing platform. As a demonstration of the power of this combination, we have chosen the salt stress transcriptomes of roots and nodules of the third most important legume crop chickpea (Cicer arietinum L.). While our report is more technology-oriented, it nevertheless addresses a major world-wide problem for crops generally: high salinity. Together with low temperatures and water stress, high salinity is responsible for crop losses of millions of tons of various legume (and other) crops. Continuously deteriorating environmental conditions will combine with salinity stress to further compromise crop yields. As a good example for such stress-exposed crop plants, we started to characterize salt stress responses of chickpeas on the transcriptome level. Results We used deepSuperSAGE to detect early global transcriptome changes in salt-stressed chickpea. The salt stress responses of 86,919 transcripts representing 17,918 unique 26 bp deepSuperSAGE tags (UniTags) from roots of the salt-tolerant variety INRAT-93 two hours after treatment with 25 mM NaCl were characterized. Additionally, the expression of 57,281 transcripts representing 13,115 UniTags was monitored in nodules of the same plants. From a total of 144,200 analyzed 26 bp tags in roots and nodules together, 21,401 unique transcripts were identified. Of these, only 363 and 106 specific transcripts, respectively, were commonly up- or down-regulated (>3.0-fold) under salt stress in both organs, witnessing a differential organ-specific response to stress. Profiting from recent pioneer works on massive cDNA sequencing in chickpea, more than 9,400 UniTags were able to be linked to UniProt entries. Additionally, gene ontology (GO) categories over-representation analysis enabled to filter out enriched biological processes among the differentially expressed UniTags. Subsequently, the gathered information was further cross-checked with stress-related pathways. From several filtered pathways, here we focus exemplarily on transcripts associated with the generation and scavenging of reactive oxygen species (ROS), as well as on transcripts involved in Na+ homeostasis. Although both processes are already very well characterized in other plants, the information generated in the present work is of high value. Information on expression profiles and sequence similarity for several hundreds of transcripts of potential interest is now available. Conclusions This report demonstrates, that the combination of the high-throughput transcriptome profiling technology SuperSAGE with one of the next-generation sequencing platforms allows deep insights into the first molecular reactions of a plant exposed to salinity. Cross validation with recent reports enriched the information about the salt stress dynamics of more than 9,000 chickpea ESTs, and enlarged their pool of alternative transcripts isoforms. As an example for the high resolution of the employed technology that we coin deepSuperSAGE, we demonstrate that ROS-scavenging and -generating pathways undergo strong global transcriptome changes in chickpea roots and nodules already 2 hours after onset of moderate salt stress (25 mM NaCl). Additionally, a set of more than 15 candidate transcripts are proposed to be potential components of the salt overly sensitive (SOS) pathway in chickpea. Newly identified transcript isoforms are potential targets for breeding novel cultivars with high salinity tolerance. We demonstrate that these targets can be integrated into breeding schemes by micro-arrays and RT-PCR assays downstream of the generation of 26 bp tags by SuperSAGE. PMID:21320317

  18. The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE.

    PubMed

    Molina, Carlos; Zaman-Allah, Mainassara; Khan, Faheema; Fatnassi, Nadia; Horres, Ralf; Rotter, Björn; Steinhauer, Diana; Amenc, Laurie; Drevon, Jean-Jacques; Winter, Peter; Kahl, Günter

    2011-02-14

    The combination of high-throughput transcript profiling and next-generation sequencing technologies is a prerequisite for genome-wide comprehensive transcriptome analysis. Our recent innovation of deepSuperSAGE is based on an advanced SuperSAGE protocol and its combination with massively parallel pyrosequencing on Roche's 454 sequencing platform. As a demonstration of the power of this combination, we have chosen the salt stress transcriptomes of roots and nodules of the third most important legume crop chickpea (Cicer arietinum L.). While our report is more technology-oriented, it nevertheless addresses a major world-wide problem for crops generally: high salinity. Together with low temperatures and water stress, high salinity is responsible for crop losses of millions of tons of various legume (and other) crops. Continuously deteriorating environmental conditions will combine with salinity stress to further compromise crop yields. As a good example for such stress-exposed crop plants, we started to characterize salt stress responses of chickpeas on the transcriptome level. We used deepSuperSAGE to detect early global transcriptome changes in salt-stressed chickpea. The salt stress responses of 86,919 transcripts representing 17,918 unique 26 bp deepSuperSAGE tags (UniTags) from roots of the salt-tolerant variety INRAT-93 two hours after treatment with 25 mM NaCl were characterized. Additionally, the expression of 57,281 transcripts representing 13,115 UniTags was monitored in nodules of the same plants. From a total of 144,200 analyzed 26 bp tags in roots and nodules together, 21,401 unique transcripts were identified. Of these, only 363 and 106 specific transcripts, respectively, were commonly up- or down-regulated (>3.0-fold) under salt stress in both organs, witnessing a differential organ-specific response to stress.Profiting from recent pioneer works on massive cDNA sequencing in chickpea, more than 9,400 UniTags were able to be linked to UniProt entries. Additionally, gene ontology (GO) categories over-representation analysis enabled to filter out enriched biological processes among the differentially expressed UniTags. Subsequently, the gathered information was further cross-checked with stress-related pathways. From several filtered pathways, here we focus exemplarily on transcripts associated with the generation and scavenging of reactive oxygen species (ROS), as well as on transcripts involved in Na+ homeostasis. Although both processes are already very well characterized in other plants, the information generated in the present work is of high value. Information on expression profiles and sequence similarity for several hundreds of transcripts of potential interest is now available. This report demonstrates, that the combination of the high-throughput transcriptome profiling technology SuperSAGE with one of the next-generation sequencing platforms allows deep insights into the first molecular reactions of a plant exposed to salinity. Cross validation with recent reports enriched the information about the salt stress dynamics of more than 9,000 chickpea ESTs, and enlarged their pool of alternative transcripts isoforms. As an example for the high resolution of the employed technology that we coin deepSuperSAGE, we demonstrate that ROS-scavenging and -generating pathways undergo strong global transcriptome changes in chickpea roots and nodules already 2 hours after onset of moderate salt stress (25 mM NaCl). Additionally, a set of more than 15 candidate transcripts are proposed to be potential components of the salt overly sensitive (SOS) pathway in chickpea. Newly identified transcript isoforms are potential targets for breeding novel cultivars with high salinity tolerance. We demonstrate that these targets can be integrated into breeding schemes by micro-arrays and RT-PCR assays downstream of the generation of 26 bp tags by SuperSAGE.

  19. Insights into the Development and Evolution of Exaggerated Traits Using De Novo Transcriptomes of Two Species of Horned Scarab Beetles

    PubMed Central

    Warren, Ian A.; Vera, J. Cristobal; Johns, Annika; Zinna, Robert; Marden, James H.; Emlen, Douglas J.; Dworkin, Ian; Lavine, Laura C.

    2014-01-01

    Scarab beetles exhibit an astonishing variety of rigid exo-skeletal outgrowths, known as “horns”. These traits are often sexually dimorphic and vary dramatically across species in size, shape, location, and allometry with body size. In many species, the horn exhibits disproportionate growth resulting in an exaggerated allometric relationship with body size, as compared to other traits, such as wings, that grow proportionately with body size. Depending on the species, the smallest males either do not produce a horn at all, or they produce a disproportionately small horn for their body size. While the diversity of horn shapes and their behavioural ecology have been reasonably well studied, we know far less about the proximate mechanisms that regulate horn growth. Thus, using 454 pyrosequencing, we generated transcriptome profiles, during horn growth and development, in two different scarab beetle species: the Asian rhinoceros beetle, Trypoxylus dichotomus, and the dung beetle, Onthophagus nigriventris. We obtained over half a million reads for each species that were assembled into over 6,000 and 16,000 contigs respectively. We combined these data with previously published studies to look for signatures of molecular evolution. We found a small subset of genes with horn-biased expression showing evidence for recent positive selection, as is expected with sexual selection on horn size. We also found evidence of relaxed selection present in genes that demonstrated biased expression between horned and horn-less morphs, consistent with the theory of developmental decoupling of phenotypically plastic traits. PMID:24586317

  20. Evolution of the BBAA Component of Bread Wheat during Its History at the Allohexaploid Level[C][W][OPEN

    PubMed Central

    Zhang, Huakun; Zhu, Bo; Qi, Bao; Gou, Xiaowan; Dong, Yuzhu; Xu, Chunming; Zhang, Bangjiao; Huang, Wei; Liu, Chang; Wang, Xutong; Yang, Chunwu; Zhou, Hao; Kashkush, Khalil; Feldman, Moshe; Wendel, Jonathan F.; Liu, Bao

    2014-01-01

    Subgenome integrity in bread wheat (Triticum aestivum; BBAADD) makes possible the extraction of its BBAA component to restitute a novel plant type. The availability of such a ploidy-reversed wheat (extracted tetraploid wheat [ETW]) provides a unique opportunity to address whether and to what extent the BBAA component of bread wheat has been modified in phenotype, karyotype, and gene expression during its evolutionary history at the allohexaploid level. We report here that ETW was anomalous in multiple phenotypic traits but maintained a stable karyotype. Microarray-based transcriptome profiling identified a large number of differentially expressed genes between ETW and natural tetraploid wheat (Triticum turgidum), and the ETW-downregulated genes were enriched for distinct Gene Ontology categories. Quantitative RT-PCR analysis showed that gene expression differences between ETW and a set of diverse durum wheat (T. turgidum subsp durum) cultivars were distinct from those characterizing tetraploid cultivars per se. Pyrosequencing revealed that the expression alterations may occur to either only one or both of the B and A homoeolog transcripts in ETW. A majority of the genes showed additive expression in a resynthesized allohexaploid wheat. Analysis of a synthetic allohexaploid wheat and diverse bread wheat cultivars revealed the rapid occurrence of expression changes to the BBAA subgenomes subsequent to allohexaploidization and their evolutionary persistence. PMID:24989045

  1. Using expected sequence features to improve basecalling accuracy of amplicon pyrosequencing data.

    PubMed

    Rask, Thomas S; Petersen, Bent; Chen, Donald S; Day, Karen P; Pedersen, Anders Gorm

    2016-04-22

    Amplicon pyrosequencing targets a known genetic region and thus inherently produces reads highly anticipated to have certain features, such as conserved nucleotide sequence, and in the case of protein coding DNA, an open reading frame. Pyrosequencing errors, consisting mainly of nucleotide insertions and deletions, are on the other hand likely to disrupt open reading frames. Such an inverse relationship between errors and expectation based on prior knowledge can be used advantageously to guide the process known as basecalling, i.e. the inference of nucleotide sequence from raw sequencing data. The new basecalling method described here, named Multipass, implements a probabilistic framework for working with the raw flowgrams obtained by pyrosequencing. For each sequence variant Multipass calculates the likelihood and nucleotide sequence of several most likely sequences given the flowgram data. This probabilistic approach enables integration of basecalling into a larger model where other parameters can be incorporated, such as the likelihood for observing a full-length open reading frame at the targeted region. We apply the method to 454 amplicon pyrosequencing data obtained from a malaria virulence gene family, where Multipass generates 20 % more error-free sequences than current state of the art methods, and provides sequence characteristics that allow generation of a set of high confidence error-free sequences. This novel method can be used to increase accuracy of existing and future amplicon sequencing data, particularly where extensive prior knowledge is available about the obtained sequences, for example in analysis of the immunoglobulin VDJ region where Multipass can be combined with a model for the known recombining germline genes. Multipass is available for Roche 454 data at http://www.cbs.dtu.dk/services/MultiPass-1.0 , and the concept can potentially be implemented for other sequencing technologies as well.

  2. Diversity, Biogeography, and Biodegradation Potential of Actinobacteria in the Deep-Sea Sediments along the Southwest Indian Ridge

    PubMed Central

    Chen, Ping; Zhang, Limin; Guo, Xiaoxuan; Dai, Xin; Liu, Li; Xi, Lijun; Wang, Jian; Song, Lei; Wang, Yuezhu; Zhu, Yaxin; Huang, Li; Huang, Ying

    2016-01-01

    The phylum Actinobacteria has been reported to be common or even abundant in deep marine sediments, however, knowledge about the diversity, distribution, and function of actinobacteria is limited. In this study, actinobacterial diversity in the deep sea along the Southwest Indian Ridge (SWIR) was investigated using both 16S rRNA gene pyrosequencing and culture-based methods. The samples were collected at depths of 1662–4000 m below water surface. Actinobacterial sequences represented 1.2–9.1% of all microbial 16S rRNA gene amplicon sequences in each sample. A total of 5 actinobacterial classes, 17 orders, 28 families, and 52 genera were detected by pyrosequencing, dominated by the classes Acidimicrobiia and Actinobacteria. Differences in actinobacterial community compositions were found among the samples. The community structure showed significant correlations to geochemical factors, notably pH, calcium, total organic carbon, total phosphorus, and total nitrogen, rather than to spatial distance at the scale of the investigation. In addition, 176 strains of the Actinobacteria class, belonging to 9 known orders, 18 families, and 29 genera, were isolated. Among these cultivated taxa, 8 orders, 13 families, and 15 genera were also recovered by pyrosequencing. At a 97% 16S rRNA gene sequence similarity, the pyrosequencing data encompassed 77.3% of the isolates but the isolates represented only 10.3% of the actinobacterial reads. Phylogenetic analysis of all the representative actinobacterial sequences and isolates indicated that at least four new orders within the phylum Actinobacteria were detected by pyrosequencing. More than half of the isolates spanning 23 genera and all samples demonstrated activity in the degradation of refractory organics, including polycyclic aromatic hydrocarbons and polysaccharides, suggesting their potential ecological functions and biotechnological applications for carbon recycling. PMID:27621725

  3. Diversity and homogeneity of oral microbiota in healthy Korean pre-school children using pyrosequencing.

    PubMed

    Lee, Soo Eon; Nam, Ok Hyung; Lee, Hyo-Seol; Choi, Sung Chul

    2016-07-01

    Objectives The purpose of this study was designed to identify the oral microbiota in healthy Korean pre-school children using pyrosequencing. Materials and methods Dental plaque samples were obtained form 10 caries-free pre-school children. The samples were analysed using pyrosequencing. Results The pyrosequencing analysis revealed that, at the phylum level, Proteobacteria, Firmicutes, Bacteroidetes, Actinobacteria and Fusobacteria showed high abundance. Also, predominant genera were identified as core microbiome, such as Streptococcus, Neisseria, Capnocytophaga, Haemophilus and Veilonella. Conclusions The diversity and homogeneity was shown in the dental plaque microbiota in healthy Korean pre-school children.

  4. Validation of the VE1 Immunostain for the BRAF V600E Mutation in Melanoma

    PubMed Central

    Pearlstein, Michelle V.; Zedek, Daniel C.; Ollila, David W.; Treece, Amanda; Gulley, Margaret L.; Groben, Pamela A.; Thomas, Nancy E.

    2014-01-01

    BACKGROUND BRAF mutation status, and therefore eligibility for BRAF inhibitors, is currently determined by sequencing methods. We assessed the validity of VE1, a monoclonal antibody against the BRAF V600E mutant protein, in the detection of mutant BRAF V600E melanomas as classified by DNA pyrosequencing. METHODS The cases were 76 metastatic melanoma patients with only one known primary melanoma who had had BRAF codon 600 pyrosequencing of either their primary (n=19), metastatic (n=57) melanoma, or both (n=17). All melanomas (n=93) were immunostained with the BRAF VE1 antibody using a red detection system. The staining intensity of these specimens was scored from 0 – 3+ by a dermatopathologist. Scores of 0 and 1+ were considered as negative staining while scores of 2+ and 3+ were considered positive. RESULTS The VE1 antibody demonstrated a sensitivity of 85% and a specificity of 100% as compared to DNA pyrosequencing results. There was 100% concordance between VE1 immunostaining of primary and metastatic melanomas from the same patient. V600K, V600Q, and V600R BRAF melanomas did not positively stain with VE1. CONCLUSIONS This hospital-based study finds high sensitivity and specificity for the BRAF VE1 immunostain in comparison to pyrosequencing in detection of BRAF V600E in melanomas. PMID:24917033

  5. A pyrosequencing assay for the quantitative methylation analysis of the PCDHB gene cluster, the major factor in neuroblastoma methylator phenotype.

    PubMed

    Banelli, Barbara; Brigati, Claudio; Di Vinci, Angela; Casciano, Ida; Forlani, Alessandra; Borzì, Luana; Allemanni, Giorgio; Romani, Massimo

    2012-03-01

    Epigenetic alterations are hallmarks of cancer and powerful biomarkers, whose clinical utilization is made difficult by the absence of standardization and of common methods of data interpretation. The coordinate methylation of many loci in cancer is defined as 'CpG island methylator phenotype' (CIMP) and identifies clinically distinct groups of patients. In neuroblastoma (NB), CIMP is defined by a methylation signature, which includes different loci, but its predictive power on outcome is entirely recapitulated by the PCDHB cluster only. We have developed a robust and cost-effective pyrosequencing-based assay that could facilitate the clinical application of CIMP in NB. This assay permits the unbiased simultaneous amplification and sequencing of 17 out of 19 genes of the PCDHB cluster for quantitative methylation analysis, taking into account all the sequence variations. As some of these variations were at CpG doublets, we bypassed the data interpretation conducted by the methylation analysis software to assign the corrected methylation value at these sites. The final result of the assay is the mean methylation level of 17 gene fragments in the protocadherin B cluster (PCDHB) cluster. We have utilized this assay to compare the methylation levels of the PCDHB cluster between high-risk and very low-risk NB patients, confirming the predictive value of CIMP. Our results demonstrate that the pyrosequencing-based assay herein described is a powerful instrument for the analysis of this gene cluster that may simplify the data comparison between different laboratories and, in perspective, could facilitate its clinical application. Furthermore, our results demonstrate that, in principle, pyrosequencing can be efficiently utilized for the methylation analysis of gene clusters with high internal homologies.

  6. A molecular gram stain using broad range PCR and pyrosequencing technology: a potentially useful tool for diagnosing orthopaedic infections.

    PubMed

    Kobayashi, Naomi; Bauer, Thomas W; Togawa, Daisuke; Lieberman, Isador H; Sakai, Hiroshige; Fujishiro, Takaaki; Tuohy, Marion J; Procop, Gary W

    2005-06-01

    The bacteria associated with orthopaedic infections are usually common gram-positive and gram-negative bacteria. This fundamental grouping of bacteria is a necessary first step in the selection of appropriate antibiotics. Since polymerase chain reaction (PCR) is more rapid and may be more sensitive than culture, we developed a postamplification pyrosequencing method to subcategorize bacteria based on a few nucleotide polymorphisms in the 16S rRNA gene. We validated this method using well-characterized strains of bacteria and applied it to specimens from spinal surgery cases with suspected infections. Lysates of 114 bacteria including 75 species were created following standard cultivation to obtain DNA. The DNA was amplified by a broad-range real-time PCR. The amplicons were evaluated by pyrosequencing and were classified as gram-positive, gram-negative, or acid-fast bacilli based on the first three to five nucleotides sequenced. In addition, clinical cases of suspected infection were obtained from spinal surgery. The results of the "molecular Gram stain" were compared with the results of traditional Gram stain and culture. The lysates of 107 (93.9%) of the bacteria extracts tested were appropriately categorized as gram-positive and gram-negative or as acid-fast bacilli on the basis of this assay. The sensitivity and specificity of this assay were 100% and 97.4% for gram-positive and 88.3% and 100% for gram-negative isolates. All of the five clinical samples were appropriately categorized as containing gram-positive or gram-negative bacteria with this assay. This study demonstrates that high sensitivity and specificity of a molecular gram stain may be achieved using broad-range real-time PCR and pyrosequencing.

  7. Swine transcriptome characterization by combined Iso-Seq and RNA-seq for annotating the emerging long read-based reference genome

    USDA-ARS?s Scientific Manuscript database

    PacBio long-read sequencing technology is increasingly popular in genome sequence assembly and transcriptome cataloguing. Recently, a new-generation pig reference genome was assembled based on long reads from this technology. To finely annotate this genome assembly, transcriptomes of nine tissues fr...

  8. Novel multiplex qualitative detection using universal primer-multiplex-PCR combined with pyrosequencing.

    PubMed

    Shang, Ying; Xu, Wentao; Wang, Yong; Xu, Yuancong; Huang, Kunlun

    2017-12-15

    This study described a novel multiplex qualitative detection method using pyrosequencing. Based on the principle of the universal primer-multiplex-PCR, only one sequencing primer was employed to realize the detection of the multiple targets. Samples containing three genetically modified (GM) crops in different proportions were used to validate the method. The dNTP dispensing order was designed based on the product sequences. Only 12 rounds (ATCTGATCGACT) of dNTPs addition and, often, as few as three rounds (CAT) under ideal conditions, were required to detect the GM events qualitatively, and sensitivity was as low as 1% of a mixture. However, when considering a mixture, calculating signal values allowed the proportion of each GM to be estimated. Based on these results, we concluded that our novel method not only realized detection but also allowed semi-quantitative detection of individual events. Copyright © 2017. Published by Elsevier Ltd.

  9. EST sequencing and gene expression profiling of defence-related genes from Persea americana infected with Phytophthora cinnamomi

    PubMed Central

    2011-01-01

    Background Avocado (Persea americana) belongs to the Lauraceae family and is an important commercial fruit crop in over 50 countries. The most serious pathogen affecting avocado production is Phytophthora cinnamomi which causes Phytophthora root rot (PRR). Root pathogens such as P. cinnamomi and their interactions with hosts are poorly understood and despite the importance of both the avocado crop and the effect Phytophthora has on its cultivation, there is a lack of molecular knowledge underpinning our understanding of defence strategies against the pathogen. In order to initiate a better understanding of host-specific defence we have generated EST data using 454 pyrosequencing and profiled nine defence-related genes from Pc-infected avocado roots. Results 2.0 Mb of data was generated consisting of ~10,000 reads on a single lane of the GS FLX platform. Using the Newbler assembler 371 contigs were assembled, of which 367 are novel for Persea americana. Genes were classified according to Gene Ontology terms. In addition to identifying root-specific ESTs we were also able to identify and quantify the expression of nine defence-related genes that were differentially regulated in response to P. cinnamomi. Genes such as metallothionein, thaumatin and the pathogenesis related PsemI, mlo and profilin were found to be differentially regulated. Conclusions This is the first study in elucidating the avocado root transcriptome as well as identifying defence responses of avocado roots to the root pathogen P. cinnamomi. Our data is currently the only EST data that has been generated for avocado rootstocks, and the ESTs identified in this study have already been useful in identifying defence-related genes as well as providing gene information for other studies looking at processes such as ROS regulation as well as hypoxia in avocado roots. Our EST data will aid in the elucidation of the avocado transcriptome and identification of markers for improved rootstock breeding and screening. The characterization of the avocado transcriptome will furthermore form a basis for functional genomics of basal angiosperms. PMID:22108245

  10. EST sequencing and gene expression profiling of defence-related genes from Persea americana infected with Phytophthora cinnamomi.

    PubMed

    Mahomed, Waheed; Berg, Noëlani van den

    2011-11-23

    Avocado (Persea americana) belongs to the Lauraceae family and is an important commercial fruit crop in over 50 countries. The most serious pathogen affecting avocado production is Phytophthora cinnamomi which causes Phytophthora root rot (PRR). Root pathogens such as P. cinnamomi and their interactions with hosts are poorly understood and despite the importance of both the avocado crop and the effect Phytophthora has on its cultivation, there is a lack of molecular knowledge underpinning our understanding of defence strategies against the pathogen. In order to initiate a better understanding of host-specific defence we have generated EST data using 454 pyrosequencing and profiled nine defence-related genes from Pc-infected avocado roots. 2.0 Mb of data was generated consisting of ~10,000 reads on a single lane of the GS FLX platform. Using the Newbler assembler 371 contigs were assembled, of which 367 are novel for Persea americana. Genes were classified according to Gene Ontology terms. In addition to identifying root-specific ESTs we were also able to identify and quantify the expression of nine defence-related genes that were differentially regulated in response to P. cinnamomi. Genes such as metallothionein, thaumatin and the pathogenesis related PsemI, mlo and profilin were found to be differentially regulated. This is the first study in elucidating the avocado root transcriptome as well as identifying defence responses of avocado roots to the root pathogen P. cinnamomi. Our data is currently the only EST data that has been generated for avocado rootstocks, and the ESTs identified in this study have already been useful in identifying defence-related genes as well as providing gene information for other studies looking at processes such as ROS regulation as well as hypoxia in avocado roots. Our EST data will aid in the elucidation of the avocado transcriptome and identification of markers for improved rootstock breeding and screening. The characterization of the avocado transcriptome will furthermore form a basis for functional genomics of basal angiosperms.

  11. Transcriptome Analysis of an Insecticide Resistant Housefly Strain: Insights about SNPs and Regulatory Elements in Cytochrome P450 Genes

    PubMed Central

    Asp, Torben; Kristensen, Michael

    2016-01-01

    Background Insecticide resistance in the housefly, Musca domestica, has been investigated for more than 60 years. It will enter a new era after the recent publication of the housefly genome and the development of multiple next generation sequencing technologies. The genetic background of the xenobiotic response can now be investigated in greater detail. Here, we investigate the 454-pyrosequencing transcriptome of the spinosad-resistant 791spin strain in relation to the housefly genome with focus on P450 genes. Results The de novo assembly of clean reads gave 35,834 contigs consisting of 21,780 sequences of the spinosad resistant strain. The 3,648 sequences were annotated with an enzyme code EC number and were mapped to 124 KEGG pathways with metabolic processes as most highly represented pathway. One hundred and twenty contigs were annotated as P450s covering 44 different P450 genes of housefly. Eight differentially expressed P450s genes were identified and investigated for SNPs, CpG islands and common regulatory motifs in promoter and coding regions. Functional annotation clustering of metabolic related genes and motif analysis of P450s revealed their association with epigenetic, transcription and gene expression related functions. The sequence variation analysis resulted in 12 SNPs and eight of them found in cyp6d1. There is variation in location, size and frequency of CpG islands and specific motifs were also identified in these P450s. Moreover, identified motifs were associated to GO terms and transcription factors using bioinformatic tools. Conclusion Transcriptome data of a spinosad resistant strain provide together with genome data fundamental support for future research to understand evolution of resistance in houseflies. Here, we report for the first time the SNPs, CpG islands and common regulatory motifs in differentially expressed P450s. Taken together our findings will serve as a stepping stone to advance understanding of the mechanism and role of P450s in xenobiotic detoxification. PMID:27019205

  12. The complete chloroplast genome sequence of date palm (Phoenix dactylifera L.).

    PubMed

    Yang, Meng; Zhang, Xiaowei; Liu, Guiming; Yin, Yuxin; Chen, Kaifu; Yun, Quanzheng; Zhao, Duojun; Al-Mssallem, Ibrahim S; Yu, Jun

    2010-09-15

    Date palm (Phoenix dactylifera L.), a member of Arecaceae family, is one of the three major economically important woody palms--the two other palms being oil palm and coconut tree--and its fruit is a staple food among Middle East and North African nations, as well as many other tropical and subtropical regions. Here we report a complete sequence of the data palm chloroplast (cp) genome based on pyrosequencing. After extracting 369,022 cp sequencing reads from our whole-genome-shotgun data, we put together an assembly and validated it with intensive PCR-based verification, coupled with PCR product sequencing. The date palm cp genome is 158,462 bp in length and has a typical quadripartite structure of the large (LSC, 86,198 bp) and small single-copy (SSC, 17,712 bp) regions separated by a pair of inverted repeats (IRs, 27,276 bp). Similar to what has been found among most angiosperms, the date palm cp genome harbors 112 unique genes and 19 duplicated fragments in the IR regions. The junctions between LSC/IRs and SSC/IRs show different features of sequence expansion in evolution. We identified 78 SNPs as major intravarietal polymorphisms within the population of a specific cp genome, most of which were located in genes with vital functions. Based on RNA-sequencing data, we also found 18 polycistronic transcription units and three highly expression-biased genes--atpF, trnA-UGC, and rrn23. Unlike most monocots, date palm has a typical cp genome similar to that of tobacco--with little rearrangement and gene loss or gain. High-throughput sequencing technology facilitates the identification of intravarietal variations in cp genomes among different cultivars. Moreover, transcriptomic analysis of cp genes provides clues for uncovering regulatory mechanisms of transcription and translation in chloroplasts.

  13. PIVOT: platform for interactive analysis and visualization of transcriptomics data.

    PubMed

    Zhu, Qin; Fisher, Stephen A; Dueck, Hannah; Middleton, Sarah; Khaladkar, Mugdha; Kim, Junhyong

    2018-01-05

    Many R packages have been developed for transcriptome analysis but their use often requires familiarity with R and integrating results of different packages requires scripts to wrangle the datatypes. Furthermore, exploratory data analyses often generate multiple derived datasets such as data subsets or data transformations, which can be difficult to track. Here we present PIVOT, an R-based platform that wraps open source transcriptome analysis packages with a uniform user interface and graphical data management that allows non-programmers to interactively explore transcriptomics data. PIVOT supports more than 40 popular open source packages for transcriptome analysis and provides an extensive set of tools for statistical data manipulations. A graph-based visual interface is used to represent the links between derived datasets, allowing easy tracking of data versions. PIVOT further supports automatic report generation, publication-quality plots, and program/data state saving, such that all analysis can be saved, shared and reproduced. PIVOT will allow researchers with broad background to easily access sophisticated transcriptome analysis tools and interactively explore transcriptome datasets.

  14. Deep sampling of the Palomero maize transcriptome by a high throughput strategy of pyrosequencing.

    PubMed

    Vega-Arreguín, Julio C; Ibarra-Laclette, Enrique; Jiménez-Moraila, Beatriz; Martínez, Octavio; Vielle-Calzada, Jean Philippe; Herrera-Estrella, Luis; Herrera-Estrella, Alfredo

    2009-07-06

    In-depth sequencing analysis has not been able to determine the overall complexity of transcriptional activity of a plant organ or tissue sample. In some cases, deep parallel sequencing of Expressed Sequence Tags (ESTs), although not yet optimized for the sequencing of cDNAs, has represented an efficient procedure for validating gene prediction and estimating overall gene coverage. This approach could be very valuable for complex plant genomes. In addition, little emphasis has been given to efforts aiming at an estimation of the overall transcriptional universe found in a multicellular organism at a specific developmental stage. To explore, in depth, the transcriptional diversity in an ancient maize landrace, we developed a protocol to optimize the sequencing of cDNAs and performed 4 consecutive GS20-454 pyrosequencing runs of a cDNA library obtained from 2 week-old Palomero Toluqueño maize plants. The protocol reported here allowed obtaining over 90% of informative sequences. These GS20-454 runs generated over 1.5 Million reads, representing the largest amount of sequences reported from a single plant cDNA library. A collection of 367,391 quality-filtered reads (30.09 Mb) from a single run was sufficient to identify transcripts corresponding to 34% of public maize ESTs databases; total sequences generated after 4 filtered runs increased this coverage to 50%. Comparisons of all 1.5 Million reads to the Maize Assembled Genomic Islands (MAGIs) provided evidence for the transcriptional activity of 11% of MAGIs. We estimate that 5.67% (86,069 sequences) do not align with public ESTs or annotated genes, potentially representing new maize transcripts. Following the assembly of 74.4% of the reads in 65,493 contigs, real-time PCR of selected genes confirmed a predicted correlation between the abundance of GS20-454 sequences and corresponding levels of gene expression. A protocol was developed that significantly increases the number, length and quality of cDNA reads using massive 454 parallel sequencing. We show that recurrent 454 pyrosequencing of a single cDNA sample is necessary to attain a thorough representation of the transcriptional universe present in maize, that can also be used to estimate transcript abundance of specific genes. This data suggests that the molecular and functional diversity contained in the vast native landraces remains to be explored, and that large-scale transcriptional sequencing of a presumed ancestor of the modern maize varieties represents a valuable approach to characterize the functional diversity of maize for future agricultural and evolutionary studies.

  15. Pyrosequencing for Rapid Detection of Mycobacterium tuberculosis Resistance to Rifampin, Isoniazid, and Fluoroquinolones ▿

    PubMed Central

    Bravo, Lulette Tricia C.; Tuohy, Marion J.; Ang, Concepcion; Destura, Raul V.; Mendoza, Myrna; Procop, Gary W.; Gordon, Steven M.; Hall, Geraldine S.; Shrestha, Nabin K.

    2009-01-01

    After isoniazid and rifampin (rifampicin), the next pivotal drug class in Mycobacterium tuberculosis treatment is the fluoroquinolone class. Mutations in resistance-determining regions (RDR) of the rpoB, katG, and gyrA genes occur with frequencies of 97%, 50%, and 85% among M. tuberculosis isolates resistant to rifampin, isoniazid, and fluoroquinolones, respectively. Sequences are highly conserved, and certain mutations correlate well with phenotypic resistance. We developed a pyrosequencing assay to determine M. tuberculosis genotypic resistance to rifampin, isoniazid, and fluoroquinolones. We characterized 102 M. tuberculosis clinical isolates from the Philippines for susceptibility to rifampin, isoniazid, and ofloxacin by using the conventional submerged-disk proportion method and validated our pyrosequencing assay using these isolates. DNA was extracted and amplified by using PCR primers directed toward the RDR of the rpoB, katG, and gyrA genes, and pyrosequencing was performed on the extracts. The M. tuberculosis H37Rv strain (ATCC 25618) was used as the reference strain. The sensitivities and specificities of pyrosequencing were 96.7% and 97.3%, 63.8% and 100%, and 70.0% and 100% for the detection of resistance to rifampin, isoniazid, and ofloxacin, respectively. Pyrosequencing is thus a rapid and accurate method for detecting M. tuberculosis resistance to these three drugs. PMID:19846642

  16. An insight into the transcriptome of the digestive tract of the bloodsucking bug, Rhodnius prolixus.

    PubMed

    Ribeiro, José M C; Genta, Fernando A; Sorgine, Marcos H F; Logullo, Raquel; Mesquita, Rafael D; Paiva-Silva, Gabriela O; Majerowicz, David; Medeiros, Marcelo; Koerich, Leonardo; Terra, Walter R; Ferreira, Clélia; Pimentel, André C; Bisch, Paulo M; Leite, Daniel C; Diniz, Michelle M P; da S G V Junior, João Lídio; Da Silva, Manuela L; Araujo, Ricardo N; Gandara, Ana Caroline P; Brosson, Sébastien; Salmon, Didier; Bousbata, Sabrina; González-Caballero, Natalia; Silber, Ariel Mariano; Alves-Bezerra, Michele; Gondim, Katia C; Silva-Neto, Mário Alberto C; Atella, Georgia C; Araujo, Helena; Dias, Felipe A; Polycarpo, Carla; Vionette-Amaral, Raquel J; Fampa, Patrícia; Melo, Ana Claudia A; Tanaka, Aparecida S; Balczun, Carsten; Oliveira, José Henrique M; Gonçalves, Renata L S; Lazoski, Cristiano; Rivera-Pomar, Rolando; Diambra, Luis; Schaub, Günter A; Garcia, Elói S; Azambuja, Patrícia; Braz, Glória R C; Oliveira, Pedro L

    2014-01-01

    The bloodsucking hemipteran Rhodnius prolixus is a vector of Chagas' disease, which affects 7-8 million people today in Latin America. In contrast to other hematophagous insects, the triatomine gut is compartmentalized into three segments that perform different functions during blood digestion. Here we report analysis of transcriptomes for each of the segments using pyrosequencing technology. Comparison of transcript frequency in digestive libraries with a whole-body library was used to evaluate expression levels. All classes of digestive enzymes were highly expressed, with a predominance of cysteine and aspartic proteinases, the latter showing a significant expansion through gene duplication. Although no protein digestion is known to occur in the anterior midgut (AM), protease transcripts were found, suggesting secretion as pro-enzymes, being possibly activated in the posterior midgut (PM). As expected, genes related to cytoskeleton, protein synthesis apparatus, protein traffic, and secretion were abundantly transcribed. Despite the absence of a chitinous peritrophic membrane in hemipterans - which have instead a lipidic perimicrovillar membrane lining over midgut epithelia - several gut-specific peritrophin transcripts were found, suggesting that these proteins perform functions other than being a structural component of the peritrophic membrane. Among immunity-related transcripts, while lysozymes and lectins were the most highly expressed, several genes belonging to the Toll pathway - found at low levels in the gut of most insects - were identified, contrasting with a low abundance of transcripts from IMD and STAT pathways. Analysis of transcripts related to lipid metabolism indicates that lipids play multiple roles, being a major energy source, a substrate for perimicrovillar membrane formation, and a source for hydrocarbons possibly to produce the wax layer of the hindgut. Transcripts related to amino acid metabolism showed an unanticipated priority for degradation of tyrosine, phenylalanine, and tryptophan. Analysis of transcripts related to signaling pathways suggested a role for MAP kinases, GTPases, and LKBP1/AMP kinases related to control of cell shape and polarity, possibly in connection with regulation of cell survival, response of pathogens and nutrients. Together, our findings present a new view of the triatomine digestive apparatus and will help us understand trypanosome interaction and allow insights into hemipteran metabolic adaptations to a blood-based diet.

  17. Pyrosequencing analysis for detection of a BRAFV600E mutation in an FNAB specimen of thyroid nodules.

    PubMed

    Kim, Suk Kyeong; Kim, Dong-Lim; Han, Hye Seung; Kim, Wan Seop; Kim, Seung Ja; Moon, Won Jin; Oh, Seo Young; Hwang, Tae Sook

    2008-06-01

    Fine-needle aspiration biopsy (FNAB) is the primary means of distinguishing benign from malignant and of guiding therapeutic intervention in thyroid nodules. However, 10% to 30% of cases with indeterminate cytology in FNAB need other diagnostic tools to refine diagnosis. We compared the pyrosequencing method with the conventional direct DNA sequencing analysis and investigated the usefulness of preoperative BRAF mutation analysis as an adjunct diagnostic tool with routine FNAB. A total of 103 surgically confirmed patients' FNA slides were recruited and DNA was extracted after atypical cells were scraped from the slides. BRAF mutation was analyzed by pyrosequencing and direct DNA sequencing. Sixty-three (77.8%) of 81 histopathologically diagnosed malignant nodules revealed positive BRAF mutation on pyrosequencing analysis. In detail, 63 (84.0%) of 75 papillary thyroid carcinoma (PTC) samples showed positive BRAF mutation, whereas 3 follicular thyroid carcinomas, 1 anaplastic carcinoma, 1 medullary thyroid carcinoma, and 1 metastatic lung carcinoma did not show BRAF mutation. None of 22 benign nodules had BRAF mutation in both pyrosequencing and direct DNA sequencing. Out of 27 thyroid nodules classified as 'indeterminate' on cytologic examination preoperatively, 21 (77.8%) cases turned out to be malignant: 18 PTCs (including 2 follicular variant types) and 3 follicular thyroid carcinomas. Among these, 13 (61.9%) classic PTCs had BRAF mutation. None of 6 benign nodules, including 3 follicular adenomas and 3 nodular hyperplasias, had BRAF mutation. Among 63 PTCs with positive BRAF mutation detected by pyrosequencing analysis, 3 cases did not show BRAF mutation by direct DNA sequencing. Although it was not statistically significant, pyrosequencing was superior to direct DNA sequencing in detecting the BRAF mutation of thyroid nodules (P=0.25). Detecting BRAF mutation by pyrosequencing is more sensitive, faster, and less expensive than direct DNA sequencing and is proposed as an adjunct diagnostic tool in evaluating thyroid nodules of indeterminate cytology.

  18. Mucosal Transcriptomics Implicates Under Expression of BRINP3 in the Pathogenesis of Ulcerative Colitis

    PubMed Central

    Smith, Philip J.; Levine, Adam P.; Dunne, Jenny; Guilhamon, Paul; Turmaine, Mark; Sewell, Gavin W.; O'Shea, Nuala R.; Vega, Roser; Paterson, Jennifer C.; Oukrif, Dahmane; Beck, Stephan; Bloom, Stuart L.; Novelli, Marco; Rodriguez-Justo, Manuel; Smith, Andrew M.

    2014-01-01

    Background: Mucosal abnormalities are potentially important in the primary pathogenesis of ulcerative colitis (UC). We investigated the mucosal transcriptomic expression profiles of biopsies from patients with UC and healthy controls, taken from macroscopically noninflamed tissue from the terminal ileum and 3 colonic locations with the objective of identifying abnormal molecules that might be involved in disease development. Methods: Whole-genome transcriptional analysis was performed on intestinal biopsies taken from 24 patients with UC, 26 healthy controls, and 14 patients with Crohn's disease. Differential gene expression analysis was performed at each tissue location separately, and results were then meta-analyzed. Significantly, differentially expressed genes were validated using quantitative polymerase chain reaction. The location of gene expression within the colon was determined using immunohistochemistry, subcellular fractionation, electron and confocal microscopy. DNA methylation was quantified by pyrosequencing. Results: Only 4 probes were abnormally expressed throughout the colon in patients with UC with Bone morphogenetic protein/Retinoic acid Inducible Neural-specific 3 (BRINP3) being the most significantly underexpressed. Attenuated expression of BRINP3 in UC was independent of current inflammation, unrelated to phenotype or treatment, and remained low at rebiopsy an average of 22 months later. BRINP3 is localized to the brush border of the colonic epithelium and expression is influenced by DNA methylation within its promoter. Conclusions: Genome-wide expression analysis of noninflamed mucosal biopsies from patients with UC identified BRINP3 as significantly underexpressed throughout the colon in a large subset of patients with UC. Low levels of this gene could predispose or contribute to the maintenance of the characteristic mucosal inflammation seen in this condition. PMID:25171508

  19. Tissue-specific transcriptomics of the exotic invasive insect pest emerald ash borer (Agrilus planipennis).

    PubMed

    Mittapalli, Omprakash; Bai, Xiaodong; Mamidala, Praveen; Rajarapu, Swapna Priya; Bonello, Pierluigi; Herms, Daniel A

    2010-10-28

    The insect midgut and fat body represent major tissue interfaces that deal with several important physiological functions including digestion, detoxification and immune response. The emerald ash borer (Agrilus planipennis), is an exotic invasive insect pest that has killed millions of ash trees (Fraxinus spp.) primarily in the Midwestern United States and Ontario, Canada. However, despite its high impact status little knowledge exists for A. planipennis at the molecular level. Newer-generation Roche-454 pyrosequencing was used to obtain 126,185 reads for the midgut and 240,848 reads for the fat body, which were assembled into 25,173 and 37,661 high quality expressed sequence tags (ESTs) for the midgut and the fat body of A. planipennis larvae, respectively. Among these ESTs, 36% of the midgut and 38% of the fat body sequences showed similarity to proteins in the GenBank nr database. A high number of the midgut sequences contained chitin-binding peritrophin (248)and trypsin (98) domains; while the fat body sequences showed high occurrence of cytochrome P450s (85) and protein kinase (123) domains. Further, the midgut transcriptome of A. planipennis revealed putative microbial transcripts encoding for cell-wall degrading enzymes such as polygalacturonases and endoglucanases. A significant number of SNPs (137 in midgut and 347 in fat body) and microsatellite loci (317 in midgut and 571 in fat body) were predicted in the A. planipennis transcripts. An initial assessment of cytochrome P450s belonging to various CYP clades revealed distinct expression patterns at the tissue level. To our knowledge this study is one of the first to illuminate tissue-specific gene expression in an invasive insect of high ecological and economic consequence. These findings will lay the foundation for future gene expression and functional studies in A. planipennis.

  20. An Appropriate Cutoff Value for Determining the Colonization of Helicobacter pylori by the Pyrosequencing Method: Comparison with Conventional Methods.

    PubMed

    Kim, Jaeyeon; Kim, Nayoung; Jo, Hyun Jin; Park, Ji Hyun; Nam, Ryoung Hee; Seok, Yeong-Jae; Kim, Yeon-Ran; Kim, Joo Sung; Kim, Jung Mogg; Kim, Jung Min; Lee, Dong Ho; Jung, Hyun Chae

    2015-10-01

    Sequencing of 16S ribosomal RNA (rRNA) gene has improved the characterization of microbial communities. It enabled the detection of low abundance gastric Helicobacter pylori sequences even in subjects that were found to be H. pylori negative with conventional methods. The objective of this study was to obtain a cutoff value for H. pylori colonization in gastric mucosa samples by pyrosequencing method. Gastric mucosal biopsies were taken from 63 subjects whose H. pylori status was determined by a combination of serology, rapid urease test, culture, and histology. Microbial DNA from mucosal samples was amplified by PCR using universal bacterial primers. 16S rDNA amplicons were pyrosequenced. ROC curve analysis was performed to determine the cutoff value for H. pylori colonization by pyrosequencing. In addition, temporal changes in the stomach microbiota were observed in eight initially H. pylori-positive and eight H. pylori-negative subjects at a single time point 1-8 years later. Of the 63 subjects, the presence of H. pylori sequences was detected in all (28/28) conventionally H. pylori-positive samples and in 60% (21/35) of H. pylori-negative samples. The average percent of H. pylori reads in each sample was 0.67 ± 1.09% in the H. pylori-negative group. Cutoff value for clinically positive H. pylori status was approximately 1.22% based on ROC curve analysis (AUC = 0.957; p < .001). Helicobacter pylori was successfully eradicated in five of seven treated H. pylori-positive subjects (71.4%), and the percentage of H. pylori reads in these five subjects dropped from 1.3-95.18% to 0-0.16% after eradication. These results suggest that the cutoff value of H. pylori sequence percentage for H. pylori colonization by pyrosequencing could be set at approximately 1%. It might be helpful to analyze gastric microbiota related to H. pylori sequence status. © 2015 John Wiley & Sons Ltd.

  1. Rumen bacterial community evaluated by 454 pyrosequencing and terminal restriction fragment length polymorphism analyses in dairy sheep fed marine algae.

    PubMed

    Castro-Carrera, T; Toral, P G; Frutos, P; McEwan, N R; Hervás, G; Abecia, L; Pinloche, E; Girdwood, S E; Belenguer, A

    2014-03-01

    Developing novel strategies to increase the content of bioactive unsaturated fatty acids (FA) in ruminant-derived products requires a deeper understanding of rumen biohydrogenation and bacteria involved in this process. Although high-throughput pyrosequencing may allow for a great coverage of bacterial diversity, it has hardly been used to investigate the microbiology of ruminal FA metabolism. In this experiment, 454 pyrosequencing and a molecular fingerprinting technique (terminal restriction fragment length polymorphism; T-RFLP) were used concurrently to assess the effect of diet supplementation with marine algae (MA) on the rumen bacterial community of dairy sheep. Eleven lactating ewes were divided in 2 lots and offered a total mixed ration based on alfalfa hay and concentrate (40:60), supplemented with 0 (control) or 8 (MA) g of MA/kg of dry matter. After 54 d on treatments, animals were slaughtered and samples of rumen content and fluid were collected separately for microbial analysis. Pyrosequencing yielded a greater coverage of bacterial diversity than T-RFLP and allowed the identification of low abundant populations. Conversely, both molecular approaches pointed to similar conclusions and showed that relevant changes due to MA addition were observed within the major ruminal phyla, namely Bacteroidetes, Firmicutes, and Proteobacteria. Decreases in the abundance of unclassified Bacteroidales, Porphyromonadaceae, and Ruminococcaceae and increases in as-yet uncultured species of the family Succinivibrionaceae, might be related to a potential role of these groups in different pathways of rumen FA metabolism. Diet supplementation with MA, however, had no effect on the relative abundance of Butyrivibrio and Pseudobutyrivibrio genera. In addition, results from both 454 pyrosequencing and T-RFLP indicate that the effect of MA was rather consistent in rumen content or fluid samples, despite inherent differences between these fractions in their bacterial composition. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  2. Identification of Male Gametogenesis Expressed Genes from the Scallop Nodipecten subnodosus by Suppressive Subtraction Hybridization and Pyrosequencing

    PubMed Central

    Llera-Herrera, Raúl; García-Gasca, Alejandra; Abreu-Goodger, Cei; Huvet, Arnaud; Ibarra, Ana M.

    2013-01-01

    Despite the great advances in sequencing technologies, genomic and transcriptomic information for marine non-model species with ecological, evolutionary, and economical interest is still scarce. In this work we aimed to identify genes expressed during spermatogenesis in the functional hermaphrodite scallop Nodipecten subnodosus (Mollusca: Bivalvia: Pectinidae), with the purpose of obtaining a panel of genes that would allow for the study of differentially transcribed genes between diploid and triploid scallops in the context of meiotic arrest and reproductive sterility. Because our aim was to isolate genes involved in meiosis and other testis maturation-related processes, we generated suppressive subtractive hybridization libraries of testis vs. inactive gonad. We obtained 352 and 177 ESTs by clone sequencing, and using pyrosequencing (454-Roche) we maximized the identified ESTs to 34,276 reads. A total of 1,153 genes from the testis library had a blastx hit and GO annotation, including genes specific for meiosis, spermatogenesis, sex-differentiation, and transposable elements. Some of the identified meiosis genes function in chromosome pairing (scp2, scp3), recombination and DNA repair (dmc1, rad51, ccnb1ip1/hei10), and meiotic checkpoints (rad1, hormad1, dtl/cdt2). Gene expression analyses in different gametogenic stages in both sexual regions of the gonad of meiosis genes confirmed that the expression was specific or increased towards the maturing testis. Spermatogenesis genes included known testis-specific ones (kelch-10, shippo1, adad1), with some of these known to be associated to sterility. Sex differentiation genes included one of the most conserved genes at the bottom of the sex-determination cascade (dmrt1). Transcript from transposable elements, reverse transcriptase, and transposases in this library evidenced that transposition is an active process during spermatogenesis in N. subnodosus. In relation to the inactive library, we identified 833 transcripts with functional annotation related to activation of the transcription and translation machinery, as well as to germline control and maintenance. PMID:24066034

  3. Detection of polyoma virus in brain tissue of patients with progressive multifocal leukoencephalopathy by real-time PCR and pyrosequencing.

    PubMed

    Beck, Rose C; Kohn, Debra J; Tuohy, Marion J; Prayson, Richard A; Yen-Lieberman, Belinda; Procop, Gary W

    2004-03-01

    We evaluated 2 methods, a LightCycler PCR assay and pyrosequencing for the detection of the JC polyoma virus (JCV) in fixed brain tissue of 10 patients with and 3 control patients without progressive multifocal leukoencephalopathy (PML). Nucleic acid extraction was performed after deparaffinization and proteinase K digestion. The LightCycler assay differentiates the BK virus (BKV), JCV, and SV40 using melt curve analysis. Conventional PCR was used with the same primers to generate products for pyrosequencing. Two sequencing primers were used that differentiate the polyoma viruses. Seven of 11 biopsies (1 patient had 2 biopsies) with PML were positive for JCV by real-time PCR and/or PCR/pyrosequencing. Three of 4 remaining biopsies were positive by real-time PCR but had melting points between JCV and SV40. The 4 specimens that were negative or atypical by LightCycler PCR were positive by traditional PCR, but 1 had an amplicon of lower molecular weight by gel electrophoresis. These were shown to represent JCV by at least 1 of the 2 pyrosequencing primers. The biopsies from patients without PML were PCR negative. Both the LightCycler and pyrosequencing assays are useful for confirming JCV in brain biopsies from patients with PML, but variant JCVs may require supplementary methods to confirm JCV infection.

  4. Barcoded pyrosequencing analysis of the microbial community in a simulator of the human gastrointestinal tract showed a colon region-specific microbiota modulation for two plant-derived polysaccharide blends.

    PubMed

    Marzorati, Massimo; Maignien, Lois; Verhelst, An; Luta, Gabriela; Sinnott, Robert; Kerckhof, Frederiek Maarten; Boon, Nico; Van de Wiele, Tom; Possemiers, Sam

    2013-02-01

    The combination of a Simulator of the Human Intestinal Microbial Ecosystem with ad hoc molecular techniques (i.e. pyrosequencing, denaturing gradient gel electrophoresis and quantitative PCR) allowed an evaluation of the extent to which two plant polysaccharide supplements could modify a complex gut microbial community. The presence of Aloe vera gel powder and algae extract in product B as compared to the standard blend (product A) improved its fermentation along the entire simulated colon. The potential extended effect of product B in the simulated distal colon, as compared to product A, was confirmed by: (i) the separate clustering of the samples before and after the treatment in the phylogenetic-based dendrogram and OTU-based PCoA plot only for product B; (ii) a higher richness estimator (+33 vs. -36 % of product A); and (iii) a higher dynamic parameter (21 vs. 13 %). These data show that the combination of well designed in vitro simulators with barcoded pyrosequencing is a powerful tool for characterizing changes occurring in the gut microbiota following a treatment. However, for the quantification of low-abundance species-of interest because of their relationship to potential positive health effects (i.e. bifidobacteria or lactobacilli)-conventional molecular ecological approaches, such as PCR-DGGE and qPCR, still remain a very useful complementary tool.

  5. Cancer Transcriptome Dataset Analysis: Comparing Methods of Pathway and Gene Regulatory Network-Based Cluster Identification.

    PubMed

    Nam, Seungyoon

    2017-04-01

    Cancer transcriptome analysis is one of the leading areas of Big Data science, biomarker, and pharmaceutical discovery, not to forget personalized medicine. Yet, cancer transcriptomics and postgenomic medicine require innovation in bioinformatics as well as comparison of the performance of available algorithms. In this data analytics context, the value of network generation and algorithms has been widely underscored for addressing the salient questions in cancer pathogenesis. Analysis of cancer trancriptome often results in complicated networks where identification of network modularity remains critical, for example, in delineating the "druggable" molecular targets. Network clustering is useful, but depends on the network topology in and of itself. Notably, the performance of different network-generating tools for network cluster (NC) identification has been little investigated to date. Hence, using gastric cancer (GC) transcriptomic datasets, we compared two algorithms for generating pathway versus gene regulatory network-based NCs, showing that the pathway-based approach better agrees with a reference set of cancer-functional contexts. Finally, by applying pathway-based NC identification to GC transcriptome datasets, we describe cancer NCs that associate with candidate therapeutic targets and biomarkers in GC. These observations collectively inform future research on cancer transcriptomics, drug discovery, and rational development of new analysis tools for optimal harnessing of omics data.

  6. Evaluation of six primer pairs targeting the nuclear rRNA operon for characterization of arbuscular mycorrhizal fungal (AMF) communities using 454 pyrosequencing.

    PubMed

    Van Geel, Maarten; Busschaert, Pieter; Honnay, Olivier; Lievens, Bart

    2014-11-01

    In the last few years, 454 pyrosequencing-based analysis of arbuscular mycorrhizal fungal (AMF; Glomeromycota) communities has tremendously increased our knowledge of the distribution and diversity of AMF. Nonetheless, comparing results between different studies is difficult, as different target genes (or regions thereof) and primer combinations, with potentially dissimilar specificities and efficacies, are being utilized. In this study we evaluated six primer pairs that have previously been used in AMF studies (NS31-AM1, AMV4.5NF-AMDGR, AML1-AML2, NS31-AML2, FLR3-LSUmBr and Glo454-NDL22) for their use in 454 pyrosequencing based on both an in silico approach and 454 pyrosequencing of AMF communities from apple tree roots. Primers were evaluated in terms of (i) in silico coverage of Glomeromycota fungi, (ii) the number of high-quality sequences obtained, (iii) selectivity for AMF species, (iv) reproducibility and (v) ability to accurately describe AMF communities. We show that primer pairs AMV4.5NF-AMDGR, AML1-AML2 and NS31-AML2 outperformed the other tested primer pairs in terms of number of Glomeromycota reads (AMF specificity and coverage). Additionally, these primer pairs were found to have no or only few mismatches to AMF sequences and were able to consistently describe AMF communities from apple roots. However, whereas most high-quality AMF sequences were obtained for AMV4.5NF-AMDGR, our results also suggest that this primer pair favored amplification of Glomeraceae sequences at the expense of Ambisporaceae, Claroideoglomeraceae and Paraglomeraceae sequences. Furthermore, we demonstrate the complementary specificity of AMV4.5NF-AMDGR with AML1-AML2, and of AMV4.5NF-AMDGR with NS31-AML2, making these primer combinations highly suitable for tandem use in covering the diversity of AMF communities. Copyright © 2014 Elsevier B.V. All rights reserved.

  7. Preliminary characterization of the oral microbiota of Chinese adults with and without gingivitis

    PubMed Central

    2011-01-01

    Background Microbial communities inhabiting human mouth are associated with oral health and disease. Previous studies have indicated the general prevalence of adult gingivitis in China to be high. The aim of this study was to characterize in depth the oral microbiota of Chinese adults with or without gingivitis, by defining the microbial phylogenetic diversity and community-structure using highly paralleled pyrosequencing. Methods Six non-smoking Chinese, three with and three without gingivitis (age range 21-39 years, 4 females and 2 males) were enrolled in the present cross-sectional study. Gingival parameters of inflammation and bleeding on probing were characterized by a clinician using the Mazza Gingival Index (MGI). Plaque (sampled separately from four different oral sites) and salivary samples were obtained from each subject. Sequences and relative abundance of the bacterial 16 S rDNA PCR-amplicons were determined via pyrosequencing that produced 400 bp-long reads. The sequence data were analyzed via a computational pipeline customized for human oral microbiome analyses. Furthermore, the relative abundances of selected microbial groups were validated using quantitative PCR. Results The oral microbiomes from gingivitis and healthy subjects could be distinguished based on the distinct community structures of plaque microbiomes, but not the salivary microbiomes. Contributions of community members to community structure divergence were statistically accessed at the phylum, genus and species-like levels. Eight predominant taxa were found associated with gingivitis: TM7, Leptotrichia, Selenomonas, Streptococcus, Veillonella, Prevotella, Lautropia, and Haemophilus. Furthermore, 98 species-level OTUs were identified to be gingivitis-associated, which provided microbial features of gingivitis at a species resolution. Finally, for the two selected genera Streptococcus and Fusobacterium, Real-Time PCR based quantification of relative bacterial abundance validated the pyrosequencing-based results. Conclusions This methods study suggests that oral samples from this patient population of gingivitis can be characterized via plaque microbiome by pyrosequencing the 16 S rDNA genes. Further studies that characterize serial samples from subjects (longitudinal study design) with a larger population size may provide insight into the temporal and ecological features of oral microbial communities in clinically-defined states of gingivitis. PMID:22152152

  8. Application of Pyrosequencing® in Food Biodefense.

    PubMed

    Amoako, Kingsley Kwaku

    2015-01-01

    The perpetration of a bioterrorism attack poses a significant risk for public health with potential socioeconomic consequences. It is imperative that we possess reliable assays for the rapid and accurate identification of biothreat agents to make rapid risk-informed decisions on emergency response. The development of advanced methodologies for the detection of biothreat agents has been evolving rapidly since the release of the anthrax spores in the mail in 2001, and recent advances in detection and identification techniques could prove to be an essential component in the defense against biological attacks. Sequence-based approaches such as Pyrosequencing(®), which has the capability to determine short DNA stretches in real time using biotinylated PCR amplicons, have potential biodefense applications. Using markers from the virulence plasmids and chromosomal regions, my laboratory has demonstrated the power of this technology in the rapid, specific, and sensitive detection of B. anthracis spores and Yersinia pestis in food. These are the first applications for the detection of the two organisms in food. Furthermore, my lab has developed a rapid assay to characterize the antimicrobial resistance (AMR) gene profiles for Y. pestis using Pyrosequencing. Pyrosequencing is completed in about 60 min (following PCR amplification) and yields accurate and reliable results with an added layer of confidence, thus enabling rapid risk-informed decisions to be made. A typical run yields 40-84 bp reads with 94-100 % identity to the expected sequence. It also provides a rapid method for determining the AMR profile as compared to the conventional plate method which takes several days. The method described is proposed as a novel detection system for potential application in food biodefense.

  9. Field Monitoring of Avian Influenza Viruses: Whole-Genome Sequencing and Tracking of Neuraminidase Evolution Using 454 Pyrosequencing

    PubMed Central

    Croville, Guillaume; Soubies, Sébastien Mathieu; Barbieri, Johanna; Klopp, Christophe; Mariette, Jérôme; Bouchez, Olivier; Camus-Bouclainville, Christelle

    2012-01-01

    Adaptation of avian influenza viruses (AIVs) from waterfowl to domestic poultry with a deletion in the neuraminidase (NA) stalk has already been reported. The way the virus undergoes this evolution, however, is thus far unclear. We address this question using pyrosequencing of duck and turkey low-pathogenicity AIVs. Ducks and turkeys were sampled at the very beginning of an H6N1 outbreak, and turkeys were swabbed again 8 days later. NA stalk deletions were evidenced in turkeys by Sanger sequencing. To further investigate viral evolution, 454 pyrosequencing was performed: for each set of samples, up to 41,500 reads of ca. 400 bp were generated and aligned. Genetic polymorphisms between duck and turkey viruses were tracked on the whole genome. NA deletion was detected in less than 2% of reads in duck feces but in 100% of reads in turkey tracheal specimens collected at the same time. Further variations in length were observed in NA from turkeys 8 days later. Similarly, minority mutants emerged on the hemagglutinin (HA) gene, with substitutions mostly in the receptor binding site on the globular head. These critical changes suggest a strong evolutionary pressure in turkeys. The increasing performances of next-generation sequencing technologies should enable us to monitor the genomic diversity of avian influenza viruses and early emergence of potentially pathogenic variants within bird flocks. The present study, based on 454 pyrosequencing, suggests that NA deletion, an example of AIV adaptation from waterfowl to domestic poultry, occurs by selection rather than de novo emergence of viral mutants. PMID:22718944

  10. Fluorogenic DNA Sequencing in PDMS Microreactors

    PubMed Central

    Sims, Peter A.; Greenleaf, William J.; Duan, Haifeng; Xie, X. Sunney

    2012-01-01

    We have developed a multiplex sequencing-by-synthesis method combining terminal-phosphate labeled fluorogenic nucleotides (TPLFNs) and resealable microreactors. In the presence of phosphatase, the incorporation of a non-fluorescent TPLFN into a DNA primer by DNA polymerase results in a fluorophore. We immobilize DNA templates within polydimethylsiloxane (PDMS) microreactors, sequentially introduce one of the four identically labeled TPLFNs, seal the microreactors, allow template-directed TPLFN incorporation, and measure the signal from the fluorophores trapped in the microreactors. This workflow allows sequencing in a manner akin to pyrosequencing but without constant monitoring of each microreactor. With cycle times of <10 minutes, we demonstrate 30 base reads with ∼99% raw accuracy. “Fluorogenic pyrosequencing” combines benefits of pyrosequencing, such as rapid turn-around, native DNA generation, and single-color detection, with benefits of fluorescence-based approaches, such as highly sensitive detection and simple parallelization. PMID:21666670

  11. Pyrosequencing analysis of the microbial diversity of airag, khoormog and tarag, traditional fermented dairy products of mongolia.

    PubMed

    Oki, Kaihei; Dugersuren, Jamyan; Demberel, Shirchin; Watanabe, Koichi

    2014-01-01

    Here, we used pyrosequencing to obtain a detailed analysis of the microbial diversities of traditional fermented dairy products of Mongolia. From 22 Airag (fermented mare's milk), 5 Khoormog (fermented camel's milk) and 26 Tarag (fermented milk of cows, goats and yaks) samples collected in the Mongolian provinces of Arhangai, Bulgan, Dundgobi, Tov, Uburhangai and Umnugobi, we obtained a total of 81 operational taxonomic units, which were assigned to 15 families, 21 genera and 41 species in 3 phyla. The genus Lactobacillus is a core bacterial component of Mongolian fermented milks, and Lactobacillus helveticus, Lactobacillus kefiranofaciens and Lactobacillus delbrueckii were the predominant species of lactic acid bacteria (LAB) in the Airag, Khoormog and Tarag samples, respectively. By using this pyrosequencing approach, we successfully detected most LAB species that have been isolated as well as seven LAB species that have not been found in our previous culture-based study. A subsequent analysis of the principal components of the samples revealed that L. delbrueckii, L. helveticus, L. kefiranofaciens and Streptococcus thermophilus were the main factors influencing the microbial diversity of these Mongolian traditional fermented dairy products and that this diversity correlated with the animal species from which the milk was sourced.

  12. The Identification and Differentiation between Burkholderia mallei and Burkholderia pseudomallei Using One Gene Pyrosequencing.

    PubMed

    Gilling, Damian H; Luna, Vicki Ann; Pflugradt, Cori

    2014-01-01

    The etiologic agents for melioidosis and glanders, Burkholderia mallei and Burkholderia pseudomallei respectively, are genetically similar making identification and differentiation from other Burkholderia species and each other challenging. We used pyrosequencing to determine the presence or absence of an insertion sequence IS407A within the flagellin P (fliP) gene and to exploit the difference in orientation of this gene in the two species. Oligonucleotide primers were designed to selectively target the IS407A-fliP interface in B. mallei and the fliP gene specifically at the insertion point in B. pseudomallei. We then examined DNA from ten B. mallei, ten B. pseudomallei, 14 B. cepacia, eight other Burkholderia spp., and 17 other bacteria. Resultant pyrograms encompassed the target sequence that contained either the fliP gene with the IS407A interruption or the fully intact fliP gene with 100% sensitivity and 100% specificity. These pyrosequencing assays based upon a single gene enable investigators to reliably identify the two species. The information obtained by these assays provides more knowledge of the genomic reduction that created the new species B. mallei from B. pseudomallei and may point to new targets that can be exploited in the future.

  13. Molecular analysis of the bacterial microbiome in the forestomach fluid from the dromedary camel (Camelus dromedarius).

    PubMed

    Bhatt, Vaibhav D; Dande, Suchitra S; Patil, Nitin V; Joshi, Chaitanya G

    2013-04-01

    Rumen microorganisms play an important role in ruminant digestion and absorption of nutrients and have great potential applications in the field of rumen adjusting, food fermentation and biomass utilization etc. In order to investigate the composition of microorganisms in the rumen of camel (Camelus dromedarius), this study delves in the microbial diversity by culture-independent approach. It includes comparison of rumen samples investigated in the present study to other currently available metagenomes to reveal potential differences in rumen microbial systems. Pyrosequencing based metagenomics was applied to analyze phylogenetic and metabolic profiles by MG-RAST, a web based tool. Pyrosequencing of camel rumen sample yielded 8,979,755 nucleotides assembled to 41,905 sequence reads with an average read length of 214 nucleotides. Taxonomic analysis of metagenomic reads indicated Bacteroidetes (55.5 %), Firmicutes (22.7 %) and Proteobacteria (9.2 %) phyla as predominant camel rumen taxa. At a finer phylogenetic resolution, Bacteroides species dominated the camel rumen metagenome. Functional analysis revealed that clustering-based subsystem and carbohydrate metabolism were the most abundant SEED subsystem representing 17 and 13 % of camel metagenome, respectively. A high taxonomic and functional similarity of camel rumen was found with the cow metagenome which is not surprising given the fact that both are mammalian herbivores with similar digestive tract structures and functions. Combined pyrosequencing approach and subsystems-based annotations available in the SEED database allowed us access to understand the metabolic potential of these microbiomes. Altogether, these data suggest that agricultural and animal husbandry practices can impose significant selective pressures on the rumen microbiota regardless of rumen type. The present study provides a baseline for understanding the complexity of camel rumen microbial ecology while also highlighting striking similarities and differences when compared to other animal gastrointestinal environments.

  14. Pyrosequencing analysis of microbial communities in hollow fiber-membrane biofilm reactors system for treating high-strength nitrogen wastewater.

    PubMed

    Park, Jung-Hun; Choi, Okkyoung; Lee, Tae-Ho; Kim, Hyunook; Sang, Byoung-In

    2016-11-01

    Wastewaters from swine farms, nitrogen-dealing industries or side-stream processes of a wastewater treatment plant (e.g., anaerobic digesters, sludge thickening processes, etc.) are characterized by low C/N ratios and not easily treatable. In this study, a hollow fiber-membrane biofilm reactors (HF-MBfR) system consisting of an O2-based HF-MBfR and an H2-based HF-MBfR was applied for treating high-strength wastewater. The reactors were continuously operated with low supply of O2 and H2 and without any supply of organic carbon for 250 d. Gradual increase of ammonium and nitrate concentration in the influent showed stable and high nitrogen removal efficiency, and the maximum ammonium and nitrate removal rates were 0.48 kg NH4(+)-N m(-3) d(-1) and 0.55 kg NO3(-)-N m(-3) d(-1), respectively. The analysis of the microbial communities using pyrosequencing analysis indicated that Nitrosospira multiformis, ammonium-oxidizing bacteria, and Nitrobacter winogradskyi and Nitrobacter vulgaris, nitrite-oxidizing bacteria were highly enriched in the O2-based HF-MBfR. In the H2-based HF-MBfR, hydrogenotrophic denitrifying bacteria belonging to the family of Thiobacillus and Comamonadaceae were initially dominant, but were replaced to heterotrophic denitrifiers belonging to Rhodocyclaceae and Rhodobacteraceae utilizing by-products induced from autotrophic denitrifying bacteria. The pyrosequencing analysis of microbial communities indicates that the autotrophic HF-MBfRs system well developed autotrophic nitrifying and denitrifying bacteria within a relatively short period to accomplish almost complete nitrogen removal. Copyright © 2016 Elsevier Ltd. All rights reserved.

  15. Differential resistance of drinking water bacterial populations to monochloramine disinfection.

    PubMed

    Chiao, Tzu-Hsin; Clancy, Tara M; Pinto, Ameet; Xi, Chuanwu; Raskin, Lutgarde

    2014-04-01

    The impact of monochloramine disinfection on the complex bacterial community structure in drinking water systems was investigated using culture-dependent and culture-independent methods. Changes in viable bacterial diversity were monitored using culture-independent methods that distinguish between live and dead cells based on membrane integrity, providing a highly conservative measure of viability. Samples were collected from lab-scale and full-scale drinking water filters exposed to monochloramine for a range of contact times. Culture-independent detection of live cells was based on propidium monoazide (PMA) treatment to selectively remove DNA from membrane-compromised cells. Quantitative PCR (qPCR) and pyrosequencing of 16S rRNA genes was used to quantify the DNA of live bacteria and characterize the bacterial communities, respectively. The inactivation rate determined by the culture-independent PMA-qPCR method (1.5-log removal at 664 mg·min/L) was lower than the inactivation rate measured by the culture-based methods (4-log removal at 66 mg·min/L). Moreover, drastic changes in the live bacterial community structure were detected during monochloramine disinfection using PMA-pyrosequencing, while the community structure appeared to remain stable when pyrosequencing was performed on samples that were not subject to PMA treatment. Genera that increased in relative abundance during monochloramine treatment include Legionella, Escherichia, and Geobacter in the lab-scale system and Mycobacterium, Sphingomonas, and Coxiella in the full-scale system. These results demonstrate that bacterial populations in drinking water exhibit differential resistance to monochloramine, and that the disinfection process selects for resistant bacterial populations.

  16. Pyrosequencing for detection of lamivudine-resistant hepatitis B virus.

    PubMed

    Lindström, Anna; Odeberg, Jacob; Albert, Jan

    2004-10-01

    Chronic hepatitis B virus (HBV) infection can cause severe liver disease, including cirrhosis and hepatocellular carcinoma. Lamivudine is a relatively recent alternative to alpha interferon for the treatment of HBV infection, but unfortunately, resistance to lamivudine commonly develops during monotherapy. Lamivudine-resistant HBV mutants display specific mutations in the YMDD (tyrosine, methionine, aspartate, aspartate) motif of the viral polymerase (reverse transcriptase [rt]), which is the catalytic site of the enzyme, i.e., methionine 204 to isoleucine (rtM204I) or valine (rtM204V). The latter mutation is often accompanied by a compensatory leucine-to-methionine change at codon 180 (rtL180M). In the present study, a novel sequencing method, pyrosequencing, was applied to the detection of lamivudine resistance mutations and was compared with direct Sanger sequencing. The new pyrosequencing method had advantages in terms of throughput. Experiments with mixtures of wild-type and resistant viruses indicated that pyrosequencing can detect minor sequence variants in heterogeneous virus populations. The new pyrosequencing method was evaluated with a small number of patient samples, and the results showed that the method could be a useful tool for the detection of lamivudine resistance in the clinical setting.

  17. Comparison Study of MS-HRM and Pyrosequencing Techniques for Quantification of APC and CDKN2A Gene Methylation

    PubMed Central

    Migheli, Francesca; Stoccoro, Andrea; Coppedè, Fabio; Wan Omar, Wan Adnan; Failli, Alessandra; Consolini, Rita; Seccia, Massimo; Spisni, Roberto; Miccoli, Paolo; Mathers, John C.; Migliore, Lucia

    2013-01-01

    There is increasing interest in the development of cost-effective techniques for the quantification of DNA methylation biomarkers. We analyzed 90 samples of surgically resected colorectal cancer tissues for APC and CDKN2A promoter methylation using methylation sensitive-high resolution melting (MS-HRM) and pyrosequencing. MS-HRM is a less expensive technique compared with pyrosequencing but is usually more limited because it gives a range of methylation estimates rather than a single value. Here, we developed a method for deriving single estimates, rather than a range, of methylation using MS-HRM and compared the values obtained in this way with those obtained using the gold standard quantitative method of pyrosequencing. We derived an interpolation curve using standards of known methylated/unmethylated ratio (0%, 12.5%, 25%, 50%, 75%, and 100% of methylation) to obtain the best estimate of the extent of methylation for each of our samples. We observed similar profiles of methylation and a high correlation coefficient between the two techniques. Overall, our new approach allows MS-HRM to be used as a quantitative assay which provides results which are comparable with those obtained by pyrosequencing. PMID:23326336

  18. Ovine pedomics: the first study of the ovine foot 16S rRNA-based microbiome

    USDA-ARS?s Scientific Manuscript database

    We report the first study of the bacterial microbiome of ovine interdigital skin based on 16S rRNA by pyrosequencing and conventional cloning with Sanger-sequencing. Ovine foot rot is an infectious, contagious disease of sheep that causes severe lameness and economic loss from decreased flock produc...

  19. Sequencing the Black Aspergilli species complex

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kuo, Alan; Salamov, Asaf; Zhou, Kemin

    2011-03-11

    The ~15 members of the Aspergillus section Nigri species complex (the "Black Aspergilli") are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as food processing and spoilage agents and agricultural toxigens. Despite their utility and ubiquity, the morphological and metabolic distinctiveness of the complex's members, and thus their taxonomy, is poorly defined. We are using short read pyrosequencing technology (Roche/454 and Illumina/Solexa) to rapidly scale up genomic and transcriptomic analysis of this species complex. To date we predict 11197 genes in Aspergillus niger, 11624 genes inmore » A. carbonarius, and 10845 genes in A. aculeatus. A. aculeatus is our most recent genome, and was assembled primarily from 454-sequenced reads and annotated with the aid of >2 million 454 ESTs and >300 million Solexa ESTs. To most effectively deploy these very large numbers of ESTs we developed 2 novel methods for clustering the ESTs into assemblies. We have also developed a pipeline to propose orthologies and paralogies among genes in the species complex. In the near future we will apply these methods to additional species of Black Aspergilli that are currently in our sequencing pipeline.« less

  20. Combining flow cytometry and 16S rRNA gene pyrosequencing: a promising approach for drinking water monitoring and characterization.

    PubMed

    Prest, E I; El-Chakhtoura, J; Hammes, F; Saikaly, P E; van Loosdrecht, M C M; Vrouwenvelder, J S

    2014-10-15

    The combination of flow cytometry (FCM) and 16S rRNA gene pyrosequencing data was investigated for the purpose of monitoring and characterizing microbial changes in drinking water distribution systems. High frequency sampling (5 min intervals for 1 h) was performed at the outlet of a treatment plant and at one location in the full-scale distribution network. In total, 52 bulk water samples were analysed with FCM, pyrosequencing and conventional methods (adenosine-triphosphate, ATP; heterotrophic plate count, HPC). FCM and pyrosequencing results individually showed that changes in the microbial community occurred in the water distribution system, which was not detected with conventional monitoring. FCM data showed an increase in the total bacterial cell concentrations (from 345 ± 15 × 10(3) to 425 ± 35 × 10(3) cells mL(-1)) and in the percentage of intact bacterial cells (from 39 ± 3.5% to 53 ± 4.4%) during water distribution. This shift was also observed in the FCM fluorescence fingerprints, which are characteristic of each water sample. A similar shift was detected in the microbial community composition as characterized with pyrosequencing, showing that FCM and genetic fingerprints are congruent. FCM and pyrosequencing data were subsequently combined for the calculation of cell concentration changes for each bacterial phylum. The results revealed an increase in cell concentrations of specific bacterial phyla (e.g., Proteobacteria), along with a decrease in other phyla (e.g., Actinobacteria), which could not be concluded from the two methods individually. The combination of FCM and pyrosequencing methods is a promising approach for future drinking water quality monitoring and for advanced studies on drinking water distribution pipeline ecology. Copyright © 2014 Elsevier Ltd. All rights reserved.

  1. Rapid detection of the CYP2A6*12 hybrid allele by Pyrosequencing technology.

    PubMed

    Koontz, Deborah A; Huckins, Jacqueline J; Spencer, Antonina; Gallagher, Margaret L

    2009-08-24

    Identification of CYP2A6 alleles associated with reduced enzyme activity is important in the study of inter-individual differences in drug metabolism. CYP2A6*12 is a hybrid allele that results from unequal crossover between CYP2A6 and CYP2A7 genes. The 5' regulatory region and exons 1-2 are derived from CYP2A7, and exons 3-9 are derived from CYP2A6. Conventional methods for detection of CYP2A6*12 consist of two-step PCR protocols that are laborious and unsuitable for high-throughput genotyping. We developed a rapid and accurate method to detect the CYP2A6*12 allele by Pyrosequencing technology. A single set of PCR primers was designed to specifically amplify both the CYP2A6*1 wild-type allele and the CYP2A6*12 hybrid allele. An internal Pyrosequencing primer was used to generate allele-specific sequence information, which detected homozygous wild-type, heterozygous hybrid, and homozygous hybrid alleles. We first validated the assay on 104 DNA samples that were also genotyped by conventional two-step PCR and by cycle sequencing. CYP2A6*12 allele frequencies were then determined using the Pyrosequencing assay on 181 multi-ethnic DNA samples from subjects of African American, European Caucasian, Pacific Rim, and Hispanic descent. Finally, we streamlined the Pyrosequencing assay by integrating liquid handling robotics into the workflow. Pyrosequencing results demonstrated 100% concordance with conventional two-step PCR and cycle sequencing methods. Allele frequency data showed slightly higher prevalence of the CYP2A6*12 allele in European Caucasians and Hispanics. This Pyrosequencing assay proved to be a simple, rapid, and accurate alternative to conventional methods, which can be easily adapted to the needs of higher-throughput studies.

  2. Multiplex pyrosequencing method to determine CYP2C9*3, VKORC1*2, and CYP4F2*3 polymorphisms simultaneously: its application to a Korean population and comparisons with other ethnic groups.

    PubMed

    Kim, Kyoung-Ah; Song, Wan-Geun; Lee, Hae-Mi; Joo, Hyun-Jin; Park, Ji-Young

    2014-11-01

    Warfarin is an anticoagulant that is difficult to administer because of the wide variation in dose requirements to achieve a therapeutic effect. CYP2C9, VKROC1, and CYP4F2 play important roles in warfarin metabolism, and their genetic polymorphisms are related to the variability in dose determination. In this study we describe a new multiplex pyrosequencing method to identify CYP2C9*3 (rs1057910), VKORC1*2 (rs9923231), and CYP4F2*3 (rs2108661) simultaneously. A multiplex pyrosequencing method to simultaneously detect CYP2C9*3, VKORC1*2, and CYP4F2*3 alleles was designed. We assessed the allele frequencies of the polymorphisms in 250 Korean subjects using the multiplex pyrosequencing method. The results showed 100 % concordance between single and multiplex pyrosequencing methods, and the polymorphisms identified by pyrosequencing were also validated with the direct sequencing method. The allele frequencies of these polymorphisms in this population were as follows: 0.040 for CYP2C9*3, 0.918 for VKORC1*2, and 0.416 for CYP4F2*3. Although the allele frequencies of the CYP2C9*3 and VKROC1*2 were comparable to those in Japanese and Chinese populations, their frequencies in this Korean population differed from those in other ethnic groups; the CYP4F2*3 frequency was the highest among other ethnic populations including Chinese and Japanese populations. The pyrosequencing methods developed were rapid and reliable for detecting CYP2C9*3, VKORC1*2, and CYP4F2*3. Large ethnic differences in the frequency of these genetic polymorphisms were noted among ethnic groups. CYP4F2*3 exhibited its highest allele frequency among other ethnic populations compared to that in a Korean population.

  3. Single-cell analysis of the transcriptome and its application in the characterization of stem cells and early embryos.

    PubMed

    Liu, Na; Liu, Lin; Pan, Xinghua

    2014-07-01

    Cellular heterogeneity within a cell population is a common phenomenon in multicellular organisms, tissues, cultured cells, and even FACS-sorted subpopulations. Important information may be masked if the cells are studied as a mass. Transcriptome profiling is a parameter that has been intensively studied, and relatively easier to address than protein composition. To understand the basis and importance of heterogeneity and stochastic aspects of the cell function and its mechanisms, it is essential to examine transcriptomes of a panel of single cells. High-throughput technologies, starting from microarrays and now RNA-seq, provide a full view of the expression of transcriptomes but are limited by the amount of RNA for analysis. Recently, several new approaches for amplification and sequencing the transcriptome of single cells or a limited low number of cells have been developed and applied. In this review, we summarize these major strategies, such as PCR-based methods, IVT-based methods, phi29-DNA polymerase-based methods, and several other methods, including their principles, characteristics, advantages, and limitations, with representative applications in cancer stem cells, early development, and embryonic stem cells. The prospects for development of future technology and application of transcriptome analysis in a single cell are also discussed.

  4. Investigating bacterial populations in styrene-degrading biofilters by 16S rDNA tag pyrosequencing.

    PubMed

    Portune, Kevin J; Pérez, M Carmen; Álvarez-Hornos, F Javier; Gabaldón, Carmen

    2015-01-01

    Microbial biofilms are essential components in the elimination of pollutants within biofilters, yet still little is known regarding the complex relationships between microbial community structure and biodegradation function within these engineered ecosystems. To further explore this relationship, 16S rDNA tag pyrosequencing was applied to samples taken at four time points from a styrene-degrading biofilter undergoing variable operating conditions. Changes in microbial structure were observed between different stages of biofilter operation, and the level of styrene concentration was revealed to be a critical factor affecting these changes. Bacterial genera Azoarcus and Pseudomonas were among the dominant classified genera in the biofilter. Canonical correspondence analysis (CCA) and correlation analysis revealed that the genera Brevundimonas, Hydrogenophaga, and Achromobacter may play important roles in styrene degradation under increasing styrene concentrations. No significant correlations (P > 0.05) could be detected between biofilter operational/functional parameters and biodiversity measurements, although biological heterogeneity within biofilms and/or technical variability within pyrosequencing may have considerably affected these results. Percentages of selected bacterial taxonomic groups detected by fluorescence in situ hybridization (FISH) were compared to results from pyrosequencing in order to assess the effectiveness and limitations of each method for identifying each microbial taxon. Comparison of results revealed discrepancies between the two methods in the detected percentages of numerous taxonomic groups. Biases and technical limitations of both FISH and pyrosequencing, such as the binding of FISH probes to non-target microbial groups and lack of classification of sequences for defined taxonomic groups from pyrosequencing, may partially explain some differences between the two methods.

  5. Developing High-Throughput HIV Incidence Assay with Pyrosequencing Platform

    PubMed Central

    Park, Sung Yong; Goeken, Nolan; Lee, Hyo Jin; Bolan, Robert; Dubé, Michael P.

    2014-01-01

    ABSTRACT Human immunodeficiency virus (HIV) incidence is an important measure for monitoring the epidemic and evaluating the efficacy of intervention and prevention trials. This study developed a high-throughput, single-measure incidence assay by implementing a pyrosequencing platform. We devised a signal-masking bioinformatics pipeline, which yielded a process error rate of 5.8 × 10−4 per base. The pipeline was then applied to analyze 18,434 envelope gene segments (HXB2 7212 to 7601) obtained from 12 incident and 24 chronic patients who had documented HIV-negative and/or -positive tests. The pyrosequencing data were cross-checked by using the single-genome-amplification (SGA) method to independently obtain 302 sequences from 13 patients. Using two genomic biomarkers that probe for the presence of similar sequences, the pyrosequencing platform correctly classified all 12 incident subjects (100% sensitivity) and 23 of 24 chronic subjects (96% specificity). One misclassified subject's chronic infection was correctly classified by conducting the same analysis with SGA data. The biomarkers were statistically associated across the two platforms, suggesting the assay's reproducibility and robustness. Sampling simulations showed that the biomarkers were tolerant of sequencing errors and template resampling, two factors most likely to affect the accuracy of pyrosequencing results. We observed comparable biomarker scores between AIDS and non-AIDS chronic patients (multivariate analysis of variance [MANOVA], P = 0.12), indicating that the stage of HIV disease itself does not affect the classification scheme. The high-throughput genomic HIV incidence marks a significant step toward determining incidence from a single measure in cross-sectional surveys. IMPORTANCE Annual HIV incidence, the number of newly infected individuals within a year, is the key measure of monitoring the epidemic's rise and decline. Developing reliable assays differentiating recent from chronic infections has been a long-standing quest in the HIV community. Over the past 15 years, these assays have traditionally measured various HIV-specific antibodies, but recent technological advancements have expanded the diversity of proposed accurate, user-friendly, and financially viable tools. Here we designed a high-throughput genomic HIV incidence assay based on the signature imprinted in the HIV gene sequence population. By combining next-generation sequencing techniques with bioinformatics analysis, we demonstrated that genomic fingerprints are capable of distinguishing recently infected patients from chronically infected patients with high precision. Our high-throughput platform is expected to allow us to process many patients' samples from a single experiment, permitting the assay to be cost-effective for routine surveillance. PMID:24371062

  6. Toxicogenomics in Environmental Science.

    PubMed

    Brinke, Alexandra; Buchinger, Sebastian

    This chapter reviews the current knowledge and recent progress in the field of environmental, aquatic ecotoxicogenomics with a focus on transcriptomic methods. In ecotoxicogenomics the omics technologies are applied for the detection and assessment of adverse effects in the environment, and thus are to be distinguished from omics used in human toxicology [Snape et al., Aquat Toxicol 67:143-154, 2004]. Transcriptomic methods in ecotoxicology are applied to gain a mechanistic understanding of toxic effects on organisms or populations, and thus aim to bridge the gap between cause and effect. A worthwhile effect-based interpretation of stressor induced changes on the transcriptome is based on the principle of phenotypic-anchoring [Paules, Environ Health Perspect 111:A338-A339, 2003]. Thereby, changes on the transcriptomic level can only be identified as effects if they are clearly linked to a specific stressor-induced effect on the macroscopic level. By integrating those macroscopic and transcriptomic effects, conclusions on the effect-inducing type of the stressor can be drawn. Stressor-specific effects on the transcriptomic level can be identified as stressor-specific induced pathways, transcriptomic patterns, or stressors-specific genetic biomarkers. In this chapter, examples of the combined application of macroscopic and transcriptional effects for the identification of environmental stressors, such as aquatic pollutants, are given and discussed. By means of these examples, challenges on the way to a standardized application of transcriptomics in ecotoxicology are discussed. This is also done against the background of the application of transcriptomic methods in environmental regulation such as the EU regulation Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH).

  7. ATGC transcriptomics: a web-based application to integrate, explore and analyze de novo transcriptomic data.

    PubMed

    Gonzalez, Sergio; Clavijo, Bernardo; Rivarola, Máximo; Moreno, Patricio; Fernandez, Paula; Dopazo, Joaquín; Paniego, Norma

    2017-02-22

    In the last years, applications based on massively parallelized RNA sequencing (RNA-seq) have become valuable approaches for studying non-model species, e.g., without a fully sequenced genome. RNA-seq is a useful tool for detecting novel transcripts and genetic variations and for evaluating differential gene expression by digital measurements. The large and complex datasets resulting from functional genomic experiments represent a challenge in data processing, management, and analysis. This problem is especially significant for small research groups working with non-model species. We developed a web-based application, called ATGC transcriptomics, with a flexible and adaptable interface that allows users to work with new generation sequencing (NGS) transcriptomic analysis results using an ontology-driven database. This new application simplifies data exploration, visualization, and integration for a better comprehension of the results. ATGC transcriptomics provides access to non-expert computer users and small research groups to a scalable storage option and simple data integration, including database administration and management. The software is freely available under the terms of GNU public license at http://atgcinta.sourceforge.net .

  8. Functional characterization of two concrete biofilms using pyrosequencing data

    EPA Science Inventory

    Phylogenetic studies of concrete biofilms using 16SrRNA-based approaches have demonstrated that concrete surfaces harbor a diverse microbial community. These approaches can provide information on the general taxonomical groups present in a sample but cannot shed light on the func...

  9. Rapid detection method for Bacillus anthracis using a combination of multiplexed real-time PCR and pyrosequencing and its application for food biodefense.

    PubMed

    Janzen, Timothy W; Thomas, Matthew C; Goji, Noriko; Shields, Michael J; Hahn, Kristen R; Amoako, Kingsley K

    2015-02-01

    Bacillus anthracis, the causative agent of anthrax, has the capacity to form highly resilient spores as part of its life cycle. The potential for the dissemination of these spores using food as a vehicle is a huge public health concern and, hence, requires the development of a foodborne bioterrorism response approach. In this work, we address a critical gap in food biodefense by presenting a novel, combined, sequential method involving the use of real-time PCR and pyrosequencing for the rapid, specific detection of B. anthracis spores in three food matrices: milk, apple juice, and bottled water. The food samples were experimentally inoculated with 40 CFU ml(-1), and DNA was extracted from the spores and analyzed after immunomagnetic separation. Applying the combination of multiplex real-time PCR and pyrosequencing, we successfully detected the presence of targets on both of the virulence plasmids and the chromosome. The results showed that DNA amplicons generated from a five-target multiplexed real-time PCR detection using biotin-labeled primers can be used for single-plex pyrosequencing detection. The combined use of multiplexed real-time PCR and pyrosequencing is a novel, rapid detection method for B. anthracis from food and provides a tool for accurate, quantitative identification with potential biodefense applications.

  10. Pyrosequencing Analysis of the Microbial Diversity of Airag, Khoormog and Tarag, Traditional Fermented Dairy Products of Mongolia

    PubMed Central

    OKI, Kaihei; DUGERSUREN, Jamyan; DEMBEREL, Shirchin; WATANABE, Koichi

    2014-01-01

    Here, we used pyrosequencing to obtain a detailed analysis of the microbial diversities of traditional fermented dairy products of Mongolia. From 22 Airag (fermented mare’s milk), 5 Khoormog (fermented camel’s milk) and 26 Tarag (fermented milk of cows, goats and yaks) samples collected in the Mongolian provinces of Arhangai, Bulgan, Dundgobi, Tov, Uburhangai and Umnugobi, we obtained a total of 81 operational taxonomic units, which were assigned to 15 families, 21 genera and 41 species in 3 phyla. The genus Lactobacillus is a core bacterial component of Mongolian fermented milks, and Lactobacillus helveticus, Lactobacillus kefiranofaciens and Lactobacillus delbrueckii were the predominant species of lactic acid bacteria (LAB) in the Airag, Khoormog and Tarag samples, respectively. By using this pyrosequencing approach, we successfully detected most LAB species that have been isolated as well as seven LAB species that have not been found in our previous culture-based study. A subsequent analysis of the principal components of the samples revealed that L. delbrueckii, L. helveticus, L. kefiranofaciens and Streptococcus thermophilus were the main factors influencing the microbial diversity of these Mongolian traditional fermented dairy products and that this diversity correlated with the animal species from which the milk was sourced. PMID:25003019

  11. [Effect of X-ray micro-computed tomography on the metabolic activity and diversity of soil microbial communities in two Chinese soils].

    PubMed

    Zu, Qianhui; Fang, Huan; Zhou, Hu; Zhang, Jianwei; Peng, Xinhua; Lin, Xiangui; Feng, Youzhi

    2016-01-04

    X-ray micro-computed tomography (micro-CT) technology, as used in the in situ and nondestructive analysis of soil physical structure, provides the opportunity of associating soil physical and biological assays. Due to the high heterogeneity of the soil matrix, X-ray micro-CT scanning and soil microbial assays should be conducted on the same soil sample. This raises the question whether X-ray micro-CT influences microbial function and diversity of the sample soil to be analyzed. To address this question, we used plate counting, microcalorimetry and pyrosequencing approaches to evaluate the effect of X-ray--at doses typically used in micro-CT--on soil microorganisms in a typical soil of North China Plain, Fluvo-aquic soil and in a typical soil of subtropical China, Ultisol soil, respectively. In both soils radiation decreased the number of viable soil bacteria and disturbed their thermogenic profiles. At DNA level, pyrosequencing revealed that alpha diversities of two soils biota were influenced in opposite ways, while beta diversity was not affected although the relative abundances of some guilds were changed. These findings indicate that the metabolically active aspects of soil biota are not compatible with X-ray micro-CT; while the beta molecular diversity based on pyrosequencing could be compatible.

  12. The Identification and Differentiation between Burkholderia mallei and Burkholderia pseudomallei Using One Gene Pyrosequencing

    PubMed Central

    Gilling, Damian H.; Luna, Vicki Ann; Pflugradt, Cori

    2014-01-01

    The etiologic agents for melioidosis and glanders, Burkholderia mallei and Burkholderia pseudomallei respectively, are genetically similar making identification and differentiation from other Burkholderia species and each other challenging. We used pyrosequencing to determine the presence or absence of an insertion sequence IS407A within the flagellin P (fliP) gene and to exploit the difference in orientation of this gene in the two species. Oligonucleotide primers were designed to selectively target the IS407A-fliP interface in B. mallei and the fliP gene specifically at the insertion point in B. pseudomallei. We then examined DNA from ten B. mallei, ten B. pseudomallei, 14 B. cepacia, eight other Burkholderia spp., and 17 other bacteria. Resultant pyrograms encompassed the target sequence that contained either the fliP gene with the IS407A interruption or the fully intact fliP gene with 100% sensitivity and 100% specificity. These pyrosequencing assays based upon a single gene enable investigators to reliably identify the two species. The information obtained by these assays provides more knowledge of the genomic reduction that created the new species B. mallei from B. pseudomallei and may point to new targets that can be exploited in the future. PMID:27350960

  13. Rapid detection and subtyping of human influenza A viruses and reassortants by pyrosequencing.

    PubMed

    Deng, Yi-Mo; Caldwell, Natalie; Barr, Ian G

    2011-01-01

    Given the continuing co-circulation of the 2009 H1N1 pandemic influenza A viruses with seasonal H3N2 viruses, rapid and reliable detection of newly emerging influenza reassortant viruses is important to enhance our influenza surveillance. A novel pyrosequencing assay was developed for the rapid identification and subtyping of potential human influenza A virus reassortants based on all eight gene segments of the virus. Except for HA and NA genes, one universal set of primers was used to amplify and subtype each of the six internal genes. With this method, all eight gene segments of 57 laboratory isolates and 17 original specimens of seasonal H1N1, H3N2 and 2009 H1N1 pandemic viruses were correctly matched with their corresponding subtypes. In addition, this method was shown to be capable of detecting reassortant viruses by correctly identifying the source of all 8 gene segments from three vaccine production reassortant viruses and three H1N2 viruses. In summary, this pyrosequencing assay is a sensitive and specific procedure for screening large numbers of viruses for reassortment events amongst the commonly circulating human influenza A viruses, which is more rapid and cheaper than using conventional sequencing approaches.

  14. Rapid Detection and Subtyping of Human Influenza A Viruses and Reassortants by Pyrosequencing

    PubMed Central

    Deng, Yi-Mo; Caldwell, Natalie; Barr, Ian G.

    2011-01-01

    Background Given the continuing co-circulation of the 2009 H1N1 pandemic influenza A viruses with seasonal H3N2 viruses, rapid and reliable detection of newly emerging influenza reassortant viruses is important to enhance our influenza surveillance. Methodology/Principal Findings A novel pyrosequencing assay was developed for the rapid identification and subtyping of potential human influenza A virus reassortants based on all eight gene segments of the virus. Except for HA and NA genes, one universal set of primers was used to amplify and subtype each of the six internal genes. With this method, all eight gene segments of 57 laboratory isolates and 17 original specimens of seasonal H1N1, H3N2 and 2009 H1N1 pandemic viruses were correctly matched with their corresponding subtypes. In addition, this method was shown to be capable of detecting reassortant viruses by correctly identifying the source of all 8 gene segments from three vaccine production reassortant viruses and three H1N2 viruses. Conclusions/Significance In summary, this pyrosequencing assay is a sensitive and specific procedure for screening large numbers of viruses for reassortment events amongst the commonly circulating human influenza A viruses, which is more rapid and cheaper than using conventional sequencing approaches. PMID:21886790

  15. Analysis of Gastric Microbiota by Pyrosequencing: Minor Role of Bacteria Other Than Helicobacter pylori in the Gastric Carcinogenesis.

    PubMed

    Jo, Hyun Jin; Kim, Jaeyeon; Kim, Nayoung; Park, Ji Hyun; Nam, Ryoung Hee; Seok, Yeong-Jae; Kim, Yeon-Ran; Kim, Joo Sung; Kim, Jung Mogg; Kim, Jung Min; Lee, Dong Ho; Jung, Hyun Chae

    2016-10-01

    Little is known about the role of gastric microbiota except for Helicobacter pylori (HP) in human health and disease. We compared the differences of human gastric microbiota according to gastric cancer or control and HP infection status and assessed the role of bacteria other than HP. Gastric microbiota of 63 antral mucosal and 18 corpus mucosal samples were analyzed by bar-coded 454 pyrosequencing of the 16S rRNA gene. Antral samples were divided into four subgroups based on HP positivity in pyrosequencing and the presence of cancer. The analysis was focused on bacteria other than HP, especially nitrosating or nitrate-reducing bacteria (NB). The changes of NB in antral mucosa of 16 subjects were followed up. The number of NB other than HP (non-HP-NB) was two times higher in the cancer groups than in the control groups, but it did not reach statistical significance. The number of non-HP-NB tends to increase over time, but this phenomenon was prevented by HP eradication in the HP-positive control group, but not in the HP-positive cancer group. We could not find the significant role of bacteria other than HP in the gastric carcinogenesis. © 2016 John Wiley & Sons Ltd.

  16. Adult Mouse Cortical Cell Taxonomy by Single Cell Transcriptomics

    PubMed Central

    Tasic, Bosiljka; Menon, Vilas; Nguyen, Thuc Nghi; Kim, Tae Kyung; Jarsky, Tim; Yao, Zizhen; Levi, Boaz; Gray, Lucas T.; Sorensen, Staci A.; Dolbeare, Tim; Bertagnolli, Darren; Goldy, Jeff; Shapovalova, Nadiya; Parry, Sheana; Lee, Changkyu; Smith, Kimberly; Bernard, Amy; Madisen, Linda; Sunkin, Susan M.; Hawrylycz, Michael; Koch, Christof; Zeng, Hongkui

    2016-01-01

    Nervous systems are composed of various cell types, but the extent of cell type diversity is poorly understood. Here, we construct a cellular taxonomy of one cortical region, primary visual cortex, in adult mice based on single cell RNA-sequencing. We identify 49 transcriptomic cell types including 23 GABAergic, 19 glutamatergic and seven non-neuronal types. We also analyze cell-type specific mRNA processing and characterize genetic access to these transcriptomic types by many transgenic Cre lines. Finally, we show that some of our transcriptomic cell types display specific and differential electrophysiological and axon projection properties, thereby confirming that the single cell transcriptomic signatures can be associated with specific cellular properties. PMID:26727548

  17. The diversity and structure of marine protists in the coastal waters of China revealed by morphological observation and 454 pyrosequencing

    NASA Astrophysics Data System (ADS)

    Liu, Yun; Song, Shuqun; Chen, Tiantian; Li, Caiwen

    2017-04-01

    Pyrosequencing of the 18S rRNA gene has been widely adopted to study the eukaryotic diversity in various types of environments, and has an advantage over traditional morphology methods in exploring unknown microbial communities. To comprehensively assess the diversity and community composition of marine protists in the coastal waters of China, we applied both morphological observations and high-throughput sequencing of the V2 and V3 regions of 18S rDNA simultaneously to analyze samples collected from the surface layer of the Yellow and East China Seas. Dinoflagellates, diatoms and ciliates were the three dominant protistan groups as revealed by the two methods. Diatoms were the first dominant protistan group in the microscopic observations, with Skeletonema mainly distributed in the nearshore eutrophic waters and Chaetoceros in higher temperature and higher pH waters. The mixotrophic dinoflagellates, Gymnodinium and Gyrodinium, were more competitive in the oligotrophic waters. The pyrosequencing method revealed an extensive diversity of dinoflagellates. Chaetoceros was the only dominant diatom group in the pyrosequencing dataset. Gyrodinium represented the most abundant reads and dominated the offshore oligotrophic protistan community as they were in the microscopic observations. The dominance of parasitic dinoflagellates in the pyrosequencing dataset, which were overlooked in the morphological observations, indicates more attention should be paid to explore the potential role of this group. Both methods provide coherent clustering of samples. Nutrient levels, salinity and pH were the main factors influencing the distribution of protists. This study demonstrates that different primer pairs used in the pyrosequencing will indicate different protistan community structures. A suitable marker may reveal more comprehensive composition of protists and provide valuable information on environmental drivers.

  18. Detection by real-time PCR and pyrosequencing of the cry1Ab and cry1Ac genes introduced in genetically modified (GM) constructs.

    PubMed

    Debode, Frederic; Janssen, Eric; Bragard, Claude; Berben, Gilbert

    2017-08-01

    The presence of genetically modified organisms (GMOs) in food and feed is mainly detected by the use of targets focusing on promoters and terminators. As some genes are frequently used in genetically modified (GM) construction, they also constitute excellent screening elements and their use is increasing. In this paper we propose a new target for the detection of cry1Ab and cry1Ac genes by real-time polymerase chain reaction (PCR) and pyrosequencing. The specificity, sensitivity and robustness of the real-time PCR method were tested following the recommendations of international guidelines and the method met the expected performance criteria. This paper also shows how the robustness testing was assessed. This new cry1Ab/Ac method can provide a positive signal with a larger number of GM events than do the other existing methods using double dye-probes. The method permits the analysis of results with less ambiguity than the SYBRGreen method recommended by the European Reference Laboratory (EURL) GM Food and Feed (GMFF). A pyrosequencing method was also developed to gain additional information thanks to the sequence of the amplicon. This method of sequencing-by-synthesis can determine the sequence between the primers used for PCR. Pyrosequencing showed that the sequences internal to the primers present differences following the GM events considered and three different sequences were observed. The sensitivity of the pyrosequencing was tested on reference flours with a low percentage GM content and different copy numbers. Improvements in the pyrosequencing protocol provided correct sequences with 50 copies of the target. Below this copy number, the quality of the sequence was more random.

  19. A pipeline for the de novo assembly of the Themira biloba (Sepsidae: Diptera) transcriptome using a multiple k-mer length approach.

    PubMed

    Melicher, Dacotah; Torson, Alex S; Dworkin, Ian; Bowsher, Julia H

    2014-03-12

    The Sepsidae family of flies is a model for investigating how sexual selection shapes courtship and sexual dimorphism in a comparative framework. However, like many non-model systems, there are few molecular resources available. Large-scale sequencing and assembly have not been performed in any sepsid, and the lack of a closely related genome makes investigation of gene expression challenging. Our goal was to develop an automated pipeline for de novo transcriptome assembly, and to use that pipeline to assemble and analyze the transcriptome of the sepsid Themira biloba. Our bioinformatics pipeline uses cloud computing services to assemble and analyze the transcriptome with off-site data management, processing, and backup. It uses a multiple k-mer length approach combined with a second meta-assembly to extend transcripts and recover more bases of transcript sequences than standard single k-mer assembly. We used 454 sequencing to generate 1.48 million reads from cDNA generated from embryo, larva, and pupae of T. biloba and assembled a transcriptome consisting of 24,495 contigs. Annotation identified 16,705 transcripts, including those involved in embryogenesis and limb patterning. We assembled transcriptomes from an additional three non-model organisms to demonstrate that our pipeline assembled a higher-quality transcriptome than single k-mer approaches across multiple species. The pipeline we have developed for assembly and analysis increases contig length, recovers unique transcripts, and assembles more base pairs than other methods through the use of a meta-assembly. The T. biloba transcriptome is a critical resource for performing large-scale RNA-Seq investigations of gene expression patterns, and is the first transcriptome sequenced in this Dipteran family.

  20. High-throughput pyrosequencing used for the discovery of a novel cellulase from a thermophilic cellulose-degrading microbial consortium.

    PubMed

    Zhao, Chao; Chu, Yanan; Li, Yanhong; Yang, Chengfeng; Chen, Yuqing; Wang, Xumin; Liu, Bin

    2017-01-01

    To analyze the microbial diversity and gene content of a thermophilic cellulose-degrading consortium from hot springs in Xiamen, China using 454 pyrosequencing for discovering cellulolytic enzyme resources. A thermophilic cellulose-degrading consortium, XM70 that was isolated from a hot spring, used sugarcane bagasse as sole carbon and energy source. DNA sequencing of the XM70 sample resulted in 349,978 reads with an average read length of 380 bases, accounting for 133,896,867 bases of sequence information. The characterization of sequencing reads and assembled contigs revealed that most microbes were derived from four phyla: Geobacillus (Firmicutes), Thermus, Bacillus, and Anoxybacillus. Twenty-eight homologous genes belonging to 15 glycoside hydrolase families were detected, including several cellulase genes. A novel hot spring metagenome-derived thermophilic cellulase was expressed and characterized. The application value of thermostable sugarcane bagasse-degrading enzymes is shown for production of cellulosic biofuel. The practical power of using a short-read-based metagenomic approach for harvesting novel microbial genes is also demonstrated.

  1. Potential forensic biogeographic application of diatom colony consistency analysis employing pyrosequencing profiles of the 18S rDNA V7 region.

    PubMed

    Zhao, Yuancun; Chen, Xiaogang; Yang, Yiwen; Zhao, Xiaohong; Zhang, Shu; Gao, Zehua; Fang, Ting; Wang, Yufang; Zhang, Ji

    2018-05-07

    Diatom examination has always been used for the diagnosis of drowning in forensic practice. However, traditional examination of the microscopic features of diatom frustules is time-consuming and requires taxonomic expertise. In this study, we demonstrate a potential DNA-based method of inferring suspected drowning site using pyrosequencing (PSQ) of the V7 region of 18S ribosome DNA (18S rDNA) as a diatom DNA barcode. By employing a sparse representation-based AdvISER-M-PYRO algorithm, the original PSQ signals of diatom DNA mixtures were deciphered to determine the corresponding taxa of the composite diatoms. Additionally, we evaluated the possibility of correlating water samples to collection sites by analyzing the PSQ signal profiles of diatom mixtures contained in the water samples via multidimensional scaling. The results suggest that diatomaceous PSQ profile analysis could be used as a cost-effective method to deduce the geographical origin of an environmental bio-sample.

  2. Transcriptional pathway and de novo network-based approaches to effects-based monitoring in the Great Lakes

    EPA Science Inventory

    Transcriptomics provides unique solutions for understanding the impact of complex mixtures and their components on aquatic systems. Here we describe the application of transcriptomics analysis of in situ fathead minnow exposures for assessing biological impacts of wastewater trea...

  3. Utilizing Pyrosequencing and Quantitative pCR to Characterize Fungal Populations among House Dust Samples

    EPA Science Inventory

    Molecular techniques are an alternative to culturing and counting methods in quantifying indoor fungal contamination. Pyrosequencing offers the possibility of identifying unexpected indoor fungi. In this study, 50 house dust samples were collected from homes in the Yakima Valley,...

  4. Transcriptome of the Caribbean stony coral Porites astreoides from three developmental stages.

    PubMed

    Mansour, Tamer A; Rosenthal, Joshua J C; Brown, C Titus; Roberson, Loretta M

    2016-08-02

    Porites astreoides is a ubiquitous species of coral on modern Caribbean reefs that is resistant to increasing temperatures, overfishing, and other anthropogenic impacts that have threatened most other coral species. We assembled and annotated a transcriptome from this coral using Illumina sequences from three different developmental stages collected over several years: free-swimming larvae, newly settled larvae, and adults (>10 cm in diameter). This resource will aid understanding of coral calcification, larval settlement, and host-symbiont interactions. A de novo transcriptome for the P. astreoides holobiont (coral plus algal symbiont) was assembled using 594 Mbp of raw Illumina sequencing data generated from five age-specific cDNA libraries. The new transcriptome consists of 867 255 transcript elements with an average length of 685 bases. The isolated P. astreoides assembly consists of 129 718 transcript elements with an average length of 811 bases, and the isolated Symbiodinium sp. assembly had 186 177 transcript elements with an average length of 1105 bases. This contribution to coral transcriptome data provides a valuable resource for researchers studying the ontogeny of gene expression patterns within both the coral and its dinoflagellate symbiont.

  5. An Insight into the Transcriptome of the Digestive Tract of the Bloodsucking Bug, Rhodnius prolixus

    PubMed Central

    Ribeiro, José M. C.; Genta, Fernando A.; Sorgine, Marcos H. F.; Logullo, Raquel; Mesquita, Rafael D.; Paiva-Silva, Gabriela O.; Majerowicz, David; Medeiros, Marcelo; Koerich, Leonardo; Terra, Walter R.; Ferreira, Clélia; Pimentel, André C.; Bisch, Paulo M.; Leite, Daniel C.; Diniz, Michelle M. P.; Junior, João Lídio da S. G. V.; Da Silva, Manuela L.; Araujo, Ricardo N.; Gandara, Ana Caroline P.; Brosson, Sébastien; Salmon, Didier; Bousbata, Sabrina; González-Caballero, Natalia; Silber, Ariel Mariano; Alves-Bezerra, Michele; Gondim, Katia C.; Silva-Neto, Mário Alberto C.; Atella, Georgia C.; Araujo, Helena; Dias, Felipe A.; Polycarpo, Carla; Vionette-Amaral, Raquel J.; Fampa, Patrícia; Melo, Ana Claudia A.; Tanaka, Aparecida S.; Balczun, Carsten; Oliveira, José Henrique M.; Gonçalves, Renata L. S.; Lazoski, Cristiano; Rivera-Pomar, Rolando; Diambra, Luis; Schaub, Günter A.; Garcia, Elói S.; Azambuja, Patrícia; Braz, Glória R. C.; Oliveira, Pedro L.

    2014-01-01

    The bloodsucking hemipteran Rhodnius prolixus is a vector of Chagas' disease, which affects 7–8 million people today in Latin America. In contrast to other hematophagous insects, the triatomine gut is compartmentalized into three segments that perform different functions during blood digestion. Here we report analysis of transcriptomes for each of the segments using pyrosequencing technology. Comparison of transcript frequency in digestive libraries with a whole-body library was used to evaluate expression levels. All classes of digestive enzymes were highly expressed, with a predominance of cysteine and aspartic proteinases, the latter showing a significant expansion through gene duplication. Although no protein digestion is known to occur in the anterior midgut (AM), protease transcripts were found, suggesting secretion as pro-enzymes, being possibly activated in the posterior midgut (PM). As expected, genes related to cytoskeleton, protein synthesis apparatus, protein traffic, and secretion were abundantly transcribed. Despite the absence of a chitinous peritrophic membrane in hemipterans - which have instead a lipidic perimicrovillar membrane lining over midgut epithelia - several gut-specific peritrophin transcripts were found, suggesting that these proteins perform functions other than being a structural component of the peritrophic membrane. Among immunity-related transcripts, while lysozymes and lectins were the most highly expressed, several genes belonging to the Toll pathway - found at low levels in the gut of most insects - were identified, contrasting with a low abundance of transcripts from IMD and STAT pathways. Analysis of transcripts related to lipid metabolism indicates that lipids play multiple roles, being a major energy source, a substrate for perimicrovillar membrane formation, and a source for hydrocarbons possibly to produce the wax layer of the hindgut. Transcripts related to amino acid metabolism showed an unanticipated priority for degradation of tyrosine, phenylalanine, and tryptophan. Analysis of transcripts related to signaling pathways suggested a role for MAP kinases, GTPases, and LKBP1/AMP kinases related to control of cell shape and polarity, possibly in connection with regulation of cell survival, response of pathogens and nutrients. Together, our findings present a new view of the triatomine digestive apparatus and will help us understand trypanosome interaction and allow insights into hemipteran metabolic adaptations to a blood-based diet. PMID:24416461

  6. Isolation and genome-wide expression and methylation characterization of CD31+ cells from normal and malignant human prostate tissue

    PubMed Central

    Luo, Wei; Hu, Qiang; Wang, Dan; Deeb, Kristin K.; Ma, Yingyu; Morrison, Carl D.; Liu, Song; Johnson, Candace S.; Trump, Donald L.

    2013-01-01

    Endothelial cells (ECs) are an important component involved in the angiogenesis. Little is known about the global gene expression and epigenetic regulation in tumor endothelial cells. The identification of gene expression and epigenetic difference between human prostate tumor-derived endothelial cells (TdECs) and those in normal tissues may uncover unique biological features of TdEC and facilitate the discovery of new anti-angiogenic targets. We established a method for isolation of CD31+ endothelial cells from malignant and normal prostate tissues obtained at prostatectomy. TdECs and normal-derived ECs (NdECs) showed >90% enrichment in primary culture and demonstrated microvascular endothelial cell characteristics such as cobblestone morphology in monolayer culture, diI-acetyl-LDL uptake and capillary-tube like formation in Matrigel®. In vitro primary cultures of ECs maintained expression of endothelial markers such as CD31, von Willebrand factor, intercellular adhesion molecule, vascular endothelial growth factor receptor 1, and vascular endothelial growth factor receptor 2. We then conducted a pilot study of transcriptome and methylome analysis of TdECs and matched NdECs from patients with prostate cancer. We observed a wide spectrum of differences in gene expression and methylation patterns in endothelial cells, between malignant and normal prostate tissues. Array-based expression and methylation data were validated by qRT-PCR and bisulfite DNA pyrosequencing. Further analysis of transcriptome and methylome data revealed a number of differentially expressed genes with loci whose methylation change is accompanied by an inverse change in gene expression. Our study demonstrates the feasibility of isolation of ECs from histologically normal prostate and prostate cancer via CD31+ selection. The data, although preliminary, indicates that there exist widespread differences in methylation and transcription between TdECs and NdECs. Interestingly, only a small proportion of perturbed genes were overlapped between American (AA) and Caucasian American (CA) patients with prostate cancer. Our study indicates that identifying gene expression and/or epigenetic differences between TdECs and NdECs may provide us with new anti-angiogenic targets. Future studies will be required to further characterize the isolated ECs and determine the biological features that can be exploited in the prognosis and therapy of prostate cancer. PMID:23978847

  7. Solanum torvum responses to the root-knot nematode Meloidogyne incognita

    PubMed Central

    2013-01-01

    Background Solanum torvum Sw is worldwide employed as rootstock for eggplant cultivation because of its vigour and resistance/tolerance to the most serious soil-borne diseases as bacterial, fungal wilts and root-knot nematodes. The little information on Solanum torvum (hereafter Torvum) resistance mechanisms, is mostly attributable to the lack of genomic tools (e.g. dedicated microarray) as well as to the paucity of database information limiting high-throughput expression studies in Torvum. Results As a first step towards transcriptome profiling of Torvum inoculated with the nematode M. incognita, we built a Torvum 3’ transcript catalogue. One-quarter of a 454 full run resulted in 205,591 quality-filtered reads. De novo assembly yielded 24,922 contigs and 11,875 singletons. Similarity searches of the S. torvum transcript tags catalogue produced 12,344 annotations. A 30,0000 features custom combimatrix chip was then designed and microarray hybridizations were conducted for both control and 14 dpi (day post inoculation) with Meloidogyne incognita-infected roots samples resulting in 390 differentially expressed genes (DEG). We also tested the chip with samples from the phylogenetically-related nematode-susceptible eggplant species Solanum melongena. An in-silico validation strategy was developed based on assessment of sequence similarity among Torvum probes and eggplant expressed sequences available in public repositories. GO term enrichment analyses with the 390 Torvum DEG revealed enhancement of several processes as chitin catabolism and sesquiterpenoids biosynthesis, while no GO term enrichment was found with eggplant DEG. The genes identified from S. torvum catalogue, bearing high similarity to known nematode resistance genes, were further investigated in view of their potential role in the nematode resistance mechanism. Conclusions By combining 454 pyrosequencing and microarray technology we were able to conduct a cost-effective global transcriptome profiling in a non-model species. In addition, the development of an in silico validation strategy allowed to further extend the use of the custom chip to a related species and to assess by comparison the expression of selected genes without major concerns of artifacts. The expression profiling of S. torvum responses to nematode infection points to sesquiterpenoids and chitinases as major effectors of nematode resistance. The availability of the long sequence tags in S. torvum catalogue will allow precise identification of active nematocide/nematostatic compounds and associated enzymes posing the basis for exploitation of these resistance mechanisms in other species. PMID:23937585

  8. Optimized Probe Masking for Comparative Transcriptomics of Closely Related Species

    PubMed Central

    Poeschl, Yvonne; Delker, Carolin; Trenner, Jana; Ullrich, Kristian Karsten; Quint, Marcel; Grosse, Ivo

    2013-01-01

    Microarrays are commonly applied to study the transcriptome of specific species. However, many available microarrays are restricted to model organisms, and the design of custom microarrays for other species is often not feasible. Hence, transcriptomics approaches of non-model organisms as well as comparative transcriptomics studies among two or more species often make use of cost-intensive RNAseq studies or, alternatively, by hybridizing transcripts of a query species to a microarray of a closely related species. When analyzing these cross-species microarray expression data, differences in the transcriptome of the query species can cause problems, such as the following: (i) lower hybridization accuracy of probes due to mismatches or deletions, (ii) probes binding multiple transcripts of different genes, and (iii) probes binding transcripts of non-orthologous genes. So far, methods for (i) exist, but these neglect (ii) and (iii). Here, we propose an approach for comparative transcriptomics addressing problems (i) to (iii), which retains only transcript-specific probes binding transcripts of orthologous genes. We apply this approach to an Arabidopsis lyrata expression data set measured on a microarray designed for Arabidopsis thaliana, and compare it to two alternative approaches, a sequence-based approach and a genomic DNA hybridization-based approach. We investigate the number of retained probe sets, and we validate the resulting expression responses by qRT-PCR. We find that the proposed approach combines the benefit of sequence-based stringency and accuracy while allowing the expression analysis of much more genes than the alternative sequence-based approach. As an added benefit, the proposed approach requires probes to detect transcripts of orthologous genes only, which provides a superior base for biological interpretation of the measured expression responses. PMID:24260119

  9. The Complete Chloroplast Genome Sequence of Date Palm (Phoenix dactylifera L.)

    PubMed Central

    Yang, Meng; Zhang, Xiaowei; Liu, Guiming; Yin, Yuxin; Chen, Kaifu; Yun, Quanzheng; Zhao, Duojun; Al-Mssallem, Ibrahim S.; Yu, Jun

    2010-01-01

    Background Date palm (Phoenix dactylifera L.), a member of Arecaceae family, is one of the three major economically important woody palms—the two other palms being oil palm and coconut tree—and its fruit is a staple food among Middle East and North African nations, as well as many other tropical and subtropical regions. Here we report a complete sequence of the data palm chloroplast (cp) genome based on pyrosequencing. Methodology/Principal Findings After extracting 369,022 cp sequencing reads from our whole-genome-shotgun data, we put together an assembly and validated it with intensive PCR-based verification, coupled with PCR product sequencing. The date palm cp genome is 158,462 bp in length and has a typical quadripartite structure of the large (LSC, 86,198 bp) and small single-copy (SSC, 17,712 bp) regions separated by a pair of inverted repeats (IRs, 27,276 bp). Similar to what has been found among most angiosperms, the date palm cp genome harbors 112 unique genes and 19 duplicated fragments in the IR regions. The junctions between LSC/IRs and SSC/IRs show different features of sequence expansion in evolution. We identified 78 SNPs as major intravarietal polymorphisms within the population of a specific cp genome, most of which were located in genes with vital functions. Based on RNA-sequencing data, we also found 18 polycistronic transcription units and three highly expression-biased genes—atpF, trnA-UGC, and rrn23. Conclusions Unlike most monocots, date palm has a typical cp genome similar to that of tobacco—with little rearrangement and gene loss or gain. High-throughput sequencing technology facilitates the identification of intravarietal variations in cp genomes among different cultivars. Moreover, transcriptomic analysis of cp genes provides clues for uncovering regulatory mechanisms of transcription and translation in chloroplasts. PMID:20856810

  10. Allele-specific DNA methylation of disease susceptibility genes in Japanese patients with inflammatory bowel disease.

    PubMed

    Chiba, Hirofumi; Kakuta, Yoichi; Kinouchi, Yoshitaka; Kawai, Yosuke; Watanabe, Kazuhiro; Nagao, Munenori; Naito, Takeo; Onodera, Motoyuki; Moroi, Rintaro; Kuroha, Masatake; Kanazawa, Yoshitake; Kimura, Tomoya; Shiga, Hisashi; Endo, Katsuya; Negoro, Kenichi; Nagasaki, Masao; Unno, Michiaki; Shimosegawa, Tooru

    2018-01-01

    Inflammatory bowel disease (IBD) has an unknown etiology; however, accumulating evidence suggests that IBD is a multifactorial disease influenced by a combination of genetic and environmental factors. The influence of genetic variants on DNA methylation in cis and cis effects on expression have been demonstrated. We hypothesized that IBD susceptibility single-nucleotide polymorphisms (SNPs) regulate susceptibility gene expressions in cis by regulating DNA methylation around SNPs. For this, we determined cis-regulated allele-specific DNA methylation (ASM) around IBD susceptibility genes in CD4+ effector/memory T cells (Tem) in lamina propria mononuclear cells (LPMCs) in patients with IBD and examined the association between the ASM SNP genotype and neighboring susceptibility gene expressions. CD4+ effector/memory T cells (Tem) were isolated from LPMCs in 15 Japanese IBD patients (ten Crohn's disease [CD] and five ulcerative colitis [UC] patients). ASM analysis was performed by methylation-sensitive SNP array analysis. We defined ASM as a changing average relative allele score ([Formula: see text]) >0.1 after digestion by methylation-sensitive restriction enzymes. Among SNPs showing [Formula: see text] >0.1, we extracted the probes located on tag-SNPs of 200 IBD susceptibility loci and around IBD susceptibility genes as candidate ASM SNPs. To validate ASM, bisulfite-pyrosequencing was performed. Transcriptome analysis was examined in 11 IBD patients (seven CD and four UC patients). The relation between rs36221701 genotype and neighboring gene expressions were analyzed. We extracted six candidate ASM SNPs around IBD susceptibility genes. The top of [Formula: see text] (0.23) was rs1130368 located on HLA-DQB1. ASM around rs36221701 ([Formula: see text] = 0.14) located near SMAD3 was validated using bisulfite pyrosequencing. The SMAD3 expression was significantly associated with the rs36221701 genotype (p = 0.016). We confirmed the existence of cis-regulated ASM around IBD susceptibility genes and the association between ASM SNP (rs36221701) genotype and SMAD3 expression, a susceptibility gene for IBD. These results give us supporting evidence that DNA methylation mediates genetic effects on disease susceptibility.

  11. Allele-specific DNA methylation of disease susceptibility genes in Japanese patients with inflammatory bowel disease

    PubMed Central

    Chiba, Hirofumi; Kakuta, Yoichi; Kinouchi, Yoshitaka; Kawai, Yosuke; Watanabe, Kazuhiro; Nagao, Munenori; Naito, Takeo; Onodera, Motoyuki; Moroi, Rintaro; Kuroha, Masatake; Kanazawa, Yoshitake; Kimura, Tomoya; Shiga, Hisashi; Endo, Katsuya; Negoro, Kenichi; Nagasaki, Masao; Unno, Michiaki; Shimosegawa, Tooru

    2018-01-01

    Background Inflammatory bowel disease (IBD) has an unknown etiology; however, accumulating evidence suggests that IBD is a multifactorial disease influenced by a combination of genetic and environmental factors. The influence of genetic variants on DNA methylation in cis and cis effects on expression have been demonstrated. We hypothesized that IBD susceptibility single-nucleotide polymorphisms (SNPs) regulate susceptibility gene expressions in cis by regulating DNA methylation around SNPs. For this, we determined cis-regulated allele-specific DNA methylation (ASM) around IBD susceptibility genes in CD4+ effector/memory T cells (Tem) in lamina propria mononuclear cells (LPMCs) in patients with IBD and examined the association between the ASM SNP genotype and neighboring susceptibility gene expressions. Methods CD4+ effector/memory T cells (Tem) were isolated from LPMCs in 15 Japanese IBD patients (ten Crohn's disease [CD] and five ulcerative colitis [UC] patients). ASM analysis was performed by methylation-sensitive SNP array analysis. We defined ASM as a changing average relative allele score (ΔRAS¯) >0.1 after digestion by methylation-sensitive restriction enzymes. Among SNPs showing ΔRAS¯ >0.1, we extracted the probes located on tag-SNPs of 200 IBD susceptibility loci and around IBD susceptibility genes as candidate ASM SNPs. To validate ASM, bisulfite-pyrosequencing was performed. Transcriptome analysis was examined in 11 IBD patients (seven CD and four UC patients). The relation between rs36221701 genotype and neighboring gene expressions were analyzed. Results We extracted six candidate ASM SNPs around IBD susceptibility genes. The top of ΔRAS¯ (0.23) was rs1130368 located on HLA-DQB1. ASM around rs36221701 (ΔRAS¯ = 0.14) located near SMAD3 was validated using bisulfite pyrosequencing. The SMAD3 expression was significantly associated with the rs36221701 genotype (p = 0.016). Conclusions We confirmed the existence of cis-regulated ASM around IBD susceptibility genes and the association between ASM SNP (rs36221701) genotype and SMAD3 expression, a susceptibility gene for IBD. These results give us supporting evidence that DNA methylation mediates genetic effects on disease susceptibility. PMID:29547621

  12. Tissue-Specific Transcriptomics of the Exotic Invasive Insect Pest Emerald Ash Borer (Agrilus planipennis)

    PubMed Central

    Mittapalli, Omprakash; Bai, Xiaodong; Bonello, Pierluigi; Herms, Daniel A.

    2010-01-01

    Background The insect midgut and fat body represent major tissue interfaces that deal with several important physiological functions including digestion, detoxification and immune response. The emerald ash borer (Agrilus planipennis), is an exotic invasive insect pest that has killed millions of ash trees (Fraxinus spp.) primarily in the Midwestern United States and Ontario, Canada. However, despite its high impact status little knowledge exists for A. planipennis at the molecular level. Methodology and Principal Findings Newer-generation Roche-454 pyrosequencing was used to obtain 126,185 reads for the midgut and 240,848 reads for the fat body, which were assembled into 25,173 and 37,661 high quality expressed sequence tags (ESTs) for the midgut and the fat body of A. planipennis larvae, respectively. Among these ESTs, 36% of the midgut and 38% of the fat body sequences showed similarity to proteins in the GenBank nr database. A high number of the midgut sequences contained chitin-binding peritrophin (248)and trypsin (98) domains; while the fat body sequences showed high occurrence of cytochrome P450s (85) and protein kinase (123) domains. Further, the midgut transcriptome of A. planipennis revealed putative microbial transcripts encoding for cell-wall degrading enzymes such as polygalacturonases and endoglucanases. A significant number of SNPs (137 in midgut and 347 in fat body) and microsatellite loci (317 in midgut and 571 in fat body) were predicted in the A. planipennis transcripts. An initial assessment of cytochrome P450s belonging to various CYP clades revealed distinct expression patterns at the tissue level. Conclusions and Significance To our knowledge this study is one of the first to illuminate tissue-specific gene expression in an invasive insect of high ecological and economic consequence. These findings will lay the foundation for future gene expression and functional studies in A. planipennis. PMID:21060843

  13. Transcriptomic Signatures of Ash (Fraxinus spp.) Phloem

    PubMed Central

    Mamidala, Praveen; Bonello, Pierluigi; Herms, Daniel A.; Mittapalli, Omprakash

    2011-01-01

    Background Ash (Fraxinus spp.) is a dominant tree species throughout urban and forested landscapes of North America (NA). The rapid invasion of NA by emerald ash borer (Agrilus planipennis), a wood-boring beetle endemic to Eastern Asia, has resulted in the death of millions of ash trees and threatens billions more. Larvae feed primarily on phloem tissue, which girdles and kills the tree. While NA ash species including black (F. nigra), green (F. pennsylvannica) and white (F. americana) are highly susceptible, the Asian species Manchurian ash (F. mandshurica) is resistant to A. planipennis perhaps due to their co-evolutionary history. Little is known about the molecular genetics of ash. Hence, we undertook a functional genomics approach to identify the repertoire of genes expressed in ash phloem. Methodology and Principal Findings Using 454 pyrosequencing we obtained 58,673 high quality ash sequences from pooled phloem samples of green, white, black, blue and Manchurian ash. Intriguingly, 45% of the deduced proteins were not significantly similar to any sequences in the GenBank non-redundant database. KEGG analysis of the ash sequences revealed a high occurrence of defense related genes. Expression analysis of early regulators potentially involved in plant defense (i.e. transcription factors, calcium dependent protein kinases and a lipoxygenase 3) revealed higher mRNA levels in resistant ash compared to susceptible ash species. Lastly, we predicted a total of 1,272 single nucleotide polymorphisms and 980 microsatellite loci, among which seven microsatellite loci showed polymorphism between different ash species. Conclusions and Significance The current transcriptomic data provide an invaluable resource for understanding the genetic make-up of ash phloem, the target tissue of A. planipennis. These data along with future functional studies could lead to the identification/characterization of defense genes involved in resistance of ash to A. planipennis, and in future ash breeding programs for marker development. PMID:21283712

  14. Large scale parallel pyrosequencing technology: PRRSV strain VR-2332 nsp2 deletion mutant stability in swine

    USDA-ARS?s Scientific Manuscript database

    Genomes from fifteen porcine reproductive and respiratory syndrome virus (PRRSV) isolates were derived simultaneously using 454 pyrosequencing technology. The viral isolates sequenced were from a recent swine study, in which engineered Type 2 prototype PRRSV strain VR-2332 mutants, with 87, 184, 200...

  15. Characterization of chlorinated and chloraminated drinking water microbial communities in a distribution system simulator using pyrosequencing data analysis

    EPA Science Inventory

    The molecular analysis of drinking water microbial communities has focused primarily on 16S rRNA gene sequence analysis. Since this approach provides limited information on function potential of microbial communities, analysis of whole-metagenome pyrosequencing data was used to...

  16. Purification of Marek's disease virus DNA for 454 pyrosequencing using micrococcal nuclease digestion and polyethylene glycol precipitation

    USDA-ARS?s Scientific Manuscript database

    Marek’s disease virus (MDV-1) is a cell-associated alphaherpesvirus that induces rapid-onset T-cell lymphomas in poultry. The genomes of 6 strains have been sequenced using both Sanger didoxy sequencing and 454 Life Science pyrosequencing. These genomes largely represent cell culture adapted strains...

  17. Pyrosequencing analysis for characterization of bacterial diversity in a soil as affected by integrated livestock-cotton production systems

    USDA-ARS?s Scientific Manuscript database

    Impacts of integrated livestock-crop production systems compared to specialized systems on soil bacterial diversity have not been well documented. We used a bacterial tag encoded FLX amplicon pyrosequencing (bTEFAP) method to evaluate bacterial diversity of a clay loam soil (Fine, mixed, thermic To...

  18. The grapevine expression atlas reveals a deep transcriptome shift driving the entire plant into a maturation program.

    PubMed

    Fasoli, Marianna; Dal Santo, Silvia; Zenoni, Sara; Tornielli, Giovanni Battista; Farina, Lorenzo; Zamboni, Anita; Porceddu, Andrea; Venturini, Luca; Bicego, Manuele; Murino, Vittorio; Ferrarini, Alberto; Delledonne, Massimo; Pezzotti, Mario

    2012-09-01

    We developed a genome-wide transcriptomic atlas of grapevine (Vitis vinifera) based on 54 samples representing green and woody tissues and organs at different developmental stages as well as specialized tissues such as pollen and senescent leaves. Together, these samples expressed ∼91% of the predicted grapevine genes. Pollen and senescent leaves had unique transcriptomes reflecting their specialized functions and physiological status. However, microarray and RNA-seq analysis grouped all the other samples into two major classes based on maturity rather than organ identity, namely, the vegetative/green and mature/woody categories. This division represents a fundamental transcriptomic reprogramming during the maturation process and was highlighted by three statistical approaches identifying the transcriptional relationships among samples (correlation analysis), putative biomarkers (O2PLS-DA approach), and sets of strongly and consistently expressed genes that define groups (topics) of similar samples (biclustering analysis). Gene coexpression analysis indicated that the mature/woody developmental program results from the reiterative coactivation of pathways that are largely inactive in vegetative/green tissues, often involving the coregulation of clusters of neighboring genes and global regulation based on codon preference. This global transcriptomic reprogramming during maturation has not been observed in herbaceous annual species and may be a defining characteristic of perennial woody plants.

  19. Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences

    PubMed Central

    Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.

    2012-01-01

    ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136

  20. Polyploid Evolution of the Brassicaceae during the Cenozoic Era[C][W][OPEN

    PubMed Central

    Kagale, Sateesh; Robinson, Stephen J.; Nixon, John; Xiao, Rong; Huebert, Terry; Condie, Janet; Kessler, Dallas; Clarke, Wayne E.; Edger, Patrick P.; Links, Matthew G.; Sharpe, Andrew G.; Parkin, Isobel A.P.

    2014-01-01

    The Brassicaceae (Cruciferae) family, owing to its remarkable species, genetic, and physiological diversity as well as its significant economic potential, has become a model for polyploidy and evolutionary studies. Utilizing extensive transcriptome pyrosequencing of diverse taxa, we established a resolved phylogeny of a subset of crucifer species. We elucidated the frequency, age, and phylogenetic position of polyploidy and lineage separation events that have marked the evolutionary history of the Brassicaceae. Besides the well-known ancient α (47 million years ago [Mya]) and β (124 Mya) paleopolyploidy events, several species were shown to have undergone a further more recent (∼7 to 12 Mya) round of genome multiplication. We identified eight whole-genome duplications corresponding to at least five independent neo/mesopolyploidy events. Although the Brassicaceae family evolved from other eudicots at the beginning of the Cenozoic era of the Earth (60 Mya), major diversification occurred only during the Neogene period (0 to 23 Mya). Remarkably, the widespread species divergence, major polyploidy, and lineage separation events during Brassicaceae evolution are clustered in time around epoch transitions characterized by prolonged unstable climatic conditions. The synchronized diversification of Brassicaceae species suggests that polyploid events may have conferred higher adaptability and increased tolerance toward the drastically changing global environment, thus facilitating species radiation. PMID:25035408

  1. Polyploid evolution of the Brassicaceae during the Cenozoic era.

    PubMed

    Kagale, Sateesh; Robinson, Stephen J; Nixon, John; Xiao, Rong; Huebert, Terry; Condie, Janet; Kessler, Dallas; Clarke, Wayne E; Edger, Patrick P; Links, Matthew G; Sharpe, Andrew G; Parkin, Isobel A P

    2014-07-01

    The Brassicaceae (Cruciferae) family, owing to its remarkable species, genetic, and physiological diversity as well as its significant economic potential, has become a model for polyploidy and evolutionary studies. Utilizing extensive transcriptome pyrosequencing of diverse taxa, we established a resolved phylogeny of a subset of crucifer species. We elucidated the frequency, age, and phylogenetic position of polyploidy and lineage separation events that have marked the evolutionary history of the Brassicaceae. Besides the well-known ancient α (47 million years ago [Mya]) and β (124 Mya) paleopolyploidy events, several species were shown to have undergone a further more recent (∼7 to 12 Mya) round of genome multiplication. We identified eight whole-genome duplications corresponding to at least five independent neo/mesopolyploidy events. Although the Brassicaceae family evolved from other eudicots at the beginning of the Cenozoic era of the Earth (60 Mya), major diversification occurred only during the Neogene period (0 to 23 Mya). Remarkably, the widespread species divergence, major polyploidy, and lineage separation events during Brassicaceae evolution are clustered in time around epoch transitions characterized by prolonged unstable climatic conditions. The synchronized diversification of Brassicaceae species suggests that polyploid events may have conferred higher adaptability and increased tolerance toward the drastically changing global environment, thus facilitating species radiation. © 2014 American Society of Plant Biologists. All rights reserved.

  2. Integrated Analysis of Transcriptomic and Proteomic Data

    PubMed Central

    Haider, Saad; Pal, Ranadip

    2013-01-01

    Until recently, understanding the regulatory behavior of cells has been pursued through independent analysis of the transcriptome or the proteome. Based on the central dogma, it was generally assumed that there exist a direct correspondence between mRNA transcripts and generated protein expressions. However, recent studies have shown that the correlation between mRNA and Protein expressions can be low due to various factors such as different half lives and post transcription machinery. Thus, a joint analysis of the transcriptomic and proteomic data can provide useful insights that may not be deciphered from individual analysis of mRNA or protein expressions. This article reviews the existing major approaches for joint analysis of transcriptomic and proteomic data. We categorize the different approaches into eight main categories based on the initial algorithm and final analysis goal. We further present analogies with other domains and discuss the existing research problems in this area. PMID:24082820

  3. Brownian model of transcriptome evolution and phylogenetic network visualization between tissues.

    PubMed

    Gu, Xun; Ruan, Hang; Su, Zhixi; Zou, Yangyun

    2017-09-01

    While phylogenetic analysis of transcriptomes of the same tissue is usually congruent with the species tree, the controversy emerges when multiple tissues are included, that is, whether species from the same tissue are clustered together, or different tissues from the same species are clustered together. Recent studies have suggested that phylogenetic network approach may shed some lights on our understanding of multi-tissue transcriptome evolution; yet the underlying evolutionary mechanism remains unclear. In this paper we develop a Brownian-based model of transcriptome evolution under the phylogenetic network that can statistically distinguish between the patterns of species-clustering and tissue-clustering. Our model can be used as a null hypothesis (neutral transcriptome evolution) for testing any correlation in tissue evolution, can be applied to cancer transcriptome evolution to study whether two tumors of an individual appeared independently or via metastasis, and can be useful to detect convergent evolution at the transcriptional level. Copyright © 2017. Published by Elsevier Inc.

  4. Microbiome Analysis of Stool Samples from African Americans with Colon Polyps

    PubMed Central

    Brim, Hassan; Yooseph, Shibu; Zoetendal, Erwin G.; Lee, Edward; Torralbo, Manolito; Laiyemo, Adeyinka O.; Shokrani, Babak; Nelson, Karen; Ashktorab, Hassan

    2013-01-01

    Background Colonic polyps are common tumors occurring in ~50% of Western populations with ~10% risk of malignant progression. Dietary agents have been considered the primary environmental exposure to promote colorectal cancer (CRC) development. However, the colonic mucosa is permanently in contact with the microbiota and its metabolic products including toxins that also have the potential to trigger oncogenic transformation. Aim To analyze fecal DNA for microbiota composition and functional potential in African Americans with pre-neoplastic lesions. Materials & Methods We analyzed the bacterial composition of stool samples from 6 healthy individuals and 6 patients with colon polyps using 16S ribosomal RNA-based phylogenetic microarray; the Human intestinal Tract Chip (HITChip) and 16S rRNA gene barcoded 454 pyrosequencing. The functional potential was determined by sequence-based metagenomics using 454 pyrosequencing. Results Fecal microbiota profiling of samples from the healthy and polyp patients using both a phylogenetic microarraying (HITChip) and barcoded 454 pyrosequencing generated similar results. A distinction between both sets of samples was only obtained when the analysis was performed at the sub-genus level. Most of the species leading to the dissociation were from the Bacteroides group. The metagenomic analysis did not reveal major differences in bacterial gene prevalence/abundances between the two groups even when the analysis and comparisons were restricted to available Bacteroides genomes. Conclusion This study reveals that at the pre-neoplastic stages, there is a trend showing microbiota changes between healthy and colon polyp patients at the sub-genus level. These differences were not reflected at the genome/functions levels. Bacteria and associated functions within the Bacteroides group need to be further analyzed and dissected to pinpoint potential actors in the early colon oncogenic transformation in a large sample size. PMID:24376500

  5. Combining real-time PCR and next-generation DNA sequencing to provide quantitative comparisons of fungal aerosol populations

    NASA Astrophysics Data System (ADS)

    Dannemiller, Karen C.; Lang-Yona, Naama; Yamamoto, Naomichi; Rudich, Yinon; Peccia, Jordan

    2014-02-01

    We examined fungal communities associated with the PM10 mass of Rehovot, Israel outdoor air samples collected in the spring and fall seasons. Fungal communities were described by 454 pyrosequencing of the internal transcribed spacer (ITS) region of the fungal ribosomal RNA encoding gene. To allow for a more quantitative comparison of fungal exposure in humans, the relative abundance values of specific taxa were transformed to absolute concentrations through multiplying these values by the sample's total fungal spore concentration (derived from universal fungal qPCR). Next, the sequencing-based absolute concentrations for Alternaria alternata, Cladosporium cladosporioides, Epicoccum nigrum, and Penicillium/Aspergillus spp. were compared to taxon-specific qPCR concentrations for A. alternata, C. cladosporioides, E. nigrum, and Penicillium/Aspergillus spp. derived from the same spring and fall aerosol samples. Results of these comparisons showed that the absolute concentration values generated from pyrosequencing were strongly associated with the concentration values derived from taxon-specific qPCR (for all four species, p < 0.005, all R > 0.70). The correlation coefficients were greater for species present in higher concentrations. Our microbial aerosol population analyses demonstrated that fungal diversity (number of fungal operational taxonomic units) was higher in the spring compared to the fall (p = 0.02), and principal coordinate analysis showed distinct seasonal differences in taxa distribution (ANOSIM p = 0.004). Among genera containing allergenic and/or pathogenic species, the absolute concentrations of Alternaria, Aspergillus, Fusarium, and Cladosporium were greater in the fall, while Cryptococcus, Penicillium, and Ulocladium concentrations were greater in the spring. The transformation of pyrosequencing fungal population relative abundance data to absolute concentrations can improve next-generation DNA sequencing-based quantitative aerosol exposure assessment.

  6. 454 pyrosequencing project identifying expressed genes from the horn fly, Haematobia irritans

    USDA-ARS?s Scientific Manuscript database

    We used an EST approach to initiate a study of the genome of the horn fly, Haematobia irritans and have used 454 pyrosequencing techniques to sequence 73,512, 100,603, 71,550, and 85,769 expressed genes from the egg, first instar larvae, adult male, and adult female lifestages of the horn fly. cD...

  7. Pyrosequencing of the northern red oak (Quercus rubra L.) chloroplast genome reveals high quality polymorphisms for population management

    Treesearch

    Lisa W. Alexander; Keith E. Woeste

    2014-01-01

    Given the low intraspecific chloroplast diversity detected in northern red oak (Quercus rubra L.), more powerful genetic tools are necessary to accurately characterize Q. rubra chloroplast diversity and structure. We report the sequencing, assembly, and annotation of the chloroplast genome of northern red oak via pyrosequencing and...

  8. Assessment of bacterial contamination of lipstick using pyrosequencing.

    PubMed

    Lee, So Y; Lee, Si Y

    As soon as they are exposed to the environment, cosmetics become contaminated with microorganisms, and this contamination accumulates with increased use. In this study, we employed pyrosequencing to investigate the diversity of bacteria found on lipstick. Bacterial DNA was extracted from 20 lipstick samples and mixed in equal ratios for pyrosequencing analysis. As a result, 105 bacterial genera were detected, four of which ( Leifsonia , Methylobacterium , Streptococcus , and Haemophilus ) were predominant in 92% of the 19,863 total sequence reads. Potentially pathogenic genera such as Staphylococcus , Pseudomonas , Escherichia , Salmonella , Corynebacterium , Mycobacterium , and Neisseria accounted for 27.6% of the 105 genera. The most commonly identified oral bacteria belonged to the Streptococcus genus, although other oral genera such as Actinomyces , Fusobacterium , Porphyromonas , and Lactobacillus were also detected.

  9. Re-evaluating microglia expression profiles using RiboTag and cell isolation strategies.

    PubMed

    Haimon, Zhana; Volaski, Alon; Orthgiess, Johannes; Boura-Halfon, Sigalit; Varol, Diana; Shemer, Anat; Yona, Simon; Zuckerman, Binyamin; David, Eyal; Chappell-Maor, Louise; Bechmann, Ingo; Gericke, Martin; Ulitsky, Igor; Jung, Steffen

    2018-06-01

    Transcriptome profiling is widely used to infer functional states of specific cell types, as well as their responses to stimuli, to define contributions to physiology and pathophysiology. Focusing on microglia, the brain's macrophages, we report here a side-by-side comparison of classical cell-sorting-based transcriptome sequencing and the 'RiboTag' method, which avoids cell retrieval from tissue context and yields translatome sequencing information. Conventional whole-cell microglial transcriptomes were found to be significantly tainted by artifacts introduced by tissue dissociation, cargo contamination and transcripts sequestered from ribosomes. Conversely, our data highlight the added value of RiboTag profiling for assessing the lineage accuracy of Cre recombinase expression in transgenic mice. Collectively, this study indicates method-based biases, reveals observer effects and establishes RiboTag-based translatome profiling as a valuable complement to standard sorting-based profiling strategies.

  10. ST Spot Detector: a web-based application for automatic spot and tissue detection for spatial Transcriptomics image datasets.

    PubMed

    Wong, Kim; Navarro, José Fernández; Bergenstråhle, Ludvig; Ståhl, Patrik L; Lundeberg, Joakim

    2018-06-01

    Spatial Transcriptomics (ST) is a method which combines high resolution tissue imaging with high troughput transcriptome sequencing data. This data must be aligned with the images for correct visualization, a process that involves several manual steps. Here we present ST Spot Detector, a web tool that automates and facilitates this alignment through a user friendly interface. jose.fernandez.navarro@scilifelab.se. Supplementary data are available at Bioinformatics online.

  11. Transcriptome sequencing of different narrow-leafed lupin tissue types provides a comprehensive uni-gene assembly and extensive gene-based molecular markers

    PubMed Central

    Kamphuis, Lars G; Hane, James K; Nelson, Matthew N; Gao, Lingling; Atkins, Craig A; Singh, Karam B

    2015-01-01

    Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is an important grain legume crop that is valuable for sustainable farming and is becoming recognized as a human health food. NLL breeding is directed at improving grain production, disease resistance, drought tolerance and health benefits. However, genetic and genomic studies have been hindered by a lack of extensive genomic resources for the species. Here, the generation, de novo assembly and annotation of transcriptome datasets derived from five different NLL tissue types of the reference accession cv. Tanjil are described. The Tanjil transcriptome was compared to transcriptomes of an early domesticated cv. Unicrop, a wild accession P27255, as well as accession 83A:476, together being the founding parents of two recombinant inbred line (RIL) populations. In silico predictions for transcriptome-derived gene-based length and SNP polymorphic markers were conducted and corroborated using a survey assembly sequence for NLL cv. Tanjil. This yielded extensive indel and SNP polymorphic markers for the two RIL populations. A total of 335 transcriptome-derived markers and 66 BAC-end sequence-derived markers were evaluated, and 275 polymorphic markers were selected to genotype the reference NLL 83A:476 × P27255 RIL population. This significantly improved the completeness, marker density and quality of the reference NLL genetic map. PMID:25060816

  12. Transcriptome analysis of carnation (Dianthus caryophyllus L.) based on next-generation sequencing technology.

    PubMed

    Tanase, Koji; Nishitani, Chikako; Hirakawa, Hideki; Isobe, Sachiko; Tabata, Satoshi; Ohmiya, Akemi; Onozaki, Takashi

    2012-07-02

    Carnation (Dianthus caryophyllus L.), in the family Caryophyllaceae, can be found in a wide range of colors and is a model system for studies of flower senescence. In addition, it is one of the most important flowers in the global floriculture industry. However, few genomics resources, such as sequences and markers are available for carnation or other members of the Caryophyllaceae. To increase our understanding of the genetic control of important characters in carnation, we generated an expressed sequence tag (EST) database for a carnation cultivar important in horticulture by high-throughput sequencing using 454 pyrosequencing technology. We constructed a normalized cDNA library and a 3'-UTR library of carnation, obtaining a total of 1,162,126 high-quality reads. These reads were assembled into 300,740 unigenes consisting of 37,844 contigs and 262,896 singlets. The contigs were searched against an Arabidopsis sequence database, and 61.8% (23,380) of them had at least one BLASTX hit. These contigs were also annotated with Gene Ontology (GO) and were found to cover a broad range of GO categories. Furthermore, we identified 17,362 potential simple sequence repeats (SSRs) in 14,291 of the unigenes. We focused on gene discovery in the areas of flower color and ethylene biosynthesis. Transcripts were identified for almost every gene involved in flower chlorophyll and carotenoid metabolism and in anthocyanin biosynthesis. Transcripts were also identified for every step in the ethylene biosynthesis pathway. We present the first large-scale sequence data set for carnation, generated using next-generation sequencing technology. The large EST database generated from these sequences is an informative resource for identifying genes involved in various biological processes in carnation and provides an EST resource for understanding the genetic diversity of this plant.

  13. Transcriptome analysis of carnation (Dianthus caryophyllus L.) based on next-generation sequencing technology

    PubMed Central

    2012-01-01

    Background Carnation (Dianthus caryophyllus L.), in the family Caryophyllaceae, can be found in a wide range of colors and is a model system for studies of flower senescence. In addition, it is one of the most important flowers in the global floriculture industry. However, few genomics resources, such as sequences and markers are available for carnation or other members of the Caryophyllaceae. To increase our understanding of the genetic control of important characters in carnation, we generated an expressed sequence tag (EST) database for a carnation cultivar important in horticulture by high-throughput sequencing using 454 pyrosequencing technology. Results We constructed a normalized cDNA library and a 3’-UTR library of carnation, obtaining a total of 1,162,126 high-quality reads. These reads were assembled into 300,740 unigenes consisting of 37,844 contigs and 262,896 singlets. The contigs were searched against an Arabidopsis sequence database, and 61.8% (23,380) of them had at least one BLASTX hit. These contigs were also annotated with Gene Ontology (GO) and were found to cover a broad range of GO categories. Furthermore, we identified 17,362 potential simple sequence repeats (SSRs) in 14,291 of the unigenes. We focused on gene discovery in the areas of flower color and ethylene biosynthesis. Transcripts were identified for almost every gene involved in flower chlorophyll and carotenoid metabolism and in anthocyanin biosynthesis. Transcripts were also identified for every step in the ethylene biosynthesis pathway. Conclusions We present the first large-scale sequence data set for carnation, generated using next-generation sequencing technology. The large EST database generated from these sequences is an informative resource for identifying genes involved in various biological processes in carnation and provides an EST resource for understanding the genetic diversity of this plant. PMID:22747974

  14. Comparative analysis of the small RNA transcriptomes of Pinus contorta and Oryza sativa

    PubMed Central

    Morin, Ryan D.; Aksay, Gozde; Dolgosheina, Elena; Ebhardt, H. Alexander; Magrini, Vincent; Mardis, Elaine R.; Sahinalp, S. Cenk; Unrau, Peter J.

    2008-01-01

    The diversity of microRNAs and small-interfering RNAs has been extensively explored within angiosperms by focusing on a few key organisms such as Oryza sativa and Arabidopsis thaliana. A deeper division of the plants is defined by the radiation of the angiosperms and gymnosperms, with the latter comprising the commercially important conifers. The conifers are expected to provide important information regarding the evolution of highly conserved small regulatory RNAs. Deep sequencing provides the means to characterize and quantitatively profile small RNAs in understudied organisms such as these. Pyrosequencing of small RNAs from O. sativa revealed, as expected, ∼21- and ∼24-nt RNAs. The former contained known microRNAs, and the latter largely comprised intergenic-derived sequences likely representing heterochromatin siRNAs. In contrast, sequences from Pinus contorta were dominated by 21-nt small RNAs. Using a novel sequence-based clustering algorithm, we identified sequences belonging to 18 highly conserved microRNA families in P. contorta as well as numerous clusters of conserved small RNAs of unknown function. Using multiple methods, including expressed sequence folding and machine learning algorithms, we found a further 53 candidate novel microRNA families, 51 appearing specific to the P. contorta library. In addition, alignment of small RNA sequences to the O. sativa genome revealed six perfectly conserved classes of small RNA that included chloroplast transcripts and specific types of genomic repeats. The conservation of microRNAs and other small RNAs between the conifers and the angiosperms indicates that important RNA silencing processes were highly developed in the earliest spermatophytes. Genomic mapping of all sequences to the O. sativa genome can be viewed at http://microrna.bcgsc.ca/cgi-bin/gbrowse/rice_build_3/. PMID:18323537

  15. Microbiota diversity and gene expression dynamics in human oral biofilms

    PubMed Central

    2014-01-01

    Background Micro-organisms inhabiting teeth surfaces grow on biofilms where a specific and complex succession of bacteria has been described by co-aggregation tests and DNA-based studies. Although the composition of oral biofilms is well established, the active portion of the bacterial community and the patterns of gene expression in vivo have not been studied. Results Using RNA-sequencing technologies, we present the first metatranscriptomic study of human dental plaque, performed by two different approaches: (1) A short-reads, high-coverage approach by Illumina sequencing to characterize the gene activity repertoire of the microbial community during biofilm development; (2) A long-reads, lower-coverage approach by pyrosequencing to determine the taxonomic identity of the active microbiome before and after a meal ingestion. The high-coverage approach allowed us to analyze over 398 million reads, revealing that microbial communities are individual-specific and no bacterial species was detected as key player at any time during biofilm formation. We could identify some gene expression patterns characteristic for early and mature oral biofilms. The transcriptomic profile of several adhesion genes was confirmed through qPCR by measuring expression of fimbriae-associated genes. In addition to the specific set of gene functions overexpressed in early and mature oral biofilms, as detected through the short-reads dataset, the long-reads approach detected specific changes when comparing the metatranscriptome of the same individual before and after a meal, which can narrow down the list of organisms responsible for acid production and therefore potentially involved in dental caries. Conclusions The bacteria changing activity during biofilm formation and after meal ingestion were person-specific. Interestingly, some individuals showed extreme homeostasis with virtually no changes in the active bacterial population after food ingestion, suggesting the presence of a microbial community which could be associated to dental health. PMID:24767457

  16. Microbiota diversity and gene expression dynamics in human oral biofilms.

    PubMed

    Benítez-Páez, Alfonso; Belda-Ferre, Pedro; Simón-Soro, Aurea; Mira, Alex

    2014-04-27

    Micro-organisms inhabiting teeth surfaces grow on biofilms where a specific and complex succession of bacteria has been described by co-aggregation tests and DNA-based studies. Although the composition of oral biofilms is well established, the active portion of the bacterial community and the patterns of gene expression in vivo have not been studied. Using RNA-sequencing technologies, we present the first metatranscriptomic study of human dental plaque, performed by two different approaches: (1) A short-reads, high-coverage approach by Illumina sequencing to characterize the gene activity repertoire of the microbial community during biofilm development; (2) A long-reads, lower-coverage approach by pyrosequencing to determine the taxonomic identity of the active microbiome before and after a meal ingestion. The high-coverage approach allowed us to analyze over 398 million reads, revealing that microbial communities are individual-specific and no bacterial species was detected as key player at any time during biofilm formation. We could identify some gene expression patterns characteristic for early and mature oral biofilms. The transcriptomic profile of several adhesion genes was confirmed through qPCR by measuring expression of fimbriae-associated genes. In addition to the specific set of gene functions overexpressed in early and mature oral biofilms, as detected through the short-reads dataset, the long-reads approach detected specific changes when comparing the metatranscriptome of the same individual before and after a meal, which can narrow down the list of organisms responsible for acid production and therefore potentially involved in dental caries. The bacteria changing activity during biofilm formation and after meal ingestion were person-specific. Interestingly, some individuals showed extreme homeostasis with virtually no changes in the active bacterial population after food ingestion, suggesting the presence of a microbial community which could be associated to dental health.

  17. Transcriptome of interstitial cells of Cajal reveals unique and selective gene signatures

    PubMed Central

    Park, Paul J.; Fuchs, Robert; Wei, Lai; Jorgensen, Brian G.; Redelman, Doug; Ward, Sean M.; Sanders, Kenton M.

    2017-01-01

    Transcriptome-scale data can reveal essential clues into understanding the underlying molecular mechanisms behind specific cellular functions and biological processes. Transcriptomics is a continually growing field of research utilized in biomarker discovery. The transcriptomic profile of interstitial cells of Cajal (ICC), which serve as slow-wave electrical pacemakers for gastrointestinal (GI) smooth muscle, has yet to be uncovered. Using copGFP-labeled ICC mice and flow cytometry, we isolated ICC populations from the murine small intestine and colon and obtained their transcriptomes. In analyzing the transcriptome, we identified a unique set of ICC-restricted markers including transcription factors, epigenetic enzymes/regulators, growth factors, receptors, protein kinases/phosphatases, and ion channels/transporters. This analysis provides new and unique insights into the cellular and biological functions of ICC in GI physiology. Additionally, we constructed an interactive ICC genome browser (http://med.unr.edu/physio/transcriptome) based on the UCSC genome database. To our knowledge, this is the first online resource that provides a comprehensive library of all known genetic transcripts expressed in primary ICC. Our genome browser offers a new perspective into the alternative expression of genes in ICC and provides a valuable reference for future functional studies. PMID:28426719

  18. Pyrosequencing vs. culture-dependent approaches to analyze lactic acid bacteria associated to chicha, a traditional maize-based fermented beverage from Northwestern Argentina.

    PubMed

    Elizaquível, Patricia; Pérez-Cataluña, Alba; Yépez, Alba; Aristimuño, Cecilia; Jiménez, Eugenia; Cocconcelli, Pier Sandro; Vignolo, Graciela; Aznar, Rosa

    2015-04-02

    The diversity of lactic acid bacteria (LAB) associated with chicha, a traditional maize-based fermented alcoholic beverage from Northwestern Argentina, was analyzed using culture-dependent and culture-independent approaches. Samples corresponding to 10 production steps were obtained from two local producers at Maimará (chicha M) and Tumbaya (chicha T). Whereas by culture-dependent approach a few number of species (Lactobacillus plantarum and Weissella viridescens in chicha M, and Enterococcus faecium and Leuconostoc mesenteroides in chicha T) were identified, a higher quantitative distribution of taxa was found in both beverages by pyrosequencing. The relative abundance of OTUs was higher in chicha M than in chicha T; six LAB genera were common for chicha M and T: Enterococcus, Lactococcus, Streptococcus, Weissella, Leuconostoc and Lactobacillus while Pediococcus only was detected in chicha M. Among the 46 identified LAB species, those of Lactobacillus were dominant in both chicha samples, exhibiting the highest diversity, whereas Enterococcus and Leuconostoc were recorded as the second dominant genera in chicha T and M, respectively. Identification at species level showed the predominance of Lb. plantarum, Lactobacillus rossiae, Leuconostoc lactis and W. viridescens in chicha M while Enterococcus hirae, E. faecium, Lc. mesenteroides and Weissella confusa predominated in chicha T samples. In parallel, when presumptive LAB isolates (chicha M: 146; chicha T: 246) recovered from the same samples were identified by ISR-PCR and RAPD-PCR profiles, species-specific PCR and 16S rRNA gene sequencing, most of them were assigned to the Leuconostoc genus (Lc. mesenteroides and Lc. lactis) in chicha M, Lactobacillus, Weissella and Enterococcus being also present. In contrast, chicha T exhibited the presence of Enterococcus and Leuconostoc, E. faecium being the most representative species. Massive sequencing approach was applied for the first time to study the diversity and evolution of microbial communities during chicha manufacture. Although differences in the LAB species profile between the two geographically different chicha productions were observed by culturing, a larger number for predominant LAB species as well as other minorities were revealed by pyrosequencing. The fine molecular inventory achieved by pyrosequencing provided more precise information on LAB population composition than culture-dependent analysis processes. Copyright © 2014 Elsevier B.V. All rights reserved.

  19. Spatial Segregation and Aggregation of Ectomycorrhizal and Root-Endophytic Fungi in the Seedlings of Two Quercus Species

    PubMed Central

    Yamamoto, Satoshi; Sato, Hirotoshi; Tanabe, Akifumi S.; Hidaka, Amane; Kadowaki, Kohmei; Toju, Hirokazu

    2014-01-01

    Diverse clades of mycorrhizal and endophytic fungi are potentially involved in competitive or facilitative interactions within host-plant roots. We investigated the potential consequences of these ecological interactions on the assembly process of root-associated fungi by examining the co-occurrence of pairs of fungi in host-plant individuals. Based on massively-parallel pyrosequencing, we analyzed the root-associated fungal community composition for each of the 249 Quercus serrata and 188 Quercus glauca seedlings sampled in a warm-temperate secondary forest in Japan. Pairs of fungi that co-occurred more or less often than expected by chance were identified based on randomization tests. The pyrosequencing analysis revealed that not only ectomycorrhizal fungi but also endophytic fungi were common in the root-associated fungal community. Intriguingly, specific pairs of these ectomycorrhizal and endophytic fungi showed spatially aggregated patterns, suggesting the existence of facilitative interactions between fungi in different functional groups. Due to the large number of fungal pairs examined, many of the observed aggregated/segregated patterns with very low P values (e.g., < 0.005) turned non-significant after the application of a multiple comparison method. However, our overall results imply that the community structures of ectomycorrhizal and endophytic fungi could influence each other through interspecific competitive/facilitative interactions in root. To test the potential of host-plants' control of fungus–fungus ecological interactions in roots, we further examined whether the aggregated/segregated patterns could vary depending on the identity of host plant species. Potentially due to the physiological properties shared between the congeneric host plant species, the sign of hosts' control was not detected in the present study. The pyrosequencing-based randomization analyses shown in this study provide a platform of the high-throughput investigation of fungus–fungus interactions in plant root systems. PMID:24801150

  20. 454 Pyrosequencing to Describe Microbial Eukaryotic Community Composition, Diversity and Relative Abundance: A Test for Marine Haptophytes

    PubMed Central

    Egge, Elianne; Bittner, Lucie; Andersen, Tom; Audic, Stéphane; de Vargas, Colomban; Edvardsen, Bente

    2013-01-01

    Next generation sequencing of ribosomal DNA is increasingly used to assess the diversity and structure of microbial communities. Here we test the ability of 454 pyrosequencing to detect the number of species present, and assess the relative abundance in terms of cell numbers and biomass of protists in the phylum Haptophyta. We used a mock community consisting of equal number of cells of 11 haptophyte species and compared targeting DNA and RNA/cDNA, and two different V4 SSU rDNA haptophyte-biased primer pairs. Further, we tested four different bioinformatic filtering methods to reduce errors in the resulting sequence dataset. With sequencing depth of 11000–20000 reads and targeting cDNA with Haptophyta specific primers Hap454 we detected all 11 species. A rarefaction analysis of expected number of species recovered as a function of sampling depth suggested that minimum 1400 reads were required here to recover all species in the mock community. Relative read abundance did not correlate to relative cell numbers. Although the species represented with the largest biomass was also proportionally most abundant among the reads, there was generally a weak correlation between proportional read abundance and proportional biomass of the different species, both with DNA and cDNA as template. The 454 sequencing generated considerable spurious diversity, and more with cDNA than DNA as template. With initial filtering based only on match with barcode and primer we observed 100-fold more operational taxonomic units (OTUs) at 99% similarity than the number of species present in the mock community. Filtering based on quality scores, or denoising with PyroNoise resulted in ten times more OTU99% than the number of species. Denoising with AmpliconNoise reduced the number of OTU99% to match the number of species present in the mock community. Based on our analyses, we propose a strategy to more accurately depict haptophyte diversity using 454 pyrosequencing. PMID:24069303

  1. N-ras Mutation Detection by Pyrosequencing in Adult Patients with Acute Myeloid Leukemia at a Single Institution

    PubMed Central

    Jeong, Ji Hun; Park, Soon Ho; Park, Mi Jung; Kim, Moon Jin; Kim, Kyung Hee; Park, Pil Whan; Seo, Yiel Hea; Lee, Jae Hoon; Park, Jinny; Hong, Junshik

    2013-01-01

    Background N-ras mutations are one of the most commonly detected abnormalities of myeloid origin. N-ras mutations result in a constitutively active N-ras protein that induces uncontrolled cell proliferation and inhibits apoptosis. We analyzed N-ras mutations in adult patients with AML at a particular institution and compared pyrosequencing analysis with a direct sequencing method for the detection of N-ras mutations. Methods We analyzed 90 bone marrow samples from 83 AML patients. We detected N-ras mutations in codons 12, 13, and 61 using the pyrosequencing method and subsequently confirmed all data by direct sequencing. Using these methods, we screened the N-ras mutation quantitatively and determined the incidence and characteristic of N-ras mutation. Results The incidence of N-ras mutation was 7.2% in adult AML patients. The patients with N-ras mutations showed significant higher hemoglobin levels (P=0.022) and an increased incidence of FLT3 mutations (P=0.003). We observed 3 cases with N-ras mutations in codon 12 (3.6%), 2 cases in codon 13 (2.4%), and 1 case in codon 61 (1.2%). All the mutations disappeared during chemotherapy. Conclusions There is a low incidence (7.2%) of N-ras mutations in AML patients compared with other populations. Similar data is obtained by both pyrosequencing and direct sequencing. This study showed the correlation between the N-ras mutation and the therapeutic response. However, pyrosequencing provides quantitative data and is useful for monitoring therapeutic responses. PMID:23667841

  2. 454-Pyrosequencing survey of microbiota in adult Spotted Wing Drosophila (SWD) corroborates a core microbiome and additional symbiotic and entomopathogenic bacterial associates

    USDA-ARS?s Scientific Manuscript database

    Complete surveys of insect endosymbionts including species of economic importance have until recently been hampered by a lack of high-throughput genetic assays. We used 454-pyrosequencing of the 16S rRNA gene amplicon of adult spotted wing Drosophila (SWD) Drosophila suzukii (Matsumura) from souther...

  3. Development of colonic microflora as assessed by pyrosequencing in dairy calves fed waste milk

    USDA-ARS?s Scientific Manuscript database

    The objective of the current study was to examine the effect of pasteurization of waste milk used to feed dairy calves on the bacterial diversity of their lower gut. Using 16S rDNA bacterial tag-encoded FLX amplicon pyrosequencing (bTEFAP), fecal samples from dairy calves aging from 1 week to 6 mon...

  4. Assessment of the microbial diversity of Brazilian kefir grains by PCR-DGGE and pyrosequencing analysis.

    PubMed

    Leite, A M O; Mayo, B; Rachid, C T C C; Peixoto, R S; Silva, J T; Paschoalin, V M F; Delgado, S

    2012-09-01

    The microbial diversity and community structure of three different kefir grains from different parts of Brazil were examined via the combination of two culture-independent methods: PCR-denaturing gradient gel electrophoresis (PCR-DGGE) and pyrosequencing. PCR-DGGE showed Lactobacillus kefiranofaciens and Lactobacillus kefiri to be the major bacterial populations in all three grains. The yeast community was dominated by Saccharomyces cerevisiae. Pyrosequencing produced a total of 14,314 partial 16S rDNA sequence reads from the three grains. Sequence analysis grouped the reads into three phyla, of which Firmicutes was dominant. Members of the genus Lactobacillus were the most abundant operational taxonomic units (OTUs) in all samples, accounting for up to 96% of the sequences. OTUs belonging to other lactic and acetic acid bacteria genera, such as Lactococcus, Leuconostoc, Streptococcus and Acetobacter, were also identified at low levels. Two of the grains showed identical DGGE profiles and a similar number of OTUs, while the third sample showed the highest diversity by both techniques. Pyrosequencing allowed the identification of bacteria that were present in small numbers and rarely associated with the microbial community of this complex ecosystem. Copyright © 2012 Elsevier Ltd. All rights reserved.

  5. High-confidence coding and noncoding transcriptome maps

    PubMed Central

    2017-01-01

    The advent of high-throughput RNA sequencing (RNA-seq) has led to the discovery of unprecedentedly immense transcriptomes encoded by eukaryotic genomes. However, the transcriptome maps are still incomplete partly because they were mostly reconstructed based on RNA-seq reads that lack their orientations (known as unstranded reads) and certain boundary information. Methods to expand the usability of unstranded RNA-seq data by predetermining the orientation of the reads and precisely determining the boundaries of assembled transcripts could significantly benefit the quality of the resulting transcriptome maps. Here, we present a high-performing transcriptome assembly pipeline, called CAFE, that significantly improves the original assemblies, respectively assembled with stranded and/or unstranded RNA-seq data, by orienting unstranded reads using the maximum likelihood estimation and by integrating information about transcription start sites and cleavage and polyadenylation sites. Applying large-scale transcriptomic data comprising 230 billion RNA-seq reads from the ENCODE, Human BodyMap 2.0, The Cancer Genome Atlas, and GTEx projects, CAFE enabled us to predict the directions of about 220 billion unstranded reads, which led to the construction of more accurate transcriptome maps, comparable to the manually curated map, and a comprehensive lncRNA catalog that includes thousands of novel lncRNAs. Our pipeline should not only help to build comprehensive, precise transcriptome maps from complex genomes but also to expand the universe of noncoding genomes. PMID:28396519

  6. Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas

    PubMed Central

    Hou, Yu; Guo, Huahu; Cao, Chen; Li, Xianlong; Hu, Boqiang; Zhu, Ping; Wu, Xinglong; Wen, Lu; Tang, Fuchou; Huang, Yanyi; Peng, Jirun

    2016-01-01

    Single-cell genome, DNA methylome, and transcriptome sequencing methods have been separately developed. However, to accurately analyze the mechanism by which transcriptome, genome and DNA methylome regulate each other, these omic methods need to be performed in the same single cell. Here we demonstrate a single-cell triple omics sequencing technique, scTrio-seq, that can be used to simultaneously analyze the genomic copy-number variations (CNVs), DNA methylome, and transcriptome of an individual mammalian cell. We show that large-scale CNVs cause proportional changes in RNA expression of genes within the gained or lost genomic regions, whereas these CNVs generally do not affect DNA methylation in these regions. Furthermore, we applied scTrio-seq to 25 single cancer cells derived from a human hepatocellular carcinoma tissue sample. We identified two subpopulations within these cells based on CNVs, DNA methylome, or transcriptome of individual cells. Our work offers a new avenue of dissecting the complex contribution of genomic and epigenomic heterogeneities to the transcriptomic heterogeneity within a population of cells. PMID:26902283

  7. FIT: statistical modeling tool for transcriptome dynamics under fluctuating field conditions

    PubMed Central

    Iwayama, Koji; Aisaka, Yuri; Kutsuna, Natsumaro

    2017-01-01

    Abstract Motivation: Considerable attention has been given to the quantification of environmental effects on organisms. In natural conditions, environmental factors are continuously changing in a complex manner. To reveal the effects of such environmental variations on organisms, transcriptome data in field environments have been collected and analyzed. Nagano et al. proposed a model that describes the relationship between transcriptomic variation and environmental conditions and demonstrated the capability to predict transcriptome variation in rice plants. However, the computational cost of parameter optimization has prevented its wide application. Results: We propose a new statistical model and efficient parameter optimization based on the previous study. We developed and released FIT, an R package that offers functions for parameter optimization and transcriptome prediction. The proposed method achieves comparable or better prediction performance within a shorter computational time than the previous method. The package will facilitate the study of the environmental effects on transcriptomic variation in field conditions. Availability and Implementation: Freely available from CRAN (https://cran.r-project.org/web/packages/FIT/). Contact: anagano@agr.ryukoku.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online PMID:28158396

  8. Plant stress biomarkers from biosimulations: the Transcriptome-To-Metabolome (TTM) technology - effects of drought stress on rice.

    PubMed

    Phelix, C F; Feltus, F A

    2015-01-01

    Measuring biomarkers from plant tissue samples is challenging and expensive when the desire is to integrate transcriptomics, fluxomics, metabolomics, lipidomics, proteomics, physiomics and phenomics. We present a computational biology method where only the transcriptome needs to be measured and is used to derive a set of parameters for deterministic kinetic models of metabolic pathways. The technology is called Transcriptome-To-Metabolome (TTM) biosimulations, currently under commercial development, but available for non-commercial use by researchers. The simulated results on metabolites of 30 primary and secondary metabolic pathways in rice (Oryza sativa) were used as the biomarkers to predict whether the transcriptome was from a plant that had been under drought conditions. The rice transcriptomes were accessed from public archives and each individual plant was simulated. This unique quality of the TTM technology allows standard analyses on biomarker assessments, i.e. sensitivity, specificity, positive and negative predictive values, accuracy, receiver operator characteristics (ROC) curve and area under the ROC curve (AUC). Two validation methods were also used, the holdout and 10-fold cross validations. Initially 17 metabolites were identified as candidate biomarkers based on either statistical significance on binary phenotype when compared with control samples or recognition from the literature. The top three biomarkers based on AUC were gibberellic acid 12 (0.89), trehalose (0.80) and sn1-palmitate-sn2-oleic-phosphatidylglycerol (0.70). Neither heat map analyses of transcriptomes nor all 300 metabolites clustered the stressed and control groups effectively. The TTM technology allows the emergent properties of the integrated system to generate unique and useful 'Omics' information. © 2014 German Botanical Society and The Royal Botanical Society of the Netherlands.

  9. Analysis of RET promoter CpG island methylation using methylation-specific PCR (MSP), pyrosequencing, and methylation-sensitive high-resolution melting (MS-HRM): impact on stage II colon cancer patient outcome.

    PubMed

    Draht, Muriel X G; Smits, Kim M; Jooste, Valérie; Tournier, Benjamin; Vervoort, Martijn; Ramaekers, Chantal; Chapusot, Caroline; Weijenberg, Matty P; van Engeland, Manon; Melotte, Veerle

    2016-01-01

    Already since the 1990s, promoter CpG island methylation markers have been considered promising diagnostic, prognostic, and predictive cancer biomarkers. However, so far, only a limited number of DNA methylation markers have been introduced into clinical practice. One reason why the vast majority of methylation markers do not translate into clinical applications is lack of independent validation of methylation markers, often caused by differences in methylation analysis techniques. We recently described RET promoter CpG island methylation as a potential prognostic marker in stage II colorectal cancer (CRC) patients of two independent series. In the current study, we analyzed the RET promoter CpG island methylation of 241 stage II colon cancer patients by direct methylation-specific PCR (MSP), nested-MSP, pyrosequencing, and methylation-sensitive high-resolution melting (MS-HRM). All primers were designed as close as possible to the same genomic region. In order to investigate the effect of different DNA methylation assays on patient outcome, we assessed the clinical sensitivity and specificity as well as the association of RET methylation with overall survival for three and five years of follow-up. Using direct-MSP and nested-MSP, 12.0 % (25/209) and 29.6 % (71/240) of the patients showed RET promoter CpG island methylation. Methylation frequencies detected by pyrosequencing were related to the threshold for positivity that defined RET methylation. Methylation frequencies obtained by pyrosequencing (threshold for positivity at 20 %) and MS-HRM were 13.3 % (32/240) and 13.8 % (33/239), respectively. The pyrosequencing threshold for positivity of 20 % showed the best correlation with MS-HRM and direct-MSP results. Nested-MSP detected RET promoter CpG island methylation in deceased patients with a higher sensitivity (33.1 %) compared to direct-MSP (10.7 %), pyrosequencing (14.4 %), and MS-HRM (15.4 %). While RET methylation frequencies detected by nested-MSP, pyrosequencing, and MS-HRM varied, the prognostic effect seemed similar (HR 1.74, 95 % CI 0.97-3.15; HR 1.85, 95 % CI 0.93-3.86; HR 1.83, 95 % CI 0.92-3.65, respectively). Our results show that upon optimizing and aligning four RET methylation assays with regard to primer location and sensitivity, differences in methylation frequencies and clinical sensitivities are observed; however, the effect on the marker's prognostic outcome is minimal.

  10. Transcriptomic-based effects monitoring for endocrine active chemicals: Assessing relative contribution of treated wastewater to downstream pollution

    EPA Science Inventory

    The present study investigated whether combining of targeted analytical chemistry methods with unsupervised, data-rich methodologies (i.e. transcriptomics) can be utilized to evaluate relative contributions of wastewater treatment plant (WWTP) effluents to biological effects. The...

  11. KRAS Mutation Test in Korean Patients with Colorectal Carcinomas: A Methodological Comparison between Sanger Sequencing and a Real-Time PCR-Based Assay.

    PubMed

    Lee, Sung Hak; Chung, Arthur Minwoo; Lee, Ahwon; Oh, Woo Jin; Choi, Yeong Jin; Lee, Youn-Soo; Jung, Eun Sun

    2017-01-01

    Mutations in the KRAS gene have been identified in approximately 50% of colorectal cancers (CRCs). KRAS mutations are well established biomarkers in anti-epidermal growth factor receptor therapy. Therefore, assessment of KRAS mutations is needed in CRC patients to ensure appropriate treatment. We compared the analytical performance of the cobas test to Sanger sequencing in 264 CRC cases. In addition, discordant specimens were evaluated by 454 pyrosequencing. KRAS mutations for codons 12/13 were detected in 43.2% of cases (114/264) by Sanger sequencing. Of 257 evaluable specimens for comparison, KRAS mutations were detected in 112 cases (43.6%) by Sanger sequencing and 118 cases (45.9%) by the cobas test. Concordance between the cobas test and Sanger sequencing for each lot was 93.8% positive percent agreement (PPA) and 91.0% negative percent agreement (NPA) for codons 12/13. Results from the cobas test and Sanger sequencing were discordant for 20 cases (7.8%). Twenty discrepant cases were subsequently subjected to 454 pyrosequencing. After comprehensive analysis of the results from combined Sanger sequencing-454 pyrosequencing and the cobas test, PPA was 97.5% and NPA was 100%. The cobas test is an accurate and sensitive test for detecting KRAS -activating mutations and has analytical power equivalent to Sanger sequencing. Prescreening using the cobas test with subsequent application of Sanger sequencing is the best strategy for routine detection of KRAS mutations in CRC.

  12. Diversity patterns and activity of uncultured marine heterotrophic flagellates unveiled with pyrosequencing

    PubMed Central

    Logares, Ramiro; Audic, Stephane; Santini, Sebastien; Pernice, Massimo C; de Vargas, Colomban; Massana, Ramon

    2012-01-01

    Flagellated heterotrophic microeukaryotes have key roles for the functioning of marine ecosystems as they channel large amounts of organic carbon to the upper trophic levels and control the population sizes of bacteria and archaea. Still, we know very little on the diversity patterns of most groups constituting this evolutionary heterogeneous assemblage. Here, we investigate 11 groups of uncultured flagellates known as MArine STramenopiles (MASTs). MASTs are ecologically very important and branch at the base of stramenopiles. We explored the diversity patterns of MASTs using pyrosequencing (18S rDNA) in coastal European waters. We found that MAST groups range from highly to lowly diversified. Pyrosequencing (hereafter ‘454') allowed us to approach to the limits of taxonomic diversity for all MAST groups, which varied in one order of magnitude (tens to hundreds) in terms of operational taxonomic units (98% similarity). We did not evidence large differences in activity, as indicated by ratios of DNA:RNA-reads. Most groups were strictly planktonic, although we found some groups that were active in sediments and even in anoxic waters. The proportion of reads per size fraction indicated that most groups were composed of very small cells (∼2–5 μm). In addition, phylogenetically different assemblages appeared to be present in different size fractions, depths and geographic zones. Thus, MAST diversity seems to be highly partitioned in spatial scales. Altogether, our results shed light on these ecologically very important but poorly known groups of uncultured marine flagellates. PMID:22534609

  13. Pyrosequencing of prey DNA in reptile faeces: analysis of earthworm consumption by slow worms.

    PubMed

    Brown, David S; Jarman, Simon N; Symondson, William O C

    2012-03-01

    Little quantitative ecological information exists on the diets of most invertebrate feeding reptiles, particularly nocturnal or elusive species that are difficult to observe. In the UK and elsewhere, reptiles are legally required to be relocated before land development can proceed, but without knowledge of their dietary requirements, the suitability of receptor sites cannot be known. Here, we tested the ability of non-invasive DNA-based molecular diagnostics (454 pyrosequencing) to analyse reptile diets, with the specific aims of determining which earthworm species are exploited by slow worms (the legless lizard Anguis fragilis) and whether they feed on the deeper-living earthworm species that only come to the surface at night. Slow worm faecal samples from four different habitats were analysed using earthworm-specific PCR primers. We found that 86% of slow worms (N=80) had eaten earthworms. In lowland heath and marshy/acid grassland, Lumbricus rubellus, a surface-dwelling epigeic species, dominated slow worm diet. In two other habitats, riverside pasture and calciferous coarse grassland, diet was dominated by deeper-living anecic and endogeic species. We conclude that all species of earthworm are exploited by these reptiles and lack of specialization allows slow worms to thrive in a wide variety of habitats. Pyrosequencing of prey DNA in faeces showed promise as a practical, rapid and relatively inexpensive means of obtaining detailed and valuable ecological information on the diets of reptiles. © 2011 Blackwell Publishing Ltd.

  14. DOGMA: domain-based transcriptome and proteome quality assessment.

    PubMed

    Dohmen, Elias; Kremer, Lukas P M; Bornberg-Bauer, Erich; Kemena, Carsten

    2016-09-01

    Genome studies have become cheaper and easier than ever before, due to the decreased costs of high-throughput sequencing and the free availability of analysis software. However, the quality of genome or transcriptome assemblies can vary a lot. Therefore, quality assessment of assemblies and annotations are crucial aspects of genome analysis pipelines. We developed DOGMA, a program for fast and easy quality assessment of transcriptome and proteome data based on conserved protein domains. DOGMA measures the completeness of a given transcriptome or proteome and provides information about domain content for further analysis. DOGMA provides a very fast way to do quality assessment within seconds. DOGMA is implemented in Python and published under GNU GPL v.3 license. The source code is available on https://ebbgit.uni-muenster.de/domainWorld/DOGMA/ CONTACTS: e.dohmen@wwu.de or c.kemena@wwu.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  15. Comparative transcriptomics of early dipteran development

    PubMed Central

    2013-01-01

    Background Modern sequencing technologies have massively increased the amount of data available for comparative genomics. Whole-transcriptome shotgun sequencing (RNA-seq) provides a powerful basis for comparative studies. In particular, this approach holds great promise for emerging model species in fields such as evolutionary developmental biology (evo-devo). Results We have sequenced early embryonic transcriptomes of two non-drosophilid dipteran species: the moth midge Clogmia albipunctata, and the scuttle fly Megaselia abdita. Our analysis includes a third, published, transcriptome for the hoverfly Episyrphus balteatus. These emerging models for comparative developmental studies close an important phylogenetic gap between Drosophila melanogaster and other insect model systems. In this paper, we provide a comparative analysis of early embryonic transcriptomes across species, and use our data for a phylogenomic re-evaluation of dipteran phylogenetic relationships. Conclusions We show how comparative transcriptomics can be used to create useful resources for evo-devo, and to investigate phylogenetic relationships. Our results demonstrate that de novo assembly of short (Illumina) reads yields high-quality, high-coverage transcriptomic data sets. We use these data to investigate deep dipteran phylogenetic relationships. Our results, based on a concatenation of 160 orthologous genes, provide support for the traditional view of Clogmia being the sister group of Brachycera (Megaselia, Episyrphus, Drosophila), rather than that of Culicomorpha (which includes mosquitoes and blackflies). PMID:23432914

  16. Targeted exploration and analysis of large cross-platform human transcriptomic compendia

    PubMed Central

    Zhu, Qian; Wong, Aaron K; Krishnan, Arjun; Aure, Miriam R; Tadych, Alicja; Zhang, Ran; Corney, David C; Greene, Casey S; Bongo, Lars A; Kristensen, Vessela N; Charikar, Moses; Li, Kai; Troyanskaya, Olga G.

    2016-01-01

    We present SEEK (http://seek.princeton.edu), a query-based search engine across very large transcriptomic data collections, including thousands of human data sets from almost 50 microarray and next-generation sequencing platforms. SEEK uses a novel query-level cross-validation-based algorithm to automatically prioritize data sets relevant to the query and a robust search approach to identify query-coregulated genes, pathways, and processes. SEEK provides cross-platform handling, multi-gene query search, iterative metadata-based search refinement, and extensive visualization-based analysis options. PMID:25581801

  17. Harnessing pain heterogeneity and RNA transcriptome to identify blood–based pain biomarkers: a novel correlational study design and bioinformatics approach in a graded chronic constriction injury model

    PubMed Central

    Grace, Peter M.; Hurley, Daniel; Barratt, Daniel T.; Tsykin, Anna; Watkins, Linda R.; Rolan, Paul E.; Hutchinson, Mark R.

    2017-01-01

    A quantitative, peripherally accessible biomarker for neuropathic pain has great potential to improve clinical outcomes. Based on the premise that peripheral and central immunity contribute to neuropathic pain mechanisms, we hypothesized that biomarkers could be identified from the whole blood of adult male rats, by integrating graded chronic constriction injury (CCI), ipsilateral lumbar dorsal quadrant (iLDQ) and whole blood transcriptomes, and pathway analysis with pain behavior. Correlational bioinformatics identified a range of putative biomarker genes for allodynia intensity, many encoding for proteins with a recognized role in immune/nociceptive mechanisms. A selection of these genes was validated in a separate replication study. Pathway analysis of the iLDQ transcriptome identified Fcγ and Fcε signaling pathways, among others. This study is the first to employ the whole blood transcriptome to identify pain biomarker panels. The novel correlational bioinformatics, developed here, selected such putative biomarkers based on a correlation with pain behavior and formation of signaling pathways with iLDQ genes. Future studies may demonstrate the predictive ability of these biomarker genes across other models and additional variables. PMID:22697386

  18. Modular organization of the white spruce (Picea glauca) transcriptome reveals functional organization and evolutionary signatures.

    PubMed

    Raherison, Elie S M; Giguère, Isabelle; Caron, Sébastien; Lamara, Mebarek; MacKay, John J

    2015-07-01

    Transcript profiling has shown the molecular bases of several biological processes in plants but few studies have developed an understanding of overall transcriptome variation. We investigated transcriptome structure in white spruce (Picea glauca), aiming to delineate its modular organization and associated functional and evolutionary attributes. Microarray analyses were used to: identify and functionally characterize groups of co-expressed genes; investigate expressional and functional diversity of vascular tissue preferential genes which were conserved among Picea species, and identify expression networks underlying wood formation. We classified 22 857 genes as variable (79%; 22 coexpression groups) or invariant (21%) by profiling across several vegetative tissues. Modular organization and complex transcriptome restructuring among vascular tissue preferential genes was revealed by their assignment to coexpression groups with partially overlapping profiles and partially distinct functions. Integrated analyses of tissue-based and temporally variable profiles identified secondary xylem gene networks, showed their remodelling over a growing season and identified PgNAC-7 (no apical meristerm (NAM), Arabidopsis transcription activation factor (ATAF) and cup-shaped cotyledon (CUC) transcription factor 007 in Picea glauca) as a major hub gene specific to earlywood formation. Reference profiling identified comprehensive, statistically robust coexpressed groups, revealing that modular organization underpins the evolutionary conservation of the transcriptome structure. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  19. Transcriptomic effects-based monitoring for endocrine active chemicals: Assessing relative contribution of treated wastewater to downstream pollution

    USGS Publications Warehouse

    Martinovic-Weigelt, Dalma; Mehinto, Alvine C.; Ankley, Gerald T.; Denslow, Nancy D.; Barber, Larry B.; Lee, Kathy E.; King, Ryan J.; Schoenfuss, Heiko L.; Schroeder, Anthony L.; Villeneuve, Daniel L.

    2014-01-01

    The present study investigated whether a combination of targeted analytical chemistry information with unsupervised, data-rich biological methodology (i.e., transcriptomics) could be utilized to evaluate relative contributions of wastewater treatment plant (WWTP) effluents to biological effects. The effects of WWTP effluents on fish exposed to ambient, receiving waters were studied at three locations with distinct WWTP and watershed characteristics. At each location, 4 d exposures of male fathead minnows to the WWTP effluent and upstream and downstream ambient waters were conducted. Transcriptomic analyses were performed on livers using 15 000 feature microarrays, followed by a canonical pathway and gene set enrichment analyses. Enrichment of gene sets indicative of teleost brain–pituitary–gonadal–hepatic (BPGH) axis function indicated that WWTPs serve as an important source of endocrine active chemicals (EACs) that affect the BPGH axis (e.g., cholesterol and steroid metabolism were altered). The results indicated that transcriptomics may even pinpoint pertinent adverse outcomes (i.e., liver vacuolization) and groups of chemicals that preselected chemical analytes may miss. Transcriptomic Effects-Based monitoring was capable of distinguishing sites, and it reflected chemical pollution gradients, thus holding promise for assessment of relative contributions of point sources to pollution and the efficacy of pollution remediation.

  20. Transcriptome analysis of tube foot and large scale marker discovery in sea cucumber, Apostichopus japonicus.

    PubMed

    Zhou, Xiaoxu; Wang, Hongdi; Cui, Jun; Qiu, Xuemei; Chang, Yaqing; Wang, Xiuli

    2016-12-01

    Tube foot as one of the ambulacral appendages types in Aspidochirote holothurioids, is known for their functions in locomotion, feeding, chemoreception, light sensitivity and respiration. In this study, we explored the characteristic of transcriptome in the tube foot of sea cucumber (Apostichopus japonicus). Our results showed that among 390 unigenes which specifically expressed in the tube foot, 190 of them were annotated. Based on the assembly transcriptome, we found 219,860 SNPs from 34,749 unigenes, 97,683, 53,624, 27,767 and 40,786 were located in CDSs, 5'-UTRs, 3'-UTRs and non-CDS separately. Furthermore, 12,114 SSRs were detected from 7394 unigenes. Target genes of four specifically expressed miRNAs (miR-29a, miR-29b, miR-278-3p and miR-2005) in tube foot were also predicted based on the transcriptome, which contain immune-related factors (MBL, VLRA, AjC3, MyD88, CFB), skin pigmentation (MITF), candidate regeneration factor (TRP) and holothurians autolysis-related factor (CL). These results develop a relatively large number of molecular markers and transcriptome resources, and will provide a foundation for further analyses on the function and molecular mechanisms underlying A. japonicas tube foot. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. Prokaryotic microbiota in the digestive cavity of the jellyfish Cotylorhiza tuberculata.

    PubMed

    Cortés-Lara, Sara; Urdiain, Mercedes; Mora-Ruiz, Merit; Prieto, Laura; Rosselló-Móra, Ramon

    2015-10-01

    The microbiota associated to the gastric cavity of four exemplars of the jellyfish Cotylorhiza tuberculata has been studied by means of cultured-dependent and -independent methods. The pyrosequencing approach rendered a very reduced diversity of Bacteria with four major groups shared by the four exemplars that made up to 95% of the total diversity. The culturing approach recovered low abundant organisms and some of them also detected by the pyrosequencing approach. The major key organisms were related to the genera Spiroplasma, Thalassospira, Tenacibaculum (from the pyrosequencing data), and Vibrio (from the cultivable fraction). Altogether the results indicate that C. tuberculata harbors an associated microbiota of very reduced diversity. On the other hand, some of the major key players may be potential pathogens and the host may serve as dispersal mechanism. Copyright © 2015 Elsevier GmbH. All rights reserved.

  2. Elucidating and mining the Tulipa and Lilium transcriptomes.

    PubMed

    Moreno-Pachon, Natalia M; Leeggangers, Hendrika A C F; Nijveen, Harm; Severing, Edouard; Hilhorst, Henk; Immink, Richard G H

    2016-10-01

    Genome sequencing remains a challenge for species with large and complex genomes containing extensive repetitive sequences, of which the bulbous and monocotyledonous plants tulip and lily are examples. In such a case, sequencing of only the active part of the genome, represented by the transcriptome, is a good alternative to obtain information about gene content. In this study we aimed to generate a high quality transcriptome of tulip and lily and to make this data available as an open-access resource via a user-friendly web-based interface. The Illumina HiSeq 2000 platform was applied and the transcribed RNA was sequenced from a collection of different lily and tulip tissues, respectively. In order to obtain good transcriptome coverage and to facilitate effective data mining, assembly was done using different filtering parameters for clearing out contamination and noise of the RNAseq datasets. This analysis revealed limitations of commonly applied methods and parameter settings used in de novo transcriptome assembly. The final created transcriptomes are publicly available via a user friendly Transcriptome browser ( http://www.bioinformatics.nl/bulbs/db/species/index ). The usefulness of this resource has been exemplified by a search for all potential transcription factors in lily and tulip, with special focus on the TCP transcription factor family. This analysis and other quality parameters point out the quality of the transcriptomes, which can serve as a basis for further genomics studies in lily, tulip, and bulbous plants in general.

  3. Descriptive Biomarkers for Assessing Breast Cancer Risk

    DTIC Science & Technology

    2010-10-01

    and we are making significant progress on Tasks 6 and 7. We completed methylation analyses of three genes (RASSF1, SFRP1 and GSTP1 ) on all samples...promoter hypermethylation; RASSF1, GSTP1 , SFRP1 12 karcaro@nre.umass.edu Arcaro, Kathleen F Annual Report...methylation analysis by pyrosequencing. PCR amplification and pyrosequencing has been completed for three genes, RASSF1, SFRP1 and GSTP1 and have

  4. Pyrosequencing Analysis of Norovirus Genogroup II Distribution in Sewage and Oysters: First Detection of GII.17 Kawasaki 2014 in Oysters.

    PubMed

    Pu, Jian; Kazama, Shinobu; Miura, Takayuki; Azraini, Nabila Dhyan; Konta, Yoshimitsu; Ito, Hiroaki; Ueki, You; Cahyaningrum, Ermaya Eka; Omura, Tatsuo; Watanabe, Toru

    2016-12-01

    Norovirus GII.3, GII.4, and GII.17 were detected using pyrosequencing in sewage and oysters in January and February 2015, in Japan. The strains in sewage and oyster samples were genetically identical or similar, predominant strains belonging to GII.17 Kawasaki 2014 lineage. This is the first report of GII.17 Kawasaki 2014 in oysters.

  5. Combined Real-Time PCR and Pyrosequencing Strategy for Objective, Sensitive, Specific, and High-Throughput Identification of Reduced Susceptibility to Penicillins in Neisseria meningitidis▿

    PubMed Central

    Thulin, Sara; Olcén, Per; Fredlund, Hans; Unemo, Magnus

    2008-01-01

    A segment of penA in Neisseria meningitidis strains (n = 127), including two nucleotide sites closely associated to reduced susceptibility to penicillins, was amplified and pyrosequenced. All results were in concordance with Sanger sequencing, and a high correlation between alterations in the two Peni-specific sites and reduced susceptibility to penicillins was identified. PMID:18070955

  6. Identification and genotyping of molluscum contagiosum virus from genital swab samples by real-time PCR and Pyrosequencing.

    PubMed

    Trama, Jason P; Adelson, Martin E; Mordechai, Eli

    2007-12-01

    Laboratory diagnosis of molluscum contagiosum virus (MCV) is important as lesions can be confused with those caused by Cryptococcus neoformans, herpes simplex virus, human papillomavirus, and varicella-zoster virus. To develop a rapid method for identifying patients infected with MCV via swab sampling. Two dual-labeled probe real-time PCR assays, one homologous to the p43K gene and one to the MC080R gene, were designed. The p43K PCR was designed to be used in conjunction with Pyrosequencing for confirmation of PCR products and discrimination between MCV1 and MCV2. Both PCR assays were optimized with respect to reaction components, thermocycling parameters, and primer and probe concentrations. The specificities of both PCR assays were confirmed by non-amplification of 38 known human pathogens. Sensitivity assays demonstrated detection of as few as 10 copies per reaction. Testing 703 swabs, concordance between the two real-time PCR assays was 99.9%. Under the developed conditions, Pyrosequencing of the p43K PCR product was capable of providing enough nucleotide sequence to definitively differentiate MCV1 and MCV2. These real-time PCR assays can be used for the rapid, sensitive, and specific detection of MCV and, when combined with Pyrosequencing, can further discriminate between MCV1 and MCV2.

  7. Microbial Diversity Analysis of Fermented Mung Beans (Lu-Doh-Huang) by Using Pyrosequencing and Culture Methods

    PubMed Central

    Chao, Shiou-Huei; Huang, Hui-Yu; Chang, Chuan-Hsiung; Yang, Chih-Hsien; Cheng, Wei-Shen; Kang, Ya-Huei; Watanabe, Koichi; Tsai, Ying-Chieh

    2013-01-01

    In Taiwanese alternative medicine Lu-doh-huang (also called Pracparatum mungo), mung beans are mixed with various herbal medicines and undergo a 4-stage process of anaerobic fermentation. Here we used high-throughput sequencing of the 16S rRNA gene to profile the bacterial community structure of Lu-doh-huang samples. Pyrosequencing of samples obtained at 7 points during fermentation revealed 9 phyla, 264 genera, and 586 species of bacteria. While mung beans were inside bamboo sections (stages 1 and 2 of the fermentation process), family Lactobacillaceae and genus Lactobacillus emerged in highest abundance; Lactobacillus plantarum was broadly distributed among these samples. During stage 3, the bacterial distribution shifted to family Porphyromonadaceae, and Butyricimonas virosa became the predominant microbial component. Thereafter, bacterial counts decreased dramatically, and organisms were too few to be detected during stage 4. In addition, the microbial compositions of the liquids used for soaking bamboo sections were dramatically different: Exiguobacterium mexicanum predominated in the fermented soybean solution whereas B. virosa was predominant in running spring water. Furthermore, our results from pyrosequencing paralleled those we obtained by using the traditional culture method, which targets lactic acid bacteria. In conclusion, the microbial communities during Lu-doh-huang fermentation were markedly diverse, and pyrosequencing revealed a complete picture of the microbial consortium. PMID:23700436

  8. Production of a reference transcriptome and transcriptomic database (EdwardsiellaBase) for the lined sea anemone, Edwardsiella lineata, a parasitic cnidarian

    PubMed Central

    2014-01-01

    Background The lined sea anemone Edwardsiella lineata is an informative model system for evolutionary-developmental studies of parasitism. In this species, it is possible to compare alternate developmental pathways leading from a larva to either a free-living polyp or a vermiform parasite that inhabits the mesoglea of a ctenophore host. Additionally, E. lineata is confamilial with the model cnidarian Nematostella vectensis, providing an opportunity for comparative genomic, molecular and organismal studies. Description We generated a reference transcriptome for E. lineata via high-throughput sequencing of RNA isolated from five developmental stages (parasite; parasite-to-larva transition; larva; larva-to-adult transition; adult). The transcriptome comprises 90,440 contigs assembled from >15 billion nucleotides of DNA sequence. Using a molecular clock approach, we estimated the divergence between E. lineata and N. vectensis at 215–364 million years ago. Based on gene ontology and metabolic pathway analyses and gene family surveys (bHLH-PAS, deiodinases, Fox genes, LIM homeodomains, minicollagens, nuclear receptors, Sox genes, and Wnts), the transcriptome of E. lineata is comparable in depth and completeness to N. vectensis. Analyses of protein motifs and revealed extensive conservation between the proteins of these two edwardsiid anemones, although we show the NF-κB protein of E. lineata reflects the ancestral structure, while the NF-κB protein of N. vectensis has undergone a split that separates the DNA-binding domain from the inhibitory domain. All contigs have been deposited in a public database (EdwardsiellaBase), where they may be searched according to contig ID, gene ontology, protein family motif (Pfam), enzyme commission number, and BLAST. The alignment of the raw reads to the contigs can also be visualized via JBrowse. Conclusions The transcriptomic data and database described here provide a platform for studying the evolutionary developmental genomics of a derived parasitic life cycle. In addition, these data from E. lineata will aid in the interpretation of evolutionary novelties in gene sequence or structure that have been reported for the model cnidarian N. vectensis (e.g., the split NF-κB locus). Finally, we include custom computational tools to facilitate the annotation of a transcriptome based on high-throughput sequencing data obtained from a “non-model system.” PMID:24467778

  9. Production of a reference transcriptome and transcriptomic database (EdwardsiellaBase) for the lined sea anemone, Edwardsiella lineata, a parasitic cnidarian.

    PubMed

    Stefanik, Derek J; Lubinski, Tristan J; Granger, Brian R; Byrd, Allyson L; Reitzel, Adam M; DeFilippo, Lukas; Lorenc, Allison; Finnerty, John R

    2014-01-28

    The lined sea anemone Edwardsiella lineata is an informative model system for evolutionary-developmental studies of parasitism. In this species, it is possible to compare alternate developmental pathways leading from a larva to either a free-living polyp or a vermiform parasite that inhabits the mesoglea of a ctenophore host. Additionally, E. lineata is confamilial with the model cnidarian Nematostella vectensis, providing an opportunity for comparative genomic, molecular and organismal studies. We generated a reference transcriptome for E. lineata via high-throughput sequencing of RNA isolated from five developmental stages (parasite; parasite-to-larva transition; larva; larva-to-adult transition; adult). The transcriptome comprises 90,440 contigs assembled from >15 billion nucleotides of DNA sequence. Using a molecular clock approach, we estimated the divergence between E. lineata and N. vectensis at 215-364 million years ago. Based on gene ontology and metabolic pathway analyses and gene family surveys (bHLH-PAS, deiodinases, Fox genes, LIM homeodomains, minicollagens, nuclear receptors, Sox genes, and Wnts), the transcriptome of E. lineata is comparable in depth and completeness to N. vectensis. Analyses of protein motifs and revealed extensive conservation between the proteins of these two edwardsiid anemones, although we show the NF-κB protein of E. lineata reflects the ancestral structure, while the NF-κB protein of N. vectensis has undergone a split that separates the DNA-binding domain from the inhibitory domain. All contigs have been deposited in a public database (EdwardsiellaBase), where they may be searched according to contig ID, gene ontology, protein family motif (Pfam), enzyme commission number, and BLAST. The alignment of the raw reads to the contigs can also be visualized via JBrowse. The transcriptomic data and database described here provide a platform for studying the evolutionary developmental genomics of a derived parasitic life cycle. In addition, these data from E. lineata will aid in the interpretation of evolutionary novelties in gene sequence or structure that have been reported for the model cnidarian N. vectensis (e.g., the split NF-κB locus). Finally, we include custom computational tools to facilitate the annotation of a transcriptome based on high-throughput sequencing data obtained from a "non-model system."

  10. Construction of Pará rubber tree genome and multi-transcriptome database accelerates rubber researches.

    PubMed

    Makita, Yuko; Kawashima, Mika; Lau, Nyok Sean; Othman, Ahmad Sofiman; Matsui, Minami

    2018-01-19

    Natural rubber is an economically important material. Currently the Pará rubber tree, Hevea brasiliensis is the main commercial source. Little is known about rubber biosynthesis at the molecular level. Next-generation sequencing (NGS) technologies brought draft genomes of three rubber cultivars and a variety of RNA sequencing (RNA-seq) data. However, no current genome or transcriptome databases (DB) are organized by gene. A gene-oriented database is a valuable support for rubber research. Based on our original draft genome sequence of H. brasiliensis RRIM600, we constructed a rubber tree genome and transcriptome DB. Our DB provides genome information including gene functional annotations and multi-transcriptome data of RNA-seq, full-length cDNAs including PacBio Isoform sequencing (Iso-Seq), ESTs and genome wide transcription start sites (TSSs) derived from CAGE technology. Using our original and publically available RNA-seq data, we calculated co-expressed genes for identifying functionally related gene sets and/or genes regulated by the same transcription factor (TF). Users can access multi-transcriptome data through both a gene-oriented web page and a genome browser. For the gene searching system, we provide keyword search, sequence homology search and gene expression search; users can also select their expression threshold easily. The rubber genome and transcriptome DB provides rubber tree genome sequence and multi-transcriptomics data. This DB is useful for comprehensive understanding of the rubber transcriptome. This will assist both industrial and academic researchers for rubber and economically important close relatives such as R. communis, M. esculenta and J. curcas. The Rubber Transcriptome DB release 2017.03 is accessible at http://matsui-lab.riken.jp/rubber/ .

  11. Sequencing, Annotation and Analysis of the Syrian Hamster (Mesocricetus auratus) Transcriptome

    PubMed Central

    Tchitchek, Nicolas; Safronetz, David; Rasmussen, Angela L.; Martens, Craig; Virtaneva, Kimmo; Porcella, Stephen F.; Feldmann, Heinz

    2014-01-01

    Background The Syrian hamster (golden hamster, Mesocricetus auratus) is gaining importance as a new experimental animal model for multiple pathogens, including emerging zoonotic diseases such as Ebola. Nevertheless there are currently no publicly available transcriptome reference sequences or genome for this species. Results A cDNA library derived from mRNA and snRNA isolated and pooled from the brains, lungs, spleens, kidneys, livers, and hearts of three adult female Syrian hamsters was sequenced. Sequence reads were assembled into 62,482 contigs and 111,796 reads remained unassembled (singletons). This combined contig/singleton dataset, designated as the Syrian hamster transcriptome, represents a total of 60,117,204 nucleotides. Our Mesocricetus auratus Syrian hamster transcriptome mapped to 11,648 mouse transcripts representing 9,562 distinct genes, and mapped to a similar number of transcripts and genes in the rat. We identified 214 quasi-complete transcripts based on mouse annotations. Canonical pathways involved in a broad spectrum of fundamental biological processes were significantly represented in the library. The Syrian hamster transcriptome was aligned to the current release of the Chinese hamster ovary (CHO) cell transcriptome and genome to improve the genomic annotation of this species. Finally, our Syrian hamster transcriptome was aligned against 14 other rodents, primate and laurasiatheria species to gain insights about the genetic relatedness and placement of this species. Conclusions This Syrian hamster transcriptome dataset significantly improves our knowledge of the Syrian hamster's transcriptome, especially towards its future use in infectious disease research. Moreover, this library is an important resource for the wider scientific community to help improve genome annotation of the Syrian hamster and other closely related species. Furthermore, these data provide the basis for development of expression microarrays that can be used in functional genomics studies. PMID:25398096

  12. RNA-Skim: a rapid method for RNA-Seq quantification at transcript level

    PubMed Central

    Zhang, Zhaojun; Wang, Wei

    2014-01-01

    Motivation: RNA-Seq technique has been demonstrated as a revolutionary means for exploring transcriptome because it provides deep coverage and base pair-level resolution. RNA-Seq quantification is proven to be an efficient alternative to Microarray technique in gene expression study, and it is a critical component in RNA-Seq differential expression analysis. Most existing RNA-Seq quantification tools require the alignments of fragments to either a genome or a transcriptome, entailing a time-consuming and intricate alignment step. To improve the performance of RNA-Seq quantification, an alignment-free method, Sailfish, has been recently proposed to quantify transcript abundances using all k-mers in the transcriptome, demonstrating the feasibility of designing an efficient alignment-free method for transcriptome quantification. Even though Sailfish is substantially faster than alternative alignment-dependent methods such as Cufflinks, using all k-mers in the transcriptome quantification impedes the scalability of the method. Results: We propose a novel RNA-Seq quantification method, RNA-Skim, which partitions the transcriptome into disjoint transcript clusters based on sequence similarity, and introduces the notion of sig-mers, which are a special type of k-mers uniquely associated with each cluster. We demonstrate that the sig-mer counts within a cluster are sufficient for estimating transcript abundances with accuracy comparable with any state-of-the-art method. This enables RNA-Skim to perform transcript quantification on each cluster independently, reducing a complex optimization problem into smaller optimization tasks that can be run in parallel. As a result, RNA-Skim uses <4% of the k-mers and <10% of the CPU time required by Sailfish. It is able to finish transcriptome quantification in <10 min per sample by using just a single thread on a commodity computer, which represents >100 speedup over the state-of-the-art alignment-based methods, while delivering comparable or higher accuracy. Availability and implementation: The software is available at http://www.csbio.unc.edu/rs. Contact: weiwang@cs.ucla.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24931995

  13. Optimization of biostimulant for bioremediation of contaminated coastal sediment by response surface methodology (RSM) and evaluation of microbial diversity by pyrosequencing.

    PubMed

    Subha, Bakthavachallam; Song, Young Chae; Woo, Jung Hui

    2015-09-15

    The present study aims to optimize the slow release biostimulant ball (BSB) for bioremediation of contaminated coastal sediment using response surface methodology (RSM). Different bacterial communities were evaluated using a pyrosequencing-based approach in contaminated coastal sediments. The effects of BSB size (1-5cm), distance (1-10cm) and time (1-4months) on changes in chemical oxygen demand (COD) and volatile solid (VS) reduction were determined. Maximum reductions of COD and VS, 89.7% and 78.8%, respectively, were observed at a 3cm ball size, 5.5cm distance and 4months; these values are the optimum conditions for effective treatment of contaminated coastal sediment. Most of the variance in COD and VS (0.9291 and 0.9369, respectively) was explained in our chosen models. BSB is a promising method for COD and VS reduction and enhancement of SRB diversity. Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. Universal DNA-based methods for assessing the diet of grazing livestock and wildlife from feces.

    PubMed

    Pegard, Anthony; Miquel, Christian; Valentini, Alice; Coissac, Eric; Bouvier, Frédéric; François, Dominique; Taberlet, Pierre; Engel, Erwan; Pompanon, François

    2009-07-08

    Because of the demand for controlling livestock diets, two methods that characterize the DNA of plants present in feces were developed. After DNA extraction from fecal samples, a short fragment of the chloroplastic trnL intron was amplified by PCR using a universal primer pair for plants. The first method generates a signature that is the electrophoretic migration pattern of the PCR product. The second method consists of sequencing several hundred DNA fragments from the PCR product through pyrosequencing. These methods were validated with a blind analysis of feces from concentrate- and pasture-fed lambs. The signature method allowed differentiation of the two diets and confirmed the presence of concentrate in one of them. The pyrosequencing method allowed the identification of up to 25 taxa in a diet. These methods are complementary to the chemical methods already used. They could be applied to the control of diets and the study of food preferences.

  15. Prevalence of the genus Cladosporium on the integument of leaf-cutting ants characterized by 454 pyrosequencing.

    PubMed

    Duarte, A P M; Ferro, M; Rodrigues, A; Bacci, M; Nagamoto, N S; Forti, L C; Pagnocca, F C

    2016-09-01

    The relationship of attine ants with their mutualistic fungus and other microorganisms has been studied during the last two centuries. However, previous studies about the diversity of fungi in the ants' microenvironment are based mostly on culture-dependent approaches, lacking a broad characterization of the fungal ant-associated community. Here, we analysed the fungal diversity found on the integument of Atta capiguara and Atta laevigata alate ants using 454 pyrosequencing. We obtained 35,453 ITS reads grouped into 99 molecular operational taxonomic units (MOTUs). Data analysis revealed that A. capiguara drones had the highest diversity of MOTUs. Besides the occurrence of several uncultured fungi, the mycobiota analysis revealed that the most abundant taxa were the Cladosporium-complex, Cryptococcus laurentii and Epicoccum sp. Taxa in the genus Cladosporium were predominant in all samples, comprising 67.9 % of all reads. The remarkable presence of the genus Cladosporium on the integument of leaf-cutting ants alates from distinct ant species suggests that this fungus is favored in this microenvironment.

  16. SC3 - consensus clustering of single-cell RNA-Seq data

    PubMed Central

    Kiselev, Vladimir Yu.; Kirschner, Kristina; Schaub, Michael T.; Andrews, Tallulah; Yiu, Andrew; Chandra, Tamir; Natarajan, Kedar N; Reik, Wolf; Barahona, Mauricio; Green, Anthony R; Hemberg, Martin

    2017-01-01

    Single-cell RNA-seq (scRNA-seq) enables a quantitative cell-type characterisation based on global transcriptome profiles. We present Single-Cell Consensus Clustering (SC3), a user-friendly tool for unsupervised clustering which achieves high accuracy and robustness by combining multiple clustering solutions through a consensus approach. We demonstrate that SC3 is capable of identifying subclones based on the transcriptomes from neoplastic cells collected from patients. PMID:28346451

  17. Metagenomic Analyses Reveal the Involvement of Syntrophic Consortia in Methanol/Electricity Conversion in Microbial Fuel Cells

    PubMed Central

    Yamamuro, Ayaka; Kouzuma, Atsushi; Abe, Takashi; Watanabe, Kazuya

    2014-01-01

    Methanol is widely used in industrial processes, and as such, is discharged in large quantities in wastewater. Microbial fuel cells (MFCs) have the potential to recover electric energy from organic pollutants in wastewater; however, the use of MFCs to generate electricity from methanol has not been reported. In the present study, we developed single-chamber MFCs that generated electricity from methanol at the maximum power density of 220 mW m−2 (based on the projected area of the anode). In order to reveal how microbes generate electricity from methanol, pyrosequencing of 16S rRNA-gene amplicons and Illumina shotgun sequencing of metagenome were conducted. The pyrosequencing detected in abundance Dysgonomonas, Sporomusa, and Desulfovibrio in the electrolyte and anode and cathode biofilms, while Geobacter was detected only in the anode biofilm. Based on known physiological properties of these bacteria, it is considered that Sporomusa converts methanol into acetate, which is then utilized by Geobacter to generate electricity. This speculation is supported by results of shotgun metagenomics of the anode-biofilm microbes, which reconstructed relevant catabolic pathways in these bacteria. These results suggest that methanol is anaerobically catabolized by syntrophic bacterial consortia with electrodes as electron acceptors. PMID:24852573

  18. Metagenomic analyses reveal the involvement of syntrophic consortia in methanol/electricity conversion in microbial fuel cells.

    PubMed

    Yamamuro, Ayaka; Kouzuma, Atsushi; Abe, Takashi; Watanabe, Kazuya

    2014-01-01

    Methanol is widely used in industrial processes, and as such, is discharged in large quantities in wastewater. Microbial fuel cells (MFCs) have the potential to recover electric energy from organic pollutants in wastewater; however, the use of MFCs to generate electricity from methanol has not been reported. In the present study, we developed single-chamber MFCs that generated electricity from methanol at the maximum power density of 220 mW m(-2) (based on the projected area of the anode). In order to reveal how microbes generate electricity from methanol, pyrosequencing of 16S rRNA-gene amplicons and Illumina shotgun sequencing of metagenome were conducted. The pyrosequencing detected in abundance Dysgonomonas, Sporomusa, and Desulfovibrio in the electrolyte and anode and cathode biofilms, while Geobacter was detected only in the anode biofilm. Based on known physiological properties of these bacteria, it is considered that Sporomusa converts methanol into acetate, which is then utilized by Geobacter to generate electricity. This speculation is supported by results of shotgun metagenomics of the anode-biofilm microbes, which reconstructed relevant catabolic pathways in these bacteria. These results suggest that methanol is anaerobically catabolized by syntrophic bacterial consortia with electrodes as electron acceptors.

  19. Pyrosequencing Based Microbial Community Analysis of Stabilized Mine Soils

    NASA Astrophysics Data System (ADS)

    Park, J. E.; Lee, B. T.; Son, A.

    2015-12-01

    Heavy metals leached from exhausted mines have been causing severe environmental problems in nearby soils and groundwater. Environmental mitigation was performed based on the heavy metal stabilization using Calcite and steel slag in Korea. Since the soil stabilization only temporarily immobilizes the contaminants to soil matrix, the potential risk of re-leaching heavy metal still exists. Therefore the follow-up management of stabilized soils and the corresponding evaluation methods are required to avoid the consequent contamination from the stabilized soils. In this study, microbial community analysis using pyrosequencing was performed for assessing the potential leaching of the stabilized soils. As a result of rarefaction curve and Chao1 and Shannon indices, the stabilized soil has shown lower richness and diversity as compared to non-contaminated negative control. At the phyla level, as the degree of contamination increases, most of phyla decreased with only exception of increased proteobacteria. Among proteobacteria, gamma-proteobacteria increased against the heavy metal contamination. At the species level, Methylobacter tundripaludum of gamma-proteobacteria showed the highest relative portion of microbial community, indicating that methanotrophs may play an important role in either solubilization or immobilization of heavy metals in stabilized soils.

  20. Pyrosequencing reveals regional differences in fruit-associated fungal communities

    PubMed Central

    Taylor, Michael W; Tsai, Peter; Anfang, Nicole; Ross, Howard A; Goddard, Matthew R

    2014-01-01

    We know relatively little of the distribution of microbial communities generally. Significant work has examined a range of bacterial communities, but the distribution of microbial eukaryotes is less well characterized. Humans have an ancient association with grape vines (Vitis vinifera) and have been making wine since the dawn of civilization, and fungi drive this natural process. While the molecular biology of certain fungi naturally associated with vines and wines is well characterized, complementary investigations into the ecology of fungi associated with fruiting plants is largely lacking. DNA sequencing technologies allow the direct estimation of microbial diversity from a given sample, avoiding culture-based biases. Here, we use deep community pyrosequencing approaches, targeted at the 26S rRNA gene, to examine the richness and composition of fungal communities associated with grapevines and test for geographical community structure among four major regions in New Zealand (NZ). We find over 200 taxa using this approach, which is 10-fold more than previously recovered using culture-based methods. Our analyses allow us to reject the null hypothesis of homogeneity in fungal species richness and community composition across NZ and reveal significant differences between major areas. PMID:24650123

  1. Direct RNA-Based Detection and Differentiation of CTX-M-Type Extended-Spectrum β-Lactamases (ESBL)

    PubMed Central

    Stein, Claudia; Makarewicz, Oliwia; Pfeifer, Yvonne; Brandt, Christian; Ramos, João Costa; Klinger, Mareike; Pletz, Mathias W.

    2013-01-01

    The current global spread of multi-resistant Gram-negatives, particularly extended spectrum β-lactamases expressing bacteria, increases the likelihood of inappropriate empiric treatment of critically ill patients with subsequently increased mortality. From a clinical perspective, fast detection of resistant pathogens would allow a pre-emptive correction of an initially inappropriate treatment. Here we present diagnostic amplification-sequencing approach as proof of principal based on the fast molecular detection and correct discrimination of CTX-M-β-lactamases, the most frequent ESBL family. The workflow consists of the isolation of total mRNA and CTX-M-specific reverse transcription (RT), amplification and pyrosequencing. Due to the high variability of the CTX-M-β-lactamase-genes, degenerated primers for RT, qRT as well as for pyrosequencing, were used and the suitability and discriminatory performance of two conserved positions within the CTX-M genes were analyzed, using one protocol for all isolates and positions, respectively. Using this approach, no information regarding the expected CTX-M variant is needed since all sequences are covered by these degenerated primers. The presented workflow can be conducted within eight hours and has the potential to be expanded to other β-lactamase families. PMID:24224038

  2. Correlation between MGMT promoter methylation and response to temozolomide-based therapy in neuroendocrine neoplasms: an observational retrospective multicenter study.

    PubMed

    Campana, Davide; Walter, Thomas; Pusceddu, Sara; Gelsomino, Fabio; Graillot, Emmanuelle; Prinzi, Natalie; Spallanzani, Andrea; Fiorentino, Michelangelo; Barritault, Marc; Dall'Olio, Filippo; Brighi, Nicole; Biasco, Guido

    2018-06-01

    Temozolomide (TEM) based therapy has been reported being effective in the treatment of metastatic neuroendocrine neoplasms (NEN), with response rates ranging from 30 to 70%. Among patients affected by advanced glioblastoma or melanoma and treated with TEM, loss of tumoral O6-methylguanine DNA methyltransferase (MGMT) is correlated with improved survival. In NEN patients, the role of MGMT deficiency in predicting clinical outcomes of TEM treatment is still under debate. In this study we evaluated 95 patients with advanced NENs undergoing treatment with TEM-based therapy. MGMT promoter methylation status was evaluated with two techniques: methylation specific-polymerase chain reaction or pyrosequencing. Treatment with TEM-based therapy was associated with an overall response rate of 27.4% according to RECIST criteria (51.8% of patients with and 17.7% without MGMT promoter methylation). Response to therapy, progression free survival and overall survival was correlated to MGMT status at univariate and multivariate analysis. Methylation of MGMT promoter could be a strong predictive factor of objective response and an important prognostic factor of a longer PFS and OS. According to our results, MGMT methylation status, evaluated with methylation specific-polymerase chain reaction or pyrosequencing, should have an important role in patients with metastatic NENs, in order to guide therapeutic options. These results need further confirmation with prospective studies.

  3. Pyrosequencing®-Based Identification of Low-Frequency Mutations Enriched Through Enhanced-ice-COLD-PCR.

    PubMed

    How-Kit, Alexandre; Tost, Jörg

    2015-01-01

    A number of molecular diagnostic assays have been developed in the last years for mutation detection. Although these methods have become increasingly sensitive, most of them are incompatible with a sequencing-based readout and require prior knowledge of the mutation present in the sample. Consequently, coamplification at low denaturation (COLD)-PCR-based methods have been developed and combine a high analytical sensitivity due to mutation enrichment in the sample with the identification of known or unknown mutations by downstream sequencing experiments. Among these methods, the recently developed Enhanced-ice-COLD-PCR appeared as the most powerful method as it outperformed the other COLD-PCR-based methods in terms of the mutation enrichment and due to the simplicity of the experimental setup of the assay. Indeed, E-ice-COLD-PCR is very versatile as it can be used on all types of PCR platforms and is applicable to different types of samples including fresh frozen, FFPE, and plasma samples. The technique relies on the incorporation of an LNA containing blocker probe in the PCR reaction followed by selective heteroduplex denaturation enabling amplification of the mutant allele while amplification of the wild-type allele is prevented. Combined with Pyrosequencing(®), which is a very quantitative high-resolution sequencing technology, E-ice-COLD-PCR can detect and identify mutations with a limit of detection down to 0.01 %.

  4. Transcriptomes of Trypanosoma brucei rhodesiense from sleeping sickness patients, rodents and culture: Effects of strain, growth conditions and RNA preparation methods

    PubMed Central

    Mulindwa, Julius; Leiss, Kevin; Ibberson, David; Kamanyi Marucha, Kevin; Helbig, Claudia; Melo do Nascimento, Larissa; Silvester, Eleanor; Matthews, Keith; Matovu, Enock; Enyaru, John

    2018-01-01

    All of our current knowledge of African trypanosome metabolism is based on results from trypanosomes grown in culture or in rodents. Drugs against sleeping sickness must however treat trypanosomes in humans. We here compare the transcriptomes of Trypanosoma brucei rhodesiense from the blood and cerebrospinal fluid of human patients with those of trypanosomes from culture and rodents. The data were aligned and analysed using new user-friendly applications designed for Kinetoplastid RNA-Seq data. The transcriptomes of trypanosomes from human blood and cerebrospinal fluid did not predict major metabolic differences that might affect drug susceptibility. Usefully, there were relatively few differences between the transcriptomes of trypanosomes from patients and those of similar trypanosomes grown in rats. Transcriptomes of monomorphic laboratory-adapted parasites grown in in vitro culture closely resembled those of the human parasites, but some differences were seen. In poly(A)-selected mRNA transcriptomes, mRNAs encoding some protein kinases and RNA-binding proteins were under-represented relative to mRNA that had not been poly(A) selected; further investigation revealed that the selection tends to result in loss of longer mRNAs. PMID:29474390

  5. Single-cell transcriptomics for microbial eukaryotes.

    PubMed

    Kolisko, Martin; Boscaro, Vittorio; Burki, Fabien; Lynn, Denis H; Keeling, Patrick J

    2014-11-17

    One of the greatest hindrances to a comprehensive understanding of microbial genomics, cell biology, ecology, and evolution is that most microbial life is not in culture. Solutions to this problem have mainly focused on whole-community surveys like metagenomics, but these analyses inevitably loose information and present particular challenges for eukaryotes, which are relatively rare and possess large, gene-sparse genomes. Single-cell analyses present an alternative solution that allows for specific species to be targeted, while retaining information on cellular identity, morphology, and partitioning of activities within microbial communities. Single-cell transcriptomics, pioneered in medical research, offers particular potential advantages for uncultivated eukaryotes, but the efficiency and biases have not been tested. Here we describe a simple and reproducible method for single-cell transcriptomics using manually isolated cells from five model ciliate species; we examine impacts of amplification bias and contamination, and compare the efficacy of gene discovery to traditional culture-based transcriptomics. Gene discovery using single-cell transcriptomes was found to be comparable to mass-culture methods, suggesting single-cell transcriptomics is an efficient entry point into genomic data from the vast majority of eukaryotic biodiversity. Copyright © 2014 Elsevier Ltd. All rights reserved.

  6. Transcriptomes of Trypanosoma brucei rhodesiense from sleeping sickness patients, rodents and culture: Effects of strain, growth conditions and RNA preparation methods.

    PubMed

    Mulindwa, Julius; Leiss, Kevin; Ibberson, David; Kamanyi Marucha, Kevin; Helbig, Claudia; Melo do Nascimento, Larissa; Silvester, Eleanor; Matthews, Keith; Matovu, Enock; Enyaru, John; Clayton, Christine

    2018-02-01

    All of our current knowledge of African trypanosome metabolism is based on results from trypanosomes grown in culture or in rodents. Drugs against sleeping sickness must however treat trypanosomes in humans. We here compare the transcriptomes of Trypanosoma brucei rhodesiense from the blood and cerebrospinal fluid of human patients with those of trypanosomes from culture and rodents. The data were aligned and analysed using new user-friendly applications designed for Kinetoplastid RNA-Seq data. The transcriptomes of trypanosomes from human blood and cerebrospinal fluid did not predict major metabolic differences that might affect drug susceptibility. Usefully, there were relatively few differences between the transcriptomes of trypanosomes from patients and those of similar trypanosomes grown in rats. Transcriptomes of monomorphic laboratory-adapted parasites grown in in vitro culture closely resembled those of the human parasites, but some differences were seen. In poly(A)-selected mRNA transcriptomes, mRNAs encoding some protein kinases and RNA-binding proteins were under-represented relative to mRNA that had not been poly(A) selected; further investigation revealed that the selection tends to result in loss of longer mRNAs.

  7. Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak

    PubMed Central

    2010-01-01

    Background The Fagaceae family comprises about 1,000 woody species worldwide. About half belong to the Quercus family. These oaks are often a source of raw material for biomass wood and fiber. Pedunculate and sessile oaks, are among the most important deciduous forest tree species in Europe. Despite their ecological and economical importance, very few genomic resources have yet been generated for these species. Here, we describe the development of an EST catalogue that will support ecosystem genomics studies, where geneticists, ecophysiologists, molecular biologists and ecologists join their efforts for understanding, monitoring and predicting functional genetic diversity. Results We generated 145,827 sequence reads from 20 cDNA libraries using the Sanger method. Unexploitable chromatograms and quality checking lead us to eliminate 19,941 sequences. Finally a total of 125,925 ESTs were retained from 111,361 cDNA clones. Pyrosequencing was also conducted for 14 libraries, generating 1,948,579 reads, from which 370,566 sequences (19.0%) were eliminated, resulting in 1,578,192 sequences. Following clustering and assembly using TGICL pipeline, 1,704,117 EST sequences collapsed into 69,154 tentative contigs and 153,517 singletons, providing 222,671 non-redundant sequences (including alternative transcripts). We also assembled the sequences using MIRA and PartiGene software and compared the three unigene sets. Gene ontology annotation was then assigned to 29,303 unigene elements. Blast search against the SWISS-PROT database revealed putative homologs for 32,810 (14.7%) unigene elements, but more extensive search with Pfam, Refseq_protein, Refseq_RNA and eight gene indices revealed homology for 67.4% of them. The EST catalogue was examined for putative homologs of candidate genes involved in bud phenology, cuticle formation, phenylpropanoids biosynthesis and cell wall formation. Our results suggest a good coverage of genes involved in these traits. Comparative orthologous sequences (COS) with other plant gene models were identified and allow to unravel the oak paleo-history. Simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 52,834 SSRs and 36,411 SNPs. All of these are available through the Oak Contig Browser http://genotoul-contigbrowser.toulouse.inra.fr:9092/Quercus_robur/index.html. Conclusions This genomic resource provides a unique tool to discover genes of interest, study the oak transcriptome, and develop new markers to investigate functional diversity in natural populations. PMID:21092232

  8. Differential mantle transcriptomics and characterization of growth-related genes in the diploid and triploid pearl oyster Pinctada fucata.

    PubMed

    Guan, Yunyan; He, Maoxian; Wu, Houbo

    2017-06-01

    To explore the molecular mechanism of triploidy effect in the pearl oyster Pinctada fucata, two RNA-seq libraries were constructed from the mantle tissue of diploids and triploids by Roche-454 massive parallel pyrosequencing. The identification of differential expressed genes (DEGs) between diploid and triploid may reveal the molecular mechanism of triploidy effect. In this study, 230 down-regulated and 259 up-regulated DEGs were obtained by comparison between diploid and triploid libraries. The gene ontology and KEGG pathway analysis revealed more functional activation in triploids and it may due to the duplicated gene expression in transcriptional level during whole genome duplication (WGD). To confirm the sequencing data, a set of 11 up-regulated genes related to growth and development control and regulation were analyzed by RT-qPCR in independent experiment. According to the validation and annotation of these genes, it is hypothesized that the set of up-regulated expressed genes had the correlated expression pattern involved in shell building or other interactive probable functions during triploidization. The up- regulation of growth-related genes may support the classic hypotheses of 'energy redistribution' from early research. The results provide valuable resources to understand the molecular mechanism of triploidy effect in both shell building and producing high-quality seawater pearls. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. Rapid Identification of Genetic Modifications in Bacillus anthracis Using Whole Genome Draft Sequences Generated by 454 Pyrosequencing

    DTIC Science & Technology

    2010-08-25

    or intentional genetic modifications that circumvent the targets of the detection assays or in the case of a biological attack using an antibiotic ...genetic changes conferring antibiotic resistance can be deciphered rapidly and accurately using WGS. We demonstrate the utility of Roche 454...Rapid Identification of Genetic Modifications in Bacillus anthracis Using Whole Genome Draft Sequences Generated by 454 Pyrosequencing Peter E. Chen1

  10. RNASeq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar 'Riesling'

    USDA-ARS?s Scientific Manuscript database

    The technological advances of RNA-seq and de novo transcriptome assembly have enabled genome annotation and transcriptome profiling in heterozygous species. This is a promising approach to improving the annotation of the reference genome sequence of grapevine (Vitis vinifera L.), a species of high-l...

  11. Characterization of adult transcriptomes from the omnivorous lady beetle Coleomegilla maculata fed pollen or insect egg diet

    USDA-ARS?s Scientific Manuscript database

    Diet, nutrition, and obesity are important topics of current research. While many insect genome and/or transcriptome models are based on dietary specialists, the lady beetle Coleomegilla maculata, a common New World species, is highly omnivorous. C. maculata feeds on plants, fungi, insects and other...

  12. Information Theoretical Analysis of a Bovine Gene Atlas Reveals Chromosomal Regions with Tissue Specific Gene Expression.

    USDA-ARS?s Scientific Manuscript database

    An essential step to understanding the genomic biology of any organism is to comprehensively survey its transcriptome. We present the Bovine Gene Atlas (BGA) a compendium of over 7.2 million unique 20 base Illumina DGE tags representing 100 tissue transcriptomes collected primarily from L1 Dominette...

  13. A General Framework for Interrogation of mRNA Stability Programs Identifies RNA-Binding Proteins that Govern Cancer Transcriptomes.

    PubMed

    Perron, Gabrielle; Jandaghi, Pouria; Solanki, Shraddha; Safisamghabadi, Maryam; Storoz, Cristina; Karimzadeh, Mehran; Papadakis, Andreas I; Arseneault, Madeleine; Scelo, Ghislaine; Banks, Rosamonde E; Tost, Jorg; Lathrop, Mark; Tanguay, Simon; Brazma, Alvis; Huang, Sidong; Brimo, Fadi; Najafabadi, Hamed S; Riazalhosseini, Yasser

    2018-05-08

    Widespread remodeling of the transcriptome is a signature of cancer; however, little is known about the post-transcriptional regulatory factors, including RNA-binding proteins (RBPs) that regulate mRNA stability, and the extent to which RBPs contribute to cancer-associated pathways. Here, by modeling the global change in gene expression based on the effect of sequence-specific RBPs on mRNA stability, we show that RBP-mediated stability programs are recurrently deregulated in cancerous tissues. Particularly, we uncovered several RBPs that contribute to the abnormal transcriptome of renal cell carcinoma (RCC), including PCBP2, ESRP2, and MBNL2. Modulation of these proteins in cancer cell lines alters the expression of pathways that are central to the disease and highlights RBPs as driving master regulators of RCC transcriptome. This study presents a framework for the screening of RBP activities based on computational modeling of mRNA stability programs in cancer and highlights the role of post-transcriptional gene dysregulation in RCC. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  14. CBrowse: a SAM/BAM-based contig browser for transcriptome assembly visualization and analysis.

    PubMed

    Li, Pei; Ji, Guoli; Dong, Min; Schmidt, Emily; Lenox, Douglas; Chen, Liangliang; Liu, Qi; Liu, Lin; Zhang, Jie; Liang, Chun

    2012-09-15

    To address the impending need for exploring rapidly increased transcriptomics data generated for non-model organisms, we developed CBrowse, an AJAX-based web browser for visualizing and analyzing transcriptome assemblies and contigs. Designed in a standard three-tier architecture with a data pre-processing pipeline, CBrowse is essentially a Rich Internet Application that offers many seamlessly integrated web interfaces and allows users to navigate, sort, filter, search and visualize data smoothly. The pre-processing pipeline takes the contig sequence file in FASTA format and its relevant SAM/BAM file as the input; detects putative polymorphisms, simple sequence repeats and sequencing errors in contigs and generates image, JSON and database-compatible CSV text files that are directly utilized by different web interfaces. CBowse is a generic visualization and analysis tool that facilitates close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors in transcriptome sequencing projects. CBrowse is distributed under the GNU General Public License, available at http://bioinfolab.muohio.edu/CBrowse/ liangc@muohio.edu or liangc.mu@gmail.com; glji@xmu.edu.cn Supplementary data are available at Bioinformatics online.

  15. Microbiome and ecotypic adaption of Holcus lanatus (L.) to extremes of its soil pH range, investigated through transcriptome sequencing.

    PubMed

    Young, Ellen; Carey, Manus; Meharg, Andrew A; Meharg, Caroline

    2018-03-20

    Plants can adapt to edaphic stress, such as nutrient deficiency, toxicity and biotic challenges, by controlled transcriptomic responses, including microbiome interactions. Traditionally studied in model plant species with controlled microbiota inoculation treatments, molecular plant-microbiome interactions can be functionally investigated via RNA-Seq. Complex, natural plant-microbiome studies are limited, typically focusing on microbial rRNA and omitting functional microbiome investigations, presenting a fundamental knowledge gap. Here, root and shoot meta-transcriptome analyses, in tandem with shoot elemental content and root staining, were employed to investigate transcriptome responses in the wild grass Holcus lanatus and its associated natural multi-species eukaryotic microbiome. A full factorial reciprocal soil transplant experiment was employed, using plant ecotypes from two widely contrasting natural habitats, acid bog and limestone quarry soil, to investigate naturally occurring, and ecologically meaningful, edaphically driven molecular plant-microbiome interactions. Arbuscular mycorrhizal (AM) and non-AM fungal colonization was detected in roots in both soils. Staining showed greater levels of non-AM fungi, and transcriptomics indicated a predominance of Ascomycota-annotated genes. Roots in acid bog soil were dominated by Phialocephala-annotated transcripts, a putative growth-promoting endophyte, potentially involved in N nutrition and ion homeostasis. Limestone roots in acid bog soil had greater expression of other Ascomycete genera and Oomycetes and lower expression of Phialocephala-annotated transcripts compared to acid ecotype roots, which corresponded with reduced induction of pathogen defense processes, particularly lignin biosynthesis in limestone ecotypes. Ascomycota dominated in shoots and limestone soil roots, but Phialocephala-annotated transcripts were insignificant, and no single Ascomycete genus dominated. Fusarium-annotated transcripts were the most common genus in shoots, with Colletotrichum and Rhizophagus (AM fungi) most numerous in limestone soil roots. The latter coincided with upregulation of plant genes involved in AM symbiosis initiation and AM-based P acquisition in an environment where P availability is low. Meta-transcriptome analyses provided novel insights into H. lanatus transcriptome responses, associated eukaryotic microbiota functions and taxonomic community composition. Significant edaphic and plant ecotype effects were identified, demonstrating that meta-transcriptome-based functional analysis is a powerful tool for the study of natural plant-microbiome interactions.

  16. Research Resource: A Reference Transcriptome for Constitutive Androstane Receptor and Pregnane X Receptor Xenobiotic Signaling

    PubMed Central

    Ochsner, Scott A.; Tsimelzon, Anna; Dong, Jianrong; Coarfa, Cristian

    2016-01-01

    The pregnane X receptor (PXR) (PXR/NR1I3) and constitutive androstane receptor (CAR) (CAR/NR1I2) members of the nuclear receptor (NR) superfamily of ligand-regulated transcription factors are well-characterized mediators of xenobiotic and endocrine-disrupting chemical signaling. The Nuclear Receptor Signaling Atlas maintains a growing library of transcriptomic datasets involving perturbations of NR signaling pathways, many of which involve perturbations relevant to PXR and CAR xenobiotic signaling. Here, we generated a reference transcriptome based on the frequency of differential expression of genes across 159 experiments compiled from 22 datasets involving perturbations of CAR and PXR signaling pathways. In addition to the anticipated overrepresentation in the reference transcriptome of genes encoding components of the xenobiotic stress response, the ranking of genes involved in carbohydrate metabolism and gonadotropin action sheds mechanistic light on the suspected role of xenobiotics in metabolic syndrome and reproductive disorders. Gene Set Enrichment Analysis showed that although acetaminophen, chlorpromazine, and phenobarbital impacted many similar gene sets, differences in direction of regulation were evident in a variety of processes. Strikingly, gene sets representing genes linked to Parkinson's, Huntington's, and Alzheimer's diseases were enriched in all 3 transcriptomes. The reference xenobiotic transcriptome will be supplemented with additional future datasets to provide the community with a continually updated reference transcriptomic dataset for CAR- and PXR-mediated xenobiotic signaling. Our study demonstrates how aggregating and annotating transcriptomic datasets, and making them available for routine data mining, facilitates research into the mechanisms by which xenobiotics and endocrine-disrupting chemicals subvert conventional NR signaling modalities. PMID:27409825

  17. Research Resource: A Reference Transcriptome for Constitutive Androstane Receptor and Pregnane X Receptor Xenobiotic Signaling.

    PubMed

    Ochsner, Scott A; Tsimelzon, Anna; Dong, Jianrong; Coarfa, Cristian; McKenna, Neil J

    2016-08-01

    The pregnane X receptor (PXR) (PXR/NR1I3) and constitutive androstane receptor (CAR) (CAR/NR1I2) members of the nuclear receptor (NR) superfamily of ligand-regulated transcription factors are well-characterized mediators of xenobiotic and endocrine-disrupting chemical signaling. The Nuclear Receptor Signaling Atlas maintains a growing library of transcriptomic datasets involving perturbations of NR signaling pathways, many of which involve perturbations relevant to PXR and CAR xenobiotic signaling. Here, we generated a reference transcriptome based on the frequency of differential expression of genes across 159 experiments compiled from 22 datasets involving perturbations of CAR and PXR signaling pathways. In addition to the anticipated overrepresentation in the reference transcriptome of genes encoding components of the xenobiotic stress response, the ranking of genes involved in carbohydrate metabolism and gonadotropin action sheds mechanistic light on the suspected role of xenobiotics in metabolic syndrome and reproductive disorders. Gene Set Enrichment Analysis showed that although acetaminophen, chlorpromazine, and phenobarbital impacted many similar gene sets, differences in direction of regulation were evident in a variety of processes. Strikingly, gene sets representing genes linked to Parkinson's, Huntington's, and Alzheimer's diseases were enriched in all 3 transcriptomes. The reference xenobiotic transcriptome will be supplemented with additional future datasets to provide the community with a continually updated reference transcriptomic dataset for CAR- and PXR-mediated xenobiotic signaling. Our study demonstrates how aggregating and annotating transcriptomic datasets, and making them available for routine data mining, facilitates research into the mechanisms by which xenobiotics and endocrine-disrupting chemicals subvert conventional NR signaling modalities.

  18. Transcriptomic and epigenetic regulation of disuse atrophy and the return to activity in skeletal muscle.

    PubMed

    Fisher, Andrew G; Seaborne, Robert A; Hughes, Thomas M; Gutteridge, Alex; Stewart, Claire; Coulson, Judy M; Sharples, Adam P; Jarvis, Jonathan C

    2017-12-01

    Physical inactivity and disuse are major contributors to age-related muscle loss. Denervation of skeletal muscle has been previously used as a model with which to investigate muscle atrophy following disuse. Although gene regulatory networks that control skeletal muscle atrophy after denervation have been established, the transcriptome in response to the recovery of muscle after disuse and the associated epigenetic mechanisms that may function to modulate gene expression during skeletal muscle atrophy or recovery have yet to be investigated. We report that silencing the tibialis anterior muscle in rats with tetrodotoxin (TTX)-administered to the common peroneal nerve-resulted in reductions in muscle mass of 7, 29, and 51% with corresponding reductions in muscle fiber cross-sectional area of 18, 42, and 69% after 3, 7, and 14 d of TTX, respectively. Of importance, 7 d of recovery, during which rodents resumed habitual physical activity, restored muscle mass from a reduction of 51% after 14 d TTX to a reduction of only 24% compared with sham control. Returning muscle mass to levels observed at 7 d TTX administration (29% reduction). Transcriptome-wide analysis demonstrated that 3714 genes were differentially expressed across all conditions at a significance of P ≤ 0.001 after disuse-induced atrophy. Of interest, after 7 d of recovery, the expression of genes that were most changed during TTX had returned to that of the sham control. The 20 most differentially expressed genes after microarray analysis were identified across all conditions and were cross-referenced with the most frequently occurring differentially expressed genes between conditions. This gene subset included myogenin (MyoG), Hdac4, Ampd3, Trim63 (MuRF1), and acetylcholine receptor subunit α1 (Chrna1). Transcript expression of these genes and Fboxo32 (MAFbx), because of its previously identified role in disuse atrophy together with Trim63 (MuRF1), were confirmed by real-time quantitative RT-PCR, and DNA methylation of their promoter regions was analyzed by PCR and pyrosequencing. MyoG, Trim63 (MuRF1), Fbxo32 (MAFbx), and Chrna1 demonstrated significantly decreased DNA methylation at key time points after disuse-induced atrophy that corresponded with significantly increased gene expression. Of importance, after TTX cessation and 7 d of recovery, there was a marked increase in the DNA methylation profiles of Trim63 (MuRF1) and Chrna1 back to control levels. This also corresponded with the return of gene expression in the recovery group back to baseline expression observed in sham-surgery controls. To our knowledge, this is the first study to demonstrate that skeletal muscle atrophy in response to disuse is accompanied by dynamic epigenetic modifications that are associated with alterations in gene expression, and that these epigenetic modifications and gene expression profiles are reversible after skeletal muscle returns to normal activity.-Fisher, A. G., Seaborne, R. A., Hughes, T. M., Gutteridge, A., Stewart, C., Coulson, J. M., Sharples, A. P., Jarvis, J. C. Transcriptomic and epigenetic regulation of disuse atrophy and the return to activity in skeletal muscle. © FASEB.

  19. Transcriptome characterization and polymorphism detection between subspecies of big sagebrush (Artemisia tridentata)

    PubMed Central

    2011-01-01

    Background Big sagebrush (Artemisia tridentata) is one of the most widely distributed and ecologically important shrub species in western North America. This species serves as a critical habitat and food resource for many animals and invertebrates. Habitat loss due to a combination of disturbances followed by establishment of invasive plant species is a serious threat to big sagebrush ecosystem sustainability. Lack of genomic data has limited our understanding of the evolutionary history and ecological adaptation in this species. Here, we report on the sequencing of expressed sequence tags (ESTs) and detection of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers in subspecies of big sagebrush. Results cDNA of A. tridentata sspp. tridentata and vaseyana were normalized and sequenced using the 454 GS FLX Titanium pyrosequencing technology. Assembly of the reads resulted in 20,357 contig consensus sequences in ssp. tridentata and 20,250 contigs in ssp. vaseyana. A BLASTx search against the non-redundant (NR) protein database using 29,541 consensus sequences obtained from a combined assembly resulted in 21,436 sequences with significant blast alignments (≤ 1e-15). A total of 20,952 SNPs and 119 polymorphic SSRs were detected between the two subspecies. SNPs were validated through various methods including sequence capture. Validation of SNPs in different individuals uncovered a high level of nucleotide variation in EST sequences. EST sequences of a third, tetraploid subspecies (ssp. wyomingensis) obtained by Illumina sequencing were mapped to the consensus sequences of the combined 454 EST assembly. Approximately one-third of the SNPs between sspp. tridentata and vaseyana identified in the combined assembly were also polymorphic within the two geographically distant ssp. wyomingensis samples. Conclusion We have produced a large EST dataset for Artemisia tridentata, which contains a large sample of the big sagebrush leaf transcriptome. SNP mapping among the three subspecies suggest the origin of ssp. wyomingensis via mixed ancestry. A large number of SNP and SSR markers provide the foundation for future research to address questions in big sagebrush evolution, ecological genetics, and conservation using genomic approaches. PMID:21767398

  20. Active site characterization and molecular cloning of Tenebrio molitor midgut trehalase and comments on their insect homologs.

    PubMed

    Gomez, Ana; Cardoso, Christiane; Genta, Fernando A; Terra, Walter R; Ferreira, Clélia

    2013-08-01

    The soluble midgut trehalase from Tenebrio molitor (TmTre1) was purified after several chromatographic steps, resulting in an enzyme with 58 kDa and pH optimum 5.3 (ionizing active groups in the free enzyme: pK(e1) = 3.8 ± 0.2 pK(e2) = 7.4 ± 0.2). The purified enzyme corresponds to the deduced amino acid sequence of a cloned cDNA (TmTre1-cDNA), because a single cDNA coding a soluble trehalase was found in the T. molitor midgut transcriptome. Furthermore, the mass of the protein predicted to be coded by TmTre1-cDNA agrees with that of the purified enzyme. TmTre1 has the essential catalytic groups Asp 315 and Glu 513 and the essential Arg residues R164, R217, R282. Carbodiimide inactivation of the purified enzyme at different pH values reveals an essential carboxyl group with pKa = 3.5 ± 0.3. Phenylglyoxal modified a single Arg residue with pKa = 7.5 ± 0.2, as observed in the soluble trehalase from Spodoptera frugiperda (SfTre1). Diethylpyrocarbonate modified a His residue that resulted in a less active enzyme with pK(e1) changed to 4.8 ± 0.2. In TmTre1 the modified His residue (putatively His 336) is more exposed than the His modified in SfTre1 (putatively His 210) and that affects the ionization of an Arg residue. The architecture of the active site of TmTre1 and SfTre1 is different, as shown by multiple inhibition analysis, the meaning of which demands further research. Trehalase sequences obtained from midgut transcriptomes (pyrosequencing and Illumina data) from 8 insects pertaining to 5 different orders were used in a cladogram, together with other representative sequences. The data suggest that the trehalase gene went duplication and divergence prior to the separation of the paraneopteran and holometabolan orders and that the soluble trehalase derived from the membrane-bound one by losing the C-terminal transmembrane loop. Copyright © 2013 Elsevier Ltd. All rights reserved.

  1. Next-Generation Genotyping by Digital PCR to Detect and Quantify the BRAF V600E Mutation in Melanoma Biopsies.

    PubMed

    Lamy, Pierre-Jean; Castan, Florence; Lozano, Nicolas; Montélion, Cécile; Audran, Patricia; Bibeau, Frédéric; Roques, Sylvie; Montels, Frédéric; Laberenne, Anne-Claire

    2015-07-01

    The detection of the BRAF V600E mutation in melanoma samples is used to select patients who should respond to BRAF inhibitors. Different techniques are routinely used to determine BRAF status in clinical samples. However, low tumor cellularity and tumor heterogeneity can affect the sensitivity of somatic mutation detection. Digital PCR (dPCR) is a next-generation genotyping method that clonally amplifies nucleic acids and allows the detection and quantification of rare mutations. Our aim was to evaluate the clinical routine performance of a new dPCR-based test to detect and quantify BRAF mutation load in 47 paraffin-embedded cutaneous melanoma biopsies. We compared the results obtained by dPCR with high-resolution melting curve analysis and pyrosequencing or with one of the allele-specific PCR methods available on the market. dPCR showed the lowest limit of detection. dPCR and allele-specific amplification detected the highest number of mutated samples. For the BRAF mutation load quantification both dPCR and pyrosequencing gave similar results with strong disparities in allele frequencies in the 47 tumor samples under study (from 0.7% to 79% of BRAF V600E mutations/sample). In conclusion, the four methods showed a high degree of concordance. dPCR was the more-sensitive method to reliably and easily detect mutations. Both pyrosequencing and dPCR could quantify the mutation load in heterogeneous tumor samples. Copyright © 2015 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  2. Analysis of bacterial xylose isomerase gene diversity using gene-targeted metagenomics.

    PubMed

    Nurdiani, Dini; Ito, Michihiro; Maruyama, Toru; Terahara, Takeshi; Mori, Tetsushi; Ugawa, Shin; Takeyama, Haruko

    2015-08-01

    Bacterial xylose isomerases (XI) are promising resources for efficient biofuel production from xylose in lignocellulosic biomass. Here, we investigated xylose isomerase gene (xylA) diversity in three soil metagenomes differing in plant vegetation and geographical location, using an amplicon pyrosequencing approach and two newly-designed primer sets. A total of 158,555 reads from three metagenomic DNA replicates for each soil sample were classified into 1127 phylotypes, detected in triplicate and defined by 90% amino acid identity. The phylotype coverage was estimated to be within the range of 84.0-92.7%. The xylA gene phylotypes obtained were phylogenetically distributed across the two known xylA groups. They shared 49-100% identities with their closest-related XI sequences in GenBank. Phylotypes demonstrating <90% identity with known XIs in the database accounted for 89% of the total xylA phylotypes. The differences among xylA members and compositions within each soil sample were significantly smaller than they were between different soils based on a UniFrac distance analysis, suggesting soil-specific xylA genotypes and taxonomic compositions. The differences among xylA members and their compositions in the soil were strongly correlated with 16S rRNA variation between soil samples, also assessed by amplicon pyrosequencing. This is the first report of xylA diversity in environmental samples assessed by amplicon pyrosequencing. Our data provide information regarding xylA diversity in nature, and can be a basis for the screening of novel xylA genotypes for practical applications. Copyright © 2015. Published by Elsevier B.V.

  3. Maternal blood contamination of collected cord blood can be identified using DNA methylation at three CpGs.

    PubMed

    Morin, Alexander M; Gatev, Evan; McEwen, Lisa M; MacIsaac, Julia L; Lin, David T S; Koen, Nastassja; Czamara, Darina; Räikkönen, Katri; Zar, Heather J; Koenen, Karestan; Stein, Dan J; Kobor, Michael S; Jones, Meaghan J

    2017-01-01

    Cord blood is a commonly used tissue in environmental, genetic, and epigenetic population studies due to its ready availability and potential to inform on a sensitive period of human development. However, the introduction of maternal blood during labor or cross-contamination during sample collection may complicate downstream analyses. After discovering maternal contamination of cord blood in a cohort study of 150 neonates using Illumina 450K DNA methylation (DNAm) data, we used a combination of linear regression and random forest machine learning to create a DNAm-based screening method. We identified a panel of DNAm sites that could discriminate between contaminated and non-contaminated samples, then designed pyrosequencing assays to pre-screen DNA prior to being assayed on an array. Maternal contamination of cord blood was initially identified by unusual X chromosome DNA methylation patterns in 17 males. We utilized our DNAm panel to detect contaminated male samples and a proportional amount of female samples in the same cohort. We validated our DNAm screening method on an additional 189 sample cohort using both pyrosequencing and DNAm arrays, as well as 9 publically available cord blood 450K data sets. The rate of contamination varied from 0 to 10% within these studies, likely related to collection specific methods. Maternal blood can contaminate cord blood during sample collection at appreciable levels across multiple studies. We have identified a panel of markers that can be used to identify this contamination, either post hoc after DNAm arrays have been completed, or in advance using a targeted technique like pyrosequencing.

  4. Pyrosequencing analysis of oral microbiota shifting in various caries states in childhood.

    PubMed

    Jiang, Wen; Ling, Zongxin; Lin, Xiaolong; Chen, Yadong; Zhang, Jie; Yu, Jinjin; Xiang, Charlie; Chen, Hui

    2014-05-01

    Dental caries is one of the most prevalent childhood diseases worldwide, but little is known about the dynamic characteristics of oral microbiota in the development of dental caries. To investigate the shifting bacterial profiles in different caries states, 60 children (3-7-year-old) were enrolled in this study, including 30 caries-free subjects and 30 caries-active subjects. Supragingival plaques were collected from caries-active subjects on intact enamel, white spot lesions and carious dentin lesions. Plaques from caries-free subjects were used as a control. All samples were analyzed by 454 pyrosequencing based on 16S rRNA gene V1-V3 hypervariable regions. A total of 572,773 pyrosequencing reads passed the quality control and 25,444 unique phylotypes were identified, which represented 18 phyla and 145 genera. Reduced bacterial diversity in the cavitated dentin was observed as compared with the other groups. Thirteen genera (including Capnocytophaga, Fusobacterium, Porphyromonas, Abiotrophia, Comamonas, Tannerella, Eikenella, Paludibacter, Treponema, Actinobaculum, Stenotrophomonas, Aestuariimicrobium, and Peptococcus) were found to be associated with dental health, and the bacterial profiles differed considerably depending on caries status. Eight genera (including Cryptobacterium, Lactobacillus, Megasphaera, Olsenella, Scardovia, Shuttleworthia, Cryptobacterium, and Streptococcus) were increased significantly in cavitated dentin lesions, and Actinomyces and Corynebacterium were present at significant high levels in white spot lesions (P < 0.05), while Flavobacterium, Neisseria, Bergeyella, and Derxia were enriched in the intact surfaces of caries individuals (P < 0.05). Our results showed that oral bacteria were specific at different stages of caries progression, which contributes to informing the prevention and treatment of childhood dental caries.

  5. Biosynthesis of the active compounds of Isatis indigotica based on transcriptome sequencing and metabolites profiling

    PubMed Central

    2013-01-01

    Backgroud Isatis indigotica is a widely used herb for the clinical treatment of colds, fever, and influenza in Traditional Chinese Medicine (TCM). Various structural classes of compounds have been identified as effective ingredients. However, little is known at genetics level about these active metabolites. In the present study, we performed de novo transcriptome sequencing for the first time to produce a comprehensive dataset of I. indigotica. Results A database of 36,367 unigenes (average length = 1,115.67 bases) was generated by performing transcriptome sequencing. Based on the gene annotation of the transcriptome, 104 unigenes were identified covering most of the catalytic steps in the general biosynthetic pathways of indole, terpenoid, and phenylpropanoid. Subsequently, the organ-specific expression patterns of the genes involved in these pathways, and their responses to methyl jasmonate (MeJA) induction, were investigated. Metabolites profile of effective phenylpropanoid showed accumulation pattern of secondary metabolites were mostly correlated with the transcription of their biosynthetic genes. According to the analysis of UDP-dependent glycosyltransferases (UGT) family, several flavonoids were indicated to exist in I. indigotica and further identified by metabolic profile using UPLC/Q-TOF. Moreover, applying transcriptome co-expression analysis, nine new, putative UGTs were suggested as flavonol glycosyltransferases and lignan glycosyltransferases. Conclusions This database provides a pool of candidate genes involved in biosynthesis of effective metabolites in I. indigotica. Furthermore, the comprehensive analysis and characterization of the significant pathways are expected to give a better insight regarding the diversity of chemical composition, synthetic characteristics, and the regulatory mechanism which operate in this medical herb. PMID:24308360

  6. Fine-Scale Bacterial Beta Diversity within a Complex Ecosystem (Zodletone Spring, OK, USA): The Role of the Rare Biosphere

    PubMed Central

    Youssef, Noha H.; Couger, M. B.; Elshahed, Mostafa S.

    2010-01-01

    Background The adaptation of pyrosequencing technologies for use in culture-independent diversity surveys allowed for deeper sampling of ecosystems of interest. One extremely well suited area of interest for pyrosequencing-based diversity surveys that has received surprisingly little attention so far, is examining fine scale (e.g. micrometer to millimeter) beta diversity in complex microbial ecosystems. Methodology/Principal Findings We examined the patterns of fine scale Beta diversity in four adjacent sediment samples (1mm apart) from the source of an anaerobic sulfide and sulfur rich spring (Zodletone spring) in southwestern Oklahoma, USA. Using pyrosequencing, a total of 292,130 16S rRNA gene sequences were obtained. The beta diversity patterns within the four datasets were examined using various qualitative and quantitative similarity indices. Low levels of Beta diversity (high similarity indices) were observed between the four samples at the phylum-level. However, at a putative species (OTU0.03) level, higher levels of beta diversity (lower similarity indices) were observed. Further examination of beta diversity patterns within dominant and rare members of the community indicated that at the putative species level, beta diversity is much higher within rare members of the community. Finally, sub-classification of rare members of Zodletone spring community based on patterns of novelty and uniqueness, and further examination of fine scale beta diversity of each of these subgroups indicated that members of the community that are unique, but non novel showed the highest beta diversity within these subgroups of the rare biosphere. Conclusions/Significance The results demonstrate the occurrence of high inter-sample diversity within seemingly identical samples from a complex habitat. We reason that such unexpected diversity should be taken into consideration when exploring gamma diversity of various ecosystems, as well as planning for sequencing-intensive metagenomic surveys of highly complex ecosystems. PMID:20865128

  7. Genome sequencing of bacteria: sequencing, de novo assembly and rapid analysis using open source tools.

    PubMed

    Kisand, Veljo; Lettieri, Teresa

    2013-04-01

    De novo genome sequencing of previously uncharacterized microorganisms has the potential to open up new frontiers in microbial genomics by providing insight into both functional capabilities and biodiversity. Until recently, Roche 454 pyrosequencing was the NGS method of choice for de novo assembly because it generates hundreds of thousands of long reads (<450 bps), which are presumed to aid in the analysis of uncharacterized genomes. The array of tools for processing NGS data are increasingly free and open source and are often adopted for both their high quality and role in promoting academic freedom. The error rate of pyrosequencing the Alcanivorax borkumensis genome was such that thousands of insertions and deletions were artificially introduced into the finished genome. Despite a high coverage (~30 fold), it did not allow the reference genome to be fully mapped. Reads from regions with errors had low quality, low coverage, or were missing. The main defect of the reference mapping was the introduction of artificial indels into contigs through lower than 100% consensus and distracting gene calling due to artificial stop codons. No assembler was able to perform de novo assembly comparable to reference mapping. Automated annotation tools performed similarly on reference mapped and de novo draft genomes, and annotated most CDSs in the de novo assembled draft genomes. Free and open source software (FOSS) tools for assembly and annotation of NGS data are being developed rapidly to provide accurate results with less computational effort. Usability is not high priority and these tools currently do not allow the data to be processed without manual intervention. Despite this, genome assemblers now readily assemble medium short reads into long contigs (>97-98% genome coverage). A notable gap in pyrosequencing technology is the quality of base pair calling and conflicting base pairs between single reads at the same nucleotide position. Regardless, using draft whole genomes that are not finished and remain fragmented into tens of contigs allows one to characterize unknown bacteria with modest effort.

  8. Improving transcriptome construction in non-model organisms: integrating manual and automated gene definition in Emiliania huxleyi.

    PubMed

    Feldmesser, Ester; Rosenwasser, Shilo; Vardi, Assaf; Ben-Dor, Shifra

    2014-02-22

    The advent of Next Generation Sequencing technologies and corresponding bioinformatics tools allows the definition of transcriptomes in non-model organisms. Non-model organisms are of great ecological and biotechnological significance, and consequently the understanding of their unique metabolic pathways is essential. Several methods that integrate de novo assembly with genome-based assembly have been proposed. Yet, there are many open challenges in defining genes, particularly where genomes are not available or incomplete. Despite the large numbers of transcriptome assemblies that have been performed, quality control of the transcript building process, particularly on the protein level, is rarely performed if ever. To test and improve the quality of the automated transcriptome reconstruction, we used manually defined and curated genes, several of them experimentally validated. Several approaches to transcript construction were utilized, based on the available data: a draft genome, high quality RNAseq reads, and ESTs. In order to maximize the contribution of the various data, we integrated methods including de novo and genome based assembly, as well as EST clustering. After each step a set of manually curated genes was used for quality assessment of the transcripts. The interplay between the automated pipeline and the quality control indicated which additional processes were required to improve the transcriptome reconstruction. We discovered that E. huxleyi has a very high percentage of non-canonical splice junctions, and relatively high rates of intron retention, which caused unique issues with the currently available tools. While individual tools missed genes and artificially joined overlapping transcripts, combining the results of several tools improved the completeness and quality considerably. The final collection, created from the integration of several quality control and improvement rounds, was compared to the manually defined set both on the DNA and protein levels, and resulted in an improvement of 20% versus any of the read-based approaches alone. To the best of our knowledge, this is the first time that an automated transcript definition is subjected to quality control using manually defined and curated genes and thereafter the process is improved. We recommend using a set of manually curated genes to troubleshoot transcriptome reconstruction.

  9. In silico mining and PCR-based approaches to transcription factor discovery in non-model plants: gene discovery of the WRKY transcription factors in conifers.

    PubMed

    Liu, Jun-Jun; Xiang, Yu

    2011-01-01

    WRKY transcription factors are key regulators of numerous biological processes in plant growth and development, as well as plant responses to abiotic and biotic stresses. Research on biological functions of plant WRKY genes has focused in the past on model plant species or species with largely characterized transcriptomes. However, a variety of non-model plants, such as forest conifers, are essential as feed, biofuel, and wood or for sustainable ecosystems. Identification of WRKY genes in these non-model plants is equally important for understanding the evolutionary and function-adaptive processes of this transcription factor family. Because of limited genomic information, the rarity of regulatory gene mRNAs in transcriptomes, and the sequence divergence to model organism genes, identification of transcription factors in non-model plants using methods similar to those generally used for model plants is difficult. This chapter describes a gene family discovery strategy for identification of WRKY transcription factors in conifers by a combination of in silico-based prediction and PCR-based experimental approaches. Compared to traditional cDNA library screening or EST sequencing at transcriptome scales, this integrated gene discovery strategy provides fast, simple, reliable, and specific methods to unveil the WRKY gene family at both genome and transcriptome levels in non-model plants.

  10. Harnessing pain heterogeneity and RNA transcriptome to identify blood-based pain biomarkers: a novel correlational study design and bioinformatics approach in a graded chronic constriction injury model.

    PubMed

    Grace, Peter M; Hurley, Daniel; Barratt, Daniel T; Tsykin, Anna; Watkins, Linda R; Rolan, Paul E; Hutchinson, Mark R

    2012-09-01

    A quantitative, peripherally accessible biomarker for neuropathic pain has great potential to improve clinical outcomes. Based on the premise that peripheral and central immunity contribute to neuropathic pain mechanisms, we hypothesized that biomarkers could be identified from the whole blood of adult male rats, by integrating graded chronic constriction injury (CCI), ipsilateral lumbar dorsal quadrant (iLDQ) and whole blood transcriptomes, and pathway analysis with pain behavior. Correlational bioinformatics identified a range of putative biomarker genes for allodynia intensity, many encoding for proteins with a recognized role in immune/nociceptive mechanisms. A selection of these genes was validated in a separate replication study. Pathway analysis of the iLDQ transcriptome identified Fcγ and Fcε signaling pathways, among others. This study is the first to employ the whole blood transcriptome to identify pain biomarker panels. The novel correlational bioinformatics, developed here, selected such putative biomarkers based on a correlation with pain behavior and formation of signaling pathways with iLDQ genes. Future studies may demonstrate the predictive ability of these biomarker genes across other models and additional variables. © 2012 The Authors. Journal of Neurochemistry © 2012 International Society for Neurochemistry.

  11. Multiplexed transcriptome analysis to detect ALK, ROS1 and RET rearrangements in lung cancer

    PubMed Central

    Rogers, Toni-Maree; Arnau, Gisela Mir; Ryland, Georgina L.; Huang, Stephen; Lira, Maruja E.; Emmanuel, Yvette; Perez, Omar D.; Irwin, Darryl; Fellowes, Andrew P.; Wong, Stephen Q.; Fox, Stephen B.

    2017-01-01

    ALK, ROS1 and RET gene fusions are important predictive biomarkers for tyrosine kinase inhibitors in lung cancer. Currently, the gold standard method for gene fusion detection is Fluorescence In Situ Hybridization (FISH) and while highly sensitive and specific, it is also labour intensive, subjective in analysis, and unable to screen a large numbers of gene fusions. Recent developments in high-throughput transcriptome-based methods may provide a suitable alternative to FISH as they are compatible with multiplexing and diagnostic workflows. However, the concordance between these different methods compared with FISH has not been evaluated. In this study we compared the results from three transcriptome-based platforms (Nanostring Elements, Agena LungFusion panel and ThermoFisher NGS fusion panel) to those obtained from ALK, ROS1 and RET FISH on 51 clinical specimens. Overall agreement of results ranged from 86–96% depending on the platform used. While all platforms were highly sensitive, both the Agena panel and Thermo Fisher NGS fusion panel reported minor fusions that were not detectable by FISH. Our proof–of–principle study illustrates that transcriptome-based analyses are sensitive and robust methods for detecting actionable gene fusions in lung cancer and could provide a robust alternative to FISH testing in the diagnostic setting. PMID:28181564

  12. Integrative structural annotation of de novo RNA-Seq provides an accurate reference gene set of the enormous genome of the onion (Allium cepa L.)

    PubMed Central

    Kim, Seungill; Kim, Myung-Shin; Kim, Yong-Min; Yeom, Seon-In; Cheong, Kyeongchae; Kim, Ki-Tae; Jeon, Jongbum; Kim, Sunggil; Kim, Do-Sun; Sohn, Seong-Han; Lee, Yong-Hwan; Choi, Doil

    2015-01-01

    The onion (Allium cepa L.) is one of the most widely cultivated and consumed vegetable crops in the world. Although a considerable amount of onion transcriptome data has been deposited into public databases, the sequences of the protein-coding genes are not accurate enough to be used, owing to non-coding sequences intermixed with the coding sequences. We generated a high-quality, annotated onion transcriptome from de novo sequence assembly and intensive structural annotation using the integrated structural gene annotation pipeline (ISGAP), which identified 54,165 protein-coding genes among 165,179 assembled transcripts totalling 203.0 Mb by eliminating the intron sequences. ISGAP performed reliable annotation, recognizing accurate gene structures based on reference proteins, and ab initio gene models of the assembled transcripts. Integrative functional annotation and gene-based SNP analysis revealed a whole biological repertoire of genes and transcriptomic variation in the onion. The method developed in this study provides a powerful tool for the construction of reference gene sets for organisms based solely on de novo transcriptome data. Furthermore, the reference genes and their variation described here for the onion represent essential tools for molecular breeding and gene cloning in Allium spp. PMID:25362073

  13. rnaQUAST: a quality assessment tool for de novo transcriptome assemblies.

    PubMed

    Bushmanova, Elena; Antipov, Dmitry; Lapidus, Alla; Suvorov, Vladimir; Prjibelski, Andrey D

    2016-07-15

    Ability to generate large RNA-Seq datasets created a demand for both de novo and reference-based transcriptome assemblers. However, while many transcriptome assemblers are now available, there is still no unified quality assessment tool for RNA-Seq assemblies. We present rnaQUAST-a tool for evaluating RNA-Seq assembly quality and benchmarking transcriptome assemblers using reference genome and gene database. rnaQUAST calculates various metrics that demonstrate completeness and correctness levels of the assembled transcripts, and outputs them in a user-friendly report. rnaQUAST is implemented in Python and is freely available at http://bioinf.spbau.ru/en/rnaquast ap@bioinf.spbau.ru Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  14. A comprehensive insight into bacterial virulence in drinking water using 454 pyrosequencing and Illumina high-throughput sequencing.

    PubMed

    Huang, Kailong; Zhang, Xu-Xiang; Shi, Peng; Wu, Bing; Ren, Hongqiang

    2014-11-01

    In order to comprehensively investigate bacterial virulence in drinking water, 454 pyrosequencing and Illumina high-throughput sequencing were used to detect potential pathogenic bacteria and virulence factors (VFs) in a full-scale drinking water treatment and distribution system. 16S rRNA gene pyrosequencing revealed high bacterial diversity in the drinking water (441-586 operational taxonomic units). Bacterial diversity decreased after chlorine disinfection, but increased after pipeline distribution. α-Proteobacteria was the most dominant taxonomic class. Alignment against the established pathogen database showed that several types of putative pathogens were present in the drinking water and Pseudomonas aeruginosa had the highest abundance (over 11‰ of total sequencing reads). Many pathogens disappeared after chlorine disinfection, but P. aeruginosa and Leptospira interrogans were still detected in the tap water. High-throughput sequencing revealed prevalence of various pathogenicity islands and virulence proteins in the drinking water, and translocases, transposons, Clp proteases and flagellar motor switch proteins were the predominant VFs. Both diversity and abundance of the detectable VFs increased after the chlorination, and decreased after the pipeline distribution. This study indicates that joint use of 454 pyrosequencing and Illumina sequencing can comprehensively characterize environmental pathogenesis, and several types of putative pathogens and various VFs are prevalent in drinking water. Copyright © 2014 Elsevier Inc. All rights reserved.

  15. Detection of novel NF1 mutations and rapid mutation prescreening with Pyrosequencing.

    PubMed

    Brinckmann, Anja; Mischung, Claudia; Bässmann, Ingelore; Kühnisch, Jirko; Schuelke, Markus; Tinschert, Sigrid; Nürnberg, Peter

    2007-12-01

    Neurofibromatosis type 1 (NF1) is caused by mutations in the neurofibromin (NF1) gene. Mutation analysis of NF1 is complicated by its large size, the lack of mutation hotspots, pseudogenes and frequent de novo mutations. Additionally, the search for NF1 mutations on the mRNA level is often hampered by nonsense-mediated mRNA decay (NMD) of the mutant allele. In this study we searched for mutations in a cohort of 38 patients and investigated the relationship between mutation type and allele-specific transcription from the wild-type versus mutant alleles. Quantification of relative mRNA transcript numbers was done by Pyrosequencing, a novel real-time sequencing method whose signals can be quantified very accurately. We identified 21 novel mutations comprising various mutation types. Pyrosequencing detected a definite relationship between allelic NF1 transcript imbalance due to NMD and mutation type in 24 of 29 patients who all carried frame-shift or nonsense mutations. NMD was absent in 5 patients with missense and silent mutations, as well as in 4 patients with splice-site mutations that did not disrupt the reading frame. Pyrosequencing was capable of detecting NMD even when the effects were only moderate. Diagnostic laboratories could thus exploit this effect for rapid prescreening for NF1 mutations as more than 60% of the mutations in this gene disrupt the reading frame and are prone to NMD.

  16. Pyrosequencing-based validation of a simple cell-suspension polymerase chain reaction assay for Campylobacter... of high-processivity polymerase with novel internal amplification controls for rapid and specific detection.

    USDA-ARS?s Scientific Manuscript database

    Although Campylobacter is an important food-borne human pathogen, there remains a lack of molecular diagnostic assays that are simple to use, cost-effective, and provide rapid results in research, clinical, or regulatory laboratories. Of the numerous Campylobacter assays that do exist, to our knowl...

  17. Comparison of Species Richness Estimates Obtained Using Nearly Complete Fragments and Simulated Pyrosequencing-Generated Fragments in 16S rRNA Gene-Based Environmental Surveys▿ †

    PubMed Central

    Youssef, Noha; Sheik, Cody S.; Krumholz, Lee R.; Najar, Fares Z.; Roe, Bruce A.; Elshahed, Mostafa S.

    2009-01-01

    Pyrosequencing-based 16S rRNA gene surveys are increasingly utilized to study highly diverse bacterial communities, with special emphasis on utilizing the large number of sequences obtained (tens to hundreds of thousands) for species richness estimation. However, it is not yet clear how the number of operational taxonomic units (OTUs) and, hence, species richness estimates determined using shorter fragments at different taxonomic cutoffs correlates with the number of OTUs assigned using longer, nearly complete 16S rRNA gene fragments. We constructed a 16S rRNA clone library from an undisturbed tallgrass prairie soil (1,132 clones) and used it to compare species richness estimates obtained using eight pyrosequencing candidate fragments (99 to 361 bp in length) and the nearly full-length fragment. Fragments encompassing the V1 and V2 (V1+V2) region and the V6 region (generated using primer pairs 8F-338R and 967F-1046R) overestimated species richness; fragments encompassing the V3, V7, and V7+V8 hypervariable regions (generated using primer pairs 338F-530R, 1046F-1220R, and 1046F-1392R) underestimated species richness; and fragments encompassing the V4, V5+V6, and V6+V7 regions (generated using primer pairs 530F-805R, 805F-1046R, and 967F-1220R) provided estimates comparable to those obtained with the nearly full-length fragment. These patterns were observed regardless of the alignment method utilized or the parameter used to gauge comparative levels of species richness (number of OTUs observed, slope of scatter plots of pairwise distance values for short and nearly complete fragments, and nonparametric and parametric species richness estimates). Similar results were obtained when analyzing three other datasets derived from soil, adult Zebrafish gut, and basaltic formations in the East Pacific Rise. Regression analysis indicated that these observed discrepancies in species richness estimates within various regions could readily be explained by the proportions of hypervariable, variable, and conserved base pairs within an examined fragment. PMID:19561178

  18. Characterization of the root transcriptome for iron and zinc homeostasis-related genes in indica rice (Oryza sativa L)

    USDA-ARS?s Scientific Manuscript database

    Micronutrient malnutrition is the most common form of nutrient deficiency among populations having a cereal based-diet. Rice is the staple food for one third of the world’s population, but is a poor source of iron and zinc concentration. We have characterized the root transcriptome of diverse indica...

  19. Gene Expression Analysis of Copper Tolerance and Wood Decay in the Brown Rot Fungus Fibroporia radiculosa

    Treesearch

    J. D. Tang; L. A. Parker; A. D. Perkins; T. S. Sonstegard; S. G. Schroeder; D. D. Nicholas; S. V. Diehl

    2013-01-01

    High-throughput transcriptomics was used to identify Fibroporia radiculosa genes that were differentially regulated during colonization of wood treated with a copper-based preservative. The transcriptome was profiled at two time points while the fungus was growing on wood treated with micronized copper quat (MCQ). A total of 917 transcripts were...

  20. 20180312 - Application of a Multiplexed High Content Imaging (HCI) Based Cell Viability and Apoptosis Chemical Screening Assay with Results in MCF-7 Cells (SOT)

    EPA Science Inventory

    The NCCT high throughput transcriptomics (HTTr) screening program uses whole transcriptome profiling assay in human-derived cells to collect concentration-response data for large numbers (100s-1000s) of environmental chemicals. To contextualize HTTr data, chemical effects on cell...

  1. Haematobia irritans dataset of raw sequence reads from Illumina-based transcriptome sequencing of specific tissues and life stages

    USDA-ARS?s Scientific Manuscript database

    Illumina HiSeq technology was used to sequence the transcriptome from various dissected tissues and life stages from the horn fly, Haematobia irritans. These samples include eggs (0, 2, 4, and 9 hours post-oviposition), adult fly gut, adult fly legs, adult fly malpighian tubule, adult fly ovary, adu...

  2. De novo Transcriptome Assembly of Common Wild Rice (Oryza rufipogon Griff.) and Discovery of Drought-Response Genes in Root Tissue Based on Transcriptomic Data.

    PubMed

    Tian, Xin-Jie; Long, Yan; Wang, Jiao; Zhang, Jing-Wen; Wang, Yan-Yan; Li, Wei-Min; Peng, Yu-Fa; Yuan, Qian-Hua; Pei, Xin-Wu

    2015-01-01

    The perennial O. rufipogon (common wild rice), which is considered to be the ancestor of Asian cultivated rice species, contains many useful genetic resources, including drought resistance genes. However, few studies have identified the drought resistance and tissue-specific genes in common wild rice. In this study, transcriptome sequencing libraries were constructed, including drought-treated roots (DR) and control leaves (CL) and roots (CR). Using Illumina sequencing technology, we generated 16.75 million bases of high-quality sequence data for common wild rice and conducted de novo assembly and annotation of genes without prior genome information. These reads were assembled into 119,332 unigenes with an average length of 715 bp. A total of 88,813 distinct sequences (74.42% of unigenes) significantly matched known genes in the NCBI NT database. Differentially expressed gene (DEG) analysis showed that 3617 genes were up-regulated and 4171 genes were down-regulated in the CR library compared with the CL library. Among the DEGs, 535 genes were expressed in roots but not in shoots. A similar comparison between the DR and CR libraries showed that 1393 genes were up-regulated and 315 genes were down-regulated in the DR library compared with the CR library. Finally, 37 genes that were specifically expressed in roots were screened after comparing the DEGs identified in the above-described analyses. This study provides a transcriptome sequence resource for common wild rice plants and establishes a digital gene expression profile of wild rice plants under drought conditions using the assembled transcriptome data as a reference. Several tissue-specific and drought-stress-related candidate genes were identified, representing a fully characterized transcriptome and providing a valuable resource for genetic and genomic studies in plants.

  3. Impact of Transcriptomics on Our Understanding of Pulmonary Fibrosis

    PubMed Central

    Vukmirovic, Milica; Kaminski, Naftali

    2018-01-01

    Idiopathic pulmonary fibrosis (IPF) is a lethal fibrotic lung disease characterized by aberrant remodeling of the lung parenchyma with extensive changes to the phenotypes of all lung resident cells. The introduction of transcriptomics, genome scale profiling of thousands of RNA transcripts, caused a significant inversion in IPF research. Instead of generating hypotheses based on animal models of disease, or biological plausibility, with limited validation in humans, investigators were able to generate hypotheses based on unbiased molecular analysis of human samples and then use animal models of disease to test their hypotheses. In this review, we describe the insights made from transcriptomic analysis of human IPF samples. We describe how transcriptomic studies led to identification of novel genes and pathways involved in the human IPF lung such as: matrix metalloproteinases, WNT pathway, epithelial genes, role of microRNAs among others, as well as conceptual insights such as the involvement of developmental pathways and deep shifts in epithelial and fibroblast phenotypes. The impact of lung and transcriptomic studies on disease classification, endotype discovery, and reproducible biomarkers is also described in detail. Despite these impressive achievements, the impact of transcriptomic studies has been limited because they analyzed bulk tissue and did not address the cellular and spatial heterogeneity of the IPF lung. We discuss new emerging technologies and applications, such as single-cell RNAseq and microenvironment analysis that may address cellular and spatial heterogeneity. We end by making the point that most current tissue collections and resources are not amenable to analysis using the novel technologies. To take advantage of the new opportunities, we need new efforts of sample collections, this time focused on access to all the microenvironments and cells in the IPF lung. PMID:29670881

  4. Determining the optimal number of independent components for reproducible transcriptomic data analysis.

    PubMed

    Kairov, Ulykbek; Cantini, Laura; Greco, Alessandro; Molkenov, Askhat; Czerwinska, Urszula; Barillot, Emmanuel; Zinovyev, Andrei

    2017-09-11

    Independent Component Analysis (ICA) is a method that models gene expression data as an action of a set of statistically independent hidden factors. The output of ICA depends on a fundamental parameter: the number of components (factors) to compute. The optimal choice of this parameter, related to determining the effective data dimension, remains an open question in the application of blind source separation techniques to transcriptomic data. Here we address the question of optimizing the number of statistically independent components in the analysis of transcriptomic data for reproducibility of the components in multiple runs of ICA (within the same or within varying effective dimensions) and in multiple independent datasets. To this end, we introduce ranking of independent components based on their stability in multiple ICA computation runs and define a distinguished number of components (Most Stable Transcriptome Dimension, MSTD) corresponding to the point of the qualitative change of the stability profile. Based on a large body of data, we demonstrate that a sufficient number of dimensions is required for biological interpretability of the ICA decomposition and that the most stable components with ranks below MSTD have more chances to be reproduced in independent studies compared to the less stable ones. At the same time, we show that a transcriptomics dataset can be reduced to a relatively high number of dimensions without losing the interpretability of ICA, even though higher dimensions give rise to components driven by small gene sets. We suggest a protocol of ICA application to transcriptomics data with a possibility of prioritizing components with respect to their reproducibility that strengthens the biological interpretation. Computing too few components (much less than MSTD) is not optimal for interpretability of the results. The components ranked within MSTD range have more chances to be reproduced in independent studies.

  5. Comparison of the Nodule vs. Root Transcriptome of the Actinorhizal Plant Datisca glomerata: Actinorhizal Nodules Contain a Specific Class of Defensins

    PubMed Central

    Santos, Patricia; Plaszczyca, Marian; Pawlowski, Katharina

    2013-01-01

    Actinorhizal root nodule symbioses are very diverse, and the symbiosis of Datisca glomerata has previously been shown to have many unusual aspects. In order to gain molecular information on the infection mechanism, nodule development and nodule metabolism, we compared the transcriptomes of D. glomerata roots and nodules. Root and nodule libraries representing the 3′-ends of cDNAs were subjected to high-throughput parallel 454 sequencing. To identify the corresponding genes and to improve the assembly, Illumina sequencing of the nodule transcriptome was performed as well. The evaluation revealed 406 differentially regulated genes, 295 of which (72.7%) could be assigned a function based on homology. Analysis of the nodule transcriptome showed that genes encoding components of the common symbiosis signaling pathway were present in nodules of D. glomerata, which in combination with the previously established function of SymRK in D. glomerata nodulation suggests that this pathway is also active in actinorhizal Cucurbitales. Furthermore, comparison of the D. glomerata nodule transcriptome with nodule transcriptomes from actinorhizal Fagales revealed a new subgroup of nodule-specific defensins that might play a role specific to actinorhizal symbioses. The D. glomerata members of this defensin subgroup contain an acidic C-terminal domain that was never found in plant defensins before. PMID:24009681

  6. RNA-Seq Technology and Its Application in Fish Transcriptomics

    PubMed Central

    Ba, Yi; Zhuang, Qianfeng

    2014-01-01

    Abstract High-throughput sequencing technologies, also known as next-generation sequencing (NGS) technologies, have revolutionized the way that genomic research is advancing. In addition to the static genome, these state-of-art technologies have been recently exploited to analyze the dynamic transcriptome, and the resulting technology is termed RNA sequencing (RNA-seq). RNA-seq is free from many limitations of other transcriptomic approaches, such as microarray and tag-based sequencing method. Although RNA-seq has only been available for a short time, studies using this method have completely changed our perspective of the breadth and depth of eukaryotic transcriptomes. In terms of the transcriptomics of teleost fishes, both model and non-model species have benefited from the RNA-seq approach and have undergone tremendous advances in the past several years. RNA-seq has helped not only in mapping and annotating fish transcriptome but also in our understanding of many biological processes in fish, such as development, adaptive evolution, host immune response, and stress response. In this review, we first provide an overview of each step of RNA-seq from library construction to the bioinformatic analysis of the data. We then summarize and discuss the recent biological insights obtained from the RNA-seq studies in a variety of fish species. PMID:24380445

  7. Use of pyrosequencing and denaturing gradient gel electrophoresis to examine the effects of probiotics and essential oil blends on digestive microflora in broilers under mixed Eimeria infection.

    PubMed

    Hume, Michael E; Barbosa, Nei A; Dowd, Scot E; Sakomura, Nilva K; Nalian, Armen G; Martynova-Van Kley, Alexandra; Oviedo-Rondón, Edgar O

    2011-11-01

    A protective digestive microflora helps prevent and reduce broiler infection and colonization by enteropathogens. In the current experiment, broilers fed diets supplemented with probiotics and essential oil (EO) blends were infected with a standard mixed Eimeria spp. to determine effects of performance enhancers on ileal and cecal microbial communities (MCs). Eight treatment groups included four controls (uninfected-unmedicated [UU], unmedicated-infected, the antibiotic BMD plus the ionophore Coban as positive control, and the ionophore as negative control), and four treatments (probiotics BC-30 and Calsporin; and EO, Crina Poultry Plus, and Crina PoultryAF). Day-old broilers were raised to 14 days in floor pens on used litter and then were moved to Petersime batteries and inoculated at 15 days with mixed Eimeria spp. Ileal and cecal samples were collected at 14 days and 7 days postinfection. Digesta DNA was subjected to pyrosequencing for sequencing of individual cecal bacteria and denaturing gradient gel electrophoresis (DGGE) for determination of changes in ileal and cecal MC according to percentage similarity coefficient (%SC). Pyrosequencing is very sensitive detecting shifts in individual bacterial sequences, whereas DGGE is able to detect gross shifts in entire MC. These combined techniques offer versatility toward identifying feed additive and mild Eimeria infection modulation of broiler MC. Pyrosequencing detected 147 bacterial species sequences. Additionally, pyrosequencing revealed the presence of relatively low levels of the potential human enteropathogens Campylobacter sp. and four Shigella spp. as well as the potential poultry pathogen Clostridiun perfringens. Pre- and postinfection changes in ileal (56%SC) and cecal (78.5%SC) DGGE profiles resulted from the coccidia infection and with increased broiler age. Probiotics and EO changed MC from those seen in UU ilea and ceca. Results potentially reflect the performance enhancement above expectations in comparison to broilers not given the probiotics or the specific EO blends as feed supplements.

  8. 454 next generation-sequencing outperforms allele-specific PCR, Sanger sequencing, and pyrosequencing for routine KRAS mutation analysis of formalin-fixed, paraffin-embedded samples

    PubMed Central

    Altimari, Annalisa; de Biase, Dario; De Maglio, Giovanna; Gruppioni, Elisa; Capizzi, Elisa; Degiovanni, Alessio; D’Errico, Antonia; Pession, Annalisa; Pizzolitto, Stefano; Fiorentino, Michelangelo; Tallini, Giovanni

    2013-01-01

    Detection of KRAS mutations in archival pathology samples is critical for therapeutic appropriateness of anti-EGFR monoclonal antibodies in colorectal cancer. We compared the sensitivity, specificity, and accuracy of Sanger sequencing, ARMS-Scorpion (TheraScreen®) real-time polymerase chain reaction (PCR), pyrosequencing, chip array hybridization, and 454 next-generation sequencing to assess KRAS codon 12 and 13 mutations in 60 nonconsecutive selected cases of colorectal cancer. Twenty of the 60 cases were detected as wild-type KRAS by all methods with 100% specificity. Among the 40 mutated cases, 13 were discrepant with at least one method. The sensitivity was 85%, 90%, 93%, and 92%, and the accuracy was 90%, 93%, 95%, and 95% for Sanger sequencing, TheraScreen real-time PCR, pyrosequencing, and chip array hybridization, respectively. The main limitation of Sanger sequencing was its low analytical sensitivity, whereas TheraScreen real-time PCR, pyrosequencing, and chip array hybridization showed higher sensitivity but suffered from the limitations of predesigned assays. Concordance between the methods was k = 0.79 for Sanger sequencing and k > 0.85 for the other techniques. Tumor cell enrichment correlated significantly with the abundance of KRAS-mutated deoxyribonucleic acid (DNA), evaluated as ΔCt for TheraScreen real-time PCR (P = 0.03), percentage of mutation for pyrosequencing (P = 0.001), ratio for chip array hybridization (P = 0.003), and percentage of mutation for 454 next-generation sequencing (P = 0.004). Also, 454 next-generation sequencing showed the best cross correlation for quantification of mutation abundance compared with all the other methods (P < 0.001). Our comparison showed the superiority of next-generation sequencing over the other techniques in terms of sensitivity and specificity. Next-generation sequencing will replace Sanger sequencing as the reference technique for diagnostic detection of KRAS mutation in archival tumor tissues. PMID:23950653

  9. MGMT promoter methylation determined by HRM in comparison to MSP and pyrosequencing for predicting high-grade glioma response.

    PubMed

    Switzeny, Olivier J; Christmann, Markus; Renovanz, Mirjam; Giese, Alf; Sommer, Clemens; Kaina, Bernd

    2016-01-01

    The DNA repair protein O(6)-methylguanine-DNA methyltransferase (MGMT) causes resistance of cancer cells to alkylating agents and, therefore, is a well-established predictive marker for high-grade gliomas that are routinely treated with alkylating drugs. Since MGMT is highly epigenetically regulated, the MGMT promoter methylation status is taken as an indicator of MGMT silencing, predicting the outcome of glioma therapy. MGMT promoter methylation is usually determined by methylation specific PCR (MSP), which is a labor intensive and error-prone method often used semi-quantitatively. Searching for alternatives, we used closed-tube high resolution melt (HRM) analysis, which is a quantitative method, and compared it with MSP and pyrosequencing regarding its predictive value. We analyzed glioblastoma cell lines with known MGMT activity and formalin-fixed samples from IDH1 wild-type high-grade glioma patients (WHO grade III/IV) treated with radiation and temozolomide by HRM, MSP, and pyrosequencing. The data were compared as to progression-free survival (PFS) and overall survival (OS) of patients exhibiting the methylated and unmethylated MGMT status. A promoter methylation cut-off level relevant for PFS and OS was determined. In a multivariate Cox regression model, methylation of MGMT promoter of high-grade gliomas analyzed by HRM, but not MSP, was found to be an independent predictive marker for OS. Univariate Kaplan-Meier analyses revealed for PFS and OS a significant and better discrimination between methylated and unmethylated tumors when quantitative HRM was used instead of MSP. Compared to MSP and pyrosequencing, the HRM method is simple, cost effective, highly accurate and fast. HRM is at least equivalent to pyrosequencing in quantifying the methylation level. It is superior in predicting PFS and OS of high-grade glioma patients compared to MSP and, therefore, can be recommended being used routinely for determination of the MGMT status of gliomas.

  10. High throughput pyrosequencing technology for molecular differential detection of Babesia vogeli, Hepatozoon canis, Ehrlichia canis and Anaplasma platys in canine blood samples.

    PubMed

    Kaewkong, Worasak; Intapan, Pewpan M; Sanpool, Oranuch; Janwan, Penchom; Thanchomnang, Tongjit; Kongklieng, Amornmas; Tantrawatpan, Chairat; Boonmars, Thidarut; Lulitanond, Viraphong; Taweethavonsawat, Piyanan; Chungpivat, Sudchit; Maleewong, Wanchai

    2014-06-01

    Canine babesiosis, hepatozoonosis, ehrlichiosis, and anaplasmosis are tick-borne diseases caused by different hemopathogens. These diseases are causes of morbidity and mortality in dogs. The classic method for parasite detection and differentiation is based on microscopic observation of blood smears. The limitations of the microscopic method are that its performance requires a specially qualified person with professional competence, and it is ineffective in differentiating closely related species. This study applied PCR amplification with high throughput pyrosequencing for molecular differential detection of the following 4 hemoparasites common to tropical areas in dog blood samples: Babesia vogeli, Hepatozoon canis, Ehrlichia canis, and Anaplasma platys. PCR was initially used to amplify specific target regions of the ribosomal RNA genes of each parasite using 2 primer pairs that included 18S rRNA for protozoa (B. vogeli and H. canis) and 16S rRNA for rickettsia (E. canis and A. platys). Babesia vogeli and H. canis were discriminated using 9 nucleotide positions out of 30 base pairs, whereas E. canis and A. platys were differentiated using 15 nucleotide positions out of 34 base pairs that were determined from regions adjacent to 3' ends of the sequencing primers. This method provides a challenging alternative for a rapid diagnosis and surveillance of these tick-borne diseases in canines. Copyright © 2014 Elsevier GmbH. All rights reserved.

  11. A Bioluminometric Method of DNA Sequencing

    NASA Technical Reports Server (NTRS)

    Ronaghi, Mostafa; Pourmand, Nader; Stolc, Viktor; Arnold, Jim (Technical Monitor)

    2001-01-01

    Pyrosequencing is a bioluminometric single-tube DNA sequencing method that takes advantage of co-operativity between four enzymes to monitor DNA synthesis. In this sequencing-by-synthesis method, a cascade of enzymatic reactions yields detectable light, which is proportional to incorporated nucleotides. Pyrosequencing has the advantages of accuracy, flexibility and parallel processing. It can be easily automated. Furthermore, the technique dispenses with the need for labeled primers, labeled nucleotides and gel-electrophoresis. In this chapter, the use of this technique for different applications is discussed.

  12. Whole transcriptome analysis using next-generation sequencing of model species Setaria viridis to support C4 photosynthesis research.

    PubMed

    Xu, Jiajia; Li, Yuanyuan; Ma, Xiuling; Ding, Jianfeng; Wang, Kai; Wang, Sisi; Tian, Ye; Zhang, Hui; Zhu, Xin-Guang

    2013-09-01

    Setaria viridis is an emerging model species for genetic studies of C4 photosynthesis. Many basic molecular resources need to be developed to support for this species. In this paper, we performed a comprehensive transcriptome analysis from multiple developmental stages and tissues of S. viridis using next-generation sequencing technologies. Sequencing of the transcriptome from multiple tissues across three developmental stages (seed germination, vegetative growth, and reproduction) yielded a total of 71 million single end 100 bp long reads. Reference-based assembly using Setaria italica genome as a reference generated 42,754 transcripts. De novo assembly generated 60,751 transcripts. In addition, 9,576 and 7,056 potential simple sequence repeats (SSRs) covering S. viridis genome were identified when using the reference based assembled transcripts and the de novo assembled transcripts, respectively. This identified transcripts and SSR provided by this study can be used for both reverse and forward genetic studies based on S. viridis.

  13. Genomic and transcriptomic predictors of triglyceride response to regular exercise

    PubMed Central

    Sarzynski, Mark A; Davidsen, Peter K; Sung, Yun Ju; Hesselink, Matthijs K C; Schrauwen, Patrick; Rice, Treva K; Rao, D C; Falciani, Francesco; Bouchard, Claude

    2015-01-01

    Aim We performed genome-wide and transcriptome-wide profiling to identify genes and single nucleotide polymorphisms (SNPs) associated with the response of triglycerides (TG) to exercise training. Methods Plasma TG levels were measured before and after a 20-week endurance training programme in 478 white participants from the HERITAGE Family Study. Illumina HumanCNV370-Quad v3.0 BeadChips were genotyped using the Illumina BeadStation 500GX platform. Affymetrix HG-U133+2 arrays were used to quantitate gene expression levels from baseline muscle biopsies of a subset of participants (N=52). Genome-wide association study (GWAS) analysis was performed using MERLIN, while transcriptomic predictor models were developed using the R-package GALGO. Results The GWAS results showed that eight SNPs were associated with TG training-response (ΔTG) at p<9.9×10−6, while another 31 SNPs showed p values <1×10−4. In multivariate regression models, the top 10 SNPs explained 32.0% of the variance in ΔTG, while conditional heritability analysis showed that four SNPs statistically accounted for all of the heritability of ΔTG. A molecular signature based on the baseline expression of 11 genes predicted 27% of ΔTG in HERITAGE, which was validated in an independent study. A composite SNP score based on the top four SNPs, each from the genomic and transcriptomic analyses, was the strongest predictor of ΔTG (R2=0.14, p=3.0×10−68). Conclusions Our results indicate that skeletal muscle transcript abundance at 11 genes and SNPs at a number of loci contribute to TG response to exercise training. Combining data from genomics and transcriptomics analyses identified a SNP-based gene signature that should be further tested in independent samples. PMID:26491034

  14. Transcriptomic alterations during ageing reflect the shift from cancer to degenerative diseases in the elderly.

    PubMed

    Aramillo Irizar, Peer; Schäuble, Sascha; Esser, Daniela; Groth, Marco; Frahm, Christiane; Priebe, Steffen; Baumgart, Mario; Hartmann, Nils; Marthandan, Shiva; Menzel, Uwe; Müller, Julia; Schmidt, Silvio; Ast, Volker; Caliebe, Amke; König, Rainer; Krawczak, Michael; Ristow, Michael; Schuster, Stefan; Cellerino, Alessandro; Diekmann, Stephan; Englert, Christoph; Hemmerich, Peter; Sühnel, Jürgen; Guthke, Reinhard; Witte, Otto W; Platzer, Matthias; Ruppin, Eytan; Kaleta, Christoph

    2018-01-30

    Disease epidemiology during ageing shows a transition from cancer to degenerative chronic disorders as dominant contributors to mortality in the old. Nevertheless, it has remained unclear to what extent molecular signatures of ageing reflect this phenomenon. Here we report on the identification of a conserved transcriptomic signature of ageing based on gene expression data from four vertebrate species across four tissues. We find that ageing-associated transcriptomic changes follow trajectories similar to the transcriptional alterations observed in degenerative ageing diseases but are in opposite direction to the transcriptomic alterations observed in cancer. We confirm the existence of a similar antagonism on the genomic level, where a majority of shared risk alleles which increase the risk of cancer decrease the risk of chronic degenerative disorders and vice versa. These results reveal a fundamental trade-off between cancer and degenerative ageing diseases that sheds light on the pronounced shift in their epidemiology during ageing.

  15. Bacterial and diazotrophic diversities of endophytes in Dendrobium catenatum determined through barcoded pyrosequencing.

    PubMed

    Li, Ou; Xiao, Rong; Sun, Lihua; Guan, Chenglin; Kong, Dedong; Hu, Xiufang

    2017-01-01

    As an epiphyte orchid, Dendrobium catenatum relies on microorganisms for requisite nutrients. Metagenome pyrosequencing based on 16S rRNA and nifH genes was used to characterize the bacterial and diazotrophic communities associated with D. catenatum collected from 5 districts in China. Based on Meta-16S rRNA sequencing, 22 bacterial phyla and 699 genera were identified, distributed as 125 genera from 8 phyla and 319 genera from 10 phyla shared by all the planting bases and all the tissues, respectively. The predominant Proteobacteria varied from 71.81% (GZ) to 96.08% (YN), and Delftia (10.39-38.42%), Burkholderia (2.71-15.98%), Escherichia/Shigella (4.90-25.12%), Pseudomonas (2.68-30.72%) and Sphingomonas (1.83-2.05%) dominated in four planting bases. Pseudomonas (17.94-22.06%), Escherichia/Shigella (6.59-11.59%), Delftia (9.65-22.14%) and Burkholderia (3.12-11.05%) dominated in all the tissues. According to Meta-nifH sequencing, 4 phyla and 45 genera were identified, while 17 genera and 24 genera from 4 phyla were shared by all the planting bases and all the tissues, respectively. Burkholderia and Bradyrhizobium were the most popular in the planting bases, followed by Methylovirgula and Mesorhizobium. Mesorhizobium was the most popular in different tissues, followed by Beijerinckia, Xanthobacter, and Burkholderia. Among the genera, 39 were completely overlapped with the results based on the 16S rRNA gene. In conclusion, abundant bacteria and diazotrophs were identified in common in different tissues of D. catenatum from five planting bases, which might play a great role in the supply of nutrients such as nitrogen. The exact abundance of phylum and genus on the different tissues from different planting bases need deeper sequencing with more samples.

  16. Impact of Nisin-Activated Packaging on Microbiota of Beef Burgers during Storage

    PubMed Central

    Ferrocino, Ilario; Greppi, Anna; La Storia, Antonietta; Rantsiou, Kalliopi; Ercolini, Danilo

    2015-01-01

    Beef burgers were stored at 4°C in a vacuum in nisin-activated antimicrobial packaging. Microbial ecology analyses were performed on samples collected between days 0 and 21 of storage to discover the population diversity. Two batches were analyzed using RNA-based denaturing gradient gel electrophoresis (DGGE) and pyrosequencing. The active packaging retarded the growth of the total viable bacteria and lactic acid bacteria. Culture-independent analysis by pyrosequencing of RNA extracted directly from meat showed that Photobacterium phosphoreum, Lactococcus piscium, Lactobacillus sakei, and Leuconostoc carnosum were the major operational taxonomic units (OTUs) shared between control and treated samples. Beta diversity analysis of the 16S rRNA sequence data and RNA-DGGE showed a clear separation between two batches based on the microbiota. Control samples from batch B showed a significant high abundance of some taxa sensitive to nisin, such as Kocuria rhizophila, Staphylococcus xylosus, Leuconostoc carnosum, and Carnobacterium divergens, compared to control samples from batch A. However, only from batch B was it possible to find a significant difference between controls and treated samples during storage due to the active packaging. Predicted metagenomes confirmed differences between the two batches and indicated that the use of nisin-based antimicrobial packaging can determine a reduction in the abundance of specific metabolic pathways related to spoilage. The present study aimed to assess the viable bacterial communities in beef burgers stored in nisin-based antimicrobial packaging, and it highlights the efficacy of this strategy to prolong beef burger shelf life. PMID:26546424

  17. Transcriptomics-based strain optimization tool for designing secondary metabolite overproducing strains of Streptomyces coelicolor.

    PubMed

    Kim, Minsuk; Yi, Jeong Sang; Lakshmanan, Meiyappan; Lee, Dong-Yup; Kim, Byung-Gee

    2016-03-01

    In silico model-driven analysis using genome-scale model of metabolism (GEM) has been recognized as a promising method for microbial strain improvement. However, most of the current GEM-based strain design algorithms based on flux balance analysis (FBA) heavily rely on the steady-state and optimality assumptions without considering any regulatory information. Thus, their practical usage is quite limited, especially in its application to secondary metabolites overproduction. In this study, we developed a transcriptomics-based strain optimization tool (tSOT) in order to overcome such limitations by integrating transcriptomic data into GEM. Initially, we evaluated existing algorithms for integrating transcriptomic data into GEM using Streptomyces coelicolor dataset, and identified iMAT algorithm as the only and the best algorithm for characterizing the secondary metabolism of S. coelicolor. Subsequently, we developed tSOT platform where iMAT is adopted to predict the reaction states, and successfully demonstrated its applicability to secondary metabolites overproduction by designing actinorhodin (ACT), a polyketide antibiotic, overproducing strain of S. coelicolor. Mutants overexpressing tSOT targets such as ribulose 5-phosphate 3-epimerase and NADP-dependent malic enzyme showed 2 and 1.8-fold increase in ACT production, thereby validating the tSOT prediction. It is expected that tSOT can be used for solving other metabolic engineering problems which could not be addressed by current strain design algorithms, especially for the secondary metabolite overproductions. © 2015 Wiley Periodicals, Inc.

  18. Rapid Molecular Identification of Human Taeniid Cestodes by Pyrosequencing Approach

    PubMed Central

    Thanchomnang, Tongjit; Tantrawatpan, Chairat; Intapan, Pewpan M.; Sanpool, Oranuch; Janwan, Penchom; Lulitanond, Viraphong; Tourtip, Somjintana; Yamasaki, Hiroshi; Maleewong, Wanchai

    2014-01-01

    Taenia saginata, T. solium, and T. asiatica are causative agents of taeniasis in humans. The difficulty of morphological identification of human taeniids can lead to misdiagnosis or confusion. To overcome this problem, several molecular methods have been developed, but use of these tends to be time-consuming. Here, a rapid and high-throughput pyrosequencing approach was developed for the identification of three human taeniids originating from various countries. Primers targeting the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene of the three Taenia species were designed. Variations in a 26-nucleotide target region were used for identification. The reproducibility and accuracy of the pyrosequencing technology was confirmed by Sanger sequencing. This technique will be a valuable tool to distinguish between sympatric human taeniids that occur in Thailand, Asia and Pacific countries. This method could potentially be used for the molecular identification of the taeniid species that might be associated with suspicious cysts and lesions, or cyst residues in humans or livestock at the slaughterhouse. PMID:24945530

  19. Identification of Methylated Genes Associated with Aggressive Bladder Cancer

    PubMed Central

    Marsit, Carmen J.; Houseman, E. Andres; Christensen, Brock C.; Gagne, Luc; Wrensch, Margaret R.; Nelson, Heather H.; Wiemels, Joseph; Zheng, Shichun; Wiencke, John K.; Andrew, Angeline S.; Schned, Alan R.; Karagas, Margaret R.; Kelsey, Karl T.

    2010-01-01

    Approximately 500,000 individuals diagnosed with bladder cancer in the U.S. require routine cystoscopic follow-up to monitor for disease recurrences or progression, resulting in over $2 billion in annual expenditures. Identification of new diagnostic and monitoring strategies are clearly needed, and markers related to DNA methylation alterations hold great promise due to their stability, objective measurement, and known associations with the disease and with its clinical features. To identify novel epigenetic markers of aggressive bladder cancer, we utilized a high-throughput DNA methylation bead-array in two distinct population-based series of incident bladder cancer (n = 73 and n = 264, respectively). We then validated the association between methylation of these candidate loci with tumor grade in a third population (n = 245) through bisulfite pyrosequencing of candidate loci. Array based analyses identified 5 loci for further confirmation with bisulfite pyrosequencing. We identified and confirmed that increased promoter methylation of HOXB2 is significantly and independently associated with invasive bladder cancer and methylation of HOXB2, KRT13 and FRZB together significantly predict high-grade non-invasive disease. Methylation of these genes may be useful as clinical markers of the disease and may point to genes and pathways worthy of additional examination as novel targets for therapeutic treatment. PMID:20808801

  20. Identification of methylated genes associated with aggressive bladder cancer.

    PubMed

    Marsit, Carmen J; Houseman, E Andres; Christensen, Brock C; Gagne, Luc; Wrensch, Margaret R; Nelson, Heather H; Wiemels, Joseph; Zheng, Shichun; Wiencke, John K; Andrew, Angeline S; Schned, Alan R; Karagas, Margaret R; Kelsey, Karl T

    2010-08-23

    Approximately 500,000 individuals diagnosed with bladder cancer in the U.S. require routine cystoscopic follow-up to monitor for disease recurrences or progression, resulting in over $2 billion in annual expenditures. Identification of new diagnostic and monitoring strategies are clearly needed, and markers related to DNA methylation alterations hold great promise due to their stability, objective measurement, and known associations with the disease and with its clinical features. To identify novel epigenetic markers of aggressive bladder cancer, we utilized a high-throughput DNA methylation bead-array in two distinct population-based series of incident bladder cancer (n = 73 and n = 264, respectively). We then validated the association between methylation of these candidate loci with tumor grade in a third population (n = 245) through bisulfite pyrosequencing of candidate loci. Array based analyses identified 5 loci for further confirmation with bisulfite pyrosequencing. We identified and confirmed that increased promoter methylation of HOXB2 is significantly and independently associated with invasive bladder cancer and methylation of HOXB2, KRT13 and FRZB together significantly predict high-grade non-invasive disease. Methylation of these genes may be useful as clinical markers of the disease and may point to genes and pathways worthy of additional examination as novel targets for therapeutic treatment.

  1. Pyrosequencing-based characterization of gastrointestinal bacteria of Atlantic salmon (Salmo salar L.) within a commercial mariculture system.

    PubMed

    Zarkasi, K Z; Abell, G C J; Taylor, R S; Neuman, C; Hatje, E; Tamplin, M L; Katouli, M; Bowman, J P

    2014-07-01

    The relationship of Atlantic salmon gastrointestinal (GI) tract bacteria to environmental factors, in particular water temperature within a commercial mariculture system, was investigated. Salmon GI tract bacterial communities commercially farmed in south-eastern Tasmania were analysed, over a 13-month period across a standard commercial production farm cycle, using 454 16S rRNA-based pyrosequencing. Faecal bacterial communities were highly dynamic but largely similar between randomly selected fish. In postsmolt, the faecal bacteria population was dominated by Gram-positive fermentative bacteria; however, by midsummer, members of the family Vibrionaceae predominated. As fish progressed towards harvest, a range of different bacterial genera became more prominent corresponding to a decline in Vibrionaceae. The sampled fish were fed two different commercial diet series with slightly different protein, lipid and digestible energy level; however, the effect of these differences was minimal. The overall data demonstrated dynamic hind gut communities in salmon that were related to season and fish growth phases but were less influenced by differences in commercial diets used routinely within the farm system studied. This study provides understanding of farmed salmon GI bacterial communities and describes the relative impact of diet, environmental and farm factors. © 2014 The Society for Applied Microbiology.

  2. ALOMYbase, a resource to investigate non-target-site-based resistance to herbicides inhibiting acetolactate-synthase (ALS) in the major grass weed Alopecurus myosuroides (black-grass).

    PubMed

    Gardin, Jeanne Aude Christiane; Gouzy, Jérôme; Carrère, Sébastien; Délye, Christophe

    2015-08-12

    Herbicide resistance in agrestal weeds is a global problem threatening food security. Non-target-site resistance (NTSR) endowed by mechanisms neutralising the herbicide or compensating for its action is considered the most agronomically noxious type of resistance. Contrary to target-site resistance, NTSR mechanisms are far from being fully elucidated. A part of weed response to herbicide stress, NTSR is considered to be largely driven by gene regulation. Our purpose was to establish a transcriptome resource allowing investigation of the transcriptomic bases of NTSR in the major grass weed Alopecurus myosuroides L. (Poaceae) for which almost no genomic or transcriptomic data was available. RNA-Seq was performed from plants in one F2 population that were sensitive or expressing NTSR to herbicides inhibiting acetolactate-synthase. Cloned plants were sampled over seven time-points ranging from before until 73 h after herbicide application. Assembly of over 159M high-quality Illumina reads generated a transcriptomic resource (ALOMYbase) containing 65,558 potentially active contigs (N50 = 1240 nucleotides) predicted to encode 32,138 peptides with 74% GO annotation, of which 2017 were assigned to protein families presumably involved in NTSR. Comparison with the fully sequenced grass genomes indicated good coverage and correct representation of A. myosuroides transcriptome in ALOMYbase. The part of the herbicide transcriptomic response common to the resistant and the sensitive plants was consistent with the expected effects of acetolactate-synthase inhibition, with striking similarities observed with published Arabidopsis thaliana data. A. myosuroides plants with NTSR were first affected by herbicide action like sensitive plants, but ultimately overcame it. Analysis of differences in transcriptomic herbicide response between resistant and sensitive plants did not allow identification of processes directly explaining NTSR. Five contigs associated to NTSR in the F2 population studied were tentatively identified. They were predicted to encode three cytochromes P450 (CYP71A, CYP71B and CYP81D), one peroxidase and one disease resistance protein. Our data confirmed that gene regulation is at the root of herbicide response and of NTSR. ALOMYbase proved to be a relevant resource to support NTSR transcriptomic studies, and constitutes a valuable tool for future research aiming at elucidating gene regulations involved in NTSR in A. myosuroides.

  3. TRAM (Transcriptome Mapper): database-driven creation and analysis of transcriptome maps from multiple sources

    PubMed Central

    2011-01-01

    Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format) and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper) is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays), implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile), useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples) and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene clusters with differential expression during the differentiation toward megakaryocyte were identified. Conclusions TRAM is designed to create, and statistically analyze, quantitative transcriptome maps, based on gene expression data from multiple sources. The release includes FileMaker Pro database management runtime application and it is freely available at http://apollo11.isto.unibo.it/software/, along with preconfigured implementations for mapping of human, mouse and zebrafish transcriptomes. PMID:21333005

  4. Integrative structural annotation of de novo RNA-Seq provides an accurate reference gene set of the enormous genome of the onion (Allium cepa L.).

    PubMed

    Kim, Seungill; Kim, Myung-Shin; Kim, Yong-Min; Yeom, Seon-In; Cheong, Kyeongchae; Kim, Ki-Tae; Jeon, Jongbum; Kim, Sunggil; Kim, Do-Sun; Sohn, Seong-Han; Lee, Yong-Hwan; Choi, Doil

    2015-02-01

    The onion (Allium cepa L.) is one of the most widely cultivated and consumed vegetable crops in the world. Although a considerable amount of onion transcriptome data has been deposited into public databases, the sequences of the protein-coding genes are not accurate enough to be used, owing to non-coding sequences intermixed with the coding sequences. We generated a high-quality, annotated onion transcriptome from de novo sequence assembly and intensive structural annotation using the integrated structural gene annotation pipeline (ISGAP), which identified 54,165 protein-coding genes among 165,179 assembled transcripts totalling 203.0 Mb by eliminating the intron sequences. ISGAP performed reliable annotation, recognizing accurate gene structures based on reference proteins, and ab initio gene models of the assembled transcripts. Integrative functional annotation and gene-based SNP analysis revealed a whole biological repertoire of genes and transcriptomic variation in the onion. The method developed in this study provides a powerful tool for the construction of reference gene sets for organisms based solely on de novo transcriptome data. Furthermore, the reference genes and their variation described here for the onion represent essential tools for molecular breeding and gene cloning in Allium spp. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  5. 16S rRNA Gene Pyrosequencing Reveals Bacterial Dysbiosis in the Duodenum of Dogs with Idiopathic Inflammatory Bowel Disease

    PubMed Central

    Suchodolski, Jan S.; Dowd, Scot E.; Wilke, Vicky; Steiner, Jörg M.; Jergens, Albert E.

    2012-01-01

    Background Canine idiopathic inflammatory bowel disease (IBD) is believed to be caused by a complex interaction of genetic, immunologic, and microbial factors. While mucosa-associated bacteria have been implicated in the pathogenesis of canine IBD, detailed studies investigating the enteric microbiota using deep sequencing techniques are lacking. The objective of this study was to evaluate mucosa-adherent microbiota in the duodenum of dogs with spontaneous idiopathic IBD using 16 S rRNA gene pyrosequencing. Methodology/Principal Findings Biopsy samples of small intestinal mucosa were collected endoscopically from healthy dogs (n = 6) and dogs with moderate IBD (n = 7) or severe IBD (n = 7) as assessed by a clinical disease activity index. Total RNA was extracted from biopsy specimens and 454-pyrosequencing of the 16 S rRNA gene was performed on aliquots of cDNA from each dog. Intestinal inflammation was associated with significant differences in the composition of the intestinal microbiota when compared to healthy dogs. PCoA plots based on the unweighted UniFrac distance metric indicated clustering of samples between healthy dogs and dogs with IBD (ANOSIM, p<0.001). Proportions of Fusobacteria (p = 0.010), Bacteroidaceae (p = 0.015), Prevotellaceae (p = 0.022), and Clostridiales (p = 0.019) were significantly more abundant in healthy dogs. In contrast, specific bacterial genera within Proteobacteria, including Diaphorobacter (p = 0.044) and Acinetobacter (p = 0.040), were either more abundant or more frequently identified in IBD dogs. Conclusions/Significance In conclusion, dogs with spontaneous IBD exhibit alterations in microbial groups, which bear resemblance to dysbiosis reported in humans with chronic intestinal inflammation. These bacterial groups may serve as useful targets for monitoring intestinal inflammation. PMID:22720094

  6. 16S rRNA gene pyrosequencing reveals bacterial dysbiosis in the duodenum of dogs with idiopathic inflammatory bowel disease.

    PubMed

    Suchodolski, Jan S; Dowd, Scot E; Wilke, Vicky; Steiner, Jörg M; Jergens, Albert E

    2012-01-01

    Canine idiopathic inflammatory bowel disease (IBD) is believed to be caused by a complex interaction of genetic, immunologic, and microbial factors. While mucosa-associated bacteria have been implicated in the pathogenesis of canine IBD, detailed studies investigating the enteric microbiota using deep sequencing techniques are lacking. The objective of this study was to evaluate mucosa-adherent microbiota in the duodenum of dogs with spontaneous idiopathic IBD using 16 S rRNA gene pyrosequencing. Biopsy samples of small intestinal mucosa were collected endoscopically from healthy dogs (n = 6) and dogs with moderate IBD (n = 7) or severe IBD (n = 7) as assessed by a clinical disease activity index. Total RNA was extracted from biopsy specimens and 454-pyrosequencing of the 16 S rRNA gene was performed on aliquots of cDNA from each dog. Intestinal inflammation was associated with significant differences in the composition of the intestinal microbiota when compared to healthy dogs. PCoA plots based on the unweighted UniFrac distance metric indicated clustering of samples between healthy dogs and dogs with IBD (ANOSIM, p<0.001). Proportions of Fusobacteria (p = 0.010), Bacteroidaceae (p = 0.015), Prevotellaceae (p = 0.022), and Clostridiales (p = 0.019) were significantly more abundant in healthy dogs. In contrast, specific bacterial genera within Proteobacteria, including Diaphorobacter (p = 0.044) and Acinetobacter (p = 0.040), were either more abundant or more frequently identified in IBD dogs. In conclusion, dogs with spontaneous IBD exhibit alterations in microbial groups, which bear resemblance to dysbiosis reported in humans with chronic intestinal inflammation. These bacterial groups may serve as useful targets for monitoring intestinal inflammation.

  7. Office space bacterial abundance and diversity in three metropolitan areas.

    PubMed

    Hewitt, Krissi M; Gerba, Charles P; Maxwell, Sheri L; Kelley, Scott T

    2012-01-01

    People in developed countries spend approximately 90% of their lives indoors, yet we know little about the source and diversity of microbes in built environments. In this study, we combined culture-based cell counting and multiplexed pyrosequencing of environmental ribosomal RNA (rRNA) gene sequences to investigate office space bacterial diversity in three metropolitan areas. Five surfaces common to all offices were sampled using sterile double-tipped swabs, one tip for culturing and one for DNA extraction, in 30 different offices per city (90 offices, 450 total samples). 16S rRNA gene sequences were PCR amplified using bar-coded "universal" bacterial primers from 54 of the surfaces (18 per city) and pooled for pyrosequencing. A three-factorial Analysis of Variance (ANOVA) found significant differences in viable bacterial abundance between offices inhabited by men or women, among the various surface types, and among cities. Multiplex pyrosequencing identified more than 500 bacterial genera from 20 different bacterial divisions. The most abundant of these genera tended to be common inhabitants of human skin, nasal, oral or intestinal cavities. Other commonly occurring genera appeared to have environmental origins (e.g., soils). There were no significant differences in the bacterial diversity between offices inhabited by men or women or among surfaces, but the bacterial community diversity of the Tucson samples was clearly distinguishable from that of New York and San Francisco, which were indistinguishable. Overall, our comprehensive molecular analysis of office building microbial diversity shows the potential of these methods for studying patterns and origins of indoor bacterial contamination. "[H]umans move through a sea of microbial life that is seldom perceived except in the context of potential disease and decay." - Feazel et al. (2009).

  8. The Landscape of long non-coding RNA classification

    PubMed Central

    St Laurent, Georges; Wahlestedt, Claes; Kapranov, Philipp

    2015-01-01

    Advances in the depth and quality of transcriptome sequencing have revealed many new classes of long non-coding RNAs (lncRNAs). lncRNA classification has mushroomed to accommodate these new findings, even though the real dimensions and complexity of the non-coding transcriptome remain unknown. Although evidence of functionality of specific lncRNAs continues to accumulate, conflicting, confusing, and overlapping terminology has fostered ambiguity and lack of clarity in the field in general. The lack of fundamental conceptual un-ambiguous classification framework results in a number of challenges in the annotation and interpretation of non-coding transcriptome data. It also might undermine integration of the new genomic methods and datasets in an effort to unravel function of lncRNA. Here, we review existing lncRNA classifications, nomenclature, and terminology. Then we describe the conceptual guidelines that have emerged for their classification and functional annotation based on expanding and more comprehensive use of large systems biology-based datasets. PMID:25869999

  9. Insight into the bacterial diversity of fermentation woad dye vats as revealed by PCR-DGGE and pyrosequencing.

    PubMed

    Milanović, Vesna; Osimani, Andrea; Taccari, Manuela; Garofalo, Cristiana; Butta, Alessandro; Clementi, Francesca; Aquilanti, Lucia

    2017-07-01

    The bacterial diversity in fermenting dye vats with woad (Isatis tinctoria L.) prepared and maintained in a functional state for approximately 12 months was examined using a combination of culture-dependent and -independent PCR-DGGE analyses and next-generation sequencing of 16S rRNA amplicons. An extremely complex ecosystem including taxa potentially contributing to both indigo reduction and formation, as well as indigo degradation was found. PCR-DGGE analyses revealed the presence of Paenibacillus lactis, Sporosarcina koreensis, Bacillus licheniformis, and Bacillus thermoamylovorans, while Bacillus thermolactis, Bacillus pumilus and Bacillus megaterium were also identified but with sequence identities lower than 97%. Dominant operational taxonomic units (OTUs) identified by pyrosequencing included Clostridium ultunense, Tissierella spp., Alcaligenes faecalis, Erysipelothrix spp., Enterococcus spp., Virgibacillus spp. and Virgibacillus panthothenicus, while sub-dominant OTUs included clostridia, alkaliphiles, halophiles, bacilli, moderately thermophilic bacteria, lactic acid bacteria, Enterobacteriaceae, aerobes, and even photosynthetic bacteria. Based on the current knowledge of indigo-reducing bacteria, it is considered that indigo-reducing bacteria constituted only a small fraction in the unique microcosm detected in the natural indigo dye vats.

  10. Time-dependent effect of graphene on the structure, abundance, and function of the soil bacterial community.

    PubMed

    Ren, Wenjie; Ren, Gaidi; Teng, Ying; Li, Zhengao; Li, Lina

    2015-10-30

    The increased application of graphene raises concerns about its environmental impact, but little information is available on the effect of graphene on the soil microbial community. This study evaluated the impact of graphene on the structure, abundance and function of the soil bacterial community based on quantitative real-time polymerase chain reaction (qPCR), pyrosequencing and soil enzyme activities. The results show that the enzyme activities of dehydrogenase and fluorescein diacetate (FDA) esterase and the biomass of the bacterial populations were transiently promoted by the presence of graphene after 4 days of exposure, but these parameters recovered completely after 21 days. Pyrosequencing analysis suggested a significant shift in some bacterial populations after 4 days, and the shift became weaker or disappeared as the exposure time increased to 60 days. During the entire exposure process, the majority of bacterial phylotypes remained unaffected. Some bacterial populations involved in nitrogen biogeochemical cycles and the degradation of organic compounds can be affected by the presence of graphene. Copyright © 2015 Elsevier B.V. All rights reserved.

  11. Pyrosequencing analysis of oral microbiota in children with severe early childhood dental caries.

    PubMed

    Jiang, Wen; Zhang, Jie; Chen, Hui

    2013-11-01

    Severe early childhood caries are a prevalent public health problem among preschool children throughout the world. However, little is known about the microbiota found in association with severe early childhood caries. Our study aimed to explore the bacterial microbiota of dental plaques to study the etiology of severe early childhood caries through pyrosequencing analysis based on 16S rRNA gene V1-V3 hypervariable regions. Forty participants were enrolled in the study, and we obtained twenty samples of supragingival plaque from caries-free subjects and twenty samples from subjects with severe early childhood caries. A total of 175,918 reads met the quality control standards, and the bacteria found belonged to fourteen phyla and sixty-three genera. Our results show the overall structure and microbial composition of oral bacterial communities, and they suggest that these bacteria may present a core microbiome in the dental plaque microbiota. Three genera, Streptococcus, Granulicatella, and Actinomyces, were increased significantly in children with severe dental cavities. These data may facilitate improvements in the prevention and treatment of severe early childhood caries.

  12. N-of-1-pathways MixEnrich: advancing precision medicine via single-subject analysis in discovering dynamic changes of transcriptomes.

    PubMed

    Li, Qike; Schissler, A Grant; Gardeux, Vincent; Achour, Ikbel; Kenost, Colleen; Berghout, Joanne; Li, Haiquan; Zhang, Hao Helen; Lussier, Yves A

    2017-05-24

    Transcriptome analytic tools are commonly used across patient cohorts to develop drugs and predict clinical outcomes. However, as precision medicine pursues more accurate and individualized treatment decisions, these methods are not designed to address single-patient transcriptome analyses. We previously developed and validated the N-of-1-pathways framework using two methods, Wilcoxon and Mahalanobis Distance (MD), for personal transcriptome analysis derived from a pair of samples of a single patient. Although, both methods uncover concordantly dysregulated pathways, they are not designed to detect dysregulated pathways with up- and down-regulated genes (bidirectional dysregulation) that are ubiquitous in biological systems. We developed N-of-1-pathways MixEnrich, a mixture model followed by a gene set enrichment test, to uncover bidirectional and concordantly dysregulated pathways one patient at a time. We assess its accuracy in a comprehensive simulation study and in a RNA-Seq data analysis of head and neck squamous cell carcinomas (HNSCCs). In presence of bidirectionally dysregulated genes in the pathway or in presence of high background noise, MixEnrich substantially outperforms previous single-subject transcriptome analysis methods, both in the simulation study and the HNSCCs data analysis (ROC Curves; higher true positive rates; lower false positive rates). Bidirectional and concordant dysregulated pathways uncovered by MixEnrich in each patient largely overlapped with the quasi-gold standard compared to other single-subject and cohort-based transcriptome analyses. The greater performance of MixEnrich presents an advantage over previous methods to meet the promise of providing accurate personal transcriptome analysis to support precision medicine at point of care.

  13. SNP Discovery in the Transcriptome of White Pacific Shrimp Litopenaeus vannamei by Next Generation Sequencing

    PubMed Central

    Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai

    2014-01-01

    The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047

  14. CGDV: a webtool for circular visualization of genomics and transcriptomics data.

    PubMed

    Jha, Vineet; Singh, Gulzar; Kumar, Shiva; Sonawane, Amol; Jere, Abhay; Anamika, Krishanpal

    2017-10-24

    Interpretation of large-scale data is very challenging and currently there is scarcity of web tools which support automated visualization of a variety of high throughput genomics and transcriptomics data and for a wide variety of model organisms along with user defined karyotypes. Circular plot provides holistic visualization of high throughput large scale data but it is very complex and challenging to generate as most of the available tools need informatics expertise to install and run them. We have developed CGDV (Circos for Genomics and Transcriptomics Data Visualization), a webtool based on Circos, for seamless and automated visualization of a variety of large scale genomics and transcriptomics data. CGDV takes output of analyzed genomics or transcriptomics data of different formats, such as vcf, bed, xls, tab limited matrix text file, CNVnator raw output and Gene fusion raw output, to plot circular view of the sample data. CGDV take cares of generating intermediate files required for circos. CGDV is freely available at https://cgdv-upload.persistent.co.in/cgdv/ . The circular plot for each data type is tailored to gain best biological insights into the data. The inter-relationship between data points, homologous sequences, genes involved in fusion events, differential expression pattern, sequencing depth, types and size of variations and enrichment of DNA binding proteins can be seen using CGDV. CGDV thus helps biologists and bioinformaticians to visualize a variety of genomics and transcriptomics data seamlessly.

  15. Desiccation tolerance in bryophytes: The dehydration and rehydration transcriptomes in the desiccation-tolerant bryophyte Bryum argenteum.

    PubMed

    Gao, Bei; Li, Xiaoshuang; Zhang, Daoyuan; Liang, Yuqing; Yang, Honglan; Chen, Moxian; Zhang, Yuanming; Zhang, Jianhua; Wood, Andrew J

    2017-08-08

    The desiccation tolerant bryophyte Bryum argenteum is an important component of desert biological soil crusts (BSCs) and is emerging as a model system for studying vegetative desiccation tolerance. Here we present and analyze the hydration-dehydration-rehydration transcriptomes in B. argenteum to establish a desiccation-tolerance transcriptomic atlas. B. argenteum gametophores representing five different hydration stages (hydrated (H0), dehydrated for 2 h (D2), 24 h (D24), then rehydrated for 2 h (R2) and 48 h (R48)), were sampled for transcriptome analyses. Illumina high throughput RNA-Seq technology was employed and generated more than 488.46 million reads. An in-house de novo transcriptome assembly optimization pipeline based on Trinity assembler was developed to obtain a reference Hydration-Dehydration-Rehydration (H-D-R) transcriptome comprising of 76,206 transcripts, with an N50 of 2,016 bp and average length of 1,222 bp. Comprehensive transcription factor (TF) annotation discovered 978 TFs in 62 families, among which 404 TFs within 40 families were differentially expressed upon dehydration-rehydration. Pfam term enrichment analysis revealed 172 protein families/domains were significantly associated with the H-D-R cycle and confirmed early rehydration (i.e. the R2 stage) as exhibiting the maximum stress-induced changes in gene expression.

  16. KONAGAbase: a genomic and transcriptomic database for the diamondback moth, Plutella xylostella.

    PubMed

    Jouraku, Akiya; Yamamoto, Kimiko; Kuwazaki, Seigo; Urio, Masahiro; Suetsugu, Yoshitaka; Narukawa, Junko; Miyamoto, Kazuhisa; Kurita, Kanako; Kanamori, Hiroyuki; Katayose, Yuichi; Matsumoto, Takashi; Noda, Hiroaki

    2013-07-09

    The diamondback moth (DBM), Plutella xylostella, is one of the most harmful insect pests for crucifer crops worldwide. DBM has rapidly evolved high resistance to most conventional insecticides such as pyrethroids, organophosphates, fipronil, spinosad, Bacillus thuringiensis, and diamides. Therefore, it is important to develop genomic and transcriptomic DBM resources for analysis of genes related to insecticide resistance, both to clarify the mechanism of resistance of DBM and to facilitate the development of insecticides with a novel mode of action for more effective and environmentally less harmful insecticide rotation. To contribute to this goal, we developed KONAGAbase, a genomic and transcriptomic database for DBM (KONAGA is the Japanese word for DBM). KONAGAbase provides (1) transcriptomic sequences of 37,340 ESTs/mRNAs and 147,370 RNA-seq contigs which were clustered and assembled into 84,570 unigenes (30,695 contigs, 50,548 pseudo singletons, and 3,327 singletons); and (2) genomic sequences of 88,530 WGS contigs with 246,244 degenerate contigs and 106,455 singletons from which 6,310 de novo identified repeat sequences and 34,890 predicted gene-coding sequences were extracted. The unigenes and predicted gene-coding sequences were clustered and 32,800 representative sequences were extracted as a comprehensive putative gene set. These sequences were annotated with BLAST descriptions, Gene Ontology (GO) terms, and Pfam descriptions, respectively. KONAGAbase contains rich graphical user interface (GUI)-based web interfaces for easy and efficient searching, browsing, and downloading sequences and annotation data. Five useful search interfaces consisting of BLAST search, keyword search, BLAST result-based search, GO tree-based search, and genome browser are provided. KONAGAbase is publicly available from our website (http://dbm.dna.affrc.go.jp/px/) through standard web browsers. KONAGAbase provides DBM comprehensive transcriptomic and draft genomic sequences with useful annotation information with easy-to-use web interfaces, which helps researchers to efficiently search for target sequences such as insect resistance-related genes. KONAGAbase will be continuously updated and additional genomic/transcriptomic resources and analysis tools will be provided for further efficient analysis of the mechanism of insecticide resistance and the development of effective insecticides with a novel mode of action for DBM.

  17. Comprehensive transcriptome analysis provides new insights into nutritional strategies and phylogenetic relationships of chrysophytes

    PubMed Central

    Graupner, Nadine; Bock, Christina; Wodniok, Sabina; Grossmann, Lars; Vos, Matthijs; Sures, Bernd

    2017-01-01

    Background Chrysophytes are protist model species in ecology and ecophysiology and important grazers of bacteria-sized microorganisms and primary producers. However, they have not yet been investigated in detail at the molecular level, and no genomic and only little transcriptomic information is available. Chrysophytes exhibit different trophic modes: while phototrophic chrysophytes perform only photosynthesis, mixotrophs can gain carbon from bacterial food as well as from photosynthesis, and heterotrophs solely feed on bacteria-sized microorganisms. Recent phylogenies and megasystematics demonstrate an immense complexity of eukaryotic diversity with numerous transitions between phototrophic and heterotrophic organisms. The question we aim to answer is how the diverse nutritional strategies, accompanied or brought about by a reduction of the plasmid and size reduction in heterotrophic strains, affect physiology and molecular processes. Results We sequenced the mRNA of 18 chrysophyte strains on the Illumina HiSeq platform and analysed the transcriptomes to determine relations between the trophic mode (mixotrophic vs. heterotrophic) and gene expression. We observed an enrichment of genes for photosynthesis, porphyrin and chlorophyll metabolism for phototrophic and mixotrophic strains that can perform photosynthesis. Genes involved in nutrient absorption, environmental information processing and various transporters (e.g., monosaccharide, peptide, lipid transporters) were present or highly expressed only in heterotrophic strains that have to sense, digest and absorb bacterial food. We furthermore present a transcriptome-based alignment-free phylogeny construction approach using transcripts assembled from short reads to determine the evolutionary relationships between the strains and the possible influence of nutritional strategies on the reconstructed phylogeny. We discuss the resulting phylogenies in comparison to those from established approaches based on ribosomal RNA and orthologous genes. Finally, we make functionally annotated reference transcriptomes of each strain available to the community, significantly enhancing publicly available data on Chrysophyceae. Conclusions Our study is the first comprehensive transcriptomic characterisation of a diverse set of Chrysophyceaen strains. In addition, we showcase the possibility of inferring phylogenies from assembled transcriptomes using an alignment-free approach. The raw and functionally annotated data we provide will prove beneficial for further examination of the diversity within this taxon. Our molecular characterisation of different trophic modes presents a first such example. PMID:28097055

  18. De novo assembling and primary analysis of genome and transcriptome of gray whale Eschrichtius robustus.

    PubMed

    Moskalev, Alexey А; Kudryavtseva, Anna V; Graphodatsky, Alexander S; Beklemisheva, Violetta R; Serdyukova, Natalya A; Krutovsky, Konstantin V; Sharov, Vadim V; Kulakovskiy, Ivan V; Lando, Andrey S; Kasianov, Artem S; Kuzmin, Dmitry A; Putintseva, Yuliya A; Feranchuk, Sergey I; Shaposhnikov, Mikhail V; Fraifeld, Vadim E; Toren, Dmitri; Snezhkina, Anastasia V; Sitnik, Vasily V

    2017-12-28

    Gray whale, Eschrichtius robustus (E. robustus), is a single member of the family Eschrichtiidae, which is considered to be the most primitive in the class Cetacea. Gray whale is often described as a "living fossil". It is adapted to extreme marine conditions and has a high life expectancy (77 years). The assembly of a gray whale genome and transcriptome will allow to carry out further studies of whale evolution, longevity, and resistance to extreme environment. In this work, we report the first de novo assembly and primary analysis of the E. robustus genome and transcriptome based on kidney and liver samples. The presented draft genome assembly is complete by 55% in terms of a total genome length, but only by 24% in terms of the BUSCO complete gene groups, although 10,895 genes were identified. Transcriptome annotation and comparison with other whale species revealed robust expression of DNA repair and hypoxia-response genes, which is expected for whales. This preliminary study of the gray whale genome and transcriptome provides new data to better understand the whale evolution and the mechanisms of their adaptation to the hypoxic conditions.

  19. Optimized approach for Ion Proton RNA sequencing reveals details of RNA splicing and editing features of the transcriptome.

    PubMed

    Brown, Roger B; Madrid, Nathaniel J; Suzuki, Hideaki; Ness, Scott A

    2017-01-01

    RNA-sequencing (RNA-seq) has become the standard method for unbiased analysis of gene expression but also provides access to more complex transcriptome features, including alternative RNA splicing, RNA editing, and even detection of fusion transcripts formed through chromosomal translocations. However, differences in library methods can adversely affect the ability to recover these different types of transcriptome data. For example, some methods have bias for one end of transcripts or rely on low-efficiency steps that limit the complexity of the resulting library, making detection of rare transcripts less likely. We tested several commonly used methods of RNA-seq library preparation and found vast differences in the detection of advanced transcriptome features, such as alternatively spliced isoforms and RNA editing sites. By comparing several different protocols available for the Ion Proton sequencer and by utilizing detailed bioinformatics analysis tools, we were able to develop an optimized random primer based RNA-seq technique that is reliable at uncovering rare transcript isoforms and RNA editing features, as well as fusion reads from oncogenic chromosome rearrangements. The combination of optimized libraries and rapid Ion Proton sequencing provides a powerful platform for the transcriptome analysis of research and clinical samples.

  20. dropEst: pipeline for accurate estimation of molecular counts in droplet-based single-cell RNA-seq experiments.

    PubMed

    Petukhov, Viktor; Guo, Jimin; Baryawno, Ninib; Severe, Nicolas; Scadden, David T; Samsonova, Maria G; Kharchenko, Peter V

    2018-06-19

    Recent single-cell RNA-seq protocols based on droplet microfluidics use massively multiplexed barcoding to enable simultaneous measurements of transcriptomes for thousands of individual cells. The increasing complexity of such data creates challenges for subsequent computational processing and troubleshooting of these experiments, with few software options currently available. Here, we describe a flexible pipeline for processing droplet-based transcriptome data that implements barcode corrections, classification of cell quality, and diagnostic information about the droplet libraries. We introduce advanced methods for correcting composition bias and sequencing errors affecting cellular and molecular barcodes to provide more accurate estimates of molecular counts in individual cells.

  1. Characterization of microsatellite loci from two-spotted octopus Octopus bimaculatus Verrill 1883 from pyrosequencing reads

    USGS Publications Warehouse

    Domínguez-Contreras, J. F.; Munguía-Vega, A.; Ceballos-Vázquez, B. P.; Arellano-Martínez, M.; Culver, Melanie

    2014-01-01

    We characterized 22 novel microsatellite loci in the two-spotted octopus Octopus bimaculatus using 454 pyrosequencing reads. All loci were polymorphic and will be used in studies of marine connectivity aimed at increasing sustainability of the resource. The mean number alleles per locus was 13.09 (range 7–19) and observed heterozygosities ranged from 0.50 to 1.00. Four loci pairs were linked and three deviated from Hardy–Weinberg equilibrium. Eighteen and 12 loci were polymorphic in Octopus bimaculoides and Octopus hubbsorum, respectively.

  2. Transcriptional profiling unravels potential metabolic activities of the olive leaf non-glandular trichome

    PubMed Central

    Koudounas, Konstantinos; Manioudaki, Maria E.; Kourti, Anna; Banilas, Georgios; Hatzopoulos, Polydefkis

    2015-01-01

    The olive leaf trichomes are multicellular peltate hairs densely distributed mainly at the lower leaf epidermis. Although, non-glandular, they have gained much attention since they significantly contribute to abiotic and biotic stress tolerance of olive leaves. The exact mechanisms by which olive trichomes achieve these goals are not fully understood. They could act as mechanical barrier but they also accumulate high amounts of flavonoids among other secondary metabolites. However, little is currently known about the exact compounds they produce and the respective metabolic pathways. Here we present the first EST analysis from olive leaf trichomes by using 454-pyrosequencing. A total of 5368 unigenes were identified out of 7258 high quality reads with an average length of 262 bp. Blast search revealed that 27.5% of them had high homologies to known proteins. By using Blast2GO, 1079 unigenes (20.1%) were assigned at least one Gene Ontology (GO) term. Most of the genes were involved in cellular and metabolic processes and in binding functions followed by catalytic activity. A total of 521 transcripts were mapped to 67 KEGG pathways. Olive trichomes represent a tissue of highly unique transcriptome as per the genes involved in developmental processes and the secondary metabolism. The results indicate that mature olive trichomes are trancriptionally active, mainly through the potential production of enzymes that contribute to phenolic compounds with important roles in biotic and abiotic stress responses. PMID:26322070

  3. Characterization of Chicken Spleen Transcriptome after Infection with Salmonella enterica Serovar Enteritidis

    PubMed Central

    Matulova, Marta; Rajova, Jana; Vlasatikova, Lenka; Volf, Jiri; Stepanova, Hana; Havlickova, Hana; Sisak, Frantisek; Rychlik, Ivan

    2012-01-01

    In this study we were interested in identification of new markers of chicken response to Salmonella Enteritidis infection. To reach this aim, gene expression in the spleens of naive chickens and those intravenously infected with S. Enteritidis with or without previous oral vaccination was determined by 454 pyrosequencing of splenic mRNA/cDNA. Forty genes with increased expression at the level of transcription were identified. The most inducible genes encoded avidin (AVD), extracellular fatty acid binding protein (EXFABP), immune responsive gene 1 (IRG1), chemokine ah221 (AH221), trappin-6-like protein (TRAP6) and serum amyloid A (SAA). Using cDNA from sorted splenic B-lymphocytes, macrophages, CD4, CD8 and γδ T-lymphocytes, we found that the above mentioned genes were preferentially expressed in macrophages. AVD, EXFABP, IRG1, AH221, TRAP6 and SAA were induced also in the cecum of chickens orally infected with S. Enteritidis on day 1 of life or day 42 of life. Unusual results were obtained for the immunoglobulin encoding transcripts. Prior to the infection, transcripts coding for the constant parts of IgM, IgY, IgA and Ig light chain were detected in B-lymphocytes. However, after the infection, immunoglobulin encoding transcripts were expressed also by T-lymphocytes and macrophages. Expression of AVD, EXFABP, IRG1, AH221, TRAP6, SAA and all immunoglobulin genes can be therefore used for the characterization of the course of S. Enteritidis infection in chickens. PMID:23094107

  4. The transcriptional response to the olive fruit fly (Bactrocera oleae) reveals extended differences between tolerant and susceptible olive (Olea europaea L.) varieties

    PubMed Central

    Grasso, Filomena; Coppola, Mariangela; Carbone, Fabrizio; Baldoni, Luciana; Alagna, Fiammetta; Perrotta, Gaetano; Pérez-Pulido, Antonio J.; Garonna, Antonio; Facella, Paolo; Daddiego, Loretta; Lopez, Loredana; Vitiello, Alessia; Rao, Rosa

    2017-01-01

    The olive fruit fly Bactrocera oleae (Diptera: Tephritidae) is the most devastating pest of cultivated olive (Olea europaea L.). Intraspecific variation in plant resistance to B. oleae has been described only at phenotypic level. In this work, we used a transcriptomic approach to study the molecular response to the olive fruit fly in two olive cultivars with contrasting level of susceptibility. Using next-generation pyrosequencing, we first generated a catalogue of more than 80,000 sequences expressed in drupes from approximately 700k reads. The assembled sequences were used to develop a microarray layout with over 60,000 olive-specific probes. The differential gene expression analysis between infested (i.e. with II or III instar larvae) and control drupes indicated a significant intraspecific variation between the more tolerant and susceptible cultivar. Around 2500 genes were differentially regulated in infested drupes of the tolerant variety. The GO annotation of the differentially expressed genes implies that the inducible resistance to the olive fruit fly involves a number of biological functions, cellular processes and metabolic pathways, including those with a known role in defence, oxidative stress responses, cellular structure, hormone signalling, and primary and secondary metabolism. The difference in the induced transcriptional changes between the cultivars suggests a strong genetic role in the olive inducible defence, which can ultimately lead to the discovery of factors associated with a higher level of tolerance to B. oleae. PMID:28797083

  5. The transcriptional response to the olive fruit fly (Bactrocera oleae) reveals extended differences between tolerant and susceptible olive (Olea europaea L.) varieties.

    PubMed

    Grasso, Filomena; Coppola, Mariangela; Carbone, Fabrizio; Baldoni, Luciana; Alagna, Fiammetta; Perrotta, Gaetano; Pérez-Pulido, Antonio J; Garonna, Antonio; Facella, Paolo; Daddiego, Loretta; Lopez, Loredana; Vitiello, Alessia; Rao, Rosa; Corrado, Giandomenico

    2017-01-01

    The olive fruit fly Bactrocera oleae (Diptera: Tephritidae) is the most devastating pest of cultivated olive (Olea europaea L.). Intraspecific variation in plant resistance to B. oleae has been described only at phenotypic level. In this work, we used a transcriptomic approach to study the molecular response to the olive fruit fly in two olive cultivars with contrasting level of susceptibility. Using next-generation pyrosequencing, we first generated a catalogue of more than 80,000 sequences expressed in drupes from approximately 700k reads. The assembled sequences were used to develop a microarray layout with over 60,000 olive-specific probes. The differential gene expression analysis between infested (i.e. with II or III instar larvae) and control drupes indicated a significant intraspecific variation between the more tolerant and susceptible cultivar. Around 2500 genes were differentially regulated in infested drupes of the tolerant variety. The GO annotation of the differentially expressed genes implies that the inducible resistance to the olive fruit fly involves a number of biological functions, cellular processes and metabolic pathways, including those with a known role in defence, oxidative stress responses, cellular structure, hormone signalling, and primary and secondary metabolism. The difference in the induced transcriptional changes between the cultivars suggests a strong genetic role in the olive inducible defence, which can ultimately lead to the discovery of factors associated with a higher level of tolerance to B. oleae.

  6. A priori and a posteriori approaches for finding genes of evolutionary interest in non-model species: osmoregulatory genes in the kidney transcriptome of the desert rodent Dipodomys spectabilis (banner-tailed kangaroo rat).

    PubMed

    Marra, Nicholas J; Eo, Soo Hyung; Hale, Matthew C; Waser, Peter M; DeWoody, J Andrew

    2012-12-01

    One common goal in evolutionary biology is the identification of genes underlying adaptive traits of evolutionary interest. Recently next-generation sequencing techniques have greatly facilitated such evolutionary studies in species otherwise depauperate of genomic resources. Kangaroo rats (Dipodomys sp.) serve as exemplars of adaptation in that they inhabit extremely arid environments, yet require no drinking water because of ultra-efficient kidney function and osmoregulation. As a basis for identifying water conservation genes in kangaroo rats, we conducted a priori bioinformatics searches in model rodents (Mus musculus and Rattus norvegicus) to identify candidate genes with known or suspected osmoregulatory function. We then obtained 446,758 reads via 454 pyrosequencing to characterize genes expressed in the kidney of banner-tailed kangaroo rats (Dipodomys spectabilis). We also determined candidates a posteriori by identifying genes that were overexpressed in the kidney. The kangaroo rat sequences revealed nine different a priori candidate genes predicted from our Mus and Rattus searches, as well as 32 a posteriori candidate genes that were overexpressed in kidney. Mutations in two of these genes, Slc12a1 and Slc12a3, cause human renal diseases that result in the inability to concentrate urine. These genes are likely key determinants of physiological water conservation in desert rodents. Copyright © 2012 Elsevier Inc. All rights reserved.

  7. Transcriptome-Based Characterization of Interactions between Saccharomyces cerevisiae and Lactobacillus delbrueckii subsp. bulgaricus in Lactose-Grown Chemostat Cocultures

    PubMed Central

    Mendes, Filipa; Sieuwerts, Sander; de Hulster, Erik; Almering, Marinka J. H.; Luttik, Marijke A. H.; Pronk, Jack T.; Smid, Eddy J.; Bron, Peter A.

    2013-01-01

    Mixed populations of Saccharomyces cerevisiae yeasts and lactic acid bacteria occur in many dairy, food, and beverage fermentations, but knowledge about their interactions is incomplete. In the present study, interactions between Saccharomyces cerevisiae and Lactobacillus delbrueckii subsp. bulgaricus, two microorganisms that co-occur in kefir fermentations, were studied during anaerobic growth on lactose. By combining physiological and transcriptome analysis of the two strains in the cocultures, five mechanisms of interaction were identified. (i) Lb. delbrueckii subsp. bulgaricus hydrolyzes lactose, which cannot be metabolized by S. cerevisiae, to galactose and glucose. Subsequently, galactose, which cannot be metabolized by Lb. delbrueckii subsp. bulgaricus, is excreted and provides a carbon source for yeast. (ii) In pure cultures, Lb. delbrueckii subsp. bulgaricus grows only in the presence of increased CO2 concentrations. In anaerobic mixed cultures, the yeast provides this CO2 via alcoholic fermentation. (iii) Analysis of amino acid consumption from the defined medium indicated that S. cerevisiae supplied alanine to the bacterium. (iv) A mild but significant low-iron response in the yeast transcriptome, identified by DNA microarray analysis, was consistent with the chelation of iron by the lactate produced by Lb. delbrueckii subsp. bulgaricus. (v) Transcriptome analysis of Lb. delbrueckii subsp. bulgaricus in mixed cultures showed an overrepresentation of transcripts involved in lipid metabolism, suggesting either a competition of the two microorganisms for fatty acids or a response to the ethanol produced by S. cerevisiae. This study demonstrates that chemostat-based transcriptome analysis is a powerful tool to investigate microbial interactions in mixed populations. PMID:23872557

  8. Transcriptome-based characterization of interactions between Saccharomyces cerevisiae and Lactobacillus delbrueckii subsp. bulgaricus in lactose-grown chemostat cocultures.

    PubMed

    Mendes, Filipa; Sieuwerts, Sander; de Hulster, Erik; Almering, Marinka J H; Luttik, Marijke A H; Pronk, Jack T; Smid, Eddy J; Bron, Peter A; Daran-Lapujade, Pascale

    2013-10-01

    Mixed populations of Saccharomyces cerevisiae yeasts and lactic acid bacteria occur in many dairy, food, and beverage fermentations, but knowledge about their interactions is incomplete. In the present study, interactions between Saccharomyces cerevisiae and Lactobacillus delbrueckii subsp. bulgaricus, two microorganisms that co-occur in kefir fermentations, were studied during anaerobic growth on lactose. By combining physiological and transcriptome analysis of the two strains in the cocultures, five mechanisms of interaction were identified. (i) Lb. delbrueckii subsp. bulgaricus hydrolyzes lactose, which cannot be metabolized by S. cerevisiae, to galactose and glucose. Subsequently, galactose, which cannot be metabolized by Lb. delbrueckii subsp. bulgaricus, is excreted and provides a carbon source for yeast. (ii) In pure cultures, Lb. delbrueckii subsp. bulgaricus grows only in the presence of increased CO2 concentrations. In anaerobic mixed cultures, the yeast provides this CO2 via alcoholic fermentation. (iii) Analysis of amino acid consumption from the defined medium indicated that S. cerevisiae supplied alanine to the bacterium. (iv) A mild but significant low-iron response in the yeast transcriptome, identified by DNA microarray analysis, was consistent with the chelation of iron by the lactate produced by Lb. delbrueckii subsp. bulgaricus. (v) Transcriptome analysis of Lb. delbrueckii subsp. bulgaricus in mixed cultures showed an overrepresentation of transcripts involved in lipid metabolism, suggesting either a competition of the two microorganisms for fatty acids or a response to the ethanol produced by S. cerevisiae. This study demonstrates that chemostat-based transcriptome analysis is a powerful tool to investigate microbial interactions in mixed populations.

  9. The application of transcriptomic data in the authentication of beef derived from contrasting production systems.

    PubMed

    Sweeney, Torres; Lejeune, Alex; Moloney, Aidan P; Monahan, Frank J; Gettigan, Paul Mc; Downey, Gerard; Park, Stephen D E; Ryan, Marion T

    2016-09-21

    Differences between cattle production systems can influence the nutritional and sensory characteristics of beef, in particular its fatty acid (FA) composition. As beef products derived from pasture-based systems can demand a higher premium from consumers, there is a need to understand the biological characteristics of pasture produced meat and subsequently to develop methods of authentication for these products. Here, we describe an approach to authentication that focuses on differences in the transcriptomic profile of muscle from animals finished in different systems of production of practical relevance to the Irish beef industry. The objectives of this study were to identify a panel of differentially expressed (DE) genes/networks in the muscle of cattle raised outdoors on pasture compared to animals raised indoors on a concentrate based diet and to subsequently identify an optimum panel which can classify the meat based on a production system. A comparison of the muscle transcriptome of outdoor/pasture-fed and Indoor/concentrate-fed cattle resulted in the identification of 26 DE genes. Functional analysis of these genes identified two significant networks (1: Energy Production, Lipid Metabolism, Small Molecule Biochemistry; and 2: Lipid Metabolism, Molecular Transport, Small Molecule Biochemistry), both of which are involved in FA metabolism. The expression of selected up-regulated genes in the outdoor/pasture-fed animals correlated positively with the total n-3 FA content of the muscle. The pathway and network analysis of the DE genes indicate that peroxisome proliferator-activated receptor (PPAR) and FYN/AMPK could be implicit in the regulation of these alterations to the lipid profile. In terms of authentication, the expression profile of three DE genes (ALAD, EIF4EBP1 and NPNT) could almost completely separate the samples based on production system (95 % authentication for animals on pasture-based and 100 % for animals on concentrate- based diet) in this context. The majority of DE genes between muscle of the outdoor/pasture-fed and concentrate-fed cattle were related to lipid metabolism and in particular β-oxidation. In this experiment the combined expression profiles of ALAD, EIF4EBP1 and NPNT were optimal in classifying the muscle transcriptome based on production system. Given the overall lack of comparable studies and variable concordance with those that do exist, the use of transcriptomic data in authenticating production systems requires more exploration across a range of contexts and breeds.

  10. Adaptation and evolution of deep-sea scale worms (Annelida: Polynoidae): insights from transcriptome comparison with a shallow-water species

    NASA Astrophysics Data System (ADS)

    Zhang, Yanjie; Sun, Jin; Chen, Chong; Watanabe, Hiromi K.; Feng, Dong; Zhang, Yu; Chiu, Jill M. Y.; Qian, Pei-Yuan; Qiu, Jian-Wen

    2017-04-01

    Polynoid scale worms (Polynoidae, Annelida) invaded deep-sea chemosynthesis-based ecosystems approximately 60 million years ago, but little is known about their genetic adaptation to the extreme deep-sea environment. In this study, we reported the first two transcriptomes of deep-sea polynoids (Branchipolynoe pettiboneae, Lepidonotopodium sp.) and compared them with the transcriptome of a shallow-water polynoid (Harmothoe imbricata). We determined codon and amino acid usage, positive selected genes, highly expressed genes and putative duplicated genes. Transcriptome assembly produced 98,806 to 225,709 contigs in the three species. There were more positively charged amino acids (i.e., histidine and arginine) and less negatively charged amino acids (i.e., aspartic acid and glutamic acid) in the deep-sea species. There were 120 genes showing clear evidence of positive selection. Among the 10% most highly expressed genes, there were more hemoglobin genes with high expression levels in both deep-sea species. The duplicated genes related to DNA recombination and metabolism, and gene expression were only enriched in deep-sea species. Deep-sea scale worms adopted two strategies of adaptation to hypoxia in the chemosynthesis-based habitats (i.e., rapid evolution of tetra-domain hemoglobin in Branchipolynoe or high expression of single-domain hemoglobin in Lepidonotopodium sp.).

  11. Adaptation and evolution of deep-sea scale worms (Annelida: Polynoidae): insights from transcriptome comparison with a shallow-water species

    PubMed Central

    Zhang, Yanjie; Sun, Jin; Chen, Chong; Watanabe, Hiromi K.; Feng, Dong; Zhang, Yu; Chiu, Jill M.Y.; Qian, Pei-Yuan; Qiu, Jian-Wen

    2017-01-01

    Polynoid scale worms (Polynoidae, Annelida) invaded deep-sea chemosynthesis-based ecosystems approximately 60 million years ago, but little is known about their genetic adaptation to the extreme deep-sea environment. In this study, we reported the first two transcriptomes of deep-sea polynoids (Branchipolynoe pettiboneae, Lepidonotopodium sp.) and compared them with the transcriptome of a shallow-water polynoid (Harmothoe imbricata). We determined codon and amino acid usage, positive selected genes, highly expressed genes and putative duplicated genes. Transcriptome assembly produced 98,806 to 225,709 contigs in the three species. There were more positively charged amino acids (i.e., histidine and arginine) and less negatively charged amino acids (i.e., aspartic acid and glutamic acid) in the deep-sea species. There were 120 genes showing clear evidence of positive selection. Among the 10% most highly expressed genes, there were more hemoglobin genes with high expression levels in both deep-sea species. The duplicated genes related to DNA recombination and metabolism, and gene expression were only enriched in deep-sea species. Deep-sea scale worms adopted two strategies of adaptation to hypoxia in the chemosynthesis-based habitats (i.e., rapid evolution of tetra-domain hemoglobin in Branchipolynoe or high expression of single-domain hemoglobin in Lepidonotopodium sp.). PMID:28397791

  12. Revealing the transcriptomic complexity of switchgrass by PacBio long-read sequencing.

    PubMed

    Zuo, Chunman; Blow, Matthew; Sreedasyam, Avinash; Kuo, Rita C; Ramamoorthy, Govindarajan Kunde; Torres-Jerez, Ivone; Li, Guifen; Wang, Mei; Dilworth, David; Barry, Kerrie; Udvardi, Michael; Schmutz, Jeremy; Tang, Yuhong; Xu, Ying

    2018-01-01

    Switchgrass ( Panicum virgatum L.) is an important bioenergy crop widely used for lignocellulosic research. While extensive transcriptomic analyses have been conducted on this species using short read-based sequencing techniques, very little has been reliably derived regarding alternatively spliced (AS) transcripts. We present an analysis of transcriptomes of six switchgrass tissue types pooled together, sequenced using Pacific Biosciences (PacBio) single-molecular long-read technology. Our analysis identified 105,419 unique transcripts covering 43,570 known genes and 8795 previously unknown genes. 45,168 are novel transcripts of known genes. A total of 60,096 AS transcripts are identified, 45,628 being novel. We have also predicted 1549 transcripts of genes involved in cell wall construction and remodeling, 639 being novel transcripts of known cell wall genes. Most of the predicted transcripts are validated against Illumina-based short reads. Specifically, 96% of the splice junction sites in all the unique transcripts are validated by at least five Illumina reads. Comparisons between genes derived from our identified transcripts and the current genome annotation revealed that among the gene set predicted by both analyses, 16,640 have different exon-intron structures. Overall, substantial amount of new information is derived from the PacBio RNA data regarding both the transcriptome and the genome of switchgrass.

  13. Transcriptomics of cortical gray matter thickness decline during normal aging

    PubMed Central

    Kochunov, P; Charlesworth, J; Winkler, A; Hong, LE; Nichols, T; Curran, JE; Sprooten, E; Jahanshad, N; Thompson, PM; Johnson, MP; Kent, JW; Landman, BA; Mitchell, B; Cole, SA; Dyer, TD; Moses, EK; Goring, HHH; Almasy, L; Duggirala, R; Olvera, RL; Glahn, DC; Blangero, J

    2013-01-01

    Introduction We performed a whole-transcriptome correlation analysis, followed by the pathway enrichment and testing of innate immune response pathways analyses to evaluate the hypothesis that transcriptional activity can predict cortical gray matter thickness (GMT) variability during normal cerebral aging Methods Transcriptome and GMT data were availabe for 379 individuals (age range=28–85) community-dwelling members of large extended Mexican-American families. Collection of transcriptome data preceded that of neuroimaging data by 17 years. Genome-wide gene transcriptome data consisted of 20,413 heritable lymphocytes-based transcripts. GMT measurements were performed from high-resolution (isotropic 800µm) T1-weighted MRI. Transcriptome-wide and pathway enrichment analysis was used to classify genes correlated with GMT. Transcripts for sixty genes from seven innate immune pathways were tested as specific predictors of GMT variability. Results Transcripts for eight genes (IGFBP3, LRRN3, CRIP2, SCD, IDS, TCF4, GATA3, HN1) passed the transcriptome-wide significance threshold. Four orthogonal factors extracted from this set predicted 31.9% of the variability in the whole-brain and between 23.4 and 35% of regional GMT measurements. Pathway enrichment analysis identified six functional categories including cellular proliferation, aggregation, differentiation, viral infection, and metabolism. The integrin signaling pathway was significantly (p<10−6) enriched with GMT. Finally, three innate immune pathways (complement signaling, toll-receptors and scavenger and immunoglobulins) were significantly associated with GMT. Conclusion Expression activity for the genes that regulate cellular proliferation, adhesion, differentiation and inflammation can explain a significant proportion of individual variability in cortical GMT. Our findings suggest that normal cerebral aging is the product of a progressive decline in regenerative capacity and increased neuroinflammation. PMID:23707588

  14. Transcriptomics of cortical gray matter thickness decline during normal aging.

    PubMed

    Kochunov, P; Charlesworth, J; Winkler, A; Hong, L E; Nichols, T E; Curran, J E; Sprooten, E; Jahanshad, N; Thompson, P M; Johnson, M P; Kent, J W; Landman, B A; Mitchell, B; Cole, S A; Dyer, T D; Moses, E K; Goring, H H H; Almasy, L; Duggirala, R; Olvera, R L; Glahn, D C; Blangero, J

    2013-11-15

    We performed a whole-transcriptome correlation analysis, followed by the pathway enrichment and testing of innate immune response pathway analyses to evaluate the hypothesis that transcriptional activity can predict cortical gray matter thickness (GMT) variability during normal cerebral aging. Transcriptome and GMT data were available for 379 individuals (age range=28-85) community-dwelling members of large extended Mexican American families. Collection of transcriptome data preceded that of neuroimaging data by 17 years. Genome-wide gene transcriptome data consisted of 20,413 heritable lymphocytes-based transcripts. GMT measurements were performed from high-resolution (isotropic 800 μm) T1-weighted MRI. Transcriptome-wide and pathway enrichment analysis was used to classify genes correlated with GMT. Transcripts for sixty genes from seven innate immune pathways were tested as specific predictors of GMT variability. Transcripts for eight genes (IGFBP3, LRRN3, CRIP2, SCD, IDS, TCF4, GATA3, and HN1) passed the transcriptome-wide significance threshold. Four orthogonal factors extracted from this set predicted 31.9% of the variability in the whole-brain and between 23.4 and 35% of regional GMT measurements. Pathway enrichment analysis identified six functional categories including cellular proliferation, aggregation, differentiation, viral infection, and metabolism. The integrin signaling pathway was significantly (p<10(-6)) enriched with GMT. Finally, three innate immune pathways (complement signaling, toll-receptors and scavenger and immunoglobulins) were significantly associated with GMT. Expression activity for the genes that regulate cellular proliferation, adhesion, differentiation and inflammation can explain a significant proportion of individual variability in cortical GMT. Our findings suggest that normal cerebral aging is the product of a progressive decline in regenerative capacity and increased neuroinflammation. Copyright © 2013 Elsevier Inc. All rights reserved.

  15. The Transcriptome Analysis and Comparison Explorer--T-ACE: a platform-independent, graphical tool to process large RNAseq datasets of non-model organisms.

    PubMed

    Philipp, E E R; Kraemer, L; Mountfort, D; Schilhabel, M; Schreiber, S; Rosenstiel, P

    2012-03-15

    Next generation sequencing (NGS) technologies allow a rapid and cost-effective compilation of large RNA sequence datasets in model and non-model organisms. However, the storage and analysis of transcriptome information from different NGS platforms is still a significant bottleneck, leading to a delay in data dissemination and subsequent biological understanding. Especially database interfaces with transcriptome analysis modules going beyond mere read counts are missing. Here, we present the Transcriptome Analysis and Comparison Explorer (T-ACE), a tool designed for the organization and analysis of large sequence datasets, and especially suited for transcriptome projects of non-model organisms with little or no a priori sequence information. T-ACE offers a TCL-based interface, which accesses a PostgreSQL database via a php-script. Within T-ACE, information belonging to single sequences or contigs, such as annotation or read coverage, is linked to the respective sequence and immediately accessible. Sequences and assigned information can be searched via keyword- or BLAST-search. Additionally, T-ACE provides within and between transcriptome analysis modules on the level of expression, GO terms, KEGG pathways and protein domains. Results are visualized and can be easily exported for external analysis. We developed T-ACE for laboratory environments, which have only a limited amount of bioinformatics support, and for collaborative projects in which different partners work on the same dataset from different locations or platforms (Windows/Linux/MacOS). For laboratories with some experience in bioinformatics and programming, the low complexity of the database structure and open-source code provides a framework that can be customized according to the different needs of the user and transcriptome project.

  16. Technical adequacy of bisulfite sequencing and pyrosequencing for detection of mitochondrial DNA methylation: Sources and avoidance of false-positive detection.

    PubMed

    Owa, Chie; Poulin, Matthew; Yan, Liying; Shioda, Toshi

    2018-01-01

    The existence of cytosine methylation in mammalian mitochondrial DNA (mtDNA) is a controversial subject. Because detection of DNA methylation depends on resistance of 5'-modified cytosines to bisulfite-catalyzed conversion to uracil, examined parameters that affect technical adequacy of mtDNA methylation analysis. Negative control amplicons (NCAs) devoid of cytosine methylation were amplified to cover the entire human or mouse mtDNA by long-range PCR. When the pyrosequencing template amplicons were gel-purified after bisulfite conversion, bisulfite pyrosequencing of NCAs did not detect significant levels of bisulfite-resistant cytosines (brCs) at ND1 (7 CpG sites) or CYTB (8 CpG sites) genes (CI95 = 0%-0.94%); without gel-purification, significant false-positive brCs were detected from NCAs (CI95 = 4.2%-6.8%). Bisulfite pyrosequencing of highly purified, linearized mtDNA isolated from human iPS cells or mouse liver detected significant brCs (~30%) in human ND1 gene when the sequencing primer was not selective in bisulfite-converted and unconverted templates. However, repeated experiments using a sequencing primer selective in bisulfite-converted templates almost completely (< 0.8%) suppressed brC detection, supporting the false-positive nature of brCs detected using the non-selective primer. Bisulfite-seq deep sequencing of linearized, gel-purified human mtDNA detected 9.4%-14.8% brCs for 9 CpG sites in ND1 gene. However, because all these brCs were associated with adjacent non-CpG brCs showing the same degrees of bisulfite resistance, DNA methylation in this mtDNA-encoded gene was not confirmed. Without linearization, data generated by bisulfite pyrosequencing or deep sequencing of purified mtDNA templates did not pass the quality control criteria. Shotgun bisulfite sequencing of human mtDNA detected extremely low levels of CpG methylation (<0.65%) over non-CpG methylation (<0.55%). Taken together, our study demonstrates that adequacy of mtDNA methylation analysis using methods dependent on bisulfite conversion needs to be established for each experiment, taking effects of incomplete bisulfite conversion and template impurity or topology into consideration.

  17. Lyssavirus Detection and Typing Using Pyrosequencing▿#‖

    PubMed Central

    De Benedictis, Paola; De Battisti, Cristian; Dacheux, Laurent; Marciano, Sabrina; Ormelli, Silvia; Salomoni, Angela; Caenazzo, Silvia Tiozzo; Lepelletier, Anthony; Bourhy, Hervé; Capua, Ilaria; Cattoli, Giovanni

    2011-01-01

    Rabies is a fatal zoonosis caused by a nonsegmented negative-strand RNA virus, namely, rabies virus (RABV). Apart from RABV, at least 10 additional species are known as rabies-related lyssaviruses (RRVs), and some of them are responsible for occasional spillovers into humans. More lyssaviruses have also been detected recently in different bat ecosystems, thanks to the application of molecular diagnostic methods. Due to the variety of the members of the genus Lyssavirus, there is the necessity to develop a reliable molecular assay for rabies diagnosis able to detect and differentiate among the existing rabies and rabies-related viruses. In the present study, a pyrosequencing protocol targeting the 3′ terminus of the nucleoprotein (N) gene was applied for the rapid characterization of lyssaviruses. Correct identification of species was achieved for each sample tested. Results from the pyrosequencing assay were also confirmed by those obtained using the Sanger sequencing method. A pan-lyssavirus one-step reverse transcription (RT)-PCR was developed within the framework of the pyrosequencing procedure. The sensitivity (Se) of the one-step RT-PCR assay was determined by using in vitro-transcribed RNA and serial dilutions of titrated viruses. The assay demonstrated high analytical and relative specificity (Sp) (98.94%) and sensitivity (99.71%). To date, this is the first case in which pyrosequencing has been applied for lyssavirus identification using a cheaper diagnostic approach than the one for all the other protocols for rapid typing that we are acquainted with. Results from this study indicate that this procedure is suitable for lyssavirus detection in samples of both human and animal origin. PMID:21389152

  18. CYP3A4 and CYP3A5 genotyping by Pyrosequencing

    PubMed Central

    Garsa, Adam A; McLeod, Howard L; Marsh, Sharon

    2005-01-01

    Background Human cytochrome P450 3A enzymes, particularly CYP3A4 and CYP3A5, play an important role in drug metabolism. CYP3A expression exhibits substantial interindividual variation, much of which may result from genetic variation. This study describes Pyrosequencing assays for key SNPs in CYP3A4 (CYP3A4*1B, CYP3A4*2, and CYP3A4*3) and CYP3A5 (CYP3A5*3C and CYP3A5*6). Methods Genotyping of 95 healthy European and 95 healthy African volunteers was performed using Pyrosequencing. Linkage disequilibrium, haplotype inference, Hardy-Weinberg equilibrium, and tag SNPs were also determined for these samples. Results CYP3A4*1B allele frequencies were 4% in Europeans and 82% in Africans. The CYP3A4*2 allele was found in neither population sample. CYP3A4*3 had an allele frequency of 2% in Europeans and 0% in Africans. The frequency of CYP3A5*3C was 94% in Europeans and 12% in Africans. No CYP3A5*6 variants were found in the European samples, but this allele had a frequency of 16% in the African samples. Allele frequencies and haplotypes show interethnic variation, highlighting the need to analyze clinically relevant SNPs and haplotypes in a variety of ethnic groups. Conclusion Pyrosequencing is a versatile technique that could improve the efficiency of SNP analysis for pharmacogenomic research with the ultimate goal of pre-screening patients for individual therapy selection. PMID:15882469

  19. Molecular characteristics of the KCNJ5 mutated aldosterone-producing adenomas.

    PubMed

    Murakami, Masanori; Yoshimoto, Takanobu; Nakabayashi, Kazuhiko; Nakano, Yujiro; Fukaishi, Takahiro; Tsuchiya, Kyoichiro; Minami, Isao; Bouchi, Ryotaro; Okamura, Kohji; Fujii, Yasuhisa; Hashimoto, Koshi; Hata, Ken-Ichiro; Kihara, Kazunori; Ogawa, Yoshihiro

    2017-10-01

    The pathophysiology of aldosterone-producing adenomas (APAs) has been investigated via genetic approaches and the pathogenic significance of a series of somatic mutations, including KCNJ5 , has been uncovered. However, how the mutational status of an APA is associated with its molecular characteristics, including its transcriptome and methylome, has not been fully understood. This study was undertaken to explore the molecular characteristics of APAs, specifically focusing on APAs with KCNJ5 mutations as opposed to those without KCNJ5 mutations, by comparing their transcriptome and methylome status. Cortisol-producing adenomas (CPAs) were used as reference. We conducted transcriptome and methylome analyses of 29 APAs with KCNJ5 mutations, 8 APAs without KCNJ5 mutations and 5 CPAs. Genome-wide gene expression and CpG methylation profiles were obtained from RNA and DNA samples extracted from these 42 adrenal tumors. Cluster analysis of the transcriptome and methylome revealed molecular heterogeneity in APAs depending on their mutational status. DNA hypomethylation and gene expression changes in Wnt signaling and inflammatory response pathways were characteristic of APAs with KCNJ5 mutations. Comparisons between transcriptome data from our APAs and that from normal adrenal cortex obtained from the Gene Expression Omnibus suggested similarities between APAs with KCNJ5 mutations and zona glomerulosa. The present study, which is based on transcriptome and methylome analyses, indicates the molecular heterogeneity of APAs depends on their mutational status. Here, we report the unique characteristics of APAs with KCNJ5 mutations. © 2017 Society for Endocrinology.

  20. De novo transcriptome assemblies of four xylem sap-feeding insects

    PubMed Central

    Tassone, Erica E.; Cowden, Charles C.

    2017-01-01

    Abstract Background: Spittle bugs and sharpshooters are well-known xylem sap-feeding insects and vectors of the phytopathogenic bacterium Xylella fastidiosa (Wells), a causal agent of Pierce's disease of grapevines and other crop diseases. Specialized feeding on nutrient-deficient xylem sap is relatively rare among insect herbivores, and only limited genomic and transcriptomic information has been generated for xylem-sap feeders. To develop a more comprehensive understanding of biochemical adaptations and symbiotic relationships that support survival on a nutritionally austere dietary source, transcriptome assemblies for three sharpshooter species and one spittlebug species were produced. Findings: Trinity-based de novo transcriptome assemblies were generated for all four xylem-sap feeders using raw sequencing data originating from whole-insect preps. Total transcripts for each species ranged from 91 384 for Cuerna arida to 106 998 for Homalodisca liturata with transcript totals for Graphocephala atropunctata and the spittlebug Clastoptera arizonana falling in between. The percentage of transcripts comprising complete open reading frames ranged from 60% for H. liturata to 82% for C. arizonana. Bench-marking universal single-copy orthologs analyses for each dataset indicated quality assemblies and a high degree of completeness for all four species. Conclusions: These four transcriptomes represent a significant expansion of data for insect herbivores that feed exclusively on xylem sap, a nutritionally deficient dietary source relative to other plant tissues and fluids. Comparison of transcriptome data with insect herbivores that utilize other dietary sources may illuminate fundamental differences in the biochemistry of dietary specialization. PMID:28327966

  1. Biologic Phenotyping of the Human Small Airway Epithelial Response to Cigarette Smoking

    PubMed Central

    Tilley, Ann E.; O'Connor, Timothy P.; Hackett, Neil R.; Strulovici-Barel, Yael; Salit, Jacqueline; Amoroso, Nancy; Zhou, Xi Kathy; Raman, Tina; Omberg, Larsson; Clark, Andrew; Mezey, Jason; Crystal, Ronald G.

    2011-01-01

    Background The first changes associated with smoking are in the small airway epithelium (SAE). Given that smoking alters SAE gene expression, but only a fraction of smokers develop chronic obstructive pulmonary disease (COPD), we hypothesized that assessment of SAE genome-wide gene expression would permit biologic phenotyping of the smoking response, and that a subset of healthy smokers would have a “COPD-like” SAE transcriptome. Methodology/Principal Findings SAE (10th–12th generation) was obtained via bronchoscopy of healthy nonsmokers, healthy smokers and COPD smokers and microarray analysis was used to identify differentially expressed genes. Individual responsiveness to smoking was quantified with an index representing the % of smoking-responsive genes abnormally expressed (ISAE), with healthy smokers grouped into “high” and “low” responders based on the proportion of smoking-responsive genes up- or down-regulated in each smoker. Smokers demonstrated significant variability in SAE transcriptome with ISAE ranging from 2.9 to 51.5%. While the SAE transcriptome of “low” responder healthy smokers differed from both “high” responders and smokers with COPD, the transcriptome of the “high” responder healthy smokers was indistinguishable from COPD smokers. Conclusion/Significance The SAE transcriptome can be used to classify clinically healthy smokers into subgroups with lesser and greater responses to cigarette smoking, even though these subgroups are indistinguishable by clinical criteria. This identifies a group of smokers with a “COPD-like” SAE transcriptome. PMID:21829517

  2. Impact of Nisin-Activated Packaging on Microbiota of Beef Burgers during Storage.

    PubMed

    Ferrocino, Ilario; Greppi, Anna; La Storia, Antonietta; Rantsiou, Kalliopi; Ercolini, Danilo; Cocolin, Luca

    2016-01-15

    Beef burgers were stored at 4°C in a vacuum in nisin-activated antimicrobial packaging. Microbial ecology analyses were performed on samples collected between days 0 and 21 of storage to discover the population diversity. Two batches were analyzed using RNA-based denaturing gradient gel electrophoresis (DGGE) and pyrosequencing. The active packaging retarded the growth of the total viable bacteria and lactic acid bacteria. Culture-independent analysis by pyrosequencing of RNA extracted directly from meat showed that Photobacterium phosphoreum, Lactococcus piscium, Lactobacillus sakei, and Leuconostoc carnosum were the major operational taxonomic units (OTUs) shared between control and treated samples. Beta diversity analysis of the 16S rRNA sequence data and RNA-DGGE showed a clear separation between two batches based on the microbiota. Control samples from batch B showed a significant high abundance of some taxa sensitive to nisin, such as Kocuria rhizophila, Staphylococcus xylosus, Leuconostoc carnosum, and Carnobacterium divergens, compared to control samples from batch A. However, only from batch B was it possible to find a significant difference between controls and treated samples during storage due to the active packaging. Predicted metagenomes confirmed differences between the two batches and indicated that the use of nisin-based antimicrobial packaging can determine a reduction in the abundance of specific metabolic pathways related to spoilage. The present study aimed to assess the viable bacterial communities in beef burgers stored in nisin-based antimicrobial packaging, and it highlights the efficacy of this strategy to prolong beef burger shelf life. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  3. Single nucleotide polymorphism discovery in cutthroat trout subspecies using genome reduction, barcoding, and 454 pyro-sequencing

    PubMed Central

    2012-01-01

    Background Salmonids are popular sport fishes, and as such have been subjected to widespread stocking throughout western North America. Historically, stocking was done with little regard for genetic variation among populations and has resulted in genetic mixing among species and subspecies in many areas, thus putting the genetic integrity of native salmonid populations at risk and creating a need to assess the genetic constitution of native salmonid populations. Cutthroat trout is a salmonid species with pronounced geographic structure (there are 10 extant subspecies) and a recent history of hybridization with introduced rainbow trout in many populations. Genetic admixture has also occurred among cutthroat trout subspecies in areas where introductions have brought two or more subspecies into contact. Consequently, management agencies have increased their efforts to evaluate the genetic composition of cutthroat trout populations to identify populations that remain uncompromised and manage them accordingly, but additional genetic markers are needed to do so effectively. Here we used genome reduction, MID-barcoding, and 454-pyrosequencing to discover single nucleotide polymorphisms that differentiate cutthroat trout subspecies and can be used as a rapid, cost-effective method to characterize the genetic composition of cutthroat trout populations. Results Thirty cutthroat and six rainbow trout individuals were subjected to genome reduction and next-generation sequencing. A total of 1,499,670 reads averaging 379 base pairs in length were generated by 454-pyrosequencing, resulting in 569,060,077 total base pairs sequenced. A total of 43,558 putative SNPs were identified, and of those, 125 SNP primers were developed that successfully amplified 96 cutthroat trout and rainbow trout individuals. These SNP loci were able to differentiate most cutthroat trout subspecies using distance methods and Structure analyses. Conclusions Genomic and bioinformatic protocols were successfully implemented to identify 125 nuclear SNPs that are capable of differentiating most subspecies of cutthroat trout from one another. The ability to use this suite of SNPs to identify individuals of unknown genetic background to subspecies can be a valuable tool for management agencies in their efforts to evaluate the genetic structure of cutthroat trout populations prior to constructing and implementing conservation plans. PMID:23259499

  4. Assessment of bacterial diversity in the cattle tick Rhipicephalus (Boophilus) microplus through tag-encoded pyrosequencing.

    PubMed

    Andreotti, Renato; Pérez de León, Adalberto A; Dowd, Scot E; Guerrero, Felix D; Bendele, Kylie G; Scoles, Glen A

    2011-01-06

    Ticks are regarded as the most relevant vectors of disease-causing pathogens in domestic and wild animals. The cattle tick, Rhipicephalus (Boophilus) microplus, hinders livestock production in tropical and subtropical parts of the world where it is endemic. Tick microbiomes remain largely unexplored. The objective of this study was to explore the R. microplus microbiome by applying the bacterial 16S tag-encoded FLX-titanium amplicon pyrosequencing (bTEFAP) technique to characterize its bacterial diversity. Pyrosequencing was performed on adult males and females, eggs, and gut and ovary tissues from adult females derived from samples of R. microplus collected during outbreaks in southern Texas. Raw data from bTEFAP were screened and trimmed based upon quality scores and binned into individual sample collections. Bacteria identified to the species level include Staphylococcus aureus, Staphylococcus chromogenes, Streptococcus dysgalactiae, Staphylococcus sciuri, Serratia marcescens, Corynebacterium glutamicum, and Finegoldia magna. One hundred twenty-one bacterial genera were detected in all the life stages and tissues sampled. The total number of genera identified by tick sample comprised: 53 in adult males, 61 in adult females, 11 in gut tissue, 7 in ovarian tissue, and 54 in the eggs. Notable genera detected in the cattle tick include Wolbachia, Coxiella, and Borrelia. The molecular approach applied in this study allowed us to assess the relative abundance of the microbiota associated with R. microplus. This report represents the first survey of the bacteriome in the cattle tick using non-culture based molecular approaches. Comparisons of our results with previous bacterial surveys provide an indication of geographic variation in the assemblages of bacteria associated with R. microplus. Additional reports on the identification of new bacterial species maintained in nature by R. microplus that may be pathogenic to its vertebrate hosts are expected as our understanding of its microbiota expands. Increased awareness of the role R. microplus can play in the transmission of pathogenic bacteria will enhance our ability to mitigate its economic impact on animal agriculture globally. This recognition should be included as part of analyses to assess the risk for re-invasion of areas like the United States of America where R. microplus was eradicated.

  5. 454-Pyrosequencing Analysis of Bacterial Communities from Autotrophic Nitrogen Removal Bioreactors Utilizing Universal Primers: Effect of Annealing Temperature

    PubMed Central

    Rodriguez-Sanchez, Alejandro; Rodelas, Belén; Abbas, Ben A.; Martinez-Toledo, Maria Victoria; van Loosdrecht, Mark C. M.; Osorio, F.; Gonzalez-Lopez, Jesus

    2015-01-01

    Identification of anaerobic ammonium oxidizing (anammox) bacteria by molecular tools aimed at the evaluation of bacterial diversity in autotrophic nitrogen removal systems is limited by the difficulty to design universal primers for the Bacteria domain able to amplify the anammox 16S rRNA genes. A metagenomic analysis (pyrosequencing) of total bacterial diversity including anammox population in five autotrophic nitrogen removal technologies, two bench-scale models (MBR and Low Temperature CANON) and three full-scale bioreactors (anammox, CANON, and DEMON), was successfully carried out by optimization of primer selection and PCR conditions (annealing temperature). The universal primer 530F was identified as the best candidate for total bacteria and anammox bacteria diversity coverage. Salt-adjusted optimum annealing temperature of primer 530F was calculated (47°C) and hence a range of annealing temperatures of 44–49°C was tested. Pyrosequencing data showed that annealing temperature of 45°C yielded the best results in terms of species richness and diversity for all bioreactors analyzed. PMID:26421306

  6. 454-Pyrosequencing Analysis of Bacterial Communities from Autotrophic Nitrogen Removal Bioreactors Utilizing Universal Primers: Effect of Annealing Temperature.

    PubMed

    Gonzalez-Martinez, Alejandro; Rodriguez-Sanchez, Alejandro; Rodelas, Belén; Abbas, Ben A; Martinez-Toledo, Maria Victoria; van Loosdrecht, Mark C M; Osorio, F; Gonzalez-Lopez, Jesus

    2015-01-01

    Identification of anaerobic ammonium oxidizing (anammox) bacteria by molecular tools aimed at the evaluation of bacterial diversity in autotrophic nitrogen removal systems is limited by the difficulty to design universal primers for the Bacteria domain able to amplify the anammox 16S rRNA genes. A metagenomic analysis (pyrosequencing) of total bacterial diversity including anammox population in five autotrophic nitrogen removal technologies, two bench-scale models (MBR and Low Temperature CANON) and three full-scale bioreactors (anammox, CANON, and DEMON), was successfully carried out by optimization of primer selection and PCR conditions (annealing temperature). The universal primer 530F was identified as the best candidate for total bacteria and anammox bacteria diversity coverage. Salt-adjusted optimum annealing temperature of primer 530F was calculated (47°C) and hence a range of annealing temperatures of 44-49°C was tested. Pyrosequencing data showed that annealing temperature of 45°C yielded the best results in terms of species richness and diversity for all bioreactors analyzed.

  7. Detection of Drug-Resistant Mycobacterium tuberculosis.

    PubMed

    Engström, Anna; Juréen, Pontus

    2015-01-01

    Tuberculosis (TB) remains a global health problem. The increasing prevalence of drug-resistant Mycobacterium tuberculosis, the causative agent of TB, demands new measures to combat the situation. Rapid and accurate diagnosis of the pathogen and its drug susceptibility pattern is essential for timely initiation of optimal treatment, and, ultimately, control of the disease. We have developed a molecular method for detection of first- and second-line drug resistance in M. tuberculosis by Pyrosequencing(®). The method consists of seven Pyrosequencing assays for the detection of mutations in the genes or promoter regions, which are most commonly responsible for resistance to the drugs rifampicin, isoniazid, ethambutol, amikacin, kanamycin, capreomycin, and fluoroquinolones. The method was validated on clinical isolates and it was shown that the sensitivity and specificity of the method were comparable to those of Sanger sequencing. In the protocol in this chapter we describe the steps necessary for setting up and performing Pyrosequencing for M. tuberculosis. The first part of the protocol describes the assay development and the second part of the protocol describes utilization of the method.

  8. Identification of candidate genes for drought tolerance in coffee by high-throughput sequencing in the shoot apex of different Coffea arabica cultivars.

    PubMed

    Mofatto, Luciana Souto; Carneiro, Fernanda de Araújo; Vieira, Natalia Gomes; Duarte, Karoline Estefani; Vidal, Ramon Oliveira; Alekcevetch, Jean Carlos; Cotta, Michelle Guitton; Verdeil, Jean-Luc; Lapeyre-Montes, Fabienne; Lartaud, Marc; Leroy, Thierry; De Bellis, Fabien; Pot, David; Rodrigues, Gustavo Costa; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães; Andrade, Alan Carvalho; Marraccini, Pierre

    2016-04-19

    Drought is a widespread limiting factor in coffee plants. It affects plant development, fruit production, bean development and consequently beverage quality. Genetic diversity for drought tolerance exists within the coffee genus. However, the molecular mechanisms underlying the adaptation of coffee plants to drought are largely unknown. In this study, we compared the molecular responses to drought in two commercial cultivars (IAPAR59, drought-tolerant and Rubi, drought-susceptible) of Coffea arabica grown in the field under control (irrigation) and drought conditions using the pyrosequencing of RNA extracted from shoot apices and analysing the expression of 38 candidate genes. Pyrosequencing from shoot apices generated a total of 34.7 Mbp and 535,544 reads enabling the identification of 43,087 clusters (41,512 contigs and 1,575 singletons). These data included 17,719 clusters (16,238 contigs and 1,575 singletons) exclusively from 454 sequencing reads, along with 25,368 hybrid clusters assembled with 454 sequences. The comparison of DNA libraries identified new candidate genes (n = 20) presenting differential expression between IAPAR59 and Rubi and/or drought conditions. Their expression was monitored in plagiotropic buds, together with those of other (n = 18) candidates genes. Under drought conditions, up-regulated expression was observed in IAPAR59 but not in Rubi for CaSTK1 (protein kinase), CaSAMT1 (SAM-dependent methyltransferase), CaSLP1 (plant development) and CaMAS1 (ABA biosynthesis). Interestingly, the expression of lipid-transfer protein (nsLTP) genes was also highly up-regulated under drought conditions in IAPAR59. This may have been related to the thicker cuticle observed on the abaxial leaf surface in IAPAR59 compared to Rubi. The full transcriptome assembly of C. arabica, followed by functional annotation, enabled us to identify differentially expressed genes related to drought conditions. Using these data, candidate genes were selected and their differential expression profiles were confirmed by qPCR experiments in plagiotropic buds of IAPAR59 and Rubi under drought conditions. As regards the genes up-regulated under drought conditions, specifically in the drought-tolerant IAPAR59, several corresponded to orphan genes but also to genes coding proteins involved in signal transduction pathways, as well as ABA and lipid metabolism, for example. The identification of these genes should help advance our understanding of the genetic determinism of drought tolerance in coffee.

  9. A Transcriptome Map of Actinobacillus pleuropneumoniae at Single-Nucleotide Resolution Using Deep RNA-Seq

    PubMed Central

    Su, Zhipeng; Zhu, Jiawen; Xu, Zhuofei; Xiao, Ran; Zhou, Rui; Li, Lu; Chen, Huanchun

    2016-01-01

    Actinobacillus pleuropneumoniae is the pathogen of porcine contagious pleuropneumoniae, a highly contagious respiratory disease of swine. Although the genome of A. pleuropneumoniae was sequenced several years ago, limited information is available on the genome-wide transcriptional analysis to accurately annotate the gene structures and regulatory elements. High-throughput RNA sequencing (RNA-seq) has been applied to study the transcriptional landscape of bacteria, which can efficiently and accurately identify gene expression regions and unknown transcriptional units, especially small non-coding RNAs (sRNAs), UTRs and regulatory regions. The aim of this study is to comprehensively analyze the transcriptome of A. pleuropneumoniae by RNA-seq in order to improve the existing genome annotation and promote our understanding of A. pleuropneumoniae gene structures and RNA-based regulation. In this study, we utilized RNA-seq to construct a single nucleotide resolution transcriptome map of A. pleuropneumoniae. More than 3.8 million high-quality reads (average length ~90 bp) from a cDNA library were generated and aligned to the reference genome. We identified 32 open reading frames encoding novel proteins that were mis-annotated in the previous genome annotations. The start sites for 35 genes based on the current genome annotation were corrected. Furthermore, 51 sRNAs in the A. pleuropneumoniae genome were discovered, of which 40 sRNAs were never reported in previous studies. The transcriptome map also enabled visualization of 5'- and 3'-UTR regions, in which contained 11 sRNAs. In addition, 351 operons covering 1230 genes throughout the whole genome were identified. The RNA-Seq based transcriptome map validated annotated genes and corrected annotations of open reading frames in the genome, and led to the identification of many functional elements (e.g. regions encoding novel proteins, non-coding sRNAs and operon structures). The transcriptional units described in this study provide a foundation for future studies concerning the gene functions and the transcriptional regulatory architectures of this pathogen. PMID:27018591

  10. DNA methylation and hydroxymethylation analyses of the active LINE-1 subfamilies in mice.

    PubMed

    Murata, Yui; Bundo, Miki; Ueda, Junko; Kubota-Sakashita, Mie; Kasai, Kiyoto; Kato, Tadafumi; Iwamoto, Kazuya

    2017-10-19

    Retrotransposon long interspersed nuclear element-1 (LINE-1) occupies a large proportion of the mammalian genome, comprising approximately 100,000 genomic copies in mice. Epigenetic status of the 5' untranslated region (5'-UTR) of LINE-1 is critical for its promoter activity. DNA methylation levels in the 5'-UTR of human active LINE-1 subfamily can be measured by well-established methods, such as a pyrosequencing-based assay. However, because of the considerable sequence and structural diversity in LINE-1 among species, methods for such assays should be adapted for the species of interest. Here we developed pyrosequencing-based assays to examine methylcytosine (mC) and hydroxymethylcytosine (hmC) levels of the three active LINE-1 subfamilies in mice (TfI, A, and GfII). Using these assays, we quantified mC and hmC levels in four brain regions and four nonbrain tissues including tail, heart, testis, and ovary. We observed tissue- and subfamily-specific mC and hmC differences. We also found that mC levels were strongly correlated among different brain regions, but mC levels of the testis showed a poor correlation with those of other tissues. Interestingly, mC levels in the A and GfII subfamilies were highly correlated, possibly reflecting their close evolutionary relationship. Our assays will be useful for exploring the epigenetic regulation of the active LINE-1 subfamilies in mice.

  11. An Insight Into the Microbiome of the Amblyomma maculatum (Acari: Ixodidae)

    PubMed Central

    BUDACHETRI, KHEMRAJ; BROWNING, REBECCA E.; ADAMSON, STEVEN W.; DOWD, SCOT E.; CHAO, CHIEN-CHUNG; CHING, WEI-MEI; KARIM, SHAHID

    2014-01-01

    The aim of this study was to survey the bacterial diversity of Amblyomma maculatum Koch, 1844, and characterize its infection with Rickettsia parkeri. Pyrosequencing of the bacterial 16S rRNA was used to determine the total bacterial population in A. maculatum. Pyrosequencing analysis identified Rickettsia in A. maculatum midguts, salivary glands, and saliva, which indicates successful trafficking in the arthropod vector. The identity of Rickettsia spp. was determined based on sequencing the rickettsial outer membrane protein A (rompA) gene. The sequence homology search revealed the presence of R. parkeri, Rickettsia amblyommii, and Rickettsia endosymbiont of A. maculatum in midgut tissues, whereas the only rickettsia detected in salivary glands was R. parkeri, suggesting it is unique in its ability to migrate from midgut to salivary glands, and colonize this tissue before dissemination to the host. Owing to its importance as an emerging infectious disease, the R. parkeri pathogen burden was quantified by a rompB-based quantitative polymerase chain reaction (qPCR) assay and the diagnostic effectiveness of using R. parkeri polyclonal antibodies in tick tissues was tested. Together, these data indicate that field-collected A. maculatum had a R. parkeri infection rate of 12–32%. This study provides an insight into the A. maculatum microbiome and confirms the presence of R. parkeri, which will serve as the basis for future tick and microbiome interaction studies. PMID:24605461

  12. Environmental Barcoding: A Next-Generation Sequencing Approach for Biomonitoring Applications Using River Benthos

    PubMed Central

    Hajibabaei, Mehrdad; Shokralla, Shadi; Zhou, Xin; Singer, Gregory A. C.; Baird, Donald J.

    2011-01-01

    Timely and accurate biodiversity analysis poses an ongoing challenge for the success of biomonitoring programs. Morphology-based identification of bioindicator taxa is time consuming, and rarely supports species-level resolution especially for immature life stages. Much work has been done in the past decade to develop alternative approaches for biodiversity analysis using DNA sequence-based approaches such as molecular phylogenetics and DNA barcoding. On-going assembly of DNA barcode reference libraries will provide the basis for a DNA-based identification system. The use of recently introduced next-generation sequencing (NGS) approaches in biodiversity science has the potential to further extend the application of DNA information for routine biomonitoring applications to an unprecedented scale. Here we demonstrate the feasibility of using 454 massively parallel pyrosequencing for species-level analysis of freshwater benthic macroinvertebrate taxa commonly used for biomonitoring. We designed our experiments in order to directly compare morphology-based, Sanger sequencing DNA barcoding, and next-generation environmental barcoding approaches. Our results show the ability of 454 pyrosequencing of mini-barcodes to accurately identify all species with more than 1% abundance in the pooled mixture. Although the approach failed to identify 6 rare species in the mixture, the presence of sequences from 9 species that were not represented by individuals in the mixture provides evidence that DNA based analysis may yet provide a valuable approach in finding rare species in bulk environmental samples. We further demonstrate the application of the environmental barcoding approach by comparing benthic macroinvertebrates from an urban region to those obtained from a conservation area. Although considerable effort will be required to robustly optimize NGS tools to identify species from bulk environmental samples, our results indicate the potential of an environmental barcoding approach for biomonitoring programs. PMID:21533287

  13. Intra-tumor heterogeneity in breast cancer has limited impact on transcriptomic-based molecular profiling.

    PubMed

    Karthik, Govindasamy-Muralidharan; Rantalainen, Mattias; Stålhammar, Gustav; Lövrot, John; Ullah, Ikram; Alkodsi, Amjad; Ma, Ran; Wedlund, Lena; Lindberg, Johan; Frisell, Jan; Bergh, Jonas; Hartman, Johan

    2017-11-29

    Transcriptomic profiling of breast tumors provides opportunity for subtyping and molecular-based patient stratification. In diagnostic applications the specimen profiled should be representative of the expression profile of the whole tumor and ideally capture properties of the most aggressive part of the tumor. However, breast cancers commonly exhibit intra-tumor heterogeneity at molecular, genomic and in phenotypic level, which can arise during tumor evolution. Currently it is not established to what extent a random sampling approach may influence molecular breast cancer diagnostics. In this study we applied RNA-sequencing to quantify gene expression in 43 pieces (2-5 pieces per tumor) from 12 breast tumors (Cohort 1). We determined molecular subtype and transcriptomic grade for all tumor pieces and analysed to what extent pieces originating from the same tumors are concordant or discordant with each other. Additionally, we validated our finding in an independent cohort consisting of 19 pieces (2-6 pieces per tumor) from 6 breast tumors (Cohort 2) profiled using microarray technique. Exome sequencing was also performed on this cohort, to investigate the extent of intra-tumor genomic heterogeneity versus the intra-tumor molecular subtype classifications. Molecular subtyping was consistent in 11 out of 12 tumors and transcriptomic grade assignments were consistent in 11 out of 12 tumors as well. Molecular subtype predictions revealed consistent subtypes in four out of six patients in this cohort 2. Interestingly, we observed extensive intra-tumor genomic heterogeneity in these tumor pieces but not in their molecular subtype classifications. Our results suggest that macroscopic intra-tumoral transcriptomic heterogeneity is limited and unlikely to have an impact on molecular diagnostics for most patients.

  14. De novo Assembly of Leaf Transcriptome in the Medicinal Plant Andrographis paniculata

    PubMed Central

    Cherukupalli, Neeraja; Divate, Mayur; Mittapelli, Suresh R.; Khareedu, Venkateswara R.; Vudem, Dashavantha R.

    2016-01-01

    Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeq™ 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant non-redundant protein database, gene ontology, and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts—using kyoto encyclopedia of genes and genomes database—revealed 5606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs) in 23,168 transcripts. Assembled sequences of transcriptome of A. paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analysis besides identification of key enzymes involved in the various pathways of secondary metabolism. PMID:27582746

  15. Detailed transcriptome description of the neglected cestode Taenia multiceps.

    PubMed

    Wu, Xuhang; Fu, Yan; Yang, Deying; Zhang, Runhui; Zheng, Wanpeng; Nie, Huaming; Xie, Yue; Yan, Ning; Hao, Guiying; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yang, Guangyou

    2012-01-01

    The larval stage of Taenia multiceps, a global cestode, encysts in the central nervous system (CNS) of sheep and other livestock. This frequently leads to their death and huge socioeconomic losses, especially in developing countries. This parasite can also cause zoonotic infections in humans, but has been largely neglected due to a lack of diagnostic techniques and studies. Recent developments in next-generation sequencing provide an opportunity to explore the transcriptome of T. multiceps. We obtained a total of 31,282 unigenes (mean length 920 bp) using Illumina paired-end sequencing technology and a new Trinity de novo assembler without a referenced genome. Individual transcription molecules were determined by sequence-based annotations and/or domain-based annotations against public databases (Nr, UniprotKB/Swiss-Prot, COG, KEGG, UniProtKB/TrEMBL, InterPro and Pfam). We identified 26,110 (83.47%) unigenes and inferred 20,896 (66.8%) coding sequences (CDS). Further comparative transcripts analysis with other cestodes (Taenia pisiformis, Taenia solium, Echincoccus granulosus and Echincoccus multilocularis) and intestinal parasites (Trichinella spiralis, Ancylostoma caninum and Ascaris suum) showed that 5,100 common genes were shared among three Taenia tapeworms, 261 conserved genes were detected among five Taeniidae cestodes, and 109 common genes were found in four zoonotic intestinal parasites. Some of the common genes were genes required for parasite survival, involved in parasite-host interactions. In addition, we amplified two full-length CDS of unigenes from the common genes using RT-PCR. This study provides an extensive transcriptome of the adult stage of T. multiceps, and demonstrates that comparative transcriptomic investigations deserve to be further studied. This transcriptome dataset forms a substantial public information platform to achieve a fundamental understanding of the biology of T. multiceps, and helps in the identification of drug targets and parasite-host interaction studies.

  16. Genome and Transcriptome Sequencing of the Ostreid herpesvirus 1 From Tomales Bay, California

    NASA Astrophysics Data System (ADS)

    Burge, C. A.; Langevin, S.; Closek, C. J.; Roberts, S. B.; Friedman, C. S.

    2016-02-01

    Mass mortalities of larval and seed bivalve molluscs attributed to the Ostreid herpesvirus 1 (OsHV-1) occur globally. OsHV-1 was fully sequenced and characterized as a member of the Family Malacoherpesviridae. Multiple strains of OsHV-1 exist and may vary in virulence, i.e. OsHV-1 µvar. For most global variants of OsHV-1, sequence data is limited to PCR-based sequencing of segments, including two recent genomes. In the United States, OsHV-1 is limited to detection in adjacent embayments in California, Tomales and Drakes bays. Limited DNA sequence data of OsHV-1 infecting oysters in Tomales Bay indicates the virus detected in Tomales Bay is similar but not identical to any one global variant of OsHV-1. In order to better understand both strain variation and virulence of OsHV-1 infecting oysters in Tomales Bay, we used genomic and transcriptomic sequencing. Meta-genomic sequencing (Illumina MiSeq) was conducted from infected oysters (n=4 per year) collected in 2003, 2007, and 2014, where full OsHV-1 genome sequences and low overall microbial diversity were achieved from highly infected oysters. Increased microbial diversity was detected in three of four samples sequenced from 2003, where qPCR based genome copy numbers of OsHV-1 were lower. Expression analysis (SOLiD RNA sequencing) of OsHV-1 genes expressed in oyster larvae at 24 hours post exposure revealed a nearly complete transcriptome, with several highly expressed genes, which are similar to recent transcriptomic analyses of other OsHV-1 variants. Taken together, our results indicate that genome and transcriptome sequencing may be powerful tools in understanding both strain variation and virulence of non-culturable marine viruses.

  17. Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants.

    PubMed

    Li, Xinguo; Wu, Harry X; Southerton, Simon G

    2010-06-21

    Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution.

  18. Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants

    PubMed Central

    2010-01-01

    Background Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. Results The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conclusions Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution. PMID:20565927

  19. Transcriptome analysis of thermophilic methylotrophic Bacillus methanolicus MGA3 using RNA-sequencing provides detailed insights into its previously uncharted transcriptional landscape.

    PubMed

    Irla, Marta; Neshat, Armin; Brautaset, Trygve; Rückert, Christian; Kalinowski, Jörn; Wendisch, Volker F

    2015-02-14

    Bacillus methanolicus MGA3 is a thermophilic, facultative ribulose monophosphate (RuMP) cycle methylotroph. Together with its ability to produce high yields of amino acids, the relevance of this microorganism as a promising candidate for biotechnological applications is evident. The B. methanolicus MGA3 genome consists of a 3,337,035 nucleotides (nt) circular chromosome, the 19,174 nt plasmid pBM19 and the 68,999 nt plasmid pBM69. 3,218 protein-coding regions were annotated on the chromosome, 22 on pBM19 and 82 on pBM69. In the present study, the RNA-seq approach was used to comprehensively investigate the transcriptome of B. methanolicus MGA3 in order to improve the genome annotation, identify novel transcripts, analyze conserved sequence motifs involved in gene expression and reveal operon structures. For this aim, two different cDNA library preparation methods were applied: one which allows characterization of the whole transcriptome and another which includes enrichment of primary transcript 5'-ends. Analysis of the primary transcriptome data enabled the detection of 2,167 putative transcription start sites (TSSs) which were categorized into 1,642 TSSs located in the upstream region (5'-UTR) of known protein-coding genes and 525 TSSs of novel antisense, intragenic, or intergenic transcripts. Firstly, 14 wrongly annotated translation start sites (TLSs) were corrected based on primary transcriptome data. Further investigation of the identified 5'-UTRs resulted in the detailed characterization of their length distribution and the detection of 75 hitherto unknown cis-regulatory RNA elements. Moreover, the exact TSSs positions were utilized to define conserved sequence motifs for translation start sites, ribosome binding sites and promoters in B. methanolicus MGA3. Based on the whole transcriptome data set, novel transcripts, operon structures and mRNA abundances were determined. The analysis of the operon structures revealed that almost half of the genes are transcribed monocistronically (940), whereas 1,164 genes are organized in 381 operons. Several of the genes related to methylotrophy had highly abundant transcripts. The extensive insights into the transcriptional landscape of B. methanolicus MGA3, gained in this study, represent a valuable foundation for further comparative quantitative transcriptome analyses and possibly also for the development of molecular biology tools which at present are very limited for this organism.

  20. ChlamyNET: a Chlamydomonas gene co-expression network reveals global properties of the transcriptome and the early setup of key co-expression patterns in the green lineage.

    PubMed

    Romero-Campero, Francisco J; Perez-Hurtado, Ignacio; Lucas-Reina, Eva; Romero, Jose M; Valverde, Federico

    2016-03-12

    Chlamydomonas reinhardtii is the model organism that serves as a reference for studies in algal genomics and physiology. It is of special interest in the study of the evolution of regulatory pathways from algae to higher plants. Additionally, it has recently gained attention as a potential source for bio-fuel and bio-hydrogen production. The genome of Chlamydomonas is available, facilitating the analysis of its transcriptome by RNA-seq data. This has produced a massive amount of data that remains fragmented making necessary the application of integrative approaches based on molecular systems biology. We constructed a gene co-expression network based on RNA-seq data and developed a web-based tool, ChlamyNET, for the exploration of the Chlamydomonas transcriptome. ChlamyNET exhibits a scale-free and small world topology. Applying clustering techniques, we identified nine gene clusters that capture the structure of the transcriptome under the analyzed conditions. One of the most central clusters was shown to be involved in carbon/nitrogen metabolism and signalling, whereas one of the most peripheral clusters was involved in DNA replication and cell cycle regulation. The transcription factors and regulators in the Chlamydomonas genome have been identified in ChlamyNET. The biological processes potentially regulated by them as well as their putative transcription factor binding sites were determined. The putative light regulated transcription factors and regulators in the Chlamydomonas genome were analyzed in order to provide a case study on the use of ChlamyNET. Finally, we used an independent data set to cross-validate the predictive power of ChlamyNET. The topological properties of ChlamyNET suggest that the Chlamydomonas transcriptome posseses important characteristics related to error tolerance, vulnerability and information propagation. The central part of ChlamyNET constitutes the core of the transcriptome where most authoritative hub genes are located interconnecting key biological processes such as light response with carbon and nitrogen metabolism. Our study reveals that key elements in the regulation of carbon and nitrogen metabolism, light response and cell cycle identified in higher plants were already established in Chlamydomonas. These conserved elements are not only limited to transcription factors, regulators and their targets, but also include the cis-regulatory elements recognized by them.

  1. Yellow lupin (Lupinus luteus L.) transcriptome sequencing: molecular marker development and comparative studies

    PubMed Central

    2012-01-01

    Background Yellow lupin (Lupinus luteus L.) is a minor legume crop characterized by its high seed protein content. Although grown in several temperate countries, its orphan condition has limited the generation of genomic tools to aid breeding efforts to improve yield and nutritional quality. In this study, we report the construction of 454-expresed sequence tag (EST) libraries, carried out comparative studies between L. luteus and model legume species, developed a comprehensive set of EST-simple sequence repeat (SSR) markers, and validated their utility on diversity studies and transferability to related species. Results Two runs of 454 pyrosequencing yielded 205 Mb and 530 Mb of sequence data for L1 (young leaves, buds and flowers) and L2 (immature seeds) EST- libraries. A combined assembly (L1L2) yielded 71,655 contigs with an average contig length of 632 nucleotides. L1L2 contigs were clustered into 55,309 isotigs. 38,200 isotigs translated into proteins and 8,741 of them were full length. Around 57% of L. luteus sequences had significant similarity with at least one sequence of Medicago, Lotus, Arabidopsis, or Glycine, and 40.17% showed positive matches with all of these species. L. luteus isotigs were also screened for the presence of SSR sequences. A total of 2,572 isotigs contained at least one EST-SSR, with a frequency of one SSR per 17.75 kbp. Empirical evaluation of the EST-SSR candidate markers resulted in 222 polymorphic EST-SSRs. Two hundred and fifty four (65.7%) and 113 (30%) SSR primer pairs were able to amplify fragments from L. hispanicus and L. mutabilis DNA, respectively. Fifty polymorphic EST-SSRs were used to genotype a sample of 64 L. luteus accessions. Neighbor-joining distance analysis detected the existence of several clusters among L. luteus accessions, strongly suggesting the existence of population subdivisions. However, no clear clustering patterns followed the accession’s origin. Conclusion L. luteus deep transcriptome sequencing will facilitate the further development of genomic tools and lupin germplasm. Massive sequencing of cDNA libraries will continue to produce raw materials for gene discovery, identification of polymorphisms (SNPs, EST-SSRs, INDELs, etc.) for marker development, anchoring sequences for genome comparisons and putative gene candidates for QTL detection. PMID:22920992

  2. Yellow lupin (Lupinus luteus L.) transcriptome sequencing: molecular marker development and comparative studies.

    PubMed

    Parra-González, Lorena B; Aravena-Abarzúa, Gabriela A; Navarro-Navarro, Cristell S; Udall, Joshua; Maughan, Jeff; Peterson, Louis M; Salvo-Garrido, Haroldo E; Maureira-Butler, Iván J

    2012-08-24

    Yellow lupin (Lupinus luteus L.) is a minor legume crop characterized by its high seed protein content. Although grown in several temperate countries, its orphan condition has limited the generation of genomic tools to aid breeding efforts to improve yield and nutritional quality. In this study, we report the construction of 454-expresed sequence tag (EST) libraries, carried out comparative studies between L. luteus and model legume species, developed a comprehensive set of EST-simple sequence repeat (SSR) markers, and validated their utility on diversity studies and transferability to related species. Two runs of 454 pyrosequencing yielded 205 Mb and 530 Mb of sequence data for L1 (young leaves, buds and flowers) and L2 (immature seeds) EST- libraries. A combined assembly (L1L2) yielded 71,655 contigs with an average contig length of 632 nucleotides. L1L2 contigs were clustered into 55,309 isotigs. 38,200 isotigs translated into proteins and 8,741 of them were full length. Around 57% of L. luteus sequences had significant similarity with at least one sequence of Medicago, Lotus, Arabidopsis, or Glycine, and 40.17% showed positive matches with all of these species. L. luteus isotigs were also screened for the presence of SSR sequences. A total of 2,572 isotigs contained at least one EST-SSR, with a frequency of one SSR per 17.75 kbp. Empirical evaluation of the EST-SSR candidate markers resulted in 222 polymorphic EST-SSRs. Two hundred and fifty four (65.7%) and 113 (30%) SSR primer pairs were able to amplify fragments from L. hispanicus and L. mutabilis DNA, respectively. Fifty polymorphic EST-SSRs were used to genotype a sample of 64 L. luteus accessions. Neighbor-joining distance analysis detected the existence of several clusters among L. luteus accessions, strongly suggesting the existence of population subdivisions. However, no clear clustering patterns followed the accession's origin. L. luteus deep transcriptome sequencing will facilitate the further development of genomic tools and lupin germplasm. Massive sequencing of cDNA libraries will continue to produce raw materials for gene discovery, identification of polymorphisms (SNPs, EST-SSRs, INDELs, etc.) for marker development, anchoring sequences for genome comparisons and putative gene candidates for QTL detection.

  3. Searching for resistance genes to Bursaphelenchus xylophilus using high throughput screening.

    PubMed

    Santos, Carla S; Pinheiro, Miguel; Silva, Ana I; Egas, Conceição; Vasconcelos, Marta W

    2012-11-07

    Pine wilt disease (PWD), caused by the pinewood nematode (PWN; Bursaphelenchus xylophilus), damages and kills pine trees and is causing serious economic damage worldwide. Although the ecological mechanism of infestation is well described, the plant's molecular response to the pathogen is not well known. This is due mainly to the lack of genomic information and the complexity of the disease. High throughput sequencing is now an efficient approach for detecting the expression of genes in non-model organisms, thus providing valuable information in spite of the lack of the genome sequence. In an attempt to unravel genes potentially involved in the pine defense against the pathogen, we hereby report the high throughput comparative sequence analysis of infested and non-infested stems of Pinus pinaster (very susceptible to PWN) and Pinus pinea (less susceptible to PWN). Four cDNA libraries from infested and non-infested stems of P. pinaster and P. pinea were sequenced in a full 454 GS FLX run, producing a total of 2,083,698 reads. The putative amino acid sequences encoded by the assembled transcripts were annotated according to Gene Ontology, to assign Pinus contigs into Biological Processes, Cellular Components and Molecular Functions categories. Most of the annotated transcripts corresponded to Picea genes-25.4-39.7%, whereas a smaller percentage, matched Pinus genes, 1.8-12.8%, probably a consequence of more public genomic information available for Picea than for Pinus. The comparative transcriptome analysis showed that when P. pinaster was infested with PWN, the genes malate dehydrogenase, ABA, water deficit stress related genes and PAR1 were highly expressed, while in PWN-infested P. pinea, the highly expressed genes were ricin B-related lectin, and genes belonging to the SNARE and high mobility group families. Quantitative PCR experiments confirmed the differential gene expression between the two pine species. Defense-related genes triggered by nematode infestation were detected in both P. pinaster and P. pinea transcriptomes utilizing 454 pyrosequencing technology. P. pinaster showed higher abundance of genes related to transcriptional regulation, terpenoid secondary metabolism (including some with nematicidal activity) and pathogen attack. P. pinea showed higher abundance of genes related to oxidative stress and higher levels of expression in general of stress responsive genes. This study provides essential information about the molecular defense mechanisms utilized by P. pinaster and P. pinea against PWN infestation and contributes to a better understanding of PWD.

  4. Genetic Inventory Task Final Report. Volume 2

    NASA Technical Reports Server (NTRS)

    Venkateswaran, Kasthuri; LaDuc, Myron T.; Vaishampayan, Parag

    2012-01-01

    Contaminant terrestrial microbiota could profoundly impact the scientific integrity of extraterrestrial life-detection experiments. It is therefore important to know what organisms persist on spacecraft surfaces so that their presence can be eliminated or discriminated from authentic extraterrestrial biosignatures. Although there is a growing understanding of the biodiversity associated with spacecraft and cleanroom surfaces, it remains challenging to assess the risk of these microbes confounding life-detection or sample-return experiments. A key challenge is to provide a comprehensive inventory of microbes present on spacecraft surfaces. To assess the phylogenetic breadth of microorganisms on spacecraft and associated surfaces, the Genetic Inventory team used three technologies: conventional cloning techniques, PhyloChip DNA microarrays, and 454 tag-encoded pyrosequencing, together with a methodology to systematically collect, process, and archive nucleic acids. These three analysis methods yielded considerably different results: Traditional approaches provided the least comprehensive assessment of microbial diversity, while PhyloChip and pyrosequencing illuminated more diverse microbial populations. The overall results stress the importance of selecting sample collection and processing approaches based on the desired target and required level of detection. The DNA archive generated in this study can be made available to future researchers as genetic-inventory-oriented technologies further mature.

  5. Changes in the bacterial community of soybean rhizospheres during growth in the field.

    PubMed

    Sugiyama, Akifumi; Ueda, Yoshikatsu; Zushi, Takahiro; Takase, Hisabumi; Yazaki, Kazufumi

    2014-01-01

    Highly diverse communities of bacteria inhabiting soybean rhizospheres play pivotal roles in plant growth and crop production; however, little is known about the changes that occur in these communities during growth. We used both culture-dependent physiological profiling and culture independent DNA-based approaches to characterize the bacterial communities of the soybean rhizosphere during growth in the field. The physiological properties of the bacterial communities were analyzed by a community-level substrate utilization assay with BioLog Eco plates, and the composition of the communities was assessed by gene pyrosequencing. Higher metabolic capabilities were found in rhizosphere soil than in bulk soil during all stages of the BioLog assay. Pyrosequencing analysis revealed that differences between the bacterial communities of rhizosphere and bulk soils at the phylum level; i.e., Proteobacteria were increased, while Acidobacteria and Firmicutes were decreased in rhizosphere soil during growth. Analysis of operational taxonomic units showed that the bacterial communities of the rhizosphere changed significantly during growth, with a higher abundance of potential plant growth promoting rhizobacteria, including Bacillus, Bradyrhizobium, and Rhizobium, in a stage-specific manner. These findings demonstrated that rhizosphere bacterial communities were changed during soybean growth in the field.

  6. Performance of commercial platforms for rapid genotyping of polymorphisms affecting warfarin dose.

    PubMed

    King, Cristi R; Porche-Sorbet, Rhonda M; Gage, Brian F; Ridker, Paul M; Renaud, Yannick; Phillips, Michael S; Eby, Charles

    2008-06-01

    Initiation of warfarin therapy is associated with bleeding owing to its narrow therapeutic window and unpredictable therapeutic dose. Pharmacogenetic-based dosing algorithms can improve accuracy of initial warfarin dosing but require rapid genotyping for cytochrome P-450 2C9 (CYP2C9) *2 and *3 single nucleotide polymorphisms (SNPs) and a vitamin K epoxide reductase (VKORC1) SNP. We evaluated 4 commercial systems: INFINITI analyzer (AutoGenomics, Carlsbad, CA), Invader assay (Third Wave Technologies, Madison, WI), Tag-It Mutation Detection assay (Luminex Molecular Diagnostics, formerly Tm Bioscience, Toronto, Canada), and Pyrosequencing (Biotage, Uppsala, Sweden). We genotyped 112 DNA samples and resolved any discrepancies with bidirectional sequencing. The INFINITI analyzer was 100% accurate for all SNPs and required 8 hours. Invader and Tag-It were 100% accurate for CYP2C9 SNPs, 99% accurate for VKORC1 -1639/3673 SNP, and required 3 hours and 8 hours, respectively. Pyrosequencing was 99% accurate for CYP2C9 *2, 100% accurate for CYP2C9 *3, and 100% accurate for VKORC1 and required 4 hours. Current commercial platforms provide accurate and rapid genotypes for pharmacogenetic dosing during initiation of warfarin therapy.

  7. Bacterial communities in the gut and reproductive organs of Bactrocera minax (Diptera: Tephritidae) based on 454 pyrosequencing.

    PubMed

    Wang, Ailin; Yao, Zhichao; Zheng, Weiwei; Zhang, Hongyu

    2014-01-01

    The citrus fruit fly Bactrocera minax is associated with diverse bacterial communities. We used a 454 pyrosequencing technology to study in depth the microbial communities associated with gut and reproductive organs of Bactrocera minax. Our dataset consisted of 100,749 reads with an average length of 400 bp. The saturated rarefaction curves and species richness indices indicate that the sampling was comprehensive. We found highly diverse bacterial communities, with individual sample containing approximately 361 microbial operational taxonomic units (OTUs). A total of 17 bacterial phyla were obtained from the flies. A phylogenetic analysis of 16S rDNA revealed that Proteobacteria was dominant in all samples (75%-95%). Actinobacteria and Firmicutes were also commonly found in the total clones. Klebsiella, Citrobacter, Enterobacter, and Serratia were the major genera. However, bacterial diversity (Chao1, Shannon and Simpson indices) and community structure (PCA analysis) varied across samples. Female ovary has the most diverse bacteria, followed by male testis, and the bacteria diversity of reproductive organs is richer than that of the gut. The observed variation can be caused by sex and tissue, possibly to meet the host's physiological demands.

  8. DIMM-SC: a Dirichlet mixture model for clustering droplet-based single cell transcriptomic data.

    PubMed

    Sun, Zhe; Wang, Ting; Deng, Ke; Wang, Xiao-Feng; Lafyatis, Robert; Ding, Ying; Hu, Ming; Chen, Wei

    2018-01-01

    Single cell transcriptome sequencing (scRNA-Seq) has become a revolutionary tool to study cellular and molecular processes at single cell resolution. Among existing technologies, the recently developed droplet-based platform enables efficient parallel processing of thousands of single cells with direct counting of transcript copies using Unique Molecular Identifier (UMI). Despite the technology advances, statistical methods and computational tools are still lacking for analyzing droplet-based scRNA-Seq data. Particularly, model-based approaches for clustering large-scale single cell transcriptomic data are still under-explored. We developed DIMM-SC, a Dirichlet Mixture Model for clustering droplet-based Single Cell transcriptomic data. This approach explicitly models UMI count data from scRNA-Seq experiments and characterizes variations across different cell clusters via a Dirichlet mixture prior. We performed comprehensive simulations to evaluate DIMM-SC and compared it with existing clustering methods such as K-means, CellTree and Seurat. In addition, we analyzed public scRNA-Seq datasets with known cluster labels and in-house scRNA-Seq datasets from a study of systemic sclerosis with prior biological knowledge to benchmark and validate DIMM-SC. Both simulation studies and real data applications demonstrated that overall, DIMM-SC achieves substantially improved clustering accuracy and much lower clustering variability compared to other existing clustering methods. More importantly, as a model-based approach, DIMM-SC is able to quantify the clustering uncertainty for each single cell, facilitating rigorous statistical inference and biological interpretations, which are typically unavailable from existing clustering methods. DIMM-SC has been implemented in a user-friendly R package with a detailed tutorial available on www.pitt.edu/∼wec47/singlecell.html. wei.chen@chp.edu or hum@ccf.org. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  9. Understanding the immune system architecture and transcriptome responses to southern rice black-streaked dwarf virus in Sogatella furcifera.

    PubMed

    Wang, Lin; Tang, Nan; Gao, Xinlei; Guo, Dongyang; Chang, Zhaoxia; Fu, Yating; Akinyemi, Ibukun A; Wu, Qingfa

    2016-11-02

    Sogatella furcifera, the white-backed planthopper (WBPH), has become one of the most destructive pests in rice production owing to its plant sap-sucking behavior and efficient transmission of Southern rice black-streaked dwarf virus (SRBSDV) in a circulative, propagative and persistent manner. The dynamic and complex SRBSDV-WBPH-rice plant interaction is still poorly understood. In this study, based on a homology-based genome-wide analysis, 348 immune-related genes belonging to 28 families were identified in WBPH. A transcriptome analysis of non-viruliferous (NVF) and viruliferous groups with high viral titers (HVT) and median viral titers (MVT) revealed that feeding on SRBSDV-infected rice plants has a significant impact on gene expression, regardless of viral titers in insects. We identified 278 up-regulated and 406 down-regulated genes shared among the NVF, MVT, and HVT groups and detected significant down-regulation of primary metabolism-related genes and oxidoreductase. In viruliferous WBPH with viral titer-specific transcriptome changes, 1,906 and 1,467 genes exhibited strict monotonically increasing and decreasing expression, respectively. The RNAi pathway was the major antiviral response to increasing viral titers among diverse immune responses. These results clarify the responses of immune genes and the transcriptome of WBPH to SRBSDV and improve our knowledge of the functional relationship between pathogen, vector, and host.

  10. The Urinary Bladder Transcriptome and Proteome Defined by Transcriptomics and Antibody-Based Profiling

    PubMed Central

    Habuka, Masato; Fagerberg, Linn; Hallström, Björn M.; Pontén, Fredrik; Yamamoto, Tadashi; Uhlen, Mathias

    2015-01-01

    To understand functions and diseases of urinary bladder, it is important to define its molecular constituents and their roles in urinary bladder biology. Here, we performed genome-wide deep RNA sequencing analysis of human urinary bladder samples and identified genes up-regulated in the urinary bladder by comparing the transcriptome data to those of all other major human tissue types. 90 protein-coding genes were elevated in the urinary bladder, either with enhanced expression uniquely in the urinary bladder or elevated expression together with at least one other tissue (group enriched). We further examined the localization of these proteins by immunohistochemistry and tissue microarrays and 20 of these 90 proteins were localized to the whole urothelium with a majority not yet described in the context of the urinary bladder. Four additional proteins were found specifically in the umbrella cells (Uroplakin 1a, 2, 3a, and 3b), and three in the intermediate/basal cells (KRT17, PCP4L1 and ATP1A4). 61 of the 90 elevated genes have not been previously described in the context of urinary bladder and the corresponding proteins are interesting targets for more in-depth studies. In summary, an integrated omics approach using transcriptomics and antibody-based profiling has been used to define a comprehensive list of proteins elevated in the urinary bladder. PMID:26694548

  11. Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data.

    PubMed

    Aliper, Alexander; Plis, Sergey; Artemov, Artem; Ulloa, Alvaro; Mamoshina, Polina; Zhavoronkov, Alex

    2016-07-05

    Deep learning is rapidly advancing many areas of science and technology with multiple success stories in image, text, voice and video recognition, robotics, and autonomous driving. In this paper we demonstrate how deep neural networks (DNN) trained on large transcriptional response data sets can classify various drugs to therapeutic categories solely based on their transcriptional profiles. We used the perturbation samples of 678 drugs across A549, MCF-7, and PC-3 cell lines from the LINCS Project and linked those to 12 therapeutic use categories derived from MeSH. To train the DNN, we utilized both gene level transcriptomic data and transcriptomic data processed using a pathway activation scoring algorithm, for a pooled data set of samples perturbed with different concentrations of the drug for 6 and 24 hours. In both pathway and gene level classification, DNN achieved high classification accuracy and convincingly outperformed the support vector machine (SVM) model on every multiclass classification problem, however, models based on pathway level data performed significantly better. For the first time we demonstrate a deep learning neural net trained on transcriptomic data to recognize pharmacological properties of multiple drugs across different biological systems and conditions. We also propose using deep neural net confusion matrices for drug repositioning. This work is a proof of principle for applying deep learning to drug discovery and development.

  12. Deep learning applications for predicting pharmacological properties of drugs and drug repurposing using transcriptomic data

    PubMed Central

    Aliper, Alexander; Plis, Sergey; Artemov, Artem; Ulloa, Alvaro; Mamoshina, Polina; Zhavoronkov, Alex

    2016-01-01

    Deep learning is rapidly advancing many areas of science and technology with multiple success stories in image, text, voice and video recognition, robotics and autonomous driving. In this paper we demonstrate how deep neural networks (DNN) trained on large transcriptional response data sets can classify various drugs to therapeutic categories solely based on their transcriptional profiles. We used the perturbation samples of 678 drugs across A549, MCF‐7 and PC‐3 cell lines from the LINCS project and linked those to 12 therapeutic use categories derived from MeSH. To train the DNN, we utilized both gene level transcriptomic data and transcriptomic data processed using a pathway activation scoring algorithm, for a pooled dataset of samples perturbed with different concentrations of the drug for 6 and 24 hours. In both gene and pathway level classification, DNN convincingly outperformed support vector machine (SVM) model on every multiclass classification problem, however, models based on a pathway level classification perform better. For the first time we demonstrate a deep learning neural net trained on transcriptomic data to recognize pharmacological properties of multiple drugs across different biological systems and conditions. We also propose using deep neural net confusion matrices for drug repositioning. This work is a proof of principle for applying deep learning to drug discovery and development. PMID:27200455

  13. Transcriptome dynamics through alternative polyadenylation in developmental and environmental responses in plants revealed by deep sequencing

    PubMed Central

    Shen, Yingjia; Venu, R.C.; Nobuta, Kan; Wu, Xiaohui; Notibala, Varun; Demirci, Caghan; Meyers, Blake C.; Wang, Guo-Liang; Ji, Guoli; Li, Qingshun Q.

    2011-01-01

    Polyadenylation sites mark the ends of mRNA transcripts. Alternative polyadenylation (APA) may alter sequence elements and/or the coding capacity of transcripts, a mechanism that has been demonstrated to regulate gene expression and transcriptome diversity. To study the role of APA in transcriptome dynamics, we analyzed a large-scale data set of RNA “tags” that signify poly(A) sites and expression levels of mRNA. These tags were derived from a wide range of tissues and developmental stages that were mutated or exposed to environmental treatments, and generated using digital gene expression (DGE)–based protocols of the massively parallel signature sequencing (MPSS-DGE) and the Illumina sequencing-by-synthesis (SBS-DGE) sequencing platforms. The data offer a global view of APA and how it contributes to transcriptome dynamics. Upon analysis of these data, we found that ∼60% of Arabidopsis genes have multiple poly(A) sites. Likewise, ∼47% and 82% of rice genes use APA, supported by MPSS-DGE and SBS-DGE tags, respectively. In both species, ∼49%–66% of APA events were mapped upstream of annotated stop codons. Interestingly, 10% of the transcriptomes are made up of APA transcripts that are differentially distributed among developmental stages and in tissues responding to environmental stresses, providing an additional level of transcriptome dynamics. Examples of pollen-specific APA switching and salicylic acid treatment-specific APA clearly demonstrated such dynamics. The significance of these APAs is more evident in the 3034 genes that have conserved APA events between rice and Arabidopsis. PMID:21813626

  14. Microfluidic single-cell whole-transcriptome sequencing.

    PubMed

    Streets, Aaron M; Zhang, Xiannian; Cao, Chen; Pang, Yuhong; Wu, Xinglong; Xiong, Liang; Yang, Lu; Fu, Yusi; Zhao, Liang; Tang, Fuchou; Huang, Yanyi

    2014-05-13

    Single-cell whole-transcriptome analysis is a powerful tool for quantifying gene expression heterogeneity in populations of cells. Many techniques have, thus, been recently developed to perform transcriptome sequencing (RNA-Seq) on individual cells. To probe subtle biological variation between samples with limiting amounts of RNA, more precise and sensitive methods are still required. We adapted a previously developed strategy for single-cell RNA-Seq that has shown promise for superior sensitivity and implemented the chemistry in a microfluidic platform for single-cell whole-transcriptome analysis. In this approach, single cells are captured and lysed in a microfluidic device, where mRNAs with poly(A) tails are reverse-transcribed into cDNA. Double-stranded cDNA is then collected and sequenced using a next generation sequencing platform. We prepared 94 libraries consisting of single mouse embryonic cells and technical replicates of extracted RNA and thoroughly characterized the performance of this technology. Microfluidic implementation increased mRNA detection sensitivity as well as improved measurement precision compared with tube-based protocols. With 0.2 M reads per cell, we were able to reconstruct a majority of the bulk transcriptome with 10 single cells. We also quantified variation between and within different types of mouse embryonic cells and found that enhanced measurement precision, detection sensitivity, and experimental throughput aided the distinction between biological variability and technical noise. With this work, we validated the advantages of an early approach to single-cell RNA-Seq and showed that the benefits of combining microfluidic technology with high-throughput sequencing will be valuable for large-scale efforts in single-cell transcriptome analysis.

  15. Phylogenetic Characterization of Fecal Microbial Communities of Dogs Fed Diets with or without Supplemental Dietary Fiber Using 454 Pyrosequencing

    PubMed Central

    Middelbos, Ingmar S.; Vester Boler, Brittany M.; Qu, Ani; White, Bryan A.; Swanson, Kelly S.; Fahey, George C.

    2010-01-01

    Background Dogs suffer from many of the same maladies as humans that may be affected by the gut microbiome, but knowledge of the canine microbiome is incomplete. This work aimed to use 16S rDNA tag pyrosequencing to phylogenetically characterize hindgut microbiome in dogs and determine how consumption of dietary fiber affects community structure. Principal Findings Six healthy adult dogs were used in a crossover design. A control diet without supplemental fiber and a beet pulp-supplemented (7.5%) diet were fed. Fecal DNA was extracted and the V3 hypervariable region of the microbial 16S rDNA gene amplified using primers suitable for 454-pyrosequencing. Microbial diversity was assessed on random 2000-sequence subsamples of individual and pooled DNA samples by diet. Our dataset comprised 77,771 reads with an average length of 141 nt. Individual samples contained approximately 129 OTU, with Fusobacteria (23 – 40% of reads), Firmicutes (14 – 28% of reads) and Bacteroidetes (31 – 34% of reads) being co-dominant phyla. Feeding dietary fiber generally decreased Fusobacteria and increased Firmicutes, but these changes were not equally apparent in all dogs. UniFrac analysis revealed that structure of the gut microbiome was affected by diet and Firmicutes appeared to play a strong role in by-diet clustering. Conclusions Our data suggest three co-dominant bacterial phyla in the canine hindgut. Furthermore, a relatively small amount of dietary fiber changed the structure of the gut microbiome detectably. Our data are among the first to characterize the healthy canine gut microbiome using pyrosequencing and provide a basis for studies focused on devising dietary interventions for microbiome-associated diseases. PMID:20339542

  16. Clinical Implications of Quantitative JAK2 V617F Analysis using Droplet Digital PCR in Myeloproliferative Neoplasms

    PubMed Central

    Lee, Eunyoung; Lee, Kyoung Joo; Park, Hyein; Chung, Jin Young; Lee, Mi-Na; Chang, Myung Hee; Yoo, Jongha; Lee, Hyewon

    2018-01-01

    Background JAK2 V617F is the most common mutation in myeloproliferative neoplasms (MPNs) and is a major diagnostic criterion. Mutation quantification is useful for classifying patients with MPN into subgroups and for prognostic prediction. Droplet digital PCR (ddPCR) can provide accurate and reproducible quantitative analysis of DNA. This study was designed to verify the correlation of ddPCR with pyrosequencing results in the diagnosis of MPN and to investigate clinical implications of the mutational burden. Methods Peripheral blood or bone marrow samples were obtained from 56 patients newly diagnosed with MPN or previously diagnosed with MPN but not yet indicated for JAK2 inhibitor treatment between 2012 and 2016. The JAK2 V617F mutation was detected by pyrosequencing as a diagnostic work-up. The same samples were used for ddPCR to determine the correlation between assays and establish a detection sensitivity cut-off. Clinical and hematologic aspects were reviewed. Results Forty-two (75%) and 46 (82.1%) patients were positive for JAK2 V617F by pyrosequencing and ddPCR, respectively. The mean mutated allele frequency at diagnosis was 37.5±30.1% and was 40.7±31.2% with ddPCR, representing a strong correlation (r=0.9712, P<0.001). Follow-up samples were available for 12 patients, including eight that were JAK2 V617F-positive. Of these, mutational burden reduction after treatment was observed in six patients (75%), consistent with trends of hematologic improvement. Conclusions Quantitative analysis of the JAK2 V617F mutation using ddPCR was highly correlated with pyrosequencing data and may reflect the clinical response to treatment. PMID:29214759

  17. Bacterial and diazotrophic diversities of endophytes in Dendrobium catenatum determined through barcoded pyrosequencing

    PubMed Central

    Li, Ou; Sun, Lihua; Guan, Chenglin; Kong, Dedong

    2017-01-01

    As an epiphyte orchid, Dendrobium catenatum relies on microorganisms for requisite nutrients. Metagenome pyrosequencing based on 16S rRNA and nifH genes was used to characterize the bacterial and diazotrophic communities associated with D. catenatum collected from 5 districts in China. Based on Meta-16S rRNA sequencing, 22 bacterial phyla and 699 genera were identified, distributed as 125 genera from 8 phyla and 319 genera from 10 phyla shared by all the planting bases and all the tissues, respectively. The predominant Proteobacteria varied from 71.81% (GZ) to 96.08% (YN), and Delftia (10.39–38.42%), Burkholderia (2.71–15.98%), Escherichia/Shigella (4.90–25.12%), Pseudomonas (2.68–30.72%) and Sphingomonas (1.83–2.05%) dominated in four planting bases. Pseudomonas (17.94–22.06%), Escherichia/Shigella (6.59–11.59%), Delftia (9.65–22.14%) and Burkholderia (3.12–11.05%) dominated in all the tissues. According to Meta-nifH sequencing, 4 phyla and 45 genera were identified, while 17 genera and 24 genera from 4 phyla were shared by all the planting bases and all the tissues, respectively. Burkholderia and Bradyrhizobium were the most popular in the planting bases, followed by Methylovirgula and Mesorhizobium. Mesorhizobium was the most popular in different tissues, followed by Beijerinckia, Xanthobacter, and Burkholderia. Among the genera, 39 were completely overlapped with the results based on the 16S rRNA gene. In conclusion, abundant bacteria and diazotrophs were identified in common in different tissues of D. catenatum from five planting bases, which might play a great role in the supply of nutrients such as nitrogen. The exact abundance of phylum and genus on the different tissues from different planting bases need deeper sequencing with more samples. PMID:28931073

  18. Consensus-phenotype integration of transcriptomic and metabolomic data implies a role for metabolism in the chemosensitivity of tumour cells.

    PubMed

    Cavill, Rachel; Kamburov, Atanas; Ellis, James K; Athersuch, Toby J; Blagrove, Marcus S C; Herwig, Ralf; Ebbels, Timothy M D; Keun, Hector C

    2011-03-01

    Using transcriptomic and metabolomic measurements from the NCI60 cell line panel, together with a novel approach to integration of molecular profile data, we show that the biochemical pathways associated with tumour cell chemosensitivity to platinum-based drugs are highly coincident, i.e. they describe a consensus phenotype. Direct integration of metabolome and transcriptome data at the point of pathway analysis improved the detection of consensus pathways by 76%, and revealed associations between platinum sensitivity and several metabolic pathways that were not visible from transcriptome analysis alone. These pathways included the TCA cycle and pyruvate metabolism, lipoprotein uptake and nucleotide synthesis by both salvage and de novo pathways. Extending the approach across a wide panel of chemotherapeutics, we confirmed the specificity of the metabolic pathway associations to platinum sensitivity. We conclude that metabolic phenotyping could play a role in predicting response to platinum chemotherapy and that consensus-phenotype integration of molecular profiling data is a powerful and versatile tool for both biomarker discovery and for exploring the complex relationships between biological pathways and drug response.

  19. Comparative Transcriptomes and EVO-DEVO Studies Depending on Next Generation Sequencing.

    PubMed

    Liu, Tiancheng; Yu, Lin; Liu, Lei; Li, Hong; Li, Yixue

    2015-01-01

    High throughput technology has prompted the progressive omics studies, including genomics and transcriptomics. We have reviewed the improvement of comparative omic studies, which are attributed to the high throughput measurement of next generation sequencing technology. Comparative genomics have been successfully applied to evolution analysis while comparative transcriptomics are adopted in comparison of expression profile from two subjects by differential expression or differential coexpression, which enables their application in evolutionary developmental biology (EVO-DEVO) studies. EVO-DEVO studies focus on the evolutionary pressure affecting the morphogenesis of development and previous works have been conducted to illustrate the most conserved stages during embryonic development. Old measurements of these studies are based on the morphological similarity from macro view and new technology enables the micro detection of similarity in molecular mechanism. Evolutionary model of embryo development, which includes the "funnel-like" model and the "hourglass" model, has been evaluated by combination of these new comparative transcriptomic methods with prior comparative genomic information. Although the technology has promoted the EVO-DEVO studies into a new era, technological and material limitation still exist and further investigations require more subtle study design and procedure.

  20. Transcriptome alterations in zebrafish embryos after exposure to environmental estrogens and anti-androgens can reveal endocrine disruption.

    PubMed

    Schiller, Viktoria; Wichmann, Arne; Kriehuber, Ralf; Schäfers, Christoph; Fischer, Rainer; Fenske, Martina

    2013-12-01

    Exposure to environmental chemicals known as endocrine disruptors (EDs) is in many cases associated with an unpredictable hazard for wildlife and human health. The identification of endocrine disruptive properties of chemicals certain to enter the aquatic environment relies on toxicity tests with fish, assessing adverse effects on reproduction and sexual development. The demand for quick, reliable ED assays favored the use of fish embryos as alternative test organisms. We investigated the application of a transcriptomics-based assay for estrogenic and anti-androgenic chemicals with zebrafish embryos. Two reference compounds, 17α-ethinylestradiol and flutamide, were tested to evaluate the effects on development and the transcriptome after 48h-exposures. Comparison of the transcriptome response with other estrogenic and anti-androgenic compounds (genistein, bisphenol A, methylparaben, linuron, prochloraz, propanil) showed commonalities and differences in regulated pathways, enabling us to classify the estrogenic and anti-androgenic potencies. This demonstrates that different mechanism of ED can be assessed already in fish embryos. Copyright © 2013 Elsevier Inc. All rights reserved.

  1. A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing.

    PubMed

    Chen, Shi-Yi; Deng, Feilong; Jia, Xianbo; Li, Cao; Lai, Song-Jia

    2017-08-09

    It is widely acknowledged that transcriptional diversity largely contributes to biological regulation in eukaryotes. Since the advent of second-generation sequencing technologies, a large number of RNA sequencing studies have considerably improved our understanding of transcriptome complexity. However, it still remains a huge challenge for obtaining full-length transcripts because of difficulties in the short read-based assembly. In the present study we employ PacBio single-molecule long-read sequencing technology for whole-transcriptome profiling in rabbit (Oryctolagus cuniculus). We totally obtain 36,186 high-confidence transcripts from 14,474 genic loci, among which more than 23% of genic loci and 66% of isoforms have not been annotated yet within the current reference genome. Furthermore, about 17% of transcripts are computationally revealed to be non-coding RNAs. Up to 24,797 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events are detected within this de novo constructed transcriptome, respectively. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of rabbit genome.

  2. A house finch (Haemorhous mexicanus) spleen transcriptome reveals intra- and interspecific patterns of gene expression, alternative splicing and genetic diversity in passerines.

    PubMed

    Zhang, Qu; Hill, Geoffrey E; Edwards, Scott V; Backström, Niclas

    2014-04-24

    With its plumage color dimorphism and unique history in North America, including a recent population expansion and an epizootic of Mycoplasma gallisepticum (MG), the house finch (Haemorhous mexicanus) is a model species for studying sexual selection, plumage coloration and host-parasite interactions. As part of our ongoing efforts to make available genomic resources for this species, here we report a transcriptome assembly derived from genes expressed in spleen. We characterize transcriptomes from two populations with different histories of demography and disease exposure: a recently founded population in the eastern US that has been exposed to MG for over a decade and a native population from the western range that has never been exposed to MG. We utilize this resource to quantify conservation in gene expression in passerine birds over approximately 50 MY by comparing splenic expression profiles for 9,646 house finch transcripts and those from zebra finch and find that less than half of all genes expressed in spleen in either species are expressed in both species. Comparative gene annotations from several vertebrate species suggest that the house finch transcriptomes contain ~15 genes not yet found in previously sequenced vertebrate genomes. The house finch transcriptomes harbour ~85,000 SNPs, ~20,000 of which are non-synonymous. Although not yet validated by biological or technical replication, we identify a set of genes exhibiting differences between populations in gene expression (n = 182; 2% of all transcripts), allele frequencies (76 FST ouliers) and alternative splicing as well as genes with several fixed non-synonymous substitutions; this set includes genes with functions related to double-strand break repair and immune response. The two house finch spleen transcriptome profiles will add to the increasing data on genome and transcriptome sequence information from natural populations. Differences in splenic expression between house finch and zebra finch imply either significant evolutionary turnover of splenic expression patterns or different physiological states of the individuals examined. The transcriptome resource will enhance the potential to annotate an eventual house finch genome, and the set of gene-based high-quality SNPs will help clarify the genetic underpinnings of host-pathogen interactions and sexual selection.

  3. Genome Sequence of Bacillus cereus Strain TG1-6, a Plant-Beneficial Rhizobacterium That Is Highly Salt Tolerant

    PubMed Central

    2018-01-01

    ABSTRACT The complete genome sequence of Bacillus cereus strain TG1-6, which is a highly salt-tolerant rhizobacterium that enhances plant tolerance to drought stress, is reported here. The sequencing process was performed based on a combination of pyrosequencing and single-molecule sequencing. The complete genome is estimated to be approximately 5.42 Mb, containing a total of 5,610 predicted protein-coding DNA sequences (CDSs). PMID:29748401

  4. RNA-seq analysis of broiler liver transcriptome reveals novel responses to high ambient temperature.

    PubMed

    Coble, Derrick J; Fleming, Damarius; Persia, Michael E; Ashwell, Chris M; Rothschild, Max F; Schmidt, Carl J; Lamont, Susan J

    2014-12-10

    In broilers, high ambient temperature can result in reduced feed consumption, digestive inefficiency, impaired metabolism, and even death. The broiler sector of the U.S. poultry industry incurs approximately $52 million in heat-related losses annually. The objective of this study is to characterize the effects of cyclic high ambient temperature on the transcriptome of a metabolically active organ, the liver. This study provides novel insight into the effects of high ambient temperature on metabolism in broilers, because it is the first reported RNA-seq study to characterize the effect of heat on the transcriptome of a metabolic-related tissue. This information provides a platform for future investigations to further elucidate physiologic responses to high ambient temperature and seek methods to ameliorate the negative impacts of heat. Transcriptome sequencing of the livers of 8 broiler males using Illumina HiSeq 2000 technology resulted in 138 million, 100-base pair single end reads, yielding a total of 13.8 gigabases of sequence. Forty genes were differentially expressed at a significance level of P-value < 0.05 and a fold-change ≥ 2 in response to a week of cyclic high ambient temperature with 27 down-regulated and 13 up-regulated genes. Two gene networks were created from the function-based Ingenuity Pathway Analysis (IPA) of the differentially expressed genes: "Cell Signaling" and "Endocrine System Development and Function". The gene expression differences in the liver transcriptome of the heat-exposed broilers reflected physiological responses to decrease internal temperature, reduce hyperthermia-induced apoptosis, and promote tissue repair. Additionally, the differential gene expression revealed a physiological response to regulate the perturbed cellular calcium levels that can result from high ambient temperature exposure. Exposure to cyclic high ambient temperature results in changes at the metabolic, physiologic, and cellular level that can be characterized through RNA-seq analysis of the liver transcriptome of broilers. The findings highlight specific physiologic mechanisms by which broilers reduce the effects of exposure to high ambient temperature. This information provides a foundation for future investigations into the gene networks involved in the broiler stress response and for development of strategies to ameliorate the negative impacts of heat on animal production and welfare.

  5. Identifying potential RNAi targets in grain aphid (Sitobion avenae F.) based on transcriptome profiling of its alimentary canal after feeding on wheat plants.

    PubMed

    Zhang, Min; Zhou, Yuwen; Wang, Hui; Jones, Huw; Gao, Qiang; Wang, Dahai; Ma, Youzhi; Xia, Lanqin

    2013-08-16

    The grain aphid (Sitobion avenae F.) is a major agricultural pest which causes significant yield losses of wheat in China, Europe and North America annually. Transcriptome profiling of the grain aphid alimentary canal after feeding on wheat plants could provide comprehensive gene expression information involved in feeding, ingestion and digestion. Furthermore, selection of aphid-specific RNAi target genes would be essential for utilizing a plant-mediated RNAi strategy to control aphids via a non-toxic mode of action. However, due to the tiny size of the alimentary canal and lack of genomic information on grain aphid as a whole, selection of the RNAi targets is a challenging task that as far as we are aware, has never been documented previously. In this study, we performed de novo transcriptome assembly and gene expression analyses of the alimentary canals of grain aphids before and after feeding on wheat plants using Illumina RNA sequencing. The transcriptome profiling generated 30,427 unigenes with an average length of 664 bp. Furthermore, comparison of the transcriptomes of alimentary canals of pre- and post feeding grain aphids indicated that 5490 unigenes were differentially expressed, among which, diverse genes and/or pathways were identified and annotated. Based on the RPKM values of these unigenes, 16 of them that were significantly up or down-regulated upon feeding were selected for dsRNA artificial feeding assay. Of these, 5 unigenes led to higher mortality and developmental stunting in an artificial feeding assay due to the down-regulation of the target gene expression. Finally, by adding fluorescently labelled dsRNA into the artificial diet, the spread of fluorescence signal in the whole body tissues of grain aphid was observed. Comparison of the transcriptome profiles of the alimentary canals of pre- and post-feeding grain aphids on wheat plants provided comprehensive gene expression information that could facilitate our understanding of the molecular mechanisms underlying feeding, ingestion and digestion. Furthermore, five novel and effective potential RNAi target genes were identified in grain aphid for the first time. This finding would provide a fundamental basis for aphid control in wheat through plant mediated RNAi strategy.

  6. Transcriptome Assembly, Gene Annotation and Tissue Gene Expression Atlas of the Rainbow Trout

    PubMed Central

    Salem, Mohamed; Paneru, Bam; Al-Tobasei, Rafet; Abdouni, Fatima; Thorgaard, Gary H.; Rexroad, Caird E.; Yao, Jianbo

    2015-01-01

    Efforts to obtain a comprehensive genome sequence for rainbow trout are ongoing and will be complemented by transcriptome information that will enhance genome assembly and annotation. Previously, transcriptome reference sequences were reported using data from different sources. Although the previous work added a great wealth of sequences, a complete and well-annotated transcriptome is still needed. In addition, gene expression in different tissues was not completely addressed in the previous studies. In this study, non-normalized cDNA libraries were sequenced from 13 different tissues of a single doubled haploid rainbow trout from the same source used for the rainbow trout genome sequence. A total of ~1.167 billion paired-end reads were de novo assembled using the Trinity RNA-Seq assembler yielding 474,524 contigs > 500 base-pairs. Of them, 287,593 had homologies to the NCBI non-redundant protein database. The longest contig of each cluster was selected as a reference, yielding 44,990 representative contigs. A total of 4,146 contigs (9.2%), including 710 full-length sequences, did not match any mRNA sequences in the current rainbow trout genome reference. Mapping reads to the reference genome identified an additional 11,843 transcripts not annotated in the genome. A digital gene expression atlas revealed 7,678 housekeeping and 4,021 tissue-specific genes. Expression of about 16,000–32,000 genes (35–71% of the identified genes) accounted for basic and specialized functions of each tissue. White muscle and stomach had the least complex transcriptomes, with high percentages of their total mRNA contributed by a small number of genes. Brain, testis and intestine, in contrast, had complex transcriptomes, with a large numbers of genes involved in their expression patterns. This study provides comprehensive de novo transcriptome information that is suitable for functional and comparative genomics studies in rainbow trout, including annotation of the genome. PMID:25793877

  7. Use of homologous and heterologous gene expression profiling tools to characterize transcription dynamics during apple fruit maturation and ripening.

    PubMed

    Costa, Fabrizio; Alba, Rob; Schouten, Henk; Soglio, Valeria; Gianfranceschi, Luca; Serra, Sara; Musacchi, Stefano; Sansavini, Silviero; Costa, Guglielmo; Fei, Zhangjun; Giovannoni, James

    2010-10-25

    Fruit development, maturation and ripening consists of a complex series of biochemical and physiological changes that in climacteric fruits, including apple and tomato, are coordinated by the gaseous hormone ethylene. These changes lead to final fruit quality and understanding of the functional machinery underlying these processes is of both biological and practical importance. To date many reports have been made on the analysis of gene expression in apple. In this study we focused our investigation on the role of ethylene during apple maturation, specifically comparing transcriptomics of normal ripening with changes resulting from application of the hormone receptor competitor 1-methylcyclopropene. To gain insight into the molecular process regulating ripening in apple, and to compare to tomato (model species for ripening studies), we utilized both homologous and heterologous (tomato) microarray to profile transcriptome dynamics of genes involved in fruit development and ripening, emphasizing those which are ethylene regulated.The use of both types of microarrays facilitated transcriptome comparison between apple and tomato (for the later using data previously published and available at the TED: tomato expression database) and highlighted genes conserved during ripening of both species, which in turn represent a foundation for further comparative genomic studies. The cross-species analysis had the secondary aim of examining the efficiency of heterologous (specifically tomato) microarray hybridization for candidate gene identification as related to the ripening process. The resulting transcriptomics data revealed coordinated gene expression during fruit ripening of a subset of ripening-related and ethylene responsive genes, further facilitating the analysis of ethylene response during fruit maturation and ripening. Our combined strategy based on microarray hybridization enabled transcriptome characterization during normal climacteric apple ripening, as well as definition of ethylene-dependent transcriptome changes. Comparison with tomato fruit maturation and ethylene responsive transcriptome activity facilitated identification of putative conserved orthologous ripening-related genes, which serve as an initial set of candidates for assessing conservation of gene activity across genomes of fruit bearing plant species.

  8. De novo transcriptome assembly analysis of weed Apera spica-venti from seven tissues and growth stages.

    PubMed

    Babineau, Marielle; Mahmood, Khalid; Mathiassen, Solvejg K; Kudsk, Per; Kristensen, Michael

    2017-02-06

    Loose silky bentgrass (Apera spica-venti) is an important weed in Europe with a recent increase in herbicide resistance cases. The lack of genetic information about this noxious weed limits its biological understanding such as growth, reproduction, genetic variation, molecular ecology and metabolic herbicide resistance. This study produced a reference transcriptome for A. spica-venti from different tissues (leaf, root, stem) and various growth stages (seed at phenological stages 05, 07, 08, 09). The de novo assembly was performed on individual and combined dataset followed by functional annotations. Individual transcripts and gene families involved in metabolic based herbicide resistance were identified. Eight separate transcriptome assemblies were performed and compared. The combined transcriptome assembly consists of 83,349 contigs with an N50 and average contig length of 762 and 658 bp, respectively. This dataset contains 74,724 transcripts consisting of total 54,846,111 bp. Among them 94% had a homologue to UniProtKB, 73% retrieved a GO mapping, and 50% were functionally annotated. Compared with other grass species, A. spica-venti has 26% proteins in common to Brachypodium distachyon, and 41% to Lolium spp. Glycosyltransferases had the highest number of transcripts in each tissue followed by the cytochrome P450s. The GSTF1 and CYP89A2 transcripts were recovered from the majority of tissues and aligned at a maximum of 66 and 30% to proven herbicide resistant allele from Alopecurus myosuroides and Lolium rigidum, respectively. De novo transcriptome assembly enabled the generation of the first reference transcriptome of A. spica-venti. This can serve as stepping stone for understanding the metabolic herbicide resistance as well as the general biology of this problematic weed. Furthermore, this large-scale sequence data is a valuable scientific resource for comparative transcriptome analysis for Poaceae grasses.

  9. Characterization of Bacterial Communities in Venous Insufficiency Wounds by Use of Conventional Culture and Molecular Diagnostic Methods▿

    PubMed Central

    Tuttle, Marie S.; Mostow, Eliot; Mukherjee, Pranab; Hu, Fen Z.; Melton-Kreft, Rachael; Ehrlich, Garth D.; Dowd, Scot E.; Ghannoum, Mahmoud A.

    2011-01-01

    Microbial infections delay wound healing, but the effect of the composition of the wound microbiome on healing parameters is unknown. To better understand bacterial communities in chronic wounds, we analyzed debridement samples from lower-extremity venous insufficiency ulcers using the following: conventional anaerobic and aerobic bacterial cultures; the Ibis T5000 universal biosensor (Abbott Molecular); and 16S 454 FLX titanium series pyrosequencing (Roche). Wound debridement samples were obtained from 10 patients monitored clinically for at least 6 months, at which point 5 of the 10 sampled wounds had healed. Pyrosequencing data revealed significantly higher bacterial abundance and diversity in wounds that had not healed at 6 months. Additionally, Actinomycetales was increased in wounds that had not healed, and Pseudomonadaceae was increased in wounds that had healed by the 6-month follow-up. Baseline wound surface area, duration, or analysis by Ibis or conventional culture did not reveal significant differences between wounds that healed after 6 months and those that did not. Thus, pyrosequencing identified distinctive baseline characteristics of wounds that did not heal by the 6-month follow-up, furthering our understanding of potentially unique microbiome characteristics of chronic wounds. PMID:21880958

  10. Massively parallel rRNA gene sequencing exacerbates the potential for biased community diversity comparisons due to variable library sizes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gihring, Thomas; Green, Stefan; Schadt, Christopher Warren

    2011-01-01

    Technologies for massively parallel sequencing are revolutionizing microbial ecology and are vastly increasing the scale of ribosomal RNA (rRNA) gene studies. Although pyrosequencing has increased the breadth and depth of possible rRNA gene sampling, one drawback is that the number of reads obtained per sample is difficult to control. Pyrosequencing libraries typically vary widely in the number of sequences per sample, even within individual studies, and there is a need to revisit the behaviour of richness estimators and diversity indices with variable gene sequence library sizes. Multiple reports and review papers have demonstrated the bias in non-parametric richness estimators (e.g.more » Chao1 and ACE) and diversity indices when using clone libraries. However, we found that biased community comparisons are accumulating in the literature. Here we demonstrate the effects of sample size on Chao1, ACE, CatchAll, Shannon, Chao-Shen and Simpson's estimations specifically using pyrosequencing libraries. The need to equalize the number of reads being compared across libraries is reiterated, and investigators are directed towards available tools for making unbiased diversity comparisons.« less

  11. Next-generation sequencing sheds light on the natural history of hepatitis C infection in patients who fail treatment.

    PubMed

    Abdelrahman, Tamer; Hughes, Joseph; Main, Janice; McLauchlan, John; Thursz, Mark; Thomson, Emma

    2015-01-01

    High rates of sexually transmitted infection and reinfection with hepatitis C virus (HCV) have recently been reported in human immunodeficiency virus (HIV)-infected men who have sex with men and reinfection has also been described in monoinfected injecting drug users. The diagnosis of reinfection has traditionally been based on direct Sanger sequencing of samples pre- and posttreatment, but not on more sensitive deep sequencing techniques. We studied viral quasispecies dynamics in patients who failed standard of care therapy in a high-risk HIV-infected cohort of patients with early HCV infection to determine whether treatment failure was associated with reinfection or recrudescence of preexisting infection. Paired sequences (pre- and posttreatment) were analyzed. The HCV E2 hypervariable region-1 was amplified using nested reverse-transcription polymerase chain reaction (RT-PCR) with indexed genotype-specific primers and the same products were sequenced using both Sanger and 454 pyrosequencing approaches. Of 99 HIV-infected patients with acute HCV treated with 24-48 weeks of pegylated interferon alpha and ribavirin, 15 failed to achieve a sustained virological response (six relapsed, six had a null response, and three had a partial response). Using direct sequencing, 10/15 patients (66%) had evidence of a previously undetected strain posttreatment; in many studies, this is interpreted as reinfection. However, pyrosequencing revealed that 15/15 (100%) of patients had evidence of persisting infection; 6/15 (40%) patients had evidence of a previously undetected variant present in the posttreatment sample in addition to a variant that was detected at baseline. This could represent superinfection or a limitation of the sensitivity of pyrosequencing. In this high-risk group, the emergence of new viral strains following treatment failure is most commonly associated with emerging dominance of preexisting minority variants rather than reinfection. Superinfection may occur in this cohort but reinfection is overestimated by Sanger sequencing. © 2014 The Authors. Hepatology published by Wiley on behalf of the American Association for the Study of Liver Diseases.

  12. Temporal Dynamics of Abundance and Composition of Nitrogen-Fixing Communities across Agricultural Soils

    PubMed Central

    Pereira e Silva, Michele C.; Schloter-Hai, Brigitte; Schloter, Michael; van Elsas, Jan Dirk; Salles, Joana Falcão

    2013-01-01

    Background Despite the fact that the fixation of nitrogen is one of the most significant nutrient processes in the terrestrial ecosystem, a thorough study of the spatial and temporal patterns in the abundance and distribution of N-fixing communities has been missing so far. Methodology/Principal Findings In order to understand the dynamics of diazotrophic communities and their resilience to external changes, we quantified the abundance and characterized the bacterial community structures based on the nifH gene, using real-time PCR, PCR-DGGE and 454-pyrosequencing, across four representative Dutch soils during one growing season. In general, higher nifH gene copy numbers were observed in soils with higher pH than in those with lower pH, but lower numbers were related to increased nitrate and ammonium levels. Results from nifH gene pyrosequencing confirmed the observed PCR-DGGE patterns, which indicated that the N fixers are highly dynamic across time, shifting around 60%. Forward selection on CCA analysis identified N availability as the main driver of these variations, as well as of the evenness of the communities, leading to very unequal communities. Moreover, deep sequencing of the nifH gene revealed that sandy soils (B and D) had the lowest percentage of shared OTUs across time, compared with clayey soils (G and K), indicating the presence of a community under constant change. Cosmopolitan nifH species (present throughout the season) were affiliated with Bradyrhizobium , Azospirillum and Methylocistis, whereas other species increased their abundances progressively over time, when appropriate conditions were met, as was notably the case for Paenibacilus and Burkholderia. Conclusions Our study provides the first in-depth pyrosequencing analysis of the N-fixing community at both spatial and temporal scales, providing insights into the cosmopolitan and specific portions of the nitrogen fixing bacterial communities in soil. PMID:24058578

  13. An Analysis of Thaumarchaeota Populations from the Northern Gulf of Mexico

    PubMed Central

    Tolar, Bradley B.; King, Gary M.; Hollibaugh, James T.

    2013-01-01

    We sampled Thaumarchaeota populations in the northern Gulf of Mexico, including shelf waters under the Mississippi River outflow plume that are subject to recurrent hypoxia. Data from this study allowed us to: (1) test the hypothesis that Thaumarchaeota would be abundant in this region; (2) assess phylogenetic composition of these populations for comparison with other regions; (3) compare the efficacy of quantitative PCR (qPCR) based on primers for 16S rRNA genes (rrs) with primers for genes in the ammonia oxidation (amoA) and carbon fixation (accA, hcd) pathways; (4) compare distributions obtained by qPCR with the relative abundance of Thaumarchaeota rrs in pyrosequenced libraries; (5) compare Thaumarchaeota distributions with environmental variables to help us elucidate the factors responsible for the distributions; (6) compare the distribution of Thaumarchaeota with Nitrite-Oxidizing Bacteria (NOB) to gain insight into the coupling between ammonia and nitrite oxidation. We found up to 108 copies L−1 of Thaumarchaeota rrs in our samples (up to 40% of prokaryotes) by qPCR, with maximum abundance in slope waters at 200–800 m. Thaumarchaeota rrs were also abundant in pyrosequenced libraries and their relative abundance correlated well with values determined by qPCR (r2 = 0.82). Thaumarchaeota populations were strongly stratified by depth. Canonical correspondence analysis using a suite of environmental variables explained 92% of the variance in qPCR-estimated gene abundances. Thaumarchaeota rrs abundance was correlated with salinity and depth, while accA abundance correlated with fluorescence and pH. Correlations of Archaeal amoA abundance with environmental variables were primer-dependent, suggesting differential responses of sub-populations to environmental variables. Bacterial amoA was at the limit of qPCR detection in most samples. NOB and Euryarchaeota rrs were found in the pyrosequenced libraries; NOB distribution was correlated with that of Thaumarchaeota (r2 = 0.49). PMID:23577005

  14. Pyrosequencing-based comparative genome analysis of the nosocomial pathogen Enterococcus faecium and identification of a large transferable pathogenicity island

    PubMed Central

    2010-01-01

    Background The Gram-positive bacterium Enterococcus faecium is an important cause of nosocomial infections in immunocompromized patients. Results We present a pyrosequencing-based comparative genome analysis of seven E. faecium strains that were isolated from various sources. In the genomes of clinical isolates several antibiotic resistance genes were identified, including the vanA transposon that confers resistance to vancomycin in two strains. A functional comparison between E. faecium and the related opportunistic pathogen E. faecalis based on differences in the presence of protein families, revealed divergence in plant carbohydrate metabolic pathways and oxidative stress defense mechanisms. The E. faecium pan-genome was estimated to be essentially unlimited in size, indicating that E. faecium can efficiently acquire and incorporate exogenous DNA in its gene pool. One of the most prominent sources of genomic diversity consists of bacteriophages that have integrated in the genome. The CRISPR-Cas system, which contributes to immunity against bacteriophage infection in prokaryotes, is not present in the sequenced strains. Three sequenced isolates carry the esp gene, which is involved in urinary tract infections and biofilm formation. The esp gene is located on a large pathogenicity island (PAI), which is between 64 and 104 kb in size. Conjugation experiments showed that the entire esp PAI can be transferred horizontally and inserts in a site-specific manner. Conclusions Genes involved in environmental persistence, colonization and virulence can easily be aquired by E. faecium. This will make the development of successful treatment strategies targeted against this organism a challenge for years to come. PMID:20398277

  15. Comparative transcriptomics between Synechococcus PCC 7942 and Synechocystis PCC 6803 provide insights into mechanisms of adaptation to stress.

    DOE PAGES

    Konstantinos, Billis; Billini, Maria; Tripp, Harry J.; ...

    2014-09-23

    Background: Synechococcus sp. PCC 7942 and Synechocystis sp. PCC 6803 are model cyanobacteria from which the metabolism and adaptive responses of other cyanobacteria are inferred. Here we report the gene expression response of these two strains to a variety of nutrient and environmental stresses of varying duration, using transcriptomics. Our data comprise both stranded and 5' enriched libraries in order to elucidate many aspects of the transcriptome. Results: Both organisms were exposed to stress conditions due to nutrient deficiency (inorganic carbon) or change of environmental conditions (salinity, temperature, pH, light) sampled at 1 and 24 hours after the application ofmore » stress. The transcriptome profile of each strain revealed similarities and differences in gene expression for photosynthetic and respiratory electron transport chains and carbon fixation. Transcriptome profiles also helped us improve the structural annotation of the genome and identify possible missed genes (including anti-sense) and determine transcriptional units (operons). Finally, we predicted association of proteins of unknown function biochemical pathways by associating them to well-characterized ones based on their transcript levels correlation. Conclusions: Overall, this study results an informative annotation of those species and the comparative analysis of the response of the two organisms revealed similarities but also significant changes in the way they respond to external stress and the duration of the response« less

  16. De Novo Assembly, Annotation, and Characterization of Root Transcriptomes of Three Caladium Cultivars with a Focus on Necrotrophic Pathogen Resistance/Defense-Related Genes

    PubMed Central

    Cao, Zhe; Deng, Zhanao

    2017-01-01

    Roots are vital to plant survival and crop yield, yet few efforts have been made to characterize the expressed genes in the roots of non-model plants (root transcriptomes). This study was conducted to sequence, assemble, annotate, and characterize the root transcriptomes of three caladium cultivars (Caladium × hortulanum) using RNA-Seq. The caladium cultivars used in this study have different levels of resistance to Pythium myriotylum, the most damaging necrotrophic pathogen to caladium roots. Forty-six to 61 million clean reads were obtained for each caladium root transcriptome. De novo assembly of the reads resulted in approximately 130,000 unigenes. Based on bioinformatic analysis, 71,825 (52.3%) caladium unigenes were annotated for putative functions, 48,417 (67.4%) and 31,417 (72.7%) were assigned to Gene Ontology (GO) and Clusters of Orthologous Groups (COG), respectively, and 46,406 (64.6%) unigenes were assigned to 128 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. A total of 4518 distinct unigenes were observed only in Pythium-resistant “Candidum” roots, of which 98 seemed to be involved in disease resistance and defense responses. In addition, 28,837 simple sequence repeat sites and 44,628 single nucleotide polymorphism sites were identified among the three caladium cultivars. These root transcriptome data will be valuable for further genetic improvement of caladium and related aroids. PMID:28346370

  17. Comparative transcriptomics between Synechococcus PCC 7942 and Synechocystis PCC 6803 provide insights into mechanisms of adaptation to stress.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Konstantinos, Billis; Billini, Maria; Tripp, Harry J.

    Background: Synechococcus sp. PCC 7942 and Synechocystis sp. PCC 6803 are model cyanobacteria from which the metabolism and adaptive responses of other cyanobacteria are inferred. Here we report the gene expression response of these two strains to a variety of nutrient and environmental stresses of varying duration, using transcriptomics. Our data comprise both stranded and 5' enriched libraries in order to elucidate many aspects of the transcriptome. Results: Both organisms were exposed to stress conditions due to nutrient deficiency (inorganic carbon) or change of environmental conditions (salinity, temperature, pH, light) sampled at 1 and 24 hours after the application ofmore » stress. The transcriptome profile of each strain revealed similarities and differences in gene expression for photosynthetic and respiratory electron transport chains and carbon fixation. Transcriptome profiles also helped us improve the structural annotation of the genome and identify possible missed genes (including anti-sense) and determine transcriptional units (operons). Finally, we predicted association of proteins of unknown function biochemical pathways by associating them to well-characterized ones based on their transcript levels correlation. Conclusions: Overall, this study results an informative annotation of those species and the comparative analysis of the response of the two organisms revealed similarities but also significant changes in the way they respond to external stress and the duration of the response« less

  18. Variations in the non-coding transcriptome as a driver of inter-strain divergence and physiological adaptation in bacteria.

    PubMed

    Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R; Voß, Björn

    2015-04-22

    In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5'UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5'UTR. Such an sRNA/mRNA structure, which we name 'actuaton', represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation.

  19. Transcriptome Profiling of Shewanella oneidensis Gene Expression following Exposure to Acidic and Alkaline pH†

    PubMed Central

    Leaphart, Adam B.; Thompson, Dorothea K.; Huang, Katherine; Alm, Eric; Wan, Xiu-Feng; Arkin, Adam; Brown, Steven D.; Wu, Liyou; Yan, Tingfen; Liu, Xueduan; Wickham, Gene S.; Zhou, Jizhong

    2006-01-01

    The molecular response of Shewanella oneidensis MR-1 to variations in extracellular pH was investigated based on genomewide gene expression profiling. Microarray analysis revealed that cells elicited both general and specific transcriptome responses when challenged with environmental acid (pH 4) or base (pH 10) conditions over a 60-min period. Global responses included the differential expression of genes functionally linked to amino acid metabolism, transcriptional regulation and signal transduction, transport, cell membrane structure, and oxidative stress protection. Response to acid stress included the elevated expression of genes encoding glycogen biosynthetic enzymes, phosphate transporters, and the RNA polymerase sigma-38 factor (rpoS), whereas the molecular response to alkaline pH was characterized by upregulation of nhaA and nhaR, which are predicted to encode an Na+/H+ antiporter and transcriptional activator, respectively, as well as sulfate transport and sulfur metabolism genes. Collectively, these results suggest that S. oneidensis modulates multiple transporters, cell envelope components, and pathways of amino acid consumption and central intermediary metabolism as part of its transcriptome response to changing external pH conditions. PMID:16452448

  20. [Progress in porky genes and transcriptome and discussion of relative issues].

    PubMed

    Zhu, Meng-Jin; Liu, Bang; Li, Kui

    2005-01-01

    To date, research on molecular base of porky molecular development was mainly involved in muscle growth and meat quality. Some functional genes including Hal gene and RN gene and some QTLs controlling or associated with porky growth and quality were detected through candidate gene approach and genome-wide scanning. Genic transcriptome pertinent to porcine muscle and adipose also came into study. At the same time, these researches have befallen some shortcomings to some extent. Research from molecular quantitative genetics showed shortcomings that single gene was devilishly emphasized and co-expression pattern of multi-genes was ignored. Research applying transcriptome analysis tool also met two of limitations, one was the singleness of type of molecular experimental techniques, and another was that genes of muscle and adipose were artificially divided into unattached two parts. Thus, porky genes were explored by parallel genetics based on systemic views and techniques to specially reveal the interactional mechanism of porky genes respectively controlling muscle and adipose, which would be important issues of genes and genome researches on porky development in the near future.

  1. Massively Parallel RNA Sequencing Identifies a Complex Immune Gene Repertoire in the lophotrochozoan Mytilus edulis

    PubMed Central

    Philipp, Eva E. R.; Kraemer, Lars; Melzner, Frank; Poustka, Albert J.; Thieme, Sebastian; Findeisen, Ulrike; Schreiber, Stefan; Rosenstiel, Philip

    2012-01-01

    The marine mussel Mytilus edulis and its closely related sister species are distributed world-wide and play an important role in coastal ecology and economy. The diversification in different species and their hybrids, broad ecological distribution, as well as the filter feeding mode of life has made this genus an attractive model to investigate physiological and molecular adaptations and responses to various biotic and abiotic environmental factors. In the present study we investigated the immune system of Mytilus, which may contribute to the ecological plasticity of this species. We generated a large Mytilus transcriptome database from different tissues of immune challenged and stress treated individuals from the Baltic Sea using 454 pyrosequencing. Phylogenetic comparison of orthologous groups of 23 species demonstrated the basal position of lophotrochozoans within protostomes. The investigation of immune related transcripts revealed a complex repertoire of innate recognition receptors and downstream pathway members including transcripts for 27 toll-like receptors and 524 C1q domain containing transcripts. NOD-like receptors on the other hand were absent. We also found evidence for sophisticated TNF, autophagy and apoptosis systems as well as for cytokines. Gill tissue and hemocytes showed highest expression of putative immune related contigs and are promising tissues for further functional studies. Our results partly contrast with findings of a less complex immune repertoire in ecdysozoan and other lophotrochozoan protostomes. We show that bivalves are interesting candidates to investigate the evolution of the immune system from basal metazoans to deuterostomes and protostomes and provide a basis for future molecular work directed to immune system functioning in Mytilus. PMID:22448234

  2. Regulatory Mechanisms Underlying Oil Palm Fruit Mesocarp Maturation, Ripening, and Functional Specialization in Lipid and Carotenoid Metabolism1[W][OA

    PubMed Central

    Tranbarger, Timothy J.; Dussert, Stéphane; Joët, Thierry; Argout, Xavier; Summo, Marilyne; Champion, Antony; Cros, David; Omore, Alphonse; Nouy, Bruno; Morcillo, Fabienne

    2011-01-01

    Fruit provide essential nutrients and vitamins for the human diet. Not only is the lipid-rich fleshy mesocarp tissue of the oil palm (Elaeis guineensis) fruit the main source of edible oil for the world, but it is also the richest dietary source of provitamin A. This study examines the transcriptional basis of these two outstanding metabolic characters in the oil palm mesocarp. Morphological, cellular, biochemical, and hormonal features defined key phases of mesocarp development. A 454 pyrosequencing-derived transcriptome was then assembled for the developmental phases preceding and during maturation and ripening, when high rates of lipid and carotenoid biosynthesis occur. A total of 2,629 contigs with differential representation revealed coordination of metabolic and regulatory components. Further analysis focused on the fatty acid and triacylglycerol assembly pathways and during carotenogenesis. Notably, a contig similar to the Arabidopsis (Arabidopsis thaliana) seed oil transcription factor WRINKLED1 was identified with a transcript profile coordinated with those of several fatty acid biosynthetic genes and the high rates of lipid accumulation, suggesting some common regulatory features between seeds and fruits. We also focused on transcriptional regulatory networks of the fruit, in particular those related to ethylene transcriptional and GLOBOSA/PISTILLATA-like proteins in the mesocarp and a central role for ethylene-coordinated transcriptional regulation of type VII ethylene response factors during ripening. Our results suggest that divergence has occurred in the regulatory components in this monocot fruit compared with those identified in the dicot tomato (Solanum lycopersicum) fleshy fruit model. PMID:21487046

  3. Chicken innate immune response to oral infection with Salmonella enterica serovar Enteritidis

    PubMed Central

    2013-01-01

    The characterization of the immune response of chickens to Salmonella infection is usually limited to the quantification of expression of genes coding for cytokines, chemokines or antimicrobial peptides. However, processes occurring in the cecum of infected chickens are likely to be much more diverse. In this study we have therefore characterized the transcriptome and proteome in the chicken cecum after infection with Salmonella Enteritidis. Using a combination of 454 pyrosequencing, protein mass spectrometry and quantitative real-time PCR, we identified 48 down- and 56 up-regulated chicken genes after Salmonella Enteritidis infection. The most inducible gene was that coding for MMP7, exhibiting a 5952 fold induction 9 days post-infection. An induction of greater than 100 fold was observed for IgG, IRG1, SAA, ExFABP, IL-22, TRAP6, MRP126, IFNγ, iNOS, ES1, IL-1β, LYG2, IFIT5, IL-17, AVD, AH221 and SERPIN B. Since prostaglandin D2 synthase was upregulated and degrading hydroxyprostaglandin dehydrogenase was downregulated after the infection, prostaglandin must accumulate in the cecum of chickens infected with Salmonella Enteritidis. Finally, above mentioned signaling was dependent on the presence of a SPI1-encoded type III secretion system in Salmonella Enteritidis. The inflammation lasted for 2 weeks after which time the expression of the “inflammatory” genes returned back to basal levels and, instead, the expression of IgA and IgG increased. This points to an important role for immunoglobulins in the restoration of homeostasis in the cecum after infection. PMID:23687968

  4. De novo assembly and annotation of the Antarctic copepod (Tigriopus kingsejongensis) transcriptome.

    PubMed

    Kim, Hui-Su; Lee, Bo-Young; Han, Jeonghoon; Lee, Young Hwan; Min, Gi-Sik; Kim, Sanghee; Lee, Jae-Seong

    2016-08-01

    The whole transcriptome of the Antarctic copepod (Tigriopus kingsejongensis) was sequenced using Illumina RNA-seq. De novo assembly was performed with 64,785,098 raw reads using Trinity, which assembled into 81,653 contigs. TransDecoder found 38,250 candidate coding contigs which showed homology to other species by BLAST analysis. Functional gene annotation was performed by Gene Ontology (GO), InterProScan, and KEGG pathway analyses. Finally, we identified a number of expressed gene catalog for T. kingsejongensis that is a useful model animal for gene information-based polar research to uncover molecular mechanisms of environmental adaptation on harsh environments. In particular, we observed highly developing lipid metabolism in T. kingsejongensis directly compared to those of the Far East Pacific coast copepod Tigriopus japonicus at the transcriptome level. Copyright © 2016 Elsevier B.V. All rights reserved.

  5. Characterization of mango (Mangifera indica L.) transcriptome and chloroplast genome.

    PubMed

    Azim, M Kamran; Khan, Ishtaiq A; Zhang, Yong

    2014-05-01

    We characterized mango leaf transcriptome and chloroplast genome using next generation DNA sequencing. The RNA-seq output of mango transcriptome generated >12 million reads (total nucleotides sequenced >1 Gb). De novo transcriptome assembly generated 30,509 unigenes with lengths in the range of 300 to ≥3,000 nt and 67× depth of coverage. Blast searching against nonredundant nucleotide databases and several Viridiplantae genomic datasets annotated 24,593 mango unigenes (80% of total) and identified Citrus sinensis as closest neighbor of mango with 9,141 (37%) matched sequences. The annotation with gene ontology and Clusters of Orthologous Group terms categorized unigene sequences into 57 and 25 classes, respectively. More than 13,500 unigenes were assigned to 293 KEGG pathways. Besides major plant biology related pathways, KEGG based gene annotation pointed out active presence of an array of biochemical pathways involved in (a) biosynthesis of bioactive flavonoids, flavones and flavonols, (b) biosynthesis of terpenoids and lignins and (c) plant hormone signal transduction. The mango transcriptome sequences revealed 235 proteases belonging to five catalytic classes of proteolytic enzymes. The draft genome of mango chloroplast (cp) was obtained by a combination of Sanger and next generation sequencing. The draft mango cp genome size is 151,173 bp with a pair of inverted repeats of 27,093 bp separated by small and large single copy regions, respectively. Out of 139 genes in mango cp genome, 91 found to be protein coding. Sequence analysis revealed cp genome of C. sinensis as closest neighbor of mango. We found 51 short repeats in mango cp genome supposed to be associated with extensive rearrangements. This is the first report of transcriptome and chloroplast genome analysis of any Anacardiaceae family member.

  6. ASGARD: an open-access database of annotated transcriptomes for emerging model arthropod species.

    PubMed

    Zeng, Victor; Extavour, Cassandra G

    2012-01-01

    The increased throughput and decreased cost of next-generation sequencing (NGS) have shifted the bottleneck genomic research from sequencing to annotation, analysis and accessibility. This is particularly challenging for research communities working on organisms that lack the basic infrastructure of a sequenced genome, or an efficient way to utilize whatever sequence data may be available. Here we present a new database, the Assembled Searchable Giant Arthropod Read Database (ASGARD). This database is a repository and search engine for transcriptomic data from arthropods that are of high interest to multiple research communities but currently lack sequenced genomes. We demonstrate the functionality and utility of ASGARD using de novo assembled transcriptomes from the milkweed bug Oncopeltus fasciatus, the cricket Gryllus bimaculatus and the amphipod crustacean Parhyale hawaiensis. We have annotated these transcriptomes to assign putative orthology, coding region determination, protein domain identification and Gene Ontology (GO) term annotation to all possible assembly products. ASGARD allows users to search all assemblies by orthology annotation, GO term annotation or Basic Local Alignment Search Tool. User-friendly features of ASGARD include search term auto-completion suggestions based on database content, the ability to download assembly product sequences in FASTA format, direct links to NCBI data for predicted orthologs and graphical representation of the location of protein domains and matches to similar sequences from the NCBI non-redundant database. ASGARD will be a useful repository for transcriptome data from future NGS studies on these and other emerging model arthropods, regardless of sequencing platform, assembly or annotation status. This database thus provides easy, one-stop access to multi-species annotated transcriptome information. We anticipate that this database will be useful for members of multiple research communities, including developmental biology, physiology, evolutionary biology, ecology, comparative genomics and phylogenomics. Database URL: asgard.rc.fas.harvard.edu.

  7. ReprOlive: a database with linked data for the olive tree (Olea europaea L.) reproductive transcriptome

    PubMed Central

    Carmona, Rosario; Zafra, Adoración; Seoane, Pedro; Castro, Antonio J.; Guerrero-Fernández, Darío; Castillo-Castillo, Trinidad; Medina-García, Ana; Cánovas, Francisco M.; Aldana-Montes, José F.; Navas-Delgado, Ismael; Alché, Juan de Dios; Claros, M. Gonzalo

    2015-01-01

    Plant reproductive transcriptomes have been analyzed in different species due to the agronomical and biotechnological importance of plant reproduction. Here we presented an olive tree reproductive transcriptome database with samples from pollen and pistil at different developmental stages, and leaf and root as control vegetative tissues http://reprolive.eez.csic.es). It was developed from 2,077,309 raw reads to 1,549 Sanger sequences. Using a pre-defined workflow based on open-source tools, sequences were pre-processed, assembled, mapped, and annotated with expression data, descriptions, GO terms, InterPro signatures, EC numbers, KEGG pathways, ORFs, and SSRs. Tentative transcripts (TTs) were also annotated with the corresponding orthologs in Arabidopsis thaliana from TAIR and RefSeq databases to enable Linked Data integration. It results in a reproductive transcriptome comprising 72,846 contigs with average length of 686 bp, of which 63,965 (87.8%) included at least one functional annotation, and 55,356 (75.9%) had an ortholog. A minimum of 23,568 different TTs was identified and 5,835 of them contain a complete ORF. The representative reproductive transcriptome can be reduced to 28,972 TTs for further gene expression studies. Partial transcriptomes from pollen, pistil, and vegetative tissues as control were also constructed. ReprOlive provides free access and download capability to these results. Retrieval mechanisms for sequences and transcript annotations are provided. Graphical localization of annotated enzymes into KEGG pathways is also possible. Finally, ReprOlive has included a semantic conceptualisation by means of a Resource Description Framework (RDF) allowing a Linked Data search for extracting the most updated information related to enzymes, interactions, allergens, structures, and reactive oxygen species. PMID:26322066

  8. De novo transcriptome assemblies of four xylem sap-feeding insects.

    PubMed

    Tassone, Erica E; Cowden, Charles C; Castle, S J

    2017-03-01

    Spittle bugs and sharpshooters are well-known xylem sap-feeding insects and vectors of the phytopathogenic bacterium Xylella fastidiosa (Wells), a causal agent of Pierce's disease of grapevines and other crop diseases. Specialized feeding on nutrient-deficient xylem sap is relatively rare among insect herbivores, and only limited genomic and transcriptomic information has been generated for xylem-sap feeders. To develop a more comprehensive understanding of biochemical adaptations and symbiotic relationships that support survival on a nutritionally austere dietary source, transcriptome assemblies for three sharpshooter species and one spittlebug species were produced. Trinity-based de novo transcriptome assemblies were generated for all four xylem-sap feeders using raw sequencing data originating from whole-insect preps. Total transcripts for each species ranged from 91 384 for Cuerna arida to 106 998 for Homalodisca liturata with transcript totals for Graphocephala atropunctata and the spittlebug Clastoptera arizonana falling in between. The percentage of transcripts comprising complete open reading frames ranged from 60% for H. liturata to 82% for C. arizonana. Bench-marking universal single-copy orthologs analyses for each dataset indicated quality assemblies and a high degree of completeness for all four species. These four transcriptomes represent a significant expansion of data for insect herbivores that feed exclusively on xylem sap, a nutritionally deficient dietary source relative to other plant tissues and fluids. Comparison of transcriptome data with insect herbivores that utilize other dietary sources may illuminate fundamental differences in the biochemistry of dietary specialization. Published by Oxford University Press on behalf of GIGSCI 2017. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  9. Next-Generation Transcriptome Profiling of the Salmon Louse Caligus rogercresseyi Exposed to Deltamethrin (AlphaMax™): Discovery of Relevant Genes and Sex-Related Differences.

    PubMed

    Chávez-Mardones, Jacqueline; Gallardo-Escárate, Cristian

    2015-12-01

    Sea lice are one of the main parasites affecting the salmon aquaculture industry, causing significant economic losses worldwide. Increased resistance to traditional chemical treatments has created the need to find alternative control methods. Therefore, the objective of this study was to identify the transcriptome response of the salmon louse Caligus rogercresseyi to the delousing drug deltamethrin (AlphaMax™). Through bioassays with different concentrations of deltamethrin, adult salmon lice transcriptomes were sequenced from cDNA libraries in the MiSeq Illumina platform. A total of 78 million reads for females and males were assembled in 30,212 and 38,536 contigs, respectively. De novo assembly yielded 86,878 high-quality contigs and, based on published data, it was possible to annotate and identify relevant genes involved in several biological processes. RNA-seq analysis in conjunction with heatmap hierarchical clustering evidenced that pyrethroids modify the ectoparasitic transcriptome in adults, affecting molecular processes associated with the nervous system, cuticle formation, oxidative stress, reproduction, and metabolism, among others. Furthermore, sex-related transcriptome differences were evidenced. Specifically, 534 and 1033 exclusive transcripts were identified for males and females, respectively, and 154 were shared between sexes. For males, estradiol 17-beta-dehydrogenase, sphingolipid delta4-desaturase DES1, ketosamine-3-kinase, and arylsulfatase A, among others, were discovered, while for females, vitellogenin 1, glycoprotein G, transaldolase, and nitric oxide synthase were among those identified. The shared transcripts included annotations for tropomyosin, γ-crystallin A, glutamate receptor-metabotropic, glutathione S-transferase, and carboxipeptidase B. The present study reveals that deltamethrin generates a complex transcriptome response in C. rogercresseyi, thus providing valuable genomic information for developing new delousing drugs.

  10. Mining genes involved in insecticide resistance of Liposcelis bostrychophila Badonnel by transcriptome and expression profile analysis.

    PubMed

    Dou, Wei; Shen, Guang-Mao; Niu, Jin-Zhi; Ding, Tian-Bo; Wei, Dan-Dan; Wang, Jin-Jun

    2013-01-01

    Recent studies indicate that infestations of psocids pose a new risk for global food security. Among the psocids species, Liposcelis bostrychophila Badonnel has gained recognition in importance because of its parthenogenic reproduction, rapid adaptation, and increased worldwide distribution. To date, the molecular data available for L. bostrychophila is largely limited to genes identified through homology. Also, no transcriptome data relevant to psocids infection is available. In this study, we generated de novo assembly of L. bostrychophila transcriptome performed through the short read sequencing technology (Illumina). In a single run, we obtained more than 51 million sequencing reads that were assembled into 60,012 unigenes (mean size = 711 bp) by Trinity. The transcriptome sequences from different developmental stages of L. bostrychophila including egg, nymph and adult were annotated with non-redundant (Nr) protein database, gene ontology (GO), cluster of orthologous groups of proteins (COG), and KEGG orthology (KO). The analysis revealed three major enzyme families involved in insecticide metabolism as differentially expressed in the L. bostrychophila transcriptome. A total of 49 P450-, 31 GST- and 21 CES-specific genes representing the three enzyme families were identified. Besides, 16 transcripts were identified to contain target site sequences of resistance genes. Furthermore, we profiled gene expression patterns upon insecticide (malathion and deltamethrin) exposure using the tag-based digital gene expression (DGE) method. The L. bostrychophila transcriptome and DGE data provide gene expression data that would further our understanding of molecular mechanisms in psocids. In particular, the findings of this investigation will facilitate identification of genes involved in insecticide resistance and designing of new compounds for control of psocids.

  11. Mining Genes Involved in Insecticide Resistance of Liposcelis bostrychophila Badonnel by Transcriptome and Expression Profile Analysis

    PubMed Central

    Dou, Wei; Shen, Guang-Mao; Niu, Jin-Zhi; Ding, Tian-Bo; Wei, Dan-Dan; Wang, Jin-Jun

    2013-01-01

    Background Recent studies indicate that infestations of psocids pose a new risk for global food security. Among the psocids species, Liposcelis bostrychophila Badonnel has gained recognition in importance because of its parthenogenic reproduction, rapid adaptation, and increased worldwide distribution. To date, the molecular data available for L. bostrychophila is largely limited to genes identified through homology. Also, no transcriptome data relevant to psocids infection is available. Methodology and Principal Findings In this study, we generated de novo assembly of L. bostrychophila transcriptome performed through the short read sequencing technology (Illumina). In a single run, we obtained more than 51 million sequencing reads that were assembled into 60,012 unigenes (mean size = 711 bp) by Trinity. The transcriptome sequences from different developmental stages of L. bostrychophila including egg, nymph and adult were annotated with non-redundant (Nr) protein database, gene ontology (GO), cluster of orthologous groups of proteins (COG), and KEGG orthology (KO). The analysis revealed three major enzyme families involved in insecticide metabolism as differentially expressed in the L. bostrychophila transcriptome. A total of 49 P450-, 31 GST- and 21 CES-specific genes representing the three enzyme families were identified. Besides, 16 transcripts were identified to contain target site sequences of resistance genes. Furthermore, we profiled gene expression patterns upon insecticide (malathion and deltamethrin) exposure using the tag-based digital gene expression (DGE) method. Conclusion The L. bostrychophila transcriptome and DGE data provide gene expression data that would further our understanding of molecular mechanisms in psocids. In particular, the findings of this investigation will facilitate identification of genes involved in insecticide resistance and designing of new compounds for control of psocids. PMID:24278202

  12. Diurnal Transcriptome and Gene Network Represented through Sparse Modeling in Brachypodium distachyon.

    PubMed

    Koda, Satoru; Onda, Yoshihiko; Matsui, Hidetoshi; Takahagi, Kotaro; Yamaguchi-Uehara, Yukiko; Shimizu, Minami; Inoue, Komaki; Yoshida, Takuhiro; Sakurai, Tetsuya; Honda, Hiroshi; Eguchi, Shinto; Nishii, Ryuei; Mochida, Keiichi

    2017-01-01

    We report the comprehensive identification of periodic genes and their network inference, based on a gene co-expression analysis and an Auto-Regressive eXogenous (ARX) model with a group smoothly clipped absolute deviation (SCAD) method using a time-series transcriptome dataset in a model grass, Brachypodium distachyon . To reveal the diurnal changes in the transcriptome in B. distachyon , we performed RNA-seq analysis of its leaves sampled through a diurnal cycle of over 48 h at 4 h intervals using three biological replications, and identified 3,621 periodic genes through our wavelet analysis. The expression data are feasible to infer network sparsity based on ARX models. We found that genes involved in biological processes such as transcriptional regulation, protein degradation, and post-transcriptional modification and photosynthesis are significantly enriched in the periodic genes, suggesting that these processes might be regulated by circadian rhythm in B. distachyon . On the basis of the time-series expression patterns of the periodic genes, we constructed a chronological gene co-expression network and identified putative transcription factors encoding genes that might be involved in the time-specific regulatory transcriptional network. Moreover, we inferred a transcriptional network composed of the periodic genes in B. distachyon , aiming to identify genes associated with other genes through variable selection by grouping time points for each gene. Based on the ARX model with the group SCAD regularization using our time-series expression datasets of the periodic genes, we constructed gene networks and found that the networks represent typical scale-free structure. Our findings demonstrate that the diurnal changes in the transcriptome in B. distachyon leaves have a sparse network structure, demonstrating the spatiotemporal gene regulatory network over the cyclic phase transitions in B. distachyon diurnal growth.

  13. Comparison of a teratogenic transcriptome-based predictive test based on human embryonic versus inducible pluripotent stem cells.

    PubMed

    Shinde, Vaibhav; Perumal Srinivasan, Sureshkumar; Henry, Margit; Rotshteyn, Tamara; Hescheler, Jürgen; Rahnenführer, Jörg; Grinberg, Marianna; Meisig, Johannes; Blüthgen, Nils; Waldmann, Tanja; Leist, Marcel; Hengstler, Jan Georg; Sachinidis, Agapios

    2016-12-30

    Human embryonic stem cells (hESCs) partially recapitulate early embryonic three germ layer development, allowing testing of potential teratogenic hazards. Because use of hESCs is ethically debated, we investigated the potential for human induced pluripotent stem cells (hiPSCs) to replace hESCs in such tests. Three cell lines, comprising hiPSCs (foreskin and IMR90) and hESCs (H9) were differentiated for 14 days. Their transcriptome profiles were obtained on day 0 and day 14 and analyzed by comprehensive bioinformatics tools. The transcriptomes on day 14 showed that more than 70% of the "developmental genes" (regulated genes with > 2-fold change on day 14 compared to day 0) exhibited variability among cell lines. The developmental genes belonging to all three cell lines captured biological processes and KEGG pathways related to all three germ layer embryonic development. In addition, transcriptome profiles were obtained after 14 days of exposure to teratogenic valproic acid (VPA) during differentiation. Although the differentially regulated genes between treated and untreated samples showed more than 90% variability among cell lines, VPA clearly antagonized the expression of developmental genes in all cell lines: suppressing upregulated developmental genes, while inducing downregulated ones. To quantify VPA-disturbed development based on developmental genes, we estimated the "developmental potency" (D p ) and "developmental index" (D i ). Despite differences in genes deregulated by VPA, uniform D i values were obtained for all three cell lines. Given that the D i values for VPA were similar for hESCs and hiPSCs, D i can be used for robust hazard identification, irrespective of whether hESCs or hiPSCs are used in the test systems.

  14. Transcriptome analysis of Jatropha curcas L. flower buds responded to the paclobutrazol treatment.

    PubMed

    Seesangboon, Anupharb; Gruneck, Lucsame; Pokawattana, Tittinat; Eungwanichayapant, Prapassorn Damrongkool; Tovaranonte, Jantrararuk; Popluechai, Siam

    2018-06-01

    Jatropha seeds can be used to produce high-quality biodiesel due to their high oil content. However, Jatropha produces low numbers of female flowers, which limits seed yield. Paclobutrazol (PCB), a plant growth retardant, can increase number of Jatropha female flowers and seed yield. However, the underlying mechanisms of flower development after PCB treatment are not well understood. To identify the critical genes associated with flower development, the transcriptome of flower buds following PCB treatment was analyzed. Scanning Electron Microscope (SEM) analysis revealed that the flower developmental stage between PCB-treated and control flower buds was similar. Based on the presence of sex organs, flower buds at 0, 4, and 24 h after treatment were chosen for global transcriptome analysis. In total, 100,597 unigenes were obtained, 174 of which were deemed as interesting based on their response to PCB treatment. Our analysis showed that the JcCKX5 and JcTSO1 genes were up-regulated at 4 h, suggesting roles in promoting organogenic capacity and ovule primordia formation in Jatropha. The JcNPGR2, JcMGP2-3, and JcHUA1 genes were down-regulated indicating that they may contribute to increased number of female flowers and amount of seed yield. Expression of cell division and cellulose biosynthesis-related genes, including JcGASA3, JcCycB3;1, JcCycP2;1, JcKNAT7, and JcCSLG3 was decreased, which might have caused the compacted inflorescences. This study represents the first report combining SEM-based morphology, qRT-PCR and transcriptome analysis of PCB-treated Jatropha flower buds at different stages of flower development. Copyright © 2018 Elsevier Masson SAS. All rights reserved.

  15. Metabolic engineering of Escherichia coli for the production of l-valine based on transcriptome analysis and in silico gene knockout simulation

    PubMed Central

    Park, Jin Hwan; Lee, Kwang Ho; Kim, Tae Yong; Lee, Sang Yup

    2007-01-01

    The l-valine production strain of Escherichia coli was constructed by rational metabolic engineering and stepwise improvement based on transcriptome analysis and gene knockout simulation of the in silico genome-scale metabolic network. Feedback inhibition of acetohydroxy acid synthase isoenzyme III by l-valine was removed by site-directed mutagenesis, and the native promoter containing the transcriptional attenuator leader regions of the ilvGMEDA and ilvBN operon was replaced with the tac promoter. The ilvA, leuA, and panB genes were deleted to make more precursors available for l-valine biosynthesis. This engineered Val strain harboring a plasmid overexpressing the ilvBN genes produced 1.31 g/liter l-valine. Comparative transcriptome profiling was performed during batch fermentation of the engineered and control strains. Among the down-regulated genes, the lrp and ygaZH genes, which encode a global regulator Lrp and l-valine exporter, respectively, were overexpressed. Amplification of the lrp, ygaZH, and lrp-ygaZH genes led to the enhanced production of l-valine by 21.6%, 47.1%, and 113%, respectively. Further improvement was achieved by using in silico gene knockout simulation, which identified the aceF, mdh, and pfkA genes as knockout targets. The VAMF strain (Val ΔaceF Δmdh ΔpfkA) overexpressing the ilvBN, ilvCED, ygaZH, and lrp genes was able to produce 7.55 g/liter l-valine from 20 g/liter glucose in batch culture, resulting in a high yield of 0.378 g of l-valine per gram of glucose. These results suggest that an industrially competitive strain can be efficiently developed by metabolic engineering based on combined rational modification, transcriptome profiling, and systems-level in silico analysis. PMID:17463081

  16. Bacterial communities associated with four ctenophore genera from the German Bight (North Sea).

    PubMed

    Hao, Wenjin; Gerdts, Gunnar; Peplies, Jörg; Wichels, Antje

    2015-01-01

    Intense research has been conducted on jellyfish and ctenophores in recent years. They are increasingly recognized as key elements in the marine ecosystem that serve as critical indicators and drivers of ecosystem performance and change. However, the bacterial community associated with ctenophores is still poorly investigated. Based on automated ribosomal intergenic spacer analysis (ARISA) and 16S ribosomal RNA gene amplicon pyrosequencing, we investigated bacterial communities associated with the frequently occurring ctenophore species Mnemiopsis leidyi, Beroe sp., Bolinopsis infundibulum and Pleurobrachia pileus at Helgoland Roads in the German Bight (North Sea). We observed significant differences between the associated bacterial communities of the different ctenophore species based on ARISA patterns. With respect to bacterial taxa, all ctenophore species were dominated by Proteobacteria as revealed by pyrosequencing. Mnemiopsis leidyi and P. pileus mainly harboured Gammaproteobacteria, with Marinomonas as the dominant phylotype of M. leidyi. By contrast, Pseudoalteromonas and Psychrobacter were the most abundant Gammaproteobacteria in P. pileus. Beroe sp. was mainly dominated by Alphaproteobacteria, particularly by the genus Thalassospira. For B. infundibulum, the bacterial community was composed of Alphaproteobacteria and Gammaproteobacteria in equal parts, which consisted of the genera Thalassospira and Marinomonas. In addition, the bacterial communities associated with M. leidyi display a clear variation over time that needs further investigation. Our results indicate that the bacterial communities associated with ctenophores are highly species- specific. © FEMS 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  17. A Pyrosequencing-Based Assay for the Rapid Detection of the 22q11.2 Deletion in DNA from Buccal and Dried Blood Spot Samples

    PubMed Central

    Koontz, Deborah; Baecher, Kirsten; Kobrynski, Lisa; Nikolova, Stanimila; Gallagher, Margaret

    2015-01-01

    The 22q11.2 deletion syndrome is one of the most common deletion syndromes in newborns. Some affected newborns may be diagnosed shortly after birth because of the presence of heart defects, palatal defects, or severe immune deficiencies. However, diagnosis is often delayed in patients presenting with other associated conditions that would benefit from early recognition and treatment, such as speech delays, learning difficulties, and schizophrenia. Fluorescence in situ hybridization (FISH) is the gold standard for deletion detection, but it is costly and time consuming and requires a whole blood specimen. Our goal was to develop a suitable assay for population-based screening of easily collectible specimens, such as buccal swabs and dried blood spots (DBS). We designed a pyrosequencing assay and validated it using DNA from FISH–confirmed 22q11 deletion syndrome patients and normal controls. We tested DBS from nine patients and paired buccal cell and venous blood specimens from 20 patients. Results were 100% concordant with FISH assay results. DNA samples from normal controls (n = 180 cell lines, n = 15 DBS, and n = 88 buccal specimens) were negative for the deletion. Limiting dilution experiments demonstrated that accurate results could be obtained from as little as 1 ng of DNA. This method represents a reliable and low-cost alternative for detection of the common 22q11.2 microdeletions and can be adapted to high-throughput population screening. PMID:24973633

  18. Transcriptome instability as a molecular pan-cancer characteristic of carcinomas.

    PubMed

    Sveen, Anita; Johannessen, Bjarne; Teixeira, Manuel R; Lothe, Ragnhild A; Skotheim, Rolf I

    2014-08-10

    We have previously proposed transcriptome instability as a genome-wide, pre-mRNA splicing-related characteristic of colorectal cancer. Here, we explore the hypothesis of transcriptome instability being a general characteristic of cancer. Exon-level microarray expression data from ten cancer datasets were analyzed, including breast cancer, cervical cancer, colorectal cancer, gastric cancer, lung cancer, neuroblastoma, and prostate cancer (555 samples), as well as paired normal tissue samples from the colon, lung, prostate, and stomach (93 samples). Based on alternative splicing scores across the genomes, we calculated sample-wise relative amounts of aberrant exon skipping and inclusion. Strong and non-random (P < 0.001) correlations between these estimates and the expression levels of splicing factor genes (n = 280) were found in most cancer types analyzed (breast-, cervical-, colorectal-, lung- and prostate cancer). This suggests a biological explanation for the splicing variation. Surprisingly, these associations prevailed in pan-cancer analyses. This is in contrast to the tissue and cancer specific patterns observed in comparisons across healthy tissue samples from the colon, lung, prostate, and stomach, and between paired cancer-normal samples from the same four tissue types. Based on exon-level expression profiling and computational analyses of alternative splicing, we propose transcriptome instability as a molecular pan-cancer characteristic. The affected cancers show strong and non-random associations between low expression levels of splicing factor genes, and high amounts of aberrant exon skipping and inclusion, and vice versa, on a genome-wide scale.

  19. An integrative 'omics' solution to the detection of recombinant human erythropoietin and blood doping.

    PubMed

    Pitsiladis, Yannis P; Durussel, Jérôme; Rabin, Olivier

    2014-05-01

    Administration of recombinant human erythropoietin (rHumanEPO) improves sporting performance and hence is frequently subject to abuse by athletes, although rHumanEPO is prohibited by the WADA. Approaches to detect rHumanEPO doping have improved significantly in recent years but remain imperfect. A new transcriptomic-based longitudinal screening approach is being developed that has the potential to improve the analytical performance of current detection methods. In particular, studies are being funded by WADA to identify a 'molecular signature' of rHumanEPO doping and preliminary results are promising. In the first systematic study to be conducted, the expression of hundreds of genes were found to be altered by rHumanEPO with numerous gene transcripts being differentially expressed after the first injection and further transcripts profoundly upregulated during and subsequently downregulated up to 4 weeks postadministration of the drug; with the same transcriptomic pattern observed in all participants. The identification of a blood 'molecular signature' of rHumanEPO administration is the strongest evidence to date that gene biomarkers have the potential to substantially improve the analytical performance of current antidoping methods such as the Athlete Biological Passport for rHumanEPO detection. Given the early promise of transcriptomics, research using an 'omics'-based approach involving genomics, transcriptomics, proteomics and metabolomics should be intensified in order to achieve improved detection of rHumanEPO and other doping substances and methods difficult to detect such a recombinant human growth hormone and blood transfusions.

  20. De novo characterization of the pine aphid Cinara pinitabulaeformis Zhang et Zhang transcriptome and analysis of genes relevant to pesticides

    PubMed Central

    Rebeca, Carballar-Lejarazú; Zhu, Xiaoli; Guo, Yajie; Lin, Qiannan; Hu, Xia; Wang, Rong; Liang, Guanghong; Guan, Xiong

    2017-01-01

    The pine aphid Cinara pinitabulaeformis Zhang et Zhang is the main pine pest in China, it causes pine needles to produce dense dew (honeydew) which can lead to sooty mold (black filamentous saprophytic ascomycetes). Although common chemical and physical strategies are used to prevent the disease caused by C. pinitabulaeformis Zhang et Zhang, new strategies based on biological and/or genetic approaches are promising to control and eradicate the disease. However, there is no information about genomics, proteomics or transcriptomics to allow the design of new control strategies for this pine aphid. We used next generation sequencing technology to sequence the transcriptome of C. pinitabulaeformis Zhang et Zhang and built a transcriptome database. We identified 80,259 unigenes assigned for Gene Ontology (GO) terms and information for a total of 11,609 classified unigenes was obtained in the Clusters of Orthologous Groups (COGs). A total of 10,806 annotated unigenes were analyzed to identify the represented biological pathways, among them 8,845 unigenes matched with 228 KEGG pathways. In addition, our data describe propagative viruses, nutrition-related genes, detoxification related molecules, olfactory related receptors, stressed-related protein, putative insecticide resistance genes and possible insecticide targets. Moreover, this study provides valuable information about putative insecticide resistance related genes and for the design of new genetic/biological based strategies to manage and control C. pinitabulaeformis Zhang et Zhang populations. PMID:28570707

  1. Controlling bias and inflation in epigenome- and transcriptome-wide association studies using the empirical null distribution.

    PubMed

    van Iterson, Maarten; van Zwet, Erik W; Heijmans, Bastiaan T

    2017-01-27

    We show that epigenome- and transcriptome-wide association studies (EWAS and TWAS) are prone to significant inflation and bias of test statistics, an unrecognized phenomenon introducing spurious findings if left unaddressed. Neither GWAS-based methodology nor state-of-the-art confounder adjustment methods completely remove bias and inflation. We propose a Bayesian method to control bias and inflation in EWAS and TWAS based on estimation of the empirical null distribution. Using simulations and real data, we demonstrate that our method maximizes power while properly controlling the false positive rate. We illustrate the utility of our method in large-scale EWAS and TWAS meta-analyses of age and smoking.

  2. Playing hide and seek with repeats in local and global de novo transcriptome assembly of short RNA-seq reads.

    PubMed

    Lima, Leandro; Sinaimeri, Blerina; Sacomoto, Gustavo; Lopez-Maestre, Helene; Marchet, Camille; Miele, Vincent; Sagot, Marie-France; Lacroix, Vincent

    2017-01-01

    The main challenge in de novo genome assembly of DNA-seq data is certainly to deal with repeats that are longer than the reads. In de novo transcriptome assembly of RNA-seq reads, on the other hand, this problem has been underestimated so far. Even though we have fewer and shorter repeated sequences in transcriptomics, they do create ambiguities and confuse assemblers if not addressed properly. Most transcriptome assemblers of short reads are based on de Bruijn graphs (DBG) and have no clear and explicit model for repeats in RNA-seq data, relying instead on heuristics to deal with them. The results of this work are threefold. First, we introduce a formal model for representing high copy-number and low-divergence repeats in RNA-seq data and exploit its properties to infer a combinatorial characteristic of repeat-associated subgraphs. We show that the problem of identifying such subgraphs in a DBG is NP-complete. Second, we show that in the specific case of local assembly of alternative splicing (AS) events, we can implicitly avoid such subgraphs, and we present an efficient algorithm to enumerate AS events that are not included in repeats. Using simulated data, we show that this strategy is significantly more sensitive and precise than the previous version of KisSplice (Sacomoto et al. in WABI, pp 99-111, 1), Trinity (Grabherr et al. in Nat Biotechnol 29(7):644-652, 2), and Oases (Schulz et al. in Bioinformatics 28(8):1086-1092, 3), for the specific task of calling AS events. Third, we turn our focus to full-length transcriptome assembly, and we show that exploring the topology of DBGs can improve de novo transcriptome evaluation methods. Based on the observation that repeats create complicated regions in a DBG, and when assemblers try to traverse these regions, they can infer erroneous transcripts, we propose a measure to flag transcripts traversing such troublesome regions, thereby giving a confidence level for each transcript. The originality of our work when compared to other transcriptome evaluation methods is that we use only the topology of the DBG, and not read nor coverage information. We show that our simple method gives better results than Rsem-Eval (Li et al. in Genome Biol 15(12):553, 4) and TransRate (Smith-Unna et al. in Genome Res 26(8):1134-1144, 5) on both real and simulated datasets for detecting chimeras, and therefore is able to capture assembly errors missed by these methods.

  3. Tn5Prime, a Tn5 based 5' capture method for single cell RNA-seq.

    PubMed

    Cole, Charles; Byrne, Ashley; Beaudin, Anna E; Forsberg, E Camilla; Vollmers, Christopher

    2018-06-01

    RNA-sequencing (RNA-seq) is a powerful technique to investigate and quantify entire transcriptomes. Recent advances in the field have made it possible to explore the transcriptomes of single cells. However, most widely used RNA-seq protocols fail to provide crucial information regarding transcription start sites. Here we present a protocol, Tn5Prime, that takes advantage of the Tn5 transposase-based Smart-seq2 protocol to create RNA-seq libraries that capture the 5' end of transcripts. The Tn5Prime method dramatically streamlines the 5' capture process and is both cost effective and reliable. By applying Tn5Prime to bulk RNA and single cell samples, we were able to define transcription start sites as well as quantify transcriptomes at high accuracy and reproducibility. Additionally, similar to 3' end-based high-throughput methods like Drop-seq and 10× Genomics Chromium, the 5' capture Tn5Prime method allows the introduction of cellular identifiers during reverse transcription, simplifying the analysis of large numbers of single cells. In contrast to 3' end-based methods, Tn5Prime also enables the assembly of the variable 5' ends of the antibody sequences present in single B-cell data. Therefore, Tn5Prime presents a robust tool for both basic and applied research into the adaptive immune system and beyond.

  4. De novo assembly and transcriptome characterization of the freshwater prawn Palaemonetes argentinus: Implications for a detoxification response.

    PubMed

    García, C Fernando; Pedrini, Nicolas; Sánchez-Paz, Arturo; Reyna-Blanco, Carlos S; Lavarias, Sabrina; Muhlia-Almazán, Adriana; Fernández-Giménez, Analía; Laino, Aldana; de-la-Re-Vega, Enrique; Lukaszewicz, German; López-Zavala, Alonso A; Brieba, Luis G; Criscitello, Michael F; Carrasco-Miranda, Jesús S; García-Orozco, Karina D; Ochoa-Leyva, Adrian; Rudiño-Piñera, Enrique; Sanchez-Flores, Alejandro; Sotelo-Mundo, Rogerio R

    2018-02-01

    Palaemonetes argentinus, an abundant freshwater prawn species in the northern and central region of Argentina, has been used as a bioindicator of environmental pollutants as it displays a very high sensitivity to pollutants exposure. Despite their extraordinary ecological relevance, a lack of genomic information has hindered a more thorough understanding of the molecular mechanisms potentially involved in detoxification processes of this species. Thus, transcriptomic profiling studies represent a promising approach to overcome the limitations imposed by the lack of extensive genomic resources for P. argentinus, and may improve the understanding of its physiological and molecular response triggered by pollutants. This work represents the first comprehensive transcriptome-based characterization of the non-model species P. argentinus to generate functional genomic annotations and provides valuable resources for future genetic studies. Trinity de novo assembly consisted of 24,738 transcripts with high representation of detoxification (phase I and II), anti-oxidation, osmoregulation pathways and DNA replication and bioenergetics. This crustacean transcriptome provides valuable molecular information about detoxification and biochemical processes that could be applied as biomarkers in further ecotoxicology studies. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Generation of a foveomacular transcriptome

    PubMed Central

    Bernstein, Steven; Wong, Paul W.

    2014-01-01

    Purpose Organizing molecular biologic data is a growing challenge since the rate of data accumulation is steadily increasing. Information relevant to a particular biologic query can be difficult to extract from the comprehensive databases currently available. We present a data collection and organization model designed to ameliorate these problems and applied it to generate an expressed sequence tag (EST)–based foveomacular transcriptome. Methods Using Perl, MySQL, EST libraries, screening, and human foveomacular gene expression as a model system, we generated a foveomacular transcriptome database enriched for molecularly relevant data. Results Using foveomacula as a gene expression model tissue, we identified and organized 6,056 genes expressed in that tissue. Of those identified genes, 3,480 had not been previously described as expressed in the foveomacula. Internal experimental controls as well as comparison of our data set to published data sets suggest we do not yet have a complete description of the foveomacula transcriptome. Conclusions We present an organizational method designed to amplify the utility of data pertinent to a specific research interest. Our method is generic enough to be applicable to a variety of conditions yet focused enough to allow for specialized study. PMID:24991187

  6. RNAseq-based transcriptome comparison of Saccharomyces cerevisiae strains isolated from diverse fermentative environments.

    PubMed

    Ibáñez, Clara; Pérez-Torrado, Roberto; Morard, Miguel; Toft, Christina; Barrio, Eladio; Querol, Amparo

    2017-09-18

    Transcriptome analyses play a central role in unraveling the complexity of gene expression regulation in Saccharomyces cerevisiae. This species, one of the most important microorganisms for humans given its industrial applications, shows an astonishing degree of genetic and phenotypic variability among different strains adapted to specific environments. In order to gain novel insights into the Saccharomyces cerevisiae biology of strains adapted to different fermentative environments, we analyzed the whole transcriptome of three strains isolated from wine, flor wine or mezcal fermentations. An RNA-seq transcriptome comparison of the different yeasts in the samples obtained during synthetic must fermentation highlighted the differences observed in the genes that encode mannoproteins, and in those involved in aroma, sugar transport, glycerol and alcohol metabolism, which are important under alcoholic fermentation conditions. These differences were also observed in the physiology of the strains after mannoprotein and aroma determinations. This study offers an essential foundation for understanding how gene expression variations contribute to the fermentation differences of the strains adapted to unequal fermentative environments. Such knowledge is crucial to make improvements in fermentation processes and to define targets for the genetic improvement or selection of wine yeasts. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. A Pipeline for High-Throughput Concentration Response Modeling of Gene Expression for Toxicogenomics

    PubMed Central

    House, John S.; Grimm, Fabian A.; Jima, Dereje D.; Zhou, Yi-Hui; Rusyn, Ivan; Wright, Fred A.

    2017-01-01

    Cell-based assays are an attractive option to measure gene expression response to exposure, but the cost of whole-transcriptome RNA sequencing has been a barrier to the use of gene expression profiling for in vitro toxicity screening. In addition, standard RNA sequencing adds variability due to variable transcript length and amplification. Targeted probe-sequencing technologies such as TempO-Seq, with transcriptomic representation that can vary from hundreds of genes to the entire transcriptome, may reduce some components of variation. Analyses of high-throughput toxicogenomics data require renewed attention to read-calling algorithms and simplified dose–response modeling for datasets with relatively few samples. Using data from induced pluripotent stem cell-derived cardiomyocytes treated with chemicals at varying concentrations, we describe here and make available a pipeline for handling expression data generated by TempO-Seq to align reads, clean and normalize raw count data, identify differentially expressed genes, and calculate transcriptomic concentration–response points of departure. The methods are extensible to other forms of concentration–response gene-expression data, and we discuss the utility of the methods for assessing variation in susceptibility and the diseased cellular state. PMID:29163636

  8. Assessment of bacterial diversity in the cattle tick Rhipicephalus (Boophilus) microplus through tag-encoded pyrosequencing

    PubMed Central

    2011-01-01

    Background Ticks are regarded as the most relevant vectors of disease-causing pathogens in domestic and wild animals. The cattle tick, Rhipicephalus (Boophilus) microplus, hinders livestock production in tropical and subtropical parts of the world where it is endemic. Tick microbiomes remain largely unexplored. The objective of this study was to explore the R. microplus microbiome by applying the bacterial 16S tag-encoded FLX-titanium amplicon pyrosequencing (bTEFAP) technique to characterize its bacterial diversity. Pyrosequencing was performed on adult males and females, eggs, and gut and ovary tissues from adult females derived from samples of R. microplus collected during outbreaks in southern Texas. Results Raw data from bTEFAP were screened and trimmed based upon quality scores and binned into individual sample collections. Bacteria identified to the species level include Staphylococcus aureus, Staphylococcus chromogenes, Streptococcus dysgalactiae, Staphylococcus sciuri, Serratia marcescens, Corynebacterium glutamicum, and Finegoldia magna. One hundred twenty-one bacterial genera were detected in all the life stages and tissues sampled. The total number of genera identified by tick sample comprised: 53 in adult males, 61 in adult females, 11 in gut tissue, 7 in ovarian tissue, and 54 in the eggs. Notable genera detected in the cattle tick include Wolbachia, Coxiella, and Borrelia. The molecular approach applied in this study allowed us to assess the relative abundance of the microbiota associated with R. microplus. Conclusions This report represents the first survey of the bacteriome in the cattle tick using non-culture based molecular approaches. Comparisons of our results with previous bacterial surveys provide an indication of geographic variation in the assemblages of bacteria associated with R. microplus. Additional reports on the identification of new bacterial species maintained in nature by R. microplus that may be pathogenic to its vertebrate hosts are expected as our understanding of its microbiota expands. Increased awareness of the role R. microplus can play in the transmission of pathogenic bacteria will enhance our ability to mitigate its economic impact on animal agriculture globally. This recognition should be included as part of analyses to assess the risk for re-invasion of areas like the United States of America where R. microplus was eradicated. PMID:21211038

  9. Analysis of the Human Prostate-Specific Proteome Defined by Transcriptomics and Antibody-Based Profiling Identifies TMEM79 and ACOXL as Two Putative, Diagnostic Markers in Prostate Cancer

    PubMed Central

    O'Hurley, Gillian; Busch, Christer; Fagerberg, Linn; Hallström, Björn M.; Stadler, Charlotte; Tolf, Anna; Lundberg, Emma; Schwenk, Jochen M.; Jirström, Karin; Bjartell, Anders; Gallagher, William M.; Uhlén, Mathias; Pontén, Fredrik

    2015-01-01

    To better understand prostate function and disease, it is important to define and explore the molecular constituents that signify the prostate gland. The aim of this study was to define the prostate specific transcriptome and proteome, in comparison to 26 other human tissues. Deep sequencing of mRNA (RNA-seq) and immunohistochemistry-based protein profiling were combined to identify prostate specific gene expression patterns and to explore tissue biomarkers for potential clinical use in prostate cancer diagnostics. We identified 203 genes with elevated expression in the prostate, 22 of which showed more than five-fold higher expression levels compared to all other tissue types. In addition to previously well-known proteins we identified two poorly characterized proteins, TMEM79 and ACOXL, with potential to differentiate between benign and cancerous prostatic glands in tissue biopsies. In conclusion, we have applied a genome-wide analysis to identify the prostate specific proteome using transcriptomics and antibody-based protein profiling to identify genes with elevated expression in the prostate. Our data provides a starting point for further functional studies to explore the molecular repertoire of normal and diseased prostate including potential prostate cancer markers such as TMEM79 and ACOXL. PMID:26237329

  10. Multimodal RNA-seq using single-strand, double-strand, and CircLigase-based capture yields a refined and extended description of the C. elegans transcriptome.

    PubMed

    Lamm, Ayelet T; Stadler, Michael R; Zhang, Huibin; Gent, Jonathan I; Fire, Andrew Z

    2011-02-01

    We have used a combination of three high-throughput RNA capture and sequencing methods to refine and augment the transcriptome map of a well-studied genetic model, Caenorhabditis elegans. The three methods include a standard (non-directional) library preparation protocol relying on cDNA priming and foldback that has been used in several previous studies for transcriptome characterization in this species, and two directional protocols, one involving direct capture of single-stranded RNA fragments and one involving circular-template PCR (CircLigase). We find that each RNA-seq approach shows specific limitations and biases, with the application of multiple methods providing a more complete map than was obtained from any single method. Of particular note in the analysis were substantial advantages of CircLigase-based and ssRNA-based capture for defining sequences and structures of the precise 5' ends (which were lost using the double-strand cDNA capture method). Of the three methods, ssRNA capture was most effective in defining sequences to the poly(A) junction. Using data sets from a spectrum of C. elegans strains and stages and the UCSC Genome Browser, we provide a series of tools, which facilitate rapid visualization and assignment of gene structures.

  11. Transcriptome landscape of a bacterial pathogen under plant immunity.

    PubMed

    Nobori, Tatsuya; Velásquez, André C; Wu, Jingni; Kvitko, Brian H; Kremer, James M; Wang, Yiming; He, Sheng Yang; Tsuda, Kenichi

    2018-03-27

    Plant pathogens can cause serious diseases that impact global agriculture. The plant innate immunity, when fully activated, can halt pathogen growth in plants. Despite extensive studies into the molecular and genetic bases of plant immunity against pathogens, the influence of plant immunity in global pathogen metabolism to restrict pathogen growth is poorly understood. Here, we developed RNA sequencing pipelines for analyzing bacterial transcriptomes in planta and determined high-resolution transcriptome patterns of the foliar bacterial pathogen Pseudomonas syringae in Arabidopsis thaliana with a total of 27 combinations of plant immunity mutants and bacterial strains. Bacterial transcriptomes were analyzed at 6 h post infection to capture early effects of plant immunity on bacterial processes and to avoid secondary effects caused by different bacterial population densities in planta We identified specific "immune-responsive" bacterial genes and processes, including those that are activated in susceptible plants and suppressed by plant immune activation. Expression patterns of immune-responsive bacterial genes at the early time point were tightly linked to later bacterial growth levels in different host genotypes. Moreover, we found that a bacterial iron acquisition pathway is commonly suppressed by multiple plant immune-signaling pathways. Overexpression of a P. syringae sigma factor gene involved in iron regulation and other processes partially countered bacterial growth restriction during the plant immune response triggered by AvrRpt2. Collectively, this study defines the effects of plant immunity on the transcriptome of a bacterial pathogen and sheds light on the enigmatic mechanisms of bacterial growth inhibition during the plant immune response.

  12. Allele Identification for Transcriptome-Based Population Genomics in the Invasive Plant Centaurea solstitialis

    PubMed Central

    Dlugosch, Katrina M.; Lai, Zhao; Bonin, Aurélie; Hierro, José; Rieseberg, Loren H.

    2013-01-01

    Transcriptome sequences are becoming more broadly available for multiple individuals of the same species, providing opportunities to derive population genomic information from these datasets. Using the 454 Life Science Genome Sequencer FLX and FLX-Titanium next-generation platforms, we generated 11−430 Mbp of sequence for normalized cDNA for 40 wild genotypes of the invasive plant Centaurea solstitialis, yellow starthistle, from across its worldwide distribution. We examined the impact of sequencing effort on transcriptome recovery and overlap among individuals. To do this, we developed two novel publicly available software pipelines: SnoWhite for read cleaning before assembly, and AllelePipe for clustering of loci and allele identification in assembled datasets with or without a reference genome. AllelePipe is designed specifically for cases in which read depth information is not appropriate or available to assist with disentangling closely related paralogs from allelic variation, as in transcriptome or previously assembled libraries. We find that modest applications of sequencing effort recover most of the novel sequences present in the transcriptome of this species, including single-copy loci and a representative distribution of functional groups. In contrast, the coverage of variable sites, observation of heterozygosity, and overlap among different libraries are all highly dependent on sequencing effort. Nevertheless, the information gained from overlapping regions was informative regarding coarse population structure and variation across our small number of population samples, providing the first genetic evidence in support of hypothesized invasion scenarios. PMID:23390612

  13. De Novo Assembly of the Donkey White Blood Cell Transcriptome and a Comparative Analysis of Phenotype-Associated Genes between Donkeys and Horses

    PubMed Central

    Xie, Feng-Yun; Feng, Yu-Long; Wang, Hong-Hui; Ma, Yun-Feng; Yang, Yang; Wang, Yin-Chao; Shen, Wei; Pan, Qing-Jie; Yin, Shen; Sun, Yu-Jiang; Ma, Jun-Yu

    2015-01-01

    Prior to the mechanization of agriculture and labor-intensive tasks, humans used donkeys (Equus africanus asinus) for farm work and packing. However, as mechanization increased, donkeys have been increasingly raised for meat, milk, and fur in China. To maintain the development of the donkey industry, breeding programs should focus on traits related to these new uses. Compared to conventional marker-assisted breeding plans, genome- and transcriptome-based selection methods are more efficient and effective. To analyze the coding genes of the donkey genome, we assembled the transcriptome of donkey white blood cells de novo. Using transcriptomic deep-sequencing data, we identified 264,714 distinct donkey unigenes and predicted 38,949 protein fragments. We annotated the donkey unigenes by BLAST searches against the non-redundant (NR) protein database. We also compared the donkey protein sequences with those of the horse (E. caballus) and wild horse (E. przewalskii), and linked the donkey protein fragments with mammalian phenotypes. As the outer ear size of donkeys and horses are obviously different, we compared the outer ear size-associated proteins in donkeys and horses. We identified three ear size-associated proteins, HIC1, PRKRA, and KMT2A, with sequence differences among the donkey, horse, and wild horse loci. Since the donkey genome sequence has not been released, the de novo assembled donkey transcriptome is helpful for preliminary investigations of donkey cultivars and for genetic improvement. PMID:26208029

  14. De Novo Assembly of the Donkey White Blood Cell Transcriptome and a Comparative Analysis of Phenotype-Associated Genes between Donkeys and Horses.

    PubMed

    Xie, Feng-Yun; Feng, Yu-Long; Wang, Hong-Hui; Ma, Yun-Feng; Yang, Yang; Wang, Yin-Chao; Shen, Wei; Pan, Qing-Jie; Yin, Shen; Sun, Yu-Jiang; Ma, Jun-Yu

    2015-01-01

    Prior to the mechanization of agriculture and labor-intensive tasks, humans used donkeys (Equus africanus asinus) for farm work and packing. However, as mechanization increased, donkeys have been increasingly raised for meat, milk, and fur in China. To maintain the development of the donkey industry, breeding programs should focus on traits related to these new uses. Compared to conventional marker-assisted breeding plans, genome- and transcriptome-based selection methods are more efficient and effective. To analyze the coding genes of the donkey genome, we assembled the transcriptome of donkey white blood cells de novo. Using transcriptomic deep-sequencing data, we identified 264,714 distinct donkey unigenes and predicted 38,949 protein fragments. We annotated the donkey unigenes by BLAST searches against the non-redundant (NR) protein database. We also compared the donkey protein sequences with those of the horse (E. caballus) and wild horse (E. przewalskii), and linked the donkey protein fragments with mammalian phenotypes. As the outer ear size of donkeys and horses are obviously different, we compared the outer ear size-associated proteins in donkeys and horses. We identified three ear size-associated proteins, HIC1, PRKRA, and KMT2A, with sequence differences among the donkey, horse, and wild horse loci. Since the donkey genome sequence has not been released, the de novo assembled donkey transcriptome is helpful for preliminary investigations of donkey cultivars and for genetic improvement.

  15. Transcriptome sequencing reveals high isoform diversity in the ant Formica exsecta

    PubMed Central

    Paviala, Jenni; Morandin, Claire; Wheat, Christopher; Sundström, Liselotte; Helanterä, Heikki

    2017-01-01

    Transcriptome resources for social insects have the potential to provide new insight into polyphenism, i.e., how divergent phenotypes arise from the same genome. Here we present a transcriptome based on paired-end RNA sequencing data for the ant Formica exsecta (Formicidae, Hymenoptera). The RNA sequencing libraries were constructed from samples of several life stages of both sexes and female castes of queens and workers, in order to maximize representation of expressed genes. We first compare the performance of common assembly and scaffolding software (Trinity, Velvet-Oases, and SOAPdenovo-trans), in producing de novo assemblies. Second, we annotate the resulting expressed contigs to the currently published genomes of ants, and other insects, including the honeybee, to filter genes that have annotation evidence of being true genes. Our pipeline resulted in a final assembly of altogether 39,262 mRNA transcripts, with an average coverage of >300X, belonging to 17,496 unique genes with annotation in the related ant species. From these genes, 536 genes were unique to one caste or sex only, highlighting the importance of comprehensive sampling. Our final assembly also showed expression of several splice variants in 6,975 genes, and we show that accounting for splice variants affects the outcome of downstream analyses such as gene ontologies. Our transcriptome provides an outstanding resource for future genetic studies on F. exsecta and other ant species, and the presented transcriptome assembly can be adapted to any non-model species that has genomic resources available from a related taxon. PMID:29177112

  16. Discovery and Annotation of Plant Endogenous Target Mimicry Sequences from Public Transcriptome Libraries: A Case Study of Prunus persica.

    PubMed

    Karakülah, Gökhan

    2017-06-28

    Novel transcript discovery through RNA sequencing has substantially improved our understanding of the transcriptome dynamics of biological systems. Endogenous target mimicry (eTM) transcripts, a novel class of regulatory molecules, bind to their target microRNAs (miRNAs) by base pairing and block their biological activity. The objective of this study was to provide a computational analysis framework for the prediction of putative eTM sequences in plants, and as an example, to discover previously un-annotated eTMs in Prunus persica (peach) transcriptome. Therefore, two public peach transcriptome libraries downloaded from Sequence Read Archive (SRA) and a previously published set of long non-coding RNAs (lncRNAs) were investigated with multi-step analysis pipeline, and 44 putative eTMs were found. Additionally, an eTM-miRNA-mRNA regulatory network module associated with peach fruit organ development was built via integration of the miRNA target information and predicted eTM-miRNA interactions. My findings suggest that one of the most widely expressed miRNA families among diverse plant species, miR156, might be potentially sponged by seven putative eTMs. Besides, the study indicates eTMs potentially play roles in the regulation of development processes in peach fruit via targeting specific miRNAs. In conclusion, by following the step-by step instructions provided in this study, novel eTMs can be identified and annotated effectively in public plant transcriptome libraries.

  17. Variations in the non-coding transcriptome as a driver of inter-strain divergence and physiological adaptation in bacteria

    PubMed Central

    Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R.; Voß, Björn

    2015-01-01

    In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5′UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5′UTR. Such an sRNA/mRNA structure, which we name ‘actuaton’, represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation. PMID:25902393

  18. Detailed Transcriptome Description of the Neglected Cestode Taenia multiceps

    PubMed Central

    Wu, Xuhang; Fu, Yan; Yang, Deying; Zhang, Runhui; Zheng, Wanpeng; Nie, Huaming; Xie, Yue; Yan, Ning; Hao, Guiying; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yang, Guangyou

    2012-01-01

    Background The larval stage of Taenia multiceps, a global cestode, encysts in the central nervous system (CNS) of sheep and other livestock. This frequently leads to their death and huge socioeconomic losses, especially in developing countries. This parasite can also cause zoonotic infections in humans, but has been largely neglected due to a lack of diagnostic techniques and studies. Recent developments in next-generation sequencing provide an opportunity to explore the transcriptome of T. multiceps. Methodology/Principal Findings We obtained a total of 31,282 unigenes (mean length 920 bp) using Illumina paired-end sequencing technology and a new Trinity de novo assembler without a referenced genome. Individual transcription molecules were determined by sequence-based annotations and/or domain-based annotations against public databases (Nr, UniprotKB/Swiss-Prot, COG, KEGG, UniProtKB/TrEMBL, InterPro and Pfam). We identified 26,110 (83.47%) unigenes and inferred 20,896 (66.8%) coding sequences (CDS). Further comparative transcripts analysis with other cestodes (Taenia pisiformis, Taenia solium, Echincoccus granulosus and Echincoccus multilocularis) and intestinal parasites (Trichinella spiralis, Ancylostoma caninum and Ascaris suum) showed that 5,100 common genes were shared among three Taenia tapeworms, 261 conserved genes were detected among five Taeniidae cestodes, and 109 common genes were found in four zoonotic intestinal parasites. Some of the common genes were genes required for parasite survival, involved in parasite-host interactions. In addition, we amplified two full-length CDS of unigenes from the common genes using RT-PCR. Conclusions/Significance This study provides an extensive transcriptome of the adult stage of T. multiceps, and demonstrates that comparative transcriptomic investigations deserve to be further studied. This transcriptome dataset forms a substantial public information platform to achieve a fundamental understanding of the biology of T. multiceps, and helps in the identification of drug targets and parasite-host interaction studies. PMID:23049872

  19. Exploring Triacylglycerol Biosynthetic Pathway in Developing Seeds of Chia (Salvia hispanica L.): A Transcriptomic Approach

    PubMed Central

    Rupwate, Sunny D.; Rajasekharan, Ram; Srinivasan, Malathi

    2015-01-01

    Chia (Salvia hispanica L.), a member of the mint family (Lamiaceae), is a rediscovered crop with great importance in health and nutrition and is also the highest known terrestrial plant source of heart-healthy omega-3 fatty acid, alpha linolenic acid (ALA). At present, there is no public genomic information or database available for this crop, hindering research on its genetic improvement through genomics-assisted breeding programs. The first comprehensive analysis of the global transcriptome profile of developing Salvia hispanica L. seeds, with special reference to lipid biosynthesis is presented in this study. RNA from five different stages of seed development was extracted and sequenced separately using the Illumina GAIIx platform. De novo assembly of processed reads in the pooled transcriptome using Trinity yielded 76,014 transcripts. The total transcript length was 66,944,462 bases (66.9 Mb), with an average length of approximately 880 bases. In the molecular functions category of Gene Ontology (GO) terms, ATP binding and nucleotide binding were found to be the most abundant and in the biological processes category, the metabolic process and the regulation of transcription-DNA-dependent and oxidation-reduction process were abundant. From the EuKaryotic Orthologous Groups of proteins (KOG) classification, the major category was “Metabolism” (31.97%), of which the most prominent class was ‘carbohydrate metabolism and transport’ (5.81% of total KOG classifications) followed by ‘secondary metabolite biosynthesis transport and catabolism’ (5.34%) and ‘lipid metabolism’ (4.57%). A majority of the candidate genes involved in lipid biosynthesis and oil accumulation were identified. Furthermore, 5596 simple sequence repeats (SSRs) were identified. The transcriptome data was further validated through confirmative PCR and qRT-PCR for select lipid genes. Our study provides insight into the complex transcriptome and will contribute to further genome-wide research and understanding of chia. The identified novel UniGenes will facilitate gene discovery and creation of genomic resource for this crop. PMID:25875809

  20. RNA-Seq effectively monitors gene expression in Eutrema salsugineum plants growing in an extreme natural habitat and in controlled growth cabinet conditions

    PubMed Central

    2013-01-01

    Background The investigation of extremophile plant species growing in their natural environment offers certain advantages, chiefly that plants adapted to severe habitats have a repertoire of stress tolerance genes that are regulated to maximize plant performance under physiologically challenging conditions. Accordingly, transcriptome sequencing offers a powerful approach to address questions concerning the influence of natural habitat on the physiology of an organism. We used RNA sequencing of Eutrema salsugineum, an extremophile relative of Arabidopsis thaliana, to investigate the extent to which genetic variation and controlled versus natural environments contribute to differences between transcript profiles. Results Using 10 million cDNA reads, we compared transcriptomes from two natural Eutrema accessions (originating from Yukon Territory, Canada and Shandong Province, China) grown under controlled conditions in cabinets and those from Yukon plants collected at a Yukon field site. We assessed the genetic heterogeneity between individuals using single-nucleotide polymorphisms (SNPs) and the expression patterns of 27,016 genes. Over 39,000 SNPs distinguish the Yukon from the Shandong accessions but only 4,475 SNPs differentiated transcriptomes of Yukon field plants from an inbred Yukon line. We found 2,989 genes that were differentially expressed between the three sample groups and multivariate statistical analyses showed that transcriptomes of individual plants from a Yukon field site were as reproducible as those from inbred plants grown under controlled conditions. Predicted functions based upon gene ontology classifications show that the transcriptomes of field plants were enriched by the differential expression of light- and stress-related genes, an observation consistent with the habitat where the plants were found. Conclusion Our expectation that comparative RNA-Seq analysis of transcriptomes from plants originating in natural habitats would be confounded by uncontrolled genetic and environmental factors was not borne out. Moreover, the transcriptome data shows little genetic variation between laboratory Yukon Eutrema plants and those found at a field site. Transcriptomes were reproducible and biological associations meaningful whether plants were grown in cabinets or found in the field. Thus RNA-Seq is a valuable approach to study native plants in natural environments and this technology can be exploited to discover new gene targets for improved crop performance under adverse conditions. PMID:23984645

Top