Sun, Yepeng; Wang, Fawei; Wang, Nan; Dong, Yuanyuan; Liu, Qi; Zhao, Lei; Chen, Huan; Liu, Weican; Yin, Hailong; Zhang, Xiaomei; Yuan, Yanxi; Li, Haiyan
2013-01-01
Background Leymus chinensis (Trin.) Tzvel. is a high saline-alkaline tolerant forage grass genus of the tribe Gramineae family, which also plays an important role in protection of natural environment. To date, little is known about the saline-alkaline tolerance of L. chinensis on the molecular level. To better understand the molecular mechanism of saline-alkaline tolerance in L. chinensis, 454 pyrosequencing was used for the transcriptome study. Results We used Roche-454 massive parallel pyrosequencing technology to sequence two different cDNA libraries that were built from the two samples of control and under saline-alkaline treatment (optimal stress concentration-Hoagland solution with 100 mM NaCl and 200 mM NaHCO3). A total of 363,734 reads in control group and 526,267 reads in treatment group with an average length of 489 bp and 493 bp were obtained, respectively. The reads were assembled into 104,105 unigenes with MIRA sequence assemable software, among which, 73,665 unigenes were in control group, 88,016 unigenes in treatment group and 57,576 unigenes in both groups. According to the comparative expression analysis between the two groups with the threshold of “log2 Ratio ≥1”, there were 36,497 up-regulated unegenes and 18,218 down-regulated unigenes predicted to be the differentially expressed genes. After gene annotation and pathway enrichment analysis, most of them were involved in stress and tolerant function, signal transduction, energy production and conversion, and inorganic ion transport. Furthermore, 16 of these differentially expressed genes were selected for real-time PCR validation, and they were successfully confirmed with the results of 454 pyrosequencing. Conclusions This work is the first time to study the transcriptome of L. chinensis under saline-alkaline treatment based on the 454-FLX massively parallel DNA sequencing platform. It also deepened studies on molecular mechanisms of saline-alkaline in L. chinensis, and constituted a database for future studies. PMID:23365637
De Novo Transcriptome of the Hemimetabolous German Cockroach (Blattella germanica)
Zhou, Xiaojie; Qian, Kun; Tong, Ying; Zhu, Junwei Jerry; Qiu, Xinghui; Zeng, Xiaopeng
2014-01-01
Background The German cockroach, Blattella germanica, is an important insect pest that transmits various pathogens mechanically and causes severe allergic diseases. This insect has long served as a model system for studies of insect biology, physiology and ecology. However, the lack of genome or transcriptome information heavily hinder our further understanding about the German cockroach in every aspect at a molecular level and on a genome-wide scale. To explore the transcriptome and identify unique sequences of interest, we subjected the B. germanica transcriptome to massively parallel pyrosequencing and generated the first reference transcriptome for B. germanica. Methodology/Principal Findings A total of 1,365,609 raw reads with an average length of 529 bp were generated via pyrosequencing the mixed cDNA library from different life stages of German cockroach including maturing oothecae, nymphs, adult females and males. The raw reads were de novo assembled to 48,800 contigs and 3,961 singletons with high-quality unique sequences. These sequences were annotated and classified functionally in terms of BLAST, GO and KEGG, and the genes putatively coding detoxification enzyme systems, insecticide targets, key components in systematic RNA interference, immunity and chemoreception pathways were identified. A total of 3,601 SSRs (Simple Sequence Repeats) loci were also predicted. Conclusions/Significance The whole transcriptome pyrosequencing data from this study provides a usable genetic resource for future identification of potential functional genes involved in various biological processes. PMID:25265537
USDA-ARS?s Scientific Manuscript database
Flesh flies in the genus Sarcophaga are important models for investigating endocrinology, diapause, cold hardiness, reproduction, and immunity. Despite the prominence of Sarcophaga flesh flies as models for insect physiology and biochemistry, and in forensic studies, little genomic or transcriptom...
Transcriptome assembly and digital gene expression atlas of the rainbow trout
USDA-ARS?s Scientific Manuscript database
Background: Transcriptome analysis is a preferred method for gene discovery, marker development and gene expression profiling in non-model organisms. Previously, we sequenced a transcriptome reference using Sanger-based and 454-pyrosequencing, however, a transcriptome assembly is still incomplete an...
Global characterization of Artemisia annua glandular trichome transcriptome using 454 pyrosequencing
Wang, Wei; Wang, Yejun; Zhang, Qing; Qi, Yan; Guo, Dianjing
2009-01-01
Background Glandular trichomes produce a wide variety of commercially important secondary metabolites in many plant species. The most prominent anti-malarial drug artemisinin, a sesquiterpene lactone, is produced in glandular trichomes of Artemisia annua. However, only limited genomic information is currently available in this non-model plant species. Results We present a global characterization of A. annua glandular trichome transcriptome using 454 pyrosequencing. Sequencing runs using two normalized cDNA collections from glandular trichomes yielded 406,044 expressed sequence tags (average length = 210 nucleotides), which assembled into 42,678 contigs and 147,699 singletons. Performing a second sequencing run only increased the number of genes identified by ~30%, indicating that massively parallel pyrosequencing provides deep coverage of the A. annua trichome transcriptome. By BLAST search against the NCBI non-redundant protein database, putative functions were assigned to over 28,573 unigenes, including previously undescribed enzymes likely involved in sesquiterpene biosynthesis. Comparison with ESTs derived from trichome collections of other plant species revealed expressed genes in common functional categories across different plant species. RT-PCR analysis confirmed the expression of selected unigenes and novel transcripts in A. annua glandular trichomes. Conclusion The presence of contigs corresponding to enzymes for terpenoids and flavonoids biosynthesis suggests important metabolic activity in A. annua glandular trichomes. Our comprehensive survey of genes expressed in glandular trichome will facilitate new gene discovery and shed light on the regulatory mechanism of artemisinin metabolism and trichome function in A. annua. PMID:19818120
Hahn, Daniel A; Ragland, Gregory J; Shoemaker, D DeWayne; Denlinger, David L
2009-01-01
Background Flesh flies in the genus Sarcophaga are important models for investigating endocrinology, diapause, cold hardiness, reproduction, and immunity. Despite the prominence of Sarcophaga flesh flies as models for insect physiology and biochemistry, and in forensic studies, little genomic or transcriptomic data are available for members of this genus. We used massively parallel pyrosequencing on the Roche 454-FLX platform to produce a substantial EST dataset for the flesh fly Sarcophaga crassipalpis. To maximize sequence diversity, we pooled RNA extracted from whole bodies of all life stages and normalized the cDNA pool after reverse transcription. Results We obtained 207,110 ESTs with an average read length of 241 bp. These reads assembled into 20,995 contigs and 31,056 singletons. Using BLAST searches of the NR and NT databases we were able to identify 11,757 unique gene elements (E<0.0001) representing approximately 9,000 independent transcripts. Comparison of the distribution of S. crassipalpis unigenes among GO Biological Process functional groups with that of the Drosophila melanogaster transcriptome suggests that our ESTs are broadly representative of the flesh fly transcriptome. Insertion and deletion errors in 454 sequencing present a serious hurdle to comparative transcriptome analysis. Aided by a new approach to correcting for these errors, we performed a comparative analysis of genetic divergence across GO categories among S. crassipalpis, D. melanogaster, and Anopheles gambiae. The results suggest that non-synonymous substitutions occur at similar rates across categories, although genes related to response to stimuli may evolve slightly faster. In addition, we identified over 500 potential microsatellite loci and more than 12,000 SNPs among our ESTs. Conclusion Our data provides the first large-scale EST-project for flesh flies, a much-needed resource for exploring this model species. In addition, we identified a large number of potential microsatellite and SNP markers that could be used in population and systematic studies of S. crassipalpis and other flesh flies. PMID:19454017
Wu, Qing-jun; Wang, Shao-li; Yang, Xin; Yang, Ni-na; Li, Ru-mei; Jiao, Xiao-guo; Pan, Hui-peng; Liu, Bai-ming; Su, Qi; Xu, Bao-yun; Hu, Song-nian; Zhou, Xu-guo; Zhang, You-jun
2012-01-01
Background Bemisia tabaci (Gennadius) is a phloem-feeding insect poised to become one of the major insect pests in open field and greenhouse production systems throughout the world. The high level of resistance to insecticides is a main factor that hinders continued use of insecticides for suppression of B. tabaci. Despite its prevalence, little is known about B. tabaci at the genome level. To fill this gap, an invasive B. tabaci B biotype was subjected to pyrosequencing-based transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes. Methodology and Principal Findings Using Roche 454 pyrosequencing, 857,205 reads containing approximately 340 megabases were obtained from the B. tabaci transcriptome. De novo assembly generated 178,669 unigenes including 30,980 from insects, 17,881 from bacteria, and 129,808 from the nohit. A total of 50,835 (28.45%) unigenes showed similarity to the non-redundant database in GenBank with a cut-off E-value of 10–5. Among them, 40,611 unigenes were assigned to one or more GO terms and 6,917 unigenes were assigned to 288 known pathways. De novo metatranscriptome analysis revealed highly diverse bacterial symbionts in B. tabaci, and demonstrated the host-symbiont cooperation in amino acid production. In-depth transcriptome analysis indentified putative molecular markers, and genes potentially involved in insecticide resistance and nutrient digestion. The utility of this transcriptome was validated by a thiamethoxam resistance study, in which annotated cytochrome P450 genes were significantly overexpressed in the resistant B. tabaci in comparison to its susceptible counterparts. Conclusions This transcriptome/metatranscriptome analysis sheds light on the molecular understanding of symbiosis and insecticide resistance in an agriculturally important phloem-feeding insect pest, and lays the foundation for future functional genomics research of the B. tabaci complex. Moreover, current pyrosequencing effort greatly enriched the existing whitefly EST database, and makes RNAseq a viable option for future genomic analysis. PMID:22558125
Gene discovery using next-generation pyrosequencing to develop ESTs for Phalaenopsis orchids
2011-01-01
Background Orchids are one of the most diversified angiosperms, but few genomic resources are available for these non-model plants. In addition to the ecological significance, Phalaenopsis has been considered as an economically important floriculture industry worldwide. We aimed to use massively parallel 454 pyrosequencing for a global characterization of the Phalaenopsis transcriptome. Results To maximize sequence diversity, we pooled RNA from 10 samples of different tissues, various developmental stages, and biotic- or abiotic-stressed plants. We obtained 206,960 expressed sequence tags (ESTs) with an average read length of 228 bp. These reads were assembled into 8,233 contigs and 34,630 singletons. The unigenes were searched against the NCBI non-redundant (NR) protein database. Based on sequence similarity with known proteins, these analyses identified 22,234 different genes (E-value cutoff, e-7). Assembled sequences were annotated with Gene Ontology, Gene Family and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Among these annotations, over 780 unigenes encoding putative transcription factors were identified. Conclusion Pyrosequencing was effective in identifying a large set of unigenes from Phalaenopsis. The informative EST dataset we developed constitutes a much-needed resource for discovery of genes involved in various biological processes in Phalaenopsis and other orchid species. These transcribed sequences will narrow the gap between study of model organisms with many genomic resources and species that are important for ecological and evolutionary studies. PMID:21749684
Zagrobelny, Mika; Scheibye-Alsing, Karsten; Jensen, Niels Bjerg; Møller, Birger Lindberg; Gorodkin, Jan; Bak, Søren
2009-12-02
An essential driving component in the co-evolution of plants and insects is the ability to produce and handle bioactive compounds. Plants produce bioactive natural products for defense, but some insects detoxify and/or sequester the compounds, opening up for new niches with fewer competitors. To study the molecular mechanism behind the co-adaption in plant-insect interactions, we have investigated the interactions between Lotus corniculatus and Zygaena filipendulae. They both contain cyanogenic glucosides which liberate toxic hydrogen cyanide upon breakdown. Moths belonging to the Zygaena family are the only insects known, able to carry out both de novo biosynthesis and sequestration of the same cyanogenic glucosides as those from their feed plants. The biosynthetic pathway for cyanogenic glucoside biosynthesis in Z. filipendulae proceeds using the same intermediates as in the well known pathway from plants, but none of the enzymes responsible have been identified. A genomics strategy founded on 454 pyrosequencing of the Z. filipendulae transcriptome was undertaken to identify some of these enzymes in Z. filipendulae. Comparisons of the Z. filipendulae transcriptome with the sequenced genomes of Bombyx mori, Drosophila melanogaster, Tribolium castaneum, Apis mellifera and Anopheles gambiae indicate a high coverage of the Z. filipendulae transcriptome. 11% of the Z. filipendulae transcriptome sequences were assigned to Gene Ontology categories. Candidate genes for enzymes functioning in the biosynthesis of cyanogenic glucosides (cytochrome P450 and family 1 glycosyltransferases) were identified based on sequence length, number of copies and presence/absence of close homologs in D. melanogaster, B. mori and the cyanogenic butterfly Heliconius. Examination of biased codon usage, GC content and selection on gene candidates support the notion of cyanogenesis as an "old" trait within Ditrysia, as well as its origins being convergent between plants and insects. Pyrosequencing is an attractive approach to gain access to genes in the biosynthesis of bio-active natural products from insects and other organisms, for which the genome sequence is not known. Based on analysis of the Z. filipendulae transcriptome, promising gene candidates for biosynthesis of cyanogenic glucosides was identified, and the suitability of Z. filipendulae as a model system for cyanogenesis in insects is evident.
Valles, Steven M.; Oi, David H.; Yu, Fahong; Tan, Xin-Xing; Buss, Eileen A.
2012-01-01
Background Nylanderia pubens (Forel) is an invasive ant species that in recent years has developed into a serious nuisance problem in the Caribbean and United States. A rapidly expanding range, explosive localized population growth, and control difficulties have elevated this ant to pest status. Professional entomologists and the pest control industry in the United States are urgently trying to understand its biology and develop effective control methods. Currently, no known biological-based control agents are available for use in controlling N. pubens. Methodology and Principal Findings Metagenomics and pyrosequencing techniques were employed to examine the transcriptome of field-collected N. pubens colonies in an effort to identify virus infections with potential to serve as control agents against this pest ant. Pyrosequencing (454-platform) of a non-normalized N. pubens expression library generated 1,306,177 raw sequence reads comprising 450 Mbp. Assembly resulted in generation of 59,017 non-redundant sequences, including 27,348 contigs and 31,669 singlets. BLAST analysis of these non-redundant sequences identified 51 of potential viral origin. Additional analyses winnowed this list of potential viruses to three that appear to replicate in N. pubens. Conclusions Pyrosequencing the transcriptome of field-collected samples of N. pubens has identified at least three sequences that are likely of viral origin and, in which, N. pubens serves as host. In addition, the N. pubens transcriptome provides a genetic resource for the scientific community which is especially important at this early stage of developing a knowledgebase for this new pest. PMID:22384082
Transcriptome assembly, gene annotation and tissue gene expression atlas of the rainbow trout
USDA-ARS?s Scientific Manuscript database
Efforts to obtain a comprehensive genome sequence for rainbow trout are ongoing and will be complimented by transcriptome information that will enhance genome assembly and annotation. Previously, we reported a transcriptome reference sequence using a 19X coverage of Sanger and 454-pyrosequencing dat...
New in-depth rainbow trout transcriptome reference and digital atlas of gene expression
USDA-ARS?s Scientific Manuscript database
Sequencing the rainbow trout genome is underway and a transcriptome reference sequence is required to help in genome assembly and gene discovery. Previously, we reported a transcriptome reference sequence using a 19X coverage of 454-pyrosequencing data. Although this work added a great wealth of ann...
Pauchet, Y; Wilkinson, P; Vogel, H; Nelson, D R; Reynolds, S E; Heckel, D G; ffrench-Constant, R H
2010-02-01
The tobacco hornworm Manduca sexta is an important model for insect physiology but genomic and transcriptomic data are currently lacking. Following a recent pyrosequencing study generating immune related expressed sequence tags (ESTs), here we use this new technology to define the M. sexta larval midgut transcriptome. We generated over 387,000 midgut ESTs, using a combination of Sanger and 454 sequencing, and classified predicted proteins into those involved in digestion, detoxification and immunity. In many cases the depth of 454 pyrosequencing coverage allowed us to define the entire cDNA sequence of a particular gene. Many new M. sexta genes are described including up to 36 new cytochrome P450s, some of which have been implicated in the metabolism of host plant-derived nicotine. New lepidopteran gene families such as the beta-fructofuranosidases, previously thought to be restricted to Bombyx mori, are also described. An unexpectedly high number of ESTs were involved in immunity, for example 39 contigs encoding serpins, and the increasingly appreciated role of the midgut in insect immunity is discussed. Similar studies of other tissues will allow for a tissue by tissue description of the M. sexta transcriptome and will form an essential complimentary step on the road to genome sequencing and annotation.
Urbarova, Ilona; Karlsen, Bård Ove; Okkenhaug, Siri; Seternes, Ole Morten; Johansen, Steinar D.; Emblem, Åse
2012-01-01
Marine bioprospecting is the search for new marine bioactive compounds and large-scale screening in extracts represents the traditional approach. Here, we report an alternative complementary protocol, called digital marine bioprospecting, based on deep sequencing of transcriptomes. We sequenced the transcriptomes from the adult polyp stage of two cold-water sea anemones, Bolocera tuediae and Hormathia digitata. We generated approximately 1.1 million quality-filtered sequencing reads by 454 pyrosequencing, which were assembled into approximately 120,000 contigs and 220,000 single reads. Based on annotation and gene ontology analysis we profiled the expressed mRNA transcripts according to known biological processes. As a proof-of-concept we identified polypeptide toxins with a potential blocking activity on sodium and potassium voltage-gated channels from digital transcriptome libraries. PMID:23170083
Valencia, Arnubio; Wang, Haichuan; Soto, Alberto; Aristizabal, Manuel; Arboleda, Jorge W; Eyun, Seong-Il; Noriega, Daniel D; Siegfried, Blair
2016-01-01
The banana weevil Cosmopolites sordidus is an important and serious insect pest in most banana and plantain-growing areas of the world. In spite of the economic importance of this insect pest very little genomic and transcriptomic information exists for this species. In the present study, we characterized the midgut transcriptome of C. sordidus using massive 454-pyrosequencing. We generated over 590,000 sequencing reads that assembled into 30,840 contigs with more than 400 bp, representing a significant expansion of existing sequences available for this insect pest. Among them, 16,427 contigs contained one or more GO terms. In addition, 15,263 contigs were assigned an EC number. In-depth transcriptome analysis identified genes potentially involved in insecticide resistance, peritrophic membrane biosynthesis, immunity-related function and defense against pathogens, and Bacillus thuringiensis toxins binding proteins as well as multiple enzymes involved with protein digestion. This transcriptome will provide a valuable resource for understanding larval physiology and for identifying novel target sites and management approaches for this important insect pest.
Valencia, Arnubio; Wang, Haichuan; Soto, Alberto; Aristizabal, Manuel; Arboleda, Jorge W.; Eyun, Seong-il; Noriega, Daniel D.; Siegfried, Blair
2016-01-01
The banana weevil Cosmopolites sordidus is an important and serious insect pest in most banana and plantain-growing areas of the world. In spite of the economic importance of this insect pest very little genomic and transcriptomic information exists for this species. In the present study, we characterized the midgut transcriptome of C. sordidus using massive 454-pyrosequencing. We generated over 590,000 sequencing reads that assembled into 30,840 contigs with more than 400 bp, representing a significant expansion of existing sequences available for this insect pest. Among them, 16,427 contigs contained one or more GO terms. In addition, 15,263 contigs were assigned an EC number. In-depth transcriptome analysis identified genes potentially involved in insecticide resistance, peritrophic membrane biosynthesis, immunity-related function and defense against pathogens, and Bacillus thuringiensis toxins binding proteins as well as multiple enzymes involved with protein digestion. This transcriptome will provide a valuable resource for understanding larval physiology and for identifying novel target sites and management approaches for this important insect pest. PMID:26949943
2010-01-01
Background The small brown planthopper (Laodelphax striatellus) is an important agricultural pest that not only damages rice plants by sap-sucking, but also acts as a vector that transmits rice stripe virus (RSV), which can cause even more serious yield loss. Despite being a model organism for studying entomology, population biology, plant protection, molecular interactions among plants, viruses and insects, only a few genomic sequences are available for this species. To investigate its transcriptome and determine the differences between viruliferous and naïve L. striatellus, we employed 454-FLX high-throughput pyrosequencing to generate EST databases of this insect. Results We obtained 201,281 and 218,681 high-quality reads from viruliferous and naïve L. striatellus, respectively, with an average read length as 230 bp. These reads were assembled into contigs and two EST databases were generated. When all reads were combined, 16,885 contigs and 24,607 singletons (a total of 41,492 unigenes) were obtained, which represents a transcriptome of the insect. BlastX search against the NCBI-NR database revealed that only 6,873 (16.6%) of these unigenes have significant matches. Comparison of the distribution of GO classification among viruliferous, naïve, and combined EST databases indicated that these libraries are broadly representative of the L. striatellus transcriptomes. Functionally diverse transcripts from RSV, endosymbiotic bacteria Wolbachia and yeast-like symbiotes were identified, which reflects the possible lifestyles of these microbial symbionts that live in the cells of the host insect. Comparative genomic analysis revealed that L. striatellus encodes similar innate immunity regulatory systems as other insects, such as RNA interference, JAK/STAT and partial Imd cascades, which might be involved in defense against viral infection. In addition, we determined the differences in gene expression between vector and naïve samples, which generated a list of candidate genes that are potentially involved in the symbiosis of L. striatellus and RSV. Conclusions To our knowledge, the present study is the first description of a genomic project for L. striatellus. The identification of transcripts from RSV, Wolbachia, yeast-like symbiotes and genes abundantly expressed in viruliferous insect, provided a starting-point for investigating the molecular basis of symbiosis among these organisms. PMID:20462456
Garcia-Reyero, Natàlia; Griffitt, Robert J.; Liu, Li; Kroll, Kevin J.; Farmerie, William G.; Barber, David S.; Denslow, Nancy D.
2009-01-01
A novel custom microarray for largemouth bass (Micropterus salmoides) was designed with sequences obtained from a normalized cDNA library using the 454 Life Sciences GS-20 pyrosequencer. This approach yielded in excess of 58 million bases of high-quality sequence. The sequence information was combined with 2,616 reads obtained by traditional suppressive subtractive hybridizations to derive a total of 31,391 unique sequences. Annotation and coding sequences were predicted for these transcripts where possible. 16,350 annotated transcripts were selected as target sequences for the design of the custom largemouth bass oligonucleotide microarray. The microarray was validated by examining the transcriptomic response in male largemouth bass exposed to 17β-œstradiol. Transcriptomic responses were assessed in liver and gonad, and indicated gene expression profiles typical of exposure to œstradiol. The results demonstrate the potential to rapidly create the tools necessary to assess large scale transcriptional responses in non-model species, paving the way for expanded impact of toxicogenomics in ecotoxicology. PMID:19936325
Comparative 454 pyrosequencing of transcripts from two olive genotypes during fruit development
Alagna, Fiammetta; D'Agostino, Nunzio; Torchia, Laura; Servili, Maurizio; Rao, Rosa; Pietrella, Marco; Giuliano, Giovanni; Chiusano, Maria Luisa; Baldoni, Luciana; Perrotta, Gaetano
2009-01-01
Background Despite its primary economic importance, genomic information on olive tree is still lacking. 454 pyrosequencing was used to enrich the very few sequence data currently available for the Olea europaea species and to identify genes involved in expression of fruit quality traits. Results Fruits of Coratina, a widely cultivated variety characterized by a very high phenolic content, and Tendellone, an oleuropein-lacking natural variant, were used as starting material for monitoring the transcriptome. Four different cDNA libraries were sequenced, respectively at the beginning and at the end of drupe development. A total of 261,485 reads were obtained, for an output of about 58 Mb. Raw sequence data were processed using a four step pipeline procedure and data were stored in a relational database with a web interface. Conclusion Massively parallel sequencing of different fruit cDNA collections has provided large scale information about the structure and putative function of gene transcripts accumulated during fruit development. Comparative transcript profiling allowed the identification of differentially expressed genes with potential relevance in regulating the fruit metabolism and phenolic content during ripening. PMID:19709400
Lee, Jungeun; Noh, Eun Kyeung; Choi, Hyung-Seok; Shin, Seung Chul; Park, Hyun; Lee, Hyoungseok
2013-03-01
Antarctic hairgrass (Deschampsia antarctica Desv.) is the only natural grass species in the maritime Antarctic. It has been studied as an extremophile that has successfully adapted to marginal land with the harshest environment for terrestrial plants. However, limited genetic research has focused on this species due to the lack of genomic resources. Here, we present the first de novo assembly of its transcriptome by massive parallel sequencing and its expression profile using D. antarctica grown under various stress conditions. Total sequence reads generated by pyrosequencing were assembled into 60,765 unigenes (28,177 contigs and 32,588 singletons). A total of 29,173 unique protein-coding genes were identified based on sequence similarities to known proteins. The combined results from all three stress conditions indicated differential expression of 3,110 genes. Quantitative reverse transcription polymerase chain reaction showed that several well-known stress-responsive genes encoding late embryogenesis abundant protein, dehydrin 1, and ice recrystallization inhibition protein were induced dramatically and that genes encoding U-box-domain-containing protein, electron transfer flavoprotein-ubiquinone, and F-box-containing protein were induced by abiotic stressors in a manner conserved with other plant species. We identified more than 2,000 simple sequence repeats that can be developed as functional molecular markers. This dataset is the most comprehensive transcriptome resource currently available for D. antarctica and is therefore expected to be an important foundation for future genetic studies of grasses and extremophiles.
USDA-ARS?s Scientific Manuscript database
The woody resurrection plant Myrothamnus flabellifolia has remarkable tolerance to desiccation. Pyro-sequencing technology permitted us to analyze the transcriptome of M. flabellifolia during both dehydration and rehydration. We identified a total of 8287 and 8542 differentially transcribed genes du...
Profiling the venom gland transcriptomes of Costa Rican snakes by 454 pyrosequencing
2011-01-01
Background A long term research goal of venomics, of applied importance for improving current antivenom therapy, but also for drug discovery, is to understand the pharmacological potential of venoms. Individually or combined, proteomic and transcriptomic studies have demonstrated their feasibility to explore in depth the molecular diversity of venoms. In the absence of genome sequence, transcriptomes represent also valuable searchable databases for proteomic projects. Results The venom gland transcriptomes of 8 Costa Rican taxa from 5 genera (Crotalus, Bothrops, Atropoides, Cerrophidion, and Bothriechis) of pitvipers were investigated using high-throughput 454 pyrosequencing. 100,394 out of 330,010 masked reads produced significant hits in the available databases. 5.165,220 nucleotides (8.27%) were masked by RepeatMasker, the vast majority of which corresponding to class I (retroelements) and class II (DNA transposons) mobile elements. BLAST hits included 79,991 matches to entries of the taxonomic suborder Serpentes, of which 62,433 displayed similarity to documented venom proteins. Strong discrepancies between the transcriptome-computed and the proteome-gathered toxin compositions were obvious at first sight. Although the reasons underlaying this discrepancy are elusive, since no clear trend within or between species is apparent, the data indicate that individual mRNA species may be translationally controlled in a species-dependent manner. The minimum number of genes from each toxin family transcribed into the venom gland transcriptome of each species was calculated from multiple alignments of reads matched to a full-length reference sequence of each toxin family. Reads encoding ORF regions of Kazal-type inhibitor-like proteins were uniquely found in Bothriechis schlegelii and B. lateralis transcriptomes, suggesting a genus-specific recruitment event during the early-Middle Miocene. A transcriptome-based cladogram supports the large divergence between A. mexicanus and A. picadoi, and a closer kinship between A. mexicanus and C. godmani. Conclusions Our comparative next-generation sequencing (NGS) analysis reveals taxon-specific trends governing the formulation of the venom arsenal. Knowledge of the venom proteome provides hints on the translation efficiency of toxin-coding transcripts, contributing thereby to a more accurate interpretation of the transcriptome. The application of NGS to the analysis of snake venom transcriptomes, may represent the tool for opening the door to systems venomics. PMID:21605378
Comparing de novo assemblers for 454 transcriptome data
2010-01-01
Background Roche 454 pyrosequencing has become a method of choice for generating transcriptome data from non-model organisms. Once the tens to hundreds of thousands of short (250-450 base) reads have been produced, it is important to correctly assemble these to estimate the sequence of all the transcripts. Most transcriptome assembly projects use only one program for assembling 454 pyrosequencing reads, but there is no evidence that the programs used to date are optimal. We have carried out a systematic comparison of five assemblers (CAP3, MIRA, Newbler, SeqMan and CLC) to establish best practices for transcriptome assemblies, using a new dataset from the parasitic nematode Litomosoides sigmodontis. Results Although no single assembler performed best on all our criteria, Newbler 2.5 gave longer contigs, better alignments to some reference sequences, and was fast and easy to use. SeqMan assemblies performed best on the criterion of recapitulating known transcripts, and had more novel sequence than the other assemblers, but generated an excess of small, redundant contigs. The remaining assemblers all performed almost as well, with the exception of Newbler 2.3 (the version currently used by most assembly projects), which generated assemblies that had significantly lower total length. As different assemblers use different underlying algorithms to generate contigs, we also explored merging of assemblies and found that the merged datasets not only aligned better to reference sequences than individual assemblies, but were also more consistent in the number and size of contigs. Conclusions Transcriptome assemblies are smaller than genome assemblies and thus should be more computationally tractable, but are often harder because individual contigs can have highly variable read coverage. Comparing single assemblers, Newbler 2.5 performed best on our trial data set, but other assemblers were closely comparable. Combining differently optimal assemblies from different programs however gave a more credible final product, and this strategy is recommended. PMID:20950480
Pereiro, Patricia; Balseiro, Pablo; Romero, Alejandro; Dios, Sonia; Forn-Cuni, Gabriel; Fuste, Berta; Planas, Josep V.; Beltran, Sergi; Novoa, Beatriz; Figueras, Antonio
2012-01-01
Background Turbot (Scophthalmus maximus L.) is an important aquacultural resource both in Europe and Asia. However, there is little information on gene sequences available in public databases. Currently, one of the main problems affecting the culture of this flatfish is mortality due to several pathogens, especially viral diseases which are not treatable. In order to identify new genes involved in immune defense, we conducted 454-pyrosequencing of the turbot transcriptome after different immune stimulations. Methodology/Principal Findings Turbot were injected with viral stimuli to increase the expression level of immune-related genes. High-throughput deep sequencing using 454-pyrosequencing technology yielded 915,256 high-quality reads. These sequences were assembled into 55,404 contigs that were subjected to annotation steps. Intriguingly, 55.16% of the deduced protein was not significantly similar to any sequences in the databases used for the annotation and only 0.85% of the BLASTx top-hits matched S. maximus protein sequences. This relatively low level of annotation is possibly due to the limited information for this specie and other flatfish in the database. These results suggest the identification of a large number of new genes in turbot and in fish in general. A more detailed analysis showed the presence of putative members of several innate and specific immune pathways. Conclusions/Significance To our knowledge, this study is the first transcriptome analysis using 454-pyrosequencing for turbot. Previously, there were only 12,471 EST and less of 1,500 nucleotide sequences for S. maximus in NCBI database. Our results provide a rich source of data (55,404 contigs and 181,845 singletons) for discovering and identifying new genes, which will serve as a basis for microarray construction, gene expression characterization and for identification of genetic markers to be used in several applications. Immune stimulation in turbot was very effective, obtaining an enormous variety of sequences belonging to genes involved in the defense mechanisms. PMID:22629298
2009-01-01
Background The full power of modern genetics has been applied to the study of speciation in only a small handful of genetic model species - all of which speciated allopatrically. Here we report the first large expressed sequence tag (EST) study of a candidate for ecological sympatric speciation, the apple maggot Rhagoletis pomonella, using massively parallel pyrosequencing on the Roche 454-FLX platform. To maximize transcript diversity we created and sequenced separate libraries from larvae, pupae, adult heads, and headless adult bodies. Results We obtained 239,531 sequences which assembled into 24,373 contigs. A total of 6810 unique protein coding genes were identified among the contigs and long singletons, corresponding to 48% of all known Drosophila melanogaster protein-coding genes. Their distribution across GO classes suggests that we have obtained a representative sample of the transcriptome. Among these sequences are many candidates for potential R. pomonella "speciation genes" (or "barrier genes") such as those controlling chemosensory and life-history timing processes. Furthermore, we identified important marker loci including more than 40,000 single nucleotide polymorphisms (SNPs) and over 100 microsatellites. An initial search for SNPs at which the apple and hawthorn host races differ suggested at least 75 loci warranting further work. We also determined that developmental expression differences remained even after normalization; transcripts expected to show different expression levels between larvae and pupae in D. melanogaster also did so in R. pomonella. Preliminary comparative analysis of transcript presences and absences revealed evidence of gene loss in Drosophila and gain in the higher dipteran clade Schizophora. Conclusions These data provide a much needed resource for exploring mechanisms of divergence in this important model for sympatric ecological speciation. Our description of ESTs from a substantial portion of the R. pomonella transcriptome will facilitate future functional studies of candidate genes for olfaction and diapause-related life history timing, and will enable large scale expression studies. Similarly, the identification of new SNP and microsatellite markers will facilitate future population and quantitative genetic studies of divergence between the apple and hawthorn-infesting host races. PMID:20035631
Microbial metatranscriptomics in a permanent marine oxygen minimum zone.
Stewart, Frank J; Ulloa, Osvaldo; DeLong, Edward F
2012-01-01
Simultaneous characterization of taxonomic composition, metabolic gene content and gene expression in marine oxygen minimum zones (OMZs) has potential to broaden perspectives on the microbial and biogeochemical dynamics in these environments. Here, we present a metatranscriptomic survey of microbial community metabolism in the Eastern Tropical South Pacific OMZ off northern Chile. Community RNA was sampled in late austral autumn from four depths (50, 85, 110, 200 m) extending across the oxycline and into the upper OMZ. Shotgun pyrosequencing of cDNA yielded 180,000 to 550,000 transcript sequences per depth. Based on functional gene representation, transcriptome samples clustered apart from corresponding metagenome samples from the same depth, highlighting the discrepancies between metabolic potential and actual transcription. BLAST-based characterizations of non-ribosomal RNA sequences revealed a dominance of genes involved with both oxidative (nitrification) and reductive (anammox, denitrification) components of the marine nitrogen cycle. Using annotations of protein-coding genes as proxies for taxonomic affiliation, we observed depth-specific changes in gene expression by key functional taxonomic groups. Notably, transcripts most closely matching the genome of the ammonia-oxidizing archaeon Nitrosopumilus maritimus dominated the transcriptome in the upper three depths, representing one in five protein-coding transcripts at 85 m. In contrast, transcripts matching the anammox bacterium Kuenenia stuttgartiensis dominated at the core of the OMZ (200 m; 1 in 12 protein-coding transcripts). The distribution of N. maritimus-like transcripts paralleled that of transcripts matching ammonia monooxygenase genes, which, despite being represented by both bacterial and archaeal sequences in the community DNA, were dominated (> 99%) by archaeal sequences in the RNA, suggesting a substantial role for archaeal nitrification in the upper OMZ. These data, as well as those describing other key OMZ metabolic processes (e.g. sulfur oxidation), highlight gene-specific expression patterns in the context of the entire community transcriptome, as well as identify key functional groups for taxon-specific genomic profiling. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.
Characterization of the rainbow trout transcriptome using Sanger and 454-Pyrosequencing approaches
USDA-ARS?s Scientific Manuscript database
BACKGROUND: Rainbow trout is an important fish species for aquaculture and a model species for research investigations associated with carcinogenesis, comparative immunology, toxicology and the evolutionary biology. However, to date there is no genome reference sequence to facilitate the development...
Characterization of the rainbow trout transcriptome using Sanger and 454-pyrosequencing approaches
USDA-ARS?s Scientific Manuscript database
Background: Rainbow trout is an important fish for aquaculture and recreational fisheries and serves as a model species for research investigations associated with carcinogenesis, comparative immunology, toxicology and the evolutionary biology. However, to date there is no genome reference sequence...
Sato, Shin; Feltus, F Alex; Iyer, Prashanti; Tien, Ming
2009-06-01
As part of an effort to determine all the gene products involved in wood degradation, we have performed massively parallel pyrosequencing on an expression library from the white rot fungus Phanerochaete chrysosporium grown in shallow stationary cultures with red oak as the carbon source. Approximately 48,000 high quality sequence tags (246 bp average length) were generated. 53% of the sequence tags aligned to 4,262 P. chrysosporium gene models, and an additional 18.5% of the tags reliably aligned to the P. chrysosporium genome providing evidence for 961 putative novel fragmented gene models. Due to their role in lignocellulose degradation, the secreted proteins were focused upon. Our results show that the four enzymes required for cellulose degradation: endocellulase, exocellulase CBHI, exocellulase CBHII, and beta-glucosidase are all produced. For hemicellulose degradation, not all known enzymes were produced, but endoxylanases, acetyl xylan esterases and mannosidases were detected. For lignin degradation, the role of peroxidases has been questioned; however, our results show that lignin peroxidase is highly expressed along with the H(2)O(2) generating enzyme, alcohol oxidase. The transcriptome snapshot reveals that H(2)O(2) generation and utilization are central in wood degradation. Our results also reveal new transcripts that encode extracellular proteins with no known function.
USDA-ARS?s Scientific Manuscript database
Genomes from fifteen porcine reproductive and respiratory syndrome virus (PRRSV) isolates were derived simultaneously using 454 pyrosequencing technology. The viral isolates sequenced were from a recent swine study, in which engineered Type 2 prototype PRRSV strain VR-2332 mutants, with 87, 184, 200...
Zeng, Victor; Ewen-Campen, Ben; Horch, Hadley W.; Roth, Siegfried; Mito, Taro; Extavour, Cassandra G.
2013-01-01
Most genomic resources available for insects represent the Holometabola, which are insects that undergo complete metamorphosis like beetles and flies. In contrast, the Hemimetabola (direct developing insects), representing the basal branches of the insect tree, have very few genomic resources. We have therefore created a large and publicly available transcriptome for the hemimetabolous insect Gryllus bimaculatus (cricket), a well-developed laboratory model organism whose potential for functional genetic experiments is currently limited by the absence of genomic resources. cDNA was prepared using mRNA obtained from adult ovaries containing all stages of oogenesis, and from embryo samples on each day of embryogenesis. Using 454 Titanium pyrosequencing, we sequenced over four million raw reads, and assembled them into 21,512 isotigs (predicted transcripts) and 120,805 singletons with an average coverage per base pair of 51.3. We annotated the transcriptome manually for over 400 conserved genes involved in embryonic patterning, gametogenesis, and signaling pathways. BLAST comparison of the transcriptome against the NCBI non-redundant protein database (nr) identified significant similarity to nr sequences for 55.5% of transcriptome sequences, and suggested that the transcriptome may contain 19,874 unique transcripts. For predicted transcripts without significant similarity to known sequences, we assessed their similarity to other orthopteran sequences, and determined that these transcripts contain recognizable protein domains, largely of unknown function. We created a searchable, web-based database to allow public access to all raw, assembled and annotated data. This database is to our knowledge the largest de novo assembled and annotated transcriptome resource available for any hemimetabolous insect. We therefore anticipate that these data will contribute significantly to more effective and higher-throughput deployment of molecular analysis tools in Gryllus. PMID:23671567
Assessment of replicate bias in 454 pyrosequencing and a multi-purpose read-filtering tool.
Jérôme, Mariette; Noirot, Céline; Klopp, Christophe
2011-05-26
Roche 454 pyrosequencing platform is often considered the most versatile of the Next Generation Sequencing technology platforms, permitting the sequencing of large genomes, the analysis of variations or the study of transcriptomes. A recent reported bias leads to the production of multiple reads for a unique DNA fragment in a random manner within a run. This bias has a direct impact on the quality of the measurement of the representation of the fragments using the reads. Other cleaning steps are usually performed on the reads before assembly or alignment. PyroCleaner is a software module intended to clean 454 pyrosequencing reads in order to ease the assembly process. This program is a free software and is distributed under the terms of the GNU General Public License as published by the Free Software Foundation. It implements several filters using criteria such as read duplication, length, complexity, base-pair quality and number of undetermined bases. It also permits to clean flowgram files (.sff) of paired-end sequences generating on one hand validated paired-ends file and the other hand single read file. Read cleaning has always been an important step in sequence analysis. The pyrocleaner python module is a Swiss knife dedicated to 454 reads cleaning. It includes commonly used filters as well as specialised ones such as duplicated read removal and paired-end read verification.
2011-01-01
Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a plant model system. The genes characterized will be useful for future research not only in the species included in the present study, but also in related species for which no genomic resources are yet available. Our results demonstrate the efficiency of massively parallel transcriptome sequencing in a comparative framework as an approach for developing genomic resources in diverse groups of non-model organisms. PMID:21791039
2011-01-01
Background Biodiesel or ethanol derived from lipids or starch produced by microalgae may overcome many of the sustainability challenges previously ascribed to petroleum-based fuels and first generation plant-based biofuels. The paucity of microalgae genome sequences, however, limits gene-based biofuel feedstock optimization studies. Here we describe the sequencing and de novo transcriptome assembly for the non-model microalgae species, Dunaliella tertiolecta, and identify pathways and genes of importance related to biofuel production. Results Next generation DNA pyrosequencing technology applied to D. tertiolecta transcripts produced 1,363,336 high quality reads with an average length of 400 bases. Following quality and size trimming, ~ 45% of the high quality reads were assembled into 33,307 isotigs with a 31-fold coverage and 376,482 singletons. Assembled sequences and singletons were subjected to BLAST similarity searches and annotated with Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology (KO) identifiers. These analyses identified the majority of lipid and starch biosynthesis and catabolism pathways in D. tertiolecta. Conclusions The construction of metabolic pathways involved in the biosynthesis and catabolism of fatty acids, triacylglycrols, and starch in D. tertiolecta as well as the assembled transcriptome provide a foundation for the molecular genetics and functional genomics required to direct metabolic engineering efforts that seek to enhance the quantity and character of microalgae-based biofuel feedstock. PMID:21401935
Shen, Di; Wang, Haiping; Wu, Qingjun; Lu, Peng; Qiu, Yang; Song, Jiangping; Zhang, Youjun; Li, Xixiang
2013-01-01
Background The diamondback moth (DBM, Plutella xylostella) is a crucifer-specific pest that causes significant crop losses worldwide. Barbarea vulgaris (Brassicaceae) can resist DBM and other herbivorous insects by producing feeding-deterrent triterpenoid saponins. Plant breeders have long aimed to transfer this insect resistance to other crops. However, a lack of knowledge on the biosynthetic pathways and regulatory networks of these insecticidal saponins has hindered their practical application. A pyrosequencing-based transcriptome analysis of B. vulgaris during DBM larval feeding was performed to identify genes and gene networks responsible for saponin biosynthesis and its regulation at the genome level. Principal Findings Approximately 1.22, 1.19, 1.16, 1.23, 1.16, 1.20, and 2.39 giga base pairs of clean nucleotides were generated from B. vulgaris transcriptomes sampled 1, 4, 8, 12, 24, and 48 h after onset of P. xylostella feeding and from non-inoculated controls, respectively. De novo assembly using all data of the seven transcriptomes generated 39,531 unigenes. A total of 37,780 (95.57%) unigenes were annotated, 14,399 of which were assigned to one or more gene ontology terms and 19,620 of which were assigned to 126 known pathways. Expression profiles revealed 2,016–4,685 up-regulated and 557–5188 down-regulated transcripts. Secondary metabolic pathways, such as those of terpenoids, glucosinolates, and phenylpropanoids, and its related regulators were elevated. Candidate genes for the triterpene saponin pathway were found in the transcriptome. Orthological analysis of the transcriptome with four other crucifer transcriptomes identified 592 B. vulgaris-specific gene families with a P-value cutoff of 1e−5. Conclusion This study presents the first comprehensive transcriptome analysis of B. vulgaris subjected to a series of DBM feedings. The biosynthetic and regulatory pathways of triterpenoid saponins and other DBM deterrent metabolites in this plant were classified. The results of this study will provide useful data for future investigations on pest-resistance phytochemistry and plant breeding. PMID:23696897
Transcriptome sequencing and de novo analysis of the copepod Calanus sinicus using 454 GS FLX.
Ning, Juan; Wang, Minxiao; Li, Chaolun; Sun, Song
2013-01-01
Despite their species abundance and primary economic importance, genomic information about copepods is still limited. In particular, genomic resources are lacking for the copepod Calanus sinicus, which is a dominant species in the coastal waters of East Asia. In this study, we performed de novo transcriptome sequencing to produce a large number of expressed sequence tags for the copepod C. sinicus. Copepodid larvae and adults were used as the basic material for transcriptome sequencing. Using 454 pyrosequencing, a total of 1,470,799 reads were obtained, which were assembled into 56,809 high quality expressed sequence tags. Based on their sequence similarity to known proteins, about 14,000 different genes were identified, including members of all major conserved signaling pathways. Transcripts that were putatively involved with growth, lipid metabolism, molting, and diapause were also identified among these genes. Differentially expressed genes related to several processes were found in C. sinicus copepodid larvae and adults. We detected 284,154 single nucleotide polymorphisms (SNPs) that provide a resource for gene function studies. Our data provide the most comprehensive transcriptome resource available for C. sinicus. This resource allowed us to identify genes associated with primary physiological processes and SNPs in coding regions, which facilitated the quantitative analysis of differential gene expression. These data should provide foundation for future genetic and genomic studies of this and related species.
Deep sampling of the Palomero maize transcriptome by a high throughput strategy of pyrosequencing.
Vega-Arreguín, Julio C; Ibarra-Laclette, Enrique; Jiménez-Moraila, Beatriz; Martínez, Octavio; Vielle-Calzada, Jean Philippe; Herrera-Estrella, Luis; Herrera-Estrella, Alfredo
2009-07-06
In-depth sequencing analysis has not been able to determine the overall complexity of transcriptional activity of a plant organ or tissue sample. In some cases, deep parallel sequencing of Expressed Sequence Tags (ESTs), although not yet optimized for the sequencing of cDNAs, has represented an efficient procedure for validating gene prediction and estimating overall gene coverage. This approach could be very valuable for complex plant genomes. In addition, little emphasis has been given to efforts aiming at an estimation of the overall transcriptional universe found in a multicellular organism at a specific developmental stage. To explore, in depth, the transcriptional diversity in an ancient maize landrace, we developed a protocol to optimize the sequencing of cDNAs and performed 4 consecutive GS20-454 pyrosequencing runs of a cDNA library obtained from 2 week-old Palomero Toluqueño maize plants. The protocol reported here allowed obtaining over 90% of informative sequences. These GS20-454 runs generated over 1.5 Million reads, representing the largest amount of sequences reported from a single plant cDNA library. A collection of 367,391 quality-filtered reads (30.09 Mb) from a single run was sufficient to identify transcripts corresponding to 34% of public maize ESTs databases; total sequences generated after 4 filtered runs increased this coverage to 50%. Comparisons of all 1.5 Million reads to the Maize Assembled Genomic Islands (MAGIs) provided evidence for the transcriptional activity of 11% of MAGIs. We estimate that 5.67% (86,069 sequences) do not align with public ESTs or annotated genes, potentially representing new maize transcripts. Following the assembly of 74.4% of the reads in 65,493 contigs, real-time PCR of selected genes confirmed a predicted correlation between the abundance of GS20-454 sequences and corresponding levels of gene expression. A protocol was developed that significantly increases the number, length and quality of cDNA reads using massive 454 parallel sequencing. We show that recurrent 454 pyrosequencing of a single cDNA sample is necessary to attain a thorough representation of the transcriptional universe present in maize, that can also be used to estimate transcript abundance of specific genes. This data suggests that the molecular and functional diversity contained in the vast native landraces remains to be explored, and that large-scale transcriptional sequencing of a presumed ancestor of the modern maize varieties represents a valuable approach to characterize the functional diversity of maize for future agricultural and evolutionary studies.
Fluorogenic DNA Sequencing in PDMS Microreactors
Sims, Peter A.; Greenleaf, William J.; Duan, Haifeng; Xie, X. Sunney
2012-01-01
We have developed a multiplex sequencing-by-synthesis method combining terminal-phosphate labeled fluorogenic nucleotides (TPLFNs) and resealable microreactors. In the presence of phosphatase, the incorporation of a non-fluorescent TPLFN into a DNA primer by DNA polymerase results in a fluorophore. We immobilize DNA templates within polydimethylsiloxane (PDMS) microreactors, sequentially introduce one of the four identically labeled TPLFNs, seal the microreactors, allow template-directed TPLFN incorporation, and measure the signal from the fluorophores trapped in the microreactors. This workflow allows sequencing in a manner akin to pyrosequencing but without constant monitoring of each microreactor. With cycle times of <10 minutes, we demonstrate 30 base reads with ∼99% raw accuracy. “Fluorogenic pyrosequencing” combines benefits of pyrosequencing, such as rapid turn-around, native DNA generation, and single-color detection, with benefits of fluorescence-based approaches, such as highly sensitive detection and simple parallelization. PMID:21666670
Transcriptomics of the Bed Bug (Cimex lectularius)
Rajarapu, Swapna P.; Jones, Susan C.; Mittapalli, Omprakash
2011-01-01
Background Bed bugs (Cimex lectularius) are blood-feeding insects poised to become one of the major pests in households throughout the United States. Resistance of C. lectularius to insecticides/pesticides is one factor thought to be involved in its sudden resurgence. Despite its high-impact status, scant knowledge exists at the genomic level for C. lectularius. Hence, we subjected the C. lectularius transcriptome to 454 pyrosequencing in order to identify potential genes involved in pesticide resistance. Methodology and Principal Findings Using 454 pyrosequencing, we obtained a total of 216,419 reads with 79,596,412 bp, which were assembled into 35,646 expressed sequence tags (3902 contigs and 31744 singletons). Nearly 85.9% of the C. lectularius sequences showed similarity to insect sequences, but 44.8% of the deduced proteins of C. lectularius did not show similarity with sequences in the GenBank non-redundant database. KEGG analysis revealed putative members of several detoxification pathways involved in pesticide resistance. Lamprin domains, Protein Kinase domains, Protein Tyrosine Kinase domains and cytochrome P450 domains were among the top Pfam domains predicted for the C. lectularius sequences. An initial assessment of putative defense genes, including a cytochrome P450 and a glutathione-S-transferase (GST), revealed high transcript levels for the cytochrome P450 (CYP9) in pesticide-exposed versus pesticide-susceptible C. lectularius populations. A significant number of single nucleotide polymorphisms (296) and microsatellite loci (370) were predicted in the C. lectularius sequences. Furthermore, 59 putative sequences of Wolbachia were retrieved from the database. Conclusions To our knowledge this is the first study to elucidate the genetic makeup of C. lectularius. This pyrosequencing effort provides clues to the identification of potential detoxification genes involved in pesticide resistance of C. lectularius and lays the foundation for future functional genomics studies. PMID:21283830
Moreira, Rebeca; Balseiro, Pablo; Planas, Josep V.; Fuste, Berta; Beltran, Sergi; Novoa, Beatriz; Figueras, Antonio
2012-01-01
Background The Manila clam (Ruditapes philippinarum) is a worldwide cultured bivalve species with important commercial value. Diseases affecting this species can result in large economic losses. Because knowledge of the molecular mechanisms of the immune response in bivalves, especially clams, is scarce and fragmentary, we sequenced RNA from immune-stimulated R. philippinarum hemocytes by 454-pyrosequencing to identify genes involved in their immune defense against infectious diseases. Methodology and Principal Findings High-throughput deep sequencing of R. philippinarum using 454 pyrosequencing technology yielded 974,976 high-quality reads with an average read length of 250 bp. The reads were assembled into 51,265 contigs and the 44.7% of the translated nucleotide sequences into protein were annotated successfully. The 35 most frequently found contigs included a large number of immune-related genes, and a more detailed analysis showed the presence of putative members of several immune pathways and processes like the apoptosis, the toll like signaling pathway and the complement cascade. We have found sequences from molecules never described in bivalves before, especially in the complement pathway where almost all the components are present. Conclusions This study represents the first transcriptome analysis using 454-pyrosequencing conducted on R. philippinarum focused on its immune system. Our results will provide a rich source of data to discover and identify new genes, which will serve as a basis for microarray construction and the study of gene expression as well as for the identification of genetic markers. The discovery of new immune sequences was very productive and resulted in a large variety of contigs that may play a role in the defense mechanisms of Ruditapes philippinarum. PMID:22536348
2011-01-01
Background Amaranthus hypochondriacus, a grain amaranth, is a C4 plant noted by its ability to tolerate stressful conditions and produce highly nutritious seeds. These possess an optimal amino acid balance and constitute a rich source of health-promoting peptides. Although several recent studies, mostly involving subtractive hybridization strategies, have contributed to increase the relatively low number of grain amaranth expressed sequence tags (ESTs), transcriptomic information of this species remains limited, particularly regarding tissue-specific and biotic stress-related genes. Thus, a large scale transcriptome analysis was performed to generate stem- and (a)biotic stress-responsive gene expression profiles in grain amaranth. Results A total of 2,700,168 raw reads were obtained from six 454 pyrosequencing runs, which were assembled into 21,207 high quality sequences (20,408 isotigs + 799 contigs). The average sequence length was 1,064 bp and 930 bp for isotigs and contigs, respectively. Only 5,113 singletons were recovered after quality control. Contigs/isotigs were further incorporated into 15,667 isogroups. All unique sequences were queried against the nr, TAIR, UniRef100, UniRef50 and Amaranthaceae EST databases for annotation. Functional GO annotation was performed with all contigs/isotigs that produced significant hits with the TAIR database. Only 8,260 sequences were found to be homologous when the transcriptomes of A. tuberculatus and A. hypochondriacus were compared, most of which were associated with basic house-keeping processes. Digital expression analysis identified 1,971 differentially expressed genes in response to at least one of four stress treatments tested. These included several multiple-stress-inducible genes that could represent potential candidates for use in the engineering of stress-resistant plants. The transcriptomic data generated from pigmented stems shared similarity with findings reported in developing stems of Arabidopsis and black cottonwood (Populus trichocarpa). Conclusions This study represents the first large-scale transcriptomic analysis of A. hypochondriacus, considered to be a highly nutritious and stress-tolerant crop. Numerous genes were found to be induced in response to (a)biotic stress, many of which could further the understanding of the mechanisms that contribute to multiple stress-resistance in plants, a trait that has potential biotechnological applications in agriculture. PMID:21752295
Rurangwa, Eugene; Sipkema, Detmer; Kals, Jeroen; ter Veld, Menno; Forlenza, Maria; Bacanu, Gianina M.; Smidt, Hauke; Palstra, Arjan P.
2015-01-01
Larval zebrafish was subjected to a methodological exploration of the gastrointestinal microbiota and transcriptome. Assessed was the impact of two dietary inclusion levels of a novel protein meal (NPM) of animal origin (ragworm Nereis virens) on the gastrointestinal tract (GIT). Microbial development was assessed over the first 21 days post egg fertilization (dpf) through 16S rRNA gene-based microbial composition profiling by pyrosequencing. Differentially expressed genes in the GIT were demonstrated at 21 dpf by whole transcriptome sequencing (mRNAseq). Larval zebrafish showed rapid temporal changes in microbial colonization but domination occurred by one to three bacterial species generally belonging to Proteobacteria and Firmicutes. The high iron content of NPM may have led to an increased relative abundance of bacteria that were related to potential pathogens and bacteria with an increased iron metabolism. Functional classification of the 328 differentially expressed genes indicated that the GIT of larvae fed at higher NPM level was more active in transmembrane ion transport and protein synthesis. mRNAseq analysis did not reveal a major activation of genes involved in the immune response or indicating differences in iron uptake and homeostasis in zebrafish fed at the high inclusion level of NPM. PMID:25983694
A Bioluminometric Method of DNA Sequencing
NASA Technical Reports Server (NTRS)
Ronaghi, Mostafa; Pourmand, Nader; Stolc, Viktor; Arnold, Jim (Technical Monitor)
2001-01-01
Pyrosequencing is a bioluminometric single-tube DNA sequencing method that takes advantage of co-operativity between four enzymes to monitor DNA synthesis. In this sequencing-by-synthesis method, a cascade of enzymatic reactions yields detectable light, which is proportional to incorporated nucleotides. Pyrosequencing has the advantages of accuracy, flexibility and parallel processing. It can be easily automated. Furthermore, the technique dispenses with the need for labeled primers, labeled nucleotides and gel-electrophoresis. In this chapter, the use of this technique for different applications is discussed.
2013-01-01
Background Prosopis alba (Fabaceae) is an important native tree adapted to arid and semiarid regions of north-western Argentina which is of great value as multipurpose species. Despite its importance, the genomic resources currently available for the entire Prosopis genus are still limited. Here we describe the development of a leaf transcriptome and the identification of new molecular markers that could support functional genetic studies in natural and domesticated populations of this genus. Results Next generation DNA pyrosequencing technology applied to P. alba transcripts produced a total of 1,103,231 raw reads with an average length of 421 bp. De novo assembling generated a set of 15,814 isotigs and 71,101 non-assembled sequences (singletons) with an average of 991 bp and 288 bp respectively. A total of 39,000 unique singletons were identified after clustering natural and artificial duplicates from pyrosequencing reads. Regarding the non-redundant sequences or unigenes, 22,095 out of 54,814 were successfully annotated with Gene Ontology terms. Moreover, simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 5,992 and 6,236 markers, respectively, throughout the genome. For the validation of the the predicted SSR markers, a subset of 87 SSRs selected through functional annotation evidence was successfully amplified from six DNA samples of seedlings. From this analysis, 11 of these 87 SSRs were identified as polymorphic. Additionally, another set of 123 nuclear polymorphic SSRs were determined in silico, of which 50% have the probability of being effectively polymorphic. Conclusions This study generated a successful global analysis of the P. alba leaf transcriptome after bioinformatic and wet laboratory validations of RNA-Seq data. The limited set of molecular markers currently available will be significantly increased with the thousands of new markers that were identified in this study. This information will strongly contribute to genomics resources for P. alba functional analysis and genetics. Finally, it will also potentially contribute to the development of population-based genome studies in the genera. PMID:24125525
Rombauts, Stephane; Chrisargiris, Antonis; Van Leeuwen, Thomas; Vontas, John
2013-01-01
The olive fruit fly Bactrocera oleae has a unique ability to cope with olive flesh, and is the most destructive pest of olives worldwide. Its control has been largely based on the use of chemical insecticides, however, the selection of insecticide resistance against several insecticides has evolved. The study of detoxification mechanisms, which allow the olive fruit fly to defend against insecticides, and/or phytotoxins possibly present in the mesocarp, has been hampered by the lack of genomic information in this species. In the NCBI database less than 1,000 nucleotide sequences have been deposited, with less than 10 detoxification gene homologues in total. We used 454 pyrosequencing to produce, for the first time, a large transcriptome dataset for B. oleae. A total of 482,790 reads were assembled into 14,204 contigs. More than 60% of those contigs (8,630) were larger than 500 base pairs, and almost half of them matched with genes of the order of the Diptera. Analysis of the Gene Ontology (GO) distribution of unique contigs, suggests that, compared to other insects, the assembly is broadly representative for the B. oleae transcriptome. Furthermore, the transcriptome was found to contain 55 P450, 43 GST-, 15 CCE- and 18 ABC transporter-genes. Several of those detoxification genes, may putatively be involved in the ability of the olive fruit fly to deal with xenobiotics, such as plant phytotoxins and insecticides. In summary, our study has generated new data and genomic resources, which will substantially facilitate molecular studies in B. oleae, including elucidation of detoxification mechanisms of xenobiotic, as well as other important aspects of olive fruit fly biology. PMID:23824998
Poretsky, Rachel S; Hewson, Ian; Sun, Shulei; Allen, Andrew E; Zehr, Jonathan P; Moran, Mary Ann
2009-06-01
Metatranscriptomic analyses of microbial assemblages (< 5 microm) from surface water at the Hawaiian Ocean Time-Series (HOT) revealed community-wide metabolic activities and day/night patterns of differential gene expression. Pyrosequencing produced 75 558 putative mRNA reads from a day transcriptome and 75 946 from a night transcriptome. Taxonomic binning of annotated mRNAs indicated that Cyanobacteria contributed a greater percentage of the transcripts (54% of annotated sequences) than expected based on abundance (35% of cell counts and 21% 16S rRNA of libraries), and may represent the most actively transcribing cells in this surface ocean community in both the day and night. Major heterotrophic taxa contributing to the community transcriptome included alpha-Proteobacteria (19% of annotated sequences, most of which were SAR11-related) and gamma-Proteobacteria (4%). The composition of transcript pools was consistent with models of prokaryotic gene expression, including operon-based transcription patterns and an abundance of genes predicted to be highly expressed. Metabolic activities that are shared by many microbial taxa (e.g. glycolysis, citric acid cycle, amino acid biosynthesis and transcription and translation machinery) were well represented among the community transcripts. There was an overabundance of transcripts for photosynthesis, C1 metabolism and oxidative phosphorylation in the day compared with night, and evidence that energy acquisition is coordinated with solar radiation levels for both autotrophic and heterotrophic microbes. In contrast, housekeeping activities such as amino acid biosynthesis, membrane synthesis and repair, and vitamin biosynthesis were overrepresented in the night transcriptome. Direct sequencing of these environmental transcripts has provided detailed information on metabolic and biogeochemical responses of a microbial community to solar forcing.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gihring, Thomas; Green, Stefan; Schadt, Christopher Warren
2011-01-01
Technologies for massively parallel sequencing are revolutionizing microbial ecology and are vastly increasing the scale of ribosomal RNA (rRNA) gene studies. Although pyrosequencing has increased the breadth and depth of possible rRNA gene sampling, one drawback is that the number of reads obtained per sample is difficult to control. Pyrosequencing libraries typically vary widely in the number of sequences per sample, even within individual studies, and there is a need to revisit the behaviour of richness estimators and diversity indices with variable gene sequence library sizes. Multiple reports and review papers have demonstrated the bias in non-parametric richness estimators (e.g.more » Chao1 and ACE) and diversity indices when using clone libraries. However, we found that biased community comparisons are accumulating in the literature. Here we demonstrate the effects of sample size on Chao1, ACE, CatchAll, Shannon, Chao-Shen and Simpson's estimations specifically using pyrosequencing libraries. The need to equalize the number of reads being compared across libraries is reiterated, and investigators are directed towards available tools for making unbiased diversity comparisons.« less
Gonzalez, Sergio; Clavijo, Bernardo; Rivarola, Máximo; Moreno, Patricio; Fernandez, Paula; Dopazo, Joaquín; Paniego, Norma
2017-02-22
In the last years, applications based on massively parallelized RNA sequencing (RNA-seq) have become valuable approaches for studying non-model species, e.g., without a fully sequenced genome. RNA-seq is a useful tool for detecting novel transcripts and genetic variations and for evaluating differential gene expression by digital measurements. The large and complex datasets resulting from functional genomic experiments represent a challenge in data processing, management, and analysis. This problem is especially significant for small research groups working with non-model species. We developed a web-based application, called ATGC transcriptomics, with a flexible and adaptable interface that allows users to work with new generation sequencing (NGS) transcriptomic analysis results using an ontology-driven database. This new application simplifies data exploration, visualization, and integration for a better comprehension of the results. ATGC transcriptomics provides access to non-expert computer users and small research groups to a scalable storage option and simple data integration, including database administration and management. The software is freely available under the terms of GNU public license at http://atgcinta.sourceforge.net .
PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics
2012-01-01
Background The peanut (Arachis hypogaea) is an important crop cultivated worldwide for oil production and food sources. Its complex genetic architecture (e.g., the large and tetraploid genome possibly due to unique cross of wild diploid relatives and subsequent chromosome duplication: 2n = 4x = 40, AABB, 2800 Mb) presents a major challenge for its genome sequencing and makes it a less-studied crop. Without a doubt, transcriptome sequencing is the most effective way to harness the genome structure and gene expression dynamics of this non-model species that has a limited genomic resource. Description With the development of next generation sequencing technologies such as 454 pyro-sequencing and Illumina sequencing by synthesis, the transcriptomics data of peanut is rapidly accumulated in both the public databases and private sectors. Integrating 187,636 Sanger reads (103,685,419 bases), 1,165,168 Roche 454 reads (333,862,593 bases) and 57,135,995 Illumina reads (4,073,740,115 bases), we generated the first release of our peanut transcriptome assembly that contains 32,619 contigs. We provided EC, KEGG and GO functional annotations to these contigs and detected SSRs, SNPs and other genetic polymorphisms for each contig. Based on both open-source and our in-house tools, PeanutDB presents many seamlessly integrated web interfaces that allow users to search, filter, navigate and visualize easily the whole transcript assembly, its annotations and detected polymorphisms and simple sequence repeats. For each contig, sequence alignment is presented in both bird’s-eye view and nucleotide level resolution, with colorfully highlighted regions of mismatches, indels and repeats that facilitate close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors. Conclusion As a public genomic database that integrates peanut transcriptome data from different sources, PeanutDB (http://bioinfolab.muohio.edu/txid3818v1) provides the Peanut research community with an easy-to-use web portal that will definitely facilitate genomics research and molecular breeding in this less-studied crop. PMID:22712730
Kim, Bong-Soo; Kim, Jong Nam; Yoon, Seok-Hwan; Chun, Jongsik; Cerniglia, Carl E
2012-06-01
The indigenous human intestinal microbiota could be disrupted by residues of antibiotics in foods as well as therapeutically administered antibiotics to humans. These disruptions may lead to adverse health outcomes. To observe the possible impact of residues of antibiotics at concentrations below therapeutic levels on human intestinal microbiota, we performed studies using in vitro cultures of fecal suspensions from three individuals with 10 different concentrations (0, 0.1, 0.5, 1, 5, 10, 15, 25, 50 and 150 μg/ml) of the fluoroquinolone, enrofloxacin. The bacterial communities of the control and enrofloxacin dosed fecal samples were analyzed by denaturing gradient gel electrophoresis (DGGE) and pyrosequencing. In addition, changes of functional gene expression were analyzed by a pyrosequencing-based random whole-community mRNA sequencing method. Although each individual had a unique microbial composition, the communities of all individuals were affected by enrofloxacin. The proportions of two phyla, namely, Bacteroidetes and Proteobacteria, were significantly reduced with increasing concentrations of enrofloxacin exposure, while the proportion of Firmicutes increased. Principal Coordinate Analysis (PCoA) using the Fast UniFrac indicated that the community structures of intestinal microbiota were shifted by enrofloxacin. Most of the mRNA transcripts and the anti-microbial drug resistance genes increased with increasing concentrations of enrofloxacin. 16S rRNA gene pyrosequencing of control and enrofloxacin treated fecal suspensions provided valuable information of affected bacterial taxa down to the species level, and the community transcriptomic analyses using mRNA revealed the functional gene expression responses of the changed bacterial communities by enrofloxacin. Published by Elsevier Ltd.
Dilly, G F; Gaitán-Espitia, J D; Hofmann, G E
2015-03-01
This is the first de novo transcriptome and complete mitochondrial genome of an Antarctic sea urchin species sequenced to date. Sterechinus neumayeri is an Antarctic sea urchin and a model species for ecology, development, physiology and global change biology. To identify transcripts important to ocean acidification (OA) and thermal stress, this transcriptome was created pooling, and 13 larval samples representing developmental stages on day 11 (late gastrula), 19 (early pluteus) and 30 (mid pluteus) maintained at three CO2 levels (421, 652, and 1071 μatm) as well as four additional heat-shocked samples. The normalized cDNA pool was sequenced using emulsion PCR (pyrosequencing) resulting in 1.34M reads with an average read length of 492 base pairs. 40,994 isotigs were identified, averaging 1188 bp with a median coverage of 11×. Additional primer design and gap sequencing were required to complete the mitochondrial genome. The mitogenome of S. neumayeri is a circular DNA molecule with a length of 15 684 bp that contains all 37 genes normally found in metazoans. We detail the main features of the transcriptome and the mitogenome architecture and investigate the phylogenetic relationships of S. neumayeri within Echinoidea. In addition, we provide comparative analyses of S. neumayeri with its closest relative, Strongylocentrotus purpuratus, including a list of potential OA gene targets. The resources described here will support a variety of quantitative (genomic, proteomic, multistress and comparative) studies to interrogate physiological responses to OA and other stressors in this important Antarctic calcifier. © 2014 John Wiley & Sons Ltd.
Transcript expression profiling for adventitious roots of Panax ginseng Meyer.
Subramaniyam, Sathiyamoorthy; Mathiyalagan, Ramya; Natarajan, Sathishkumar; Kim, Yu-Jin; Jang, Moon-Gi; Park, Jun-Hyung; Yang, Deok Chun
2014-08-01
Panax ginseng Meyer is one of the major medicinal plants in oriental countries belonging to the Araliaceae family which are the primary source for ginsenosides. However, very few genes were characterized for ginsenoside pathway, due to the limited genome information. Through this study, we obtained a comprehensive transcriptome from adventitious roots, which were treated with methyl jasmonic acids for different time points (control, 2h, 6h, 12h, and 24h) and sequenced by RNA 454 pyrosequencing technology. Reference transcriptome 39,304,529 (0.04GB) was obtained from 5,724,987,880 bases (5.7GB) of 22 libraries by de novo assembly and 35,266 (58.5%) transcripts were annotated with biological schemas (GO and KEGG). The digital gene expression patterns were obtained from in vitro grown adventitious root sequences which mapped to reference, from that, 3813 (6.3%) unique transcripts were involved in ≥2 fold up and downregulations. Finally, candidates for ginsenoside pathway genes were predicted from observed expression patterns. Among them, 30 transcription factors, 20 cytochromes, and 11 glycosyl transferases were predicted as ginsenoside candidates. These data can remarkably expand the existing transcriptome resources of Panax, especially to predict existence of gene networks in P. ginseng. The entity of the data provides a valuable platform to reveal more on secondary metabolism and abiotic stresses from P. ginseng in vitro grown adventitious roots. Copyright © 2014 Elsevier B.V. All rights reserved.
Hwang, Hwan-Su; Lee, Hyoshin; Choi, Yong Eui
2015-03-14
Eleutherococcus senticosus, Siberian ginseng, is a highly valued woody medicinal plant belonging to the family Araliaceae. E. senticosus produces a rich variety of saponins such as oleanane-type, noroleanane-type, 29-hydroxyoleanan-type, and lupane-type saponins. Genomic or transcriptomic approaches have not been used to investigate the saponin biosynthetic pathway in this plant. In this study, de novo sequencing was performed to select candidate genes involved in the saponin biosynthetic pathway. A half-plate 454 pyrosequencing run produced 627,923 high-quality reads with an average sequence length of 422 bases. De novo assembly generated 72,811 unique sequences, including 15,217 contigs and 57,594 singletons. Approximately 48,300 (66.3%) unique sequences were annotated using BLAST similarity searches. All of the mevalonate pathway genes for saponin biosynthesis starting from acetyl-CoA were isolated. Moreover, 206 reads of cytochrome P450 (CYP) and 145 reads of uridine diphosphate glycosyltransferase (UGT) sequences were isolated. Based on methyl jasmonate (MeJA) treatment and real-time PCR (qPCR) analysis, 3 CYPs and 3 UGTs were finally selected as candidate genes involved in the saponin biosynthetic pathway. The identified sequences associated with saponin biosynthesis will facilitate the study of the functional genomics of saponin biosynthesis and genetic engineering of E. senticosus.
Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.
Taheri, Sima; Lee Abdullah, Thohirah; Yusop, Mohd Rafii; Hanafi, Mohamed Musa; Sahebi, Mahbod; Azizi, Parisa; Shamshiri, Redmond Ramin
2018-02-13
Microsatellites, or simple sequence repeats (SSRs), are one of the most informative and multi-purpose genetic markers exploited in plant functional genomics. However, the discovery of SSRs and development using traditional methods are laborious, time-consuming, and costly. Recently, the availability of high-throughput sequencing technologies has enabled researchers to identify a substantial number of microsatellites at less cost and effort than traditional approaches. Illumina is a noteworthy transcriptome sequencing technology that is currently used in SSR marker development. Although 454 pyrosequencing datasets can be used for SSR development, this type of sequencing is no longer supported. This review aims to present an overview of the next generation sequencing, with a focus on the efficient use of de novo transcriptome sequencing (RNA-Seq) and related tools for mining and development of microsatellites in plants.
Li, Jitao; Li, Jian; Chen, Ping; Liu, Ping; He, Yuying
2015-01-01
The ridgetail white prawn Exopalaemon carinicauda is one of major economic mariculture species in eastern China. The deficiency of genomic and transcriptomic data is becoming the bottleneck of further researches on its good traits. In the present study, 454 pyrosequencing was undertaken to investigate the transcriptome profiles of E. carinicauda. A collection of 1,028,710 sequence reads (459.59 Mb) obtained from cDNA prepared from eyestalk and hemocytes was assembled into 162,056 expressed sequence tags (ESTs). Of these, 29.88 % of 48,428 contigs and 70.12 % of 113,628 singlets possessed high similarities to sequences in the GenBank non-redundant database, with most significant (E value <1e(-10)) unigenes matches occurring with crustacean and insect sequences. KEGG analysis of unigenes identified putative members of biological pathways related to growth and immunity. In addition, we obtained a total of putative 125,112 SNPs and 13,467 microsatellites. These results will contribute to the understanding of the genome makeup and provide useful information for future functional genomic research in E. carinicauda.
Ahn, Yul-Kyun; Tripathi, Swati; Kim, Jeong-Ho; Cho, Young-Il; Lee, Hye-Eun; Kim, Do-Sun; Woo, Jong-Gyu; Cho, Myeong-Cheoul
2014-01-10
Next generation sequencing technologies have proven to be a rapid and cost-effective means to assemble and characterize gene content and identify molecular markers in various organisms. Pepper (Capsicum annuum L., Solanaceae) is a major staple vegetable crop, which is economically important and has worldwide distribution. High-throughput transcriptome profiling of two pepper cultivars, Mandarin and Blackcluster, using 454 GS-FLX pyrosequencing yielded 279,221 and 316,357 sequenced reads with a total 120.44 and 142.54Mb of sequence data (average read length of 431 and 450 nucleotides). These reads resulted from 17,525 and 16,341 'isogroups' and were assembled into 19,388 and 18,057 isotigs, and 22,217 and 13,153 singletons for both the cultivars, respectively. Assembled sequences were annotated functionally based on homology to genes in multiple public databases. Detailed sequence variant analysis identified a total of 9701 and 12,741 potential SNPs which eventually resulted in 1025 and 1059 genotype specific SNPs, for both the varieties, respectively, after examining SNP frequency distribution for each mapped unigenes. These markers for pepper will be highly valuable for marker-assisted breeding and other genetic studies. © 2013 Elsevier B.V. All rights reserved.
2011-01-01
Background Jatropha curcas L. is an important non-edible oilseed crop with promising future in biodiesel production. However, factors like oil yield, oil composition, toxic compounds in oil cake, pests and diseases limit its commercial potential. Well established genetic engineering methods using cloned genes could be used to address these limitations. Earlier, 10,983 unigenes from Sanger sequencing of ESTs, and 3,484 unique assembled transcripts from 454 pyrosequencing of uncloned cDNAs were reported. In order to expedite the process of gene discovery, we have undertaken 454 pyrosequencing of normalized cDNAs prepared from roots, mature leaves, flowers, developing seeds, and embryos of J. curcas. Results From 383,918 raw reads, we obtained 381,957 quality-filtered and trimmed reads that are suitable for the assembly of transcript sequences. De novo contig assembly of these reads generated 17,457 assembled transcripts (contigs) and 54,002 singletons. Average length of the assembled transcripts was 916 bp. About 30% of the transcripts were longer than 1000 bases, and the size of the longest transcript was 7,173 bases. BLASTX analysis revealed that 2,589 of these transcripts are full-length. The assembled transcripts were validated by RT-PCR analysis of 28 transcripts. The results showed that the transcripts were correctly assembled and represent actively expressed genes. KEGG pathway mapping showed that 2,320 transcripts are related to major biochemical pathways including the oil biosynthesis pathway. Overall, the current study reports 14,327 new assembled transcripts which included 2589 full-length transcripts and 27 transcripts that are directly involved in oil biosynthesis. Conclusion The large number of transcripts reported in the current study together with existing ESTs and transcript sequences will serve as an invaluable genetic resource for crop improvement in jatropha. Sequence information of those genes that are involved in oil biosynthesis could be used for metabolic engineering of jatropha to increase oil content, and to modify oil composition. PMID:21492485
Natarajan, Purushothaman; Parani, Madasamy
2011-04-15
Jatropha curcas L. is an important non-edible oilseed crop with promising future in biodiesel production. However, factors like oil yield, oil composition, toxic compounds in oil cake, pests and diseases limit its commercial potential. Well established genetic engineering methods using cloned genes could be used to address these limitations. Earlier, 10,983 unigenes from Sanger sequencing of ESTs, and 3,484 unique assembled transcripts from 454 pyrosequencing of uncloned cDNAs were reported. In order to expedite the process of gene discovery, we have undertaken 454 pyrosequencing of normalized cDNAs prepared from roots, mature leaves, flowers, developing seeds, and embryos of J. curcas. From 383,918 raw reads, we obtained 381,957 quality-filtered and trimmed reads that are suitable for the assembly of transcript sequences. De novo contig assembly of these reads generated 17,457 assembled transcripts (contigs) and 54,002 singletons. Average length of the assembled transcripts was 916 bp. About 30% of the transcripts were longer than 1000 bases, and the size of the longest transcript was 7,173 bases. BLASTX analysis revealed that 2,589 of these transcripts are full-length. The assembled transcripts were validated by RT-PCR analysis of 28 transcripts. The results showed that the transcripts were correctly assembled and represent actively expressed genes. KEGG pathway mapping showed that 2,320 transcripts are related to major biochemical pathways including the oil biosynthesis pathway. Overall, the current study reports 14,327 new assembled transcripts which included 2589 full-length transcripts and 27 transcripts that are directly involved in oil biosynthesis. The large number of transcripts reported in the current study together with existing ESTs and transcript sequences will serve as an invaluable genetic resource for crop improvement in jatropha. Sequence information of those genes that are involved in oil biosynthesis could be used for metabolic engineering of jatropha to increase oil content, and to modify oil composition.
Sugarcane giant borer transcriptome analysis and identification of genes related to digestion.
Fonseca, Fernando Campos de Assis; Firmino, Alexandre Augusto Pereira; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; de Souza Júnior, José Dijair Antonino; de Sousa Júnior, José Dijair Antonino; Silva-Junior, Orzenil Bonfim; Togawa, Roberto Coiti; Pappas, Georgios Joannis; de Góis, Luiz Avelar Brandão; da Silva, Maria Cristina Mattar; Grossi-de-Sá, Maria Fátima
2015-01-01
Sugarcane is a widely cultivated plant that serves primarily as a source of sugar and ethanol. Its annual yield can be significantly reduced by the action of several insect pests including the sugarcane giant borer (Telchin licus licus), a lepidopteran that presents a long life cycle and which efforts to control it using pesticides have been inefficient. Although its economical relevance, only a few DNA sequences are available for this species in the GenBank. Pyrosequencing technology was used to investigate the transcriptome of several developmental stages of the insect. To maximize transcript diversity, a pool of total RNA was extracted from whole body insects and used to construct a normalized cDNA database. Sequencing produced over 650,000 reads, which were de novo assembled to generate a reference library of 23,824 contigs. After quality score and annotation, 43% of the contigs had at least one BLAST hit against the NCBI non-redundant database, and 40% showed similarities with the lepidopteran Bombyx mori. In a further analysis, we conducted a comparison with Manduca sexta midgut sequences to identify transcripts of genes involved in digestion. Of these transcripts, many presented an expansion or depletion in gene number, compared to B. mori genome. From the sugarcane giant borer (SGB) transcriptome, a number of aminopeptidase N (APN) cDNAs were characterized based on homology to those reported as Cry toxin receptors. This is the first report that provides a large-scale EST database for the species. Transcriptome analysis will certainly be useful to identify novel developmental genes, to better understand the insect's biology and to guide the development of new strategies for insect-pest control.
Perdiguero, Pedro; Venturas, Martin; Cervera, María Teresa; Gil, Luis; Collada, Carmen
2015-01-01
Elms, especially Ulmus minor and U. americana, are carrying out a hard battle against Dutch elm disease (DED). This vascular wilt disease, caused by Ophiostoma ulmi and O. novo-ulmi, appeared in the twentieth century and killed millions of elms across North America and Europe. Elm breeding and conservation programmes have identified a reduced number of DED tolerant genotypes. In this study, three U. minor genotypes with contrasted levels of tolerance to DED were exposed to several biotic and abiotic stresses in order to (i) obtain a de novo assembled transcriptome of U. minor using 454 pyrosequencing, (ii) perform a functional annotation of the assembled transcriptome, (iii) identify genes potentially involved in the molecular response to environmental stress, and (iv) develop gene-based markers to support breeding programmes. A total of 58,429 putative unigenes were identified after assembly and filtering of the transcriptome. 32,152 of these unigenes showed homology with proteins identified in the genome from the most common plant model species. Well-known family proteins and transcription factors involved in abiotic, biotic or both stresses were identified after functional annotation. A total of 30,693 polymorphisms were identified in 7,125 isotigs, a large number of them corresponding to single nucleotide polymorphisms (SNPs; 27,359). In a subset randomly selected for validation, 87% of the SNPs were confirmed. The material generated may be valuable for future Ulmus gene expression, population genomics and association genetics studies, especially taking into account the scarce molecular information available for this genus and the great impact that DED has on elm populations. PMID:26257751
Sugarcane Giant Borer Transcriptome Analysis and Identification of Genes Related to Digestion
de Assis Fonseca, Fernando Campos; Firmino, Alexandre Augusto Pereira; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; de Sousa Júnior, José Dijair Antonino; Silva-Junior, Orzenil Bonfim; Togawa, Roberto Coiti; Pappas, Georgios Joannis; de Góis, Luiz Avelar Brandão; da Silva, Maria Cristina Mattar; Grossi-de-Sá, Maria Fátima
2015-01-01
Sugarcane is a widely cultivated plant that serves primarily as a source of sugar and ethanol. Its annual yield can be significantly reduced by the action of several insect pests including the sugarcane giant borer (Telchin licus licus), a lepidopteran that presents a long life cycle and which efforts to control it using pesticides have been inefficient. Although its economical relevance, only a few DNA sequences are available for this species in the GenBank. Pyrosequencing technology was used to investigate the transcriptome of several developmental stages of the insect. To maximize transcript diversity, a pool of total RNA was extracted from whole body insects and used to construct a normalized cDNA database. Sequencing produced over 650,000 reads, which were de novo assembled to generate a reference library of 23,824 contigs. After quality score and annotation, 43% of the contigs had at least one BLAST hit against the NCBI non-redundant database, and 40% showed similarities with the lepidopteran Bombyx mori. In a further analysis, we conducted a comparison with Manduca sexta midgut sequences to identify transcripts of genes involved in digestion. Of these transcripts, many presented an expansion or depletion in gene number, compared to B. mori genome. From the sugarcane giant borer (SGB) transcriptome, a number of aminopeptidase N (APN) cDNAs were characterized based on homology to those reported as Cry toxin receptors. This is the first report that provides a large-scale EST database for the species. Transcriptome analysis will certainly be useful to identify novel developmental genes, to better understand the insect’s biology and to guide the development of new strategies for insect-pest control. PMID:25706301
2012-01-01
Background We present a comprehensive transcriptome analysis of the fungus Ascosphaera apis, an economically important pathogen of the Western honey bee (Apis mellifera) that causes chalkbrood disease. Our goals were to further annotate the A. apis reference genome and to identify genes that are candidates for being differentially expressed during host infection versus axenic culture. Results We compared A. apis transcriptome sequence from mycelia grown on liquid or solid media with that dissected from host-infected tissue. 454 pyrosequencing provided 252 Mb of filtered sequence reads from both culture types that were assembled into 10,087 contigs. Transcript contigs, protein sequences from multiple fungal species, and ab initio gene predictions were included as evidence sources in the Maker gene prediction pipeline, resulting in 6,992 consensus gene models. A phylogeny based on 12 of these protein-coding loci further supported the taxonomic placement of Ascosphaera as sister to the core Onygenales. Several common protein domains were less abundant in A. apis compared with related ascomycete genomes, particularly cytochrome p450 and protein kinase domains. A novel gene family was identified that has expanded in some ascomycete lineages, but not others. We manually annotated genes with homologs in other fungal genomes that have known relevance to fungal virulence and life history. Functional categories of interest included genes involved in mating-type specification, intracellular signal transduction, and stress response. Computational and manual annotations have been made publicly available on the Bee Pests and Pathogens website. Conclusions This comprehensive transcriptome analysis substantially enhances our understanding of the A. apis genome and its expression during infection of honey bee larvae. It also provides resources for future molecular studies of chalkbrood disease and ultimately improved disease management. PMID:22747707
Cabrera, Ana R; Donohue, Kevin V; Khalil, Sayed M S; Scholl, Elizabeth; Opperman, Charles; Sonenshine, Daniel E; Roe, R Michael
2011-01-01
Many species of mites and ticks are of agricultural and medical importance. Much can be learned from the study of transcriptomes of acarines which can generate DNA-sequence information of potential target genes for the control of acarine pests. High throughput transcriptome sequencing can also yield sequences of genes critical during physiological processes poorly understood in acarines, i.e., the regulation of female reproduction in mites. The predatory mite, Phytoseiulus persimilis, was selected to conduct a transcriptome analysis using 454 pyrosequencing. The objective of this project was to obtain DNA-sequence information of expressed genes from P. persimilis with special interest in sequences corresponding to vitellogenin (Vg) and the vitellogenin receptor (VgR). These genes are critical to the understanding of vitellogenesis, and they will facilitate the study of the regulation of mite female reproduction. A total of 12,556 contiguous sequences (contigs) were assembled with an average size of 935bp. From these sequences, the putative translated peptides of 11 contigs were similar in amino acid sequences to other arthropod Vgs, while 6 were similar to VgRs. We selected some of these sequences to conduct stage-specific expression studies to further determine their function. 2010 Elsevier Ltd. All rights reserved.
2013-01-01
Background The transition from the vegetative mycelium to the primordium during fruiting body development is the most complex and critical developmental event in the life cycle of many basidiomycete fungi. Understanding the molecular mechanisms underlying this process has long been a goal of research on basidiomycetes. Large scale assessment of the expressed transcriptomes of these developmental stages will facilitate the generation of a more comprehensive picture of the mushroom fruiting process. In this study, we coupled 5'-Serial Analysis of Gene Expression (5'-SAGE) to high-throughput pyrosequencing from 454 Life Sciences to analyze the transcriptomes and identify up-regulated genes among vegetative mycelium (Myc) and stage 1 primordium (S1-Pri) of Coprinopsis cinerea during fruiting body development. Results We evaluated the expression of >3,000 genes in the two respective growth stages and discovered that almost one-third of these genes were preferentially expressed in either stage. This identified a significant turnover of the transcriptome during the course of fruiting body development. Additionally, we annotated more than 79,000 transcription start sites (TSSs) based on the transcriptomes of the mycelium and stage 1 primoridum stages. Patterns of enrichment based on gene annotations from the GO and KEGG databases indicated that various structural and functional protein families were uniquely employed in either stage and that during primordial growth, cellular metabolism is highly up-regulated. Various signaling pathways such as the cAMP-PKA, MAPK and TOR pathways were also identified as up-regulated, consistent with the model that sensing of nutrient levels and the environment are important in this developmental transition. More than 100 up-regulated genes were also found to be unique to mushroom forming basidiomycetes, highlighting the novelty of fruiting body development in the fungal kingdom. Conclusions We implicated a wealth of new candidate genes important to early stages of mushroom fruiting development, though their precise molecular functions and biological roles are not yet fully known. This study serves to advance our understanding of the molecular mechanisms of fruiting body development in the model mushroom C. cinerea. PMID:23514374
A physiologically-oriented transcriptomic analysis of the midgut of Tenebrio molitor.
Moreira, Nathalia R; Cardoso, Christiane; Dias, Renata O; Ferreira, Clelia; Terra, Walter R
2017-05-01
Physiological data showed that T. molitor midgut is buffered at pH 5.6 at the two anterior thirds and at 7.9 at the posterior third. Furthermore, water is absorbed and secreted at the anterior and posterior midgut, respectively, driving a midgut counter flux of fluid. To look for the molecular mechanisms underlying these phenomena and nutrient absorption as well, a transcriptomic approach was used. For this, 11 types of transporters were chosen from the midgut transcriptome obtained by pyrosequencing (Roche 454). After annotation with the aid of databanks and manual curation, the sequences were validated by RT-PCR. The expression level of each gene at anterior, middle and posterior midgut and carcass (larva less midgut) was evaluated by RNA-seq taking into account reference sequences based on 454 contigs and reads obtained by Illumina sequencing. The data showed that sugar and amino acid uniporters and symporters are expressed along the whole midgut. In the anterior midgut are found transporters for NH 3 and NH 4 + that with a chloride channel may be responsible for acidifying the lumen. At the posterior midgut, bicarbonate-Cl - antiporter with bicarbonate supplied by carbonic anhydrase may alkalinize the lumen. Water absorption caused mainly by an anterior Na + -K + -2Cl - symporter and water secretion caused by a posterior K + -Cl - may drive the midgut counter flux. Transporters that complement the action of those described were also found. Copyright © 2017 Elsevier Ltd. All rights reserved.
Large-scale enrichment and discovery of gene-associated SNPs
USDA-ARS?s Scientific Manuscript database
With the recent advent of massively parallel pyrosequencing by 454 Life Sciences it has become feasible to cost-effectively identify numerous single nucleotide polymorphisms (SNPs) within the recombinogenic regions of the maize (Zea mays L.) genome. We developed a modified version of hypomethylated...
Bailey, Jeffrey A; Mvalo, Tisungane; Aragam, Nagesh; Weiser, Matthew; Congdon, Seth; Kamwendo, Debbie; Martinson, Francis; Hoffman, Irving; Meshnick, Steven R; Juliano, Jonathan J
2012-08-15
The development of an effective malaria vaccine has been hampered by the genetic diversity of commonly used target antigens. This diversity has led to concerns about allele-specific immunity limiting the effectiveness of vaccines. Despite extensive genetic diversity of circumsporozoite protein (CS), the most successful malaria vaccine is RTS/S, a monovalent CS vaccine. By use of massively parallel pyrosequencing, we evaluated the diversity of CS haplotypes across the T-cell epitopes in parasites from Lilongwe, Malawi. We identified 57 unique parasite haplotypes from 100 participants. By use of ecological and molecular indexes of diversity, we saw no difference in the diversity of CS haplotypes between adults and children. We saw evidence of weak variant-specific selection within this region of CS, suggesting naturally acquired immunity does induce variant-specific selection on CS. Therefore, the impact of CS vaccines on variant frequencies with widespread implementation of vaccination requires further study.
Separation and parallel sequencing of the genomes and transcriptomes of single cells using G&T-seq.
Macaulay, Iain C; Teng, Mabel J; Haerty, Wilfried; Kumar, Parveen; Ponting, Chris P; Voet, Thierry
2016-11-01
Parallel sequencing of a single cell's genome and transcriptome provides a powerful tool for dissecting genetic variation and its relationship with gene expression. Here we present a detailed protocol for G&T-seq, a method for separation and parallel sequencing of genomic DNA and full-length polyA(+) mRNA from single cells. We provide step-by-step instructions for the isolation and lysis of single cells; the physical separation of polyA(+) mRNA from genomic DNA using a modified oligo-dT bead capture and the respective whole-transcriptome and whole-genome amplifications; and library preparation and sequence analyses of these amplification products. The method allows the detection of thousands of transcripts in parallel with the genetic variants captured by the DNA-seq data from the same single cell. G&T-seq differs from other currently available methods for parallel DNA and RNA sequencing from single cells, as it involves physical separation of the DNA and RNA and does not require bespoke microfluidics platforms. The process can be implemented manually or through automation. When performed manually, paired genome and transcriptome sequencing libraries from eight single cells can be produced in ∼3 d by researchers experienced in molecular laboratory work. For users with experience in the programming and operation of liquid-handling robots, paired DNA and RNA libraries from 96 single cells can be produced in the same time frame. Sequence analysis and integration of single-cell G&T-seq DNA and RNA data requires a high level of bioinformatics expertise and familiarity with a wide range of informatics tools.
Characterization of gonadal transcriptomes from the turbot (Scophthalmus maximus).
Hu, Yulong; Huang, Meng; Wang, Weiji; Guan, Jiantao; Kong, Jie
2016-01-01
The mechanisms underlying sexual reproduction and sex ratio determination remains unclear in turbot, a flatfish of great commercial value. And there is limited information in the turbot database regarding genes related to the reproductive system. Here, we conducted high-throughput transcriptome profiling of turbot gonad tissues to better understand their reproductive functions and to supply essential gene sequence information for marker-assisted selection programs in the turbot industry. In this study, two gonad libraries representing sex differences in Scophthalmus maximus yielded 453 818 high-quality reads that were assembled into 24 611 contigs and 33 713 singletons by using 454 pyrosequencing, 13 936 contigs and singletons (CS) of which were annotated using BLASTx. GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analyses revealed that various biological functions and processes were associated with many of the annotated CS. Expression analyses showed that 510 genes were differentially expressed in males versus females; 80% of these genes were annotated. In addition, 6484 and 6036 single nucleotide polymorphisms (SNPs) were identified in male and female libraries, respectively. This transcriptome resource will serve as the foundation for cDNA or SNP microarray construction, gene expression characterization, and sex-specific linkage mapping in turbot.
Preliminary characterization of the oral microbiota of Chinese adults with and without gingivitis
2011-01-01
Background Microbial communities inhabiting human mouth are associated with oral health and disease. Previous studies have indicated the general prevalence of adult gingivitis in China to be high. The aim of this study was to characterize in depth the oral microbiota of Chinese adults with or without gingivitis, by defining the microbial phylogenetic diversity and community-structure using highly paralleled pyrosequencing. Methods Six non-smoking Chinese, three with and three without gingivitis (age range 21-39 years, 4 females and 2 males) were enrolled in the present cross-sectional study. Gingival parameters of inflammation and bleeding on probing were characterized by a clinician using the Mazza Gingival Index (MGI). Plaque (sampled separately from four different oral sites) and salivary samples were obtained from each subject. Sequences and relative abundance of the bacterial 16 S rDNA PCR-amplicons were determined via pyrosequencing that produced 400 bp-long reads. The sequence data were analyzed via a computational pipeline customized for human oral microbiome analyses. Furthermore, the relative abundances of selected microbial groups were validated using quantitative PCR. Results The oral microbiomes from gingivitis and healthy subjects could be distinguished based on the distinct community structures of plaque microbiomes, but not the salivary microbiomes. Contributions of community members to community structure divergence were statistically accessed at the phylum, genus and species-like levels. Eight predominant taxa were found associated with gingivitis: TM7, Leptotrichia, Selenomonas, Streptococcus, Veillonella, Prevotella, Lautropia, and Haemophilus. Furthermore, 98 species-level OTUs were identified to be gingivitis-associated, which provided microbial features of gingivitis at a species resolution. Finally, for the two selected genera Streptococcus and Fusobacterium, Real-Time PCR based quantification of relative bacterial abundance validated the pyrosequencing-based results. Conclusions This methods study suggests that oral samples from this patient population of gingivitis can be characterized via plaque microbiome by pyrosequencing the 16 S rDNA genes. Further studies that characterize serial samples from subjects (longitudinal study design) with a larger population size may provide insight into the temporal and ecological features of oral microbial communities in clinically-defined states of gingivitis. PMID:22152152
Chao, Shiou-Huei; Huang, Hui-Yu; Chang, Chuan-Hsiung; Yang, Chih-Hsien; Cheng, Wei-Shen; Kang, Ya-Huei; Watanabe, Koichi; Tsai, Ying-Chieh
2013-01-01
In Taiwanese alternative medicine Lu-doh-huang (also called Pracparatum mungo), mung beans are mixed with various herbal medicines and undergo a 4-stage process of anaerobic fermentation. Here we used high-throughput sequencing of the 16S rRNA gene to profile the bacterial community structure of Lu-doh-huang samples. Pyrosequencing of samples obtained at 7 points during fermentation revealed 9 phyla, 264 genera, and 586 species of bacteria. While mung beans were inside bamboo sections (stages 1 and 2 of the fermentation process), family Lactobacillaceae and genus Lactobacillus emerged in highest abundance; Lactobacillus plantarum was broadly distributed among these samples. During stage 3, the bacterial distribution shifted to family Porphyromonadaceae, and Butyricimonas virosa became the predominant microbial component. Thereafter, bacterial counts decreased dramatically, and organisms were too few to be detected during stage 4. In addition, the microbial compositions of the liquids used for soaking bamboo sections were dramatically different: Exiguobacterium mexicanum predominated in the fermented soybean solution whereas B. virosa was predominant in running spring water. Furthermore, our results from pyrosequencing paralleled those we obtained by using the traditional culture method, which targets lactic acid bacteria. In conclusion, the microbial communities during Lu-doh-huang fermentation were markedly diverse, and pyrosequencing revealed a complete picture of the microbial consortium. PMID:23700436
Quiroz Velasquez, Paula F.; Abiff, Sumayyah K.; Fins, Katrina C.; Conway, Quincy B.; Salazar, Norma C.; Delgado, Ana Paula; Dawes, Jhanelle K.; Douma, Lauren G.
2014-01-01
A combination of 454 pyrosequencing and Sanger sequencing was used to sample and characterize the transcriptome of the entomopathogenic oomycete Lagenidium giganteum. More than 50,000 high-throughput reads were annotated through homology searches. Several selected reads served as seeds for the amplification and sequencing of full-length transcripts. Phylogenetic analyses inferred from full-length cellulose synthase alignments revealed that L giganteum is nested within the peronosporalean galaxy and as such appears to have evolved from a phytopathogenic ancestor. In agreement with the phylogeny reconstructions, full-length L. giganteum oomycete effector orthologs, corresponding to the cellulose-binding elicitor lectin (CBEL), crinkler (CRN), and elicitin proteins, were characterized by domain organizations similar to those of pathogenicity factors of plant-pathogenic oomycetes. Importantly, the L. giganteum effectors provide a basis for detailing the roles of canonical CRN, CBEL, and elicitin proteins in the infectious process of an oomycete known principally as an animal pathogen. Finally, phylogenetic analyses and genome mining identified members of glycoside hydrolase family 5 subfamily 27 (GH5_27) as putative virulence factors active on the host insect cuticle, based in part on the fact that GH5_27 genes are shared by entomopathogenic oomycetes and fungi but are underrepresented in nonentomopathogenic genomes. The genomic resources gathered from the L. giganteum transcriptome analysis strongly suggest that filamentous entomopathogens (oomycetes and fungi) exhibit convergent evolution: they have evolved independently from plant-associated microbes, have retained genes indicative of plant associations, and may share similar cores of virulence factors, such as GH5_27 enzymes, that are absent from the genomes of their plant-pathogenic relatives. PMID:25107973
Sheveleva, Anna; Kudryavtseva, Anna; Speranskaya, Anna; Belenikin, Maxim; Melnikova, Natalia; Chirkov, Sergei
2013-10-01
The near-complete (99.7 %) genome sequence of a novel Russian Plum pox virus (PPV) isolate Pk, belonging to the strain Winona (W), has been determined by 454 pyrosequencing with the exception of the thirty-one 5'-terminal nucleotides. This region was amplified using 5'RACE kit and sequenced by the Sanger method. Genomic RNA released from immunocaptured PPV particles was employed for generation of cDNA library using TransPlex Whole transcriptome amplification kit (WTA2, Sigma-Aldrich). The entire Pk genome has identity level of 92.8-94.5 % when compared to the complete nucleotide sequences of other PPV-W isolates (W3174, LV-141pl, LV-145bt, and UKR 44189), confirming a high degree of variability within the PPV-W strain. The isolates Pk and LV-141pl are most closely related. The Pk has been found in a wild plum (Prunus domestica) in a new region of Russia indicating widespread dissemination of the PPV-W strain in the European part of the former USSR.
Mannino, M Constanza; Rivarola, Máximo; Scannapieco, Alejandra C; González, Sergio; Farber, Marisa; Cladera, Jorge L; Lanzavecchia, Silvia B
2016-10-12
Diachasmimorpha longicaudata (Hymenoptera: Braconidae) is a solitary parasitoid of Tephritidae (Diptera) fruit flies of economic importance currently being mass-reared in bio-factories and successfully used worldwide. A peculiar biological aspect of Hymenoptera is its haplo-diploid life cycle, where females (diploid) develop from fertilized eggs and males (haploid) from unfertilized eggs. Diploid males were described in many species and recently evidenced in D. longicaudata by mean of inbreeding studies. Sex determination in this parasitoid is based on the Complementary Sex Determination (CSD) system, with alleles from at least one locus involved in early steps of this pathway. Since limited information is available about genetics of this parasitoid species, a deeper analysis on D. longicaudata's genomics is required to provide molecular tools for achieving a more cost effective production under artificial rearing conditions. We report here the first transcriptome analysis of male-larvae, adult females and adult males of D. longicaudata using 454-pyrosequencing. A total of 469766 reads were analyzed and 8483 high-quality isotigs were assembled. After functional annotation, a total of 51686 unigenes were produced, from which, 7021 isotigs and 20227 singletons had at least one BLAST hit against the NCBI non-redundant protein database. A preliminary comparison of adult female and male evidenced that 98 transcripts showed differential expression profiles, with at least a 10-fold difference. Among the functionally annotated transcripts we detected four sequences potentially involved in sex determination and three homologues to two known genes involved in the sex determination cascade. Finally, a total of 4674SimpleSequence Repeats (SSRs) were in silico identified and characterized. The information obtained here will significantly contribute to the development of D. longicaudata functional genomics, genetics and population-based genome studies. Thousands of new microsatellite markers were identified as toolkits for population genetics analysis. The transcriptome characterized here is the starting point to elucidate the molecular bases of the sex determination mechanism in this species.
Firmino, Alexandre Augusto Pereira; Fonseca, Fernando Campos de Assis; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; Antonino de Souza, José Dijair; Togawa, Roberto Coiti; Silva-Junior, Orzenil Bonfim; Pappas, Georgios Joannis; da Silva, Maria Cristina Mattar; Engler, Gilbert; Grossi-de-Sa, Maria Fatima
2013-01-01
Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi) as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families' data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects.
Coelho, Roberta Ramos; Antonino de Souza Jr, José Dijair; Togawa, Roberto Coiti; Silva-Junior, Orzenil Bonfim; Pappas-Jr, Georgios Joannis; da Silva, Maria Cristina Mattar; Engler, Gilbert; Grossi-de-Sa, Maria Fatima
2013-01-01
Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi) as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families’ data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects. PMID:24386449
Guan, Yunyan; He, Maoxian; Wu, Houbo
2017-06-01
To explore the molecular mechanism of triploidy effect in the pearl oyster Pinctada fucata, two RNA-seq libraries were constructed from the mantle tissue of diploids and triploids by Roche-454 massive parallel pyrosequencing. The identification of differential expressed genes (DEGs) between diploid and triploid may reveal the molecular mechanism of triploidy effect. In this study, 230 down-regulated and 259 up-regulated DEGs were obtained by comparison between diploid and triploid libraries. The gene ontology and KEGG pathway analysis revealed more functional activation in triploids and it may due to the duplicated gene expression in transcriptional level during whole genome duplication (WGD). To confirm the sequencing data, a set of 11 up-regulated genes related to growth and development control and regulation were analyzed by RT-qPCR in independent experiment. According to the validation and annotation of these genes, it is hypothesized that the set of up-regulated expressed genes had the correlated expression pattern involved in shell building or other interactive probable functions during triploidization. The up- regulation of growth-related genes may support the classic hypotheses of 'energy redistribution' from early research. The results provide valuable resources to understand the molecular mechanism of triploidy effect in both shell building and producing high-quality seawater pearls. Copyright © 2017 Elsevier B.V. All rights reserved.
Santos, Patricia; Plaszczyca, Marian; Pawlowski, Katharina
2013-01-01
Actinorhizal root nodule symbioses are very diverse, and the symbiosis of Datisca glomerata has previously been shown to have many unusual aspects. In order to gain molecular information on the infection mechanism, nodule development and nodule metabolism, we compared the transcriptomes of D. glomerata roots and nodules. Root and nodule libraries representing the 3′-ends of cDNAs were subjected to high-throughput parallel 454 sequencing. To identify the corresponding genes and to improve the assembly, Illumina sequencing of the nodule transcriptome was performed as well. The evaluation revealed 406 differentially regulated genes, 295 of which (72.7%) could be assigned a function based on homology. Analysis of the nodule transcriptome showed that genes encoding components of the common symbiosis signaling pathway were present in nodules of D. glomerata, which in combination with the previously established function of SymRK in D. glomerata nodulation suggests that this pathway is also active in actinorhizal Cucurbitales. Furthermore, comparison of the D. glomerata nodule transcriptome with nodule transcriptomes from actinorhizal Fagales revealed a new subgroup of nodule-specific defensins that might play a role specific to actinorhizal symbioses. The D. glomerata members of this defensin subgroup contain an acidic C-terminal domain that was never found in plant defensins before. PMID:24009681
Molnár, István; Lopez, David; Wisecaver, Jennifer H; Devarenne, Timothy P; Weiss, Taylor L; Pellegrini, Matteo; Hackett, Jeremiah D
2012-10-30
Microalgae hold promise for yielding a biofuel feedstock that is sustainable, carbon-neutral, distributed, and only minimally disruptive for the production of food and feed by traditional agriculture. Amongst oleaginous eukaryotic algae, the B race of Botryococcus braunii is unique in that it produces large amounts of liquid hydrocarbons of terpenoid origin. These are comparable to fossil crude oil, and are sequestered outside the cells in a communal extracellular polymeric matrix material. Biosynthetic engineering of terpenoid bio-crude production requires identification of genes and reconstruction of metabolic pathways responsible for production of both hydrocarbons and other metabolites of the alga that compete for photosynthetic carbon and energy. A de novo assembly of 1,334,609 next-generation pyrosequencing reads form the Showa strain of the B race of B. braunii yielded a transcriptomic database of 46,422 contigs with an average length of 756 bp. Contigs were annotated with pathway, ontology, and protein domain identifiers. Manual curation allowed the reconstruction of pathways that produce terpenoid liquid hydrocarbons from primary metabolites, and pathways that divert photosynthetic carbon into tetraterpenoid carotenoids, diterpenoids, and the prenyl chains of meroterpenoid quinones and chlorophyll. Inventories of machine-assembled contigs are also presented for reconstructed pathways for the biosynthesis of competing storage compounds including triacylglycerol and starch. Regeneration of S-adenosylmethionine, and the extracellular localization of the hydrocarbon oils by active transport and possibly autophagy are also investigated. The construction of an annotated transcriptomic database, publicly available in a web-based data depository and annotation tool, provides a foundation for metabolic pathway and network reconstruction, and facilitates further omics studies in the absence of a genome sequence for the Showa strain of B. braunii, race B. Further, the transcriptome database empowers future biosynthetic engineering approaches for strain improvement and the transfer of desirable traits to heterologous hosts.
2012-01-01
Background Microalgae hold promise for yielding a biofuel feedstock that is sustainable, carbon-neutral, distributed, and only minimally disruptive for the production of food and feed by traditional agriculture. Amongst oleaginous eukaryotic algae, the B race of Botryococcus braunii is unique in that it produces large amounts of liquid hydrocarbons of terpenoid origin. These are comparable to fossil crude oil, and are sequestered outside the cells in a communal extracellular polymeric matrix material. Biosynthetic engineering of terpenoid bio-crude production requires identification of genes and reconstruction of metabolic pathways responsible for production of both hydrocarbons and other metabolites of the alga that compete for photosynthetic carbon and energy. Results A de novo assembly of 1,334,609 next-generation pyrosequencing reads form the Showa strain of the B race of B. braunii yielded a transcriptomic database of 46,422 contigs with an average length of 756 bp. Contigs were annotated with pathway, ontology, and protein domain identifiers. Manual curation allowed the reconstruction of pathways that produce terpenoid liquid hydrocarbons from primary metabolites, and pathways that divert photosynthetic carbon into tetraterpenoid carotenoids, diterpenoids, and the prenyl chains of meroterpenoid quinones and chlorophyll. Inventories of machine-assembled contigs are also presented for reconstructed pathways for the biosynthesis of competing storage compounds including triacylglycerol and starch. Regeneration of S-adenosylmethionine, and the extracellular localization of the hydrocarbon oils by active transport and possibly autophagy are also investigated. Conclusions The construction of an annotated transcriptomic database, publicly available in a web-based data depository and annotation tool, provides a foundation for metabolic pathway and network reconstruction, and facilitates further omics studies in the absence of a genome sequence for the Showa strain of B. braunii, race B. Further, the transcriptome database empowers future biosynthetic engineering approaches for strain improvement and the transfer of desirable traits to heterologous hosts. PMID:23110428
Halvade-RNA: Parallel variant calling from transcriptomic data using MapReduce.
Decap, Dries; Reumers, Joke; Herzeel, Charlotte; Costanza, Pascal; Fostier, Jan
2017-01-01
Given the current cost-effectiveness of next-generation sequencing, the amount of DNA-seq and RNA-seq data generated is ever increasing. One of the primary objectives of NGS experiments is calling genetic variants. While highly accurate, most variant calling pipelines are not optimized to run efficiently on large data sets. However, as variant calling in genomic data has become common practice, several methods have been proposed to reduce runtime for DNA-seq analysis through the use of parallel computing. Determining the effectively expressed variants from transcriptomics (RNA-seq) data has only recently become possible, and as such does not yet benefit from efficiently parallelized workflows. We introduce Halvade-RNA, a parallel, multi-node RNA-seq variant calling pipeline based on the GATK Best Practices recommendations. Halvade-RNA makes use of the MapReduce programming model to create and manage parallel data streams on which multiple instances of existing tools such as STAR and GATK operate concurrently. Whereas the single-threaded processing of a typical RNA-seq sample requires ∼28h, Halvade-RNA reduces this runtime to ∼2h using a small cluster with two 20-core machines. Even on a single, multi-core workstation, Halvade-RNA can significantly reduce runtime compared to using multi-threading, thus providing for a more cost-effective processing of RNA-seq data. Halvade-RNA is written in Java and uses the Hadoop MapReduce 2.0 API. It supports a wide range of distributions of Hadoop, including Cloudera and Amazon EMR.
Massively parallel digital transcriptional profiling of single cells
Zheng, Grace X. Y.; Terry, Jessica M.; Belgrader, Phillip; Ryvkin, Paul; Bent, Zachary W.; Wilson, Ryan; Ziraldo, Solongo B.; Wheeler, Tobias D.; McDermott, Geoff P.; Zhu, Junjie; Gregory, Mark T.; Shuga, Joe; Montesclaros, Luz; Underwood, Jason G.; Masquelier, Donald A.; Nishimura, Stefanie Y.; Schnall-Levin, Michael; Wyatt, Paul W.; Hindson, Christopher M.; Bharadwaj, Rajiv; Wong, Alexander; Ness, Kevin D.; Beppu, Lan W.; Deeg, H. Joachim; McFarland, Christopher; Loeb, Keith R.; Valente, William J.; Ericson, Nolan G.; Stevens, Emily A.; Radich, Jerald P.; Mikkelsen, Tarjei S.; Hindson, Benjamin J.; Bielas, Jason H.
2017-01-01
Characterizing the transcriptome of individual cells is fundamental to understanding complex biological systems. We describe a droplet-based system that enables 3′ mRNA counting of tens of thousands of single cells per sample. Cell encapsulation, of up to 8 samples at a time, takes place in ∼6 min, with ∼50% cell capture efficiency. To demonstrate the system's technical performance, we collected transcriptome data from ∼250k single cells across 29 samples. We validated the sensitivity of the system and its ability to detect rare populations using cell lines and synthetic RNAs. We profiled 68k peripheral blood mononuclear cells to demonstrate the system's ability to characterize large immune populations. Finally, we used sequence variation in the transcriptome data to determine host and donor chimerism at single-cell resolution from bone marrow mononuclear cells isolated from transplant patients. PMID:28091601
454 Pyrosequencing of Olive (Olea europaea L.) Transcriptome in Response to Salinity
Bazakos, Christos; Manioudaki, Maria E.; Sarropoulou, Elena; Spano, Thodhoraq; Kalaitzis, Panagiotis
2015-01-01
Olive (Olea europaea L.) is one of the most important crops in the Mediterranean region. The expansion of cultivation in areas irrigated with low quality and saline water has negative effects on growth and productivity however the investigation of the molecular basis of salt tolerance in olive trees has been only recently initiated. To this end, we investigated the molecular response of cultivar Kalamon to salinity stress using next-generation sequencing technology to explore the transcriptome profile of olive leaves and roots and identify differentially expressed genes that are related to salt tolerance response. Out of 291,958 obtained trimmed reads, 28,270 unique transcripts were identified of which 35% are annotated, a percentage that is comparable to similar reports on non-model plants. Among the 1,624 clusters in roots that comprise more than one read, 24 were differentially expressed comprising 9 down- and 15 up-regulated genes. Respectively, inleaves, among the 2,642 clusters, 70 were identified as differentially expressed, with 14 down- and 56 up-regulated genes. Using next-generation sequencing technology we were able to identify salt-response-related transcripts. Furthermore we provide an annotated transcriptome of olive as well as expression data, which are both significant tools for further molecular studies in olive. PMID:26576008
454 Pyrosequencing of Olive (Olea europaea L.) Transcriptome in Response to Salinity.
Bazakos, Christos; Manioudaki, Maria E; Sarropoulou, Elena; Spano, Thodhoraq; Kalaitzis, Panagiotis
2015-01-01
Olive (Olea europaea L.) is one of the most important crops in the Mediterranean region. The expansion of cultivation in areas irrigated with low quality and saline water has negative effects on growth and productivity however the investigation of the molecular basis of salt tolerance in olive trees has been only recently initiated. To this end, we investigated the molecular response of cultivar Kalamon to salinity stress using next-generation sequencing technology to explore the transcriptome profile of olive leaves and roots and identify differentially expressed genes that are related to salt tolerance response. Out of 291,958 obtained trimmed reads, 28,270 unique transcripts were identified of which 35% are annotated, a percentage that is comparable to similar reports on non-model plants. Among the 1,624 clusters in roots that comprise more than one read, 24 were differentially expressed comprising 9 down- and 15 up-regulated genes. Respectively, inleaves, among the 2,642 clusters, 70 were identified as differentially expressed, with 14 down- and 56 up-regulated genes. Using next-generation sequencing technology we were able to identify salt-response-related transcripts. Furthermore we provide an annotated transcriptome of olive as well as expression data, which are both significant tools for further molecular studies in olive.
The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE
2011-01-01
Background The combination of high-throughput transcript profiling and next-generation sequencing technologies is a prerequisite for genome-wide comprehensive transcriptome analysis. Our recent innovation of deepSuperSAGE is based on an advanced SuperSAGE protocol and its combination with massively parallel pyrosequencing on Roche's 454 sequencing platform. As a demonstration of the power of this combination, we have chosen the salt stress transcriptomes of roots and nodules of the third most important legume crop chickpea (Cicer arietinum L.). While our report is more technology-oriented, it nevertheless addresses a major world-wide problem for crops generally: high salinity. Together with low temperatures and water stress, high salinity is responsible for crop losses of millions of tons of various legume (and other) crops. Continuously deteriorating environmental conditions will combine with salinity stress to further compromise crop yields. As a good example for such stress-exposed crop plants, we started to characterize salt stress responses of chickpeas on the transcriptome level. Results We used deepSuperSAGE to detect early global transcriptome changes in salt-stressed chickpea. The salt stress responses of 86,919 transcripts representing 17,918 unique 26 bp deepSuperSAGE tags (UniTags) from roots of the salt-tolerant variety INRAT-93 two hours after treatment with 25 mM NaCl were characterized. Additionally, the expression of 57,281 transcripts representing 13,115 UniTags was monitored in nodules of the same plants. From a total of 144,200 analyzed 26 bp tags in roots and nodules together, 21,401 unique transcripts were identified. Of these, only 363 and 106 specific transcripts, respectively, were commonly up- or down-regulated (>3.0-fold) under salt stress in both organs, witnessing a differential organ-specific response to stress. Profiting from recent pioneer works on massive cDNA sequencing in chickpea, more than 9,400 UniTags were able to be linked to UniProt entries. Additionally, gene ontology (GO) categories over-representation analysis enabled to filter out enriched biological processes among the differentially expressed UniTags. Subsequently, the gathered information was further cross-checked with stress-related pathways. From several filtered pathways, here we focus exemplarily on transcripts associated with the generation and scavenging of reactive oxygen species (ROS), as well as on transcripts involved in Na+ homeostasis. Although both processes are already very well characterized in other plants, the information generated in the present work is of high value. Information on expression profiles and sequence similarity for several hundreds of transcripts of potential interest is now available. Conclusions This report demonstrates, that the combination of the high-throughput transcriptome profiling technology SuperSAGE with one of the next-generation sequencing platforms allows deep insights into the first molecular reactions of a plant exposed to salinity. Cross validation with recent reports enriched the information about the salt stress dynamics of more than 9,000 chickpea ESTs, and enlarged their pool of alternative transcripts isoforms. As an example for the high resolution of the employed technology that we coin deepSuperSAGE, we demonstrate that ROS-scavenging and -generating pathways undergo strong global transcriptome changes in chickpea roots and nodules already 2 hours after onset of moderate salt stress (25 mM NaCl). Additionally, a set of more than 15 candidate transcripts are proposed to be potential components of the salt overly sensitive (SOS) pathway in chickpea. Newly identified transcript isoforms are potential targets for breeding novel cultivars with high salinity tolerance. We demonstrate that these targets can be integrated into breeding schemes by micro-arrays and RT-PCR assays downstream of the generation of 26 bp tags by SuperSAGE. PMID:21320317
The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE.
Molina, Carlos; Zaman-Allah, Mainassara; Khan, Faheema; Fatnassi, Nadia; Horres, Ralf; Rotter, Björn; Steinhauer, Diana; Amenc, Laurie; Drevon, Jean-Jacques; Winter, Peter; Kahl, Günter
2011-02-14
The combination of high-throughput transcript profiling and next-generation sequencing technologies is a prerequisite for genome-wide comprehensive transcriptome analysis. Our recent innovation of deepSuperSAGE is based on an advanced SuperSAGE protocol and its combination with massively parallel pyrosequencing on Roche's 454 sequencing platform. As a demonstration of the power of this combination, we have chosen the salt stress transcriptomes of roots and nodules of the third most important legume crop chickpea (Cicer arietinum L.). While our report is more technology-oriented, it nevertheless addresses a major world-wide problem for crops generally: high salinity. Together with low temperatures and water stress, high salinity is responsible for crop losses of millions of tons of various legume (and other) crops. Continuously deteriorating environmental conditions will combine with salinity stress to further compromise crop yields. As a good example for such stress-exposed crop plants, we started to characterize salt stress responses of chickpeas on the transcriptome level. We used deepSuperSAGE to detect early global transcriptome changes in salt-stressed chickpea. The salt stress responses of 86,919 transcripts representing 17,918 unique 26 bp deepSuperSAGE tags (UniTags) from roots of the salt-tolerant variety INRAT-93 two hours after treatment with 25 mM NaCl were characterized. Additionally, the expression of 57,281 transcripts representing 13,115 UniTags was monitored in nodules of the same plants. From a total of 144,200 analyzed 26 bp tags in roots and nodules together, 21,401 unique transcripts were identified. Of these, only 363 and 106 specific transcripts, respectively, were commonly up- or down-regulated (>3.0-fold) under salt stress in both organs, witnessing a differential organ-specific response to stress.Profiting from recent pioneer works on massive cDNA sequencing in chickpea, more than 9,400 UniTags were able to be linked to UniProt entries. Additionally, gene ontology (GO) categories over-representation analysis enabled to filter out enriched biological processes among the differentially expressed UniTags. Subsequently, the gathered information was further cross-checked with stress-related pathways. From several filtered pathways, here we focus exemplarily on transcripts associated with the generation and scavenging of reactive oxygen species (ROS), as well as on transcripts involved in Na+ homeostasis. Although both processes are already very well characterized in other plants, the information generated in the present work is of high value. Information on expression profiles and sequence similarity for several hundreds of transcripts of potential interest is now available. This report demonstrates, that the combination of the high-throughput transcriptome profiling technology SuperSAGE with one of the next-generation sequencing platforms allows deep insights into the first molecular reactions of a plant exposed to salinity. Cross validation with recent reports enriched the information about the salt stress dynamics of more than 9,000 chickpea ESTs, and enlarged their pool of alternative transcripts isoforms. As an example for the high resolution of the employed technology that we coin deepSuperSAGE, we demonstrate that ROS-scavenging and -generating pathways undergo strong global transcriptome changes in chickpea roots and nodules already 2 hours after onset of moderate salt stress (25 mM NaCl). Additionally, a set of more than 15 candidate transcripts are proposed to be potential components of the salt overly sensitive (SOS) pathway in chickpea. Newly identified transcript isoforms are potential targets for breeding novel cultivars with high salinity tolerance. We demonstrate that these targets can be integrated into breeding schemes by micro-arrays and RT-PCR assays downstream of the generation of 26 bp tags by SuperSAGE.
Oh, Yejin; Song, Ik-Chan; Kim, Jimyung; Kwon, Gye Cheol; Koo, Sun Hoe; Kim, Seon Young
2018-05-01
We developed a pyrosequencing-based method for the quantification of CALR mutations and compared the results using Sanger sequencing, fragment length analysis (FLA), digital-droplet PCR (ddPCR), and next-generation sequencing (NGS). Method validation studies were performed using cloned plasmid controls. Samples from 24 patients with myeloproliferative neoplasms were evaluated. Among the 24 patients, 15 had CALR mutations (7 type 1, 2 type 2, and 6 other mutations). The type 1 or type 2 mutation-positive results from pyrosequencing exhibited 100% concordance with the Sanger sequencing results. One novel CALR mutation was not detected by pyrosequencing. The CALR mutation allele burdens measured by pyrosequencing were slightly lower than those measured by FLA but slightly higher than the results obtained using ddPCR. Pyrosequencing exhibited high correlations with both methods. The mutation allele burdens estimated by NGS were significantly lower than those measured by pyrosequencing. An increased CALR mutation allele burden was associated with overt primary myelofibrosis. Patients with >70% mutation allele burdens in myeloid cells had a significantly longer time from diagnosis (P = 0.007), more bone marrow fibrosis (P = 0.010), and lower hemoglobin (P = 0.007). Pyrosequencing was a useful rapid sequencing method to determine the burden of CALR mutations. Copyright © 2018 Elsevier B.V. All rights reserved.
2011-01-01
Background Transcriptome sequencing data has become an integral component of modern genetics, genomics and evolutionary biology. However, despite advances in the technologies of DNA sequencing, such data are lacking for many groups of living organisms, in particular, many plant taxa. We present here the results of transcriptome sequencing for two closely related plant species. These species, Fagopyrum esculentum and F. tataricum, belong to the order Caryophyllales - a large group of flowering plants with uncertain evolutionary relationships. F. esculentum (common buckwheat) is also an important food crop. Despite these practical and evolutionary considerations Fagopyrum species have not been the subject of large-scale sequencing projects. Results Normalized cDNA corresponding to genes expressed in flowers and inflorescences of F. esculentum and F. tataricum was sequenced using the 454 pyrosequencing technology. This resulted in 267 (for F. esculentum) and 229 (F. tataricum) thousands of reads with average length of 341-349 nucleotides. De novo assembly of the reads produced about 25 thousands of contigs for each species, with 7.5-8.2× coverage. Comparative analysis of two transcriptomes demonstrated their overall similarity but also revealed genes that are presumably differentially expressed. Among them are retrotransposon genes and genes involved in sugar biosynthesis and metabolism. Thirteen single-copy genes were used for phylogenetic analysis; the resulting trees are largely consistent with those inferred from multigenic plastid datasets. The sister relationships of the Caryophyllales and asterids now gained high support from nuclear gene sequences. Conclusions 454 transcriptome sequencing and de novo assembly was performed for two congeneric flowering plant species, F. esculentum and F. tataricum. As a result, a large set of cDNA sequences that represent orthologs of known plant genes as well as potential new genes was generated. PMID:21232141
Ferreira de Carvalho, J; Poulain, J; Da Silva, C; Wincker, P; Michon-Coudouel, S; Dheilly, A; Naquin, D; Boutte, J; Salmon, A; Ainouche, M
2013-01-01
Spartina species have a critical ecological role in salt marshes and represent an excellent system to investigate recurrent polyploid speciation. Using the 454 GS-FLX pyrosequencer, we assembled and annotated the first reference transcriptome (from roots and leaves) for two related hexaploid Spartina species that hybridize in Western Europe, the East American invasive Spartina alterniflora and the Euro-African S. maritima. The de novo read assembly generated 38 478 consensus sequences and 99% found an annotation using Poaceae databases, representing a total of 16 753 non-redundant genes. Spartina expressed sequence tags were mapped onto the Sorghum bicolor genome, where they were distributed among the subtelomeric arms of the 10 S. bicolor chromosomes, with high gene density correlation. Normalization of the complementary DNA library improved the number of annotated genes. Ecologically relevant genes were identified among GO biological function categories in salt and heavy metal stress response, C4 photosynthesis and in lignin and cellulose metabolism. Expression of some of these genes had been found to be altered by hybridization and genome duplication in a previous microarray-based study in Spartina. As these species are hexaploid, up to three duplicated homoeologs may be expected per locus. When analyzing sequence polymorphism at four different loci in S. maritima and S. alterniflora, we found up to four haplotypes per locus, suggesting the presence of two expressed homoeologous sequences with one or two allelic variants each. This reference transcriptome will allow analysis of specific Spartina genes of ecological or evolutionary interest, estimation of homoeologous gene expression variation using RNA-seq and further gene expression evolution analyses in natural populations. PMID:23149455
Comparative transcriptome analysis of the Asteraceae halophyte Karelinia caspica under salt stress.
Zhang, Xia; Liao, Maoseng; Chang, Dan; Zhang, Fuchun
2014-12-17
Much attention has been given to the potential of halophytes as sources of tolerance traits for introduction into cereals. However, a great deal remains unknown about the diverse mechanisms employed by halophytes to cope with salinity. To characterize salt tolerance mechanisms underlying Karelinia caspica, an Asteraceae halophyte, we performed Large-scale transcriptomic analysis using a high-throughput Illumina sequencing platform. Comparative gene expression analysis was performed to correlate the effects of salt stress and ABA regulation at the molecular level. Total sequence reads generated by pyrosequencing were assembled into 287,185 non-redundant transcripts with an average length of 652 bp. Using the BLAST function in the Swiss-Prot, NCBI nr, GO, KEGG, and KOG databases, a total of 216,416 coding sequences associated with known proteins were annotated. Among these, 35,533 unigenes were classified into 69 gene ontology categories, and 18,378 unigenes were classified into 202 known pathways. Based on the fold changes observed when comparing the salt stress and control samples, 60,127 unigenes were differentially expressed, with 38,122 and 22,005 up- and down-regulated, respectively. Several of the differentially expressed genes are known to be involved in the signaling pathway of the plant hormone ABA, including ABA metabolism, transport, and sensing as well as the ABA signaling cascade. Transcriptome profiling of K. caspica contribute to a comprehensive understanding of K. caspica at the molecular level. Moreover, the global survey of differentially expressed genes in this species under salt stress and analyses of the effects of salt stress and ABA regulation will contribute to the identification and characterization of genes and molecular mechanisms underlying salt stress responses in Asteraceae plants.
Brousseau, Louise; Tinaut, Alexandra; Duret, Caroline; Lang, Tiange; Garnier-Gere, Pauline; Scotti, Ivan
2014-03-27
The Amazonian rainforest is predicted to suffer from ongoing environmental changes. Despite the need to evaluate the impact of such changes on tree genetic diversity, we almost entirely lack genomic resources. In this study, we analysed the transcriptome of four tropical tree species (Carapa guianensis, Eperua falcata, Symphonia globulifera and Virola michelii) with contrasting ecological features, belonging to four widespread botanical families (respectively Meliaceae, Fabaceae, Clusiaceae and Myristicaceae). We sequenced cDNA libraries from three organs (leaves, stems, and roots) using 454 pyrosequencing. We have developed an R and bioperl-based bioinformatic procedure for de novo assembly, gene functional annotation and marker discovery. Mismatch identification takes into account single-base quality values as well as the likelihood of false variants as a function of contig depth and number of sequenced chromosomes. Between 17103 (for Symphonia globulifera) and 23390 (for Eperua falcata) contigs were assembled. Organs varied in the numbers of unigenes they apparently express, with higher number in roots. Patterns of gene expression were similar across species, with metabolism of aromatic compounds standing out as an overrepresented gene function. Transcripts corresponding to several gene functions were found to be over- or underrepresented in each organ. We identified between 4434 (for Symphonia globulifera) and 9076 (for Virola surinamensis) well-supported mismatches. The resulting overall mismatch density was comprised between 0.89 (S. globulifera) and 1.05 (V. surinamensis) mismatches/100 bp in variation-containing contigs. The relative representation of gene functions in the four transcriptomes suggests that secondary metabolism may be particularly important in tropical trees. The differential representation of transcripts among tissues suggests differential gene expression, which opens the way to functional studies in these non-model, ecologically important species. We found substantial amounts of mismatches in the four species. These newly identified putative variants are a first step towards acquiring much needed genomic resources for tropical tree species.
Yamamoto, Satoshi; Sato, Hirotoshi; Tanabe, Akifumi S.; Hidaka, Amane; Kadowaki, Kohmei; Toju, Hirokazu
2014-01-01
Diverse clades of mycorrhizal and endophytic fungi are potentially involved in competitive or facilitative interactions within host-plant roots. We investigated the potential consequences of these ecological interactions on the assembly process of root-associated fungi by examining the co-occurrence of pairs of fungi in host-plant individuals. Based on massively-parallel pyrosequencing, we analyzed the root-associated fungal community composition for each of the 249 Quercus serrata and 188 Quercus glauca seedlings sampled in a warm-temperate secondary forest in Japan. Pairs of fungi that co-occurred more or less often than expected by chance were identified based on randomization tests. The pyrosequencing analysis revealed that not only ectomycorrhizal fungi but also endophytic fungi were common in the root-associated fungal community. Intriguingly, specific pairs of these ectomycorrhizal and endophytic fungi showed spatially aggregated patterns, suggesting the existence of facilitative interactions between fungi in different functional groups. Due to the large number of fungal pairs examined, many of the observed aggregated/segregated patterns with very low P values (e.g., < 0.005) turned non-significant after the application of a multiple comparison method. However, our overall results imply that the community structures of ectomycorrhizal and endophytic fungi could influence each other through interspecific competitive/facilitative interactions in root. To test the potential of host-plants' control of fungus–fungus ecological interactions in roots, we further examined whether the aggregated/segregated patterns could vary depending on the identity of host plant species. Potentially due to the physiological properties shared between the congeneric host plant species, the sign of hosts' control was not detected in the present study. The pyrosequencing-based randomization analyses shown in this study provide a platform of the high-throughput investigation of fungus–fungus interactions in plant root systems. PMID:24801150
Predictable transcriptome evolution in the convergent and complex bioluminescent organs of squid
Pankey, M. Sabrina; Minin, Vladimir N.; Imholte, Greg C.; Suchard, Marc A.; Oakley, Todd H.
2014-01-01
Despite contingency in life’s history, the similarity of evolutionarily convergent traits may represent predictable solutions to common conditions. However, the extent to which overall gene expression levels (transcriptomes) underlying convergent traits are themselves convergent remains largely unexplored. Here, we show strong statistical support for convergent evolutionary origins and massively parallel evolution of the entire transcriptomes in symbiotic bioluminescent organs (bacterial photophores) from two divergent squid species. The gene expression similarities are so strong that regression models of one species’ photophore can predict organ identity of a distantly related photophore from gene expression levels alone. Our results point to widespread parallel changes in gene expression evolution associated with convergent origins of complex organs. Therefore, predictable solutions may drive not only the evolution of novel, complex organs but also the evolution of overall gene expression levels that underlie them. PMID:25336755
[Progress in porky genes and transcriptome and discussion of relative issues].
Zhu, Meng-Jin; Liu, Bang; Li, Kui
2005-01-01
To date, research on molecular base of porky molecular development was mainly involved in muscle growth and meat quality. Some functional genes including Hal gene and RN gene and some QTLs controlling or associated with porky growth and quality were detected through candidate gene approach and genome-wide scanning. Genic transcriptome pertinent to porcine muscle and adipose also came into study. At the same time, these researches have befallen some shortcomings to some extent. Research from molecular quantitative genetics showed shortcomings that single gene was devilishly emphasized and co-expression pattern of multi-genes was ignored. Research applying transcriptome analysis tool also met two of limitations, one was the singleness of type of molecular experimental techniques, and another was that genes of muscle and adipose were artificially divided into unattached two parts. Thus, porky genes were explored by parallel genetics based on systemic views and techniques to specially reveal the interactional mechanism of porky genes respectively controlling muscle and adipose, which would be important issues of genes and genome researches on porky development in the near future.
Meyer, B; Martini, P; Biscontin, A; De Pittà, C; Romualdi, C; Teschke, M; Frickenhaus, S; Harms, L; Freier, U; Jarman, S; Kawaguchi, S
2015-01-01
The Antarctic krill, Euphausia superba, has a key position in the Southern Ocean food web by serving as direct link between primary producers and apex predators. The south-west Atlantic sector of the Southern Ocean, where the majority of the krill population is located, is experiencing one of the most profound environmental changes worldwide. Up to now, we have only cursory information about krill’s genomic plasticity to cope with the ongoing environmental changes induced by anthropogenic CO2 emission. The genome of krill is not yet available due to its large size (about 48 Gbp). Here, we present two cDNA normalized libraries from whole krill and krill heads sampled in different seasons that were combined with two data sets of krill transcriptome projects, already published, to produce the first knowledgebase krill ‘master’ transcriptome. The new library produced 25% more E. superba transcripts and now includes nearly all the enzymes involved in the primary oxidative metabolism (Glycolysis, Krebs cycle and oxidative phosphorylation) as well as all genes involved in glycogenesis, glycogen breakdown, gluconeogenesis, fatty acid synthesis and fatty acids β-oxidation. With these features, the ‘master’ transcriptome provides the most complete picture of metabolic pathways in Antarctic krill and will provide a major resource for future physiological and molecular studies. This will be particularly valuable for characterizing the molecular networks that respond to stressors caused by the anthropogenic CO2 emissions and krill’s capacity to cope with the ongoing environmental changes in the Atlantic sector of the Southern Ocean. PMID:25818178
Transcriptome Analysis of Sarracenia, an Insectivorous Plant
Srivastava, Anuj; Rogers, Willie L.; Breton, Catherine M.; Cai, Liming; Malmberg, Russell L.
2011-01-01
Sarracenia species (pitcher plants) are carnivorous plants which obtain a portion of their nutrients from insects captured in the pitchers. To investigate these plants, we sequenced the transcriptome of two species, Sarracenia psittacina and Sarracenia purpurea, using Roche 454 pyrosequencing technology. We obtained 46 275 and 36 681 contigs by de novo assembly methods for S. psittacina and S. purpurea, respectively, and further identified 16 163 orthologous contigs between them. Estimation of synonymous substitution rates between orthologous and paralogous contigs indicates the events of genome duplication and speciation within the Sarracenia genus both occurred ∼2 million years ago. The ratios of synonymous and non-synonymous substitution rates indicated that 491 contigs have been under positive selection (Ka/Ks > 1). Significant proportions of these contigs were involved in functions related to binding activity. We also found that the greatest sequence similarity for both of these species was to Vitis vinifera, which is most consistent with a non-current classification of the order Ericales as an asterid. This study has provided new insights into pitcher plants and will contribute greatly to future research on this genus and its distinctive ecological adaptations. PMID:21676972
Transcriptome analysis of sarracenia, an insectivorous plant.
Srivastava, Anuj; Rogers, Willie L; Breton, Catherine M; Cai, Liming; Malmberg, Russell L
2011-08-01
Sarracenia species (pitcher plants) are carnivorous plants which obtain a portion of their nutrients from insects captured in the pitchers. To investigate these plants, we sequenced the transcriptome of two species, Sarracenia psittacina and Sarracenia purpurea, using Roche 454 pyrosequencing technology. We obtained 46 275 and 36 681 contigs by de novo assembly methods for S. psittacina and S. purpurea, respectively, and further identified 16 163 orthologous contigs between them. Estimation of synonymous substitution rates between orthologous and paralogous contigs indicates the events of genome duplication and speciation within the Sarracenia genus both occurred ∼2 million years ago. The ratios of synonymous and non-synonymous substitution rates indicated that 491 contigs have been under positive selection (K(a)/K(s) > 1). Significant proportions of these contigs were involved in functions related to binding activity. We also found that the greatest sequence similarity for both of these species was to Vitis vinifera, which is most consistent with a non-current classification of the order Ericales as an asterid. This study has provided new insights into pitcher plants and will contribute greatly to future research on this genus and its distinctive ecological adaptations.
Zopf, Agnes; Raim, Roman; Danzer, Martin; Niklas, Norbert; Spilka, Rita; Pröll, Johannes; Gabriel, Christian; Nechansky, Andreas; Roucka, Markus
2015-03-01
The detection of KRAS mutations in codons 12 and 13 is critical for anti-EGFR therapy strategies; however, only those methodologies with high sensitivity, specificity, and accuracy as well as the best cost and turnaround balance are suitable for routine daily testing. Here we compared the performance of compact sequencing using the novel hybcell technology with 454 next-generation sequencing (454-NGS), Sanger sequencing, and pyrosequencing, using an evaluation panel of 35 specimens. A total of 32 mutations and 10 wild-type cases were reported using 454-NGS as the reference method. Specificity ranged from 100% for Sanger sequencing to 80% for pyrosequencing. Sanger sequencing and hybcell-based compact sequencing achieved a sensitivity of 96%, whereas pyrosequencing had a sensitivity of 88%. Accuracy was 97% for Sanger sequencing, 85% for pyrosequencing, and 94% for hybcell-based compact sequencing. Quantitative results were obtained for 454-NGS and hybcell-based compact sequencing data, resulting in a significant correlation (r = 0.914). Whereas pyrosequencing and Sanger sequencing were not able to detect multiple mutated cell clones within one tumor specimen, 454-NGS and the hybcell-based compact sequencing detected multiple mutations in two specimens. Our comparison shows that the hybcell-based compact sequencing is a valuable alternative to state-of-the-art methodologies used for detection of clinically relevant point mutations.
Ma, Chao; Wang, Hong; Macnish, Andrew J; Estrada-Melo, Alejandro C; Lin, Jing; Chang, Youhong; Reid, Michael S; Jiang, Cai-Zhong
2015-01-01
The woody resurrection plant Myrothamnus flabellifolia has remarkable tolerance to desiccation. Pyro-sequencing technology permitted us to analyze the transcriptome of M. flabellifolia during both dehydration and rehydration. We identified a total of 8287 and 8542 differentially transcribed genes during dehydration and rehydration treatments respectively. Approximately 295 transcription factors (TFs) and 484 protein kinases (PKs) were up- or down-regulated in response to desiccation stress. Among these, the transcript levels of 53 TFs and 91 PKs increased rapidly and peaked early during dehydration. These regulators transduce signal cascades of molecular pathways, including the up-regulation of ABA-dependent and independent drought stress pathways and the activation of protective mechanisms for coping with oxidative damage. Antioxidant systems are up-regulated, and the photosynthetic system is modified to reduce ROS generation. Secondary metabolism may participate in the desiccation tolerance of M. flabellifolia as indicated by increases in transcript abundance of genes involved in isopentenyl diphosphate biosynthesis. Up-regulation of genes encoding late embryogenesis abundant proteins and sucrose phosphate synthase is also associated with increased tolerance to desiccation. During rehydration, the transcriptome is also enriched in transcripts of genes encoding TFs and PKs, as well as genes involved in photosynthesis, and protein synthesis. The data reported here contribute comprehensive insights into the molecular mechanisms of desiccation tolerance in M. flabellifolia. PMID:26504577
Multiplex pyrosequencing of InDel markers for forensic DNA analysis.
Bus, Magdalena M; Karas, Ognjen; Allen, Marie
2016-12-01
The capillary electrophoresis (CE) technology is commonly used for fragment length separation of markers in forensic DNA analysis. In this study, pyrosequencing technology was used as an alternative and rapid tool for the analysis of biallelic InDel (insertion/deletion) markers for individual identification. The DNA typing is based on a subset of the InDel markers that are included in the Investigator ® DIPplex Kit, which are sequenced in a multiplex pyrosequencing analysis. To facilitate the analysis of degraded DNA, the polymerase chain reaction (PCR) fragments were kept short in the primer design. Samples from individuals of Swedish origin were genotyped using the pyrosequencing strategy and analysis of the Investigator ® DIPplex markers with CE. A comparison between the pyrosequencing and CE data revealed concordant results demonstrating a robust and correct genotyping by pyrosequencing. Using optimal marker combination and a directed dispensation strategy, five markers could be multiplexed and analyzed simultaneously. In this proof-of-principle study, we demonstrate that multiplex InDel pyrosequencing analysis is possible. However, further studies on degraded samples, lower DNA quantities, and mixtures will be required to fully optimize InDel analysis by pyrosequencing for forensic applications. Overall, although CE analysis is implemented in most forensic laboratories, multiplex InDel pyrosequencing offers a cost-effective alternative for some applications. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genomic Resources Notes Accepted 1 June 2015-31 July 2015.
Álvarez, P; Arthofer, Wolfgang; Coelho, Maria M; Conklin, D; Estonba, A; Grosso, Ana R; Helyar, S J; Langa, J; Machado, Miguel P; Montes, I; Pinho, Joana; Rief, Alexander; Schartl, Manfred; Schlick-Steiner, Birgit C; Seeber, Julia; Steiner, Florian M; Vilas, C
2015-11-01
This article documents the public availability of (i) microbiomes in diet and gut of larvae from the dipteran Dilophus febrilis using massive parallel sequencing, (ii) SNP and SSR discovery and characterization in the transcriptome of the Atlantic mackerel (Scomber scombrus, L) and (iii) assembled transcriptome for an endangered, endemic Iberian cyprinid fish (Squalius pyrenaicus). © 2015 John Wiley & Sons Ltd.
2011-01-01
Abstract Background Bupleurum chinense DC. is a widely used traditional Chinese medicinal plant. Saikosaponins are the major bioactive constituents of B. chinense, but relatively little is known about saikosaponin biosynthesis. The 454 pyrosequencing technology provides a promising opportunity for finding novel genes that participate in plant metabolism. Consequently, this technology may help to identify the candidate genes involved in the saikosaponin biosynthetic pathway. Results One-quarter of the 454 pyrosequencing runs produced a total of 195, 088 high-quality reads, with an average read length of 356 bases (NCBI SRA accession SRA039388). A de novo assembly generated 24, 037 unique sequences (22, 748 contigs and 1, 289 singletons), 12, 649 (52.6%) of which were annotated against three public protein databases using a basic local alignment search tool (E-value ≤1e-10). All unique sequences were compared with NCBI expressed sequence tags (ESTs) (237) and encoding sequences (44) from the Bupleurum genus, and with a Sanger-sequenced EST dataset (3, 111). The 23, 173 (96.4%) unique sequences obtained in the present study represent novel Bupleurum genes. The ESTs of genes related to saikosaponin biosynthesis were found to encode known enzymes that catalyze the formation of the saikosaponin backbone; 246 cytochrome P450 (P450s) and 102 glycosyltransferases (GTs) unique sequences were also found in the 454 dataset. Full length cDNAs of 7 P450s and 7 uridine diphosphate GTs (UGTs) were verified by reverse transcriptase polymerase chain reaction or by cloning using 5' and/or 3' rapid amplification of cDNA ends. Two P450s and three UGTs were identified as the most likely candidates involved in saikosaponin biosynthesis. This finding was based on the coordinate up-regulation of their expression with β-AS in methyl jasmonate-treated adventitious roots and on their similar expression patterns with β-AS in various B. chinense tissues. Conclusions A collection of high-quality ESTs for B. chinense obtained by 454 pyrosequencing is provided here for the first time. These data should aid further research on the functional genomics of B. chinense and other Bupleurum species. The candidate genes for enzymes involved in saikosaponin biosynthesis, especially the P450s and UGTs, that were revealed provide a substantial foundation for follow-up research on the metabolism and regulation of the saikosaponins. PMID:22047182
Pardo, Belén G; Álvarez-Dios, José Antonio; Cao, Asunción; Ramilo, Andrea; Gómez-Tato, Antonio; Planas, Josep V; Villalba, Antonio; Martínez, Paulino
2016-12-01
The flat oyster, Ostrea edulis, is one of the main farmed oysters, not only in Europe but also in the United States and Canada. Bonamiosis due to the parasite Bonamia ostreae has been associated with high mortality episodes in this species. This parasite is an intracellular protozoan that infects haemocytes, the main cells involved in oyster defence. Due to the economical and ecological importance of flat oyster, genomic data are badly needed for genetic improvement of the species, but they are still very scarce. The objective of this study is to develop a sequence database, OedulisDB, with new genomic and transcriptomic resources, providing new data and convenient tools to improve our knowledge of the oyster's immune mechanisms. Transcriptomic and genomic sequences were obtained using 454 pyrosequencing and compiled into an O. edulis database, OedulisDB, consisting of two sets of 10,318 and 7159 unique sequences that represent the oyster's genome (WG) and de novo haemocyte transcriptome (HT), respectively. The flat oyster transcriptome was obtained from two strains (naïve and tolerant) challenged with B. ostreae, and from their corresponding non-challenged controls. Approximately 78.5% of 5619 HT unique sequences were successfully annotated by Blast search using public databases. A total of 984 sequences were identified as being related to immune response and several key immune genes were identified for the first time in flat oyster. Additionally, transcriptome information was used to design and validate the first oligo-microarray in flat oyster enriched with immune sequences from haemocytes. Our transcriptomic and genomic sequencing and subsequent annotation have largely increased the scarce resources available for this economically important species and have enabled us to develop an OedulisDB database and accompanying tools for gene expression analysis. This study represents the first attempt to characterize in depth the O. edulis haemocyte transcriptome in response to B. ostreae through massively sequencing and has aided to improve our knowledge of the immune mechanisms of flat oyster. The validated oligo-microarray and the establishment of a reference transcriptome will be useful for large-scale gene expression studies in this species. Copyright © 2016 Elsevier Ltd. All rights reserved.
Elizaquível, Patricia; Pérez-Cataluña, Alba; Yépez, Alba; Aristimuño, Cecilia; Jiménez, Eugenia; Cocconcelli, Pier Sandro; Vignolo, Graciela; Aznar, Rosa
2015-04-02
The diversity of lactic acid bacteria (LAB) associated with chicha, a traditional maize-based fermented alcoholic beverage from Northwestern Argentina, was analyzed using culture-dependent and culture-independent approaches. Samples corresponding to 10 production steps were obtained from two local producers at Maimará (chicha M) and Tumbaya (chicha T). Whereas by culture-dependent approach a few number of species (Lactobacillus plantarum and Weissella viridescens in chicha M, and Enterococcus faecium and Leuconostoc mesenteroides in chicha T) were identified, a higher quantitative distribution of taxa was found in both beverages by pyrosequencing. The relative abundance of OTUs was higher in chicha M than in chicha T; six LAB genera were common for chicha M and T: Enterococcus, Lactococcus, Streptococcus, Weissella, Leuconostoc and Lactobacillus while Pediococcus only was detected in chicha M. Among the 46 identified LAB species, those of Lactobacillus were dominant in both chicha samples, exhibiting the highest diversity, whereas Enterococcus and Leuconostoc were recorded as the second dominant genera in chicha T and M, respectively. Identification at species level showed the predominance of Lb. plantarum, Lactobacillus rossiae, Leuconostoc lactis and W. viridescens in chicha M while Enterococcus hirae, E. faecium, Lc. mesenteroides and Weissella confusa predominated in chicha T samples. In parallel, when presumptive LAB isolates (chicha M: 146; chicha T: 246) recovered from the same samples were identified by ISR-PCR and RAPD-PCR profiles, species-specific PCR and 16S rRNA gene sequencing, most of them were assigned to the Leuconostoc genus (Lc. mesenteroides and Lc. lactis) in chicha M, Lactobacillus, Weissella and Enterococcus being also present. In contrast, chicha T exhibited the presence of Enterococcus and Leuconostoc, E. faecium being the most representative species. Massive sequencing approach was applied for the first time to study the diversity and evolution of microbial communities during chicha manufacture. Although differences in the LAB species profile between the two geographically different chicha productions were observed by culturing, a larger number for predominant LAB species as well as other minorities were revealed by pyrosequencing. The fine molecular inventory achieved by pyrosequencing provided more precise information on LAB population composition than culture-dependent analysis processes. Copyright © 2014 Elsevier B.V. All rights reserved.
Pyrosequencing for Microbial Identification and Characterization
Cummings, Patrick J.; Ahmed, Ray; Durocher, Jeffrey A.; Jessen, Adam; Vardi, Tamar; Obom, Kristina M.
2013-01-01
Pyrosequencing is a versatile technique that facilitates microbial genome sequencing that can be used to identify bacterial species, discriminate bacterial strains and detect genetic mutations that confer resistance to anti-microbial agents. The advantages of pyrosequencing for microbiology applications include rapid and reliable high-throughput screening and accurate identification of microbes and microbial genome mutations. Pyrosequencing involves sequencing of DNA by synthesizing the complementary strand a single base at a time, while determining the specific nucleotide being incorporated during the synthesis reaction. The reaction occurs on immobilized single stranded template DNA where the four deoxyribonucleotides (dNTP) are added sequentially and the unincorporated dNTPs are enzymatically degraded before addition of the next dNTP to the synthesis reaction. Detection of the specific base incorporated into the template is monitored by generation of chemiluminescent signals. The order of dNTPs that produce the chemiluminescent signals determines the DNA sequence of the template. The real-time sequencing capability of pyrosequencing technology enables rapid microbial identification in a single assay. In addition, the pyrosequencing instrument, can analyze the full genetic diversity of anti-microbial drug resistance, including typing of SNPs, point mutations, insertions, and deletions, as well as quantification of multiple gene copies that may occur in some anti-microbial resistance patterns. PMID:23995536
Pyrosequencing for microbial identification and characterization.
Cummings, Patrick J; Ahmed, Ray; Durocher, Jeffrey A; Jessen, Adam; Vardi, Tamar; Obom, Kristina M
2013-08-22
Pyrosequencing is a versatile technique that facilitates microbial genome sequencing that can be used to identify bacterial species, discriminate bacterial strains and detect genetic mutations that confer resistance to anti-microbial agents. The advantages of pyrosequencing for microbiology applications include rapid and reliable high-throughput screening and accurate identification of microbes and microbial genome mutations. Pyrosequencing involves sequencing of DNA by synthesizing the complementary strand a single base at a time, while determining the specific nucleotide being incorporated during the synthesis reaction. The reaction occurs on immobilized single stranded template DNA where the four deoxyribonucleotides (dNTP) are added sequentially and the unincorporated dNTPs are enzymatically degraded before addition of the next dNTP to the synthesis reaction. Detection of the specific base incorporated into the template is monitored by generation of chemiluminescent signals. The order of dNTPs that produce the chemiluminescent signals determines the DNA sequence of the template. The real-time sequencing capability of pyrosequencing technology enables rapid microbial identification in a single assay. In addition, the pyrosequencing instrument, can analyze the full genetic diversity of anti-microbial drug resistance, including typing of SNPs, point mutations, insertions, and deletions, as well as quantification of multiple gene copies that may occur in some anti-microbial resistance patterns.
Zhu, Genfa; Yang, Fengxi; Shi, Shanshan; Li, Dongmei; Wang, Zhen; Liu, Hailin; Huang, Dan; Wang, Caiyun
2015-01-01
The highly variable leaf color of Cymbidium sinense significantly improves its horticultural and economic value, and makes it highly desirable in the flower markets in China and Southeast Asia. However, little is understood about the molecular mechanism underlying leaf-color variations. In this study, we found the content of photosynthetic pigments, especially chlorophyll degradation metabolite in the leaf-color mutants is distinguished significantly from that in the wild type of Cymbidium sinense 'Dharma'. To further determine the candidate genes controlling leaf-color variations, we first sequenced the global transcriptome using 454 pyrosequencing. More than 0.7 million expressed sequence tags (ESTs) with an average read length of 445.9 bp were generated and assembled into 103,295 isotigs representing 68,460 genes. Of these isotigs, 43,433 were significantly aligned to known proteins in the public database, of which 29,299 could be categorized into 42 functional groups in the gene ontology system, 10,079 classified into 23 functional classifications in the clusters of orthologous groups system, and 23,092 assigned to 139 clusters of specific metabolic pathways in the Kyoto Encyclopedia of Genes and Genomes. Among these annotations, 95 isotigs were designated as involved in chlorophyll metabolism. On this basis, we identified 16 key enzyme-encoding genes in the chlorophyll metabolism pathway, the full length cDNAs and expressions of which were further confirmed. Expression pattern indicated that the key enzyme-encoding genes for chlorophyll degradation were more highly expressed in the leaf color mutants, as was consistent with their lower chlorophyll contents. This study is the first to supply an informative 454 EST dataset for Cymbidium sinense 'Dharma' and to identify original leaf color-associated genes, which provide important resources to facilitate gene discovery for molecular breeding, marketable trait discovery, and investigating various biological process in this species.
Shi, Shanshan; Li, Dongmei; Wang, Zhen; Liu, Hailin; Huang, Dan; Wang, Caiyun
2015-01-01
The highly variable leaf color of Cymbidium sinense significantly improves its horticultural and economic value, and makes it highly desirable in the flower markets in China and Southeast Asia. However, little is understood about the molecular mechanism underlying leaf-color variations. In this study, we found the content of photosynthetic pigments, especially chlorophyll degradation metabolite in the leaf-color mutants is distinguished significantly from that in the wild type of Cymbidium sinense 'Dharma'. To further determine the candidate genes controlling leaf-color variations, we first sequenced the global transcriptome using 454 pyrosequencing. More than 0.7 million expressed sequence tags (ESTs) with an average read length of 445.9 bp were generated and assembled into 103,295 isotigs representing 68,460 genes. Of these isotigs, 43,433 were significantly aligned to known proteins in the public database, of which 29,299 could be categorized into 42 functional groups in the gene ontology system, 10,079 classified into 23 functional classifications in the clusters of orthologous groups system, and 23,092 assigned to 139 clusters of specific metabolic pathways in the Kyoto Encyclopedia of Genes and Genomes. Among these annotations, 95 isotigs were designated as involved in chlorophyll metabolism. On this basis, we identified 16 key enzyme-encoding genes in the chlorophyll metabolism pathway, the full length cDNAs and expressions of which were further confirmed. Expression pattern indicated that the key enzyme-encoding genes for chlorophyll degradation were more highly expressed in the leaf color mutants, as was consistent with their lower chlorophyll contents. This study is the first to supply an informative 454 EST dataset for Cymbidium sinense 'Dharma' and to identify original leaf color-associated genes, which provide important resources to facilitate gene discovery for molecular breeding, marketable trait discovery, and investigating various biological process in this species. PMID:26042676
Li, Jianjun; Ye, Guangyun; Sun, Duanfang; Sun, Guoping; Zeng, Xiaowei; Xu, Jian; Liang, Shizhong
2012-09-01
Two identical biotrickling filters named BTFa and BTFb were run in parallel to examine their performances in removing hydrogen sulfide. BTFa was filled with ceramic granules, and BTFb was filled with volcanic rocks. The results showed that BTFb was more robust than BTFa under acidic conditions. At empty bed residence times (EBRTs) of 20 and 15 s, the removal efficiency of BTFa was close to 100%. At EBRTs of 10 and 5 s, the removal efficiency of BTFa slightly decreased. The removal efficiencies of BTFa decreased by different degrees at the end of each stage, dropping to 94%, 81%, 60%, and 71%, respectively. However, the H(2)S removal efficiency in BTFb consistently reached 99% throughout the experiment. Pyrosequencing analyses indicated that members of Thiomonas dominated in both BTFs, but the relative abundance of Acidithiobacillus was higher in BTFb than in BTFa.
RNA-Skim: a rapid method for RNA-Seq quantification at transcript level
Zhang, Zhaojun; Wang, Wei
2014-01-01
Motivation: RNA-Seq technique has been demonstrated as a revolutionary means for exploring transcriptome because it provides deep coverage and base pair-level resolution. RNA-Seq quantification is proven to be an efficient alternative to Microarray technique in gene expression study, and it is a critical component in RNA-Seq differential expression analysis. Most existing RNA-Seq quantification tools require the alignments of fragments to either a genome or a transcriptome, entailing a time-consuming and intricate alignment step. To improve the performance of RNA-Seq quantification, an alignment-free method, Sailfish, has been recently proposed to quantify transcript abundances using all k-mers in the transcriptome, demonstrating the feasibility of designing an efficient alignment-free method for transcriptome quantification. Even though Sailfish is substantially faster than alternative alignment-dependent methods such as Cufflinks, using all k-mers in the transcriptome quantification impedes the scalability of the method. Results: We propose a novel RNA-Seq quantification method, RNA-Skim, which partitions the transcriptome into disjoint transcript clusters based on sequence similarity, and introduces the notion of sig-mers, which are a special type of k-mers uniquely associated with each cluster. We demonstrate that the sig-mer counts within a cluster are sufficient for estimating transcript abundances with accuracy comparable with any state-of-the-art method. This enables RNA-Skim to perform transcript quantification on each cluster independently, reducing a complex optimization problem into smaller optimization tasks that can be run in parallel. As a result, RNA-Skim uses <4% of the k-mers and <10% of the CPU time required by Sailfish. It is able to finish transcriptome quantification in <10 min per sample by using just a single thread on a commodity computer, which represents >100 speedup over the state-of-the-art alignment-based methods, while delivering comparable or higher accuracy. Availability and implementation: The software is available at http://www.csbio.unc.edu/rs. Contact: weiwang@cs.ucla.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24931995
Pyrosequencing Analysis of Bench-Scale Nitrifying BiofiltersRemoving Trihalomethanes
The bacterial biofilm communities in four nitrifying biofilters degrading regulated drinking water trihalomethanes were characterized by 454 pyrosequencing. The three most abundant phylotypes based on total diversity were Nitrosomonas (70%), Nitrobacter (14%), and Chitinophagace...
Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe
2013-01-01
Background Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. Results The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. Conclusions This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species. PMID:23861841
Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe
2013-01-01
Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.
Meyer, B; Martini, P; Biscontin, A; De Pittà, C; Romualdi, C; Teschke, M; Frickenhaus, S; Harms, L; Freier, U; Jarman, S; Kawaguchi, S
2015-11-01
The Antarctic krill, Euphausia superba, has a key position in the Southern Ocean food web by serving as direct link between primary producers and apex predators. The south-west Atlantic sector of the Southern Ocean, where the majority of the krill population is located, is experiencing one of the most profound environmental changes worldwide. Up to now, we have only cursory information about krill's genomic plasticity to cope with the ongoing environmental changes induced by anthropogenic CO2 emission. The genome of krill is not yet available due to its large size (about 48 Gbp). Here, we present two cDNA normalized libraries from whole krill and krill heads sampled in different seasons that were combined with two data sets of krill transcriptome projects, already published, to produce the first knowledgebase krill 'master' transcriptome. The new library produced 25% more E. superba transcripts and now includes nearly all the enzymes involved in the primary oxidative metabolism (Glycolysis, Krebs cycle and oxidative phosphorylation) as well as all genes involved in glycogenesis, glycogen breakdown, gluconeogenesis, fatty acid synthesis and fatty acids β-oxidation. With these features, the 'master' transcriptome provides the most complete picture of metabolic pathways in Antarctic krill and will provide a major resource for future physiological and molecular studies. This will be particularly valuable for characterizing the molecular networks that respond to stressors caused by the anthropogenic CO2 emissions and krill's capacity to cope with the ongoing environmental changes in the Atlantic sector of the Southern Ocean. © 2015 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd.
2012-01-01
Background Common carp (Cyprinus carpio) is thought to have undergone one extra round of genome duplication compared to zebrafish. Transcriptome analysis has been used to study the existence and timing of genome duplication in species for which genome sequences are incomplete. Large-scale transcriptome data for the common carp genome should help reveal the timing of the additional duplication event. Results We have sequenced the transcriptome of common carp using 454 pyrosequencing. After assembling the 454 contigs and the published common carp sequences together, we obtained 49,669 contigs and identified genes using homology searches and an ab initio method. We identified 4,651 orthologous pairs between common carp and zebrafish and found 129,984 paralogous pairs within the common carp. An estimation of the synonymous substitution rate in the orthologous pairs indicated that common carp and zebrafish diverged 120 million years ago (MYA). We identified one round of genome duplication in common carp and estimated that it had occurred 5.6 to 11.3 MYA. In zebrafish, no genome duplication event after speciation was observed, suggesting that, compared to zebrafish, common carp had undergone an additional genome duplication event. We annotated the common carp contigs with Gene Ontology terms and KEGG pathways. Compared with zebrafish gene annotations, we found that a set of biological processes and pathways were enriched in common carp. Conclusions The assembled contigs helped us to estimate the time of the fourth-round of genome duplication in common carp. The resource that we have built as part of this study will help advance functional genomics and genome annotation studies in the future. PMID:22424280
Janecek, Elisabeth; Streichan, Sabine; Strube, Christina
2012-10-18
Rickettsioses are caused by pathogenic species of the genus Rickettsia and play an important role as emerging diseases. The bacteria are transmitted to mammal hosts including humans by arthropod vectors. Since detection, especially in tick vectors, is usually based on PCR with genus-specific primers to include different occurring Rickettsia species, subsequent species identification is mainly achieved by Sanger sequencing. In the present study a real-time pyrosequencing approach was established with the objective to differentiate between species occurring in German Ixodes ticks, which are R. helvetica, R. monacensis, R. massiliae, and R. felis. Tick material from a quantitative real-time PCR (qPCR) based study on Rickettsia-infections in I. ricinus allowed direct comparison of both sequencing techniques, Sanger and real-time pyrosequencing. A sequence stretch of rickettsial citrate synthase (gltA) gene was identified to contain divergent single nucleotide polymorphism (SNP) sites suitable for Rickettsia species differentiation. Positive control plasmids inserting the respective target sequence of each Rickettsia species of interest were constructed for initial establishment of the real-time pyrosequencing approach using Qiagen's PSQ 96MA Pyrosequencing System operating in a 96-well format. The approach included an initial amplification reaction followed by the actual pyrosequencing, which is traceable by pyrograms in real-time. Afterwards, real-time pyrosequencing was applied to 263 Ixodes tick samples already detected Rickettsia-positive in previous qPCR experiments. Establishment of real-time pyrosequencing using positive control plasmids resulted in accurate detection of all SNPs in all included Rickettsia species. The method was then applied to 263 Rickettsia-positive Ixodes ricinus samples, of which 153 (58.2%) could be identified for their species (151 R. helvetica and 2 R. monacensis) by previous custom Sanger sequencing. Real-time pyrosequencing identified all Sanger-determined ticks as well as 35 previously undifferentiated ticks resulting in a total number of 188 (71.5%) identified samples. Pyrosequencing sensitivity was found to be strongly dependent on gltA copy numbers in the reaction setup. Whereas less than 101 copies in the initial amplification reaction resulted in identification of 15.1% of the samples only, the percentage increased to 54.2% at 101-102 copies, to 95.6% at >102-103 copies and reached 100% samples identified for their Rickettsia species if more than 103 copies were present in the template. The established real-time pyrosequencing approach represents a reliable method for detection and differentiation of Rickettsia spp. present in I. ricinus diagnostic material and prevalence studies. Furthermore, the method proved to be faster, more cost-effective as well as more sensitive than custom Sanger sequencing with simultaneous high specificity.
Al-Sadi, A M; Al-Mazroui, S S; Phillips, A J L
2015-08-01
Potting media and organic fertilizers (OFs) are commonly used in agricultural systems. However, there is a lack of studies on the efficiency of culture-based techniques in assessing the level of fungal diversity in these products. A study was conducted to investigate the efficiency of seven culture-based techniques and pyrosequencing for characterizing fungal diversity in potting media and OFs. Fungal diversity was evaluated using serial dilution, direct plating and baiting with carrot slices, potato slices, radish seeds, cucumber seeds and cucumber cotyledons. Identity of all the isolates was confirmed on the basis of the internal transcribed spacer region of the ribosomal RNA (ITS rRNA) sequence data. The direct plating technique was found to be superior over other culture-based techniques in the number of fungal species detected. It was also found to be simple and the least time consuming technique. Comparing the efficiency of direct plating with 454 pyrosequencing revealed that pyrosequencing detected 12 and 15 times more fungal species from potting media and OFs respectively. Analysis revealed that there were differences between potting media and OFs in the dominant phyla, classes, orders, families, genera and species detected. Zygomycota (52%) and Chytridiomycota (60%) were the predominant phyla in potting media and OFs respectively. The superiority of pyrosequencing over cultural methods could be related to the ability to detect obligate fungi, slow growing fungi and fungi that exist at low population densities. The evaluated methods in this study, especially direct plating and pyrosequencing, may be used as tools to help detect and reduce movement of unwanted fungi between countries and regions. © 2015 The Society for Applied Microbiology.
Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants.
Li, Xinguo; Wu, Harry X; Southerton, Simon G
2010-06-21
Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution.
Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants
2010-01-01
Background Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. Results The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conclusions Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution. PMID:20565927
Shen, Yingjia; Venu, R.C.; Nobuta, Kan; Wu, Xiaohui; Notibala, Varun; Demirci, Caghan; Meyers, Blake C.; Wang, Guo-Liang; Ji, Guoli; Li, Qingshun Q.
2011-01-01
Polyadenylation sites mark the ends of mRNA transcripts. Alternative polyadenylation (APA) may alter sequence elements and/or the coding capacity of transcripts, a mechanism that has been demonstrated to regulate gene expression and transcriptome diversity. To study the role of APA in transcriptome dynamics, we analyzed a large-scale data set of RNA “tags” that signify poly(A) sites and expression levels of mRNA. These tags were derived from a wide range of tissues and developmental stages that were mutated or exposed to environmental treatments, and generated using digital gene expression (DGE)–based protocols of the massively parallel signature sequencing (MPSS-DGE) and the Illumina sequencing-by-synthesis (SBS-DGE) sequencing platforms. The data offer a global view of APA and how it contributes to transcriptome dynamics. Upon analysis of these data, we found that ∼60% of Arabidopsis genes have multiple poly(A) sites. Likewise, ∼47% and 82% of rice genes use APA, supported by MPSS-DGE and SBS-DGE tags, respectively. In both species, ∼49%–66% of APA events were mapped upstream of annotated stop codons. Interestingly, 10% of the transcriptomes are made up of APA transcripts that are differentially distributed among developmental stages and in tissues responding to environmental stresses, providing an additional level of transcriptome dynamics. Examples of pollen-specific APA switching and salicylic acid treatment-specific APA clearly demonstrated such dynamics. The significance of these APAs is more evident in the 3034 genes that have conserved APA events between rice and Arabidopsis. PMID:21813626
Evidence for trade-offs in detoxification and chemosensation gene signatures in Plutella xylostella.
Bautista, Ma Anita M; Bhandary, Binny; Wijeratne, Asela J; Michel, Andrew P; Hoy, Casey W; Mittapalli, Omprakash
2015-03-01
Detoxification genes have been associated with insecticide adaptation in the diamondback moth, Plutella xylostella. The link between chemosensation genes and adaptation, however, remains unexplored. To gain a better understanding of the involvement of these genes in insecticide adaptation, the authors exposed lines of P. xylostella to either high uniform (HU) or low heterogeneous (LH) concentrations of permethrin, expecting primarily physiological or behavioral selection respectively. Initially, 454 pyrosequencing was applied, followed by an examination of expression profiles of candidate genes that responded to selection [cytochrome P450 (CYP), glutathione S-transferase (GST), carboxylesterase (CarE), chemosensory protein (CSP) and odorant-binding protein (OBP)] by quantitative PCR in the larvae. Toxicity and behavioral assays were also conducted to document the effects of the two forms of exposure. Pyrosequencing of the P. xylostella transcriptome from adult heads and third instars produced 198,753 reads with 52,752,486 bases. Quantitative PCR revealed overexpression of CYP4M14, CYP305B1 and CSP8 in HU larvae. OBP13, however, was highest in LH. Larvae from LH and HU lines had up to five- and 752-fold resistance levels respectively, which could be due to overexpression of P450s. However, the behavioral responses of all lines to a series of permethrin concentrations did not vary significantly in any of the generations examined, in spite of the observed upregulation of CSP8 and OBP13. Expression patterns from the target genes provide insights into behavioral and physiological responses to permethrin and suggest a new avenue of research on the role of chemosensation genes in insect adaptation to toxins. © 2014 Society of Chemical Industry.
Chen, Xiaoping; Zhu, Wei; Azam, Sarwar; Li, Heying; Zhu, Fanghe; Li, Haifen; Hong, Yanbin; Liu, Haiyan; Zhang, Erhua; Wu, Hong; Yu, Shanlin; Zhou, Guiyuan; Li, Shaoxiong; Zhong, Ni; Wen, Shijie; Li, Xingyu; Knapp, Steve J; Ozias-Akins, Peggy; Varshney, Rajeev K; Liang, Xuanqiang
2013-01-01
The failure of peg penetration into the soil leads to seed abortion in peanut. Knowledge of genes involved in these processes is comparatively deficient. Here, we used RNA-seq to gain insights into transcriptomes of aerial and subterranean pods. More than 2 million transcript reads with an average length of 396 bp were generated from one aerial (AP) and two subterranean (SP1 and SP2) pod libraries using pyrosequencing technology. After assembly, sets of 49 632, 49 952 and 50 494 from a total of 74 974 transcript assembly contigs (TACs) were identified in AP, SP1 and SP2, respectively. A clear linear relationship in the gene expression level was observed between these data sets. In brief, 2194 differentially expressed TACs with a 99.0% true-positive rate were identified, among which 859 and 1068 TACs were up-regulated in aerial and subterranean pods, respectively. Functional analysis showed that putative function based on similarity with proteins catalogued in UniProt and gene ontology term classification could be determined for 59 342 (79.2%) and 42 955 (57.3%) TACs, respectively. A total of 2968 TACs were mapped to 174 KEGG pathways, of which 168 were shared by aerial and subterranean transcriptomes. TACs involved in photosynthesis were significantly up-regulated and enriched in the aerial pod. In addition, two senescence-associated genes were identified as significantly up-regulated in the aerial pod, which potentially contribute to embryo abortion in aerial pods, and in turn, to cessation of swelling. The data set generated in this study provides evidence for some functional genes as robust candidates underlying aerial and subterranean pod development and contributes to an elucidation of the evolutionary implications resulting from fruit development under light and dark conditions. © 2012 The Authors Plant Biotechnology Journal © 2012 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.
Cárdenas, Leyla; Sánchez, Roland; Gomez, Daniela; Fuenzalida, Gonzalo; Gallardo-Escárate, Cristián; Tanguy, Arnaud
2011-09-01
The marine gastropod Concholepas concholepas, locally known as the "loco", is the main target species of the benthonic Chilean fisheries. Genetic and genomic tools are necessary to study the genome of this species in order to understand the molecular basis of its development, growth, and other key traits to improve the management strategies and to identify local adaptation to prevent loss of biodiversity. Here, we use pyrosequencing technologies to generate the first transcriptomic database from adult specimens of the loco. After trimming, a total of 140,756 Expressed Sequence Tag sequences were achieved. Clustering and assembly analysis identified 19,219 contigs and 105,435 singleton sequences. BlastN analysis showed a significant identity with Expressed Sequence Tags of different gastropod species available in public databases. Similarly, BlastX results showed that only 895 out of the total 124,654 had significant hits and may represent novel genes for marine gastropods. From this database, simple sequence repeat motifs were also identified and a total of 38 primer pairs were designed and tested to assess their potential as informative markers and to investigate their cross-species amplification in different related gastropod species. This dataset represents the first publicly available 454 data for a marine gastropod endemic to the southeastern Pacific coast, providing a valuable transcriptomic resource for future efforts of gene discovery and development of functional markers in other marine gastropods. Copyright © 2011 Elsevier B.V. All rights reserved.
Cruz, Andreia; Rodrigues, Raquel; Pinheiro, Miguel; Mendo, Sónia
2015-01-01
Aeromonas molluscorum Av27 cells were exposed to 0, 5 and 50 μM of TBT and the respective transcriptomes were obtained by pyrosequencing. Gene Ontology revealed that exposure to 5 μM TBT results in a higher number of repressed genes in contrast with 50 μM of TBT, where the number of over-expressed genes is greater. At both TBT concentrations, higher variations in gene expression were found in the functional categories associated with enzymatic activities, transport/binding and oxidation-reduction. A number of proteins are affected by TBT, such as the acriflavin resistance protein, several transcription-related proteins, several Hsps, ABC transporters, CorA and ZntB and other outer membrane efflux proteins, all of these involved in cellular metabolic processes, important to maintain overall cell viability. Using the STRING tool, several proteins with unknown function were related with others involved in degradation processes, such as the pyoverdine chromophore biosynthetic protein, that has been described as playing a role in the Sn–C cleavage of organotins. This approach has allowed a better understanding of the molecular effects of exposure of bacterial cells to TBT. Furthermore it contributes to the knowledge of the functional genomic aspects of bacteria exposed to this pollutant. Furthermore, the transcriptomic data gathered, and now publically available, constitute a valuable resource for comparative genome analysis. PMID:26171931
Cruz, Andreia; Rodrigues, Raquel; Pinheiro, Miguel; Mendo, Sónia
2015-08-01
Aeromonas molluscorum Av27 cells were exposed to 0, 5 and 50 μM of TBT and the respective transcriptomes were obtained by pyrosequencing. Gene Ontology revealed that exposure to 5 μM TBT results in a higher number of repressed genes in contrast with 50 μM of TBT, where the number of over-expressed genes is greater. At both TBT concentrations, higher variations in gene expression were found in the functional categories associated with enzymatic activities, transport/binding and oxidation-reduction. A number of proteins are affected by TBT, such as the acriflavin resistance protein, several transcription-related proteins, several Hsps, ABC transporters, CorA and ZntB and other outer membrane efflux proteins, all of these involved in cellular metabolic processes, important to maintain overall cell viability. Using the STRING tool, several proteins with unknown function were related with others involved in degradation processes, such as the pyoverdine chromophore biosynthetic protein, that has been described as playing a role in the Sn-C cleavage of organotins. This approach has allowed a better understanding of the molecular effects of exposure of bacterial cells to TBT. Furthermore it contributes to the knowledge of the functional genomic aspects of bacteria exposed to this pollutant. Furthermore, the transcriptomic data gathered, and now publically available, constitute a valuable resource for comparative genome analysis. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
Hajibabaei, Mehrdad; Shokralla, Shadi; Zhou, Xin; Singer, Gregory A. C.; Baird, Donald J.
2011-01-01
Timely and accurate biodiversity analysis poses an ongoing challenge for the success of biomonitoring programs. Morphology-based identification of bioindicator taxa is time consuming, and rarely supports species-level resolution especially for immature life stages. Much work has been done in the past decade to develop alternative approaches for biodiversity analysis using DNA sequence-based approaches such as molecular phylogenetics and DNA barcoding. On-going assembly of DNA barcode reference libraries will provide the basis for a DNA-based identification system. The use of recently introduced next-generation sequencing (NGS) approaches in biodiversity science has the potential to further extend the application of DNA information for routine biomonitoring applications to an unprecedented scale. Here we demonstrate the feasibility of using 454 massively parallel pyrosequencing for species-level analysis of freshwater benthic macroinvertebrate taxa commonly used for biomonitoring. We designed our experiments in order to directly compare morphology-based, Sanger sequencing DNA barcoding, and next-generation environmental barcoding approaches. Our results show the ability of 454 pyrosequencing of mini-barcodes to accurately identify all species with more than 1% abundance in the pooled mixture. Although the approach failed to identify 6 rare species in the mixture, the presence of sequences from 9 species that were not represented by individuals in the mixture provides evidence that DNA based analysis may yet provide a valuable approach in finding rare species in bulk environmental samples. We further demonstrate the application of the environmental barcoding approach by comparing benthic macroinvertebrates from an urban region to those obtained from a conservation area. Although considerable effort will be required to robustly optimize NGS tools to identify species from bulk environmental samples, our results indicate the potential of an environmental barcoding approach for biomonitoring programs. PMID:21533287
Trumbić, Željka; Bekaert, Michaël; Taggart, John B; Bron, James E; Gharbi, Karim; Mladineo, Ivona
2015-11-25
The largest of the tuna species, Atlantic bluefin tuna (Thunnus thynnus), inhabits the North Atlantic Ocean and the Mediterranean Sea and is considered to be an endangered species, largely a consequence of overfishing. T. thynnus aquaculture, referred to as fattening or farming, is a capture based activity dependent on yearly renewal from the wild. Thus, the development of aquaculture practices independent of wild resources can provide an important contribution towards ensuring security and sustainability of this species in the longer-term. The development of such practices is today greatly assisted by large scale transcriptomic studies. We have used pyrosequencing technology to sequence a mixed-tissue normalised cDNA library, derived from adult T. thynnus. A total of 976,904 raw sequence reads were assembled into 33,105 unique transcripts having a mean length of 893 bases and an N50 of 870. Of these, 33.4% showed similarity to known proteins or gene transcripts and 86.6% of them were matched to the congeneric Pacific bluefin tuna (Thunnus orientalis) genome, compared to 70.3% for the more distantly related Nile tilapia (Oreochromis niloticus) genome. Transcript sequences were used to develop a novel 15 K Agilent oligonucleotide DNA microarray for T. thynnus and comparative tissue gene expression profiles were inferred for gill, heart, liver, ovaries and testes. Functional contrasts were strongest between gills and ovaries. Gills were particularly associated with immune system, signal transduction and cell communication, while ovaries displayed signatures of glycan biosynthesis, nucleotide metabolism, transcription, translation, replication and repair. Sequence data generated from a novel mixed-tissue T. thynnus cDNA library provide an important transcriptomic resource that can be further employed for study of various aspects of T. thynnus ecology and genomics, with strong applications in aquaculture. Tissue-specific gene expression profiles inferred through the use of novel oligo-microarray can serve in the design of new and more focused transcriptomic studies for future research of tuna physiology and assessment of the welfare in a production environment.
Transcriptome-wide investigation of genomic imprinting in chicken.
Frésard, Laure; Leroux, Sophie; Servin, Bertrand; Gourichon, David; Dehais, Patrice; Cristobal, Magali San; Marsaud, Nathalie; Vignoles, Florence; Bed'hom, Bertrand; Coville, Jean-Luc; Hormozdiari, Farhad; Beaumont, Catherine; Zerjal, Tatiana; Vignal, Alain; Morisson, Mireille; Lagarrigue, Sandrine; Pitel, Frédérique
2014-04-01
Genomic imprinting is an epigenetic mechanism by which alleles of some specific genes are expressed in a parent-of-origin manner. It has been observed in mammals and marsupials, but not in birds. Until now, only a few genes orthologous to mammalian imprinted ones have been analyzed in chicken and did not demonstrate any evidence of imprinting in this species. However, several published observations such as imprinted-like QTL in poultry or reciprocal effects keep the question open. Our main objective was thus to screen the entire chicken genome for parental-allele-specific differential expression on whole embryonic transcriptomes, using high-throughput sequencing. To identify the parental origin of each observed haplotype, two chicken experimental populations were used, as inbred and as genetically distant as possible. Two families were produced from two reciprocal crosses. Transcripts from 20 embryos were sequenced using NGS technology, producing ∼200 Gb of sequences. This allowed the detection of 79 potentially imprinted SNPs, through an analysis method that we validated by detecting imprinting from mouse data already published. However, out of 23 candidates tested by pyrosequencing, none could be confirmed. These results come together, without a priori, with previous statements and phylogenetic considerations assessing the absence of genomic imprinting in chicken.
Jueterbock, A; Franssen, S U; Bergmann, N; Gu, J; Coyer, J A; Reusch, T B H; Bornberg-Bauer, E; Olsen, J L
2016-11-01
Populations distributed across a broad thermal cline are instrumental in addressing adaptation to increasing temperatures under global warming. Using a space-for-time substitution design, we tested for parallel adaptation to warm temperatures along two independent thermal clines in Zostera marina, the most widely distributed seagrass in the temperate Northern Hemisphere. A North-South pair of populations was sampled along the European and North American coasts and exposed to a simulated heatwave in a common-garden mesocosm. Transcriptomic responses under control, heat stress and recovery were recorded in 99 RNAseq libraries with ~13 000 uniquely annotated, expressed genes. We corrected for phylogenetic differentiation among populations to discriminate neutral from adaptive differentiation. The two southern populations recovered faster from heat stress and showed parallel transcriptomic differentiation, as compared with northern populations. Among 2389 differentially expressed genes, 21 exceeded neutral expectations and were likely involved in parallel adaptation to warm temperatures. However, the strongest differentiation following phylogenetic correction was between the three Atlantic populations and the Mediterranean population with 128 of 4711 differentially expressed genes exceeding neutral expectations. Although adaptation to warm temperatures is expected to reduce sensitivity to heatwaves, the continued resistance of seagrass to further anthropogenic stresses may be impaired by heat-induced downregulation of genes related to photosynthesis, pathogen defence and stress tolerance. © 2016 John Wiley & Sons Ltd.
Philipp, Eva E. R.; Kraemer, Lars; Melzner, Frank; Poustka, Albert J.; Thieme, Sebastian; Findeisen, Ulrike; Schreiber, Stefan; Rosenstiel, Philip
2012-01-01
The marine mussel Mytilus edulis and its closely related sister species are distributed world-wide and play an important role in coastal ecology and economy. The diversification in different species and their hybrids, broad ecological distribution, as well as the filter feeding mode of life has made this genus an attractive model to investigate physiological and molecular adaptations and responses to various biotic and abiotic environmental factors. In the present study we investigated the immune system of Mytilus, which may contribute to the ecological plasticity of this species. We generated a large Mytilus transcriptome database from different tissues of immune challenged and stress treated individuals from the Baltic Sea using 454 pyrosequencing. Phylogenetic comparison of orthologous groups of 23 species demonstrated the basal position of lophotrochozoans within protostomes. The investigation of immune related transcripts revealed a complex repertoire of innate recognition receptors and downstream pathway members including transcripts for 27 toll-like receptors and 524 C1q domain containing transcripts. NOD-like receptors on the other hand were absent. We also found evidence for sophisticated TNF, autophagy and apoptosis systems as well as for cytokines. Gill tissue and hemocytes showed highest expression of putative immune related contigs and are promising tissues for further functional studies. Our results partly contrast with findings of a less complex immune repertoire in ecdysozoan and other lophotrochozoan protostomes. We show that bivalves are interesting candidates to investigate the evolution of the immune system from basal metazoans to deuterostomes and protostomes and provide a basis for future molecular work directed to immune system functioning in Mytilus. PMID:22448234
De Novo Assembly and Functional Annotation of the Olive (Olea europaea) Transcriptome
Muñoz-Mérida, Antonio; González-Plaza, Juan José; Cañada, Andrés; Blanco, Ana María; García-López, Maria del Carmen; Rodríguez, José Manuel; Pedrola, Laia; Sicardo, M. Dolores; Hernández, M. Luisa; De la Rosa, Raúl; Belaj, Angjelina; Gil-Borja, Mayte; Luque, Francisco; Martínez-Rivas, José Manuel; Pisano, David G.; Trelles, Oswaldo; Valpuesta, Victoriano; Beuzón, Carmen R.
2013-01-01
Olive breeding programmes are focused on selecting for traits as short juvenile period, plant architecture suited for mechanical harvest, or oil characteristics, including fatty acid composition, phenolic, and volatile compounds to suit new markets. Understanding the molecular basis of these characteristics and improving the efficiency of such breeding programmes require the development of genomic information and tools. However, despite its economic relevance, genomic information on olive or closely related species is still scarce. We have applied Sanger and 454 pyrosequencing technologies to generate close to 2 million reads from 12 cDNA libraries obtained from the Picual, Arbequina, and Lechin de Sevilla cultivars and seedlings from a segregating progeny of a Picual × Arbequina cross. The libraries include fruit mesocarp and seeds at three relevant developmental stages, young stems and leaves, active juvenile and adult buds as well as dormant buds, and juvenile and adult roots. The reads were assembled by library or tissue and then assembled together into 81 020 unigenes with an average size of 496 bases. Here, we report their assembly and their functional annotation. PMID:23297299
Hanriot, Lucie; Keime, Céline; Gay, Nadine; Faure, Claudine; Dossat, Carole; Wincker, Patrick; Scoté-Blachon, Céline; Peyron, Christelle; Gandrillon, Olivier
2008-01-01
Background "Open" transcriptome analysis methods allow to study gene expression without a priori knowledge of the transcript sequences. As of now, SAGE (Serial Analysis of Gene Expression), LongSAGE and MPSS (Massively Parallel Signature Sequencing) are the mostly used methods for "open" transcriptome analysis. Both LongSAGE and MPSS rely on the isolation of 21 pb tag sequences from each transcript. In contrast to LongSAGE, the high throughput sequencing method used in MPSS enables the rapid sequencing of very large libraries containing several millions of tags, allowing deep transcriptome analysis. However, a bias in the complexity of the transcriptome representation obtained by MPSS was recently uncovered. Results In order to make a deep analysis of mouse hypothalamus transcriptome avoiding the limitation introduced by MPSS, we combined LongSAGE with the Solexa sequencing technology and obtained a library of more than 11 millions of tags. We then compared it to a LongSAGE library of mouse hypothalamus sequenced with the Sanger method. Conclusion We found that Solexa sequencing technology combined with LongSAGE is perfectly suited for deep transcriptome analysis. In contrast to MPSS, it gives a complex representation of transcriptome as reliable as a LongSAGE library sequenced by the Sanger method. PMID:18796152
Gupta, Yogesh; Pathak, Ashish K; Singh, Kashmir; Mantri, Shrikant S; Singh, Sudhir P; Tuli, Rakesh
2015-02-14
Annona squamosa L., a popular fruit tree, is the most widely cultivated species of the genus Annona. The lack of transcriptomic and genomic information limits the scope of genome investigations in this important shrub. It bears aggregate fruits with numerous seeds. A few rare accessions with very few seeds have been reported for Annona. A massive pyrosequencing (Roche, 454 GS FLX+) of transcriptome from early stages of fruit development (0, 4, 8 and 12 days after pollination) was performed to produce expression datasets in two genotypes, Sitaphal and NMK-1, that show a contrast in the number of seeds set in fruits. The data reported here is the first source of genome-wide differential transcriptome sequence in two genotypes of A. squamosa, and identifies several candidate genes related to seed development. Approximately 1.9 million high-quality clean reads were obtained in the cDNA library from the developing fruits of both the genotypes, with an average length of about 568 bp. Quality-reads were assembled de novo into 2074 to 11004 contigs in the developing fruit samples at different stages of development. The contig sequence data of all the four stages of each genotype were combined into larger units resulting into 14921 (Sitaphal) and 14178 (NMK-1) unigenes, with a mean size of more than 1 Kb. Assembled unigenes were functionally annotated by querying against the protein sequences of five different public databases (NCBI non redundant, Prunus persica, Vitis vinifera, Fragaria vesca, and Amborella trichopoda), with an E-value cut-off of 10(-5). A total of 4588 (Sitaphal) and 2502 (NMK-1) unigenes did not match any known protein in the NR database. These sequences could be genes specific to Annona sp. or belong to untranslated regions. Several of the unigenes representing pathways related to primary and secondary metabolism, and seed and fruit development expressed at a higher level in Sitaphal, the densely seeded cultivar in comparison to the poorly seeded NMK-1. A total of 2629 (Sitaphal) and 3445 (NMK-1) Simple Sequence Repeat (SSR) motifs were identified respectively in the two genotypes. These could be potential candidates for transcript based microsatellite analysis in A. squamosa. The present work provides early-stage fruit specific transcriptome sequence resource for A. squamosa. This repository will serve as a useful resource for investigating the molecular mechanisms of fruit development, and improvement of fruit related traits in A. squamosa and related species.
Munoz, Sergio; Guerrero, Felix D.; Kellogg, Anastasia; Heekin, Andrew M.
2017-01-01
The cattle tick of Australia, Rhipicephalus australis, is a vector for microbial parasites that cause serious bovine diseases. The Haller’s organ, located in the tick’s forelegs, is crucial for host detection and mating. To facilitate the development of new technologies for better control of this agricultural pest, we aimed to sequence and annotate the transcriptome of the R. australis forelegs and associated tissues, including the Haller's organ. As G protein-coupled receptors (GPCRs) are an important family of eukaryotic proteins studied as pharmaceutical targets in humans, we prioritized the identification and classification of the GPCRs expressed in the foreleg tissues. The two forelegs from adult R. australis were excised, RNA extracted, and pyrosequenced with 454 technology. Reads were assembled into unigenes and annotated by sequence similarity. Python scripts were written to find open reading frames (ORFs) from each unigene. These ORFs were analyzed by different GPCR prediction approaches based on sequence alignments, support vector machines, hidden Markov models, and principal component analysis. GPCRs consistently predicted by multiple methods were further studied by phylogenetic analysis and 3D homology modeling. From 4,782 assembled unigenes, 40,907 possible ORFs were predicted. Using Blastp, Pfam, GPCRpred, TMHMM, and PCA-GPCR, a basic set of 46 GPCR candidates were compiled and a phylogenetic tree was constructed. With further screening of tertiary structures predicted by RaptorX, 6 likely GPCRs emerged and the strongest candidate was classified by PCA-GPCR to be a GABAB receptor. PMID:28231302
Munoz, Sergio; Guerrero, Felix D; Kellogg, Anastasia; Heekin, Andrew M; Leung, Ming-Ying
2017-01-01
The cattle tick of Australia, Rhipicephalus australis, is a vector for microbial parasites that cause serious bovine diseases. The Haller's organ, located in the tick's forelegs, is crucial for host detection and mating. To facilitate the development of new technologies for better control of this agricultural pest, we aimed to sequence and annotate the transcriptome of the R. australis forelegs and associated tissues, including the Haller's organ. As G protein-coupled receptors (GPCRs) are an important family of eukaryotic proteins studied as pharmaceutical targets in humans, we prioritized the identification and classification of the GPCRs expressed in the foreleg tissues. The two forelegs from adult R. australis were excised, RNA extracted, and pyrosequenced with 454 technology. Reads were assembled into unigenes and annotated by sequence similarity. Python scripts were written to find open reading frames (ORFs) from each unigene. These ORFs were analyzed by different GPCR prediction approaches based on sequence alignments, support vector machines, hidden Markov models, and principal component analysis. GPCRs consistently predicted by multiple methods were further studied by phylogenetic analysis and 3D homology modeling. From 4,782 assembled unigenes, 40,907 possible ORFs were predicted. Using Blastp, Pfam, GPCRpred, TMHMM, and PCA-GPCR, a basic set of 46 GPCR candidates were compiled and a phylogenetic tree was constructed. With further screening of tertiary structures predicted by RaptorX, 6 likely GPCRs emerged and the strongest candidate was classified by PCA-GPCR to be a GABAB receptor.
Transcriptomic survey of the midgut of Anthonomus grandis (Coleoptera: Curculionidae).
Salvador, Ricardo; Príncipi, Darío; Berretta, Marcelo; Fernández, Paula; Paniego, Norma; Sciocco-Cap, Alicia; Hopp, Esteban
2014-01-01
Anthonomus grandis Boheman is a key pest in cotton crops in the New World. Its larval stage develops within the flower bud using it as food and as protection against its predators. This behavior limits the effectiveness of its control using conventional insecticide applications and biocontrol techniques. In spite of its importance, little is known about its genome sequence and, more important, its specific expression in key organs like the midgut. Total mRNA isolated from larval midguts was used for pyrosequencing. Sequence reads were assembled and annotated to generate a unigene data set. In total, 400,000 reads from A. grandis midgut with an average length of 237 bp were assembled and combined into 20,915 contigs. The assembled reads fell into 6,621 genes models. BlastX search using the NCBI-NR database showed that 3,006 unigenes had significant matches to known sequences. Gene Ontology (GO) mapping analysis evidenced that A. grandis is able to transcripts coding for proteins involved in catalytic processing of macromolecules that allows its adaptation to very different feeding source scenarios. Furthermore, transcripts encoding for proteins involved in detoxification mechanisms such as p450 genes, glutathione-S-transferase, and carboxylesterases are also expressed. This is the first report of a transcriptomic study in A. grandis and the largest set of sequence data reported for this species. These data are valuable resources to expand the knowledge of this insect group and could be used in the design of new control strategies based in molecular information. © The Author 2014. Published by Oxford University Press on behalf of the Entomological Society of America.
IgM Repertoire Biodiversity is Reduced in HIV-1 Infection and Systemic Lupus Erythematosus.
Yin, Li; Hou, Wei; Liu, Li; Cai, Yunpeng; Wallet, Mark Andrew; Gardner, Brent Paul; Chang, Kaifen; Lowe, Amanda Catherine; Rodriguez, Carina Adriana; Sriaroon, Panida; Farmerie, William George; Sleasman, John William; Goodenow, Maureen Michels
2013-01-01
HIV-1 infection or systemic lupus erythematosus (SLE) disrupt B cell homeostasis, reduce memory B cells, and impair function of IgG and IgM antibodies. To determine how disturbances in B cell populations producing polyclonal antibodies relate to the IgM repertoire, the IgM transcriptome in health and disease was explored at the complementarity determining region 3 (CDRH3) sequence level. 454-deep pyrosequencing in combination with a novel analysis pipeline was applied to define populations of IGHM CDRH3 sequences based on absence or presence of somatic hypermutations (SHM) in peripheral blood B cells. HIV or SLE subjects have reduced biodiversity within their IGHM transcriptome compared to healthy subjects, mainly due to a significant decrease in the number of unique combinations of alleles, although recombination machinery was intact. While major differences between sequences without or with SHM occurred among all groups, IGHD and IGHJ allele use, CDRH3 length distribution, or generation of SHM were similar among study cohorts. Antiretroviral therapy failed to normalize IGHM biodiversity in HIV-infected individuals. All subjects had a low frequency of allelic combinations within the IGHM repertoire similar to known broadly neutralizing HIV-1 antibodies. Polyclonal expansion would decrease overall IgM biodiversity independent of other mechanisms for development of the B cell repertoire. Applying deep sequencing as a strategy to follow development of the IgM repertoire in health and disease provides a novel molecular assessment of multiple points along the B cell differentiation pathway that is highly sensitive for detecting perturbations within the repertoire at the population level.
Transcriptomic Survey of the Midgut of Anthonomus grandis (Coleoptera: Curculionidae)
Salvador, Ricardo; Príncipi, Darío; Berretta, Marcelo; Fernández, Paula; Paniego, Norma; Sciocco-Cap, Alicia; Hopp, Esteban
2014-01-01
Abstract Anthonomus grandis Boheman is a key pest in cotton crops in the New World. Its larval stage develops within the flower bud using it as food and as protection against its predators. This behavior limits the effectiveness of its control using conventional insecticide applications and biocontrol techniques. In spite of its importance, little is known about its genome sequence and, more important, its specific expression in key organs like the midgut. Total mRNA isolated from larval midguts was used for pyrosequencing. Sequence reads were assembled and annotated to generate a unigene data set. In total, 400,000 reads from A. grandis midgut with an average length of 237 bp were assembled and combined into 20,915 contigs. The assembled reads fell into 6,621 genes models. BlastX search using the NCBI-NR database showed that 3,006 unigenes had significant matches to known sequences. Gene Ontology (GO) mapping analysis evidenced that A. grandis is able to transcripts coding for proteins involved in catalytic processing of macromolecules that allows its adaptation to very different feeding source scenarios. Furthermore, transcripts encoding for proteins involved in detoxification mechanisms such as p450 genes, glutathione-S-transferase , and carboxylesterases are also expressed. This is the first report of a transcriptomic study in A. grandis and the largest set of sequence data reported for this species. These data are valuable resources to expand the knowledge of this insect group and could be used in the design of new control strategies based in molecular information. PMID:25473064
DIMM-SC: a Dirichlet mixture model for clustering droplet-based single cell transcriptomic data.
Sun, Zhe; Wang, Ting; Deng, Ke; Wang, Xiao-Feng; Lafyatis, Robert; Ding, Ying; Hu, Ming; Chen, Wei
2018-01-01
Single cell transcriptome sequencing (scRNA-Seq) has become a revolutionary tool to study cellular and molecular processes at single cell resolution. Among existing technologies, the recently developed droplet-based platform enables efficient parallel processing of thousands of single cells with direct counting of transcript copies using Unique Molecular Identifier (UMI). Despite the technology advances, statistical methods and computational tools are still lacking for analyzing droplet-based scRNA-Seq data. Particularly, model-based approaches for clustering large-scale single cell transcriptomic data are still under-explored. We developed DIMM-SC, a Dirichlet Mixture Model for clustering droplet-based Single Cell transcriptomic data. This approach explicitly models UMI count data from scRNA-Seq experiments and characterizes variations across different cell clusters via a Dirichlet mixture prior. We performed comprehensive simulations to evaluate DIMM-SC and compared it with existing clustering methods such as K-means, CellTree and Seurat. In addition, we analyzed public scRNA-Seq datasets with known cluster labels and in-house scRNA-Seq datasets from a study of systemic sclerosis with prior biological knowledge to benchmark and validate DIMM-SC. Both simulation studies and real data applications demonstrated that overall, DIMM-SC achieves substantially improved clustering accuracy and much lower clustering variability compared to other existing clustering methods. More importantly, as a model-based approach, DIMM-SC is able to quantify the clustering uncertainty for each single cell, facilitating rigorous statistical inference and biological interpretations, which are typically unavailable from existing clustering methods. DIMM-SC has been implemented in a user-friendly R package with a detailed tutorial available on www.pitt.edu/∼wec47/singlecell.html. wei.chen@chp.edu or hum@ccf.org. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Schäpe, Paul; Müller-Hagen, Dirk; Ouedraogo, Jean-Paul; Heiderich, Caroline; Jedamzick, Johanna; van den Hondel, Cees A.; Ram, Arthur F.; Meyer, Vera
2016-01-01
Understanding the genetic, molecular and evolutionary basis of cysteine-stabilized antifungal proteins (AFPs) from fungi is important for understanding whether their function is mainly defensive or associated with fungal growth and development. In the current study, a transcriptome meta-analysis of the Aspergillus niger γ-core protein AnAFP was performed to explore co-expressed genes and pathways, based on independent expression profiling microarrays covering 155 distinct cultivation conditions. This analysis uncovered that anafp displays a highly coordinated temporal and spatial transcriptional profile which is concomitant with key nutritional and developmental processes. Its expression profile coincides with early starvation response and parallels with genes involved in nutrient mobilization and autophagy. Using fluorescence- and luciferase reporter strains we demonstrated that the anafp promoter is active in highly vacuolated compartments and foraging hyphal cells during carbon starvation with CreA and FlbA, but not BrlA, as most likely regulators of anafp. A co-expression network analysis supported by luciferase-based reporter assays uncovered that anafp expression is embedded in several cellular processes including allorecognition, osmotic and oxidative stress survival, development, secondary metabolism and autophagy, and predicted StuA and VelC as additional regulators. The transcriptomic resources available for A. niger provide unparalleled resources to investigate the function of proteins. Our work illustrates how transcriptomic meta-analyses can lead to hypotheses regarding protein function and predict a role for AnAFP during slow growth, allorecognition, asexual development and nutrient recycling of A. niger and propose that it interacts with the autophagic machinery to enable these processes. PMID:27835655
Paege, Norman; Jung, Sascha; Schäpe, Paul; Müller-Hagen, Dirk; Ouedraogo, Jean-Paul; Heiderich, Caroline; Jedamzick, Johanna; Nitsche, Benjamin M; van den Hondel, Cees A; Ram, Arthur F; Meyer, Vera
2016-01-01
Understanding the genetic, molecular and evolutionary basis of cysteine-stabilized antifungal proteins (AFPs) from fungi is important for understanding whether their function is mainly defensive or associated with fungal growth and development. In the current study, a transcriptome meta-analysis of the Aspergillus niger γ-core protein AnAFP was performed to explore co-expressed genes and pathways, based on independent expression profiling microarrays covering 155 distinct cultivation conditions. This analysis uncovered that anafp displays a highly coordinated temporal and spatial transcriptional profile which is concomitant with key nutritional and developmental processes. Its expression profile coincides with early starvation response and parallels with genes involved in nutrient mobilization and autophagy. Using fluorescence- and luciferase reporter strains we demonstrated that the anafp promoter is active in highly vacuolated compartments and foraging hyphal cells during carbon starvation with CreA and FlbA, but not BrlA, as most likely regulators of anafp. A co-expression network analysis supported by luciferase-based reporter assays uncovered that anafp expression is embedded in several cellular processes including allorecognition, osmotic and oxidative stress survival, development, secondary metabolism and autophagy, and predicted StuA and VelC as additional regulators. The transcriptomic resources available for A. niger provide unparalleled resources to investigate the function of proteins. Our work illustrates how transcriptomic meta-analyses can lead to hypotheses regarding protein function and predict a role for AnAFP during slow growth, allorecognition, asexual development and nutrient recycling of A. niger and propose that it interacts with the autophagic machinery to enable these processes.
Small, Clayton M; Harlin-Cognato, April D; Jones, Adam G
2013-01-01
Evolutionary studies have revealed that reproductive proteins in animals and plants often evolve more rapidly than the genome-wide average. The causes of this pattern, which may include relaxed purifying selection, sexual selection, sexual conflict, pathogen resistance, reinforcement, or gene duplication, remain elusive. Investigative expansions to additional taxa and reproductive tissues have the potential to shed new light on this unresolved problem. Here, we embark on such an expansion, in a comparison of the brood-pouch transcriptome between two male-pregnant species of the pipefish genus Syngnathus. Male brooding tissues in syngnathid fishes represent a novel, nonurogenital reproductive trait, heretofore mostly uncharacterized from a molecular perspective. We leveraged next-generation sequencing (Roche 454 pyrosequencing) to compare transcript abundance in the male brooding tissues of pregnant with nonpregnant samples from Gulf (S. scovelli) and dusky (S. floridae) pipefish. A core set of protein-coding genes, including multiple members of astacin metalloprotease and c-type lectin gene families, is consistent between species in both the direction and magnitude of expression bias. As predicted, coding DNA sequence analysis of these putative “male pregnancy proteins” suggests rapid evolution relative to nondifferentially expressed genes and reflects signatures of adaptation similar in magnitude to those reported from Drosophila male accessory gland proteins. Although the precise drivers of male pregnancy protein divergence remain unknown, we argue that the male pregnancy transcriptome in syngnathid fishes, a clade diverse with respect to brooding morphology and mating system, represents a unique and promising object of study for understanding the perplexing evolutionary nature of reproductive molecules. PMID:24324861
Gupta, Parul; Goel, Ridhi; Pathak, Sumya; Srivastava, Apeksha; Singh, Surya Pratap; Sangwan, Rajender Singh; Asif, Mehar Hasan; Trivedi, Prabodh Kumar
2013-01-01
Withania somnifera is one of the most valuable medicinal plants used in Ayurvedic and other indigenous medicine systems due to bioactive molecules known as withanolides. As genomic information regarding this plant is very limited, little information is available about biosynthesis of withanolides. To facilitate the basic understanding about the withanolide biosynthesis pathways, we performed transcriptome sequencing for Withania leaf (101L) and root (101R) which specifically synthesize withaferin A and withanolide A, respectively. Pyrosequencing yielded 8,34,068 and 7,21,755 reads which got assembled into 89,548 and 1,14,814 unique sequences from 101L and 101R, respectively. A total of 47,885 (101L) and 54,123 (101R) could be annotated using TAIR10, NR, tomato and potato databases. Gene Ontology and KEGG analyses provided a detailed view of all the enzymes involved in withanolide backbone synthesis. Our analysis identified members of cytochrome P450, glycosyltransferase and methyltransferase gene families with unique presence or differential expression in leaf and root and might be involved in synthesis of tissue-specific withanolides. We also detected simple sequence repeats (SSRs) in transcriptome data for use in future genetic studies. Comprehensive sequence resource developed for Withania, in this study, will help to elucidate biosynthetic pathway for tissue-specific synthesis of secondary plant products in non-model plant organisms as well as will be helpful in developing strategies for enhanced biosynthesis of withanolides through biotechnological approaches. PMID:23667511
Validation of two ribosomal RNA removal methods for microbial metatranscriptomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Shaomei; Wurtzel, Omri; Singh, Kanwar
2010-10-01
The predominance of rRNAs in the transcriptome is a major technical challenge in sequence-based analysis of cDNAs from microbial isolates and communities. Several approaches have been applied to deplete rRNAs from (meta)transcriptomes, but no systematic investigation of potential biases introduced by any of these approaches has been reported. Here we validated the effectiveness and fidelity of the two most commonly used approaches, subtractive hybridization and exonuclease digestion, as well as combinations of these treatments, on two synthetic five-microorganism metatranscriptomes using massively parallel sequencing. We found that the effectiveness of rRNA removal was a function of community composition and RNA integritymore » for these treatments. Subtractive hybridization alone introduced the least bias in relative transcript abundance, whereas exonuclease and in particular combined treatments greatly compromised mRNA abundance fidelity. Illumina sequencing itself also can compromise quantitative data analysis by introducing a G+C bias between runs.« less
Stephenson, William; Donlin, Laura T; Butler, Andrew; Rozo, Cristina; Bracken, Bernadette; Rashidfarrokhi, Ali; Goodman, Susan M; Ivashkiv, Lionel B; Bykerk, Vivian P; Orange, Dana E; Darnell, Robert B; Swerdlow, Harold P; Satija, Rahul
2018-02-23
Droplet-based single-cell RNA-seq has emerged as a powerful technique for massively parallel cellular profiling. While this approach offers the exciting promise to deconvolute cellular heterogeneity in diseased tissues, the lack of cost-effective and user-friendly instrumentation has hindered widespread adoption of droplet microfluidic techniques. To address this, we developed a 3D-printed, low-cost droplet microfluidic control instrument and deploy it in a clinical environment to perform single-cell transcriptome profiling of disaggregated synovial tissue from five rheumatoid arthritis patients. We sequence 20,387 single cells revealing 13 transcriptomically distinct clusters. These encompass an unsupervised draft atlas of the autoimmune infiltrate that contribute to disease biology. Additionally, we identify previously uncharacterized fibroblast subpopulations and discern their spatial location within the synovium. We envision that this instrument will have broad utility in both research and clinical settings, enabling low-cost and routine application of microfluidic techniques.
Blood transcriptomics and metabolomics for personalized medicine.
Li, Shuzhao; Todor, Andrei; Luo, Ruiyan
2016-01-01
Molecular analysis of blood samples is pivotal to clinical diagnosis and has been intensively investigated since the rise of systems biology. Recent developments have opened new opportunities to utilize transcriptomics and metabolomics for personalized and precision medicine. Efforts from human immunology have infused into this area exquisite characterizations of subpopulations of blood cells. It is now possible to infer from blood transcriptomics, with fine accuracy, the contribution of immune activation and of cell subpopulations. In parallel, high-resolution mass spectrometry has brought revolutionary analytical capability, detecting > 10,000 metabolites, together with environmental exposure, dietary intake, microbial activity, and pharmaceutical drugs. Thus, the re-examination of blood chemicals by metabolomics is in order. Transcriptomics and metabolomics can be integrated to provide a more comprehensive understanding of the human biological states. We will review these new data and methods and discuss how they can contribute to personalized medicine.
Jeanne, Nicolas; Saliou, Adrien; Carcenac, Romain; Lefebvre, Caroline; Dubois, Martine; Cazabat, Michelle; Nicot, Florence; Loiseau, Claire; Raymond, Stéphanie; Izopet, Jacques; Delobel, Pierre
2015-01-01
HIV-1 coreceptor usage must be accurately determined before starting CCR5 antagonist-based treatment as the presence of undetected minor CXCR4-using variants can cause subsequent virological failure. Ultra-deep pyrosequencing of HIV-1 V3 env allows to detect low levels of CXCR4-using variants that current genotypic approaches miss. However, the computation of the mass of sequence data and the need to identify true minor variants while excluding artifactual sequences generated during amplification and ultra-deep pyrosequencing is rate-limiting. Arbitrary fixed cut-offs below which minor variants are discarded are currently used but the errors generated during ultra-deep pyrosequencing are sequence-dependant rather than random. We have developed an automated processing of HIV-1 V3 env ultra-deep pyrosequencing data that uses biological filters to discard artifactual or non-functional V3 sequences followed by statistical filters to determine position-specific sensitivity thresholds, rather than arbitrary fixed cut-offs. It allows to retain authentic sequences with point mutations at V3 positions of interest and discard artifactual ones with accurate sensitivity thresholds. PMID:26585833
Jeanne, Nicolas; Saliou, Adrien; Carcenac, Romain; Lefebvre, Caroline; Dubois, Martine; Cazabat, Michelle; Nicot, Florence; Loiseau, Claire; Raymond, Stéphanie; Izopet, Jacques; Delobel, Pierre
2015-11-20
HIV-1 coreceptor usage must be accurately determined before starting CCR5 antagonist-based treatment as the presence of undetected minor CXCR4-using variants can cause subsequent virological failure. Ultra-deep pyrosequencing of HIV-1 V3 env allows to detect low levels of CXCR4-using variants that current genotypic approaches miss. However, the computation of the mass of sequence data and the need to identify true minor variants while excluding artifactual sequences generated during amplification and ultra-deep pyrosequencing is rate-limiting. Arbitrary fixed cut-offs below which minor variants are discarded are currently used but the errors generated during ultra-deep pyrosequencing are sequence-dependant rather than random. We have developed an automated processing of HIV-1 V3 env ultra-deep pyrosequencing data that uses biological filters to discard artifactual or non-functional V3 sequences followed by statistical filters to determine position-specific sensitivity thresholds, rather than arbitrary fixed cut-offs. It allows to retain authentic sequences with point mutations at V3 positions of interest and discard artifactual ones with accurate sensitivity thresholds.
Seim, Inge; Ma, Siming; Zhou, Xuming; Gerashchenko, Maxim V.; Lee, Sang-Goo; Suydam, Robert; George, John C.; Bickham, John W.; Gladyshev, Vadim N.
2014-01-01
Mammals vary dramatically in lifespan, by at least two-orders of magnitude, but the molecular basis for this difference remains largely unknown. The bowhead whale Balaena mysticetus is the longest-lived mammal known, with an estimated maximal lifespan in excess of two hundred years. It is also one of the two largest animals and the most cold-adapted baleen whale species. Here, we report the first genome-wide gene expression analyses of the bowhead whale, based on the de novo assembly of its transcriptome. Bowhead whale or cetacean-specific changes in gene expression were identified in the liver, kidney and heart, and complemented with analyses of positively selected genes. Changes associated with altered insulin signaling and other gene expression patterns could help explain the remarkable longevity of bowhead whales as well as their adaptation to a lipid-rich diet. The data also reveal parallels in candidate longevity adaptations of the bowhead whale, naked mole rat and Brandt's bat. The bowhead whale transcriptome is a valuable resource for the study of this remarkable animal, including the evolution of longevity and its important correlates such as resistance to cancer and other diseases. PMID:25411232
Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing
Wang, Bin; Guo, Guangwu; Wang, Chao; Lin, Ying; Wang, Xiaoning; Zhao, Mouming; Guo, Yong; He, Minghui; Zhang, Yong; Pan, Li
2010-01-01
Aspergillus oryzae, an important filamentous fungus used in food fermentation and the enzyme industry, has been shown through genome sequencing and various other tools to have prominent features in its genomic composition. However, the functional complexity of the A. oryzae transcriptome has not yet been fully elucidated. Here, we applied direct high-throughput paired-end RNA-sequencing (RNA-Seq) to the transcriptome of A. oryzae under four different culture conditions. With the high resolution and sensitivity afforded by RNA-Seq, we were able to identify a substantial number of novel transcripts, new exons, untranslated regions, alternative upstream initiation codons and upstream open reading frames, which provide remarkable insight into the A. oryzae transcriptome. We were also able to assess the alternative mRNA isoforms in A. oryzae and found a large number of genes undergoing alternative splicing. Many genes and pathways that might be involved in higher levels of protein production in solid-state culture than in liquid culture were identified by comparing gene expression levels between different cultures. Our analysis indicated that the transcriptome of A. oryzae is much more complex than previously anticipated, and these results may provide a blueprint for further study of the A. oryzae transcriptome. PMID:20392818
Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing.
Wang, Bin; Guo, Guangwu; Wang, Chao; Lin, Ying; Wang, Xiaoning; Zhao, Mouming; Guo, Yong; He, Minghui; Zhang, Yong; Pan, Li
2010-08-01
Aspergillus oryzae, an important filamentous fungus used in food fermentation and the enzyme industry, has been shown through genome sequencing and various other tools to have prominent features in its genomic composition. However, the functional complexity of the A. oryzae transcriptome has not yet been fully elucidated. Here, we applied direct high-throughput paired-end RNA-sequencing (RNA-Seq) to the transcriptome of A. oryzae under four different culture conditions. With the high resolution and sensitivity afforded by RNA-Seq, we were able to identify a substantial number of novel transcripts, new exons, untranslated regions, alternative upstream initiation codons and upstream open reading frames, which provide remarkable insight into the A. oryzae transcriptome. We were also able to assess the alternative mRNA isoforms in A. oryzae and found a large number of genes undergoing alternative splicing. Many genes and pathways that might be involved in higher levels of protein production in solid-state culture than in liquid culture were identified by comparing gene expression levels between different cultures. Our analysis indicated that the transcriptome of A. oryzae is much more complex than previously anticipated, and these results may provide a blueprint for further study of the A. oryzae transcriptome.
Assessing the Gene Content of the Megagenome: Sugar Pine (Pinus lambertiana)
Gonzalez-Ibeas, Daniel; Martinez-Garcia, Pedro J.; Famula, Randi A.; Delfino-Mix, Annette; Stevens, Kristian A.; Loopstra, Carol A.; Langley, Charles H.; Neale, David B.; Wegrzyn, Jill L.
2016-01-01
Sugar pine (Pinus lambertiana Douglas) is within the subgenus Strobus with an estimated genome size of 31 Gbp. Transcriptomic resources are of particular interest in conifers due to the challenges presented in their megagenomes for gene identification. In this study, we present the first comprehensive survey of the P. lambertiana transcriptome through deep sequencing of a variety of tissue types to generate more than 2.5 billion short reads. Third generation, long reads generated through PacBio Iso-Seq have been included for the first time in conifers to combat the challenges associated with de novo transcriptome assembly. A technology comparison is provided here to contribute to the otherwise scarce comparisons of second and third generation transcriptome sequencing approaches in plant species. In addition, the transcriptome reference was essential for gene model identification and quality assessment in the parallel project responsible for sequencing and assembly of the entire genome. In this study, the transcriptomic data were also used to address questions surrounding lineage-specific Dicer-like proteins in conifers. These proteins play a role in the control of transposable element proliferation and the related genome expansion in conifers. PMID:27799338
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, Shuangyan; Huang, Xin; Yang, Xiaohan
BACKGROUND: Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. RESULTS: The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resultedmore » in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. CONCLUSIONS: This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.« less
The transcriptome recipe for the venom cocktail of Tityus bahiensis scorpion.
de Oliveira, Ursula Castro; Candido, Denise Maria; Dorce, Valquíria Abrão Coronado; Junqueira-de-Azevedo, Inácio de Loiola Meirelles
2015-03-01
Scorpion venom is a mixture of peptides, including antimicrobial, bradykinin-potentiating and anionic peptides and small to medium proteins, such as ion channel toxins, metalloproteinases and phospholipases that together cause severe clinical manifestation. Tityus bahiensis is the second most medically important scorpion species in Brazil and it is widely distributed in the country with the exception of the North Region. Here we sequenced and analyzed the transcripts from the venom glands of T. bahiensis, aiming at identifying and annotating venom gland expressed genes. A total of 116,027 long reads were generated by pyrosequencing and assembled in 2891 isotigs. An annotation process identified transcripts by similarity to known toxins, revealing that putative venom components represent 7.4% of gene expression. The major toxins identified are potassium and sodium channel toxins, whereas metalloproteinases showed an unexpected high abundance. Phylogenetic analysis of deduced metalloproteinases from T. bahiensis and other scorpions revealed a pattern of ancient and intraspecific gene expansions. Other venom molecules identified include antimicrobial, anionic and bradykinin-potentiating peptides, besides several putative new venom components. This report provides the first attempt to massively identify the venom components of this species and constitutes one of the few transcriptomic efforts on the genus Tityus. Copyright © 2015 Elsevier Ltd. All rights reserved.
Transcriptome-wide investigation of genomic imprinting in chicken
Frésard, Laure; Leroux, Sophie; Servin, Bertrand; Gourichon, David; Dehais, Patrice; Cristobal, Magali San; Marsaud, Nathalie; Vignoles, Florence; Bed'hom, Bertrand; Coville, Jean-Luc; Hormozdiari, Farhad; Beaumont, Catherine; Zerjal, Tatiana; Vignal, Alain; Morisson, Mireille; Lagarrigue, Sandrine; Pitel, Frédérique
2014-01-01
Genomic imprinting is an epigenetic mechanism by which alleles of some specific genes are expressed in a parent-of-origin manner. It has been observed in mammals and marsupials, but not in birds. Until now, only a few genes orthologous to mammalian imprinted ones have been analyzed in chicken and did not demonstrate any evidence of imprinting in this species. However, several published observations such as imprinted-like QTL in poultry or reciprocal effects keep the question open. Our main objective was thus to screen the entire chicken genome for parental-allele-specific differential expression on whole embryonic transcriptomes, using high-throughput sequencing. To identify the parental origin of each observed haplotype, two chicken experimental populations were used, as inbred and as genetically distant as possible. Two families were produced from two reciprocal crosses. Transcripts from 20 embryos were sequenced using NGS technology, producing ∼200 Gb of sequences. This allowed the detection of 79 potentially imprinted SNPs, through an analysis method that we validated by detecting imprinting from mouse data already published. However, out of 23 candidates tested by pyrosequencing, none could be confirmed. These results come together, without a priori, with previous statements and phylogenetic considerations assessing the absence of genomic imprinting in chicken. PMID:24452801
Role of APOE Isoforms in the Pathogenesis of TBI induced Alzheimer’s Disease
2016-10-01
deletion, APOE targeted replacement, complex breeding, CCI model optimization, mRNA library generation, high throughput massive parallel sequencing...demonstrate that the lack of Abca1 increases amyloid plaques and decreased APOE protein levels in AD-model mice. In this proposal we will test the hypothesis...injury, inflammatory reaction, transcriptome, high throughput massive parallel sequencing, mRNA-seq., behavioral testing, memory impairment, recovery 3
Arkas: Rapid reproducible RNAseq analysis
Colombo, Anthony R.; J. Triche Jr, Timothy; Ramsingh, Giridharan
2017-01-01
The recently introduced Kallisto pseudoaligner has radically simplified the quantification of transcripts in RNA-sequencing experiments. We offer cloud-scale RNAseq pipelines Arkas-Quantification, and Arkas-Analysis available within Illumina’s BaseSpace cloud application platform which expedites Kallisto preparatory routines, reliably calculates differential expression, and performs gene-set enrichment of REACTOME pathways . Due to inherit inefficiencies of scale, Illumina's BaseSpace computing platform offers a massively parallel distributive environment improving data management services and data importing. Arkas-Quantification deploys Kallisto for parallel cloud computations and is conveniently integrated downstream from the BaseSpace Sequence Read Archive (SRA) import/conversion application titled SRA Import. Arkas-Analysis annotates the Kallisto results by extracting structured information directly from source FASTA files with per-contig metadata, calculates the differential expression and gene-set enrichment analysis on both coding genes and transcripts. The Arkas cloud pipeline supports ENSEMBL transcriptomes and can be used downstream from the SRA Import facilitating raw sequencing importing, SRA FASTQ conversion, RNA quantification and analysis steps. PMID:28868134
Massively parallel nanowell-based single-cell gene expression profiling.
Goldstein, Leonard D; Chen, Ying-Jiun Jasmine; Dunne, Jude; Mir, Alain; Hubschle, Hermann; Guillory, Joseph; Yuan, Wenlin; Zhang, Jingli; Stinson, Jeremy; Jaiswal, Bijay; Pahuja, Kanika Bajaj; Mann, Ishminder; Schaal, Thomas; Chan, Leo; Anandakrishnan, Sangeetha; Lin, Chun-Wah; Espinoza, Patricio; Husain, Syed; Shapiro, Harris; Swaminathan, Karthikeyan; Wei, Sherry; Srinivasan, Maithreyan; Seshagiri, Somasekar; Modrusan, Zora
2017-07-07
Technological advances have enabled transcriptome characterization of cell types at the single-cell level providing new biological insights. New methods that enable simple yet high-throughput single-cell expression profiling are highly desirable. Here we report a novel nanowell-based single-cell RNA sequencing system, ICELL8, which enables processing of thousands of cells per sample. The system employs a 5,184-nanowell-containing microchip to capture ~1,300 single cells and process them. Each nanowell contains preprinted oligonucleotides encoding poly-d(T), a unique well barcode, and a unique molecular identifier. The ICELL8 system uses imaging software to identify nanowells containing viable single cells and only wells with single cells are processed into sequencing libraries. Here, we report the performance and utility of ICELL8 using samples of increasing complexity from cultured cells to mouse solid tissue samples. Our assessment of the system to discriminate between mixed human and mouse cells showed that ICELL8 has a low cell multiplet rate (< 3%) and low cross-cell contamination. We characterized single-cell transcriptomes of more than a thousand cultured human and mouse cells as well as 468 mouse pancreatic islets cells. We were able to identify distinct cell types in pancreatic islets, including alpha, beta, delta and gamma cells. Overall, ICELL8 provides efficient and cost-effective single-cell expression profiling of thousands of cells, allowing researchers to decipher single-cell transcriptomes within complex biological samples.
Feliubadaló, Lídia; Lopez-Doriga, Adriana; Castellsagué, Ester; del Valle, Jesús; Menéndez, Mireia; Tornero, Eva; Montes, Eva; Cuesta, Raquel; Gómez, Carolina; Campos, Olga; Pineda, Marta; González, Sara; Moreno, Victor; Brunet, Joan; Blanco, Ignacio; Serra, Eduard; Capellá, Gabriel; Lázaro, Conxi
2013-01-01
Next-generation sequencing (NGS) is changing genetic diagnosis due to its huge sequencing capacity and cost-effectiveness. The aim of this study was to develop an NGS-based workflow for routine diagnostics for hereditary breast and ovarian cancer syndrome (HBOCS), to improve genetic testing for BRCA1 and BRCA2. A NGS-based workflow was designed using BRCA MASTR kit amplicon libraries followed by GS Junior pyrosequencing. Data analysis combined Variant Identification Pipeline freely available software and ad hoc R scripts, including a cascade of filters to generate coverage and variant calling reports. A BRCA homopolymer assay was performed in parallel. A research scheme was designed in two parts. A Training Set of 28 DNA samples containing 23 unique pathogenic mutations and 213 other variants (33 unique) was used. The workflow was validated in a set of 14 samples from HBOCS families in parallel with the current diagnostic workflow (Validation Set). The NGS-based workflow developed permitted the identification of all pathogenic mutations and genetic variants, including those located in or close to homopolymers. The use of NGS for detecting copy-number alterations was also investigated. The workflow meets the sensitivity and specificity requirements for the genetic diagnosis of HBOCS and improves on the cost-effectiveness of current approaches. PMID:23249957
Picoliter DNA Sequencing Chemistry on an Electrowetting-based Digital Microfluidic Platform
Ferguson Welch, Erin R.; Lin, Yan-You; Madison, Andrew; Fair, R.B.
2011-01-01
The results of investigations into performing DNA sequencing chemistry on a picoliter-scale electrowetting digital microfluidic platform are reported. Pyrosequencing utilizes pyrophosphate produced during nucleotide base addition to initiate a process ending with detection through a chemiluminescence reaction using firefly luciferase. The intensity of light produced during the reaction can be quantified to determine the number of bases added to the DNA strand. The logic-based control and discrete fluid droplets of a digital microfluidic device lend themselves well to the pyrosequencing process. Bead-bound DNA is magnetically held in a single location, and wash or reagent droplets added or split from it to circumvent product dilution. Here we discuss the dispensing, control, and magnetic manipulation of the paramagnetic beads used to hold target DNA. We also demonstrate and characterize the picoliter-scale reaction of luciferase with adenosine triphosphate to represent the detection steps of pyrosequencing and all necessary alterations for working on this scale. PMID:21298802
Agunbiade, Tolulope A.; Sun, Weilin; Coates, Brad S.; Djouaka, Rousseau; Tamò, Manuele; Ba, Malick N.; Binso-Dabire, Clementine; Baoua, Ibrahim; Olds, Brett P.; Pittendrigh, Barry R.
2013-01-01
Cowpea is a widely cultivated and major nutritional source of protein for many people that live in West Africa. Annual yields and longevity of grain storage is greatly reduced by feeding damage caused by a complex of insect pests that include the pod sucking bugs, Anoplocnemis curvipes Fabricius (Hemiptera: Coreidae) and Clavigralla tomentosicollis Stål (Hemiptera: Coreidae); as well as phloem-feeding cowpea aphids, Aphis craccivora Koch (Hemiptera: Aphididae) and flower thrips, Megalurothrips sjostedti Trybom (Thysanoptera: Thripidae). Efforts to control these pests remain a challenge and there is a need to understand the structure and movement of these pest populations in order to facilitate the development of integrated pest management strategies (IPM). Molecular tools have the potential to help facilitate a better understanding of pest populations. Towards this goal, we used 454 pyrosequencing technology to generate 319,126, 176,262, 320,722 and 227,882 raw reads from A. curvipes, A. craccivora, C. tomentosicollis and M. sjostedti, respectively. The reads were de novo assembled into 11,687, 7,647, 10,652 and 7,348 transcripts for A. curvipes, A. craccivora, C. tomentosicollis and M. sjostedti, respectively. Functional annotation of the resulting transcripts identified genes putatively involved in insecticide resistance, pathogen defense and immunity. Additionally, sequences that matched the primary aphid endosymbiont, Buchnera aphidicola, were identified among A. craccivora transcripts. Furthermore, 742, 97, 607 and 180 single nucleotide polymorphisms (SNPs) were respectively predicted among A. curvipes, A. craccivora, C. tomentosicollis and M. sjostedti transcripts, and will likely be valuable tools for future molecular genetic marker development. These results demonstrate that Roche 454-based transcriptome sequencing could be useful for the development of genomic resources for cowpea pest insects in West Africa. PMID:24278221
Zeng, Digang; Chen, Xiuli; Xie, Daxiang; Zhao, Yongzhen; Yang, Chunling; Li, Yongmei; Ma, Ning; Peng, Min; Yang, Qiong; Liao, Zhenping; Wang, Hui; Chen, Xiaohan
2013-01-01
The Pacific white shrimp, Litopenaeus vannamei, is a worldwide cultured crustacean species with important commercial value. Over the last two decades, Taura syndrome virus (TSV) has seriously threatened the shrimp aquaculture industry in the Western Hemisphere. To better understand the interaction between shrimp immune and TSV, we performed a transcriptome analysis in the hepatopancreas of L. vannamei challenged with TSV, using the 454 pyrosequencing (Roche) technology. We obtained 126919 and 102181 high-quality reads from TSV-infected and non-infected (control) L. vannamei cDNA libraries, respectively. The overall de novo assembly of cDNA sequence data generated 15004 unigenes, with an average length of 507 bp. Based on BLASTX search (E-value <10-5) against NR, Swissprot, GO, COG and KEGG databases, 10425 unigenes (69.50% of all unigenes) were annotated with gene descriptions, gene ontology terms, or metabolic pathways. In addition, we identified 770 microsatellites and designed 497 sets of primers. Comparative genomic analysis revealed that 1311 genes differentially expressed in the infected shrimp compared to the controls, including 559 up- and 752 down- regulated genes. Among the differentially expressed genes, several are involved in various animal immune functions, such as antiviral, antimicrobial, proteases, protease inhibitors, signal transduction, transcriptional control, cell death and cell adhesion. This study provides valuable information on shrimp gene activities against TSV infection. Results can contribute to the in-depth study of candidate genes in shrimp immunity, and improves our current understanding of this host-virus interaction. In addition, the large amount of transcripts reported in this study provide a rich source for identification of novel genes in shrimp.
Tumor Heterogeneity, Single-Cell Sequencing, and Drug Resistance.
Schmidt, Felix; Efferth, Thomas
2016-06-16
Tumor heterogeneity has been compared with Darwinian evolution and survival of the fittest. The evolutionary ecosystem of tumors consisting of heterogeneous tumor cell populations represents a considerable challenge to tumor therapy, since all genetically and phenotypically different subpopulations have to be efficiently killed by therapy. Otherwise, even small surviving subpopulations may cause repopulation and refractory tumors. Single-cell sequencing allows for a better understanding of the genomic principles of tumor heterogeneity and represents the basis for more successful tumor treatments. The isolation and sequencing of single tumor cells still represents a considerable technical challenge and consists of three major steps: (1) single cell isolation (e.g., by laser-capture microdissection), fluorescence-activated cell sorting, micromanipulation, whole genome amplification (e.g., with the help of Phi29 DNA polymerase), and transcriptome-wide next generation sequencing technologies (e.g., 454 pyrosequencing, Illumina sequencing, and other systems). Data demonstrating the feasibility of single-cell sequencing for monitoring the emergence of drug-resistant cell clones in patient samples are discussed herein. It is envisioned that single-cell sequencing will be a valuable asset to assist the design of regimens for personalized tumor therapies based on tumor subpopulation-specific genetic alterations in individual patients.
Transcriptome mining of immune-related genes in the muricid snail Concholepas concholepas.
Détrée, Camille; López-Landavery, Edgar; Gallardo-Escárate, Cristian; Lafarga-De la Cruz, Fabiola
2017-12-01
The population of the Chilean endemic marine gastropod Concholepas concholepas locally called "loco" has dramatically decreased in the past 50 years as a result of intense activity of local fisheries and high environmental variability observed along the Chilean coast, including episodes of hypoxia, changes in sea surface temperature, ocean acidification and diseases. In this study, we set out to explore the molecular basis of C. concholepas to cope with biotic stressors such as exposure to the pathogenic bacterium Vibrio anguillarum. Here, 454pyrosequencing was conducted and 61 transcripts related to the immune response in this muricid species were identified. Among these, the expression of six genes (CcNFκβ, CcIκβ, CcLITAF, CcTLR, CcCas8 and CcCath) involved in the regulation of inflammatory, apoptotic and immune processes upon stimuli, were evaluated during the first 33 h post challenge (hpc). The results showed that CcTLR, CcCas8 and CcCath have an initial response at 4 hpc, evidencing an up-regulation from 4 to 24 hpc. Notably, the response of CcNFKB occurred 2 h later with a statistically significant up-regulation at 6 hpc and 10 hpc. Furthermore, the challenge with V. anguillarum induced a statistically significant down-regulation of CcIKB between 2 and 10 hpc as well as a down-regulation of CcLITAF between 2 and 4 hpc followed in both cases by an up-regulation between 24 and 33 hpc. This work describes the first transcriptomic effort to characterize the immune response of C. concholepas and constitutes a valuable transcriptomic resource for future efforts to develop sustainable aquaculture and conservations tools for this endemic marine snail species. Copyright © 2017 Elsevier Ltd. All rights reserved.
Hubert, Jan; Erban, Tomas; Kopecky, Jan; Sopko, Bruno; Nesvorna, Marta; Lichovnikova, Martina; Schicht, Sabine; Strube, Christina; Sparagano, Olivier
2017-11-01
Blood feeding red poultry mites (RPM) serve as vectors of pathogenic bacteria and viruses among vertebrate hosts including wild birds, poultry hens, mammals, and humans. The microbiome of RPM has not yet been studied by high-throughput sequencing. RPM eggs, larvae, and engorged adult/nymph samples obtained in four poultry houses in Czechia were used for microbiome analyses by Illumina amplicon sequencing of the 16S ribosomal RNA (rRNA) gene V4 region. A laboratory RPM population was used as positive control for transcriptome analysis by pyrosequencing with identification of sequences originating from bacteria. The samples of engorged adult/nymph stages had 100-fold more copies of 16S rRNA gene copies than the samples of eggs and larvae. The microbiome composition showed differences among the four poultry houses and among observed developmental stadia. In the adults' microbiome 10 OTUs comprised 90 to 99% of all sequences. Bartonella-like bacteria covered between 30 and 70% of sequences in RPM microbiome and 25% bacterial sequences in transcriptome. The phylogenetic analyses of 16S rRNA gene sequences revealed two distinct groups of Bartonella-like bacteria forming sister groups: (i) symbionts of ants; (ii) Bartonella genus. Cardinium, Wolbachia, and Rickettsiella sp. were found in the microbiomes of all tested stadia, while Spiroplasma eriocheiris and Wolbachia were identified in the laboratory RPM transcriptome. The microbiomes from eggs, larvae, and engorged adults/nymphs differed. Bartonella-like symbionts were found in all stadia and sampling sites. Bartonella-like bacteria was the most diversified group within the RPM microbiome. The presence of identified putative pathogenic bacteria is relevant with respect to human and animal health issues while the identification of symbiontic bacteria can lead to new control methods targeting them to destabilize the arthropod host.
Fungal Diversity in Tomato Rhizosphere Soil under Conventional and Desert Farming Systems
Kazerooni, Elham A.; Maharachchikumbura, Sajeewa S. N.; Rethinasamy, Velazhahan; Al-Mahrouqi, Hamed; Al-Sadi, Abdullah M.
2017-01-01
This study examined fungal diversity and composition in conventional (CM) and desert farming (DE) systems in Oman. Fungal diversity in the rhizosphere of tomato was assessed using 454-pyrosequencing and culture-based techniques. Both techniques produced variable results in terms of fungal diversity, with 25% of the fungal classes shared between the two techniques. In addition, pyrosequencing recovered more taxa compared to direct plating. These findings could be attributed to the ability of pyrosequencing to recover taxa that cannot grow or are slow growing on culture media. Both techniques showed that fungal diversity in the conventional farm was comparable to that in the desert farm. However, the composition of fungal classes and taxa in the two farming systems were different. Pyrosequencing revealed that Microsporidetes and Dothideomycetes are the two most common fungal classes in CM and DE, respectively. However, the culture-based technique revealed that Eurotiomycetes was the most abundant class in both farming systems and some classes, such as Microsporidetes, were not detected by the culture-based technique. Although some plant pathogens (e.g., Pythium or Fusarium) were detected in the rhizosphere of tomato, the majority of fungal species in the rhizosphere of tomato were saprophytes. Our study shows that the cultivation system may have an impact on fungal diversity. The factors which affected fungal diversity in both farms are discussed. PMID:28824590
Simon, Matthew J; Murchison, Charles; Iliff, Jeffrey J
2018-02-01
Astrocytes play a critical role in regulating the interface between the cerebral vasculature and the central nervous system. Contributing to this is the astrocytic endfoot domain, a specialized structure that ensheathes the entirety of the vasculature and mediates signaling between endothelial cells, pericytes, and neurons. The astrocytic endfoot has been implicated as a critical element of the glymphatic pathway, and changes in protein expression profiles in this cellular domain are linked to Alzheimer's disease pathology. Despite this, basic physiological properties of this structure remain poorly understood including the developmental timing of its formation, and the protein components that localize there to mediate its functions. Here we use human transcriptome data from male and female subjects across several developmental stages and brain regions to characterize the gene expression profile of the dystrophin-associated complex (DAC), a known structural component of the astrocytic endfoot that supports perivascular localization of the astroglial water channel aquaporin-4. Transcriptomic profiling is also used to define genes exhibiting parallel expression profiles to DAC elements, generating a pool of candidate genes that encode gene products that may contribute to the physiological function of the perivascular astrocytic endfoot domain. We found that several genes encoding transporter proteins are transcriptionally associated with DAC genes. © 2017 Wiley Periodicals, Inc.
Young, Neil D.; Jex, Aaron R.; Cantacessi, Cinzia; Hall, Ross S.; Campbell, Bronwyn E.; Spithill, Terence W.; Tangkawattana, Sirikachorn; Tangkawattana, Prasarn; Laha, Thewarach; Gasser, Robin B.
2011-01-01
Fasciola gigantica (Digenea) is an important foodborne trematode that causes liver fluke disease (fascioliasis) in mammals, including ungulates and humans, mainly in tropical climatic zones of the world. Despite its socioeconomic impact, almost nothing is known about the molecular biology of this parasite, its interplay with its hosts, and the pathogenesis of fascioliasis. Modern genomic technologies now provide unique opportunities to rapidly tackle these exciting areas. The present study reports the first transcriptome representing the adult stage of F. gigantica (of bovid origin), defined using a massively parallel sequencing-coupled bioinformatic approach. From >20 million raw sequence reads, >30,000 contiguous sequences were assembled, of which most were novel. Relative levels of transcription were determined for individual molecules, which were also characterized (at the inferred amino acid level) based on homology, gene ontology, and/or pathway mapping. Comparisons of the transcriptome of F. gigantica with those of other trematodes, including F. hepatica, revealed similarities in transcription for molecules inferred to have key roles in parasite-host interactions. Overall, the present dataset should provide a solid foundation for future fundamental genomic, proteomic, and metabolomic explorations of F. gigantica, as well as a basis for applied outcomes such as the development of novel methods of intervention against this neglected parasite. PMID:21408104
Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng
2015-09-01
Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species.
Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng
2015-01-01
Premise of the study: Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag–simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. Methods and Results: We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. Conclusions: These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species. PMID:26421250
Genome sequencing in microfabricated high-density picolitre reactors.
Margulies, Marcel; Egholm, Michael; Altman, William E; Attiya, Said; Bader, Joel S; Bemben, Lisa A; Berka, Jan; Braverman, Michael S; Chen, Yi-Ju; Chen, Zhoutao; Dewell, Scott B; Du, Lei; Fierro, Joseph M; Gomes, Xavier V; Godwin, Brian C; He, Wen; Helgesen, Scott; Ho, Chun Heen; Ho, Chun He; Irzyk, Gerard P; Jando, Szilveszter C; Alenquer, Maria L I; Jarvie, Thomas P; Jirage, Kshama B; Kim, Jong-Bum; Knight, James R; Lanza, Janna R; Leamon, John H; Lefkowitz, Steven M; Lei, Ming; Li, Jing; Lohman, Kenton L; Lu, Hong; Makhijani, Vinod B; McDade, Keith E; McKenna, Michael P; Myers, Eugene W; Nickerson, Elizabeth; Nobile, John R; Plant, Ramona; Puc, Bernard P; Ronan, Michael T; Roth, George T; Sarkis, Gary J; Simons, Jan Fredrik; Simpson, John W; Srinivasan, Maithreyan; Tartaro, Karrie R; Tomasz, Alexander; Vogt, Kari A; Volkmer, Greg A; Wang, Shally H; Wang, Yong; Weiner, Michael P; Yu, Pengguang; Begley, Richard F; Rothberg, Jonathan M
2005-09-15
The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel fibre-optic slide of individual wells and is able to sequence 25 million bases, at 99% or better accuracy, in one four-hour run. To achieve an approximately 100-fold increase in throughput over current Sanger sequencing technology, we have developed an emulsion method for DNA amplification and an instrument for sequencing by synthesis using a pyrosequencing protocol optimized for solid support and picolitre-scale volumes. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembly of the Mycoplasma genitalium genome with 96% coverage at 99.96% accuracy in one run of the machine.
A high-throughput approach to profile RNA structure.
Delli Ponti, Riccardo; Marti, Stefanie; Armaos, Alexandros; Tartaglia, Gian Gaetano
2017-03-17
Here we introduce the Computational Recognition of Secondary Structure (CROSS) method to calculate the structural profile of an RNA sequence (single- or double-stranded state) at single-nucleotide resolution and without sequence length restrictions. We trained CROSS using data from high-throughput experiments such as Selective 2΄-Hydroxyl Acylation analyzed by Primer Extension (SHAPE; Mouse and HIV transcriptomes) and Parallel Analysis of RNA Structure (PARS; Human and Yeast transcriptomes) as well as high-quality NMR/X-ray structures (PDB database). The algorithm uses primary structure information alone to predict experimental structural profiles with >80% accuracy, showing high performances on large RNAs such as Xist (17 900 nucleotides; Area Under the ROC Curve AUC of 0.75 on dimethyl sulfate (DMS) experiments). We integrated CROSS in thermodynamics-based methods to predict secondary structure and observed an increase in their predictive power by up to 30%. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Mitochondrial sequence analysis for forensic identification using pyrosequencing technology.
Andréasson, H; Asp, A; Alderborn, A; Gyllensten, U; Allen, M
2002-01-01
Over recent years, requests for mtDNA analysis in the field of forensic medicine have notably increased, and the results of such analyses have proved to be very useful in forensic cases where nuclear DNA analysis cannot be performed. Traditionally, mtDNA has been analyzed by DNA sequencing of the two hypervariable regions, HVI and HVII, in the D-loop. DNA sequence analysis using the conventional Sanger sequencing is very robust but time consuming and labor intensive. By contrast, mtDNA analysis based on the pyrosequencing technology provides fast and accurate results from the human mtDNA present in many types of evidence materials in forensic casework. The assay has been developed to determine polymorphic sites in the mitochondrial D-loop as well as the coding region to further increase the discrimination power of mtDNA analysis. The pyrosequencing technology for analysis of mtDNA polymorphisms has been tested with regard to sensitivity, reproducibility, and success rate when applied to control samples and actual casework materials. The results show that the method is very accurate and sensitive; the results are easily interpreted and provide a high success rate on casework samples. The panel of pyrosequencing reactions for the mtDNA polymorphisms were chosen to result in an optimal discrimination power in relation to the number of bases determined.
Hou, X-L; Cao, Q-Y; Jia, H-Y; Chen, Z
2008-07-01
Pathogens causing acute diarrhea include a large variety of species from Enterobacteriaceae and Vibrionaceae. A method based on pyrosequencing was used here to differentiate bacteria commonly associated with diarrhea in China; the method is targeted to a partial amplicon of the gyrB gene, which encodes the B subunit of DNA gyrase. Twenty-eight specific polymorphic positions were identified from sequence alignment of a large sequence dataset and targeted using 17 sequencing primers. Of 95 isolates tested, belonging to 13 species within 7 genera, most could be identified to the species level; O157 type could be differentiated from other E. coli types; Salmonella enterica subsp. enterica could be identified at the serotype level; the genus Shigella, except for S. boydii and S. dysenteriae, could also be identified. All these isolates were also subjected to conventional sequencing of a relatively long ( approximately1.2 kb) region of gyrB DNA; these results confirmed those with pyrosequencing. Twenty-two fecal samples were surveyed, the results of which were concordant with culture-based bacterial identification, and the pathogen detection limit with simulated stool specimens was 10(4) CFU/ml. DNA from different pathogens was also mixed to simulate a case of multibacterial infection, and the generated signals correlated well with the mix ratio. In summary, the gyrB-based pyrosequencing approach proved to have significant reliability and discriminatory power for enteropathogenic bacterial identification and provided a fast and effective method for clinical diagnosis.
Grasso, Chiara; Trevisan, Morena; Fiano, Valentina; Tarallo, Valentina; De Marco, Laura; Sacerdote, Carlotta; Richiardi, Lorenzo; Merletti, Franco; Gillio-Tos, Anna
2016-01-01
Pyrosequencing has emerged as an alternative method of nucleic acid sequencing, well suited for many applications which aim to characterize single nucleotide polymorphisms, mutations, microbial types and CpG methylation in the target DNA. The commercially available pyrosequencing systems can harbor two different types of software which allow analysis in AQ or CpG mode, respectively, both widely employed for DNA methylation analysis. Aim of the study was to assess the performance for DNA methylation analysis at CpG sites of the two pyrosequencing software which allow analysis in AQ or CpG mode, respectively. Despite CpG mode having been specifically generated for CpG methylation quantification, many investigations on this topic have been carried out with AQ mode. As proof of equivalent performance of the two software for this type of analysis is not available, the focus of this paper was to evaluate if the two modes currently used for CpG methylation assessment by pyrosequencing may give overlapping results. We compared the performance of the two software in quantifying DNA methylation in the promoter of selected genes (GSTP1, MGMT, LINE-1) by testing two case series which include DNA from paraffin embedded prostate cancer tissues (PC study, N = 36) and DNA from blood fractions of healthy people (DD study, N = 28), respectively. We found discrepancy in the two pyrosequencing software-based quality assignment of DNA methylation assays. Compared to the software for analysis in the AQ mode, less permissive criteria are supported by the Pyro Q-CpG software, which enables analysis in CpG mode. CpG mode warns the operators about potential unsatisfactory performance of the assay and ensures a more accurate quantitative evaluation of DNA methylation at CpG sites. The implementation of CpG mode is strongly advisable in order to improve the reliability of the methylation analysis results achievable by pyrosequencing.
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.
Shahin, Arwa; van Kaauwen, Martijn; Esselink, Danny; Bargsten, Joachim W; van Tuyl, Jaap M; Visser, Richard G F; Arens, Paul
2012-11-20
Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies.
Huth, Troy J; Place, Sean P
2013-11-20
The notothenioids comprise a diverse group of fishes that rapidly radiated after isolation by the Antarctic Circumpolar Current approximately 14-25 million years ago. Given that evolutionary adaptation has led to finely tuned traits with narrow physiological limits in these organisms, this system provides a unique opportunity to examine physiological trade-offs and limits of adaptive responses to environmental perturbation. As such, notothenioids have a rich history with respect to studies attempting to understand the vulnerability of polar ecosystems to the negative impacts associated with global climate change. Unfortunately, despite being a model system for understanding physiological adaptations to extreme environments, we still lack fundamental molecular tools for much of the Nototheniidae family. Specimens of the emerald notothen, Trematomus bernacchii, were acclimated for 28 days in flow-through seawater tanks maintained near ambient seawater temperatures (-1.5°C) or at +4°C. Following acclimation, tissue specific cDNA libraries for liver, gill and brain were created by pooling RNA from n = 5 individuals per temperature treatment. The tissue specific libraries were bar-coded and used for 454 pyrosequencing, which yielded over 700 thousand sequencing reads. A de novo assembly and annotation of these reads produced a functional transcriptome library of T. bernacchii containing 30,107 unigenes, 13,003 of which possessed significant homology to a known protein product. Digital gene expression analysis of these extremely cold adapted fish reinforced the loss of an inducible heat shock response and allowed the preliminary exploration into other elements of the cellular stress response. Preliminary exploration of the transcriptome of T. bernacchii under elevated temperatures enabled a semi-quantitative comparison to prior studies aimed at characterizing the thermal response of this endemic fish whose size, abundance and distribution has established it as a pivotal species in polar research spanning several decades. The comparison of these findings to previous studies demonstrates the efficacy of transcriptomics and digital gene expression analysis as tools in future studies of polar organisms and has greatly increased the available genomic resources for the suborder Notothenioidei, particularly in the Trematominae subfamily.
2013-01-01
Background The notothenioids comprise a diverse group of fishes that rapidly radiated after isolation by the Antarctic Circumpolar Current approximately 14–25 million years ago. Given that evolutionary adaptation has led to finely tuned traits with narrow physiological limits in these organisms, this system provides a unique opportunity to examine physiological trade-offs and limits of adaptive responses to environmental perturbation. As such, notothenioids have a rich history with respect to studies attempting to understand the vulnerability of polar ecosystems to the negative impacts associated with global climate change. Unfortunately, despite being a model system for understanding physiological adaptations to extreme environments, we still lack fundamental molecular tools for much of the Nototheniidae family. Results Specimens of the emerald notothen, Trematomus bernacchii, were acclimated for 28 days in flow-through seawater tanks maintained near ambient seawater temperatures (−1.5°C) or at +4°C. Following acclimation, tissue specific cDNA libraries for liver, gill and brain were created by pooling RNA from n = 5 individuals per temperature treatment. The tissue specific libraries were bar-coded and used for 454 pyrosequencing, which yielded over 700 thousand sequencing reads. A de novo assembly and annotation of these reads produced a functional transcriptome library of T. bernacchii containing 30,107 unigenes, 13,003 of which possessed significant homology to a known protein product. Digital gene expression analysis of these extremely cold adapted fish reinforced the loss of an inducible heat shock response and allowed the preliminary exploration into other elements of the cellular stress response. Conclusions Preliminary exploration of the transcriptome of T. bernacchii under elevated temperatures enabled a semi-quantitative comparison to prior studies aimed at characterizing the thermal response of this endemic fish whose size, abundance and distribution has established it as a pivotal species in polar research spanning several decades. The comparison of these findings to previous studies demonstrates the efficacy of transcriptomics and digital gene expression analysis as tools in future studies of polar organisms and has greatly increased the available genomic resources for the suborder Notothenioidei, particularly in the Trematominae subfamily. PMID:24252228
2010-01-01
Background Papaver somniferum (opium poppy) is the source for several pharmaceutical benzylisoquinoline alkaloids including morphine, the codeine and sanguinarine. In response to treatment with a fungal elicitor, the biosynthesis and accumulation of sanguinarine is induced along with other plant defense responses in opium poppy cell cultures. The transcriptional induction of alkaloid metabolism in cultured cells provides an opportunity to identify components of this process via the integration of deep transcriptome and proteome databases generated using next-generation technologies. Results A cDNA library was prepared for opium poppy cell cultures treated with a fungal elicitor for 10 h. Using 454 GS-FLX Titanium pyrosequencing, 427,369 expressed sequence tags (ESTs) with an average length of 462 bp were generated. Assembly of these sequences yielded 93,723 unigenes, of which 23,753 were assigned Gene Ontology annotations. Transcripts encoding all known sanguinarine biosynthetic enzymes were identified in the EST database, 5 of which were represented among the 50 most abundant transcripts. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) of total protein extracts from cell cultures treated with a fungal elicitor for 50 h facilitated the identification of 1,004 proteins. Proteins were fractionated by one-dimensional SDS-PAGE and digested with trypsin prior to LC-MS/MS analysis. Query of an opium poppy-specific EST database substantially enhanced peptide identification. Eight out of 10 known sanguinarine biosynthetic enzymes and many relevant primary metabolic enzymes were represented in the peptide database. Conclusions The integration of deep transcriptome and proteome analyses provides an effective platform to catalogue the components of secondary metabolism, and to identify genes encoding uncharacterized enzymes. The establishment of corresponding transcript and protein databases generated by next-generation technologies in a system with a well-defined metabolite profile facilitates an improved linkage between genes, enzymes, and pathway components. The proteome database represents the most relevant alkaloid-producing enzymes, compared with the much deeper and more complete transcriptome library. The transcript database contained full-length mRNAs encoding most alkaloid biosynthetic enzymes, which is a key requirement for the functional characterization of novel gene candidates. PMID:21083930
USDA-ARS?s Scientific Manuscript database
Molecular events regulating apple fruit ripening and sensory quality are largely unknown. Such knowledge is essential for genomic-assisted apple breeding and postharvest quality management. In this study, a parallel transcriptome profile analysis, scanning electron microscopic (SEM) examination and...
Li, Qian; Guo, Song; Jiang, Xi; Bryk, Jaroslaw; Naumann, Ronald; Enard, Wolfgang; Tomita, Masaru; Sugimoto, Masahiro; Khaitovich, Philipp; Pääbo, Svante
2016-01-01
Whereas all mammals have one glutamate dehydrogenase gene (GLUD1), humans and apes carry an additional gene (GLUD2), which encodes an enzyme with distinct biochemical properties. We inserted a bacterial artificial chromosome containing the human GLUD2 gene into mice and analyzed the resulting changes in the transcriptome and metabolome during postnatal brain development. Effects were most pronounced early postnatally, and predominantly genes involved in neuronal development were affected. Remarkably, the effects in the transgenic mice partially parallel the transcriptome and metabolome differences seen between humans and macaques analyzed. Notably, the introduction of GLUD2 did not affect glutamate levels in mice, consistent with observations in the primates. Instead, the metabolic effects of GLUD2 center on the tricarboxylic acid cycle, suggesting that GLUD2 affects carbon flux during early brain development, possibly supporting lipid biosynthesis. PMID:27118840
Song, Qinxin; Wei, Guijiang; Zhou, Guohua
2014-07-01
A portable bioluminescence analyser for detecting the DNA sequence of genetically modified organisms (GMOs) was developed by using a photodiode (PD) array. Pyrosequencing on eight genes (zSSIIb, Bt11 and Bt176 gene of genetically modified maize; Lectin, 35S-CTP4, CP4EPSPS, CaMV35S promoter and NOS terminator of the genetically modified Roundup ready soya) was successfully detected with this instrument. The corresponding limit of detection (LOD) was 0.01% with 35 PCR cycles. The maize and soya available from three different provenances in China were detected. The results indicate that pyrosequencing using the small size of the detector is a simple, inexpensive, and reliable way in a farm/field test of GMO analysis. Copyright © 2014 Elsevier Ltd. All rights reserved.
Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae rhamnoides L.) Transcriptome
Ghangal, Rajesh; Chaudhary, Saurabh; Jain, Mukesh; Purty, Ram Singh; Chand Sharma, Prakash
2013-01-01
Seabuckthorn ( Hippophae rhamnoides L.) is known for its medicinal, nutritional and environmental importance since ancient times. However, very limited efforts have been made to characterize the genome and transcriptome of this wonder plant. Here, we report the use of next generation massive parallel sequencing technology (Illumina platform) and de novo assembly to gain a comprehensive view of the seabuckthorn transcriptome. We assembled 86,253,874 high quality short reads using six assembly tools. At our hand, assembly of non-redundant short reads following a two-step procedure was found to be the best considering various assembly quality parameters. Initially, ABySS tool was used following an additive k-mer approach. The assembled transcripts were subsequently subjected to TGICL suite. Finally, de novo short read assembly yielded 88,297 transcripts (> 100 bp), representing about 53 Mb of seabuckthorn transcriptome. The average length of transcripts was 610 bp, N50 length 1198 BP and 91% of the short reads uniquely mapped back to seabuckthorn transcriptome. A total of 41,340 (46.8%) transcripts showed significant similarity with sequences present in nr protein databases of NCBI (E-value < 1E-06). We also screened the assembled transcripts for the presence of transcription factors and simple sequence repeats. Our strategy involving the use of short read assembler (ABySS) followed by TGICL will be useful for the researchers working with a non-model organism’s transcriptome in terms of saving time and reducing complexity in data management. The seabuckthorn transcriptome data generated here provide a valuable resource for gene discovery and development of functional molecular markers. PMID:23991119
Ambroise, Jérôme; Butoescu, Valentina; Robert, Annie; Tombal, Bertrand; Gala, Jean-Luc
2015-06-25
Single Nucleotide Polymorphisms (SNPs) identified in Genome Wide Association Studies (GWAS) have generally moderate association with related complex diseases. Accordingly, Multilocus Genetic Risk Scores (MGRSs) have been computed in previous studies in order to assess the cumulative association of multiple SNPs. When several SNPs have to be genotyped for each patient, using successive uniplex pyrosequencing reactions increases analytical reagent expenses and Turnaround Time (TAT). While a set of several pyrosequencing primers could theoretically be used to analyze multiplex amplicons, this would generate overlapping primer-specific pyro-signals that are visually uninterpretable. In the current study, two multiplex assays were developed consisting of a quadruplex (n=4) and a quintuplex (n=5) polymerase chain reaction (PCR) each followed by multiplex pyrosequencing analysis. The aim was to reliably but rapidly genotype a set of prostate cancer-related SNPs (n=9). The nucleotide dispensation order was selected using SENATOR software. Multiplex pyro-signals were analyzed using the new AdvISER-MH-PYRO software based on a sparse representation of the signal. Using uniplex assays as gold standard, the concordance between multiplex and uniplex assays was assessed on DNA extracted from patient blood samples (n = 10). All genotypes (n=90) generated with the quadruplex and the quintuplex pyroquencing assays were perfectly (100 %) concordant with uniplex pyrosequencing. Using multiplex genotyping approach for analyzing a set of 90 patients allowed reducing TAT by approximately 75 % (i.e., from 2025 to 470 min) while reducing reagent consumption and cost by approximately 70 % (i.e., from ~229 US$ /patient to ~64 US$ /patient). This combination of quadruplex and quintuplex pyrosequencing and PCR assays enabled to reduce the amount of DNA required for multi-SNP analysis, and to lower the global TAT and costs of SNP genotyping while providing results as reliable as uniplex analysis. Using this combined multiplex approach also substantially reduced the production of waste material. These genotyping assays appear therefore to be biologically, economically and ecologically highly relevant, being worth to be integrated in genetic-based predictive strategies for better selecting patients at risk for prostate cancer. In addition, the same approach could now equally be transposed to other clinical/research applications relying on the computation of MGRS based on multi-SNP genotyping.
USDA-ARS?s Scientific Manuscript database
Next Generation Sequencing is transforming the way scientists collect and measure an organism’s genetic background and gene dynamics, while bioinformatics and super-computing are merging to facilitate parallel sample computation and interpretation at unprecedented speeds. Analyzing the complete gene...
Maia, Julio; Dekkers, Bas J. W.; Provart, Nicholas J.; Ligterink, Wilco; Hilhorst, Henk W. M.
2011-01-01
The combination of robust physiological models with “omics” studies holds promise for the discovery of genes and pathways linked to how organisms deal with drying. Here we used a transcriptomics approach in combination with an in vivo physiological model of re-establishment of desiccation tolerance (DT) in Arabidopsis thaliana seeds. We show that the incubation of desiccation sensitive (DS) germinated Arabidopsis seeds in a polyethylene glycol (PEG) solution re-induces the mechanisms necessary for expression of DT. Based on a SNP-tile array gene expression profile, our data indicates that the re-establishment of DT, in this system, is related to a programmed reversion from a metabolic active to a quiescent state similar to prior to germination. Our findings show that transcripts of germinated seeds after the PEG-treatment are dominated by those encoding LEA, seed storage and dormancy related proteins. On the other hand, a massive repression of genes belonging to many other classes such as photosynthesis, cell wall modification and energy metabolism occurs in parallel. Furthermore, comparison with a similar system for Medicago truncatula reveals a significant overlap between the two transcriptomes. Such overlap may highlight core mechanisms and key regulators of the trait DT. Taking into account the availability of the many genetic and molecular resources for Arabidopsis, the described system may prove useful for unraveling DT in higher plants. PMID:22195004
Meier, Kristian; Hansen, Michael Møller; Normandeau, Eric; Mensberg, Karen-Lise D.; Frydenberg, Jane; Larsen, Peter Foged; Bekkevold, Dorte; Bernatchez, Louis
2014-01-01
Local adaptation and its underlying molecular basis has long been a key focus in evolutionary biology. There has recently been increased interest in the evolutionary role of plasticity and the molecular mechanisms underlying local adaptation. Using transcriptome analysis, we assessed differences in gene expression profiles for three brown trout (Salmo trutta) populations, one resident and two anadromous, experiencing different temperature regimes in the wild. The study was based on an F2 generation raised in a common garden setting. A previous study of the F1 generation revealed different reaction norms and significantly higher QST than FST among populations for two early life-history traits. In the present study we investigated if genomic reaction norm patterns were also present at the transcriptome level. Eggs from the three populations were incubated at two temperatures (5 and 8 degrees C) representing conditions encountered in the local environments. Global gene expression for fry at the stage of first feeding was analysed using a 32k cDNA microarray. The results revealed differences in gene expression between populations and temperatures and population × temperature interactions, the latter indicating locally adapted reaction norms. Moreover, the reaction norms paralleled those observed previously at early life-history traits. We identified 90 cDNA clones among the genes with an interaction effect that were differently expressed between the ecologically divergent populations. These included genes involved in immune- and stress response. We observed less plasticity in the resident as compared to the anadromous populations, possibly reflecting that the degree of environmental heterogeneity encountered by individuals throughout their life cycle will select for variable level of phenotypic plasticity at the transcriptome level. Our study demonstrates the usefulness of transcriptome approaches to identify genes with different temperature reaction norms. The responses observed suggest that populations may vary in their susceptibility to climate change. PMID:24454810
Reading, Benjamin J; Chapman, Robert W; Schaff, Jennifer E; Scholl, Elizabeth H; Opperman, Charles H; Sullivan, Craig V
2012-02-21
The striped bass and its relatives (genus Morone) are important fisheries and aquaculture species native to estuaries and rivers of the Atlantic coast and Gulf of Mexico in North America. To open avenues of gene expression research on reproduction and breeding of striped bass, we generated a collection of expressed sequence tags (ESTs) from a complementary DNA (cDNA) library representative of their ovarian transcriptome. Sequences of a total of 230,151 ESTs (51,259,448 bp) were acquired by Roche 454 pyrosequencing of cDNA pooled from ovarian tissues obtained at all stages of oocyte growth, at ovulation (eggs), and during preovulatory atresia. Quality filtering of ESTs allowed assembly of 11,208 high-quality contigs ≥ 100 bp, including 2,984 contigs 500 bp or longer (average length 895 bp). Blastx comparisons revealed 5,482 gene orthologues (E-value < 10-3), of which 4,120 (36.7% of total contigs) were annotated with Gene Ontology terms (E-value < 10-6). There were 5,726 remaining unknown unique sequences (51.1% of total contigs). All of the high-quality EST sequences are available in the National Center for Biotechnology Information (NCBI) Short Read Archive (GenBank: SRX007394). Informative contigs were considered to be abundant if they were assembled from groups of ESTs comprising ≥ 0.15% of the total short read sequences (≥ 345 reads/contig). Approximately 52.5% of these abundant contigs were predicted to have predominant ovary expression through digital differential display in silico comparisons to zebrafish (Danio rerio) UniGene orthologues. Over 1,300 Gene Ontology terms from Biological Process classes of Reproduction, Reproductive process, and Developmental process were assigned to this collection of annotated contigs. This first large reference sequence database available for the ecologically and economically important temperate basses (genus Morone) provides a foundation for gene expression studies in these species. The predicted predominance of ovary gene expression and assignment of directly relevant Gene Ontology classes suggests a powerful utility of this dataset for analysis of ovarian gene expression related to fundamental questions of oogenesis. Additionally, a high definition Agilent 60-mer oligo ovary 'UniClone' microarray with 8 × 15,000 probe format has been designed based on this striped bass transcriptome (eArray Group: Striper Group, Design ID: 029004).
Zhao, Yonggui; Fang, Yang; Jin, Yanling; Huang, Jun; Ma, Xinrong; He, Kaize; He, Zhiming; Wang, Feng; Zhao, Hai
2015-03-01
Carriers were added to a pilot-scale duckweed-based (Lemna japonica 0223) wastewater treatment system to immobilize and enhance microorganisms. This system and another parallel duckweed system without carriers were operated for 1.5 years. The results indicated the addition of the carrier did not significantly affect the growth and composition of duckweed, the recovery of total nitrogen (TN), total phosphorus (TP) and CO2 or the removal of TP. However, it significantly improved the removal efficiency of TN and NH4(+)-N (by 19.97% and 15.02%, respectively). The use of 454 pyrosequencing revealed large differences of the microbial communities between the different components within a system and similarities within the same components between the two systems. The carrier biofilm had the highest bacterial diversity and relative abundance of nitrifying bacteria (3%) and denitrifying bacteria (24% of Rhodocyclaceae), which improved nitrogen removal of the system. An efficient N-removal duckweed system with enhanced microorganisms was established. Copyright © 2014 Elsevier Ltd. All rights reserved.
Sun, Shumei; Zhou, Hao; Zhou, Bin; Hu, Ziyou; Hou, Jinlin; Sun, Jian
2012-05-01
To evaluate the sensitivity and specificity of nested PCR combined with pyrosequencing in the detection of HBV drug-resistance gene. RtM204I (ATT) mutant and rtM204 (ATG) nonmutant plasmids mixed at different ratios were detected for mutations using nested-PCR combined with pyrosequencing, and the results were compared with those by conventional PCR pyrosequencing to analyze the linearity and consistency of the two methods. Clinical specimens with different viral loads were examined for drug-resistant mutations using nested PCR pyrosequencing and nested PCR combined with dideoxy sequencing (Sanger) for comparison of the detection sensitivity and specificity. The fitting curves demonstrated good linearity of both conventional PCR pyrosequencing and nested PCR pyrosequencing (R(2)>0.99, P<0.05). Nested PCR showed a better consistency with the predicted value than conventional PCR, and was superior to conventional PCR for detection of samples containing 90% mutant plasmid. In the detection of clinical specimens, Sanger sequencing had a significantly lower sensitivity than nested PCR pyrosequencing (92% vs 100%, P<0.01). The detection sensitivity of Sanger sequencing varied with the viral loads, especially in samples with low viral copies (HBV DNA ≤3log10 copies/ml), where the sensitivity was 78%, significantly lower than that of pyrosequencing (100%, P<0.01). Neither of the two methods yielded positive results for the negative control samples, suggesting their good specificity. Compared with nested PCR and Sanger sequencing method, nested PCR pyrosequencing has a higher sensitivity especially in clinical specimens with low viral copies, which can be important for early detection of HBV mutant strains and hence more effective clinical management.
Dinzouna-Boutamba, Sylvatrie-Danne; Lee, Sanghyun; Son, Ui-Han; Yun, Hae Soo; Joo, So-Young; Jeong, Sookwan; Rhee, Man Hee; Kwak, Dongmi; Xuan, Xuenan; Hong, Yeonchul; Chung, Dong-Il; Goo, Youn-Kyoung
2017-12-01
Allelic diversity leading to multiple gene polymorphisms of vivax malaria parasites has been shown to greatly contribute to antigenic variation and drug resistance, increasing the potential for multiple-clone infections within the host. Therefore, to identify multiple-clone infections and the predominant haplotype of Plasmodium vivax in a South Korean population, P. vivax merozoite surface protein-1 (PvMSP-1) was analyzed by pyrosequencing. Pyrosequencing of 156 vivax malaria-infected samples yielded 97 (62.18%) output pyrograms showing two main types of peak patterns of the dimorphic allele for threonine and alanine (T1476A). Most of the samples evaluated (88.66%) carried multiple-clone infections (wild- and mutant-types), whereas 11.34% of the same population carried only the mutant-type (1476A). In addition, each allele showed a high frequency of guanine (G) base substitution at both the first and third positions (86.07% and 81.13%, respectively) of the nucleotide combinations. Pyrosequencing of the PvMSP-1 42-kDa fragment revealed a heterogeneous parasite population, with the mutant-type dominant compared to the wild-type. Understanding the genetic diversity and multiple-clone infection rates may lead to improvements in vivax malaria prevention and strategic control plans. Further studies are needed to improve the efficacy of the pyrosequencing assay with large sample sizes and additional nucleotide positions. Copyright © 2017 Elsevier B.V. All rights reserved.
Warren, Ian A; Vera, J Cristobal; Johns, Annika; Zinna, Robert; Marden, James H; Emlen, Douglas J; Dworkin, Ian; Lavine, Laura C
2014-01-01
Scarab beetles exhibit an astonishing variety of rigid exo-skeletal outgrowths, known as "horns". These traits are often sexually dimorphic and vary dramatically across species in size, shape, location, and allometry with body size. In many species, the horn exhibits disproportionate growth resulting in an exaggerated allometric relationship with body size, as compared to other traits, such as wings, that grow proportionately with body size. Depending on the species, the smallest males either do not produce a horn at all, or they produce a disproportionately small horn for their body size. While the diversity of horn shapes and their behavioural ecology have been reasonably well studied, we know far less about the proximate mechanisms that regulate horn growth. Thus, using 454 pyrosequencing, we generated transcriptome profiles, during horn growth and development, in two different scarab beetle species: the Asian rhinoceros beetle, Trypoxylus dichotomus, and the dung beetle, Onthophagus nigriventris. We obtained over half a million reads for each species that were assembled into over 6,000 and 16,000 contigs respectively. We combined these data with previously published studies to look for signatures of molecular evolution. We found a small subset of genes with horn-biased expression showing evidence for recent positive selection, as is expected with sexual selection on horn size. We also found evidence of relaxed selection present in genes that demonstrated biased expression between horned and horn-less morphs, consistent with the theory of developmental decoupling of phenotypically plastic traits.
Droplet-based pyrosequencing using digital microfluidics.
Boles, Deborah J; Benton, Jonathan L; Siew, Germaine J; Levy, Miriam H; Thwar, Prasanna K; Sandahl, Melissa A; Rouse, Jeremy L; Perkins, Lisa C; Sudarsan, Arjun P; Jalili, Roxana; Pamula, Vamsee K; Srinivasan, Vijay; Fair, Richard B; Griffin, Peter B; Eckhardt, Allen E; Pollack, Michael G
2011-11-15
The feasibility of implementing pyrosequencing chemistry within droplets using electrowetting-based digital microfluidics is reported. An array of electrodes patterned on a printed-circuit board was used to control the formation, transportation, merging, mixing, and splitting of submicroliter-sized droplets contained within an oil-filled chamber. A three-enzyme pyrosequencing protocol was implemented in which individual droplets contained enzymes, deoxyribonucleotide triphosphates (dNTPs), and DNA templates. The DNA templates were anchored to magnetic beads which enabled them to be thoroughly washed between nucleotide additions. Reagents and protocols were optimized to maximize signal over background, linearity of response, cycle efficiency, and wash efficiency. As an initial demonstration of feasibility, a portion of a 229 bp Candida parapsilosis template was sequenced using both a de novo protocol and a resequencing protocol. The resequencing protocol generated over 60 bp of sequence with 100% sequence accuracy based on raw pyrogram levels. Excellent linearity was observed for all of the homopolymers (two, three, or four nucleotides) contained in the C. parapsilosis sequence. With improvements in microfluidic design it is expected that longer reads, higher throughput, and improved process integration (i.e., "sample-to-sequence" capability) could eventually be achieved using this low-cost platform.
Kim, Hyun Young; Seo, Jiyoung; Kim, Tae-Hun; Shim, Bomi; Cha, Seok Mun; Yu, Seungho
2017-06-01
This study examined the use of microbial community structure as a bio-indicator of decomposition levels. High-throughput pyrosequencing technology was used to assess the shift in microbial community of leachate from animal carcass lysimeter. The leachate samples were collected monthly for one year and a total of 164,639 pyrosequencing reads were obtained and used in the taxonomic classification and operational taxonomy units (OTUs) distribution analysis based on sequence similarity. Our results show considerable changes in the phylum-level bacterial composition, suggesting that the microbial community is a sensitive parameter affected by the burial environment. The phylum classification results showed that Proteobacteria (Pseudomonas) were the most influential taxa in earlier decomposition stage whereas Firmicutes (Clostridium, Sporanaerobacter, and Peptostreptococcus) were dominant in later stage under anaerobic conditions. The result of this study can provide useful information on a time series of leachate profiles of microbial community structures and suggest patterns of microbial diversity in livestock burial sites. In addition, this result can be applicable to predict the decomposition stages under clay loam based soil conditions of animal livestock. Copyright © 2017 Elsevier B.V. All rights reserved.
Droplet-Based Pyrosequencing Using Digital Microfluidics
Boles, Deborah J.; Benton, Jonathan L.; Siew, Germaine J.; Levy, Miriam H.; Thwar, Prasanna K.; Sandahl, Melissa A.; Rouse, Jeremy L.; Perkins, Lisa C.; Sudarsan, Arjun P.; Jalili, Roxana; Pamula, Vamsee K.; Srinivasan, Vijay; Fair, Richard B.; Griffin, Peter B.; Eckhardt, Allen E.; Pollack, Michael G.
2013-01-01
The feasibility of implementing pyrosequencing chemistry within droplets using electrowetting-based digital microfluidics is reported. An array of electrodes patterned on a printed-circuit board was used to control the formation, transportation, merging, mixing, and splitting of submicroliter-sized droplets contained within an oil-filled chamber. A three-enzyme pyrosequencing protocol was implemented in which individual droplets contained enzymes, deoxyribonucleotide triphosphates (dNTPs), and DNA templates. The DNA templates were anchored to magnetic beads which enabled them to be thoroughly washed between nucleotide additions. Reagents and protocols were optimized to maximize signal over background, linearity of response, cycle efficiency, and wash efficiency. As an initial demonstration of feasibility, a portion of a 229 bp Candida parapsilosis template was sequenced using both a de novo protocol and a resequencing protocol. The resequencing protocol generated over 60 bp of sequence with 100% sequence accuracy based on raw pyrogram levels. Excellent linearity was observed for all of the homopolymers (two, three, or four nucleotides) contained in the C. parapsilosis sequence. With improvements in microfluidic design it is expected that longer reads, higher throughput, and improved process integration (i.e., “sample-to-sequence” capability) could eventually be achieved using this low-cost platform. PMID:21932784
Microbial analysis in primary and persistent endodontic infections by using pyrosequencing.
Hong, Bo-Young; Lee, Tae-Kwon; Lim, Sang-Min; Chang, Seok Woo; Park, Joonhong; Han, Seung Hyun; Zhu, Qiang; Safavi, Kamran E; Fouad, Ashraf F; Kum, Kee Yeon
2013-09-01
The aim of this study was to investigate the bacterial community profile of intracanal microbiota in primary and persistent endodontic infections associated with asymptomatic chronic apical periodontitis by using GS-FLX Titanium pyrosequencing. The null hypothesis was that there is no difference in diversity of overall bacterial community profiles between primary and persistent infections. Pyrosequencing analysis from 10 untreated and 8 root-filled samples was conducted. Analysis from 18 samples yielded total of 124,767 16S rRNA gene sequences (with a mean of 6932 reads per sample) that were taxonomically assigned into 803 operational taxonomic units (3% distinction), 148 genera, and 10 phyla including unclassified. Bacteroidetes was the most abundant phylum in both primary and persistent infections. There were no significant differences in bacterial diversity between the 2 infection groups (P > .05). The bacterial community profile that was based on dendrogram showed that bacterial population in both infections was not significantly different in their structure and composition (P > .05). The present pyrosequencing study demonstrates that persistent infections have as diverse bacterial community as primary infections. Copyright © 2013 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Rapid phylogenetic dissection of prokaryotic community structure in tidal flat using pyrosequencing.
Kim, Bong-Soo; Kim, Byung Kwon; Lee, Jae-Hak; Kim, Myungjin; Lim, Young Woon; Chun, Jongsik
2008-08-01
Dissection of prokaryotic community structure is prerequisite to understand their ecological roles. Various methods are available for such a purpose which amplification and sequencing of 16S rRNA genes gained its popularity. However, conventional methods based on Sanger sequencing technique require cloning process prior to sequencing, and are expensive and labor-intensive. We investigated prokaryotic community structure in tidal flat sediments, Korea, using pyrosequencing and a subsequent automated bioinformatic pipeline for the rapid and accurate taxonomic assignment of each amplicon. The combination of pyrosequencing and bioinformatic analysis showed that bacterial and archaeal communities were more diverse than previously reported in clone library studies. Pyrosequencing analysis revealed 21 bacterial divisions and 37 candidate divisions. Proteobacteria was the most abundant division in the bacterial community, of which Gamma-and Delta-Proteobacteria were the most abundant. Similarly, 4 archaeal divisions were found in tidal flat sediments. Euryarchaeota was the most abundant division in the archaeal sequences, which were further divided into 8 classes and 11 unclassified euryarchaeota groups. The system developed here provides a simple, in-depth and automated way of dissecting a prokaryotic community structure without extensive pretreatment such as cloning.
Preusser, Matthias; Berghoff, Anna S.; Manzl, Claudia; Filipits, Martin; Weinhäusel, Andreas; Pulverer, Walter; Dieckmann, Karin; Widhalm, Georg; Wöhrer, Adelheid; Knosp, Engelbert; Marosi, Christine; Hainfellner, Johannes A.
2014-01-01
Testing of the MGMT promoter methylation status in glioblastoma is relevant for clinical decision making and research applications. Two recent and independent phase III therapy trials confirmed a prognostic and predictive value of the MGMT promoter methylation status in elderly glioblastoma patients. Several methods for MGMT promoter methylation testing have been proposed, but seem to be of limited test reliability. Therefore, and also due to feasibility reasons, translation of MGMT methylation testing into routine use has been protracted so far. Pyrosequencing after prior DNA bisulfite modification has emerged as a reliable, accurate, fast and easy-to-use method for MGMT promoter methylation testing in tumor tissues (including formalin-fixed and paraffin-embedded samples). We performed an intra- and inter-laboratory ring trial which demonstrates a high analytical performance of this technique. Thus, pyrosequencing-based assessment of MGMT promoter methylation status in glioblastoma meets the criteria of high analytical test performance and can be recommended for clinical application, provided that strict quality control is performed. Our article summarizes clinical indications, practical instructions and open issues for MGMT promoter methylation testing in glioblastoma using pyrosequencing. PMID:24359605
McCarthy, Christina B; Santini, María Soledad; Pimenta, Paulo F P; Diambra, Luis A
2013-01-01
Leishmaniasis is a vector-borne disease with a complex epidemiology and ecology. Visceral leishmaniasis (VL) is its most severe clinical form as it results in death if not treated. In Latin America VL is caused by the protist parasite Leishmania infantum (syn. chagasi) and transmitted by Lutzomyia longipalpis. This phlebotomine sand fly is only found in the New World, from Mexico to Argentina. However, due to deforestation, migration and urbanisation, among others, VL in Latin America is undergoing an evident geographic expansion as well as dramatic changes in its transmission patterns. In this context, the first VL outbreak was recently reported in Argentina, which has already caused 7 deaths and 83 reported cases. Insect vector transcriptomic analyses enable the identification of molecules involved in the insect's biology and vector-parasite interaction. Previous studies on laboratory reared Lu. longipalpis have provided a descriptive repertoire of gene expression in the whole insect, midgut, salivary gland and male reproductive organs. Nevertheless, the study of wild specimens would contribute a unique insight into the development of novel bioinsecticides. Given the recent VL outbreak in Argentina and the compelling need to develop appropriate control strategies, this study focused on wild male and female Lu. longipalpis from an Argentine endemic (Posadas, Misiones) and a Brazilian non-endemic (Lapinha Cave, Minas Gerais) VL location. In this study, total RNA was extracted from the sand flies, submitted to sequence independent amplification and high-throughput pyrosequencing. This is the first time an unbiased and comprehensive transcriptomic approach has been used to analyse an infectious disease vector in its natural environment. Transcripts identified in the sand flies showed characteristic profiles which correlated with the environment of origin and with taxa previously identified in these same specimens. Among these, various genes represented putative targets for vector control via RNA interference (RNAi).
Mahmood, Khalid; Højland, Dorte H; Asp, Torben; Kristensen, Michael
2016-01-01
Insecticide resistance in the housefly, Musca domestica, has been investigated for more than 60 years. It will enter a new era after the recent publication of the housefly genome and the development of multiple next generation sequencing technologies. The genetic background of the xenobiotic response can now be investigated in greater detail. Here, we investigate the 454-pyrosequencing transcriptome of the spinosad-resistant 791spin strain in relation to the housefly genome with focus on P450 genes. The de novo assembly of clean reads gave 35,834 contigs consisting of 21,780 sequences of the spinosad resistant strain. The 3,648 sequences were annotated with an enzyme code EC number and were mapped to 124 KEGG pathways with metabolic processes as most highly represented pathway. One hundred and twenty contigs were annotated as P450s covering 44 different P450 genes of housefly. Eight differentially expressed P450s genes were identified and investigated for SNPs, CpG islands and common regulatory motifs in promoter and coding regions. Functional annotation clustering of metabolic related genes and motif analysis of P450s revealed their association with epigenetic, transcription and gene expression related functions. The sequence variation analysis resulted in 12 SNPs and eight of them found in cyp6d1. There is variation in location, size and frequency of CpG islands and specific motifs were also identified in these P450s. Moreover, identified motifs were associated to GO terms and transcription factors using bioinformatic tools. Transcriptome data of a spinosad resistant strain provide together with genome data fundamental support for future research to understand evolution of resistance in houseflies. Here, we report for the first time the SNPs, CpG islands and common regulatory motifs in differentially expressed P450s. Taken together our findings will serve as a stepping stone to advance understanding of the mechanism and role of P450s in xenobiotic detoxification.
Liu, Yulin; Huang, Zhedong; Ao, Yan; Li, Wei; Zhang, Zhixiang
2013-01-01
Background Yellow horn (Xanthoceras sorbifolia Bunge) is an oil-rich seed shrub that grows well in cold, barren environments and has great potential for biodiesel production in China. However, the limited genetic data means that little information about the key genes involved in oil biosynthesis is available, which limits further improvement of this species. In this study, we describe sequencing and de novo transcriptome assembly to produce the first comprehensive and integrated genomic resource for yellow horn and identify the pathways and key genes related to oil accumulation. In addition, potential molecular markers were identified and compiled. Methodology/Principal Findings Total RNA was isolated from 30 plants from two regions, including buds, leaves, flowers and seeds. Equal quantities of RNA from these tissues were pooled to construct a cDNA library for 454 pyrosequencing. A total of 1,147,624 high-quality reads with total and average lengths of 530.6 Mb and 462 bp, respectively, were generated. These reads were assembled into 51,867 unigenes, corresponding to a total of 36.1 Mb with a mean length, N50 and median of 696, 928 and 570 bp, respectively. Of the unigenes, 17,541 (33.82%) were unmatched in any public protein databases. We identified 281 unigenes that may be involved in de novo fatty acid (FA) and triacylglycerol (TAG) biosynthesis and metabolism. Furthermore, 6,707 SSRs, 16,925 SNPs and 6,201 InDels with high-confidence were also identified in this study. Conclusions This transcriptome represents a new functional genomics resource and a foundation for further studies on the metabolic engineering of yellow horn to increase oil content and modify oil composition. The potential molecular markers identified in this study provide a basis for polymorphism analysis of Xanthoceras, and even Sapindaceae; they will also accelerate the process of breeding new varieties with better agronomic characteristics. PMID:24040247
Han, Hua; Sun, Xiaomei; Xie, Yunhui; Feng, Jian; Zhang, Shougong
2014-11-26
Hybrids of larch (Larix kaempferi × Larix olgensis) are important afforestation species in northeastern China. They are routinely propagated via rooted stem cuttings. Despite the importance of rooting, little is known about the regulation of adventitious root development in larch hybrids. 454 GS FLX Titanium technology represents a new method for characterizing the transcriptomes of non-model species. This method can be used to identify differentially expressed genes, and then two-dimensional difference gel electrophoresis (2D-DIGE) and matrix-assisted laser desorption-ionization time-of-flight mass spectrometry (MALDI-TOF/TOF MS) analyses can be used to analyze their corresponding proteins. In this study, we analyzed semi-lignified cuttings of two clones of L. kaempferi × L. olgensis with different rooting capacities to study the molecular basis of adventitious root development. We analyzed two clones; clone 25-5, with strong rooting capacity, and clone 23-12, with weak rooting capacity. We constructed four cDNA libraries from 25-5 and 23-12 at two development stages. Sequencing was conducted using the 454 pyrosequencing platform. A total of 957832 raw reads was produced; 95.07% were high-quality reads, and were assembled into 45137 contigs and 61647 singletons. The functions of the unigenes, as indicated by their Gene Ontology annotation, included diverse roles in the molecular functions, biological processes, and cellular component categories. We analyzed 75 protein spots (-fold change ≥ 2, P ≤ 0.05) by 2D-DIGE, and identified the differentially expressed proteins using MALDI-TOF/TOF MS. A joint analysis of transcriptome and proteome showed genes related to two pathways, polyamine synthesis and stress response, might play an important role on adventitious root development. These results provide fundamental and important information for research on the molecular mechanism of adventitious root development. We also demonstrated for the first time the combined use of two important technologies as a powerful approach to advance research on non-model, but otherwise important, larch species.
McCarthy, Christina B.; Santini, María Soledad; Pimenta, Paulo F. P.; Diambra, Luis A.
2013-01-01
Leishmaniasis is a vector-borne disease with a complex epidemiology and ecology. Visceral leishmaniasis (VL) is its most severe clinical form as it results in death if not treated. In Latin America VL is caused by the protist parasite Leishmania infantum (syn. chagasi) and transmitted by Lutzomyia longipalpis. This phlebotomine sand fly is only found in the New World, from Mexico to Argentina. However, due to deforestation, migration and urbanisation, among others, VL in Latin America is undergoing an evident geographic expansion as well as dramatic changes in its transmission patterns. In this context, the first VL outbreak was recently reported in Argentina, which has already caused 7 deaths and 83 reported cases. Insect vector transcriptomic analyses enable the identification of molecules involved in the insect's biology and vector-parasite interaction. Previous studies on laboratory reared Lu. longipalpis have provided a descriptive repertoire of gene expression in the whole insect, midgut, salivary gland and male reproductive organs. Nevertheless, the study of wild specimens would contribute a unique insight into the development of novel bioinsecticides. Given the recent VL outbreak in Argentina and the compelling need to develop appropriate control strategies, this study focused on wild male and female Lu. longipalpis from an Argentine endemic (Posadas, Misiones) and a Brazilian non-endemic (Lapinha Cave, Minas Gerais) VL location. In this study, total RNA was extracted from the sand flies, submitted to sequence independent amplification and high-throughput pyrosequencing. This is the first time an unbiased and comprehensive transcriptomic approach has been used to analyse an infectious disease vector in its natural environment. Transcripts identified in the sand flies showed characteristic profiles which correlated with the environment of origin and with taxa previously identified in these same specimens. Among these, various genes represented putative targets for vector control via RNA interference (RNAi). PMID:23554910
Antarctic krill 454 pyrosequencing reveals chaperone and stress transcriptome.
Clark, Melody S; Thorne, Michael A S; Toullec, Jean-Yves; Meng, Yan; Guan, Le Luo; Peck, Lloyd S; Moore, Stephen
2011-01-06
The Antarctic krill Euphausia superba is a keystone species in the Antarctic food chain. Not only is it a significant grazer of phytoplankton, but it is also a major food item for charismatic megafauna such as whales and seals and an important Southern Ocean fisheries crop. Ecological data suggest that this species is being affected by climate change and this will have considerable consequences for the balance of the Southern Ocean ecosystem. Hence, understanding how this organism functions is a priority area and will provide fundamental data for life history studies, energy budget calculations and food web models. The assembly of the 454 transcriptome of E. superba resulted in 22,177 contigs with an average size of 492bp (ranging between 137 and 8515bp). In depth analysis of the data revealed an extensive catalogue of the cellular chaperone systems and the major antioxidant proteins. Full length sequences were characterised for the chaperones HSP70, HSP90 and the super-oxide dismutase antioxidants, with the discovery of potentially novel duplications of these genes. The sequence data contained 41,470 microsatellites and 17,776 Single Nucleotide Polymorphisms (SNPs/INDELS), providing a resource for population and also gene function studies. This paper details the first 454 generated data for a pelagic Antarctic species or any pelagic crustacean globally. The classical "stress proteins", such as HSP70, HSP90, ferritin and GST were all highly expressed. These genes were shown to be over expressed in the transcriptomes of Antarctic notothenioid fish and hypothesized as adaptations to living in the cold, with the associated problems of decreased protein folding efficiency and increased vulnerability to damage by reactive oxygen species. Hence, these data will provide a major resource for future physiological work on krill, but in particular a suite of "stress" genes for studies understanding marine ectotherms' capacities to cope with environmental change.
Bongers, Roger S.; van Bokhorst-van de Veen, Hermien; Wiersma, Anne; Overmars, Lex; Marco, Maria L.; Kleerebezem, Michiel
2012-01-01
Lactic acid bacteria (LAB) are utilized widely for the fermentation of foods. In the current post-genomic era, tools have been developed that explore genetic diversity among LAB strains aiming to link these variations to differential phenotypes observed in the strains investigated. However, these genotype-phenotype matching approaches fail to assess the role of conserved genes in the determination of physiological characteristics of cultures by environmental conditions. This manuscript describes a complementary approach in which Lactobacillus plantarum WCFS1 was fermented under a variety of conditions that differ in temperature, pH, as well as NaCl, amino acid, and O2 levels. Samples derived from these fermentations were analyzed by full-genome transcriptomics, paralleled by the assessment of physiological characteristics, e.g., maximum growth rate, yield, and organic acid profiles. A data-storage and -mining suite designated FermDB was constructed and exploited to identify correlations between fermentation conditions and industrially relevant physiological characteristics of L. plantarum, as well as the associated transcriptome signatures. Finally, integration of the specific fermentation variables with the transcriptomes enabled the reconstruction of the gene-regulatory networks involved. The fermentation-genomics platform presented here is a valuable complementary approach to earlier described genotype-phenotype matching strategies which allows the identification of transcriptome signatures underlying physiological variations imposed by different fermentation conditions. PMID:22802930
Velotta, Jonathan P.; Wegrzyn, Jill L.; Ginzburg, Samuel; Kang, Lin; Czesny, Sergiusz J.; O'Neill, Rachel J.; McCormick, Stephen; Michalak, Pawel; Schultz, Eric T.
2017-01-01
Comparative approaches in physiological genomics offer an opportunity to understand the functional importance of genes involved in niche exploitation. We used populations of Alewife (Alosa pseudoharengus) to explore the transcriptional mechanisms that underlie adaptation to fresh water. Ancestrally anadromous Alewives have recently formed multiple, independently derived, landlocked populations, which exhibit reduced tolerance of saltwater and enhanced tolerance of fresh water. Using RNA-seq, we compared transcriptional responses of an anadromous Alewife population to two landlocked populations after acclimation to fresh (0 ppt) and saltwater (35 ppt). Our results suggest that the gill transcriptome has evolved in primarily discordant ways between independent landlocked populations and their anadromous ancestor. By contrast, evolved shifts in the transcription of a small suite of well-characterized osmoregulatory genes exhibited a strong degree of parallelism. In particular, transcription of genes that regulate gill ion exchange has diverged in accordance with functional predictions: freshwater ion-uptake genes (most notably, the ‘freshwater paralog’ of Na+/K+-ATPase α-subunit) were more highly expressed in landlocked forms, whereas genes that regulate saltwater ion secretion (e.g. the ‘saltwater paralog’ of NKAα) exhibited a blunted response to saltwater. Parallel divergence of ion transport gene expression is associated with shifts in salinity tolerance limits among landlocked forms, suggesting that changes to the gill's transcriptional response to salinity facilitate freshwater adaptation.
2010-01-01
Background Bathymodiolus azoricus is a deep-sea hydrothermal vent mussel found in association with large faunal communities living in chemosynthetic environments at the bottom of the sea floor near the Azores Islands. Investigation of the exceptional physiological reactions that vent mussels have adopted in their habitat, including responses to environmental microbes, remains a difficult challenge for deep-sea biologists. In an attempt to reveal genes potentially involved in the deep-sea mussel innate immunity we carried out a high-throughput sequence analysis of freshly collected B. azoricus transcriptome using gills tissues as the primary source of immune transcripts given its strategic role in filtering the surrounding waterborne potentially infectious microorganisms. Additionally, a substantial EST data set was produced and from which a comprehensive collection of genes coding for putative proteins was organized in a dedicated database, "DeepSeaVent" the first deep-sea vent animal transcriptome database based on the 454 pyrosequencing technology. Results A normalized cDNA library from gills tissue was sequenced in a full 454 GS-FLX run, producing 778,996 sequencing reads. Assembly of the high quality reads resulted in 75,407 contigs of which 3,071 were singletons. A total of 39,425 transcripts were conceptually translated into amino-sequences of which 22,023 matched known proteins in the NCBI non-redundant protein database, 15,839 revealed conserved protein domains through InterPro functional classification and 9,584 were assigned with Gene Ontology terms. Queries conducted within the database enabled the identification of genes putatively involved in immune and inflammatory reactions which had not been previously evidenced in the vent mussel. Their physical counterpart was confirmed by semi-quantitative quantitative Reverse-Transcription-Polymerase Chain Reactions (RT-PCR) and their RNA transcription level by quantitative PCR (qPCR) experiments. Conclusions We have established the first tissue transcriptional analysis of a deep-sea hydrothermal vent animal and generated a searchable catalog of genes that provides a direct method of identifying and retrieving vast numbers of novel coding sequences which can be applied in gene expression profiling experiments from a non-conventional model organism. This provides the most comprehensive sequence resource for identifying novel genes currently available for a deep-sea vent organism, in particular, genes putatively involved in immune and inflammatory reactions in vent mussels. The characterization of the B. azoricus transcriptome will facilitate research into biological processes underlying physiological adaptations to hydrothermal vent environments and will provide a basis for expanding our understanding of genes putatively involved in adaptations processes during post-capture long term acclimatization experiments, at "sea-level" conditions, using B. azoricus as a model organism. PMID:20937131
Development of an ELA-DRA gene typing method based on pyrosequencing technology.
Díaz, S; Echeverría, M G; It, V; Posik, D M; Rogberg-Muñoz, A; Pena, N L; Peral-García, P; Vega-Pla, J L; Giovambattista, G
2008-11-01
The polymorphism of equine lymphocyte antigen (ELA) class II DRA gene had been detected by polymerase chain reaction-single-strand conformational polymorphism (PCR-SSCP) and reference strand-mediated conformation analysis. These methodologies allowed to identify 11 ELA-DRA exon 2 sequences, three of which are widely distributed among domestic horse breeds. Herein, we describe the development of a pyrosequencing-based method applicable to ELA-DRA typing, by screening samples from eight different horse breeds previously typed by PCR-SSCP. This sequence-based method would be useful in high-throughput genotyping of major histocompatibility complex genes in horses and other animal species, making this system interesting as a rapid screening method for animal genotyping of immune-related genes.
RNA-Seq reveals genotype-specific molecular responses to water deficit in eucalyptus
2011-01-01
Background In a context of climate change, phenotypic plasticity provides long-lived species, such as trees, with the means to adapt to environmental variations occurring within a single generation. In eucalyptus plantations, water availability is a key factor limiting productivity. However, the molecular mechanisms underlying the adaptation of eucalyptus to water shortage remain unclear. In this study, we compared the molecular responses of two commercial eucalyptus hybrids during the dry season. Both hybrids differ in productivity when grown under water deficit. Results Pyrosequencing of RNA extracted from shoot apices provided extensive transcriptome coverage - a catalog of 129,993 unigenes (49,748 contigs and 80,245 singletons) was generated from 398 million base pairs, or 1.14 million reads. The pyrosequencing data enriched considerably existing Eucalyptus EST collections, adding 36,985 unigenes not previously represented. Digital analysis of read abundance in 14,460 contigs identified 1,280 that were differentially expressed between the two genotypes, 155 contigs showing differential expression between treatments (irrigated vs. non irrigated conditions during the dry season), and 274 contigs with significant genotype-by-treatment interaction. The more productive genotype displayed a larger set of genes responding to water stress. Moreover, stress signal transduction seemed to involve different pathways in the two genotypes, suggesting that water shortage induces distinct cellular stress cascades. Similarly, the response of functional proteins also varied widely between genotypes: the most productive genotype decreased expression of genes related to photosystem, transport and secondary metabolism, whereas genes related to primary metabolism and cell organisation were over-expressed. Conclusions For the most productive genotype, the ability to express a broader set of genes in response to water availability appears to be a key characteristic in the maintenance of biomass growth during the dry season. Its strategy may involve a decrease of photosynthetic activity during the dry season associated with resources reallocation through major changes in the expression of primary metabolism associated genes. Further efforts will be needed to assess the adaptive nature of the genes highlighted in this study. PMID:22047139
Urease gene-containing Archaea dominate autotrophic ammonia oxidation in two acid soils.
Lu, Lu; Jia, Zhongjun
2013-06-01
The metabolic traits of ammonia-oxidizing archaea (AOA) and bacteria (AOB) interacting with their environment determine the nitrogen cycle at the global scale. Ureolytic metabolism has long been proposed as a mechanism for AOB to cope with substrate paucity in acid soil, but it remains unclear whether urea hydrolysis could afford AOA greater ecological advantages. By combining DNA-based stable isotope probing (SIP) and high-throughput pyrosequencing, here we show that autotrophic ammonia oxidation in two acid soils was predominately driven by AOA that contain ureC genes encoding the alpha subunit of a putative archaeal urease. In urea-amended SIP microcosms of forest soil (pH 5.40) and tea orchard soil (pH 3.75), nitrification activity was stimulated significantly by urea fertilization when compared with water-amended soils in which nitrification resulted solely from the oxidation of ammonia generated through mineralization of soil organic nitrogen. The stimulated activity was paralleled by changes in abundance and composition of archaeal amoA genes. Time-course incubations indicated that archaeal amoA genes were increasingly labelled by (13) CO2 in both microcosms amended with water and urea. Pyrosequencing revealed that archaeal populations were labelled to a much greater extent in soils amended with urea than water. Furthermore, archaeal ureC genes were successfully amplified in the (13) C-DNA, and acetylene inhibition suggests that autotrophic growth of urease-containing AOA depended on energy generation through ammonia oxidation. The sequences of AOB were not detected, and active AOA were affiliated with the marine Group 1.1a-associated lineage. The results suggest that ureolytic N metabolism could afford AOA greater advantages for autotrophic ammonia oxidation in acid soil, but the mechanism of how urea activates AOA cells remains unclear. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.
Malenke, J R; Milash, B; Miller, A W; Dearing, M D
2013-07-01
Massively parallel sequencing has enabled the creation of novel, in-depth genetic tools for nonmodel, ecologically important organisms. We present the de novo transcriptome sequencing, analysis and microarray development for a vertebrate herbivore, the woodrat (Neotoma spp.). This genus is of ecological and evolutionary interest, especially with respect to ingestion and hepatic metabolism of potentially toxic plant secondary compounds. We generated a liver transcriptome of the desert woodrat (Neotoma lepida) using the Roche 454 platform. The assembled contigs were well annotated using rodent references (99.7% annotation), and biotransformation function was reflected in the gene ontology. The transcriptome was used to develop a custom microarray (eArray, Agilent). We tested the microarray with three experiments: one across species with similar habitat (thus, dietary) niches, one across species with different habitat niches and one across populations within a species. The resulting one-colour arrays had high technical and biological quality. Probes designed from the woodrat transcriptome performed significantly better than functionally similar probes from the Norway rat (Rattus norvegicus). There were a multitude of expression differences across the woodrat treatments, many of which related to biotransformation processes and activities. The pattern and function of the differences indicate shared ecological pressures, and not merely phylogenetic distance, play an important role in shaping gene expression profiles of woodrat species and populations. The quality and functionality of the woodrat transcriptome and custom microarray suggest these tools will be valuable for expanding the scope of herbivore biology, as well as the exploration of conceptual topics in ecology. © 2013 John Wiley & Sons Ltd.
SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells.
Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang
2018-01-01
Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. © 2018 Han et al.; Published by Cold Spring Harbor Laboratory Press.
SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells
Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang
2018-01-01
Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. PMID:29208629
Cozzi-Lepri, Alessandro; Noguera-Julian, Marc; Di Giallonardo, Francesca; Schuurman, Rob; Däumer, Martin; Aitken, Sue; Ceccherini-Silberstein, Francesca; D'Arminio Monforte, Antonella; Geretti, Anna Maria; Booth, Clare L; Kaiser, Rolf; Michalik, Claudia; Jansen, Klaus; Masquelier, Bernard; Bellecave, Pantxika; Kouyos, Roger D; Castro, Erika; Furrer, Hansjakob; Schultze, Anna; Günthard, Huldrych F; Brun-Vezinet, Francoise; Paredes, Roger; Metzner, Karin J
2015-03-01
It is still debated if pre-existing minority drug-resistant HIV-1 variants (MVs) affect the virological outcomes of first-line NNRTI-containing ART. This Europe-wide case-control study included ART-naive subjects infected with drug-susceptible HIV-1 as revealed by population sequencing, who achieved virological suppression on first-line ART including one NNRTI. Cases experienced virological failure and controls were subjects from the same cohort whose viraemia remained suppressed at a matched time since initiation of ART. Blinded, centralized 454 pyrosequencing with parallel bioinformatic analysis in two laboratories was used to identify MVs in the 1%-25% frequency range. ORs of virological failure according to MV detection were estimated by logistic regression. Two hundred and sixty samples (76 cases and 184 controls), mostly subtype B (73.5%), were used for the analysis. Identical MVs were detected in the two laboratories. 31.6% of cases and 16.8% of controls harboured pre-existing MVs. Detection of at least one MV versus no MVs was associated with an increased risk of virological failure (OR = 2.75, 95% CI = 1.35-5.60, P = 0.005); similar associations were observed for at least one MV versus no NRTI MVs (OR = 2.27, 95% CI = 0.76-6.77, P = 0.140) and at least one MV versus no NNRTI MVs (OR = 2.41, 95% CI = 1.12-5.18, P = 0.024). A dose-effect relationship between virological failure and mutational load was found. Pre-existing MVs more than double the risk of virological failure to first-line NNRTI-based ART. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy.
Boyer, Stephane; Brown, Samuel D. J.; Collins, Rupert A.; Cruickshank, Robert H.; Lefort, Marie-Caroline; Malumbres-Olarte, Jagoba; Wratten, Stephen D.
2012-01-01
DNA barcoding remains a challenge when applied to diet analyses, ancient DNA studies, environmental DNA samples and, more generally, in any cases where DNA samples have not been adequately preserved. Because the size of the commonly used barcoding marker (COI) is over 600 base pairs (bp), amplification fails when the DNA molecule is degraded into smaller fragments. However, relevant information for specimen identification may not be evenly distributed along the barcoding region, and a shorter target can be sufficient for identification purposes. This study proposes a new, widely applicable, method to compare the performance of all potential ‘mini-barcodes’ for a given molecular marker and to objectively select the shortest and most informative one. Our method is based on a sliding window analysis implemented in the new R package SPIDER (Species IDentity and Evolution in R). This method is applicable to any taxon and any molecular marker. Here, it was tested on earthworm DNA that had been degraded through digestion by carnivorous landsnails. A 100 bp region of 16 S rDNA was selected as the shortest informative fragment (mini-barcode) required for accurate specimen identification. Corresponding primers were designed and used to amplify degraded earthworm (prey) DNA from 46 landsnail (predator) faeces using 454-pyrosequencing. This led to the detection of 18 earthworm species in the diet of the snail. We encourage molecular ecologists to use this method to objectively select the most informative region of the gene they aim to amplify from degraded DNA. The method and tools provided here, can be particularly useful (1) when dealing with degraded DNA for which only small fragments can be amplified, (2) for cases where no consensus has yet been reached on the appropriate barcode gene, or (3) to allow direct analysis of short reads derived from massively parallel sequencing without the need for bioinformatic consolidation. PMID:22666489
Enabling large-scale next-generation sequence assembly with Blacklight
Couger, M. Brian; Pipes, Lenore; Squina, Fabio; Prade, Rolf; Siepel, Adam; Palermo, Robert; Katze, Michael G.; Mason, Christopher E.; Blood, Philip D.
2014-01-01
Summary A variety of extremely challenging biological sequence analyses were conducted on the XSEDE large shared memory resource Blacklight, using current bioinformatics tools and encompassing a wide range of scientific applications. These include genomic sequence assembly, very large metagenomic sequence assembly, transcriptome assembly, and sequencing error correction. The data sets used in these analyses included uncategorized fungal species, reference microbial data, very large soil and human gut microbiome sequence data, and primate transcriptomes, composed of both short-read and long-read sequence data. A new parallel command execution program was developed on the Blacklight resource to handle some of these analyses. These results, initially reported previously at XSEDE13 and expanded here, represent significant advances for their respective scientific communities. The breadth and depth of the results achieved demonstrate the ease of use, versatility, and unique capabilities of the Blacklight XSEDE resource for scientific analysis of genomic and transcriptomic sequence data, and the power of these resources, together with XSEDE support, in meeting the most challenging scientific problems. PMID:25294974
Borman, Andrew M.; Linton, Christopher J.; Oliver, Debra; Palmer, Michael D.; Szekely, Adrien; Johnson, Elizabeth M.
2010-01-01
Rapid identification of yeast species isolates from clinical samples is particularly important given their innately variable antifungal susceptibility profiles. Here, we have evaluated the utility of pyrosequencing analysis of a portion of the internal transcribed spacer 2 region (ITS2) for identification of pathogenic yeasts. A total of 477 clinical isolates encompassing 43 different fungal species were subjected to pyrosequencing analysis in a strictly blinded study. The molecular identifications produced by pyrosequencing were compared with those obtained using conventional biochemical tests (AUXACOLOR2) and following PCR amplification and sequencing of the D1-D2 portion of the nuclear 28S large rRNA gene. More than 98% (469/477) of isolates encompassing 40 of the 43 fungal species tested were correctly identified by pyrosequencing of only 35 bp of ITS2. Moreover, BLAST searches of the public synchronized databases with the ITS2 pyrosequencing signature sequences revealed that there was only minimal sequence redundancy in the ITS2 under analysis. In all cases, the pyrosequencing signature sequences were unique to the yeast species (or species complex) under investigation. Finally, when pyrosequencing was combined with the Whatman FTA paper technology for the rapid extraction of fungal genomic DNA, molecular identification could be accomplished within 6 h from the time of starting from pure cultures. PMID:20702674
Agave: a biofuel feedstock for arid and semi-arid environments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gross, Stephen; Martin, Jeffrey; Simpson, June
2011-05-31
Efficient production of plant-based, lignocellulosic biofuels relies upon continued improvement of existing biofuel feedstock species, as well as the introduction of newfeedstocks capable of growing on marginal lands to avoid conflicts with existing food production and minimize use of water and nitrogen resources. To this end, specieswithin the plant genus Agave have recently been proposed as new biofuel feedstocks. Many Agave species are adapted to hot and arid environments generally unsuitable forfood production, yet have biomass productivity rates comparable to other second-generation biofuel feedstocks such as switchgrass and Miscanthus. Agavesachieve remarkable heat tolerance and water use efficiency in part throughmore » a Crassulacean Acid Metabolism (CAM) mode of photosynthesis, but the genes andregulatory pathways enabling CAM and thermotolerance in agaves remain poorly understood. We seek to accelerate the development of agave as a new biofuelfeedstock through genomic approaches using massively-parallel sequencing technologies. First, we plan to sequence the transcriptome of A. tequilana to provide adatabase of protein-coding genes to the agave research community. Second, we will compare transcriptome-wide gene expression of agaves under different environmentalconditions in order to understand genetic pathways controlling CAM, water use efficiency, and thermotolerance. Finally, we aim to compare the transcriptome of A.tequilana with that of other Agave species to gain further insight into molecular mechanisms underlying traits desirable for biofuel feedstocks. These genomicapproaches will provide sequence and gene expression information critical to the breeding and domestication of Agave species suitable for biofuel production.« less
Schiller, Viktoria; Wichmann, Arne; Kriehuber, Ralf; Muth-Köhne, Elke; Giesy, John P; Hecker, Markus; Fenske, Martina
2013-01-01
Assessment of endocrine disruption currently relies on testing strategies involving adult vertebrates. In order to minimize the use of animal tests according to the 3Rs principle of replacement, reduction and refinement, we propose a transcriptomics and fish embryo based approach as an alternative to identify and analyze an estrogenic activity of environmental chemicals. For this purpose, the suitability of 48 h and 7 days post-fertilization zebrafish and medaka embryos to test for estrogenic disruption was evaluated. The embryos were exposed to the phytoestrogen genistein and subsequently analyzed by microarrays and quantitative real-time PCR. The functional analysis showed that the genes affected related to multiple metabolic and signaling pathways in the early fish embryo, which reflect the known components of genistein's mode of actions, like apoptosis, estrogenic response, hox gene expression and steroid hormone synthesis. Moreover, the transcriptomic data also suggested a thyroidal mode of action and disruption of the nervous system development. The parallel testing of two fish species provided complementary data on the effects of genistein at gene expression level and facilitated the separation of common from species-dependent effects. Overall, the study demonstrated that combining fish embryo testing with transcriptomics can deliver abundant information about the mechanistic effects of endocrine disrupting chemicals, rendering this strategy a promising alternative approach to test for endocrine disruption in a whole organism in-vitro scale system. Copyright © 2012 Elsevier Inc. All rights reserved.
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa
2012-01-01
Background Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Results Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Conclusions Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies. PMID:23167289
PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.
Gan, Ruei-Chi; Chen, Ting-Wen; Wu, Timothy H; Huang, Po-Jung; Lee, Chi-Ching; Yeh, Yuan-Ming; Chiu, Cheng-Hsun; Huang, Hsien-Da; Tang, Petrus
2016-12-22
Next-generation sequencing promises the de novo genomic and transcriptomic analysis of samples of interests. However, there are only a few organisms having reference genomic sequences and even fewer having well-defined or curated annotations. For transcriptome studies focusing on organisms lacking proper reference genomes, the common strategy is de novo assembly followed by functional annotation. However, things become even more complicated when multiple transcriptomes are compared. Here, we propose a new analysis strategy and quantification methods for quantifying expression level which not only generate a virtual reference from sequencing data, but also provide comparisons between transcriptomes. First, all reads from the transcriptome datasets are pooled together for de novo assembly. The assembled contigs are searched against NCBI NR databases to find potential homolog sequences. Based on the searched result, a set of virtual transcripts are generated and served as a reference transcriptome. By using the same reference, normalized quantification values including RC (read counts), eRPKM (estimated RPKM) and eTPM (estimated TPM) can be obtained that are comparable across transcriptome datasets. In order to demonstrate the feasibility of our strategy, we implement it in the web service PARRoT. PARRoT stands for Pipeline for Analyzing RNA Reads of Transcriptomes. It analyzes gene expression profiles for two transcriptome sequencing datasets. For better understanding of the biological meaning from the comparison among transcriptomes, PARRoT further provides linkage between these virtual transcripts and their potential function through showing best hits in SwissProt, NR database, assigning GO terms. Our demo datasets showed that PARRoT can analyze two paired-end transcriptomic datasets of approximately 100 million reads within just three hours. In this study, we proposed and implemented a strategy to analyze transcriptomes from non-reference organisms which offers the opportunity to quantify and compare transcriptome profiles through a homolog based virtual transcriptome reference. By using the homolog based reference, our strategy effectively avoids the problems that may cause from inconsistencies among transcriptomes. This strategy will shed lights on the field of comparative genomics for non-model organism. We have implemented PARRoT as a web service which is freely available at http://parrot.cgu.edu.tw .
Rungrassamee, Wanilada; Klanchui, Amornpan; Chaiyapechara, Sage; Maibunkaew, Sawarot; Tangphatsornruang, Sithichoke; Jiravanichpaisal, Pikul; Karoonuthaisiri, Nitsara
2013-01-01
Intestinal bacterial communities in aquaculture have been drawn to attention due to potential benefit to their hosts. To identify core intestinal bacteria in the black tiger shrimp (Penaeus monodon), bacterial populations of disease-free shrimp were characterized from intestines of four developmental stages (15-day-old post larvae (PL15), 1- (J1), 2- (J2), and 3-month-old (J3) juveniles) using pyrosequencing, real-time PCR and denaturing gradient gel electrophoresis (DGGE) approaches. A total of 25,121 pyrosequencing reads (reading length = 442±24 bases) were obtained, which were categorized by barcode for PL15 (7,045 sequences), J1 (3,055 sequences), J2 (13,130 sequences) and J3 (1,890 sequences). Bacteria in the phyla Bacteroides, Firmicutes and Proteobacteria were found in intestines at all four growth stages. There were 88, 14, 27, and 20 bacterial genera associated with the intestinal tract of PL15, J1, J2 and J3, respectively. Pyrosequencing analysis revealed that Proteobacteria (class Gammaproteobacteria) was a dominant bacteria group with a relative abundance of 89% for PL15 and 99% for J1, J2 and J3. Real-time PCR assay also confirmed that Gammaproteobacteria had the highest relative abundance in intestines from all growth stages. Intestinal bacterial communities from the three juvenile stages were more similar to each other than that of the PL shrimp based on PCA analyses of pyrosequencing results and their DGGE profiles. This study provides descriptive bacterial communities associated to the black tiger shrimp intestines during these growth development stages in rearing facilities. PMID:23577162
Rungrassamee, Wanilada; Klanchui, Amornpan; Chaiyapechara, Sage; Maibunkaew, Sawarot; Tangphatsornruang, Sithichoke; Jiravanichpaisal, Pikul; Karoonuthaisiri, Nitsara
2013-01-01
Intestinal bacterial communities in aquaculture have been drawn to attention due to potential benefit to their hosts. To identify core intestinal bacteria in the black tiger shrimp (Penaeus monodon), bacterial populations of disease-free shrimp were characterized from intestines of four developmental stages (15-day-old post larvae (PL15), 1- (J1), 2- (J2), and 3-month-old (J3) juveniles) using pyrosequencing, real-time PCR and denaturing gradient gel electrophoresis (DGGE) approaches. A total of 25,121 pyrosequencing reads (reading length = 442±24 bases) were obtained, which were categorized by barcode for PL15 (7,045 sequences), J1 (3,055 sequences), J2 (13,130 sequences) and J3 (1,890 sequences). Bacteria in the phyla Bacteroides, Firmicutes and Proteobacteria were found in intestines at all four growth stages. There were 88, 14, 27, and 20 bacterial genera associated with the intestinal tract of PL15, J1, J2 and J3, respectively. Pyrosequencing analysis revealed that Proteobacteria (class Gammaproteobacteria) was a dominant bacteria group with a relative abundance of 89% for PL15 and 99% for J1, J2 and J3. Real-time PCR assay also confirmed that Gammaproteobacteria had the highest relative abundance in intestines from all growth stages. Intestinal bacterial communities from the three juvenile stages were more similar to each other than that of the PL shrimp based on PCA analyses of pyrosequencing results and their DGGE profiles. This study provides descriptive bacterial communities associated to the black tiger shrimp intestines during these growth development stages in rearing facilities.
Nature and nurture: environmental influences on a genetic rat model of depression.
Mehta-Raghavan, N S; Wert, S L; Morley, C; Graf, E N; Redei, E E
2016-03-29
In this study, we sought to learn whether adverse events such as chronic restraint stress (CRS), or 'nurture' in the form of environmental enrichment (EE), could modify depression-like behavior and blood biomarker transcript levels in a genetic rat model of depression. The Wistar Kyoto More Immobile (WMI) is a genetic model of depression that aided in the identification of blood transcriptomic markers, which successfully distinguished adolescent and adult subjects with major depressive disorders from their matched no-disorder controls. Here, we followed the effects of CRS and EE in adult male WMIs and their genetically similar control strain, the Wistar Kyoto Less Immobile (WLI), that does not show depression-like behavior, by measuring the levels of these transcripts in the blood and hippocampus. In WLIs, increased depression-like behavior and transcriptomic changes were present in response to CRS, but in WMIs no behavioral or additive transcriptomic changes occurred. Environmental enrichment decreased both the inherent depression-like behavior in the WMIs and the behavioral difference between WMIs and WLIs, but did not reverse basal transcript level differences between the strains. The inverse behavioral change induced by CRS and EE in the WLIs did not result in parallel inverse expression changes of the transcriptomic markers, suggesting that these behavioral responses to the environment work via separate molecular pathways. In contrast, 'trait' transcriptomic markers with expression differences inherent and unchanging between the strains regardless of the environment suggest that in our model, environmental and genetic etiologies of depression work through independent molecular mechanisms.
Gluck, Christian; Min, Sangwon; Oyelakin, Akinsola; Smalley, Kirsten; Sinha, Satrajit; Romano, Rose-Anne
2016-11-16
Mouse models have served a valuable role in deciphering various facets of Salivary Gland (SG) biology, from normal developmental programs to diseased states. To facilitate such studies, gene expression profiling maps have been generated for various stages of SG organogenesis. However these prior studies fall short of capturing the transcriptional complexity due to the limited scope of gene-centric microarray-based technology. Compared to microarray, RNA-sequencing (RNA-seq) offers unbiased detection of novel transcripts, broader dynamic range and high specificity and sensitivity for detection of genes, transcripts, and differential gene expression. Although RNA-seq data, particularly under the auspices of the ENCODE project, have covered a large number of biological specimens, studies on the SG have been lacking. To better appreciate the wide spectrum of gene expression profiles, we isolated RNA from mouse submandibular salivary glands at different embryonic and adult stages. In parallel, we processed RNA-seq data for 24 organs and tissues obtained from the mouse ENCODE consortium and calculated the average gene expression values. To identify molecular players and pathways likely to be relevant for SG biology, we performed functional gene enrichment analysis, network construction and hierarchal clustering of the RNA-seq datasets obtained from different stages of SG development and maturation, and other mouse organs and tissues. Our bioinformatics-based data analysis not only reaffirmed known modulators of SG morphogenesis but revealed novel transcription factors and signaling pathways unique to mouse SG biology and function. Finally we demonstrated that the unique SG gene signature obtained from our mouse studies is also well conserved and can demarcate features of the human SG transcriptome that is different from other tissues. Our RNA-seq based Atlas has revealed a high-resolution cartographic view of the dynamic transcriptomic landscape of the mouse SG at various stages. These RNA-seq datasets will complement pre-existing microarray based datasets, including the Salivary Gland Molecular Anatomy Project by offering a broader systems-biology based perspective rather than the classical gene-centric view. Ultimately such resources will be valuable in providing a useful toolkit to better understand how the diverse cell population of the SG are organized and controlled during development and differentiation.
USDA-ARS?s Scientific Manuscript database
Pecan nuts and other tree nuts can be a nutrient rich part of a healthy diet full of beneficial fatty acids and antioxidants, but can also cause allergic reactions in people suffering from food allergy to the nuts. We characterized the transcriptome of a developing pecan nut to identify the gene ex...
Dupl'áková, Nikoleta; Renák, David; Hovanec, Patrik; Honysová, Barbora; Twell, David; Honys, David
2007-07-23
Microarray technologies now belong to the standard functional genomics toolbox and have undergone massive development leading to increased genome coverage, accuracy and reliability. The number of experiments exploiting microarray technology has markedly increased in recent years. In parallel with the rapid accumulation of transcriptomic data, on-line analysis tools are being introduced to simplify their use. Global statistical data analysis methods contribute to the development of overall concepts about gene expression patterns and to query and compose working hypotheses. More recently, these applications are being supplemented with more specialized products offering visualization and specific data mining tools. We present a curated gene family-oriented gene expression database, Arabidopsis Gene Family Profiler (aGFP; http://agfp.ueb.cas.cz), which gives the user access to a large collection of normalised Affymetrix ATH1 microarray datasets. The database currently contains NASC Array and AtGenExpress transcriptomic datasets for various tissues at different developmental stages of wild type plants gathered from nearly 350 gene chips. The Arabidopsis GFP database has been designed as an easy-to-use tool for users needing an easily accessible resource for expression data of single genes, pre-defined gene families or custom gene sets, with the further possibility of keyword search. Arabidopsis Gene Family Profiler presents a user-friendly web interface using both graphic and text output. Data are stored at the MySQL server and individual queries are created in PHP script. The most distinguishable features of Arabidopsis Gene Family Profiler database are: 1) the presentation of normalized datasets (Affymetrix MAS algorithm and calculation of model-based gene-expression values based on the Perfect Match-only model); 2) the choice between two different normalization algorithms (Affymetrix MAS4 or MAS5 algorithms); 3) an intuitive interface; 4) an interactive "virtual plant" visualizing the spatial and developmental expression profiles of both gene families and individual genes. Arabidopsis GFP gives users the possibility to analyze current Arabidopsis developmental transcriptomic data starting with simple global queries that can be expanded and further refined to visualize comparative and highly selective gene expression profiles.
Warren, Ian A.; Vera, J. Cristobal; Johns, Annika; Zinna, Robert; Marden, James H.; Emlen, Douglas J.; Dworkin, Ian; Lavine, Laura C.
2014-01-01
Scarab beetles exhibit an astonishing variety of rigid exo-skeletal outgrowths, known as “horns”. These traits are often sexually dimorphic and vary dramatically across species in size, shape, location, and allometry with body size. In many species, the horn exhibits disproportionate growth resulting in an exaggerated allometric relationship with body size, as compared to other traits, such as wings, that grow proportionately with body size. Depending on the species, the smallest males either do not produce a horn at all, or they produce a disproportionately small horn for their body size. While the diversity of horn shapes and their behavioural ecology have been reasonably well studied, we know far less about the proximate mechanisms that regulate horn growth. Thus, using 454 pyrosequencing, we generated transcriptome profiles, during horn growth and development, in two different scarab beetle species: the Asian rhinoceros beetle, Trypoxylus dichotomus, and the dung beetle, Onthophagus nigriventris. We obtained over half a million reads for each species that were assembled into over 6,000 and 16,000 contigs respectively. We combined these data with previously published studies to look for signatures of molecular evolution. We found a small subset of genes with horn-biased expression showing evidence for recent positive selection, as is expected with sexual selection on horn size. We also found evidence of relaxed selection present in genes that demonstrated biased expression between horned and horn-less morphs, consistent with the theory of developmental decoupling of phenotypically plastic traits. PMID:24586317
Zhang, Huakun; Zhu, Bo; Qi, Bao; Gou, Xiaowan; Dong, Yuzhu; Xu, Chunming; Zhang, Bangjiao; Huang, Wei; Liu, Chang; Wang, Xutong; Yang, Chunwu; Zhou, Hao; Kashkush, Khalil; Feldman, Moshe; Wendel, Jonathan F.; Liu, Bao
2014-01-01
Subgenome integrity in bread wheat (Triticum aestivum; BBAADD) makes possible the extraction of its BBAA component to restitute a novel plant type. The availability of such a ploidy-reversed wheat (extracted tetraploid wheat [ETW]) provides a unique opportunity to address whether and to what extent the BBAA component of bread wheat has been modified in phenotype, karyotype, and gene expression during its evolutionary history at the allohexaploid level. We report here that ETW was anomalous in multiple phenotypic traits but maintained a stable karyotype. Microarray-based transcriptome profiling identified a large number of differentially expressed genes between ETW and natural tetraploid wheat (Triticum turgidum), and the ETW-downregulated genes were enriched for distinct Gene Ontology categories. Quantitative RT-PCR analysis showed that gene expression differences between ETW and a set of diverse durum wheat (T. turgidum subsp durum) cultivars were distinct from those characterizing tetraploid cultivars per se. Pyrosequencing revealed that the expression alterations may occur to either only one or both of the B and A homoeolog transcripts in ETW. A majority of the genes showed additive expression in a resynthesized allohexaploid wheat. Analysis of a synthetic allohexaploid wheat and diverse bread wheat cultivars revealed the rapid occurrence of expression changes to the BBAA subgenomes subsequent to allohexaploidization and their evolutionary persistence. PMID:24989045
Using expected sequence features to improve basecalling accuracy of amplicon pyrosequencing data.
Rask, Thomas S; Petersen, Bent; Chen, Donald S; Day, Karen P; Pedersen, Anders Gorm
2016-04-22
Amplicon pyrosequencing targets a known genetic region and thus inherently produces reads highly anticipated to have certain features, such as conserved nucleotide sequence, and in the case of protein coding DNA, an open reading frame. Pyrosequencing errors, consisting mainly of nucleotide insertions and deletions, are on the other hand likely to disrupt open reading frames. Such an inverse relationship between errors and expectation based on prior knowledge can be used advantageously to guide the process known as basecalling, i.e. the inference of nucleotide sequence from raw sequencing data. The new basecalling method described here, named Multipass, implements a probabilistic framework for working with the raw flowgrams obtained by pyrosequencing. For each sequence variant Multipass calculates the likelihood and nucleotide sequence of several most likely sequences given the flowgram data. This probabilistic approach enables integration of basecalling into a larger model where other parameters can be incorporated, such as the likelihood for observing a full-length open reading frame at the targeted region. We apply the method to 454 amplicon pyrosequencing data obtained from a malaria virulence gene family, where Multipass generates 20 % more error-free sequences than current state of the art methods, and provides sequence characteristics that allow generation of a set of high confidence error-free sequences. This novel method can be used to increase accuracy of existing and future amplicon sequencing data, particularly where extensive prior knowledge is available about the obtained sequences, for example in analysis of the immunoglobulin VDJ region where Multipass can be combined with a model for the known recombining germline genes. Multipass is available for Roche 454 data at http://www.cbs.dtu.dk/services/MultiPass-1.0 , and the concept can potentially be implemented for other sequencing technologies as well.
Chen, Ping; Zhang, Limin; Guo, Xiaoxuan; Dai, Xin; Liu, Li; Xi, Lijun; Wang, Jian; Song, Lei; Wang, Yuezhu; Zhu, Yaxin; Huang, Li; Huang, Ying
2016-01-01
The phylum Actinobacteria has been reported to be common or even abundant in deep marine sediments, however, knowledge about the diversity, distribution, and function of actinobacteria is limited. In this study, actinobacterial diversity in the deep sea along the Southwest Indian Ridge (SWIR) was investigated using both 16S rRNA gene pyrosequencing and culture-based methods. The samples were collected at depths of 1662–4000 m below water surface. Actinobacterial sequences represented 1.2–9.1% of all microbial 16S rRNA gene amplicon sequences in each sample. A total of 5 actinobacterial classes, 17 orders, 28 families, and 52 genera were detected by pyrosequencing, dominated by the classes Acidimicrobiia and Actinobacteria. Differences in actinobacterial community compositions were found among the samples. The community structure showed significant correlations to geochemical factors, notably pH, calcium, total organic carbon, total phosphorus, and total nitrogen, rather than to spatial distance at the scale of the investigation. In addition, 176 strains of the Actinobacteria class, belonging to 9 known orders, 18 families, and 29 genera, were isolated. Among these cultivated taxa, 8 orders, 13 families, and 15 genera were also recovered by pyrosequencing. At a 97% 16S rRNA gene sequence similarity, the pyrosequencing data encompassed 77.3% of the isolates but the isolates represented only 10.3% of the actinobacterial reads. Phylogenetic analysis of all the representative actinobacterial sequences and isolates indicated that at least four new orders within the phylum Actinobacteria were detected by pyrosequencing. More than half of the isolates spanning 23 genera and all samples demonstrated activity in the degradation of refractory organics, including polycyclic aromatic hydrocarbons and polysaccharides, suggesting their potential ecological functions and biotechnological applications for carbon recycling. PMID:27621725
Stamatakis, Alexandros; Ott, Michael
2008-12-27
The continuous accumulation of sequence data, for example, due to novel wet-laboratory techniques such as pyrosequencing, coupled with the increasing popularity of multi-gene phylogenies and emerging multi-core processor architectures that face problems of cache congestion, poses new challenges with respect to the efficient computation of the phylogenetic maximum-likelihood (ML) function. Here, we propose two approaches that can significantly speed up likelihood computations that typically represent over 95 per cent of the computational effort conducted by current ML or Bayesian inference programs. Initially, we present a method and an appropriate data structure to efficiently compute the likelihood score on 'gappy' multi-gene alignments. By 'gappy' we denote sampling-induced gaps owing to missing sequences in individual genes (partitions), i.e. not real alignment gaps. A first proof-of-concept implementation in RAXML indicates that this approach can accelerate inferences on large and gappy alignments by approximately one order of magnitude. Moreover, we present insights and initial performance results on multi-core architectures obtained during the transition from an OpenMP-based to a Pthreads-based fine-grained parallelization of the ML function.
Lee, Soo Eon; Nam, Ok Hyung; Lee, Hyo-Seol; Choi, Sung Chul
2016-07-01
Objectives The purpose of this study was designed to identify the oral microbiota in healthy Korean pre-school children using pyrosequencing. Materials and methods Dental plaque samples were obtained form 10 caries-free pre-school children. The samples were analysed using pyrosequencing. Results The pyrosequencing analysis revealed that, at the phylum level, Proteobacteria, Firmicutes, Bacteroidetes, Actinobacteria and Fusobacteria showed high abundance. Also, predominant genera were identified as core microbiome, such as Streptococcus, Neisseria, Capnocytophaga, Haemophilus and Veilonella. Conclusions The diversity and homogeneity was shown in the dental plaque microbiota in healthy Korean pre-school children.
Shared molecular neuropathology across major psychiatric disorders parallels polygenic overlap.
Gandal, Michael J; Haney, Jillian R; Parikshak, Neelroop N; Leppa, Virpi; Ramaswami, Gokul; Hartl, Chris; Schork, Andrew J; Appadurai, Vivek; Buil, Alfonso; Werge, Thomas M; Liu, Chunyu; White, Kevin P; Horvath, Steve; Geschwind, Daniel H
2018-02-09
The predisposition to neuropsychiatric disease involves a complex, polygenic, and pleiotropic genetic architecture. However, little is known about how genetic variants impart brain dysfunction or pathology. We used transcriptomic profiling as a quantitative readout of molecular brain-based phenotypes across five major psychiatric disorders-autism, schizophrenia, bipolar disorder, depression, and alcoholism-compared with matched controls. We identified patterns of shared and distinct gene-expression perturbations across these conditions. The degree of sharing of transcriptional dysregulation is related to polygenic (single-nucleotide polymorphism-based) overlap across disorders, suggesting a substantial causal genetic component. This comprehensive systems-level view of the neurobiological architecture of major neuropsychiatric illness demonstrates pathways of molecular convergence and specificity. Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Konradi, Christine; Sillivan, Stephanie E.; Clay, Hayley B.
2011-01-01
Gene expression studies of bipolar disorder (BPD) have shown changes in transcriptome profiles in multiple brain regions. Here we summarize the most consistent findings in the scientific literature, and compare them to data from schizophrenia (SZ) and major depressive disorder (MDD). The transcriptome profiles of all three disorders overlap, making the existence of a BPD-specific profile unlikely. Three groups of functionally related genes are consistently expressed at altered levels in BPD, SZ and MDD. Genes involved in energy metabolism and mitochondrial function are downregulated, genes involved in immune response and inflammation are upregulated, and genes expressed in oligodendrocytes are downregulated. Experimental paradigms for multiple sclerosis demonstrate a tight link between energy metabolism, inflammation and demyelination. These studies also show variabilities in the extent of oligodendrocyte stress, which can vary from a downregulation of oligodendrocyte genes, such as observed in psychiatric disorders, to cell death and brain lesions seen in multiple sclerosis. We conclude that experimental models of multiple sclerosis could be of interest for the research of BPD, SZ and MDD. PMID:21310238
Munger, Steven C.; Raghupathy, Narayanan; Choi, Kwangbom; Simons, Allen K.; Gatti, Daniel M.; Hinerfeld, Douglas A.; Svenson, Karen L.; Keller, Mark P.; Attie, Alan D.; Hibbs, Matthew A.; Graber, Joel H.; Chesler, Elissa J.; Churchill, Gary A.
2014-01-01
Massively parallel RNA sequencing (RNA-seq) has yielded a wealth of new insights into transcriptional regulation. A first step in the analysis of RNA-seq data is the alignment of short sequence reads to a common reference genome or transcriptome. Genetic variants that distinguish individual genomes from the reference sequence can cause reads to be misaligned, resulting in biased estimates of transcript abundance. Fine-tuning of read alignment algorithms does not correct this problem. We have developed Seqnature software to construct individualized diploid genomes and transcriptomes for multiparent populations and have implemented a complete analysis pipeline that incorporates other existing software tools. We demonstrate in simulated and real data sets that alignment to individualized transcriptomes increases read mapping accuracy, improves estimation of transcript abundance, and enables the direct estimation of allele-specific expression. Moreover, when applied to expression QTL mapping we find that our individualized alignment strategy corrects false-positive linkage signals and unmasks hidden associations. We recommend the use of individualized diploid genomes over reference sequence alignment for all applications of high-throughput sequencing technology in genetically diverse populations. PMID:25236449
Validation of the VE1 Immunostain for the BRAF V600E Mutation in Melanoma
Pearlstein, Michelle V.; Zedek, Daniel C.; Ollila, David W.; Treece, Amanda; Gulley, Margaret L.; Groben, Pamela A.; Thomas, Nancy E.
2014-01-01
BACKGROUND BRAF mutation status, and therefore eligibility for BRAF inhibitors, is currently determined by sequencing methods. We assessed the validity of VE1, a monoclonal antibody against the BRAF V600E mutant protein, in the detection of mutant BRAF V600E melanomas as classified by DNA pyrosequencing. METHODS The cases were 76 metastatic melanoma patients with only one known primary melanoma who had had BRAF codon 600 pyrosequencing of either their primary (n=19), metastatic (n=57) melanoma, or both (n=17). All melanomas (n=93) were immunostained with the BRAF VE1 antibody using a red detection system. The staining intensity of these specimens was scored from 0 – 3+ by a dermatopathologist. Scores of 0 and 1+ were considered as negative staining while scores of 2+ and 3+ were considered positive. RESULTS The VE1 antibody demonstrated a sensitivity of 85% and a specificity of 100% as compared to DNA pyrosequencing results. There was 100% concordance between VE1 immunostaining of primary and metastatic melanomas from the same patient. V600K, V600Q, and V600R BRAF melanomas did not positively stain with VE1. CONCLUSIONS This hospital-based study finds high sensitivity and specificity for the BRAF VE1 immunostain in comparison to pyrosequencing in detection of BRAF V600E in melanomas. PMID:24917033
Banelli, Barbara; Brigati, Claudio; Di Vinci, Angela; Casciano, Ida; Forlani, Alessandra; Borzì, Luana; Allemanni, Giorgio; Romani, Massimo
2012-03-01
Epigenetic alterations are hallmarks of cancer and powerful biomarkers, whose clinical utilization is made difficult by the absence of standardization and of common methods of data interpretation. The coordinate methylation of many loci in cancer is defined as 'CpG island methylator phenotype' (CIMP) and identifies clinically distinct groups of patients. In neuroblastoma (NB), CIMP is defined by a methylation signature, which includes different loci, but its predictive power on outcome is entirely recapitulated by the PCDHB cluster only. We have developed a robust and cost-effective pyrosequencing-based assay that could facilitate the clinical application of CIMP in NB. This assay permits the unbiased simultaneous amplification and sequencing of 17 out of 19 genes of the PCDHB cluster for quantitative methylation analysis, taking into account all the sequence variations. As some of these variations were at CpG doublets, we bypassed the data interpretation conducted by the methylation analysis software to assign the corrected methylation value at these sites. The final result of the assay is the mean methylation level of 17 gene fragments in the protocadherin B cluster (PCDHB) cluster. We have utilized this assay to compare the methylation levels of the PCDHB cluster between high-risk and very low-risk NB patients, confirming the predictive value of CIMP. Our results demonstrate that the pyrosequencing-based assay herein described is a powerful instrument for the analysis of this gene cluster that may simplify the data comparison between different laboratories and, in perspective, could facilitate its clinical application. Furthermore, our results demonstrate that, in principle, pyrosequencing can be efficiently utilized for the methylation analysis of gene clusters with high internal homologies.
Kobayashi, Naomi; Bauer, Thomas W; Togawa, Daisuke; Lieberman, Isador H; Sakai, Hiroshige; Fujishiro, Takaaki; Tuohy, Marion J; Procop, Gary W
2005-06-01
The bacteria associated with orthopaedic infections are usually common gram-positive and gram-negative bacteria. This fundamental grouping of bacteria is a necessary first step in the selection of appropriate antibiotics. Since polymerase chain reaction (PCR) is more rapid and may be more sensitive than culture, we developed a postamplification pyrosequencing method to subcategorize bacteria based on a few nucleotide polymorphisms in the 16S rRNA gene. We validated this method using well-characterized strains of bacteria and applied it to specimens from spinal surgery cases with suspected infections. Lysates of 114 bacteria including 75 species were created following standard cultivation to obtain DNA. The DNA was amplified by a broad-range real-time PCR. The amplicons were evaluated by pyrosequencing and were classified as gram-positive, gram-negative, or acid-fast bacilli based on the first three to five nucleotides sequenced. In addition, clinical cases of suspected infection were obtained from spinal surgery. The results of the "molecular Gram stain" were compared with the results of traditional Gram stain and culture. The lysates of 107 (93.9%) of the bacteria extracts tested were appropriately categorized as gram-positive and gram-negative or as acid-fast bacilli on the basis of this assay. The sensitivity and specificity of this assay were 100% and 97.4% for gram-positive and 88.3% and 100% for gram-negative isolates. All of the five clinical samples were appropriately categorized as containing gram-positive or gram-negative bacteria with this assay. This study demonstrates that high sensitivity and specificity of a molecular gram stain may be achieved using broad-range real-time PCR and pyrosequencing.
Jung, Mi-Ja; Nam, Young-Do; Roh, Seong Woon; Bae, Jin-Woo
2012-05-01
Makgeolli is a traditional Korean alcoholic beverage manufactured with a natural starter, called nuruk, and grains. Nuruk is a starchy disk or tablet formed from wheat or grist containing various fungal and bacterial strains from the surrounding environment that are allowed to incorporate naturally into the starter, each of which simultaneously participates in the makgeolli fermentation process. In the current study, changes in microbial dynamics during laboratory-scale fermentation of makgeolli inoculated with six different kinds of nuruk were evaluated by barcoded pyrosequencing using fungal- and bacterial-specific primers targeting the internal transcribed spacer 2 region and hypervariable regions V1 to V3 of the 16S rRNA gene, respectively. A total of 61,571 fungal and 68,513 bacterial sequences were used for the analysis of microbial diversity in ferment samples. During fermentation, the proportion of fungal microorganisms belonging to the family Saccharomycetaceae increased significantly, and the major bacterial phylum of the samples shifted from γ-Proteobacteria to Firmicutes. The results of quantitative PCR indicated that the bacterial content in the final ferments was higher than in commercial rice beers, while total fungi appeared similar. This is the first report of a comparative analysis of bacterial and fungal dynamics in parallel during the fermentation of Korean traditional alcoholic beverage using barcoded pyrosequencing. Copyright © 2011 Elsevier Ltd. All rights reserved.
Understanding microbial ecology can help improve biogas production in AD.
Ferguson, Robert M W; Coulon, Frédéric; Villa, Raffaella
2018-06-16
454-Pyrosequencing and lipid fingerprinting were used to link anaerobic digestion (AD) process parameters (pH, alkalinity, volatile fatty acids (VFAs), biogas production and methane content) with the reactor microbial community structure and composition. AD microbial communities underwent stress conditions after changes in organic loading rate and digestion substrates. 454-Pyrosequencing analysis showed that, irrespectively of the substrate digested, methane content and pH were always significantly, and positively, correlated with community evenness. In AD, microbial communities with more even distributions of diversity are able to use parallel metabolic pathways and have greater functional stability; hence, they are capable of adapting and responding to disturbances. In all reactors, a decrease in methane content to <30% was always correlated with a 50% increase of Firmicutes sequences (particularly in operational taxonomic units (OTUs) related to Ruminococcaceae and Veillonellaceae). Whereas digesters producing higher methane content (above 60%), contained a high number of sequences related to Synergistetes and unidentified bacterial OTUs. Finally, lipid fingerprinting demonstrated that, under stress, the decrease in archaeal biomass was higher than the bacterial one, and that archaeal Phospholipid etherlipids (PLEL) levels were correlated to reactor performances. These results demonstrate that, across a number of parameters such as lipids, alpha and beta diversity, and OTUs, knowledge of the microbial community structure can be used to predict, monitor, or optimise AD performance. Copyright © 2018 Elsevier B.V. All rights reserved.
Mikkelsen, Martin; Frank-Hansen, Rune; Hansen, Anders J; Morling, Niels
2014-09-01
of sequencing of whole mitochondrial genome, HV1 and HV2 DNA with the second generation system (SGS) Roche 454 GS Junior were compared with results of Sanger sequencing and SNP typing with SNaPshot single base extension detected with MALDI-TOF and capillary electrophoresis. We investigated the performance of the software analysis of the data, reproducibility, ability to sequence homopolymeric regions, detection of mixtures and heteroplasmy as well as the implications of the depth of coverage. We found full reproducibility between samples sequenced twice with SGS. We found close to full concordance between the mtDNA sequences of 26 samples obtained with (1) the 454 SGS method using a depth of coverage above 100 and (2) Sanger sequencing and SNP typing. The discrepancies were primarily observed in homopolymeric regions. The 454 SGS method was able to sequence 95% of the reads correctly in homopolymers up to 4 bases, and up to 6 bases could be sequenced with similar success if the results were carefully, visually inspected. The 454 technology was able to detect mixtures or heteroplasmy of approximately 10%. We detected previously unreported heteroplasmy in the GM9947A component of the NIST human mitochondrial DNA SRM-2392 standard reference material. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Expression Profiling Smackdown: Human Transcriptome Array HTA 2.0 vs. RNA-Seq
Palermo, Meghann; Driscoll, Heather; Tighe, Scott; Dragon, Julie; Bond, Jeff; Shukla, Arti; Vangala, Mahesh; Vincent, James; Hunter, Tim
2014-01-01
The advent of both microarray and massively parallel sequencing have revolutionized high-throughput analysis of the human transcriptome. Due to limitations in microarray technology, detecting and quantifying coding transcript isoforms, in addition to non-coding transcripts, has been challenging. As a result, RNA-Seq has been the preferred method for characterizing the full human transcriptome, until now. A new high-resolution array from Affymetrix, GeneChip Human Transcriptome Array 2.0 (HTA 2.0), has been designed to interrogate all transcript isoforms in the human transcriptome with >6 million probes targeting coding transcripts, exon-exon splice junctions, and non-coding transcripts. Here we compare expression results from GeneChip HTA 2.0 and RNA-Seq data using identical RNA extractions from three samples each of healthy human mesothelial cells in culture, LP9-C1, and healthy mesothelial cells treated with asbestos, LP9-A1. For GeneChip HTA 2.0 sample preparation, we chose to compare two target preparation methods, NuGEN Ovation Pico WTA V2 with the Encore Biotin Module versus Affymetrix's GeneChip WT PLUS with the WT Terminal Labeling Kit, on identical RNA extractions from both untreated and treated samples. These same RNA extractions were used for the RNA-Seq library preparation. All analyses were performed in Partek Genomics Suite 6.6. Expression profiles for control and asbestos-treated mesothelial cells prepared with NuGEN versus Affymetrix target preparation methods (GeneChip HTA 2.0) are compared to each other as well as to RNA-Seq results.
Nature and nurture: environmental influences on a genetic rat model of depression
Mehta-Raghavan, N S; Wert, S L; Morley, C; Graf, E N; Redei, E E
2016-01-01
In this study, we sought to learn whether adverse events such as chronic restraint stress (CRS), or ‘nurture' in the form of environmental enrichment (EE), could modify depression-like behavior and blood biomarker transcript levels in a genetic rat model of depression. The Wistar Kyoto More Immobile (WMI) is a genetic model of depression that aided in the identification of blood transcriptomic markers, which successfully distinguished adolescent and adult subjects with major depressive disorders from their matched no-disorder controls. Here, we followed the effects of CRS and EE in adult male WMIs and their genetically similar control strain, the Wistar Kyoto Less Immobile (WLI), that does not show depression-like behavior, by measuring the levels of these transcripts in the blood and hippocampus. In WLIs, increased depression-like behavior and transcriptomic changes were present in response to CRS, but in WMIs no behavioral or additive transcriptomic changes occurred. Environmental enrichment decreased both the inherent depression-like behavior in the WMIs and the behavioral difference between WMIs and WLIs, but did not reverse basal transcript level differences between the strains. The inverse behavioral change induced by CRS and EE in the WLIs did not result in parallel inverse expression changes of the transcriptomic markers, suggesting that these behavioral responses to the environment work via separate molecular pathways. In contrast, ‘trait' transcriptomic markers with expression differences inherent and unchanging between the strains regardless of the environment suggest that in our model, environmental and genetic etiologies of depression work through independent molecular mechanisms. PMID:27023176
USDA-ARS?s Scientific Manuscript database
PacBio long-read sequencing technology is increasingly popular in genome sequence assembly and transcriptome cataloguing. Recently, a new-generation pig reference genome was assembled based on long reads from this technology. To finely annotate this genome assembly, transcriptomes of nine tissues fr...
Plouhinec, Jean-Louis; Medina-Ruiz, Sofía; Borday, Caroline; Bernard, Elsa; Vert, Jean-Philippe; Eisen, Michael B; Harland, Richard M; Monsoro-Burq, Anne H
2017-10-01
During vertebrate neurulation, the embryonic ectoderm is patterned into lineage progenitors for neural plate, neural crest, placodes and epidermis. Here, we use Xenopus laevis embryos to analyze the spatial and temporal transcriptome of distinct ectodermal domains in the course of neurulation, during the establishment of cell lineages. In order to define the transcriptome of small groups of cells from a single germ layer and to retain spatial information, dorsal and ventral ectoderm was subdivided along the anterior-posterior and medial-lateral axes by microdissections. Principal component analysis on the transcriptomes of these ectoderm fragments primarily identifies embryonic axes and temporal dynamics. This provides a genetic code to define positional information of any ectoderm sample along the anterior-posterior and dorsal-ventral axes directly from its transcriptome. In parallel, we use nonnegative matrix factorization to predict enhanced gene expression maps onto early and mid-neurula embryos, and specific signatures for each ectoderm area. The clustering of spatial and temporal datasets allowed detection of multiple biologically relevant groups (e.g., Wnt signaling, neural crest development, sensory placode specification, ciliogenesis, germ layer specification). We provide an interactive network interface, EctoMap, for exploring synexpression relationships among genes expressed in the neurula, and suggest several strategies to use this comprehensive dataset to address questions in developmental biology as well as stem cell or cancer research.
Zhu, Li-Ping; Yue, Xin-Jing; Han, Kui; Li, Zhi-Feng; Zheng, Lian-Shuai; Yi, Xiu-Nan; Wang, Hai-Long; Zhang, You-Ming; Li, Yue-Zhong
2015-07-22
Exotic genes, especially clustered multiple-genes for a complex pathway, are normally integrated into chromosome for heterologous expression. The influences of insertion sites on heterologous expression and allotropic expressions of exotic genes on host remain mostly unclear. We compared the integration and expression efficiencies of single and multiple exotic genes that were inserted into Myxococcus xanthus genome by transposition and attB-site-directed recombination. While the site-directed integration had a rather stable chloramphenicol acetyl transferase (CAT) activity, the transposition produced varied CAT enzyme activities. We attempted to integrate the 56-kb gene cluster for the biosynthesis of antitumor polyketides epothilones into M. xanthus genome by site-direction but failed, which was determined to be due to the insertion size limitation at the attB site. The transposition technique produced many recombinants with varied production capabilities of epothilones, which, however, were not paralleled to the transcriptional characteristics of the local sites where the genes were integrated. Comparative transcriptomics analysis demonstrated that the allopatric integrations caused selective changes of host transcriptomes, leading to varied expressions of epothilone genes in different mutants. With the increase of insertion fragment size, transposition is a more practicable integration method for the expression of exotic genes. Allopatric integrations selectively change host transcriptomes, which lead to varied expression efficiencies of exotic genes.
Borday, Caroline; Bernard, Elsa; Vert, Jean-Philippe; Eisen, Michael B.; Harland, Richard M.
2017-01-01
During vertebrate neurulation, the embryonic ectoderm is patterned into lineage progenitors for neural plate, neural crest, placodes and epidermis. Here, we use Xenopus laevis embryos to analyze the spatial and temporal transcriptome of distinct ectodermal domains in the course of neurulation, during the establishment of cell lineages. In order to define the transcriptome of small groups of cells from a single germ layer and to retain spatial information, dorsal and ventral ectoderm was subdivided along the anterior-posterior and medial-lateral axes by microdissections. Principal component analysis on the transcriptomes of these ectoderm fragments primarily identifies embryonic axes and temporal dynamics. This provides a genetic code to define positional information of any ectoderm sample along the anterior-posterior and dorsal-ventral axes directly from its transcriptome. In parallel, we use nonnegative matrix factorization to predict enhanced gene expression maps onto early and mid-neurula embryos, and specific signatures for each ectoderm area. The clustering of spatial and temporal datasets allowed detection of multiple biologically relevant groups (e.g., Wnt signaling, neural crest development, sensory placode specification, ciliogenesis, germ layer specification). We provide an interactive network interface, EctoMap, for exploring synexpression relationships among genes expressed in the neurula, and suggest several strategies to use this comprehensive dataset to address questions in developmental biology as well as stem cell or cancer research. PMID:29049289
Shang, Ying; Xu, Wentao; Wang, Yong; Xu, Yuancong; Huang, Kunlun
2017-12-15
This study described a novel multiplex qualitative detection method using pyrosequencing. Based on the principle of the universal primer-multiplex-PCR, only one sequencing primer was employed to realize the detection of the multiple targets. Samples containing three genetically modified (GM) crops in different proportions were used to validate the method. The dNTP dispensing order was designed based on the product sequences. Only 12 rounds (ATCTGATCGACT) of dNTPs addition and, often, as few as three rounds (CAT) under ideal conditions, were required to detect the GM events qualitatively, and sensitivity was as low as 1% of a mixture. However, when considering a mixture, calculating signal values allowed the proportion of each GM to be estimated. Based on these results, we concluded that our novel method not only realized detection but also allowed semi-quantitative detection of individual events. Copyright © 2017. Published by Elsevier Ltd.
2011-01-01
Background Avocado (Persea americana) belongs to the Lauraceae family and is an important commercial fruit crop in over 50 countries. The most serious pathogen affecting avocado production is Phytophthora cinnamomi which causes Phytophthora root rot (PRR). Root pathogens such as P. cinnamomi and their interactions with hosts are poorly understood and despite the importance of both the avocado crop and the effect Phytophthora has on its cultivation, there is a lack of molecular knowledge underpinning our understanding of defence strategies against the pathogen. In order to initiate a better understanding of host-specific defence we have generated EST data using 454 pyrosequencing and profiled nine defence-related genes from Pc-infected avocado roots. Results 2.0 Mb of data was generated consisting of ~10,000 reads on a single lane of the GS FLX platform. Using the Newbler assembler 371 contigs were assembled, of which 367 are novel for Persea americana. Genes were classified according to Gene Ontology terms. In addition to identifying root-specific ESTs we were also able to identify and quantify the expression of nine defence-related genes that were differentially regulated in response to P. cinnamomi. Genes such as metallothionein, thaumatin and the pathogenesis related PsemI, mlo and profilin were found to be differentially regulated. Conclusions This is the first study in elucidating the avocado root transcriptome as well as identifying defence responses of avocado roots to the root pathogen P. cinnamomi. Our data is currently the only EST data that has been generated for avocado rootstocks, and the ESTs identified in this study have already been useful in identifying defence-related genes as well as providing gene information for other studies looking at processes such as ROS regulation as well as hypoxia in avocado roots. Our EST data will aid in the elucidation of the avocado transcriptome and identification of markers for improved rootstock breeding and screening. The characterization of the avocado transcriptome will furthermore form a basis for functional genomics of basal angiosperms. PMID:22108245
Mahomed, Waheed; Berg, Noëlani van den
2011-11-23
Avocado (Persea americana) belongs to the Lauraceae family and is an important commercial fruit crop in over 50 countries. The most serious pathogen affecting avocado production is Phytophthora cinnamomi which causes Phytophthora root rot (PRR). Root pathogens such as P. cinnamomi and their interactions with hosts are poorly understood and despite the importance of both the avocado crop and the effect Phytophthora has on its cultivation, there is a lack of molecular knowledge underpinning our understanding of defence strategies against the pathogen. In order to initiate a better understanding of host-specific defence we have generated EST data using 454 pyrosequencing and profiled nine defence-related genes from Pc-infected avocado roots. 2.0 Mb of data was generated consisting of ~10,000 reads on a single lane of the GS FLX platform. Using the Newbler assembler 371 contigs were assembled, of which 367 are novel for Persea americana. Genes were classified according to Gene Ontology terms. In addition to identifying root-specific ESTs we were also able to identify and quantify the expression of nine defence-related genes that were differentially regulated in response to P. cinnamomi. Genes such as metallothionein, thaumatin and the pathogenesis related PsemI, mlo and profilin were found to be differentially regulated. This is the first study in elucidating the avocado root transcriptome as well as identifying defence responses of avocado roots to the root pathogen P. cinnamomi. Our data is currently the only EST data that has been generated for avocado rootstocks, and the ESTs identified in this study have already been useful in identifying defence-related genes as well as providing gene information for other studies looking at processes such as ROS regulation as well as hypoxia in avocado roots. Our EST data will aid in the elucidation of the avocado transcriptome and identification of markers for improved rootstock breeding and screening. The characterization of the avocado transcriptome will furthermore form a basis for functional genomics of basal angiosperms.
Asp, Torben; Kristensen, Michael
2016-01-01
Background Insecticide resistance in the housefly, Musca domestica, has been investigated for more than 60 years. It will enter a new era after the recent publication of the housefly genome and the development of multiple next generation sequencing technologies. The genetic background of the xenobiotic response can now be investigated in greater detail. Here, we investigate the 454-pyrosequencing transcriptome of the spinosad-resistant 791spin strain in relation to the housefly genome with focus on P450 genes. Results The de novo assembly of clean reads gave 35,834 contigs consisting of 21,780 sequences of the spinosad resistant strain. The 3,648 sequences were annotated with an enzyme code EC number and were mapped to 124 KEGG pathways with metabolic processes as most highly represented pathway. One hundred and twenty contigs were annotated as P450s covering 44 different P450 genes of housefly. Eight differentially expressed P450s genes were identified and investigated for SNPs, CpG islands and common regulatory motifs in promoter and coding regions. Functional annotation clustering of metabolic related genes and motif analysis of P450s revealed their association with epigenetic, transcription and gene expression related functions. The sequence variation analysis resulted in 12 SNPs and eight of them found in cyp6d1. There is variation in location, size and frequency of CpG islands and specific motifs were also identified in these P450s. Moreover, identified motifs were associated to GO terms and transcription factors using bioinformatic tools. Conclusion Transcriptome data of a spinosad resistant strain provide together with genome data fundamental support for future research to understand evolution of resistance in houseflies. Here, we report for the first time the SNPs, CpG islands and common regulatory motifs in differentially expressed P450s. Taken together our findings will serve as a stepping stone to advance understanding of the mechanism and role of P450s in xenobiotic detoxification. PMID:27019205
The complete chloroplast genome sequence of date palm (Phoenix dactylifera L.).
Yang, Meng; Zhang, Xiaowei; Liu, Guiming; Yin, Yuxin; Chen, Kaifu; Yun, Quanzheng; Zhao, Duojun; Al-Mssallem, Ibrahim S; Yu, Jun
2010-09-15
Date palm (Phoenix dactylifera L.), a member of Arecaceae family, is one of the three major economically important woody palms--the two other palms being oil palm and coconut tree--and its fruit is a staple food among Middle East and North African nations, as well as many other tropical and subtropical regions. Here we report a complete sequence of the data palm chloroplast (cp) genome based on pyrosequencing. After extracting 369,022 cp sequencing reads from our whole-genome-shotgun data, we put together an assembly and validated it with intensive PCR-based verification, coupled with PCR product sequencing. The date palm cp genome is 158,462 bp in length and has a typical quadripartite structure of the large (LSC, 86,198 bp) and small single-copy (SSC, 17,712 bp) regions separated by a pair of inverted repeats (IRs, 27,276 bp). Similar to what has been found among most angiosperms, the date palm cp genome harbors 112 unique genes and 19 duplicated fragments in the IR regions. The junctions between LSC/IRs and SSC/IRs show different features of sequence expansion in evolution. We identified 78 SNPs as major intravarietal polymorphisms within the population of a specific cp genome, most of which were located in genes with vital functions. Based on RNA-sequencing data, we also found 18 polycistronic transcription units and three highly expression-biased genes--atpF, trnA-UGC, and rrn23. Unlike most monocots, date palm has a typical cp genome similar to that of tobacco--with little rearrangement and gene loss or gain. High-throughput sequencing technology facilitates the identification of intravarietal variations in cp genomes among different cultivars. Moreover, transcriptomic analysis of cp genes provides clues for uncovering regulatory mechanisms of transcription and translation in chloroplasts.
PIVOT: platform for interactive analysis and visualization of transcriptomics data.
Zhu, Qin; Fisher, Stephen A; Dueck, Hannah; Middleton, Sarah; Khaladkar, Mugdha; Kim, Junhyong
2018-01-05
Many R packages have been developed for transcriptome analysis but their use often requires familiarity with R and integrating results of different packages requires scripts to wrangle the datatypes. Furthermore, exploratory data analyses often generate multiple derived datasets such as data subsets or data transformations, which can be difficult to track. Here we present PIVOT, an R-based platform that wraps open source transcriptome analysis packages with a uniform user interface and graphical data management that allows non-programmers to interactively explore transcriptomics data. PIVOT supports more than 40 popular open source packages for transcriptome analysis and provides an extensive set of tools for statistical data manipulations. A graph-based visual interface is used to represent the links between derived datasets, allowing easy tracking of data versions. PIVOT further supports automatic report generation, publication-quality plots, and program/data state saving, such that all analysis can be saved, shared and reproduced. PIVOT will allow researchers with broad background to easily access sophisticated transcriptome analysis tools and interactively explore transcriptome datasets.
Bravo, Lulette Tricia C.; Tuohy, Marion J.; Ang, Concepcion; Destura, Raul V.; Mendoza, Myrna; Procop, Gary W.; Gordon, Steven M.; Hall, Geraldine S.; Shrestha, Nabin K.
2009-01-01
After isoniazid and rifampin (rifampicin), the next pivotal drug class in Mycobacterium tuberculosis treatment is the fluoroquinolone class. Mutations in resistance-determining regions (RDR) of the rpoB, katG, and gyrA genes occur with frequencies of 97%, 50%, and 85% among M. tuberculosis isolates resistant to rifampin, isoniazid, and fluoroquinolones, respectively. Sequences are highly conserved, and certain mutations correlate well with phenotypic resistance. We developed a pyrosequencing assay to determine M. tuberculosis genotypic resistance to rifampin, isoniazid, and fluoroquinolones. We characterized 102 M. tuberculosis clinical isolates from the Philippines for susceptibility to rifampin, isoniazid, and ofloxacin by using the conventional submerged-disk proportion method and validated our pyrosequencing assay using these isolates. DNA was extracted and amplified by using PCR primers directed toward the RDR of the rpoB, katG, and gyrA genes, and pyrosequencing was performed on the extracts. The M. tuberculosis H37Rv strain (ATCC 25618) was used as the reference strain. The sensitivities and specificities of pyrosequencing were 96.7% and 97.3%, 63.8% and 100%, and 70.0% and 100% for the detection of resistance to rifampin, isoniazid, and ofloxacin, respectively. Pyrosequencing is thus a rapid and accurate method for detecting M. tuberculosis resistance to these three drugs. PMID:19846642
An insight into the transcriptome of the digestive tract of the bloodsucking bug, Rhodnius prolixus.
Ribeiro, José M C; Genta, Fernando A; Sorgine, Marcos H F; Logullo, Raquel; Mesquita, Rafael D; Paiva-Silva, Gabriela O; Majerowicz, David; Medeiros, Marcelo; Koerich, Leonardo; Terra, Walter R; Ferreira, Clélia; Pimentel, André C; Bisch, Paulo M; Leite, Daniel C; Diniz, Michelle M P; da S G V Junior, João Lídio; Da Silva, Manuela L; Araujo, Ricardo N; Gandara, Ana Caroline P; Brosson, Sébastien; Salmon, Didier; Bousbata, Sabrina; González-Caballero, Natalia; Silber, Ariel Mariano; Alves-Bezerra, Michele; Gondim, Katia C; Silva-Neto, Mário Alberto C; Atella, Georgia C; Araujo, Helena; Dias, Felipe A; Polycarpo, Carla; Vionette-Amaral, Raquel J; Fampa, Patrícia; Melo, Ana Claudia A; Tanaka, Aparecida S; Balczun, Carsten; Oliveira, José Henrique M; Gonçalves, Renata L S; Lazoski, Cristiano; Rivera-Pomar, Rolando; Diambra, Luis; Schaub, Günter A; Garcia, Elói S; Azambuja, Patrícia; Braz, Glória R C; Oliveira, Pedro L
2014-01-01
The bloodsucking hemipteran Rhodnius prolixus is a vector of Chagas' disease, which affects 7-8 million people today in Latin America. In contrast to other hematophagous insects, the triatomine gut is compartmentalized into three segments that perform different functions during blood digestion. Here we report analysis of transcriptomes for each of the segments using pyrosequencing technology. Comparison of transcript frequency in digestive libraries with a whole-body library was used to evaluate expression levels. All classes of digestive enzymes were highly expressed, with a predominance of cysteine and aspartic proteinases, the latter showing a significant expansion through gene duplication. Although no protein digestion is known to occur in the anterior midgut (AM), protease transcripts were found, suggesting secretion as pro-enzymes, being possibly activated in the posterior midgut (PM). As expected, genes related to cytoskeleton, protein synthesis apparatus, protein traffic, and secretion were abundantly transcribed. Despite the absence of a chitinous peritrophic membrane in hemipterans - which have instead a lipidic perimicrovillar membrane lining over midgut epithelia - several gut-specific peritrophin transcripts were found, suggesting that these proteins perform functions other than being a structural component of the peritrophic membrane. Among immunity-related transcripts, while lysozymes and lectins were the most highly expressed, several genes belonging to the Toll pathway - found at low levels in the gut of most insects - were identified, contrasting with a low abundance of transcripts from IMD and STAT pathways. Analysis of transcripts related to lipid metabolism indicates that lipids play multiple roles, being a major energy source, a substrate for perimicrovillar membrane formation, and a source for hydrocarbons possibly to produce the wax layer of the hindgut. Transcripts related to amino acid metabolism showed an unanticipated priority for degradation of tyrosine, phenylalanine, and tryptophan. Analysis of transcripts related to signaling pathways suggested a role for MAP kinases, GTPases, and LKBP1/AMP kinases related to control of cell shape and polarity, possibly in connection with regulation of cell survival, response of pathogens and nutrients. Together, our findings present a new view of the triatomine digestive apparatus and will help us understand trypanosome interaction and allow insights into hemipteran metabolic adaptations to a blood-based diet.
Kim, Suk Kyeong; Kim, Dong-Lim; Han, Hye Seung; Kim, Wan Seop; Kim, Seung Ja; Moon, Won Jin; Oh, Seo Young; Hwang, Tae Sook
2008-06-01
Fine-needle aspiration biopsy (FNAB) is the primary means of distinguishing benign from malignant and of guiding therapeutic intervention in thyroid nodules. However, 10% to 30% of cases with indeterminate cytology in FNAB need other diagnostic tools to refine diagnosis. We compared the pyrosequencing method with the conventional direct DNA sequencing analysis and investigated the usefulness of preoperative BRAF mutation analysis as an adjunct diagnostic tool with routine FNAB. A total of 103 surgically confirmed patients' FNA slides were recruited and DNA was extracted after atypical cells were scraped from the slides. BRAF mutation was analyzed by pyrosequencing and direct DNA sequencing. Sixty-three (77.8%) of 81 histopathologically diagnosed malignant nodules revealed positive BRAF mutation on pyrosequencing analysis. In detail, 63 (84.0%) of 75 papillary thyroid carcinoma (PTC) samples showed positive BRAF mutation, whereas 3 follicular thyroid carcinomas, 1 anaplastic carcinoma, 1 medullary thyroid carcinoma, and 1 metastatic lung carcinoma did not show BRAF mutation. None of 22 benign nodules had BRAF mutation in both pyrosequencing and direct DNA sequencing. Out of 27 thyroid nodules classified as 'indeterminate' on cytologic examination preoperatively, 21 (77.8%) cases turned out to be malignant: 18 PTCs (including 2 follicular variant types) and 3 follicular thyroid carcinomas. Among these, 13 (61.9%) classic PTCs had BRAF mutation. None of 6 benign nodules, including 3 follicular adenomas and 3 nodular hyperplasias, had BRAF mutation. Among 63 PTCs with positive BRAF mutation detected by pyrosequencing analysis, 3 cases did not show BRAF mutation by direct DNA sequencing. Although it was not statistically significant, pyrosequencing was superior to direct DNA sequencing in detecting the BRAF mutation of thyroid nodules (P=0.25). Detecting BRAF mutation by pyrosequencing is more sensitive, faster, and less expensive than direct DNA sequencing and is proposed as an adjunct diagnostic tool in evaluating thyroid nodules of indeterminate cytology.
Smith, Philip J.; Levine, Adam P.; Dunne, Jenny; Guilhamon, Paul; Turmaine, Mark; Sewell, Gavin W.; O'Shea, Nuala R.; Vega, Roser; Paterson, Jennifer C.; Oukrif, Dahmane; Beck, Stephan; Bloom, Stuart L.; Novelli, Marco; Rodriguez-Justo, Manuel; Smith, Andrew M.
2014-01-01
Background: Mucosal abnormalities are potentially important in the primary pathogenesis of ulcerative colitis (UC). We investigated the mucosal transcriptomic expression profiles of biopsies from patients with UC and healthy controls, taken from macroscopically noninflamed tissue from the terminal ileum and 3 colonic locations with the objective of identifying abnormal molecules that might be involved in disease development. Methods: Whole-genome transcriptional analysis was performed on intestinal biopsies taken from 24 patients with UC, 26 healthy controls, and 14 patients with Crohn's disease. Differential gene expression analysis was performed at each tissue location separately, and results were then meta-analyzed. Significantly, differentially expressed genes were validated using quantitative polymerase chain reaction. The location of gene expression within the colon was determined using immunohistochemistry, subcellular fractionation, electron and confocal microscopy. DNA methylation was quantified by pyrosequencing. Results: Only 4 probes were abnormally expressed throughout the colon in patients with UC with Bone morphogenetic protein/Retinoic acid Inducible Neural-specific 3 (BRINP3) being the most significantly underexpressed. Attenuated expression of BRINP3 in UC was independent of current inflammation, unrelated to phenotype or treatment, and remained low at rebiopsy an average of 22 months later. BRINP3 is localized to the brush border of the colonic epithelium and expression is influenced by DNA methylation within its promoter. Conclusions: Genome-wide expression analysis of noninflamed mucosal biopsies from patients with UC identified BRINP3 as significantly underexpressed throughout the colon in a large subset of patients with UC. Low levels of this gene could predispose or contribute to the maintenance of the characteristic mucosal inflammation seen in this condition. PMID:25171508
Mittapalli, Omprakash; Bai, Xiaodong; Mamidala, Praveen; Rajarapu, Swapna Priya; Bonello, Pierluigi; Herms, Daniel A
2010-10-28
The insect midgut and fat body represent major tissue interfaces that deal with several important physiological functions including digestion, detoxification and immune response. The emerald ash borer (Agrilus planipennis), is an exotic invasive insect pest that has killed millions of ash trees (Fraxinus spp.) primarily in the Midwestern United States and Ontario, Canada. However, despite its high impact status little knowledge exists for A. planipennis at the molecular level. Newer-generation Roche-454 pyrosequencing was used to obtain 126,185 reads for the midgut and 240,848 reads for the fat body, which were assembled into 25,173 and 37,661 high quality expressed sequence tags (ESTs) for the midgut and the fat body of A. planipennis larvae, respectively. Among these ESTs, 36% of the midgut and 38% of the fat body sequences showed similarity to proteins in the GenBank nr database. A high number of the midgut sequences contained chitin-binding peritrophin (248)and trypsin (98) domains; while the fat body sequences showed high occurrence of cytochrome P450s (85) and protein kinase (123) domains. Further, the midgut transcriptome of A. planipennis revealed putative microbial transcripts encoding for cell-wall degrading enzymes such as polygalacturonases and endoglucanases. A significant number of SNPs (137 in midgut and 347 in fat body) and microsatellite loci (317 in midgut and 571 in fat body) were predicted in the A. planipennis transcripts. An initial assessment of cytochrome P450s belonging to various CYP clades revealed distinct expression patterns at the tissue level. To our knowledge this study is one of the first to illuminate tissue-specific gene expression in an invasive insect of high ecological and economic consequence. These findings will lay the foundation for future gene expression and functional studies in A. planipennis.
Magazani, Edmond K.; Garin, Daniel; Muyembe, Jean-Jacques T.; Bentahir, Mostafa; Gala, Jean-Luc
2014-01-01
Background In case of outbreak of rash illness in remote areas, clinically discriminating monkeypox (MPX) from severe form of chickenpox and from smallpox remains a concern for first responders. Objective The goal of the study was therefore to use MPX and chickenpox outbreaks in Democratic Republic of Congo (DRC) as a test case for establishing a rapid and specific diagnosis in affected remote areas. Methods In 2008 and 2009, successive outbreaks of presumed MPX skin rash were reported in Bena Tshiadi, Yangala and Ndesha healthcare districts of the West Kasai province (DRC). Specimens consisting of liquid vesicle dried on filter papers or crusted scabs from healing patients were sampled by first responders. A field analytical facility was deployed nearby in order to carry out a real-time PCR (qPCR) assay using genus consensus primers, consensus orthopoxvirus (OPV) and smallpox-specific probes spanning over the 14 kD fusion protein encoding gene. A PCR-restriction fragment length polymorphism was used on-site as backup method to confirm the presence of monkeypox virus (MPXV) in samples. To complete the differential diagnosis of skin rash, chickenpox was tested in parallel using a commercial qPCR assay. In a post-deployment step, a MPXV-specific pyrosequencing was carried out on all biotinylated amplicons generated on-site in order to confirm the on-site results. Results Whereas MPXV proved to be the agent causing the rash illness outbreak in the Bena Tshiadi, VZV was the causative agent of the disease in Yangala and Ndesha districts. In addition, each on-site result was later confirmed by MPXV-specific pyrosequencing analysis without any discrepancy. Conclusion This experience of rapid on-site dual use DNA-based differential diagnosis of rash illnesses demonstrates the potential of combining tests specifically identifying bioterrorism agents and agents causing natural outbreaks. This opens the way to rapid on-site DNA-based identification of a broad spectrum of causative agents in remote areas. PMID:24841633
Kim, Jaeyeon; Kim, Nayoung; Jo, Hyun Jin; Park, Ji Hyun; Nam, Ryoung Hee; Seok, Yeong-Jae; Kim, Yeon-Ran; Kim, Joo Sung; Kim, Jung Mogg; Kim, Jung Min; Lee, Dong Ho; Jung, Hyun Chae
2015-10-01
Sequencing of 16S ribosomal RNA (rRNA) gene has improved the characterization of microbial communities. It enabled the detection of low abundance gastric Helicobacter pylori sequences even in subjects that were found to be H. pylori negative with conventional methods. The objective of this study was to obtain a cutoff value for H. pylori colonization in gastric mucosa samples by pyrosequencing method. Gastric mucosal biopsies were taken from 63 subjects whose H. pylori status was determined by a combination of serology, rapid urease test, culture, and histology. Microbial DNA from mucosal samples was amplified by PCR using universal bacterial primers. 16S rDNA amplicons were pyrosequenced. ROC curve analysis was performed to determine the cutoff value for H. pylori colonization by pyrosequencing. In addition, temporal changes in the stomach microbiota were observed in eight initially H. pylori-positive and eight H. pylori-negative subjects at a single time point 1-8 years later. Of the 63 subjects, the presence of H. pylori sequences was detected in all (28/28) conventionally H. pylori-positive samples and in 60% (21/35) of H. pylori-negative samples. The average percent of H. pylori reads in each sample was 0.67 ± 1.09% in the H. pylori-negative group. Cutoff value for clinically positive H. pylori status was approximately 1.22% based on ROC curve analysis (AUC = 0.957; p < .001). Helicobacter pylori was successfully eradicated in five of seven treated H. pylori-positive subjects (71.4%), and the percentage of H. pylori reads in these five subjects dropped from 1.3-95.18% to 0-0.16% after eradication. These results suggest that the cutoff value of H. pylori sequence percentage for H. pylori colonization by pyrosequencing could be set at approximately 1%. It might be helpful to analyze gastric microbiota related to H. pylori sequence status. © 2015 John Wiley & Sons Ltd.
Castro-Carrera, T; Toral, P G; Frutos, P; McEwan, N R; Hervás, G; Abecia, L; Pinloche, E; Girdwood, S E; Belenguer, A
2014-03-01
Developing novel strategies to increase the content of bioactive unsaturated fatty acids (FA) in ruminant-derived products requires a deeper understanding of rumen biohydrogenation and bacteria involved in this process. Although high-throughput pyrosequencing may allow for a great coverage of bacterial diversity, it has hardly been used to investigate the microbiology of ruminal FA metabolism. In this experiment, 454 pyrosequencing and a molecular fingerprinting technique (terminal restriction fragment length polymorphism; T-RFLP) were used concurrently to assess the effect of diet supplementation with marine algae (MA) on the rumen bacterial community of dairy sheep. Eleven lactating ewes were divided in 2 lots and offered a total mixed ration based on alfalfa hay and concentrate (40:60), supplemented with 0 (control) or 8 (MA) g of MA/kg of dry matter. After 54 d on treatments, animals were slaughtered and samples of rumen content and fluid were collected separately for microbial analysis. Pyrosequencing yielded a greater coverage of bacterial diversity than T-RFLP and allowed the identification of low abundant populations. Conversely, both molecular approaches pointed to similar conclusions and showed that relevant changes due to MA addition were observed within the major ruminal phyla, namely Bacteroidetes, Firmicutes, and Proteobacteria. Decreases in the abundance of unclassified Bacteroidales, Porphyromonadaceae, and Ruminococcaceae and increases in as-yet uncultured species of the family Succinivibrionaceae, might be related to a potential role of these groups in different pathways of rumen FA metabolism. Diet supplementation with MA, however, had no effect on the relative abundance of Butyrivibrio and Pseudobutyrivibrio genera. In addition, results from both 454 pyrosequencing and T-RFLP indicate that the effect of MA was rather consistent in rumen content or fluid samples, despite inherent differences between these fractions in their bacterial composition. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Llera-Herrera, Raúl; García-Gasca, Alejandra; Abreu-Goodger, Cei; Huvet, Arnaud; Ibarra, Ana M.
2013-01-01
Despite the great advances in sequencing technologies, genomic and transcriptomic information for marine non-model species with ecological, evolutionary, and economical interest is still scarce. In this work we aimed to identify genes expressed during spermatogenesis in the functional hermaphrodite scallop Nodipecten subnodosus (Mollusca: Bivalvia: Pectinidae), with the purpose of obtaining a panel of genes that would allow for the study of differentially transcribed genes between diploid and triploid scallops in the context of meiotic arrest and reproductive sterility. Because our aim was to isolate genes involved in meiosis and other testis maturation-related processes, we generated suppressive subtractive hybridization libraries of testis vs. inactive gonad. We obtained 352 and 177 ESTs by clone sequencing, and using pyrosequencing (454-Roche) we maximized the identified ESTs to 34,276 reads. A total of 1,153 genes from the testis library had a blastx hit and GO annotation, including genes specific for meiosis, spermatogenesis, sex-differentiation, and transposable elements. Some of the identified meiosis genes function in chromosome pairing (scp2, scp3), recombination and DNA repair (dmc1, rad51, ccnb1ip1/hei10), and meiotic checkpoints (rad1, hormad1, dtl/cdt2). Gene expression analyses in different gametogenic stages in both sexual regions of the gonad of meiosis genes confirmed that the expression was specific or increased towards the maturing testis. Spermatogenesis genes included known testis-specific ones (kelch-10, shippo1, adad1), with some of these known to be associated to sterility. Sex differentiation genes included one of the most conserved genes at the bottom of the sex-determination cascade (dmrt1). Transcript from transposable elements, reverse transcriptase, and transposases in this library evidenced that transposition is an active process during spermatogenesis in N. subnodosus. In relation to the inactive library, we identified 833 transcripts with functional annotation related to activation of the transcription and translation machinery, as well as to germline control and maintenance. PMID:24066034
The immune gene repertoire of an important viral reservoir, the Australian black flying fox.
Papenfuss, Anthony T; Baker, Michelle L; Feng, Zhi-Ping; Tachedjian, Mary; Crameri, Gary; Cowled, Chris; Ng, Justin; Janardhana, Vijaya; Field, Hume E; Wang, Lin-Fa
2012-06-20
Bats are the natural reservoir host for a range of emerging and re-emerging viruses, including SARS-like coronaviruses, Ebola viruses, henipaviruses and Rabies viruses. However, the mechanisms responsible for the control of viral replication in bats are not understood and there is little information available on any aspect of antiviral immunity in bats. Massively parallel sequencing of the bat transcriptome provides the opportunity for rapid gene discovery. Although the genomes of one megabat and one microbat have now been sequenced to low coverage, no transcriptomic datasets have been reported from any bat species. In this study, we describe the immune transcriptome of the Australian flying fox, Pteropus alecto, providing an important resource for identification of genes involved in a range of activities including antiviral immunity. Towards understanding the adaptations that have allowed bats to coexist with viruses, we have de novo assembled transcriptome sequence from immune tissues and stimulated cells from P. alecto. We identified about 18,600 genes involved in a broad range of activities with the most highly expressed genes involved in cell growth and maintenance, enzyme activity, cellular components and metabolism and energy pathways. 3.5% of the bat transcribed genes corresponded to immune genes and a total of about 500 immune genes were identified, providing an overview of both innate and adaptive immunity. A small proportion of transcripts found no match with annotated sequences in any of the public databases and may represent bat-specific transcripts. This study represents the first reported bat transcriptome dataset and provides a survey of expressed bat genes that complement existing bat genomic data. In addition, these data provide insight into genes relevant to the antiviral responses of bats, and form a basis for examining the roles of these molecules in immune response to viral infection.
Transcriptome and ultrastructural changes in dystrophic Epidermolysis bullosa resemble skin aging
Trost, Andrea; Weber, Manuela; Klausegger, Alfred; Gruber, Christina; Bruckner, Daniela; Reitsamer, Herbert A.; Bauer, Johann W.; Breitenbach, Michael
2015-01-01
The aging process of skin has been investigated recently with respect to mitochondrial function and oxidative stress. We have here observed striking phenotypic and clinical similarity between skin aging and recessive dystrophic Epidermolysis bullosa (RDEB), which is caused by recessive mutations in the gene coding for collagen VII, COL7A1. Ultrastructural changes, defects in wound healing, and inflammation markers are in part shared with aged skin. We have here compared the skin transcriptomes of young adults suffering from RDEB with that of sex‐ and age‐matched healthy probands. In parallel we have compared the skin transcriptome of healthy young adults with that of elderly healthy donors. Quite surprisingly, there was a large overlap of the two gene lists that concerned a limited number of functional protein families. Most prominent among the proteins found are a number of proteins of the cornified envelope or proteins mechanistically involved in cornification and other skin proteins. Further, the overlap list contains a large number of genes with a known role in inflammation. We are documenting some of the most prominent ultrastructural and protein changes by immunofluorescence analysis of skin sections from patients, old individuals, and healthy controls. PMID:26143532
Transcriptome and ultrastructural changes in dystrophic Epidermolysis bullosa resemble skin aging.
Breitenbach, Jenny S; Rinnerthaler, Mark; Trost, Andrea; Weber, Manuela; Klausegger, Alfred; Gruber, Christina; Bruckner, Daniela; Reitsamer, Herbert A; Bauer, Johann W; Breitenbach, Michael
2015-06-01
The aging process of skin has been investigated recently with respect to mitochondrial function and oxidative stress. We have here observed striking phenotypic and clinical similarity between skin aging and recessive dystrophic Epidermolysis bullosa (RDEB), which is caused by recessive mutations in the gene coding for collagen VII,COL7A1. Ultrastructural changes, defects in wound healing, and inflammation markers are in part shared with aged skin. We have here compared the skin transcriptomes of young adults suffering from RDEB with that of sex- and age-matched healthy probands. In parallel we have compared the skin transcriptome of healthy young adults with that of elderly healthy donors. Quite surprisingly, there was a large overlap of the two gene lists that concerned a limited number of functional protein families. Most prominent among the proteins found are a number of proteins of the cornified envelope or proteins mechanistically involved in cornification and other skin proteins. Further, the overlap list contains a large number of genes with a known role in inflammation. We are documenting some of the most prominent ultrastructural and protein changes by immunofluorescence analysis of skin sections from patients, old individuals, and healthy controls.
Beck, Rose C; Kohn, Debra J; Tuohy, Marion J; Prayson, Richard A; Yen-Lieberman, Belinda; Procop, Gary W
2004-03-01
We evaluated 2 methods, a LightCycler PCR assay and pyrosequencing for the detection of the JC polyoma virus (JCV) in fixed brain tissue of 10 patients with and 3 control patients without progressive multifocal leukoencephalopathy (PML). Nucleic acid extraction was performed after deparaffinization and proteinase K digestion. The LightCycler assay differentiates the BK virus (BKV), JCV, and SV40 using melt curve analysis. Conventional PCR was used with the same primers to generate products for pyrosequencing. Two sequencing primers were used that differentiate the polyoma viruses. Seven of 11 biopsies (1 patient had 2 biopsies) with PML were positive for JCV by real-time PCR and/or PCR/pyrosequencing. Three of 4 remaining biopsies were positive by real-time PCR but had melting points between JCV and SV40. The 4 specimens that were negative or atypical by LightCycler PCR were positive by traditional PCR, but 1 had an amplicon of lower molecular weight by gel electrophoresis. These were shown to represent JCV by at least 1 of the 2 pyrosequencing primers. The biopsies from patients without PML were PCR negative. Both the LightCycler and pyrosequencing assays are useful for confirming JCV in brain biopsies from patients with PML, but variant JCVs may require supplementary methods to confirm JCV infection.
Cell fixation and preservation for droplet-based single-cell transcriptomics.
Alles, Jonathan; Karaiskos, Nikos; Praktiknjo, Samantha D; Grosswendt, Stefanie; Wahle, Philipp; Ruffault, Pierre-Louis; Ayoub, Salah; Schreyer, Luisa; Boltengagen, Anastasiya; Birchmeier, Carmen; Zinzen, Robert; Kocks, Christine; Rajewsky, Nikolaus
2017-05-19
Recent developments in droplet-based microfluidics allow the transcriptional profiling of thousands of individual cells in a quantitative, highly parallel and cost-effective way. A critical, often limiting step is the preparation of cells in an unperturbed state, not altered by stress or ageing. Other challenges are rare cells that need to be collected over several days or samples prepared at different times or locations. Here, we used chemical fixation to address these problems. Methanol fixation allowed us to stabilise and preserve dissociated cells for weeks without compromising single-cell RNA sequencing data. By using mixtures of fixed, cultured human and mouse cells, we first showed that individual transcriptomes could be confidently assigned to one of the two species. Single-cell gene expression from live and fixed samples correlated well with bulk mRNA-seq data. We then applied methanol fixation to transcriptionally profile primary cells from dissociated, complex tissues. Low RNA content cells from Drosophila embryos, as well as mouse hindbrain and cerebellum cells prepared by fluorescence-activated cell sorting, were successfully analysed after fixation, storage and single-cell droplet RNA-seq. We were able to identify diverse cell populations, including neuronal subtypes. As an additional resource, we provide 'dropbead', an R package for exploratory data analysis, visualization and filtering of Drop-seq data. We expect that the availability of a simple cell fixation method will open up many new opportunities in diverse biological contexts to analyse transcriptional dynamics at single-cell resolution.
Marzorati, Massimo; Maignien, Lois; Verhelst, An; Luta, Gabriela; Sinnott, Robert; Kerckhof, Frederiek Maarten; Boon, Nico; Van de Wiele, Tom; Possemiers, Sam
2013-02-01
The combination of a Simulator of the Human Intestinal Microbial Ecosystem with ad hoc molecular techniques (i.e. pyrosequencing, denaturing gradient gel electrophoresis and quantitative PCR) allowed an evaluation of the extent to which two plant polysaccharide supplements could modify a complex gut microbial community. The presence of Aloe vera gel powder and algae extract in product B as compared to the standard blend (product A) improved its fermentation along the entire simulated colon. The potential extended effect of product B in the simulated distal colon, as compared to product A, was confirmed by: (i) the separate clustering of the samples before and after the treatment in the phylogenetic-based dendrogram and OTU-based PCoA plot only for product B; (ii) a higher richness estimator (+33 vs. -36 % of product A); and (iii) a higher dynamic parameter (21 vs. 13 %). These data show that the combination of well designed in vitro simulators with barcoded pyrosequencing is a powerful tool for characterizing changes occurring in the gut microbiota following a treatment. However, for the quantification of low-abundance species-of interest because of their relationship to potential positive health effects (i.e. bifidobacteria or lactobacilli)-conventional molecular ecological approaches, such as PCR-DGGE and qPCR, still remain a very useful complementary tool.
Aparicio, Ana; North, Brittany; Barske, Lindsey; Wang, Xuemei; Bollati, Valentina; Weisenberger, Daniel; Yoo, Christine; Tannir, Nizar; Horne, Erin; Groshen, Susan; Jones, Peter; Yang, Allen; Issa, Jean-Pierre
2009-04-01
Multiple clinical trials are investigating the use of the DNA methylation inhibitors azacitidine and decitabine for the treatment of solid tumors. Clinical trials in hematological malignancies have shown that optimal activity does not occur at their maximum tolerated doses but selection of an optimal biological dose and schedule for use in solid tumor patients is hampered by the difficulty of obtaining tumor tissue to measure their activity. Here we investigate the feasibility of using plasma DNA to measure the demethylating activity of the DNA methylation inhibitors in patients with solid tumors. We compared four methods to measure LINE-1 and MAGE-A1 promoter methylation in T24 and HCT116 cancer cells treated with decitabine treatment and selected Pyrosequencing for its greater reproducibility and higher signal to noise ratio. We then obtained DNA from plasma, peripheral blood mononuclear cells, buccal mucosa cells and saliva from ten patients with metastatic solid tumors at two different time points, without any intervening treatment. DNA methylation measurements were not significantly different between time point 1 and time point 2 in patient samples. We conclude that measurement of LINE-1 methylation in DNA extracted from the plasma of patients with advanced solid tumors, using Pyrosequencing, is feasible and has low within patient variability. Ongoing studies will determine whether changes in LINE-1 methylation in plasma DNA occur as a result of treatment with DNA methylation inhibitors and parallel changes in tumor tissue DNA.
Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah
2012-01-01
Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple.
Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah
2012-01-01
Background Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. Methodology/Principal Findings To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. Conclusions The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple. PMID:23091603
Nam, Seungyoon
2017-04-01
Cancer transcriptome analysis is one of the leading areas of Big Data science, biomarker, and pharmaceutical discovery, not to forget personalized medicine. Yet, cancer transcriptomics and postgenomic medicine require innovation in bioinformatics as well as comparison of the performance of available algorithms. In this data analytics context, the value of network generation and algorithms has been widely underscored for addressing the salient questions in cancer pathogenesis. Analysis of cancer trancriptome often results in complicated networks where identification of network modularity remains critical, for example, in delineating the "druggable" molecular targets. Network clustering is useful, but depends on the network topology in and of itself. Notably, the performance of different network-generating tools for network cluster (NC) identification has been little investigated to date. Hence, using gastric cancer (GC) transcriptomic datasets, we compared two algorithms for generating pathway versus gene regulatory network-based NCs, showing that the pathway-based approach better agrees with a reference set of cancer-functional contexts. Finally, by applying pathway-based NC identification to GC transcriptome datasets, we describe cancer NCs that associate with candidate therapeutic targets and biomarkers in GC. These observations collectively inform future research on cancer transcriptomics, drug discovery, and rational development of new analysis tools for optimal harnessing of omics data.
Van Geel, Maarten; Busschaert, Pieter; Honnay, Olivier; Lievens, Bart
2014-11-01
In the last few years, 454 pyrosequencing-based analysis of arbuscular mycorrhizal fungal (AMF; Glomeromycota) communities has tremendously increased our knowledge of the distribution and diversity of AMF. Nonetheless, comparing results between different studies is difficult, as different target genes (or regions thereof) and primer combinations, with potentially dissimilar specificities and efficacies, are being utilized. In this study we evaluated six primer pairs that have previously been used in AMF studies (NS31-AM1, AMV4.5NF-AMDGR, AML1-AML2, NS31-AML2, FLR3-LSUmBr and Glo454-NDL22) for their use in 454 pyrosequencing based on both an in silico approach and 454 pyrosequencing of AMF communities from apple tree roots. Primers were evaluated in terms of (i) in silico coverage of Glomeromycota fungi, (ii) the number of high-quality sequences obtained, (iii) selectivity for AMF species, (iv) reproducibility and (v) ability to accurately describe AMF communities. We show that primer pairs AMV4.5NF-AMDGR, AML1-AML2 and NS31-AML2 outperformed the other tested primer pairs in terms of number of Glomeromycota reads (AMF specificity and coverage). Additionally, these primer pairs were found to have no or only few mismatches to AMF sequences and were able to consistently describe AMF communities from apple roots. However, whereas most high-quality AMF sequences were obtained for AMV4.5NF-AMDGR, our results also suggest that this primer pair favored amplification of Glomeraceae sequences at the expense of Ambisporaceae, Claroideoglomeraceae and Paraglomeraceae sequences. Furthermore, we demonstrate the complementary specificity of AMV4.5NF-AMDGR with AML1-AML2, and of AMV4.5NF-AMDGR with NS31-AML2, making these primer combinations highly suitable for tandem use in covering the diversity of AMF communities. Copyright © 2014 Elsevier B.V. All rights reserved.
Application of Pyrosequencing® in Food Biodefense.
Amoako, Kingsley Kwaku
2015-01-01
The perpetration of a bioterrorism attack poses a significant risk for public health with potential socioeconomic consequences. It is imperative that we possess reliable assays for the rapid and accurate identification of biothreat agents to make rapid risk-informed decisions on emergency response. The development of advanced methodologies for the detection of biothreat agents has been evolving rapidly since the release of the anthrax spores in the mail in 2001, and recent advances in detection and identification techniques could prove to be an essential component in the defense against biological attacks. Sequence-based approaches such as Pyrosequencing(®), which has the capability to determine short DNA stretches in real time using biotinylated PCR amplicons, have potential biodefense applications. Using markers from the virulence plasmids and chromosomal regions, my laboratory has demonstrated the power of this technology in the rapid, specific, and sensitive detection of B. anthracis spores and Yersinia pestis in food. These are the first applications for the detection of the two organisms in food. Furthermore, my lab has developed a rapid assay to characterize the antimicrobial resistance (AMR) gene profiles for Y. pestis using Pyrosequencing. Pyrosequencing is completed in about 60 min (following PCR amplification) and yields accurate and reliable results with an added layer of confidence, thus enabling rapid risk-informed decisions to be made. A typical run yields 40-84 bp reads with 94-100 % identity to the expected sequence. It also provides a rapid method for determining the AMR profile as compared to the conventional plate method which takes several days. The method described is proposed as a novel detection system for potential application in food biodefense.
Croville, Guillaume; Soubies, Sébastien Mathieu; Barbieri, Johanna; Klopp, Christophe; Mariette, Jérôme; Bouchez, Olivier; Camus-Bouclainville, Christelle
2012-01-01
Adaptation of avian influenza viruses (AIVs) from waterfowl to domestic poultry with a deletion in the neuraminidase (NA) stalk has already been reported. The way the virus undergoes this evolution, however, is thus far unclear. We address this question using pyrosequencing of duck and turkey low-pathogenicity AIVs. Ducks and turkeys were sampled at the very beginning of an H6N1 outbreak, and turkeys were swabbed again 8 days later. NA stalk deletions were evidenced in turkeys by Sanger sequencing. To further investigate viral evolution, 454 pyrosequencing was performed: for each set of samples, up to 41,500 reads of ca. 400 bp were generated and aligned. Genetic polymorphisms between duck and turkey viruses were tracked on the whole genome. NA deletion was detected in less than 2% of reads in duck feces but in 100% of reads in turkey tracheal specimens collected at the same time. Further variations in length were observed in NA from turkeys 8 days later. Similarly, minority mutants emerged on the hemagglutinin (HA) gene, with substitutions mostly in the receptor binding site on the globular head. These critical changes suggest a strong evolutionary pressure in turkeys. The increasing performances of next-generation sequencing technologies should enable us to monitor the genomic diversity of avian influenza viruses and early emergence of potentially pathogenic variants within bird flocks. The present study, based on 454 pyrosequencing, suggests that NA deletion, an example of AIV adaptation from waterfowl to domestic poultry, occurs by selection rather than de novo emergence of viral mutants. PMID:22718944
Oki, Kaihei; Dugersuren, Jamyan; Demberel, Shirchin; Watanabe, Koichi
2014-01-01
Here, we used pyrosequencing to obtain a detailed analysis of the microbial diversities of traditional fermented dairy products of Mongolia. From 22 Airag (fermented mare's milk), 5 Khoormog (fermented camel's milk) and 26 Tarag (fermented milk of cows, goats and yaks) samples collected in the Mongolian provinces of Arhangai, Bulgan, Dundgobi, Tov, Uburhangai and Umnugobi, we obtained a total of 81 operational taxonomic units, which were assigned to 15 families, 21 genera and 41 species in 3 phyla. The genus Lactobacillus is a core bacterial component of Mongolian fermented milks, and Lactobacillus helveticus, Lactobacillus kefiranofaciens and Lactobacillus delbrueckii were the predominant species of lactic acid bacteria (LAB) in the Airag, Khoormog and Tarag samples, respectively. By using this pyrosequencing approach, we successfully detected most LAB species that have been isolated as well as seven LAB species that have not been found in our previous culture-based study. A subsequent analysis of the principal components of the samples revealed that L. delbrueckii, L. helveticus, L. kefiranofaciens and Streptococcus thermophilus were the main factors influencing the microbial diversity of these Mongolian traditional fermented dairy products and that this diversity correlated with the animal species from which the milk was sourced.
Gilling, Damian H; Luna, Vicki Ann; Pflugradt, Cori
2014-01-01
The etiologic agents for melioidosis and glanders, Burkholderia mallei and Burkholderia pseudomallei respectively, are genetically similar making identification and differentiation from other Burkholderia species and each other challenging. We used pyrosequencing to determine the presence or absence of an insertion sequence IS407A within the flagellin P (fliP) gene and to exploit the difference in orientation of this gene in the two species. Oligonucleotide primers were designed to selectively target the IS407A-fliP interface in B. mallei and the fliP gene specifically at the insertion point in B. pseudomallei. We then examined DNA from ten B. mallei, ten B. pseudomallei, 14 B. cepacia, eight other Burkholderia spp., and 17 other bacteria. Resultant pyrograms encompassed the target sequence that contained either the fliP gene with the IS407A interruption or the fully intact fliP gene with 100% sensitivity and 100% specificity. These pyrosequencing assays based upon a single gene enable investigators to reliably identify the two species. The information obtained by these assays provides more knowledge of the genomic reduction that created the new species B. mallei from B. pseudomallei and may point to new targets that can be exploited in the future.
Bhatt, Vaibhav D; Dande, Suchitra S; Patil, Nitin V; Joshi, Chaitanya G
2013-04-01
Rumen microorganisms play an important role in ruminant digestion and absorption of nutrients and have great potential applications in the field of rumen adjusting, food fermentation and biomass utilization etc. In order to investigate the composition of microorganisms in the rumen of camel (Camelus dromedarius), this study delves in the microbial diversity by culture-independent approach. It includes comparison of rumen samples investigated in the present study to other currently available metagenomes to reveal potential differences in rumen microbial systems. Pyrosequencing based metagenomics was applied to analyze phylogenetic and metabolic profiles by MG-RAST, a web based tool. Pyrosequencing of camel rumen sample yielded 8,979,755 nucleotides assembled to 41,905 sequence reads with an average read length of 214 nucleotides. Taxonomic analysis of metagenomic reads indicated Bacteroidetes (55.5 %), Firmicutes (22.7 %) and Proteobacteria (9.2 %) phyla as predominant camel rumen taxa. At a finer phylogenetic resolution, Bacteroides species dominated the camel rumen metagenome. Functional analysis revealed that clustering-based subsystem and carbohydrate metabolism were the most abundant SEED subsystem representing 17 and 13 % of camel metagenome, respectively. A high taxonomic and functional similarity of camel rumen was found with the cow metagenome which is not surprising given the fact that both are mammalian herbivores with similar digestive tract structures and functions. Combined pyrosequencing approach and subsystems-based annotations available in the SEED database allowed us access to understand the metabolic potential of these microbiomes. Altogether, these data suggest that agricultural and animal husbandry practices can impose significant selective pressures on the rumen microbiota regardless of rumen type. The present study provides a baseline for understanding the complexity of camel rumen microbial ecology while also highlighting striking similarities and differences when compared to other animal gastrointestinal environments.
Park, Jung-Hun; Choi, Okkyoung; Lee, Tae-Ho; Kim, Hyunook; Sang, Byoung-In
2016-11-01
Wastewaters from swine farms, nitrogen-dealing industries or side-stream processes of a wastewater treatment plant (e.g., anaerobic digesters, sludge thickening processes, etc.) are characterized by low C/N ratios and not easily treatable. In this study, a hollow fiber-membrane biofilm reactors (HF-MBfR) system consisting of an O2-based HF-MBfR and an H2-based HF-MBfR was applied for treating high-strength wastewater. The reactors were continuously operated with low supply of O2 and H2 and without any supply of organic carbon for 250 d. Gradual increase of ammonium and nitrate concentration in the influent showed stable and high nitrogen removal efficiency, and the maximum ammonium and nitrate removal rates were 0.48 kg NH4(+)-N m(-3) d(-1) and 0.55 kg NO3(-)-N m(-3) d(-1), respectively. The analysis of the microbial communities using pyrosequencing analysis indicated that Nitrosospira multiformis, ammonium-oxidizing bacteria, and Nitrobacter winogradskyi and Nitrobacter vulgaris, nitrite-oxidizing bacteria were highly enriched in the O2-based HF-MBfR. In the H2-based HF-MBfR, hydrogenotrophic denitrifying bacteria belonging to the family of Thiobacillus and Comamonadaceae were initially dominant, but were replaced to heterotrophic denitrifiers belonging to Rhodocyclaceae and Rhodobacteraceae utilizing by-products induced from autotrophic denitrifying bacteria. The pyrosequencing analysis of microbial communities indicates that the autotrophic HF-MBfRs system well developed autotrophic nitrifying and denitrifying bacteria within a relatively short period to accomplish almost complete nitrogen removal. Copyright © 2016 Elsevier Ltd. All rights reserved.
Differential resistance of drinking water bacterial populations to monochloramine disinfection.
Chiao, Tzu-Hsin; Clancy, Tara M; Pinto, Ameet; Xi, Chuanwu; Raskin, Lutgarde
2014-04-01
The impact of monochloramine disinfection on the complex bacterial community structure in drinking water systems was investigated using culture-dependent and culture-independent methods. Changes in viable bacterial diversity were monitored using culture-independent methods that distinguish between live and dead cells based on membrane integrity, providing a highly conservative measure of viability. Samples were collected from lab-scale and full-scale drinking water filters exposed to monochloramine for a range of contact times. Culture-independent detection of live cells was based on propidium monoazide (PMA) treatment to selectively remove DNA from membrane-compromised cells. Quantitative PCR (qPCR) and pyrosequencing of 16S rRNA genes was used to quantify the DNA of live bacteria and characterize the bacterial communities, respectively. The inactivation rate determined by the culture-independent PMA-qPCR method (1.5-log removal at 664 mg·min/L) was lower than the inactivation rate measured by the culture-based methods (4-log removal at 66 mg·min/L). Moreover, drastic changes in the live bacterial community structure were detected during monochloramine disinfection using PMA-pyrosequencing, while the community structure appeared to remain stable when pyrosequencing was performed on samples that were not subject to PMA treatment. Genera that increased in relative abundance during monochloramine treatment include Legionella, Escherichia, and Geobacter in the lab-scale system and Mycobacterium, Sphingomonas, and Coxiella in the full-scale system. These results demonstrate that bacterial populations in drinking water exhibit differential resistance to monochloramine, and that the disinfection process selects for resistant bacterial populations.
Single-feature polymorphism discovery in the barley transcriptome
Rostoks, Nils; Borevitz, Justin O; Hedley, Peter E; Russell, Joanne; Mudie, Sharon; Morris, Jenny; Cardle, Linda; Marshall, David F; Waugh, Robbie
2005-01-01
A probe-level model for analysis of GeneChip gene-expression data is presented which identified more than 10,000 single-feature polymorphisms (SFP) between two barley genotypes. The method has good sensitivity, as 67% of known single-nucleotide polymorphisms (SNP) were called as SFPs. This method is applicable to all oligonucleotide microarray data, accounts for SNP effects in gene-expression data and represents an efficient and versatile approach for highly parallel marker identification in large genomes. PMID:15960806
Pyrosequencing for detection of lamivudine-resistant hepatitis B virus.
Lindström, Anna; Odeberg, Jacob; Albert, Jan
2004-10-01
Chronic hepatitis B virus (HBV) infection can cause severe liver disease, including cirrhosis and hepatocellular carcinoma. Lamivudine is a relatively recent alternative to alpha interferon for the treatment of HBV infection, but unfortunately, resistance to lamivudine commonly develops during monotherapy. Lamivudine-resistant HBV mutants display specific mutations in the YMDD (tyrosine, methionine, aspartate, aspartate) motif of the viral polymerase (reverse transcriptase [rt]), which is the catalytic site of the enzyme, i.e., methionine 204 to isoleucine (rtM204I) or valine (rtM204V). The latter mutation is often accompanied by a compensatory leucine-to-methionine change at codon 180 (rtL180M). In the present study, a novel sequencing method, pyrosequencing, was applied to the detection of lamivudine resistance mutations and was compared with direct Sanger sequencing. The new pyrosequencing method had advantages in terms of throughput. Experiments with mixtures of wild-type and resistant viruses indicated that pyrosequencing can detect minor sequence variants in heterogeneous virus populations. The new pyrosequencing method was evaluated with a small number of patient samples, and the results showed that the method could be a useful tool for the detection of lamivudine resistance in the clinical setting.
Migheli, Francesca; Stoccoro, Andrea; Coppedè, Fabio; Wan Omar, Wan Adnan; Failli, Alessandra; Consolini, Rita; Seccia, Massimo; Spisni, Roberto; Miccoli, Paolo; Mathers, John C.; Migliore, Lucia
2013-01-01
There is increasing interest in the development of cost-effective techniques for the quantification of DNA methylation biomarkers. We analyzed 90 samples of surgically resected colorectal cancer tissues for APC and CDKN2A promoter methylation using methylation sensitive-high resolution melting (MS-HRM) and pyrosequencing. MS-HRM is a less expensive technique compared with pyrosequencing but is usually more limited because it gives a range of methylation estimates rather than a single value. Here, we developed a method for deriving single estimates, rather than a range, of methylation using MS-HRM and compared the values obtained in this way with those obtained using the gold standard quantitative method of pyrosequencing. We derived an interpolation curve using standards of known methylated/unmethylated ratio (0%, 12.5%, 25%, 50%, 75%, and 100% of methylation) to obtain the best estimate of the extent of methylation for each of our samples. We observed similar profiles of methylation and a high correlation coefficient between the two techniques. Overall, our new approach allows MS-HRM to be used as a quantitative assay which provides results which are comparable with those obtained by pyrosequencing. PMID:23326336
Ovine pedomics: the first study of the ovine foot 16S rRNA-based microbiome
USDA-ARS?s Scientific Manuscript database
We report the first study of the bacterial microbiome of ovine interdigital skin based on 16S rRNA by pyrosequencing and conventional cloning with Sanger-sequencing. Ovine foot rot is an infectious, contagious disease of sheep that causes severe lameness and economic loss from decreased flock produc...
Sequencing the Black Aspergilli species complex
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kuo, Alan; Salamov, Asaf; Zhou, Kemin
2011-03-11
The ~15 members of the Aspergillus section Nigri species complex (the "Black Aspergilli") are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as food processing and spoilage agents and agricultural toxigens. Despite their utility and ubiquity, the morphological and metabolic distinctiveness of the complex's members, and thus their taxonomy, is poorly defined. We are using short read pyrosequencing technology (Roche/454 and Illumina/Solexa) to rapidly scale up genomic and transcriptomic analysis of this species complex. To date we predict 11197 genes in Aspergillus niger, 11624 genes inmore » A. carbonarius, and 10845 genes in A. aculeatus. A. aculeatus is our most recent genome, and was assembled primarily from 454-sequenced reads and annotated with the aid of >2 million 454 ESTs and >300 million Solexa ESTs. To most effectively deploy these very large numbers of ESTs we developed 2 novel methods for clustering the ESTs into assemblies. We have also developed a pipeline to propose orthologies and paralogies among genes in the species complex. In the near future we will apply these methods to additional species of Black Aspergilli that are currently in our sequencing pipeline.« less
Prest, E I; El-Chakhtoura, J; Hammes, F; Saikaly, P E; van Loosdrecht, M C M; Vrouwenvelder, J S
2014-10-15
The combination of flow cytometry (FCM) and 16S rRNA gene pyrosequencing data was investigated for the purpose of monitoring and characterizing microbial changes in drinking water distribution systems. High frequency sampling (5 min intervals for 1 h) was performed at the outlet of a treatment plant and at one location in the full-scale distribution network. In total, 52 bulk water samples were analysed with FCM, pyrosequencing and conventional methods (adenosine-triphosphate, ATP; heterotrophic plate count, HPC). FCM and pyrosequencing results individually showed that changes in the microbial community occurred in the water distribution system, which was not detected with conventional monitoring. FCM data showed an increase in the total bacterial cell concentrations (from 345 ± 15 × 10(3) to 425 ± 35 × 10(3) cells mL(-1)) and in the percentage of intact bacterial cells (from 39 ± 3.5% to 53 ± 4.4%) during water distribution. This shift was also observed in the FCM fluorescence fingerprints, which are characteristic of each water sample. A similar shift was detected in the microbial community composition as characterized with pyrosequencing, showing that FCM and genetic fingerprints are congruent. FCM and pyrosequencing data were subsequently combined for the calculation of cell concentration changes for each bacterial phylum. The results revealed an increase in cell concentrations of specific bacterial phyla (e.g., Proteobacteria), along with a decrease in other phyla (e.g., Actinobacteria), which could not be concluded from the two methods individually. The combination of FCM and pyrosequencing methods is a promising approach for future drinking water quality monitoring and for advanced studies on drinking water distribution pipeline ecology. Copyright © 2014 Elsevier Ltd. All rights reserved.
Rapid detection of the CYP2A6*12 hybrid allele by Pyrosequencing technology.
Koontz, Deborah A; Huckins, Jacqueline J; Spencer, Antonina; Gallagher, Margaret L
2009-08-24
Identification of CYP2A6 alleles associated with reduced enzyme activity is important in the study of inter-individual differences in drug metabolism. CYP2A6*12 is a hybrid allele that results from unequal crossover between CYP2A6 and CYP2A7 genes. The 5' regulatory region and exons 1-2 are derived from CYP2A7, and exons 3-9 are derived from CYP2A6. Conventional methods for detection of CYP2A6*12 consist of two-step PCR protocols that are laborious and unsuitable for high-throughput genotyping. We developed a rapid and accurate method to detect the CYP2A6*12 allele by Pyrosequencing technology. A single set of PCR primers was designed to specifically amplify both the CYP2A6*1 wild-type allele and the CYP2A6*12 hybrid allele. An internal Pyrosequencing primer was used to generate allele-specific sequence information, which detected homozygous wild-type, heterozygous hybrid, and homozygous hybrid alleles. We first validated the assay on 104 DNA samples that were also genotyped by conventional two-step PCR and by cycle sequencing. CYP2A6*12 allele frequencies were then determined using the Pyrosequencing assay on 181 multi-ethnic DNA samples from subjects of African American, European Caucasian, Pacific Rim, and Hispanic descent. Finally, we streamlined the Pyrosequencing assay by integrating liquid handling robotics into the workflow. Pyrosequencing results demonstrated 100% concordance with conventional two-step PCR and cycle sequencing methods. Allele frequency data showed slightly higher prevalence of the CYP2A6*12 allele in European Caucasians and Hispanics. This Pyrosequencing assay proved to be a simple, rapid, and accurate alternative to conventional methods, which can be easily adapted to the needs of higher-throughput studies.
Kim, Kyoung-Ah; Song, Wan-Geun; Lee, Hae-Mi; Joo, Hyun-Jin; Park, Ji-Young
2014-11-01
Warfarin is an anticoagulant that is difficult to administer because of the wide variation in dose requirements to achieve a therapeutic effect. CYP2C9, VKROC1, and CYP4F2 play important roles in warfarin metabolism, and their genetic polymorphisms are related to the variability in dose determination. In this study we describe a new multiplex pyrosequencing method to identify CYP2C9*3 (rs1057910), VKORC1*2 (rs9923231), and CYP4F2*3 (rs2108661) simultaneously. A multiplex pyrosequencing method to simultaneously detect CYP2C9*3, VKORC1*2, and CYP4F2*3 alleles was designed. We assessed the allele frequencies of the polymorphisms in 250 Korean subjects using the multiplex pyrosequencing method. The results showed 100 % concordance between single and multiplex pyrosequencing methods, and the polymorphisms identified by pyrosequencing were also validated with the direct sequencing method. The allele frequencies of these polymorphisms in this population were as follows: 0.040 for CYP2C9*3, 0.918 for VKORC1*2, and 0.416 for CYP4F2*3. Although the allele frequencies of the CYP2C9*3 and VKROC1*2 were comparable to those in Japanese and Chinese populations, their frequencies in this Korean population differed from those in other ethnic groups; the CYP4F2*3 frequency was the highest among other ethnic populations including Chinese and Japanese populations. The pyrosequencing methods developed were rapid and reliable for detecting CYP2C9*3, VKORC1*2, and CYP4F2*3. Large ethnic differences in the frequency of these genetic polymorphisms were noted among ethnic groups. CYP4F2*3 exhibited its highest allele frequency among other ethnic populations compared to that in a Korean population.
The Sequencing Bead Array (SBA), a Next-Generation Digital Suspension Array
Akhras, Michael S.; Pettersson, Erik; Diamond, Lisa; Unemo, Magnus; Okamoto, Jennifer; Davis, Ronald W.; Pourmand, Nader
2013-01-01
Here we describe the novel Sequencing Bead Array (SBA), a complete assay for molecular diagnostics and typing applications. SBA is a digital suspension array using Next-Generation Sequencing (NGS), to replace conventional optical readout platforms. The technology allows for reducing the number of instruments required in a laboratory setting, where the same NGS instrument could be employed from whole-genome and targeted sequencing to SBA broad-range biomarker detection and genotyping. As proof-of-concept, a model assay was designed that could distinguish ten Human Papillomavirus (HPV) genotypes associated with cervical cancer progression. SBA was used to genotype 20 cervical tumor samples and, when compared with amplicon pyrosequencing, was able to detect two additional co-infections due to increased sensitivity. We also introduce in-house software Sphix, enabling easy accessibility and interpretation of results. The technology offers a multi-parallel, rapid, robust, and scalable system that is readily adaptable for a multitude of microarray diagnostic and typing applications, e.g. genetic signatures, single nucleotide polymorphisms (SNPs), structural variations, and immunoassays. SBA has the potential to dramatically change the way we perform probe-based applications, and allow for a smooth transition towards the technology offered by genomic sequencing. PMID:24116138
Liu, Na; Liu, Lin; Pan, Xinghua
2014-07-01
Cellular heterogeneity within a cell population is a common phenomenon in multicellular organisms, tissues, cultured cells, and even FACS-sorted subpopulations. Important information may be masked if the cells are studied as a mass. Transcriptome profiling is a parameter that has been intensively studied, and relatively easier to address than protein composition. To understand the basis and importance of heterogeneity and stochastic aspects of the cell function and its mechanisms, it is essential to examine transcriptomes of a panel of single cells. High-throughput technologies, starting from microarrays and now RNA-seq, provide a full view of the expression of transcriptomes but are limited by the amount of RNA for analysis. Recently, several new approaches for amplification and sequencing the transcriptome of single cells or a limited low number of cells have been developed and applied. In this review, we summarize these major strategies, such as PCR-based methods, IVT-based methods, phi29-DNA polymerase-based methods, and several other methods, including their principles, characteristics, advantages, and limitations, with representative applications in cancer stem cells, early development, and embryonic stem cells. The prospects for development of future technology and application of transcriptome analysis in a single cell are also discussed.
Investigating bacterial populations in styrene-degrading biofilters by 16S rDNA tag pyrosequencing.
Portune, Kevin J; Pérez, M Carmen; Álvarez-Hornos, F Javier; Gabaldón, Carmen
2015-01-01
Microbial biofilms are essential components in the elimination of pollutants within biofilters, yet still little is known regarding the complex relationships between microbial community structure and biodegradation function within these engineered ecosystems. To further explore this relationship, 16S rDNA tag pyrosequencing was applied to samples taken at four time points from a styrene-degrading biofilter undergoing variable operating conditions. Changes in microbial structure were observed between different stages of biofilter operation, and the level of styrene concentration was revealed to be a critical factor affecting these changes. Bacterial genera Azoarcus and Pseudomonas were among the dominant classified genera in the biofilter. Canonical correspondence analysis (CCA) and correlation analysis revealed that the genera Brevundimonas, Hydrogenophaga, and Achromobacter may play important roles in styrene degradation under increasing styrene concentrations. No significant correlations (P > 0.05) could be detected between biofilter operational/functional parameters and biodiversity measurements, although biological heterogeneity within biofilms and/or technical variability within pyrosequencing may have considerably affected these results. Percentages of selected bacterial taxonomic groups detected by fluorescence in situ hybridization (FISH) were compared to results from pyrosequencing in order to assess the effectiveness and limitations of each method for identifying each microbial taxon. Comparison of results revealed discrepancies between the two methods in the detected percentages of numerous taxonomic groups. Biases and technical limitations of both FISH and pyrosequencing, such as the binding of FISH probes to non-target microbial groups and lack of classification of sequences for defined taxonomic groups from pyrosequencing, may partially explain some differences between the two methods.
Global survey of genomic imprinting by transcriptome sequencing.
Babak, Tomas; Deveale, Brian; Armour, Christopher; Raymond, Christopher; Cleary, Michele A; van der Kooy, Derek; Johnson, Jason M; Lim, Lee P
2008-11-25
Genomic imprinting restricts gene expression to a paternal or maternal allele. To date, approximately 90 imprinted transcripts have been identified in mouse, of which the majority were detected after intense interrogation of clusters of imprinted genes identified by phenotype-driven assays in mice with uniparental disomies [1]. Here we use selective priming and parallel sequencing to measure allelic bias in whole transcriptomes. By distinguishing parent-of-origin bias from strain-specific bias in embryos derived from a reciprocal cross of mice, we constructed a genome-wide map of imprinted transcription. This map was able to objectively locate over 80% of known imprinted loci and allowed the detection and confirmation of six novel imprinted genes. Even in the intensely studied embryonic day 9.5 developmental stage that we analyzed, more than half of all imprinted single-nucleotide polymorphisms did not overlap previously discovered imprinted transcripts; a large fraction of these represent novel noncoding RNAs within known imprinted loci. For example, a previously unnoticed, maternally expressed antisense transcript was mapped within the Grb10 locus. This study demonstrates the feasibility of using transcriptome sequencing for mapping of imprinted gene expression in physiologically normal animals. Such an approach will allow researchers to study imprinting without restricting themselves to individual loci or specific transcripts.
Discovering Functions of Unannotated Genes from a Transcriptome Survey of Wild Fungal Isolates
Ellison, Christopher E.; Kowbel, David; Glass, N. Louise; Taylor, John W.
2014-01-01
ABSTRACT Most fungal genomes are poorly annotated, and many fungal traits of industrial and biomedical relevance are not well suited to classical genetic screens. Assigning genes to phenotypes on a genomic scale thus remains an urgent need in the field. We developed an approach to infer gene function from expression profiles of wild fungal isolates, and we applied our strategy to the filamentous fungus Neurospora crassa. Using transcriptome measurements in 70 strains from two well-defined clades of this microbe, we first identified 2,247 cases in which the expression of an unannotated gene rose and fell across N. crassa strains in parallel with the expression of well-characterized genes. We then used image analysis of hyphal morphologies, quantitative growth assays, and expression profiling to test the functions of four genes predicted from our population analyses. The results revealed two factors that influenced regulation of metabolism of nonpreferred carbon and nitrogen sources, a gene that governed hyphal architecture, and a gene that mediated amino acid starvation resistance. These findings validate the power of our population-transcriptomic approach for inference of novel gene function, and we suggest that this strategy will be of broad utility for genome-scale annotation in many fungal systems. PMID:24692637
Developing High-Throughput HIV Incidence Assay with Pyrosequencing Platform
Park, Sung Yong; Goeken, Nolan; Lee, Hyo Jin; Bolan, Robert; Dubé, Michael P.
2014-01-01
ABSTRACT Human immunodeficiency virus (HIV) incidence is an important measure for monitoring the epidemic and evaluating the efficacy of intervention and prevention trials. This study developed a high-throughput, single-measure incidence assay by implementing a pyrosequencing platform. We devised a signal-masking bioinformatics pipeline, which yielded a process error rate of 5.8 × 10−4 per base. The pipeline was then applied to analyze 18,434 envelope gene segments (HXB2 7212 to 7601) obtained from 12 incident and 24 chronic patients who had documented HIV-negative and/or -positive tests. The pyrosequencing data were cross-checked by using the single-genome-amplification (SGA) method to independently obtain 302 sequences from 13 patients. Using two genomic biomarkers that probe for the presence of similar sequences, the pyrosequencing platform correctly classified all 12 incident subjects (100% sensitivity) and 23 of 24 chronic subjects (96% specificity). One misclassified subject's chronic infection was correctly classified by conducting the same analysis with SGA data. The biomarkers were statistically associated across the two platforms, suggesting the assay's reproducibility and robustness. Sampling simulations showed that the biomarkers were tolerant of sequencing errors and template resampling, two factors most likely to affect the accuracy of pyrosequencing results. We observed comparable biomarker scores between AIDS and non-AIDS chronic patients (multivariate analysis of variance [MANOVA], P = 0.12), indicating that the stage of HIV disease itself does not affect the classification scheme. The high-throughput genomic HIV incidence marks a significant step toward determining incidence from a single measure in cross-sectional surveys. IMPORTANCE Annual HIV incidence, the number of newly infected individuals within a year, is the key measure of monitoring the epidemic's rise and decline. Developing reliable assays differentiating recent from chronic infections has been a long-standing quest in the HIV community. Over the past 15 years, these assays have traditionally measured various HIV-specific antibodies, but recent technological advancements have expanded the diversity of proposed accurate, user-friendly, and financially viable tools. Here we designed a high-throughput genomic HIV incidence assay based on the signature imprinted in the HIV gene sequence population. By combining next-generation sequencing techniques with bioinformatics analysis, we demonstrated that genomic fingerprints are capable of distinguishing recently infected patients from chronically infected patients with high precision. Our high-throughput platform is expected to allow us to process many patients' samples from a single experiment, permitting the assay to be cost-effective for routine surveillance. PMID:24371062
Toxicogenomics in Environmental Science.
Brinke, Alexandra; Buchinger, Sebastian
This chapter reviews the current knowledge and recent progress in the field of environmental, aquatic ecotoxicogenomics with a focus on transcriptomic methods. In ecotoxicogenomics the omics technologies are applied for the detection and assessment of adverse effects in the environment, and thus are to be distinguished from omics used in human toxicology [Snape et al., Aquat Toxicol 67:143-154, 2004]. Transcriptomic methods in ecotoxicology are applied to gain a mechanistic understanding of toxic effects on organisms or populations, and thus aim to bridge the gap between cause and effect. A worthwhile effect-based interpretation of stressor induced changes on the transcriptome is based on the principle of phenotypic-anchoring [Paules, Environ Health Perspect 111:A338-A339, 2003]. Thereby, changes on the transcriptomic level can only be identified as effects if they are clearly linked to a specific stressor-induced effect on the macroscopic level. By integrating those macroscopic and transcriptomic effects, conclusions on the effect-inducing type of the stressor can be drawn. Stressor-specific effects on the transcriptomic level can be identified as stressor-specific induced pathways, transcriptomic patterns, or stressors-specific genetic biomarkers. In this chapter, examples of the combined application of macroscopic and transcriptional effects for the identification of environmental stressors, such as aquatic pollutants, are given and discussed. By means of these examples, challenges on the way to a standardized application of transcriptomics in ecotoxicology are discussed. This is also done against the background of the application of transcriptomic methods in environmental regulation such as the EU regulation Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH).
Farag, Mohamed A.; Deavours, Bettina E.; de Fátima, Ângelo; Naoumkina, Marina; Dixon, Richard A.; Sumner, Lloyd W.
2009-01-01
Metabolic profiling of elicited barrel medic (Medicago truncatula) cell cultures using high-performance liquid chromatography coupled to photodiode and mass spectrometry detection revealed the accumulation of the aurone hispidol (6-hydroxy-2-[(4-hydroxyphenyl)methylidene]-1-benzofuran-3-one) as a major response to yeast elicitor. Parallel, large-scale transcriptome profiling indicated that three peroxidases, MtPRX1, MtPRX2, and MtPRX3, were coordinately induced with the accumulation of hispidol. MtPRX1 and MtPRX2 exhibited aurone synthase activity based upon in vitro substrate specificity and product profiles of recombinant proteins expressed in Escherichia coli. Hispidol possessed significant antifungal activity relative to other M. truncatula phenylpropanoids tested but has not been reported in this species before and was not found in differentiated roots in which high levels of the peroxidase transcripts accumulated. We propose that hispidol is formed in cell cultures by metabolic spillover when the pool of its precursor, isoliquiritigenin, builds up as a result of an imbalance between the upstream and downstream segments of the phenylpropanoid pathway, reflecting the plasticity of plant secondary metabolism. The results illustrate that integration of metabolomics and transcriptomics in genetically reprogrammed plant cell cultures is a powerful approach for the discovery of novel bioactive secondary metabolites and the mechanisms underlying their generation. PMID:19571306
Li, Dayong; Huang, Zhiyuan; Song, Shuhui; Xin, Yeyun; Mao, Donghai; Lv, Qiming; Zhou, Ming; Tian, Dongmei; Tang, Mingfeng; Wu, Qi; Liu, Xue; Chen, Tingting; Song, Xianwei; Fu, Xiqin; Zhao, Bingran; Liang, Chengzhi; Li, Aihong; Liu, Guozhen; Li, Shigui; Hu, Songnian; Cao, Xiaofeng; Yu, Jun; Yuan, Longping; Chen, Caiyan; Zhu, Lihuang
2016-01-01
Hybrid rice is the dominant form of rice planted in China, and its use has extended worldwide since the 1970s. It offers great yield advantages and has contributed greatly to the world’s food security. However, the molecular mechanisms underlying heterosis have remained a mystery. In this study we integrated genetics and omics analyses to determine the candidate genes for yield heterosis in a model two-line rice hybrid system, Liang-you-pei 9 (LYP9) and its parents. Phenomics study revealed that the better parent heterosis (BPH) of yield in hybrid is not ascribed to BPH of all the yield components but is specific to the BPH of spikelet number per panicle (SPP) and paternal parent heterosis (PPH) of effective panicle number (EPN). Genetic analyses then identified multiple quantitative trait loci (QTLs) for these two components. Moreover, a number of differentially expressed genes and alleles in the hybrid were mapped by transcriptome profiling to the QTL regions as possible candidate genes. In parallel, a major QTL for yield heterosis, rice heterosis 8 (RH8), was found to be the DTH8/Ghd8/LHD1 gene. Based on the shared allelic heterozygosity of RH8 in many hybrid rice cultivars, a common mechanism for yield heterosis in the present commercial hybrid rice is proposed. PMID:27663737
de Steenhuijsen Piters, Wouter A A; Heinonen, Santtu; Hasrat, Raiza; Bunsow, Eleonora; Smith, Bennett; Suarez-Arrabal, Maria-Carmen; Chaussabel, Damien; Cohen, Daniel M; Sanders, Elisabeth A M; Ramilo, Octavio; Bogaert, Debby; Mejias, Asuncion
2016-11-01
Respiratory syncytial virus (RSV) is the leading cause of acute lower respiratory tract infections and hospitalizations in infants worldwide. Known risk factors, however, incompletely explain the variability of RSV disease severity, especially among healthy children. We postulate that the severity of RSV infection is influenced by modulation of the host immune response by the local bacterial ecosystem. To assess whether specific nasopharyngeal microbiota (clusters) are associated with distinct host transcriptome profiles and disease severity in children less than 2 years of age with RSV infection. We characterized the nasopharyngeal microbiota profiles of young children with mild and severe RSV disease and healthy children by 16S-rRNA sequencing. In parallel, using multivariable models, we analyzed whole-blood transcriptome profiles to study the relationship between microbial community composition, the RSV-induced host transcriptional response, and clinical disease severity. We identified five nasopharyngeal microbiota clusters characterized by enrichment of either Haemophilus influenzae, Streptococcus, Corynebacterium, Moraxella, or Staphylococcus aureus. RSV infection and RSV hospitalization were positively associated with H. influenzae and Streptococcus and negatively associated with S. aureus abundance, independent of age. Children with RSV showed overexpression of IFN-related genes, independent of the microbiota cluster. In addition, transcriptome profiles of children with RSV infection and H. influenzae- and Streptococcus-dominated microbiota were characterized by greater overexpression of genes linked to Toll-like receptor and by neutrophil and macrophage activation and signaling. Our data suggest that interactions between RSV and nasopharyngeal microbiota might modulate the host immune response, potentially affecting clinical disease severity.
Functional characterization of two concrete biofilms using pyrosequencing data
Phylogenetic studies of concrete biofilms using 16SrRNA-based approaches have demonstrated that concrete surfaces harbor a diverse microbial community. These approaches can provide information on the general taxonomical groups present in a sample but cannot shed light on the func...
Janzen, Timothy W; Thomas, Matthew C; Goji, Noriko; Shields, Michael J; Hahn, Kristen R; Amoako, Kingsley K
2015-02-01
Bacillus anthracis, the causative agent of anthrax, has the capacity to form highly resilient spores as part of its life cycle. The potential for the dissemination of these spores using food as a vehicle is a huge public health concern and, hence, requires the development of a foodborne bioterrorism response approach. In this work, we address a critical gap in food biodefense by presenting a novel, combined, sequential method involving the use of real-time PCR and pyrosequencing for the rapid, specific detection of B. anthracis spores in three food matrices: milk, apple juice, and bottled water. The food samples were experimentally inoculated with 40 CFU ml(-1), and DNA was extracted from the spores and analyzed after immunomagnetic separation. Applying the combination of multiplex real-time PCR and pyrosequencing, we successfully detected the presence of targets on both of the virulence plasmids and the chromosome. The results showed that DNA amplicons generated from a five-target multiplexed real-time PCR detection using biotin-labeled primers can be used for single-plex pyrosequencing detection. The combined use of multiplexed real-time PCR and pyrosequencing is a novel, rapid detection method for B. anthracis from food and provides a tool for accurate, quantitative identification with potential biodefense applications.
Zhan, Xiangjiang; Pan, Shengkai; Wang, Junyi; Dixon, Andrew; He, Jing; Muller, Margit G; Ni, Peixiang; Hu, Li; Liu, Yuan; Hou, Haolong; Chen, Yuanping; Xia, Jinquan; Luo, Qiong; Xu, Pengwei; Chen, Ying; Liao, Shengguang; Cao, Changchang; Gao, Shukun; Wang, Zhaobao; Yue, Zhen; Li, Guoqing; Yin, Ye; Fox, Nick C; Wang, Jun; Bruford, Michael W
2013-05-01
As top predators, falcons possess unique morphological, physiological and behavioral adaptations that allow them to be successful hunters: for example, the peregrine is renowned as the world's fastest animal. To examine the evolutionary basis of predatory adaptations, we sequenced the genomes of both the peregrine (Falco peregrinus) and saker falcon (Falco cherrug), and we present parallel, genome-wide evidence for evolutionary innovation and selection for a predatory lifestyle. The genomes, assembled using Illumina deep sequencing with greater than 100-fold coverage, are both approximately 1.2 Gb in length, with transcriptome-assisted prediction of approximately 16,200 genes for both species. Analysis of 8,424 orthologs in both falcons, chicken, zebra finch and turkey identified consistent evidence for genome-wide rapid evolution in these raptors. SNP-based inference showed contrasting recent demographic trajectories for the two falcons, and gene-based analysis highlighted falcon-specific evolutionary novelties for beak development and olfaction and specifically for homeostasis-related genes in the arid environment-adapted saker.
Systems analysis of arrestin pathway functions.
Maudsley, Stuart; Siddiqui, Sana; Martin, Bronwen
2013-01-01
To fully appreciate the diversity and specificity of complex cellular signaling events, such as arrestin-mediated signaling from G protein-coupled receptor activation, a complex systems-level investigation currently appears to be the best option. A rational combination of transcriptomics, proteomics, and interactomics, all coherently integrated with applied next-generation bioinformatics, is vital for the future understanding of the development, translation, and expression of GPCR-mediated arrestin signaling events in physiological contexts. Through a more nuanced, systems-level appreciation of arrestin-mediated signaling, the creation of arrestin-specific molecular response "signatures" should be made simple and ultimately amenable to drug discovery processes. Arrestin-based signaling paradigms possess important aspects, such as its specific temporal kinetics and ability to strongly affect transcriptional activity, that make it an ideal test bed for next-generation of drug discovery bioinformatic approaches such as multi-parallel dose-response analysis, data texturization, and latent semantic indexing-based natural language data processing and feature extraction. Copyright © 2013 Elsevier Inc. All rights reserved.
The immune gene repertoire of an important viral reservoir, the Australian black flying fox
2012-01-01
Background Bats are the natural reservoir host for a range of emerging and re-emerging viruses, including SARS-like coronaviruses, Ebola viruses, henipaviruses and Rabies viruses. However, the mechanisms responsible for the control of viral replication in bats are not understood and there is little information available on any aspect of antiviral immunity in bats. Massively parallel sequencing of the bat transcriptome provides the opportunity for rapid gene discovery. Although the genomes of one megabat and one microbat have now been sequenced to low coverage, no transcriptomic datasets have been reported from any bat species. In this study, we describe the immune transcriptome of the Australian flying fox, Pteropus alecto, providing an important resource for identification of genes involved in a range of activities including antiviral immunity. Results Towards understanding the adaptations that have allowed bats to coexist with viruses, we have de novo assembled transcriptome sequence from immune tissues and stimulated cells from P. alecto. We identified about 18,600 genes involved in a broad range of activities with the most highly expressed genes involved in cell growth and maintenance, enzyme activity, cellular components and metabolism and energy pathways. 3.5% of the bat transcribed genes corresponded to immune genes and a total of about 500 immune genes were identified, providing an overview of both innate and adaptive immunity. A small proportion of transcripts found no match with annotated sequences in any of the public databases and may represent bat-specific transcripts. Conclusions This study represents the first reported bat transcriptome dataset and provides a survey of expressed bat genes that complement existing bat genomic data. In addition, these data provide insight into genes relevant to the antiviral responses of bats, and form a basis for examining the roles of these molecules in immune response to viral infection. PMID:22716473
OKI, Kaihei; DUGERSUREN, Jamyan; DEMBEREL, Shirchin; WATANABE, Koichi
2014-01-01
Here, we used pyrosequencing to obtain a detailed analysis of the microbial diversities of traditional fermented dairy products of Mongolia. From 22 Airag (fermented mare’s milk), 5 Khoormog (fermented camel’s milk) and 26 Tarag (fermented milk of cows, goats and yaks) samples collected in the Mongolian provinces of Arhangai, Bulgan, Dundgobi, Tov, Uburhangai and Umnugobi, we obtained a total of 81 operational taxonomic units, which were assigned to 15 families, 21 genera and 41 species in 3 phyla. The genus Lactobacillus is a core bacterial component of Mongolian fermented milks, and Lactobacillus helveticus, Lactobacillus kefiranofaciens and Lactobacillus delbrueckii were the predominant species of lactic acid bacteria (LAB) in the Airag, Khoormog and Tarag samples, respectively. By using this pyrosequencing approach, we successfully detected most LAB species that have been isolated as well as seven LAB species that have not been found in our previous culture-based study. A subsequent analysis of the principal components of the samples revealed that L. delbrueckii, L. helveticus, L. kefiranofaciens and Streptococcus thermophilus were the main factors influencing the microbial diversity of these Mongolian traditional fermented dairy products and that this diversity correlated with the animal species from which the milk was sourced. PMID:25003019
Zu, Qianhui; Fang, Huan; Zhou, Hu; Zhang, Jianwei; Peng, Xinhua; Lin, Xiangui; Feng, Youzhi
2016-01-04
X-ray micro-computed tomography (micro-CT) technology, as used in the in situ and nondestructive analysis of soil physical structure, provides the opportunity of associating soil physical and biological assays. Due to the high heterogeneity of the soil matrix, X-ray micro-CT scanning and soil microbial assays should be conducted on the same soil sample. This raises the question whether X-ray micro-CT influences microbial function and diversity of the sample soil to be analyzed. To address this question, we used plate counting, microcalorimetry and pyrosequencing approaches to evaluate the effect of X-ray--at doses typically used in micro-CT--on soil microorganisms in a typical soil of North China Plain, Fluvo-aquic soil and in a typical soil of subtropical China, Ultisol soil, respectively. In both soils radiation decreased the number of viable soil bacteria and disturbed their thermogenic profiles. At DNA level, pyrosequencing revealed that alpha diversities of two soils biota were influenced in opposite ways, while beta diversity was not affected although the relative abundances of some guilds were changed. These findings indicate that the metabolically active aspects of soil biota are not compatible with X-ray micro-CT; while the beta molecular diversity based on pyrosequencing could be compatible.
Gilling, Damian H.; Luna, Vicki Ann; Pflugradt, Cori
2014-01-01
The etiologic agents for melioidosis and glanders, Burkholderia mallei and Burkholderia pseudomallei respectively, are genetically similar making identification and differentiation from other Burkholderia species and each other challenging. We used pyrosequencing to determine the presence or absence of an insertion sequence IS407A within the flagellin P (fliP) gene and to exploit the difference in orientation of this gene in the two species. Oligonucleotide primers were designed to selectively target the IS407A-fliP interface in B. mallei and the fliP gene specifically at the insertion point in B. pseudomallei. We then examined DNA from ten B. mallei, ten B. pseudomallei, 14 B. cepacia, eight other Burkholderia spp., and 17 other bacteria. Resultant pyrograms encompassed the target sequence that contained either the fliP gene with the IS407A interruption or the fully intact fliP gene with 100% sensitivity and 100% specificity. These pyrosequencing assays based upon a single gene enable investigators to reliably identify the two species. The information obtained by these assays provides more knowledge of the genomic reduction that created the new species B. mallei from B. pseudomallei and may point to new targets that can be exploited in the future. PMID:27350960
Rapid detection and subtyping of human influenza A viruses and reassortants by pyrosequencing.
Deng, Yi-Mo; Caldwell, Natalie; Barr, Ian G
2011-01-01
Given the continuing co-circulation of the 2009 H1N1 pandemic influenza A viruses with seasonal H3N2 viruses, rapid and reliable detection of newly emerging influenza reassortant viruses is important to enhance our influenza surveillance. A novel pyrosequencing assay was developed for the rapid identification and subtyping of potential human influenza A virus reassortants based on all eight gene segments of the virus. Except for HA and NA genes, one universal set of primers was used to amplify and subtype each of the six internal genes. With this method, all eight gene segments of 57 laboratory isolates and 17 original specimens of seasonal H1N1, H3N2 and 2009 H1N1 pandemic viruses were correctly matched with their corresponding subtypes. In addition, this method was shown to be capable of detecting reassortant viruses by correctly identifying the source of all 8 gene segments from three vaccine production reassortant viruses and three H1N2 viruses. In summary, this pyrosequencing assay is a sensitive and specific procedure for screening large numbers of viruses for reassortment events amongst the commonly circulating human influenza A viruses, which is more rapid and cheaper than using conventional sequencing approaches.
Rapid Detection and Subtyping of Human Influenza A Viruses and Reassortants by Pyrosequencing
Deng, Yi-Mo; Caldwell, Natalie; Barr, Ian G.
2011-01-01
Background Given the continuing co-circulation of the 2009 H1N1 pandemic influenza A viruses with seasonal H3N2 viruses, rapid and reliable detection of newly emerging influenza reassortant viruses is important to enhance our influenza surveillance. Methodology/Principal Findings A novel pyrosequencing assay was developed for the rapid identification and subtyping of potential human influenza A virus reassortants based on all eight gene segments of the virus. Except for HA and NA genes, one universal set of primers was used to amplify and subtype each of the six internal genes. With this method, all eight gene segments of 57 laboratory isolates and 17 original specimens of seasonal H1N1, H3N2 and 2009 H1N1 pandemic viruses were correctly matched with their corresponding subtypes. In addition, this method was shown to be capable of detecting reassortant viruses by correctly identifying the source of all 8 gene segments from three vaccine production reassortant viruses and three H1N2 viruses. Conclusions/Significance In summary, this pyrosequencing assay is a sensitive and specific procedure for screening large numbers of viruses for reassortment events amongst the commonly circulating human influenza A viruses, which is more rapid and cheaper than using conventional sequencing approaches. PMID:21886790
Jo, Hyun Jin; Kim, Jaeyeon; Kim, Nayoung; Park, Ji Hyun; Nam, Ryoung Hee; Seok, Yeong-Jae; Kim, Yeon-Ran; Kim, Joo Sung; Kim, Jung Mogg; Kim, Jung Min; Lee, Dong Ho; Jung, Hyun Chae
2016-10-01
Little is known about the role of gastric microbiota except for Helicobacter pylori (HP) in human health and disease. We compared the differences of human gastric microbiota according to gastric cancer or control and HP infection status and assessed the role of bacteria other than HP. Gastric microbiota of 63 antral mucosal and 18 corpus mucosal samples were analyzed by bar-coded 454 pyrosequencing of the 16S rRNA gene. Antral samples were divided into four subgroups based on HP positivity in pyrosequencing and the presence of cancer. The analysis was focused on bacteria other than HP, especially nitrosating or nitrate-reducing bacteria (NB). The changes of NB in antral mucosa of 16 subjects were followed up. The number of NB other than HP (non-HP-NB) was two times higher in the cancer groups than in the control groups, but it did not reach statistical significance. The number of non-HP-NB tends to increase over time, but this phenomenon was prevented by HP eradication in the HP-positive control group, but not in the HP-positive cancer group. We could not find the significant role of bacteria other than HP in the gastric carcinogenesis. © 2016 John Wiley & Sons Ltd.
Adult Mouse Cortical Cell Taxonomy by Single Cell Transcriptomics
Tasic, Bosiljka; Menon, Vilas; Nguyen, Thuc Nghi; Kim, Tae Kyung; Jarsky, Tim; Yao, Zizhen; Levi, Boaz; Gray, Lucas T.; Sorensen, Staci A.; Dolbeare, Tim; Bertagnolli, Darren; Goldy, Jeff; Shapovalova, Nadiya; Parry, Sheana; Lee, Changkyu; Smith, Kimberly; Bernard, Amy; Madisen, Linda; Sunkin, Susan M.; Hawrylycz, Michael; Koch, Christof; Zeng, Hongkui
2016-01-01
Nervous systems are composed of various cell types, but the extent of cell type diversity is poorly understood. Here, we construct a cellular taxonomy of one cortical region, primary visual cortex, in adult mice based on single cell RNA-sequencing. We identify 49 transcriptomic cell types including 23 GABAergic, 19 glutamatergic and seven non-neuronal types. We also analyze cell-type specific mRNA processing and characterize genetic access to these transcriptomic types by many transgenic Cre lines. Finally, we show that some of our transcriptomic cell types display specific and differential electrophysiological and axon projection properties, thereby confirming that the single cell transcriptomic signatures can be associated with specific cellular properties. PMID:26727548
NASA Astrophysics Data System (ADS)
Liu, Yun; Song, Shuqun; Chen, Tiantian; Li, Caiwen
2017-04-01
Pyrosequencing of the 18S rRNA gene has been widely adopted to study the eukaryotic diversity in various types of environments, and has an advantage over traditional morphology methods in exploring unknown microbial communities. To comprehensively assess the diversity and community composition of marine protists in the coastal waters of China, we applied both morphological observations and high-throughput sequencing of the V2 and V3 regions of 18S rDNA simultaneously to analyze samples collected from the surface layer of the Yellow and East China Seas. Dinoflagellates, diatoms and ciliates were the three dominant protistan groups as revealed by the two methods. Diatoms were the first dominant protistan group in the microscopic observations, with Skeletonema mainly distributed in the nearshore eutrophic waters and Chaetoceros in higher temperature and higher pH waters. The mixotrophic dinoflagellates, Gymnodinium and Gyrodinium, were more competitive in the oligotrophic waters. The pyrosequencing method revealed an extensive diversity of dinoflagellates. Chaetoceros was the only dominant diatom group in the pyrosequencing dataset. Gyrodinium represented the most abundant reads and dominated the offshore oligotrophic protistan community as they were in the microscopic observations. The dominance of parasitic dinoflagellates in the pyrosequencing dataset, which were overlooked in the morphological observations, indicates more attention should be paid to explore the potential role of this group. Both methods provide coherent clustering of samples. Nutrient levels, salinity and pH were the main factors influencing the distribution of protists. This study demonstrates that different primer pairs used in the pyrosequencing will indicate different protistan community structures. A suitable marker may reveal more comprehensive composition of protists and provide valuable information on environmental drivers.
Debode, Frederic; Janssen, Eric; Bragard, Claude; Berben, Gilbert
2017-08-01
The presence of genetically modified organisms (GMOs) in food and feed is mainly detected by the use of targets focusing on promoters and terminators. As some genes are frequently used in genetically modified (GM) construction, they also constitute excellent screening elements and their use is increasing. In this paper we propose a new target for the detection of cry1Ab and cry1Ac genes by real-time polymerase chain reaction (PCR) and pyrosequencing. The specificity, sensitivity and robustness of the real-time PCR method were tested following the recommendations of international guidelines and the method met the expected performance criteria. This paper also shows how the robustness testing was assessed. This new cry1Ab/Ac method can provide a positive signal with a larger number of GM events than do the other existing methods using double dye-probes. The method permits the analysis of results with less ambiguity than the SYBRGreen method recommended by the European Reference Laboratory (EURL) GM Food and Feed (GMFF). A pyrosequencing method was also developed to gain additional information thanks to the sequence of the amplicon. This method of sequencing-by-synthesis can determine the sequence between the primers used for PCR. Pyrosequencing showed that the sequences internal to the primers present differences following the GM events considered and three different sequences were observed. The sensitivity of the pyrosequencing was tested on reference flours with a low percentage GM content and different copy numbers. Improvements in the pyrosequencing protocol provided correct sequences with 50 copies of the target. Below this copy number, the quality of the sequence was more random.
Melicher, Dacotah; Torson, Alex S; Dworkin, Ian; Bowsher, Julia H
2014-03-12
The Sepsidae family of flies is a model for investigating how sexual selection shapes courtship and sexual dimorphism in a comparative framework. However, like many non-model systems, there are few molecular resources available. Large-scale sequencing and assembly have not been performed in any sepsid, and the lack of a closely related genome makes investigation of gene expression challenging. Our goal was to develop an automated pipeline for de novo transcriptome assembly, and to use that pipeline to assemble and analyze the transcriptome of the sepsid Themira biloba. Our bioinformatics pipeline uses cloud computing services to assemble and analyze the transcriptome with off-site data management, processing, and backup. It uses a multiple k-mer length approach combined with a second meta-assembly to extend transcripts and recover more bases of transcript sequences than standard single k-mer assembly. We used 454 sequencing to generate 1.48 million reads from cDNA generated from embryo, larva, and pupae of T. biloba and assembled a transcriptome consisting of 24,495 contigs. Annotation identified 16,705 transcripts, including those involved in embryogenesis and limb patterning. We assembled transcriptomes from an additional three non-model organisms to demonstrate that our pipeline assembled a higher-quality transcriptome than single k-mer approaches across multiple species. The pipeline we have developed for assembly and analysis increases contig length, recovers unique transcripts, and assembles more base pairs than other methods through the use of a meta-assembly. The T. biloba transcriptome is a critical resource for performing large-scale RNA-Seq investigations of gene expression patterns, and is the first transcriptome sequenced in this Dipteran family.
Lung Transcriptomics during Protective Ventilatory Support in Sepsis-Induced Acute Lung Injury
Acosta-Herrera, Marialbert; Lorenzo-Diaz, Fabian; Pino-Yanes, Maria; Corrales, Almudena; Valladares, Francisco; Klassert, Tilman E.; Valladares, Basilio; Slevogt, Hortense; Ma, Shwu-Fan
2015-01-01
Acute lung injury (ALI) is a severe inflammatory process of the lung. The only proven life-saving support is mechanical ventilation (MV) using low tidal volumes (LVT) plus moderate to high levels of positive end-expiratory pressure (PEEP). However, it is currently unknown how they exert the protective effects. To identify the molecular mechanisms modulated by protective MV, this study reports transcriptomic analyses based on microarray and microRNA sequencing in lung tissues from a clinically relevant animal model of sepsis-induced ALI. Sepsis was induced by cecal ligation and puncture (CLP) in male Sprague-Dawley rats. At 24 hours post-CLP, septic animals were randomized to three ventilatory strategies: spontaneous breathing, LVT (6 ml/kg) plus 10 cmH2O PEEP and high tidal volume (HVT, 20 ml/kg) plus 2 cmH2O PEEP. Healthy, non-septic, non-ventilated animals served as controls. After 4 hours of ventilation, lung samples were obtained for histological examination and gene expression analysis using microarray and microRNA sequencing. Validations were assessed using parallel analyses on existing publicly available genome-wide association study findings and transcriptomic human data. The catalogue of deregulated processes differed among experimental groups. The ‘response to microorganisms’ was the most prominent biological process in septic, non-ventilated and in HVT animals. Unexpectedly, the ‘neuron projection morphogenesis’ process was one of the most significantly deregulated in LVT. Further support for the key role of the latter process was obtained by microRNA studies, as four species targeting many of its genes (Mir-27a, Mir-103, Mir-17-5p and Mir-130a) were found deregulated. Additional analyses revealed 'VEGF signaling' as a central underlying response mechanism to all the septic groups (spontaneously breathing or mechanically ventilated). Based on this data, we conclude that a co-deregulation of 'VEGF signaling' along with 'neuron projection morphogenesis', which have been never anticipated in ALI pathogenesis, promotes lung-protective effects of LVT with high levels of PEEP. PMID:26147972
Zhao, Chao; Chu, Yanan; Li, Yanhong; Yang, Chengfeng; Chen, Yuqing; Wang, Xumin; Liu, Bin
2017-01-01
To analyze the microbial diversity and gene content of a thermophilic cellulose-degrading consortium from hot springs in Xiamen, China using 454 pyrosequencing for discovering cellulolytic enzyme resources. A thermophilic cellulose-degrading consortium, XM70 that was isolated from a hot spring, used sugarcane bagasse as sole carbon and energy source. DNA sequencing of the XM70 sample resulted in 349,978 reads with an average read length of 380 bases, accounting for 133,896,867 bases of sequence information. The characterization of sequencing reads and assembled contigs revealed that most microbes were derived from four phyla: Geobacillus (Firmicutes), Thermus, Bacillus, and Anoxybacillus. Twenty-eight homologous genes belonging to 15 glycoside hydrolase families were detected, including several cellulase genes. A novel hot spring metagenome-derived thermophilic cellulase was expressed and characterized. The application value of thermostable sugarcane bagasse-degrading enzymes is shown for production of cellulosic biofuel. The practical power of using a short-read-based metagenomic approach for harvesting novel microbial genes is also demonstrated.
Zhao, Yuancun; Chen, Xiaogang; Yang, Yiwen; Zhao, Xiaohong; Zhang, Shu; Gao, Zehua; Fang, Ting; Wang, Yufang; Zhang, Ji
2018-05-07
Diatom examination has always been used for the diagnosis of drowning in forensic practice. However, traditional examination of the microscopic features of diatom frustules is time-consuming and requires taxonomic expertise. In this study, we demonstrate a potential DNA-based method of inferring suspected drowning site using pyrosequencing (PSQ) of the V7 region of 18S ribosome DNA (18S rDNA) as a diatom DNA barcode. By employing a sparse representation-based AdvISER-M-PYRO algorithm, the original PSQ signals of diatom DNA mixtures were deciphered to determine the corresponding taxa of the composite diatoms. Additionally, we evaluated the possibility of correlating water samples to collection sites by analyzing the PSQ signal profiles of diatom mixtures contained in the water samples via multidimensional scaling. The results suggest that diatomaceous PSQ profile analysis could be used as a cost-effective method to deduce the geographical origin of an environmental bio-sample.
Transcriptomics provides unique solutions for understanding the impact of complex mixtures and their components on aquatic systems. Here we describe the application of transcriptomics analysis of in situ fathead minnow exposures for assessing biological impacts of wastewater trea...
Molecular techniques are an alternative to culturing and counting methods in quantifying indoor fungal contamination. Pyrosequencing offers the possibility of identifying unexpected indoor fungi. In this study, 50 house dust samples were collected from homes in the Yakima Valley,...
Zhao, Chanjuan; Xie, Junqi; Li, Li; Cao, Chongjiang
2017-09-20
The transcriptomes of paddy rice in response to high temperature and humidity were studied using a high-throughput RNA sequencing approach. Effects of high temperature and humidity on the sucrose and starch contents and α/β-amylase activity were also investigated. Results showed that 6876 differentially expressed genes (DEGs) were identified in paddy rice under high temperature and humidity storage. Importantly, 12 DEGs that were downregulated fell into the "starch and sucrose pathway". The quantitative real-time polymerase chain reaction assays indicated that expression of these 12 DEGs was significantly decreased, which was in parallel with the reduced level of enzyme activities and the contents of sucrose and starch in paddy rice stored at high temperature and humidity conditions compared to the control group. Taken together, high temperature and humidity influence the quality of paddy rice at least partially by downregulating the expression of genes encoding sucrose transferases and hydrolases, which might result in the decrease of starch and sucrose contents.
Durack, Juliana; Ross, Tom; Bowman, John P.
2013-01-01
The ability of Listeria monocytogenes to adapt to various food and food- processing environments has been attributed to its robustness, persistence and prevalence in the food supply chain. To improve the present understanding of molecular mechanisms involved in hyperosmotic and low-temperature stress adaptation of L. monocytogenes, we undertook transcriptomics analysis on three strains adapted to sub-lethal levels of these stress stimuli and assessed functional gene response. Adaptation to hyperosmotic and cold-temperature stress has revealed many parallels in terms of gene expression profiles in strains possessing different levels of stress tolerance. Gene sets associated with ribosomes and translation, transcription, cell division as well as fatty acid biosynthesis and peptide transport showed activation in cells adapted to either cold or hyperosmotic stress. Repression of genes associated with carbohydrate metabolism and transport as well as flagella was evident in stressed cells, likely linked to activation of CodY regulon and consequential cellular energy conservation. PMID:24023890
Transcriptome of the Caribbean stony coral Porites astreoides from three developmental stages.
Mansour, Tamer A; Rosenthal, Joshua J C; Brown, C Titus; Roberson, Loretta M
2016-08-02
Porites astreoides is a ubiquitous species of coral on modern Caribbean reefs that is resistant to increasing temperatures, overfishing, and other anthropogenic impacts that have threatened most other coral species. We assembled and annotated a transcriptome from this coral using Illumina sequences from three different developmental stages collected over several years: free-swimming larvae, newly settled larvae, and adults (>10 cm in diameter). This resource will aid understanding of coral calcification, larval settlement, and host-symbiont interactions. A de novo transcriptome for the P. astreoides holobiont (coral plus algal symbiont) was assembled using 594 Mbp of raw Illumina sequencing data generated from five age-specific cDNA libraries. The new transcriptome consists of 867 255 transcript elements with an average length of 685 bases. The isolated P. astreoides assembly consists of 129 718 transcript elements with an average length of 811 bases, and the isolated Symbiodinium sp. assembly had 186 177 transcript elements with an average length of 1105 bases. This contribution to coral transcriptome data provides a valuable resource for researchers studying the ontogeny of gene expression patterns within both the coral and its dinoflagellate symbiont.
An Insight into the Transcriptome of the Digestive Tract of the Bloodsucking Bug, Rhodnius prolixus
Ribeiro, José M. C.; Genta, Fernando A.; Sorgine, Marcos H. F.; Logullo, Raquel; Mesquita, Rafael D.; Paiva-Silva, Gabriela O.; Majerowicz, David; Medeiros, Marcelo; Koerich, Leonardo; Terra, Walter R.; Ferreira, Clélia; Pimentel, André C.; Bisch, Paulo M.; Leite, Daniel C.; Diniz, Michelle M. P.; Junior, João Lídio da S. G. V.; Da Silva, Manuela L.; Araujo, Ricardo N.; Gandara, Ana Caroline P.; Brosson, Sébastien; Salmon, Didier; Bousbata, Sabrina; González-Caballero, Natalia; Silber, Ariel Mariano; Alves-Bezerra, Michele; Gondim, Katia C.; Silva-Neto, Mário Alberto C.; Atella, Georgia C.; Araujo, Helena; Dias, Felipe A.; Polycarpo, Carla; Vionette-Amaral, Raquel J.; Fampa, Patrícia; Melo, Ana Claudia A.; Tanaka, Aparecida S.; Balczun, Carsten; Oliveira, José Henrique M.; Gonçalves, Renata L. S.; Lazoski, Cristiano; Rivera-Pomar, Rolando; Diambra, Luis; Schaub, Günter A.; Garcia, Elói S.; Azambuja, Patrícia; Braz, Glória R. C.; Oliveira, Pedro L.
2014-01-01
The bloodsucking hemipteran Rhodnius prolixus is a vector of Chagas' disease, which affects 7–8 million people today in Latin America. In contrast to other hematophagous insects, the triatomine gut is compartmentalized into three segments that perform different functions during blood digestion. Here we report analysis of transcriptomes for each of the segments using pyrosequencing technology. Comparison of transcript frequency in digestive libraries with a whole-body library was used to evaluate expression levels. All classes of digestive enzymes were highly expressed, with a predominance of cysteine and aspartic proteinases, the latter showing a significant expansion through gene duplication. Although no protein digestion is known to occur in the anterior midgut (AM), protease transcripts were found, suggesting secretion as pro-enzymes, being possibly activated in the posterior midgut (PM). As expected, genes related to cytoskeleton, protein synthesis apparatus, protein traffic, and secretion were abundantly transcribed. Despite the absence of a chitinous peritrophic membrane in hemipterans - which have instead a lipidic perimicrovillar membrane lining over midgut epithelia - several gut-specific peritrophin transcripts were found, suggesting that these proteins perform functions other than being a structural component of the peritrophic membrane. Among immunity-related transcripts, while lysozymes and lectins were the most highly expressed, several genes belonging to the Toll pathway - found at low levels in the gut of most insects - were identified, contrasting with a low abundance of transcripts from IMD and STAT pathways. Analysis of transcripts related to lipid metabolism indicates that lipids play multiple roles, being a major energy source, a substrate for perimicrovillar membrane formation, and a source for hydrocarbons possibly to produce the wax layer of the hindgut. Transcripts related to amino acid metabolism showed an unanticipated priority for degradation of tyrosine, phenylalanine, and tryptophan. Analysis of transcripts related to signaling pathways suggested a role for MAP kinases, GTPases, and LKBP1/AMP kinases related to control of cell shape and polarity, possibly in connection with regulation of cell survival, response of pathogens and nutrients. Together, our findings present a new view of the triatomine digestive apparatus and will help us understand trypanosome interaction and allow insights into hemipteran metabolic adaptations to a blood-based diet. PMID:24416461
Luo, Wei; Hu, Qiang; Wang, Dan; Deeb, Kristin K.; Ma, Yingyu; Morrison, Carl D.; Liu, Song; Johnson, Candace S.; Trump, Donald L.
2013-01-01
Endothelial cells (ECs) are an important component involved in the angiogenesis. Little is known about the global gene expression and epigenetic regulation in tumor endothelial cells. The identification of gene expression and epigenetic difference between human prostate tumor-derived endothelial cells (TdECs) and those in normal tissues may uncover unique biological features of TdEC and facilitate the discovery of new anti-angiogenic targets. We established a method for isolation of CD31+ endothelial cells from malignant and normal prostate tissues obtained at prostatectomy. TdECs and normal-derived ECs (NdECs) showed >90% enrichment in primary culture and demonstrated microvascular endothelial cell characteristics such as cobblestone morphology in monolayer culture, diI-acetyl-LDL uptake and capillary-tube like formation in Matrigel®. In vitro primary cultures of ECs maintained expression of endothelial markers such as CD31, von Willebrand factor, intercellular adhesion molecule, vascular endothelial growth factor receptor 1, and vascular endothelial growth factor receptor 2. We then conducted a pilot study of transcriptome and methylome analysis of TdECs and matched NdECs from patients with prostate cancer. We observed a wide spectrum of differences in gene expression and methylation patterns in endothelial cells, between malignant and normal prostate tissues. Array-based expression and methylation data were validated by qRT-PCR and bisulfite DNA pyrosequencing. Further analysis of transcriptome and methylome data revealed a number of differentially expressed genes with loci whose methylation change is accompanied by an inverse change in gene expression. Our study demonstrates the feasibility of isolation of ECs from histologically normal prostate and prostate cancer via CD31+ selection. The data, although preliminary, indicates that there exist widespread differences in methylation and transcription between TdECs and NdECs. Interestingly, only a small proportion of perturbed genes were overlapped between American (AA) and Caucasian American (CA) patients with prostate cancer. Our study indicates that identifying gene expression and/or epigenetic differences between TdECs and NdECs may provide us with new anti-angiogenic targets. Future studies will be required to further characterize the isolated ECs and determine the biological features that can be exploited in the prognosis and therapy of prostate cancer. PMID:23978847
Solanum torvum responses to the root-knot nematode Meloidogyne incognita
2013-01-01
Background Solanum torvum Sw is worldwide employed as rootstock for eggplant cultivation because of its vigour and resistance/tolerance to the most serious soil-borne diseases as bacterial, fungal wilts and root-knot nematodes. The little information on Solanum torvum (hereafter Torvum) resistance mechanisms, is mostly attributable to the lack of genomic tools (e.g. dedicated microarray) as well as to the paucity of database information limiting high-throughput expression studies in Torvum. Results As a first step towards transcriptome profiling of Torvum inoculated with the nematode M. incognita, we built a Torvum 3’ transcript catalogue. One-quarter of a 454 full run resulted in 205,591 quality-filtered reads. De novo assembly yielded 24,922 contigs and 11,875 singletons. Similarity searches of the S. torvum transcript tags catalogue produced 12,344 annotations. A 30,0000 features custom combimatrix chip was then designed and microarray hybridizations were conducted for both control and 14 dpi (day post inoculation) with Meloidogyne incognita-infected roots samples resulting in 390 differentially expressed genes (DEG). We also tested the chip with samples from the phylogenetically-related nematode-susceptible eggplant species Solanum melongena. An in-silico validation strategy was developed based on assessment of sequence similarity among Torvum probes and eggplant expressed sequences available in public repositories. GO term enrichment analyses with the 390 Torvum DEG revealed enhancement of several processes as chitin catabolism and sesquiterpenoids biosynthesis, while no GO term enrichment was found with eggplant DEG. The genes identified from S. torvum catalogue, bearing high similarity to known nematode resistance genes, were further investigated in view of their potential role in the nematode resistance mechanism. Conclusions By combining 454 pyrosequencing and microarray technology we were able to conduct a cost-effective global transcriptome profiling in a non-model species. In addition, the development of an in silico validation strategy allowed to further extend the use of the custom chip to a related species and to assess by comparison the expression of selected genes without major concerns of artifacts. The expression profiling of S. torvum responses to nematode infection points to sesquiterpenoids and chitinases as major effectors of nematode resistance. The availability of the long sequence tags in S. torvum catalogue will allow precise identification of active nematocide/nematostatic compounds and associated enzymes posing the basis for exploitation of these resistance mechanisms in other species. PMID:23937585
Optimized Probe Masking for Comparative Transcriptomics of Closely Related Species
Poeschl, Yvonne; Delker, Carolin; Trenner, Jana; Ullrich, Kristian Karsten; Quint, Marcel; Grosse, Ivo
2013-01-01
Microarrays are commonly applied to study the transcriptome of specific species. However, many available microarrays are restricted to model organisms, and the design of custom microarrays for other species is often not feasible. Hence, transcriptomics approaches of non-model organisms as well as comparative transcriptomics studies among two or more species often make use of cost-intensive RNAseq studies or, alternatively, by hybridizing transcripts of a query species to a microarray of a closely related species. When analyzing these cross-species microarray expression data, differences in the transcriptome of the query species can cause problems, such as the following: (i) lower hybridization accuracy of probes due to mismatches or deletions, (ii) probes binding multiple transcripts of different genes, and (iii) probes binding transcripts of non-orthologous genes. So far, methods for (i) exist, but these neglect (ii) and (iii). Here, we propose an approach for comparative transcriptomics addressing problems (i) to (iii), which retains only transcript-specific probes binding transcripts of orthologous genes. We apply this approach to an Arabidopsis lyrata expression data set measured on a microarray designed for Arabidopsis thaliana, and compare it to two alternative approaches, a sequence-based approach and a genomic DNA hybridization-based approach. We investigate the number of retained probe sets, and we validate the resulting expression responses by qRT-PCR. We find that the proposed approach combines the benefit of sequence-based stringency and accuracy while allowing the expression analysis of much more genes than the alternative sequence-based approach. As an added benefit, the proposed approach requires probes to detect transcripts of orthologous genes only, which provides a superior base for biological interpretation of the measured expression responses. PMID:24260119
Quantitative phenotyping via deep barcode sequencing.
Smith, Andrew M; Heisler, Lawrence E; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J; Chee, Mark; Roth, Frederick P; Giaever, Guri; Nislow, Corey
2009-10-01
Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or "Bar-seq," outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that approximately 20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene-environment interactions on a genome-wide scale.
Parallel RNA extraction using magnetic beads and a droplet array.
Shi, Xu; Chen, Chun-Hong; Gao, Weimin; Chao, Shih-Hui; Meldrum, Deirdre R
2015-02-21
Nucleic acid extraction is a necessary step for most genomic/transcriptomic analyses, but it often requires complicated mechanisms to be integrated into a lab-on-a-chip device. Here, we present a simple, effective configuration for rapidly obtaining purified RNA from low concentration cell medium. This Total RNA Extraction Droplet Array (TREDA) utilizes an array of surface-adhering droplets to facilitate the transportation of magnetic purification beads seamlessly through individual buffer solutions without solid structures. The fabrication of TREDA chips is rapid and does not require a microfabrication facility or expertise. The process takes less than 5 minutes. When purifying mRNA from bulk marine diatom samples, its repeatability and extraction efficiency are comparable to conventional tube-based operations. We demonstrate that TREDA can extract the total mRNA of about 10 marine diatom cells, indicating that the sensitivity of TREDA approaches single-digit cell numbers.
Parallel RNA extraction using magnetic beads and a droplet array
Shi, Xu; Chen, Chun-Hong; Gao, Weimin; Meldrum, Deirdre R.
2015-01-01
Nucleic acid extraction is a necessary step for most genomic/transcriptomic analyses, but it often requires complicated mechanisms to be integrated into a lab-on-a-chip device. Here, we present a simple, effective configuration for rapidly obtaining purified RNA from low concentration cell medium. This Total RNA Extraction Droplet Array (TREDA) utilizes an array of surface-adhering droplets to facilitate the transportation of magnetic purification beads seamlessly through individual buffer solutions without solid structures. The fabrication of TREDA chips is rapid and does not require a microfabrication facility or expertise. The process takes less than 5 minutes. When purifying mRNA from bulk marine diatom samples, its repeatability and extraction efficiency are comparable to conventional tube-based operations. We demonstrate that TREDA can extract the total mRNA of about 10 marine diatom cells, indicating that the sensitivity of TREDA approaches single-digit cell numbers. PMID:25519439
LRRTM1 underlies synaptic convergence in visual thalamus
Monavarfeshani, Aboozar; Stanton, Gail; Van Name, Jonathan; Su, Kaiwen; Mills, William A; Swilling, Kenya; Kerr, Alicia; Huebschman, Natalie A; Su, Jianmin
2018-01-01
It has long been thought that the mammalian visual system is organized into parallel pathways, with incoming visual signals being parsed in the retina based on feature (e.g. color, contrast and motion) and then transmitted to the brain in unmixed, feature-specific channels. To faithfully convey feature-specific information from retina to cortex, thalamic relay cells must receive inputs from only a small number of functionally similar retinal ganglion cells. However, recent studies challenged this by revealing substantial levels of retinal convergence onto relay cells. Here, we sought to identify mechanisms responsible for the assembly of such convergence. Using an unbiased transcriptomics approach and targeted mutant mice, we discovered a critical role for the synaptic adhesion molecule Leucine Rich Repeat Transmembrane Neuronal 1 (LRRTM1) in the emergence of retinothalamic convergence. Importantly, LRRTM1 mutant mice display impairment in visual behaviors, suggesting a functional role of retinothalamic convergence in vision. PMID:29424692
The Complete Chloroplast Genome Sequence of Date Palm (Phoenix dactylifera L.)
Yang, Meng; Zhang, Xiaowei; Liu, Guiming; Yin, Yuxin; Chen, Kaifu; Yun, Quanzheng; Zhao, Duojun; Al-Mssallem, Ibrahim S.; Yu, Jun
2010-01-01
Background Date palm (Phoenix dactylifera L.), a member of Arecaceae family, is one of the three major economically important woody palms—the two other palms being oil palm and coconut tree—and its fruit is a staple food among Middle East and North African nations, as well as many other tropical and subtropical regions. Here we report a complete sequence of the data palm chloroplast (cp) genome based on pyrosequencing. Methodology/Principal Findings After extracting 369,022 cp sequencing reads from our whole-genome-shotgun data, we put together an assembly and validated it with intensive PCR-based verification, coupled with PCR product sequencing. The date palm cp genome is 158,462 bp in length and has a typical quadripartite structure of the large (LSC, 86,198 bp) and small single-copy (SSC, 17,712 bp) regions separated by a pair of inverted repeats (IRs, 27,276 bp). Similar to what has been found among most angiosperms, the date palm cp genome harbors 112 unique genes and 19 duplicated fragments in the IR regions. The junctions between LSC/IRs and SSC/IRs show different features of sequence expansion in evolution. We identified 78 SNPs as major intravarietal polymorphisms within the population of a specific cp genome, most of which were located in genes with vital functions. Based on RNA-sequencing data, we also found 18 polycistronic transcription units and three highly expression-biased genes—atpF, trnA-UGC, and rrn23. Conclusions Unlike most monocots, date palm has a typical cp genome similar to that of tobacco—with little rearrangement and gene loss or gain. High-throughput sequencing technology facilitates the identification of intravarietal variations in cp genomes among different cultivars. Moreover, transcriptomic analysis of cp genes provides clues for uncovering regulatory mechanisms of transcription and translation in chloroplasts. PMID:20856810
Petitot, Anne-Sophie; Kyndt, Tina; Haidar, Rana; Dereeper, Alexis; Collin, Myriam; de Almeida Engler, Janice; Gheysen, Godelieve
2017-01-01
Abstract Background and Aims The root-knot nematode Meloidogyne graminicola is responsible for production losses in rice (Oryza sativa) in Asia and Latin America. The accession TOG5681 of African rice, O. glaberrima, presents improved resistance to several biotic and abiotic factors, including nematodes. The aim of this study was to assess the cytological and molecular mechanisms underlying nematode resistance in this accession. Methods Penetration and development in M. graminicola in TOG5681 and the susceptible O. sativa genotype ‘Nipponbare’ were compared by microscopic observation of infected roots and histological analysis of galls. In parallel, host molecular responses to M. graminicola were assessed by root transcriptome profiling at 2, 4 and 8 d post-infection (dpi). Specific treatments with hormone inhibitors were conducted in TOG5681 to assess the impact of the jasmonic acid and salicylic acid pathways on nematode penetration and reproduction. Key Results Penetration and development of M. graminicola juveniles were reduced in the resistant TOG5681 in comparison with the susceptible accession, with degeneration of giant cells observed in the resistant genotype from 15 dpi onwards. Transcriptome changes were observed as early as 2 dpi, with genes predicted to be involved in defence responses, phenylpropanoid and hormone pathways strongly induced in TOG5681, in contrast to ‘Nipponbare’. No specific hormonal pathway could be identified as the major determinant of resistance in the rice-nematode incompatible interaction. Candidate genes proposed as involved in resistance to M. graminicola in TOG5681 were identified based on their expression pattern and quantitative trait locus (QTL) position, including chalcone synthase, isoflavone reductase, phenylalanine ammonia lyase, WRKY62 transcription factor, thionin, stripe rust resistance protein, thaumatins and ATPase3. Conclusions This study provides a novel set of candidate genes for O. glaberrima resistance to nematodes and highlights the rice-M. graminicola pathosystem as a model to study plant-nematode incompatible interactions. PMID:28334204
Chiba, Hirofumi; Kakuta, Yoichi; Kinouchi, Yoshitaka; Kawai, Yosuke; Watanabe, Kazuhiro; Nagao, Munenori; Naito, Takeo; Onodera, Motoyuki; Moroi, Rintaro; Kuroha, Masatake; Kanazawa, Yoshitake; Kimura, Tomoya; Shiga, Hisashi; Endo, Katsuya; Negoro, Kenichi; Nagasaki, Masao; Unno, Michiaki; Shimosegawa, Tooru
2018-01-01
Inflammatory bowel disease (IBD) has an unknown etiology; however, accumulating evidence suggests that IBD is a multifactorial disease influenced by a combination of genetic and environmental factors. The influence of genetic variants on DNA methylation in cis and cis effects on expression have been demonstrated. We hypothesized that IBD susceptibility single-nucleotide polymorphisms (SNPs) regulate susceptibility gene expressions in cis by regulating DNA methylation around SNPs. For this, we determined cis-regulated allele-specific DNA methylation (ASM) around IBD susceptibility genes in CD4+ effector/memory T cells (Tem) in lamina propria mononuclear cells (LPMCs) in patients with IBD and examined the association between the ASM SNP genotype and neighboring susceptibility gene expressions. CD4+ effector/memory T cells (Tem) were isolated from LPMCs in 15 Japanese IBD patients (ten Crohn's disease [CD] and five ulcerative colitis [UC] patients). ASM analysis was performed by methylation-sensitive SNP array analysis. We defined ASM as a changing average relative allele score ([Formula: see text]) >0.1 after digestion by methylation-sensitive restriction enzymes. Among SNPs showing [Formula: see text] >0.1, we extracted the probes located on tag-SNPs of 200 IBD susceptibility loci and around IBD susceptibility genes as candidate ASM SNPs. To validate ASM, bisulfite-pyrosequencing was performed. Transcriptome analysis was examined in 11 IBD patients (seven CD and four UC patients). The relation between rs36221701 genotype and neighboring gene expressions were analyzed. We extracted six candidate ASM SNPs around IBD susceptibility genes. The top of [Formula: see text] (0.23) was rs1130368 located on HLA-DQB1. ASM around rs36221701 ([Formula: see text] = 0.14) located near SMAD3 was validated using bisulfite pyrosequencing. The SMAD3 expression was significantly associated with the rs36221701 genotype (p = 0.016). We confirmed the existence of cis-regulated ASM around IBD susceptibility genes and the association between ASM SNP (rs36221701) genotype and SMAD3 expression, a susceptibility gene for IBD. These results give us supporting evidence that DNA methylation mediates genetic effects on disease susceptibility.
Chiba, Hirofumi; Kakuta, Yoichi; Kinouchi, Yoshitaka; Kawai, Yosuke; Watanabe, Kazuhiro; Nagao, Munenori; Naito, Takeo; Onodera, Motoyuki; Moroi, Rintaro; Kuroha, Masatake; Kanazawa, Yoshitake; Kimura, Tomoya; Shiga, Hisashi; Endo, Katsuya; Negoro, Kenichi; Nagasaki, Masao; Unno, Michiaki; Shimosegawa, Tooru
2018-01-01
Background Inflammatory bowel disease (IBD) has an unknown etiology; however, accumulating evidence suggests that IBD is a multifactorial disease influenced by a combination of genetic and environmental factors. The influence of genetic variants on DNA methylation in cis and cis effects on expression have been demonstrated. We hypothesized that IBD susceptibility single-nucleotide polymorphisms (SNPs) regulate susceptibility gene expressions in cis by regulating DNA methylation around SNPs. For this, we determined cis-regulated allele-specific DNA methylation (ASM) around IBD susceptibility genes in CD4+ effector/memory T cells (Tem) in lamina propria mononuclear cells (LPMCs) in patients with IBD and examined the association between the ASM SNP genotype and neighboring susceptibility gene expressions. Methods CD4+ effector/memory T cells (Tem) were isolated from LPMCs in 15 Japanese IBD patients (ten Crohn's disease [CD] and five ulcerative colitis [UC] patients). ASM analysis was performed by methylation-sensitive SNP array analysis. We defined ASM as a changing average relative allele score (ΔRAS¯) >0.1 after digestion by methylation-sensitive restriction enzymes. Among SNPs showing ΔRAS¯ >0.1, we extracted the probes located on tag-SNPs of 200 IBD susceptibility loci and around IBD susceptibility genes as candidate ASM SNPs. To validate ASM, bisulfite-pyrosequencing was performed. Transcriptome analysis was examined in 11 IBD patients (seven CD and four UC patients). The relation between rs36221701 genotype and neighboring gene expressions were analyzed. Results We extracted six candidate ASM SNPs around IBD susceptibility genes. The top of ΔRAS¯ (0.23) was rs1130368 located on HLA-DQB1. ASM around rs36221701 (ΔRAS¯ = 0.14) located near SMAD3 was validated using bisulfite pyrosequencing. The SMAD3 expression was significantly associated with the rs36221701 genotype (p = 0.016). Conclusions We confirmed the existence of cis-regulated ASM around IBD susceptibility genes and the association between ASM SNP (rs36221701) genotype and SMAD3 expression, a susceptibility gene for IBD. These results give us supporting evidence that DNA methylation mediates genetic effects on disease susceptibility. PMID:29547621
Scharf, Michael; Sethi, Amit
2016-09-13
Termites have specialized digestive systems that overcome the lignin barrier in wood to release fermentable simple sugars. Using the termite Reticulitermes flavipes and its gut symbionts, high-throughput titanium pyrosequencing and proteomics approaches experimentally compared the effects of lignin-containing diets on host-symbiont digestome composition. Proteomic investigations and functional digestive studies with recombinant lignocellulases conducted in parallel provided strong evidence of congruence at the transcription and translational levels and provide enzymatic strategies for overcoming recalcitrant lignin barriers in biofuel feedstocks. Briefly described, therefore, the disclosure provides a system for generating a fermentable product from a lignified plant material, the system comprising a cooperating series of at least two catalytically active polypeptides, where said catalytically active polypeptides are selected from the group consisting of: cellulase Cell-1, .beta.-glu cellulase, an aldo-keto-reductase, a catalase, a laccase, and an endo-xylanase.
Kukekova, Anna V; Johnson, Jennifer L; Teiling, Clotilde; Li, Lewyn; Oskina, Irina N; Kharlamova, Anastasiya V; Gulevich, Rimma G; Padte, Ravee; Dubreuil, Michael M; Vladimirova, Anastasiya V; Shepeleva, Darya V; Shikhevich, Svetlana G; Sun, Qi; Ponnala, Lalit; Temnykh, Svetlana V; Trut, Lyudmila N; Acland, Gregory M
2011-10-03
Two strains of the silver fox (Vulpes vulpes), with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain are defensive and exhibit aggression to humans. To understand the genetic differences underlying these behavioral phenotypes fox-specific genomic resources are needed. cDNA from mRNA from pre-frontal cortex of a tame and an aggressive fox was sequenced using the Roche 454 FLX Titanium platform (> 2.5 million reads & 0.9 Gbase of tame fox sequence; >3.3 million reads & 1.2 Gbase of aggressive fox sequence). Over 80% of the fox reads were assembled into contigs. Mapping fox reads against the fox transcriptome assembly and the dog genome identified over 30,000 high confidence fox-specific SNPs. Fox transcripts for approximately 14,000 genes were identified using SwissProt and the dog RefSeq databases. An at least 2-fold expression difference between the two samples (p < 0.05) was observed for 335 genes, fewer than 3% of the total number of genes identified in the fox transcriptome. Transcriptome sequencing significantly expanded genomic resources available for the fox, a species without a sequenced genome. In a very cost efficient manner this yielded a large number of fox-specific SNP markers for genetic studies and provided significant insights into the gene expression profile of the fox pre-frontal cortex; expression differences between the two fox samples; and a catalogue of potentially important gene-specific sequence variants. This result demonstrates the utility of this approach for developing genomic resources in species with limited genomic information.
de Marcos, Alberto; Triviño, Magdalena; Pérez-Bueno, María Luisa; Ballesteros, Isabel; Barón, Matilde; Mena, Montaña; Fenoll, Carmen
2015-01-01
Loss of function of the positive stomata development regulators SPCH or MUTE in Arabidopsis thaliana renders stomataless plants; spch-3 and mute-3 mutants are extreme dwarfs, but produce cotyledons and tiny leaves, providing a system to interrogate plant life in the absence of stomata. To this end, we compared their cotyledon transcriptomes with that of wild-type plants. K-means clustering of differentially expressed genes generated four clusters: clusters 1 and 2 grouped genes commonly regulated in the mutants, while clusters 3 and 4 contained genes distinctively regulated in mute-3. Classification in functional categories and metabolic pathways of genes in clusters 1 and 2 suggested that both mutants had depressed secondary, nitrogen and sulfur metabolisms, while only a few photosynthesis-related genes were down-regulated. In situ quenching analysis of chlorophyll fluorescence revealed limited inhibition of photosynthesis. This and other fluorescence measurements matched the mutant transcriptomic features. Differential transcriptomes of both mutants were enriched in growth-related genes, including known stomata development regulators, which paralleled their epidermal phenotypes. Analysis of cluster 3 was not informative for developmental aspects of mute-3. Cluster 4 comprised genes differentially up−regulated in mute−3, 35% of which were direct targets for SPCH and may relate to the unique cell types of mute−3. A screen of T-DNA insertion lines in genes differentially expressed in the mutants identified a gene putatively involved in stomata development. A collection of lines for conditional overexpression of transcription factors differentially expressed in the mutants rendered distinct epidermal phenotypes, suggesting that these proteins may be novel stomatal development regulators. Thus, our transcriptome analysis represents a useful source of new genes for the study of stomata development and for characterizing physiology and growth in the absence of stomata. PMID:26157447
2011-01-01
Background Two strains of the silver fox (Vulpes vulpes), with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain are defensive and exhibit aggression to humans. To understand the genetic differences underlying these behavioral phenotypes fox-specific genomic resources are needed. Results cDNA from mRNA from pre-frontal cortex of a tame and an aggressive fox was sequenced using the Roche 454 FLX Titanium platform (> 2.5 million reads & 0.9 Gbase of tame fox sequence; >3.3 million reads & 1.2 Gbase of aggressive fox sequence). Over 80% of the fox reads were assembled into contigs. Mapping fox reads against the fox transcriptome assembly and the dog genome identified over 30,000 high confidence fox-specific SNPs. Fox transcripts for approximately 14,000 genes were identified using SwissProt and the dog RefSeq databases. An at least 2-fold expression difference between the two samples (p < 0.05) was observed for 335 genes, fewer than 3% of the total number of genes identified in the fox transcriptome. Conclusions Transcriptome sequencing significantly expanded genomic resources available for the fox, a species without a sequenced genome. In a very cost efficient manner this yielded a large number of fox-specific SNP markers for genetic studies and provided significant insights into the gene expression profile of the fox pre-frontal cortex; expression differences between the two fox samples; and a catalogue of potentially important gene-specific sequence variants. This result demonstrates the utility of this approach for developing genomic resources in species with limited genomic information. PMID:21967120
Mittapalli, Omprakash; Bai, Xiaodong; Bonello, Pierluigi; Herms, Daniel A.
2010-01-01
Background The insect midgut and fat body represent major tissue interfaces that deal with several important physiological functions including digestion, detoxification and immune response. The emerald ash borer (Agrilus planipennis), is an exotic invasive insect pest that has killed millions of ash trees (Fraxinus spp.) primarily in the Midwestern United States and Ontario, Canada. However, despite its high impact status little knowledge exists for A. planipennis at the molecular level. Methodology and Principal Findings Newer-generation Roche-454 pyrosequencing was used to obtain 126,185 reads for the midgut and 240,848 reads for the fat body, which were assembled into 25,173 and 37,661 high quality expressed sequence tags (ESTs) for the midgut and the fat body of A. planipennis larvae, respectively. Among these ESTs, 36% of the midgut and 38% of the fat body sequences showed similarity to proteins in the GenBank nr database. A high number of the midgut sequences contained chitin-binding peritrophin (248)and trypsin (98) domains; while the fat body sequences showed high occurrence of cytochrome P450s (85) and protein kinase (123) domains. Further, the midgut transcriptome of A. planipennis revealed putative microbial transcripts encoding for cell-wall degrading enzymes such as polygalacturonases and endoglucanases. A significant number of SNPs (137 in midgut and 347 in fat body) and microsatellite loci (317 in midgut and 571 in fat body) were predicted in the A. planipennis transcripts. An initial assessment of cytochrome P450s belonging to various CYP clades revealed distinct expression patterns at the tissue level. Conclusions and Significance To our knowledge this study is one of the first to illuminate tissue-specific gene expression in an invasive insect of high ecological and economic consequence. These findings will lay the foundation for future gene expression and functional studies in A. planipennis. PMID:21060843
Transcriptomic Signatures of Ash (Fraxinus spp.) Phloem
Mamidala, Praveen; Bonello, Pierluigi; Herms, Daniel A.; Mittapalli, Omprakash
2011-01-01
Background Ash (Fraxinus spp.) is a dominant tree species throughout urban and forested landscapes of North America (NA). The rapid invasion of NA by emerald ash borer (Agrilus planipennis), a wood-boring beetle endemic to Eastern Asia, has resulted in the death of millions of ash trees and threatens billions more. Larvae feed primarily on phloem tissue, which girdles and kills the tree. While NA ash species including black (F. nigra), green (F. pennsylvannica) and white (F. americana) are highly susceptible, the Asian species Manchurian ash (F. mandshurica) is resistant to A. planipennis perhaps due to their co-evolutionary history. Little is known about the molecular genetics of ash. Hence, we undertook a functional genomics approach to identify the repertoire of genes expressed in ash phloem. Methodology and Principal Findings Using 454 pyrosequencing we obtained 58,673 high quality ash sequences from pooled phloem samples of green, white, black, blue and Manchurian ash. Intriguingly, 45% of the deduced proteins were not significantly similar to any sequences in the GenBank non-redundant database. KEGG analysis of the ash sequences revealed a high occurrence of defense related genes. Expression analysis of early regulators potentially involved in plant defense (i.e. transcription factors, calcium dependent protein kinases and a lipoxygenase 3) revealed higher mRNA levels in resistant ash compared to susceptible ash species. Lastly, we predicted a total of 1,272 single nucleotide polymorphisms and 980 microsatellite loci, among which seven microsatellite loci showed polymorphism between different ash species. Conclusions and Significance The current transcriptomic data provide an invaluable resource for understanding the genetic make-up of ash phloem, the target tissue of A. planipennis. These data along with future functional studies could lead to the identification/characterization of defense genes involved in resistance of ash to A. planipennis, and in future ash breeding programs for marker development. PMID:21283712
The molecular analysis of drinking water microbial communities has focused primarily on 16S rRNA gene sequence analysis. Since this approach provides limited information on function potential of microbial communities, analysis of whole-metagenome pyrosequencing data was used to...
Histological and transcriptomic effects of 17α-methyltestosterone on zebrafish gonad development.
Lee, Stephanie Ling Jie; Horsfield, Julia A; Black, Michael A; Rutherford, Kim; Fisher, Amanda; Gemmell, Neil J
2017-07-24
Sex hormones play important roles in teleost ovarian and testicular development. In zebrafish, ovarian differentiation appears to be dictated by an oocyte-derived signal via Cyp19a1a aromatase-mediated estrogen production. Androgens and aromatase inhibitors can induce female-to-male sex reversal, however, the mechanisms underlying gonadal masculinisation are poorly understood. We used histological analyses together with RNA sequencing to characterise zebrafish gonadal transcriptomes and investigate the effects of 17α-methyltestosterone on gonadal differentiation. At a morphological level, 17α-methyltestosterone (MT) masculinised gonads and accelerated spermatogenesis, and these changes were paralleled in masculinisation and de-feminisation of gonadal transcriptomes. MT treatment upregulated expression of genes involved in male sex determination and differentiation (amh, dmrt1, gsdf and wt1a) and those involved in 11-oxygenated androgen production (cyp11c1 and hsd11b2). It also repressed expression of ovarian development and folliculogenesis genes (bmp15, gdf9, figla, zp2.1 and zp3b). Furthermore, MT treatment altered epigenetic modification of histones in zebrafish gonads. Contrary to expectations, higher levels of cyp19a1a or foxl2 expression in control ovaries compared to MT-treated testes and control testes were not statistically significant during early gonad development (40 dpf). Our study suggests that both androgen production and aromatase inhibition are important for androgen-induced gonadal masculinisation and natural testicular differentiation in zebrafish.
RISC RNA sequencing for context-specific identification of in vivo miR targets
Matkovich, Scot J; Van Booven, Derek J; Eschenbacher, William H; Dorn, Gerald W
2010-01-01
Rationale MicroRNAs (miRs) are expanding our understanding of cardiac disease and have the potential to transform cardiovascular therapeutics. One miR can target hundreds of individual mRNAs, but existing methodologies are not sufficient to accurately and comprehensively identify these mRNA targets in vivo. Objective To develop methods permitting identification of in vivo miR targets in an unbiased manner, using massively parallel sequencing of mouse cardiac transcriptomes in combination with sequencing of mRNA associated with mouse cardiac RNA-induced silencing complexes (RISCs). Methods and Results We optimized techniques for expression profiling small amounts of RNA without introducing amplification bias, and applied this to anti-Argonaute 2 immunoprecipitated RISCs (RISC-Seq) from mouse hearts. By comparing RNA-sequencing results of cardiac RISC and transcriptome from the same individual hearts, we defined 1,645 mRNAs consistently targeted to mouse cardiac RISCs. We employed this approach in hearts overexpressing miRs from Myh6 promoter-driven precursors (programmed RISC-Seq) to identify 209 in vivo targets of miR-133a and 81 in vivo targets of miR-499. Consistent with the fact that miR-133a and miR-499 have widely differing ‘seed’ sequences and belong to different miR families, only 6 targets were common to miR-133a- and miR-499-programmed hearts. Conclusions RISC-sequencing is a highly sensitive method for general RISC profiling and individual miR target identification in biological context, and is applicable to any tissue and any disease state. Summary MicroRNAs (miRs) are key regulators of mRNA translation in health and disease. While bioinformatic predictions suggest that a single miR may target hundreds of mRNAs, the number of experimentally verified targets of miRs is low. To enable comprehensive, unbiased examination of miR targets, we have performed deep RNA sequencing of cardiac transcriptomes in parallel with cardiac RNA-induced silencing complex (RISC)-associated RNAs (the RISCome), called RISC sequencing. We developed methods that did not require cross-linking of RNAs to RISCs or amplification of mRNA prior to sequencing, making it possible to rapidly perform RISC sequencing from intact tissue while avoiding amplification bias. Comparison of RISCome with transcriptome expression defined the degree of RISC enrichment for each mRNA. The majority of the mRNAs enriched in wild-type cardiac RISComes compared to transcriptomes were bioinformatically predicted to be targets of at least 1 of 139 cardiac-expressed miRs. Programming cardiomyocyte RISCs via transgenic overexpression in adult hearts of miR-133a or miR-499, two miRs that contain entirely different ‘seed’ sequences, elicited differing profiles of RISC-targeted mRNAs. Thus, RISC sequencing represents a highly sensitive method for general RISC profiling and individual miR target identification in biological context. PMID:21030712
USDA-ARS?s Scientific Manuscript database
Marek’s disease virus (MDV-1) is a cell-associated alphaherpesvirus that induces rapid-onset T-cell lymphomas in poultry. The genomes of 6 strains have been sequenced using both Sanger didoxy sequencing and 454 Life Science pyrosequencing. These genomes largely represent cell culture adapted strains...
USDA-ARS?s Scientific Manuscript database
Impacts of integrated livestock-crop production systems compared to specialized systems on soil bacterial diversity have not been well documented. We used a bacterial tag encoded FLX amplicon pyrosequencing (bTEFAP) method to evaluate bacterial diversity of a clay loam soil (Fine, mixed, thermic To...
Cassin, Andrew M.; Soltis, Douglas E.; Miles, Nicholas W.; Melkonian, Michael; Melkonian, Barbara; Wu, Shuangxiu; Edger, Patrick P.; Carpenter, Eric J.
2017-01-01
The carbohydrate-rich cell walls of land plants and algae have been the focus of much interest given the value of cell wall-based products to our current and future economies. Hydroxyproline-rich glycoproteins (HRGPs), a major group of wall glycoproteins, play important roles in plant growth and development, yet little is known about how they have evolved in parallel with the polysaccharide components of walls. We investigate the origins and evolution of the HRGP superfamily, which is commonly divided into three major multigene families: the arabinogalactan proteins (AGPs), extensins (EXTs), and proline-rich proteins. Using motif and amino acid bias, a newly developed bioinformatics pipeline, we identified HRGPs in sequences from the 1000 Plants transcriptome project (www.onekp.com). Our analyses provide new insights into the evolution of HRGPs across major evolutionary milestones, including the transition to land and the early radiation of angiosperms. Significantly, data mining reveals the origin of glycosylphosphatidylinositol (GPI)-anchored AGPs in green algae and a 3- to 4-fold increase in GPI-AGPs in liverworts and mosses. The first detection of cross-linking (CL)-EXTs is observed in bryophytes, which suggests that CL-EXTs arose though the juxtaposition of preexisting SPn EXT glycomotifs with refined Y-based motifs. We also detected the loss of CL-EXT in a few lineages, including the grass family (Poaceae), that have a cell wall composition distinct from other monocots and eudicots. A key challenge in HRGP research is tracking individual HRGPs throughout evolution. Using the 1000 Plants output, we were able to find putative orthologs of Arabidopsis pollen-specific GPI-AGPs in basal eudicots. PMID:28446636
Insights into the Evolution of Hydroxyproline-Rich Glycoproteins from 1000 Plant Transcriptomes.
Johnson, Kim L; Cassin, Andrew M; Lonsdale, Andrew; Wong, Gane Ka-Shu; Soltis, Douglas E; Miles, Nicholas W; Melkonian, Michael; Melkonian, Barbara; Deyholos, Michael K; Leebens-Mack, James; Rothfels, Carl J; Stevenson, Dennis W; Graham, Sean W; Wang, Xumin; Wu, Shuangxiu; Pires, J Chris; Edger, Patrick P; Carpenter, Eric J; Bacic, Antony; Doblin, Monika S; Schultz, Carolyn J
2017-06-01
The carbohydrate-rich cell walls of land plants and algae have been the focus of much interest given the value of cell wall-based products to our current and future economies. Hydroxyproline-rich glycoproteins (HRGPs), a major group of wall glycoproteins, play important roles in plant growth and development, yet little is known about how they have evolved in parallel with the polysaccharide components of walls. We investigate the origins and evolution of the HRGP superfamily, which is commonly divided into three major multigene families: the arabinogalactan proteins (AGPs), extensins (EXTs), and proline-rich proteins. Using motif and amino acid bias, a newly developed bioinformatics pipeline, we identified HRGPs in sequences from the 1000 Plants transcriptome project (www.onekp.com). Our analyses provide new insights into the evolution of HRGPs across major evolutionary milestones, including the transition to land and the early radiation of angiosperms. Significantly, data mining reveals the origin of glycosylphosphatidylinositol (GPI)-anchored AGPs in green algae and a 3- to 4-fold increase in GPI-AGPs in liverworts and mosses. The first detection of cross-linking (CL)-EXTs is observed in bryophytes, which suggests that CL-EXTs arose though the juxtaposition of preexisting SP n EXT glycomotifs with refined Y-based motifs. We also detected the loss of CL-EXT in a few lineages, including the grass family (Poaceae), that have a cell wall composition distinct from other monocots and eudicots. A key challenge in HRGP research is tracking individual HRGPs throughout evolution. Using the 1000 Plants output, we were able to find putative orthologs of Arabidopsis pollen-specific GPI-AGPs in basal eudicots. © 2017 American Society of Plant Biologists. All Rights Reserved.
Fasoli, Marianna; Dal Santo, Silvia; Zenoni, Sara; Tornielli, Giovanni Battista; Farina, Lorenzo; Zamboni, Anita; Porceddu, Andrea; Venturini, Luca; Bicego, Manuele; Murino, Vittorio; Ferrarini, Alberto; Delledonne, Massimo; Pezzotti, Mario
2012-09-01
We developed a genome-wide transcriptomic atlas of grapevine (Vitis vinifera) based on 54 samples representing green and woody tissues and organs at different developmental stages as well as specialized tissues such as pollen and senescent leaves. Together, these samples expressed ∼91% of the predicted grapevine genes. Pollen and senescent leaves had unique transcriptomes reflecting their specialized functions and physiological status. However, microarray and RNA-seq analysis grouped all the other samples into two major classes based on maturity rather than organ identity, namely, the vegetative/green and mature/woody categories. This division represents a fundamental transcriptomic reprogramming during the maturation process and was highlighted by three statistical approaches identifying the transcriptional relationships among samples (correlation analysis), putative biomarkers (O2PLS-DA approach), and sets of strongly and consistently expressed genes that define groups (topics) of similar samples (biclustering analysis). Gene coexpression analysis indicated that the mature/woody developmental program results from the reiterative coactivation of pathways that are largely inactive in vegetative/green tissues, often involving the coregulation of clusters of neighboring genes and global regulation based on codon preference. This global transcriptomic reprogramming during maturation has not been observed in herbaceous annual species and may be a defining characteristic of perennial woody plants.
Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences
Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.
2012-01-01
ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136
2009-01-01
Background Recent studies have shown that the fecal microbiota is generally resilient to short-term antibiotic administration, but some bacterial taxa may remain depressed for several months. Limited information is available about the effect of antimicrobials on small intestinal microbiota, an important contributor to gastrointestinal health. The antibiotic tylosin is often successfully used for the treatment of chronic diarrhea in dogs, but its exact mode of action and its effect on the intestinal microbiota remain unknown. The aim of this study was to evaluate the effect of tylosin on canine jejunal microbiota. Tylosin was administered at 20 to 22 mg/kg q 24 hr for 14 days to five healthy dogs, each with a pre-existing jejunal fistula. Jejunal brush samples were collected through the fistula on days 0, 14, and 28 (14 days after withdrawal of tylosin). Bacterial diversity was characterized using massive parallel 16S rRNA gene pyrosequencing. Results Pyrosequencing revealed a previously unrecognized species richness in the canine small intestine. Ten bacterial phyla were identified. Microbial populations were phylogenetically more similar during tylosin treatment. However, a remarkable inter-individual response was observed for specific taxa. Fusobacteria, Bacteroidales, and Moraxella tended to decrease. The proportions of Enterococcus-like organisms, Pasteurella spp., and Dietzia spp. increased significantly during tylosin administration (p < 0.05). The proportion of Escherichia coli-like organisms increased by day 28 (p = 0.04). These changes were not accompanied by any obvious clinical effects. On day 28, the phylogenetic composition of the microbiota was similar to day 0 in only 2 of 5 dogs. Bacterial diversity resembled the pre-treatment state in 3 of 5 dogs. Several bacterial taxa such as Spirochaetes, Streptomycetaceae, and Prevotellaceae failed to recover at day 28 (p < 0.05). Several bacterial groups considered to be sensitive to tylosin increased in their proportions. Conclusion Tylosin may lead to prolonged effects on the composition and diversity of jejunal microbiota. However, these changes were not associated with any short-term clinical signs of gastrointestinal disease in healthy dogs. Our results illustrate the complexity of the intestinal microbiota and the challenges associated with evaluating the effect of antibiotic administration on the various bacterial groups and their potential interactions. PMID:19799792
Polyploid Evolution of the Brassicaceae during the Cenozoic Era[C][W][OPEN
Kagale, Sateesh; Robinson, Stephen J.; Nixon, John; Xiao, Rong; Huebert, Terry; Condie, Janet; Kessler, Dallas; Clarke, Wayne E.; Edger, Patrick P.; Links, Matthew G.; Sharpe, Andrew G.; Parkin, Isobel A.P.
2014-01-01
The Brassicaceae (Cruciferae) family, owing to its remarkable species, genetic, and physiological diversity as well as its significant economic potential, has become a model for polyploidy and evolutionary studies. Utilizing extensive transcriptome pyrosequencing of diverse taxa, we established a resolved phylogeny of a subset of crucifer species. We elucidated the frequency, age, and phylogenetic position of polyploidy and lineage separation events that have marked the evolutionary history of the Brassicaceae. Besides the well-known ancient α (47 million years ago [Mya]) and β (124 Mya) paleopolyploidy events, several species were shown to have undergone a further more recent (∼7 to 12 Mya) round of genome multiplication. We identified eight whole-genome duplications corresponding to at least five independent neo/mesopolyploidy events. Although the Brassicaceae family evolved from other eudicots at the beginning of the Cenozoic era of the Earth (60 Mya), major diversification occurred only during the Neogene period (0 to 23 Mya). Remarkably, the widespread species divergence, major polyploidy, and lineage separation events during Brassicaceae evolution are clustered in time around epoch transitions characterized by prolonged unstable climatic conditions. The synchronized diversification of Brassicaceae species suggests that polyploid events may have conferred higher adaptability and increased tolerance toward the drastically changing global environment, thus facilitating species radiation. PMID:25035408
Polyploid evolution of the Brassicaceae during the Cenozoic era.
Kagale, Sateesh; Robinson, Stephen J; Nixon, John; Xiao, Rong; Huebert, Terry; Condie, Janet; Kessler, Dallas; Clarke, Wayne E; Edger, Patrick P; Links, Matthew G; Sharpe, Andrew G; Parkin, Isobel A P
2014-07-01
The Brassicaceae (Cruciferae) family, owing to its remarkable species, genetic, and physiological diversity as well as its significant economic potential, has become a model for polyploidy and evolutionary studies. Utilizing extensive transcriptome pyrosequencing of diverse taxa, we established a resolved phylogeny of a subset of crucifer species. We elucidated the frequency, age, and phylogenetic position of polyploidy and lineage separation events that have marked the evolutionary history of the Brassicaceae. Besides the well-known ancient α (47 million years ago [Mya]) and β (124 Mya) paleopolyploidy events, several species were shown to have undergone a further more recent (∼7 to 12 Mya) round of genome multiplication. We identified eight whole-genome duplications corresponding to at least five independent neo/mesopolyploidy events. Although the Brassicaceae family evolved from other eudicots at the beginning of the Cenozoic era of the Earth (60 Mya), major diversification occurred only during the Neogene period (0 to 23 Mya). Remarkably, the widespread species divergence, major polyploidy, and lineage separation events during Brassicaceae evolution are clustered in time around epoch transitions characterized by prolonged unstable climatic conditions. The synchronized diversification of Brassicaceae species suggests that polyploid events may have conferred higher adaptability and increased tolerance toward the drastically changing global environment, thus facilitating species radiation. © 2014 American Society of Plant Biologists. All rights reserved.
Martins, Patrícia; Cleary, Daniel F R; Pires, Ana C C; Rodrigues, Ana Maria; Quintino, Victor; Calado, Ricardo; Gomes, Newton C M
2013-01-01
The present study combined a DGGE and barcoded 16S rRNA pyrosequencing approach to assess bacterial composition in the water of a recirculating aquaculture system (RAS) with a shallow raceway system (SRS) for turbot (Scophthalmus maximus) and sole (Solea senegalensis). Barcoded pyrosequencing results were also used to determine the potential pathogen load in the RAS studied. Samples were collected from the water supply pipeline (Sup), fish production tanks (Pro), sedimentation filter (Sed), biofilter tank (Bio), and protein skimmer (Ozo; also used as an ozone reaction chamber) of twin RAS operating in parallel (one for each fish species). Our results revealed pronounced differences in bacterial community composition between turbot and sole RAS, suggesting that in the systems studied there is a strong species-specific effect on water bacterial communities. Proteobacteria was the most abundant phylum in the water supply and all RAS compartments. Other important taxonomic groups included the phylum Bacteriodetes. The saltwater supplied displayed a markedly lower richness and appeared to have very little influence on bacterial composition. The following potentially pathogenic species were detected: Photobacterium damselae in turbot (all compartments), Tenacibaculum discolor in turbot and sole (all compartments), Tenacibaculum soleae in turbot (all compartments) and sole (Pro, Sed and Bio), and Serratia marcescens in turbot (Sup, Sed, Bio and Ozo) and sole (only Sed) RAS. Despite the presence of these pathogens, no symptomatic fish were observed. Although we were able to identify potential pathogens, this approach should be employed with caution when monitoring aquaculture systems, as the required phylogenetic resolution for reliable identification of pathogens may not always be possible to achieve when employing 16S rRNA gene fragments.
Martins, Patrícia; Cleary, Daniel F. R.; Pires, Ana C. C.; Rodrigues, Ana Maria; Quintino, Victor; Calado, Ricardo; Gomes, Newton C. M.
2013-01-01
The present study combined a DGGE and barcoded 16S rRNA pyrosequencing approach to assess bacterial composition in the water of a recirculating aquaculture system (RAS) with a shallow raceway system (SRS) for turbot (Scophthalmus maximus) and sole (Solea senegalensis). Barcoded pyrosequencing results were also used to determine the potential pathogen load in the RAS studied. Samples were collected from the water supply pipeline (Sup), fish production tanks (Pro), sedimentation filter (Sed), biofilter tank (Bio), and protein skimmer (Ozo; also used as an ozone reaction chamber) of twin RAS operating in parallel (one for each fish species). Our results revealed pronounced differences in bacterial community composition between turbot and sole RAS, suggesting that in the systems studied there is a strong species-specific effect on water bacterial communities. Proteobacteria was the most abundant phylum in the water supply and all RAS compartments. Other important taxonomic groups included the phylum Bacteriodetes. The saltwater supplied displayed a markedly lower richness and appeared to have very little influence on bacterial composition. The following potentially pathogenic species were detected: Photobacterium damselae in turbot (all compartments), Tenacibaculum discolor in turbot and sole (all compartments), Tenacibaculum soleae in turbot (all compartments) and sole (Pro, Sed and Bio), and Serratia marcescens in turbot (Sup, Sed, Bio and Ozo) and sole (only Sed) RAS. Despite the presence of these pathogens, no symptomatic fish were observed. Although we were able to identify potential pathogens, this approach should be employed with caution when monitoring aquaculture systems, as the required phylogenetic resolution for reliable identification of pathogens may not always be possible to achieve when employing 16S rRNA gene fragments. PMID:24278329
Integrated Analysis of Transcriptomic and Proteomic Data
Haider, Saad; Pal, Ranadip
2013-01-01
Until recently, understanding the regulatory behavior of cells has been pursued through independent analysis of the transcriptome or the proteome. Based on the central dogma, it was generally assumed that there exist a direct correspondence between mRNA transcripts and generated protein expressions. However, recent studies have shown that the correlation between mRNA and Protein expressions can be low due to various factors such as different half lives and post transcription machinery. Thus, a joint analysis of the transcriptomic and proteomic data can provide useful insights that may not be deciphered from individual analysis of mRNA or protein expressions. This article reviews the existing major approaches for joint analysis of transcriptomic and proteomic data. We categorize the different approaches into eight main categories based on the initial algorithm and final analysis goal. We further present analogies with other domains and discuss the existing research problems in this area. PMID:24082820
Chen, Tianbao; Gagliardo, Ron; Walker, Brian; Zhou, Mei; Shaw, Chris
2005-12-01
Phylloxin is a novel prototype antimicrobial peptide from the skin of Phyllomedusa bicolor. Here, we describe parallel identification and sequencing of phylloxin precursor transcript (mRNA) and partial gene structure (genomic DNA) from the same sample of lyophilized skin secretion using our recently-described cloning technique. The open-reading frame of the phylloxin precursor was identical in nucleotide sequence to that previously reported and alignment with the nucleotide sequence derived from genomic DNA indicated the presence of a 175 bp intron located in a near identical position to that found in the dermaseptins. The highly-conserved structural organization of skin secretion peptide genes in P. bicolor can thus be extended to include that encoding phylloxin (plx). These data further reinforce our assertion that application of the described methodology can provide robust genomic/transcriptomic/peptidomic data without the need for specimen sacrifice.
Brownian model of transcriptome evolution and phylogenetic network visualization between tissues.
Gu, Xun; Ruan, Hang; Su, Zhixi; Zou, Yangyun
2017-09-01
While phylogenetic analysis of transcriptomes of the same tissue is usually congruent with the species tree, the controversy emerges when multiple tissues are included, that is, whether species from the same tissue are clustered together, or different tissues from the same species are clustered together. Recent studies have suggested that phylogenetic network approach may shed some lights on our understanding of multi-tissue transcriptome evolution; yet the underlying evolutionary mechanism remains unclear. In this paper we develop a Brownian-based model of transcriptome evolution under the phylogenetic network that can statistically distinguish between the patterns of species-clustering and tissue-clustering. Our model can be used as a null hypothesis (neutral transcriptome evolution) for testing any correlation in tissue evolution, can be applied to cancer transcriptome evolution to study whether two tumors of an individual appeared independently or via metastasis, and can be useful to detect convergent evolution at the transcriptional level. Copyright © 2017. Published by Elsevier Inc.
Microbiome Analysis of Stool Samples from African Americans with Colon Polyps
Brim, Hassan; Yooseph, Shibu; Zoetendal, Erwin G.; Lee, Edward; Torralbo, Manolito; Laiyemo, Adeyinka O.; Shokrani, Babak; Nelson, Karen; Ashktorab, Hassan
2013-01-01
Background Colonic polyps are common tumors occurring in ~50% of Western populations with ~10% risk of malignant progression. Dietary agents have been considered the primary environmental exposure to promote colorectal cancer (CRC) development. However, the colonic mucosa is permanently in contact with the microbiota and its metabolic products including toxins that also have the potential to trigger oncogenic transformation. Aim To analyze fecal DNA for microbiota composition and functional potential in African Americans with pre-neoplastic lesions. Materials & Methods We analyzed the bacterial composition of stool samples from 6 healthy individuals and 6 patients with colon polyps using 16S ribosomal RNA-based phylogenetic microarray; the Human intestinal Tract Chip (HITChip) and 16S rRNA gene barcoded 454 pyrosequencing. The functional potential was determined by sequence-based metagenomics using 454 pyrosequencing. Results Fecal microbiota profiling of samples from the healthy and polyp patients using both a phylogenetic microarraying (HITChip) and barcoded 454 pyrosequencing generated similar results. A distinction between both sets of samples was only obtained when the analysis was performed at the sub-genus level. Most of the species leading to the dissociation were from the Bacteroides group. The metagenomic analysis did not reveal major differences in bacterial gene prevalence/abundances between the two groups even when the analysis and comparisons were restricted to available Bacteroides genomes. Conclusion This study reveals that at the pre-neoplastic stages, there is a trend showing microbiota changes between healthy and colon polyp patients at the sub-genus level. These differences were not reflected at the genome/functions levels. Bacteria and associated functions within the Bacteroides group need to be further analyzed and dissected to pinpoint potential actors in the early colon oncogenic transformation in a large sample size. PMID:24376500
NASA Astrophysics Data System (ADS)
Dannemiller, Karen C.; Lang-Yona, Naama; Yamamoto, Naomichi; Rudich, Yinon; Peccia, Jordan
2014-02-01
We examined fungal communities associated with the PM10 mass of Rehovot, Israel outdoor air samples collected in the spring and fall seasons. Fungal communities were described by 454 pyrosequencing of the internal transcribed spacer (ITS) region of the fungal ribosomal RNA encoding gene. To allow for a more quantitative comparison of fungal exposure in humans, the relative abundance values of specific taxa were transformed to absolute concentrations through multiplying these values by the sample's total fungal spore concentration (derived from universal fungal qPCR). Next, the sequencing-based absolute concentrations for Alternaria alternata, Cladosporium cladosporioides, Epicoccum nigrum, and Penicillium/Aspergillus spp. were compared to taxon-specific qPCR concentrations for A. alternata, C. cladosporioides, E. nigrum, and Penicillium/Aspergillus spp. derived from the same spring and fall aerosol samples. Results of these comparisons showed that the absolute concentration values generated from pyrosequencing were strongly associated with the concentration values derived from taxon-specific qPCR (for all four species, p < 0.005, all R > 0.70). The correlation coefficients were greater for species present in higher concentrations. Our microbial aerosol population analyses demonstrated that fungal diversity (number of fungal operational taxonomic units) was higher in the spring compared to the fall (p = 0.02), and principal coordinate analysis showed distinct seasonal differences in taxa distribution (ANOSIM p = 0.004). Among genera containing allergenic and/or pathogenic species, the absolute concentrations of Alternaria, Aspergillus, Fusarium, and Cladosporium were greater in the fall, while Cryptococcus, Penicillium, and Ulocladium concentrations were greater in the spring. The transformation of pyrosequencing fungal population relative abundance data to absolute concentrations can improve next-generation DNA sequencing-based quantitative aerosol exposure assessment.
Employing machine learning for reliable miRNA target identification in plants
2011-01-01
Background miRNAs are ~21 nucleotide long small noncoding RNA molecules, formed endogenously in most of the eukaryotes, which mainly control their target genes post transcriptionally by interacting and silencing them. While a lot of tools has been developed for animal miRNA target system, plant miRNA target identification system has witnessed limited development. Most of them have been centered around exact complementarity match. Very few of them considered other factors like multiple target sites and role of flanking regions. Result In the present work, a Support Vector Regression (SVR) approach has been implemented for plant miRNA target identification, utilizing position specific dinucleotide density variation information around the target sites, to yield highly reliable result. It has been named as p-TAREF (plant-Target Refiner). Performance comparison for p-TAREF was done with other prediction tools for plants with utmost rigor and where p-TAREF was found better performing in several aspects. Further, p-TAREF was run over the experimentally validated miRNA targets from species like Arabidopsis, Medicago, Rice and Tomato, and detected them accurately, suggesting gross usability of p-TAREF for plant species. Using p-TAREF, target identification was done for the complete Rice transcriptome, supported by expression and degradome based data. miR156 was found as an important component of the Rice regulatory system, where control of genes associated with growth and transcription looked predominant. The entire methodology has been implemented in a multi-threaded parallel architecture in Java, to enable fast processing for web-server version as well as standalone version. This also makes it to run even on a simple desktop computer in concurrent mode. It also provides a facility to gather experimental support for predictions made, through on the spot expression data analysis, in its web-server version. Conclusion A machine learning multivariate feature tool has been implemented in parallel and locally installable form, for plant miRNA target identification. The performance was assessed and compared through comprehensive testing and benchmarking, suggesting a reliable performance and gross usability for transcriptome wide plant miRNA target identification. PMID:22206472
Employing machine learning for reliable miRNA target identification in plants.
Jha, Ashwani; Shankar, Ravi
2011-12-29
miRNAs are ~21 nucleotide long small noncoding RNA molecules, formed endogenously in most of the eukaryotes, which mainly control their target genes post transcriptionally by interacting and silencing them. While a lot of tools has been developed for animal miRNA target system, plant miRNA target identification system has witnessed limited development. Most of them have been centered around exact complementarity match. Very few of them considered other factors like multiple target sites and role of flanking regions. In the present work, a Support Vector Regression (SVR) approach has been implemented for plant miRNA target identification, utilizing position specific dinucleotide density variation information around the target sites, to yield highly reliable result. It has been named as p-TAREF (plant-Target Refiner). Performance comparison for p-TAREF was done with other prediction tools for plants with utmost rigor and where p-TAREF was found better performing in several aspects. Further, p-TAREF was run over the experimentally validated miRNA targets from species like Arabidopsis, Medicago, Rice and Tomato, and detected them accurately, suggesting gross usability of p-TAREF for plant species. Using p-TAREF, target identification was done for the complete Rice transcriptome, supported by expression and degradome based data. miR156 was found as an important component of the Rice regulatory system, where control of genes associated with growth and transcription looked predominant. The entire methodology has been implemented in a multi-threaded parallel architecture in Java, to enable fast processing for web-server version as well as standalone version. This also makes it to run even on a simple desktop computer in concurrent mode. It also provides a facility to gather experimental support for predictions made, through on the spot expression data analysis, in its web-server version. A machine learning multivariate feature tool has been implemented in parallel and locally installable form, for plant miRNA target identification. The performance was assessed and compared through comprehensive testing and benchmarking, suggesting a reliable performance and gross usability for transcriptome wide plant miRNA target identification.
454 pyrosequencing project identifying expressed genes from the horn fly, Haematobia irritans
USDA-ARS?s Scientific Manuscript database
We used an EST approach to initiate a study of the genome of the horn fly, Haematobia irritans and have used 454 pyrosequencing techniques to sequence 73,512, 100,603, 71,550, and 85,769 expressed genes from the egg, first instar larvae, adult male, and adult female lifestages of the horn fly. cD...
Lisa W. Alexander; Keith E. Woeste
2014-01-01
Given the low intraspecific chloroplast diversity detected in northern red oak (Quercus rubra L.), more powerful genetic tools are necessary to accurately characterize Q. rubra chloroplast diversity and structure. We report the sequencing, assembly, and annotation of the chloroplast genome of northern red oak via pyrosequencing and...
Assessment of bacterial contamination of lipstick using pyrosequencing.
Lee, So Y; Lee, Si Y
As soon as they are exposed to the environment, cosmetics become contaminated with microorganisms, and this contamination accumulates with increased use. In this study, we employed pyrosequencing to investigate the diversity of bacteria found on lipstick. Bacterial DNA was extracted from 20 lipstick samples and mixed in equal ratios for pyrosequencing analysis. As a result, 105 bacterial genera were detected, four of which ( Leifsonia , Methylobacterium , Streptococcus , and Haemophilus ) were predominant in 92% of the 19,863 total sequence reads. Potentially pathogenic genera such as Staphylococcus , Pseudomonas , Escherichia , Salmonella , Corynebacterium , Mycobacterium , and Neisseria accounted for 27.6% of the 105 genera. The most commonly identified oral bacteria belonged to the Streptococcus genus, although other oral genera such as Actinomyces , Fusobacterium , Porphyromonas , and Lactobacillus were also detected.
Re-evaluating microglia expression profiles using RiboTag and cell isolation strategies.
Haimon, Zhana; Volaski, Alon; Orthgiess, Johannes; Boura-Halfon, Sigalit; Varol, Diana; Shemer, Anat; Yona, Simon; Zuckerman, Binyamin; David, Eyal; Chappell-Maor, Louise; Bechmann, Ingo; Gericke, Martin; Ulitsky, Igor; Jung, Steffen
2018-06-01
Transcriptome profiling is widely used to infer functional states of specific cell types, as well as their responses to stimuli, to define contributions to physiology and pathophysiology. Focusing on microglia, the brain's macrophages, we report here a side-by-side comparison of classical cell-sorting-based transcriptome sequencing and the 'RiboTag' method, which avoids cell retrieval from tissue context and yields translatome sequencing information. Conventional whole-cell microglial transcriptomes were found to be significantly tainted by artifacts introduced by tissue dissociation, cargo contamination and transcripts sequestered from ribosomes. Conversely, our data highlight the added value of RiboTag profiling for assessing the lineage accuracy of Cre recombinase expression in transgenic mice. Collectively, this study indicates method-based biases, reveals observer effects and establishes RiboTag-based translatome profiling as a valuable complement to standard sorting-based profiling strategies.
Wong, Kim; Navarro, José Fernández; Bergenstråhle, Ludvig; Ståhl, Patrik L; Lundeberg, Joakim
2018-06-01
Spatial Transcriptomics (ST) is a method which combines high resolution tissue imaging with high troughput transcriptome sequencing data. This data must be aligned with the images for correct visualization, a process that involves several manual steps. Here we present ST Spot Detector, a web tool that automates and facilitates this alignment through a user friendly interface. jose.fernandez.navarro@scilifelab.se. Supplementary data are available at Bioinformatics online.
Kamphuis, Lars G; Hane, James K; Nelson, Matthew N; Gao, Lingling; Atkins, Craig A; Singh, Karam B
2015-01-01
Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is an important grain legume crop that is valuable for sustainable farming and is becoming recognized as a human health food. NLL breeding is directed at improving grain production, disease resistance, drought tolerance and health benefits. However, genetic and genomic studies have been hindered by a lack of extensive genomic resources for the species. Here, the generation, de novo assembly and annotation of transcriptome datasets derived from five different NLL tissue types of the reference accession cv. Tanjil are described. The Tanjil transcriptome was compared to transcriptomes of an early domesticated cv. Unicrop, a wild accession P27255, as well as accession 83A:476, together being the founding parents of two recombinant inbred line (RIL) populations. In silico predictions for transcriptome-derived gene-based length and SNP polymorphic markers were conducted and corroborated using a survey assembly sequence for NLL cv. Tanjil. This yielded extensive indel and SNP polymorphic markers for the two RIL populations. A total of 335 transcriptome-derived markers and 66 BAC-end sequence-derived markers were evaluated, and 275 polymorphic markers were selected to genotype the reference NLL 83A:476 × P27255 RIL population. This significantly improved the completeness, marker density and quality of the reference NLL genetic map. PMID:25060816
Tanase, Koji; Nishitani, Chikako; Hirakawa, Hideki; Isobe, Sachiko; Tabata, Satoshi; Ohmiya, Akemi; Onozaki, Takashi
2012-07-02
Carnation (Dianthus caryophyllus L.), in the family Caryophyllaceae, can be found in a wide range of colors and is a model system for studies of flower senescence. In addition, it is one of the most important flowers in the global floriculture industry. However, few genomics resources, such as sequences and markers are available for carnation or other members of the Caryophyllaceae. To increase our understanding of the genetic control of important characters in carnation, we generated an expressed sequence tag (EST) database for a carnation cultivar important in horticulture by high-throughput sequencing using 454 pyrosequencing technology. We constructed a normalized cDNA library and a 3'-UTR library of carnation, obtaining a total of 1,162,126 high-quality reads. These reads were assembled into 300,740 unigenes consisting of 37,844 contigs and 262,896 singlets. The contigs were searched against an Arabidopsis sequence database, and 61.8% (23,380) of them had at least one BLASTX hit. These contigs were also annotated with Gene Ontology (GO) and were found to cover a broad range of GO categories. Furthermore, we identified 17,362 potential simple sequence repeats (SSRs) in 14,291 of the unigenes. We focused on gene discovery in the areas of flower color and ethylene biosynthesis. Transcripts were identified for almost every gene involved in flower chlorophyll and carotenoid metabolism and in anthocyanin biosynthesis. Transcripts were also identified for every step in the ethylene biosynthesis pathway. We present the first large-scale sequence data set for carnation, generated using next-generation sequencing technology. The large EST database generated from these sequences is an informative resource for identifying genes involved in various biological processes in carnation and provides an EST resource for understanding the genetic diversity of this plant.
2012-01-01
Background Carnation (Dianthus caryophyllus L.), in the family Caryophyllaceae, can be found in a wide range of colors and is a model system for studies of flower senescence. In addition, it is one of the most important flowers in the global floriculture industry. However, few genomics resources, such as sequences and markers are available for carnation or other members of the Caryophyllaceae. To increase our understanding of the genetic control of important characters in carnation, we generated an expressed sequence tag (EST) database for a carnation cultivar important in horticulture by high-throughput sequencing using 454 pyrosequencing technology. Results We constructed a normalized cDNA library and a 3’-UTR library of carnation, obtaining a total of 1,162,126 high-quality reads. These reads were assembled into 300,740 unigenes consisting of 37,844 contigs and 262,896 singlets. The contigs were searched against an Arabidopsis sequence database, and 61.8% (23,380) of them had at least one BLASTX hit. These contigs were also annotated with Gene Ontology (GO) and were found to cover a broad range of GO categories. Furthermore, we identified 17,362 potential simple sequence repeats (SSRs) in 14,291 of the unigenes. We focused on gene discovery in the areas of flower color and ethylene biosynthesis. Transcripts were identified for almost every gene involved in flower chlorophyll and carotenoid metabolism and in anthocyanin biosynthesis. Transcripts were also identified for every step in the ethylene biosynthesis pathway. Conclusions We present the first large-scale sequence data set for carnation, generated using next-generation sequencing technology. The large EST database generated from these sequences is an informative resource for identifying genes involved in various biological processes in carnation and provides an EST resource for understanding the genetic diversity of this plant. PMID:22747974
Comparative analysis of the small RNA transcriptomes of Pinus contorta and Oryza sativa
Morin, Ryan D.; Aksay, Gozde; Dolgosheina, Elena; Ebhardt, H. Alexander; Magrini, Vincent; Mardis, Elaine R.; Sahinalp, S. Cenk; Unrau, Peter J.
2008-01-01
The diversity of microRNAs and small-interfering RNAs has been extensively explored within angiosperms by focusing on a few key organisms such as Oryza sativa and Arabidopsis thaliana. A deeper division of the plants is defined by the radiation of the angiosperms and gymnosperms, with the latter comprising the commercially important conifers. The conifers are expected to provide important information regarding the evolution of highly conserved small regulatory RNAs. Deep sequencing provides the means to characterize and quantitatively profile small RNAs in understudied organisms such as these. Pyrosequencing of small RNAs from O. sativa revealed, as expected, ∼21- and ∼24-nt RNAs. The former contained known microRNAs, and the latter largely comprised intergenic-derived sequences likely representing heterochromatin siRNAs. In contrast, sequences from Pinus contorta were dominated by 21-nt small RNAs. Using a novel sequence-based clustering algorithm, we identified sequences belonging to 18 highly conserved microRNA families in P. contorta as well as numerous clusters of conserved small RNAs of unknown function. Using multiple methods, including expressed sequence folding and machine learning algorithms, we found a further 53 candidate novel microRNA families, 51 appearing specific to the P. contorta library. In addition, alignment of small RNA sequences to the O. sativa genome revealed six perfectly conserved classes of small RNA that included chloroplast transcripts and specific types of genomic repeats. The conservation of microRNAs and other small RNAs between the conifers and the angiosperms indicates that important RNA silencing processes were highly developed in the earliest spermatophytes. Genomic mapping of all sequences to the O. sativa genome can be viewed at http://microrna.bcgsc.ca/cgi-bin/gbrowse/rice_build_3/. PMID:18323537
Microbiota diversity and gene expression dynamics in human oral biofilms
2014-01-01
Background Micro-organisms inhabiting teeth surfaces grow on biofilms where a specific and complex succession of bacteria has been described by co-aggregation tests and DNA-based studies. Although the composition of oral biofilms is well established, the active portion of the bacterial community and the patterns of gene expression in vivo have not been studied. Results Using RNA-sequencing technologies, we present the first metatranscriptomic study of human dental plaque, performed by two different approaches: (1) A short-reads, high-coverage approach by Illumina sequencing to characterize the gene activity repertoire of the microbial community during biofilm development; (2) A long-reads, lower-coverage approach by pyrosequencing to determine the taxonomic identity of the active microbiome before and after a meal ingestion. The high-coverage approach allowed us to analyze over 398 million reads, revealing that microbial communities are individual-specific and no bacterial species was detected as key player at any time during biofilm formation. We could identify some gene expression patterns characteristic for early and mature oral biofilms. The transcriptomic profile of several adhesion genes was confirmed through qPCR by measuring expression of fimbriae-associated genes. In addition to the specific set of gene functions overexpressed in early and mature oral biofilms, as detected through the short-reads dataset, the long-reads approach detected specific changes when comparing the metatranscriptome of the same individual before and after a meal, which can narrow down the list of organisms responsible for acid production and therefore potentially involved in dental caries. Conclusions The bacteria changing activity during biofilm formation and after meal ingestion were person-specific. Interestingly, some individuals showed extreme homeostasis with virtually no changes in the active bacterial population after food ingestion, suggesting the presence of a microbial community which could be associated to dental health. PMID:24767457
Microbiota diversity and gene expression dynamics in human oral biofilms.
Benítez-Páez, Alfonso; Belda-Ferre, Pedro; Simón-Soro, Aurea; Mira, Alex
2014-04-27
Micro-organisms inhabiting teeth surfaces grow on biofilms where a specific and complex succession of bacteria has been described by co-aggregation tests and DNA-based studies. Although the composition of oral biofilms is well established, the active portion of the bacterial community and the patterns of gene expression in vivo have not been studied. Using RNA-sequencing technologies, we present the first metatranscriptomic study of human dental plaque, performed by two different approaches: (1) A short-reads, high-coverage approach by Illumina sequencing to characterize the gene activity repertoire of the microbial community during biofilm development; (2) A long-reads, lower-coverage approach by pyrosequencing to determine the taxonomic identity of the active microbiome before and after a meal ingestion. The high-coverage approach allowed us to analyze over 398 million reads, revealing that microbial communities are individual-specific and no bacterial species was detected as key player at any time during biofilm formation. We could identify some gene expression patterns characteristic for early and mature oral biofilms. The transcriptomic profile of several adhesion genes was confirmed through qPCR by measuring expression of fimbriae-associated genes. In addition to the specific set of gene functions overexpressed in early and mature oral biofilms, as detected through the short-reads dataset, the long-reads approach detected specific changes when comparing the metatranscriptome of the same individual before and after a meal, which can narrow down the list of organisms responsible for acid production and therefore potentially involved in dental caries. The bacteria changing activity during biofilm formation and after meal ingestion were person-specific. Interestingly, some individuals showed extreme homeostasis with virtually no changes in the active bacterial population after food ingestion, suggesting the presence of a microbial community which could be associated to dental health.
Transcriptome of interstitial cells of Cajal reveals unique and selective gene signatures
Park, Paul J.; Fuchs, Robert; Wei, Lai; Jorgensen, Brian G.; Redelman, Doug; Ward, Sean M.; Sanders, Kenton M.
2017-01-01
Transcriptome-scale data can reveal essential clues into understanding the underlying molecular mechanisms behind specific cellular functions and biological processes. Transcriptomics is a continually growing field of research utilized in biomarker discovery. The transcriptomic profile of interstitial cells of Cajal (ICC), which serve as slow-wave electrical pacemakers for gastrointestinal (GI) smooth muscle, has yet to be uncovered. Using copGFP-labeled ICC mice and flow cytometry, we isolated ICC populations from the murine small intestine and colon and obtained their transcriptomes. In analyzing the transcriptome, we identified a unique set of ICC-restricted markers including transcription factors, epigenetic enzymes/regulators, growth factors, receptors, protein kinases/phosphatases, and ion channels/transporters. This analysis provides new and unique insights into the cellular and biological functions of ICC in GI physiology. Additionally, we constructed an interactive ICC genome browser (http://med.unr.edu/physio/transcriptome) based on the UCSC genome database. To our knowledge, this is the first online resource that provides a comprehensive library of all known genetic transcripts expressed in primary ICC. Our genome browser offers a new perspective into the alternative expression of genes in ICC and provides a valuable reference for future functional studies. PMID:28426719
Egge, Elianne; Bittner, Lucie; Andersen, Tom; Audic, Stéphane; de Vargas, Colomban; Edvardsen, Bente
2013-01-01
Next generation sequencing of ribosomal DNA is increasingly used to assess the diversity and structure of microbial communities. Here we test the ability of 454 pyrosequencing to detect the number of species present, and assess the relative abundance in terms of cell numbers and biomass of protists in the phylum Haptophyta. We used a mock community consisting of equal number of cells of 11 haptophyte species and compared targeting DNA and RNA/cDNA, and two different V4 SSU rDNA haptophyte-biased primer pairs. Further, we tested four different bioinformatic filtering methods to reduce errors in the resulting sequence dataset. With sequencing depth of 11000–20000 reads and targeting cDNA with Haptophyta specific primers Hap454 we detected all 11 species. A rarefaction analysis of expected number of species recovered as a function of sampling depth suggested that minimum 1400 reads were required here to recover all species in the mock community. Relative read abundance did not correlate to relative cell numbers. Although the species represented with the largest biomass was also proportionally most abundant among the reads, there was generally a weak correlation between proportional read abundance and proportional biomass of the different species, both with DNA and cDNA as template. The 454 sequencing generated considerable spurious diversity, and more with cDNA than DNA as template. With initial filtering based only on match with barcode and primer we observed 100-fold more operational taxonomic units (OTUs) at 99% similarity than the number of species present in the mock community. Filtering based on quality scores, or denoising with PyroNoise resulted in ten times more OTU99% than the number of species. Denoising with AmpliconNoise reduced the number of OTU99% to match the number of species present in the mock community. Based on our analyses, we propose a strategy to more accurately depict haptophyte diversity using 454 pyrosequencing. PMID:24069303
Between a Pod and a Hard Test: The Deep Evolution of Amoebae
Kang, Seungho; Tice, Alexander K.; Spiegel, Frederick W.; Silberman, Jeffrey D.; Pánek, Tomáš; Čepička, Ivan; Kostka, Martin; Kosakyan, Anush; Alcântara, Daniel M.C.; Roger, Andrew J.; Shadwick, Lora L.; Smirnov, Alexey; Kudryavtsev, Alexander; Lahr, Daniel J.G.; Brown, Matthew W.
2017-01-01
Abstract Amoebozoa is the eukaryotic supergroup sister to Obazoa, the lineage that contains the animals and Fungi, as well as their protistan relatives, and the breviate and apusomonad flagellates. Amoebozoa is extraordinarily diverse, encompassing important model organisms and significant pathogens. Although amoebozoans are integral to global nutrient cycles and present in nearly all environments, they remain vastly understudied. We present a robust phylogeny of Amoebozoa based on broad representative set of taxa in a phylogenomic framework (325 genes). By sampling 61 taxa using culture-based and single-cell transcriptomics, our analyses show two major clades of Amoebozoa, Discosea, and Tevosa. This phylogeny refutes previous studies in major respects. Our results support the hypothesis that the last common ancestor of Amoebozoa was sexual and flagellated, it also may have had the ability to disperse propagules from a sporocarp-type fruiting body. Overall, the main macroevolutionary patterns in Amoebozoa appear to result from the parallel losses of homologous characters of a multiphase life cycle that included flagella, sex, and sporocarps rather than independent acquisition of convergent features. PMID:28505375
Jeong, Ji Hun; Park, Soon Ho; Park, Mi Jung; Kim, Moon Jin; Kim, Kyung Hee; Park, Pil Whan; Seo, Yiel Hea; Lee, Jae Hoon; Park, Jinny; Hong, Junshik
2013-01-01
Background N-ras mutations are one of the most commonly detected abnormalities of myeloid origin. N-ras mutations result in a constitutively active N-ras protein that induces uncontrolled cell proliferation and inhibits apoptosis. We analyzed N-ras mutations in adult patients with AML at a particular institution and compared pyrosequencing analysis with a direct sequencing method for the detection of N-ras mutations. Methods We analyzed 90 bone marrow samples from 83 AML patients. We detected N-ras mutations in codons 12, 13, and 61 using the pyrosequencing method and subsequently confirmed all data by direct sequencing. Using these methods, we screened the N-ras mutation quantitatively and determined the incidence and characteristic of N-ras mutation. Results The incidence of N-ras mutation was 7.2% in adult AML patients. The patients with N-ras mutations showed significant higher hemoglobin levels (P=0.022) and an increased incidence of FLT3 mutations (P=0.003). We observed 3 cases with N-ras mutations in codon 12 (3.6%), 2 cases in codon 13 (2.4%), and 1 case in codon 61 (1.2%). All the mutations disappeared during chemotherapy. Conclusions There is a low incidence (7.2%) of N-ras mutations in AML patients compared with other populations. Similar data is obtained by both pyrosequencing and direct sequencing. This study showed the correlation between the N-ras mutation and the therapeutic response. However, pyrosequencing provides quantitative data and is useful for monitoring therapeutic responses. PMID:23667841
USDA-ARS?s Scientific Manuscript database
Complete surveys of insect endosymbionts including species of economic importance have until recently been hampered by a lack of high-throughput genetic assays. We used 454-pyrosequencing of the 16S rRNA gene amplicon of adult spotted wing Drosophila (SWD) Drosophila suzukii (Matsumura) from souther...
Development of colonic microflora as assessed by pyrosequencing in dairy calves fed waste milk
USDA-ARS?s Scientific Manuscript database
The objective of the current study was to examine the effect of pasteurization of waste milk used to feed dairy calves on the bacterial diversity of their lower gut. Using 16S rDNA bacterial tag-encoded FLX amplicon pyrosequencing (bTEFAP), fecal samples from dairy calves aging from 1 week to 6 mon...
Leite, A M O; Mayo, B; Rachid, C T C C; Peixoto, R S; Silva, J T; Paschoalin, V M F; Delgado, S
2012-09-01
The microbial diversity and community structure of three different kefir grains from different parts of Brazil were examined via the combination of two culture-independent methods: PCR-denaturing gradient gel electrophoresis (PCR-DGGE) and pyrosequencing. PCR-DGGE showed Lactobacillus kefiranofaciens and Lactobacillus kefiri to be the major bacterial populations in all three grains. The yeast community was dominated by Saccharomyces cerevisiae. Pyrosequencing produced a total of 14,314 partial 16S rDNA sequence reads from the three grains. Sequence analysis grouped the reads into three phyla, of which Firmicutes was dominant. Members of the genus Lactobacillus were the most abundant operational taxonomic units (OTUs) in all samples, accounting for up to 96% of the sequences. OTUs belonging to other lactic and acetic acid bacteria genera, such as Lactococcus, Leuconostoc, Streptococcus and Acetobacter, were also identified at low levels. Two of the grains showed identical DGGE profiles and a similar number of OTUs, while the third sample showed the highest diversity by both techniques. Pyrosequencing allowed the identification of bacteria that were present in small numbers and rarely associated with the microbial community of this complex ecosystem. Copyright © 2012 Elsevier Ltd. All rights reserved.
Isern, Joan; He, Zhiyong; Fraser, Stuart T.; Nowotschin, Sonja; Ferrer-Vaquer, Anna; Moore, Rebecca; Hadjantonakis, Anna-Katerina; Schulz, Vincent; Tuck, David; Gallagher, Patrick G.
2011-01-01
Primitive erythroid (EryP) progenitors are the first cell type specified from the mesoderm late in gastrulation. We used a transgenic reporter to image and purify the earliest blood progenitors and their descendants from developing mouse embryos. EryP progenitors exhibited remarkable proliferative capacity in the yolk sac immediately before the onset of circulation, when these cells comprise nearly half of all cells of the embryo. Global expression profiles generated at 24-hour intervals from embryonic day 7.5 through 2.5 revealed 2 abrupt changes in transcript diversity that coincided with the entry of EryPs into the circulation and with their late maturation and enucleation, respectively. These changes were paralleled by the expression of critical regulatory factors. Experiments designed to test predictions from these data demonstrated that the Wnt-signaling pathway is active in EryP progenitors, which display an aerobic glycolytic profile and the numbers of which are regulated by transforming growth factor-β1 and hypoxia. This is the first transcriptome assembled for a single hematopoietic lineage of the embryo over the course of its differentiation. PMID:21263157
Peters, Linda M.; Belyantseva, Inna A.; Lagziel, Ayala; Battey, James F.; Friedman, Thomas B.; Morell, Robert J.
2007-01-01
Specialization in cell function and morphology is influenced by the differential expression of mRNAs, many of which are expressed at low abundance and restricted to certain cell types. Detecting such transcripts in cDNA libraries may require sequencing millions of clones. Massively parallel signature sequencing (MPSS) is well-suited for identifying transcripts that are expressed in discrete cell types and in low abundance. We have made MPSS libraries from microdissections of three inner ear tissues. By comparing these MPSS libraries to those of 87 other tissues included in the Mouse Reference Transcriptome (MRT) online resource, we have identified genes that are highly enriched in, or specific to, the inner ear. We show by RT-PCR and in situ hybridization that signatures unique to the inner ear libraries identify transcripts with highly specific cell-type localizations. These transcripts serve to illustrate the utility of a resource that is available to the research community. Utilization of these resources will increase the number of known transcription units and expand our knowledge of the tissue-specific regulation of the transcriptome. PMID:17049805
Tang, Qin; Iyer, Sowmya; Lobbardi, Riadh; Moore, John C; Chen, Huidong; Lareau, Caleb; Hebert, Christine; Shaw, McKenzie L; Neftel, Cyril; Suva, Mario L; Ceol, Craig J; Bernards, Andre; Aryee, Martin; Pinello, Luca; Drummond, Iain A; Langenau, David M
2017-10-02
Recent advances in single-cell, transcriptomic profiling have provided unprecedented access to investigate cell heterogeneity during tissue and organ development. In this study, we used massively parallel, single-cell RNA sequencing to define cell heterogeneity within the zebrafish kidney marrow, constructing a comprehensive molecular atlas of definitive hematopoiesis and functionally distinct renal cells found in adult zebrafish. Because our method analyzed blood and kidney cells in an unbiased manner, our approach was useful in characterizing immune-cell deficiencies within DNA-protein kinase catalytic subunit ( prkdc ), interleukin-2 receptor γ a ( il2rga ), and double-homozygous-mutant fish, identifying blood cell losses in T, B, and natural killer cells within specific genetic mutants. Our analysis also uncovered novel cell types, including two classes of natural killer immune cells, classically defined and erythroid-primed hematopoietic stem and progenitor cells, mucin-secreting kidney cells, and kidney stem/progenitor cells. In total, our work provides the first, comprehensive, single-cell, transcriptomic analysis of kidney and marrow cells in the adult zebrafish. © 2017 Tang et al.
Iyer, Sowmya; Lobbardi, Riadh; Chen, Huidong; Hebert, Christine; Shaw, McKenzie L.; Neftel, Cyril; Suva, Mario L.; Bernards, Andre; Aryee, Martin; Drummond, Iain A.
2017-01-01
Recent advances in single-cell, transcriptomic profiling have provided unprecedented access to investigate cell heterogeneity during tissue and organ development. In this study, we used massively parallel, single-cell RNA sequencing to define cell heterogeneity within the zebrafish kidney marrow, constructing a comprehensive molecular atlas of definitive hematopoiesis and functionally distinct renal cells found in adult zebrafish. Because our method analyzed blood and kidney cells in an unbiased manner, our approach was useful in characterizing immune-cell deficiencies within DNA–protein kinase catalytic subunit (prkdc), interleukin-2 receptor γ a (il2rga), and double-homozygous–mutant fish, identifying blood cell losses in T, B, and natural killer cells within specific genetic mutants. Our analysis also uncovered novel cell types, including two classes of natural killer immune cells, classically defined and erythroid-primed hematopoietic stem and progenitor cells, mucin-secreting kidney cells, and kidney stem/progenitor cells. In total, our work provides the first, comprehensive, single-cell, transcriptomic analysis of kidney and marrow cells in the adult zebrafish. PMID:28878000
High-confidence coding and noncoding transcriptome maps
2017-01-01
The advent of high-throughput RNA sequencing (RNA-seq) has led to the discovery of unprecedentedly immense transcriptomes encoded by eukaryotic genomes. However, the transcriptome maps are still incomplete partly because they were mostly reconstructed based on RNA-seq reads that lack their orientations (known as unstranded reads) and certain boundary information. Methods to expand the usability of unstranded RNA-seq data by predetermining the orientation of the reads and precisely determining the boundaries of assembled transcripts could significantly benefit the quality of the resulting transcriptome maps. Here, we present a high-performing transcriptome assembly pipeline, called CAFE, that significantly improves the original assemblies, respectively assembled with stranded and/or unstranded RNA-seq data, by orienting unstranded reads using the maximum likelihood estimation and by integrating information about transcription start sites and cleavage and polyadenylation sites. Applying large-scale transcriptomic data comprising 230 billion RNA-seq reads from the ENCODE, Human BodyMap 2.0, The Cancer Genome Atlas, and GTEx projects, CAFE enabled us to predict the directions of about 220 billion unstranded reads, which led to the construction of more accurate transcriptome maps, comparable to the manually curated map, and a comprehensive lncRNA catalog that includes thousands of novel lncRNAs. Our pipeline should not only help to build comprehensive, precise transcriptome maps from complex genomes but also to expand the universe of noncoding genomes. PMID:28396519
Informatics for RNA Sequencing: A Web Resource for Analysis on the Cloud
Griffith, Malachi; Walker, Jason R.; Spies, Nicholas C.; Ainscough, Benjamin J.; Griffith, Obi L.
2015-01-01
Massively parallel RNA sequencing (RNA-seq) has rapidly become the assay of choice for interrogating RNA transcript abundance and diversity. This article provides a detailed introduction to fundamental RNA-seq molecular biology and informatics concepts. We make available open-access RNA-seq tutorials that cover cloud computing, tool installation, relevant file formats, reference genomes, transcriptome annotations, quality-control strategies, expression, differential expression, and alternative splicing analysis methods. These tutorials and additional training resources are accompanied by complete analysis pipelines and test datasets made available without encumbrance at www.rnaseq.wiki. PMID:26248053
Zhao, Shanrong; Prenger, Kurt; Smith, Lance
2013-01-01
RNA-Seq is becoming a promising replacement to microarrays in transcriptome profiling and differential gene expression study. Technical improvements have decreased sequencing costs and, as a result, the size and number of RNA-Seq datasets have increased rapidly. However, the increasing volume of data from large-scale RNA-Seq studies poses a practical challenge for data analysis in a local environment. To meet this challenge, we developed Stormbow, a cloud-based software package, to process large volumes of RNA-Seq data in parallel. The performance of Stormbow has been tested by practically applying it to analyse 178 RNA-Seq samples in the cloud. In our test, it took 6 to 8 hours to process an RNA-Seq sample with 100 million reads, and the average cost was $3.50 per sample. Utilizing Amazon Web Services as the infrastructure for Stormbow allows us to easily scale up to handle large datasets with on-demand computational resources. Stormbow is a scalable, cost effective, and open-source based tool for large-scale RNA-Seq data analysis. Stormbow can be freely downloaded and can be used out of box to process Illumina RNA-Seq datasets. PMID:25937948
Zhao, Shanrong; Prenger, Kurt; Smith, Lance
2013-01-01
RNA-Seq is becoming a promising replacement to microarrays in transcriptome profiling and differential gene expression study. Technical improvements have decreased sequencing costs and, as a result, the size and number of RNA-Seq datasets have increased rapidly. However, the increasing volume of data from large-scale RNA-Seq studies poses a practical challenge for data analysis in a local environment. To meet this challenge, we developed Stormbow, a cloud-based software package, to process large volumes of RNA-Seq data in parallel. The performance of Stormbow has been tested by practically applying it to analyse 178 RNA-Seq samples in the cloud. In our test, it took 6 to 8 hours to process an RNA-Seq sample with 100 million reads, and the average cost was $3.50 per sample. Utilizing Amazon Web Services as the infrastructure for Stormbow allows us to easily scale up to handle large datasets with on-demand computational resources. Stormbow is a scalable, cost effective, and open-source based tool for large-scale RNA-Seq data analysis. Stormbow can be freely downloaded and can be used out of box to process Illumina RNA-Seq datasets.
Hou, Yu; Guo, Huahu; Cao, Chen; Li, Xianlong; Hu, Boqiang; Zhu, Ping; Wu, Xinglong; Wen, Lu; Tang, Fuchou; Huang, Yanyi; Peng, Jirun
2016-01-01
Single-cell genome, DNA methylome, and transcriptome sequencing methods have been separately developed. However, to accurately analyze the mechanism by which transcriptome, genome and DNA methylome regulate each other, these omic methods need to be performed in the same single cell. Here we demonstrate a single-cell triple omics sequencing technique, scTrio-seq, that can be used to simultaneously analyze the genomic copy-number variations (CNVs), DNA methylome, and transcriptome of an individual mammalian cell. We show that large-scale CNVs cause proportional changes in RNA expression of genes within the gained or lost genomic regions, whereas these CNVs generally do not affect DNA methylation in these regions. Furthermore, we applied scTrio-seq to 25 single cancer cells derived from a human hepatocellular carcinoma tissue sample. We identified two subpopulations within these cells based on CNVs, DNA methylome, or transcriptome of individual cells. Our work offers a new avenue of dissecting the complex contribution of genomic and epigenomic heterogeneities to the transcriptomic heterogeneity within a population of cells. PMID:26902283
FIT: statistical modeling tool for transcriptome dynamics under fluctuating field conditions
Iwayama, Koji; Aisaka, Yuri; Kutsuna, Natsumaro
2017-01-01
Abstract Motivation: Considerable attention has been given to the quantification of environmental effects on organisms. In natural conditions, environmental factors are continuously changing in a complex manner. To reveal the effects of such environmental variations on organisms, transcriptome data in field environments have been collected and analyzed. Nagano et al. proposed a model that describes the relationship between transcriptomic variation and environmental conditions and demonstrated the capability to predict transcriptome variation in rice plants. However, the computational cost of parameter optimization has prevented its wide application. Results: We propose a new statistical model and efficient parameter optimization based on the previous study. We developed and released FIT, an R package that offers functions for parameter optimization and transcriptome prediction. The proposed method achieves comparable or better prediction performance within a shorter computational time than the previous method. The package will facilitate the study of the environmental effects on transcriptomic variation in field conditions. Availability and Implementation: Freely available from CRAN (https://cran.r-project.org/web/packages/FIT/). Contact: anagano@agr.ryukoku.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online PMID:28158396
Phelix, C F; Feltus, F A
2015-01-01
Measuring biomarkers from plant tissue samples is challenging and expensive when the desire is to integrate transcriptomics, fluxomics, metabolomics, lipidomics, proteomics, physiomics and phenomics. We present a computational biology method where only the transcriptome needs to be measured and is used to derive a set of parameters for deterministic kinetic models of metabolic pathways. The technology is called Transcriptome-To-Metabolome (TTM) biosimulations, currently under commercial development, but available for non-commercial use by researchers. The simulated results on metabolites of 30 primary and secondary metabolic pathways in rice (Oryza sativa) were used as the biomarkers to predict whether the transcriptome was from a plant that had been under drought conditions. The rice transcriptomes were accessed from public archives and each individual plant was simulated. This unique quality of the TTM technology allows standard analyses on biomarker assessments, i.e. sensitivity, specificity, positive and negative predictive values, accuracy, receiver operator characteristics (ROC) curve and area under the ROC curve (AUC). Two validation methods were also used, the holdout and 10-fold cross validations. Initially 17 metabolites were identified as candidate biomarkers based on either statistical significance on binary phenotype when compared with control samples or recognition from the literature. The top three biomarkers based on AUC were gibberellic acid 12 (0.89), trehalose (0.80) and sn1-palmitate-sn2-oleic-phosphatidylglycerol (0.70). Neither heat map analyses of transcriptomes nor all 300 metabolites clustered the stressed and control groups effectively. The TTM technology allows the emergent properties of the integrated system to generate unique and useful 'Omics' information. © 2014 German Botanical Society and The Royal Botanical Society of the Netherlands.
Draht, Muriel X G; Smits, Kim M; Jooste, Valérie; Tournier, Benjamin; Vervoort, Martijn; Ramaekers, Chantal; Chapusot, Caroline; Weijenberg, Matty P; van Engeland, Manon; Melotte, Veerle
2016-01-01
Already since the 1990s, promoter CpG island methylation markers have been considered promising diagnostic, prognostic, and predictive cancer biomarkers. However, so far, only a limited number of DNA methylation markers have been introduced into clinical practice. One reason why the vast majority of methylation markers do not translate into clinical applications is lack of independent validation of methylation markers, often caused by differences in methylation analysis techniques. We recently described RET promoter CpG island methylation as a potential prognostic marker in stage II colorectal cancer (CRC) patients of two independent series. In the current study, we analyzed the RET promoter CpG island methylation of 241 stage II colon cancer patients by direct methylation-specific PCR (MSP), nested-MSP, pyrosequencing, and methylation-sensitive high-resolution melting (MS-HRM). All primers were designed as close as possible to the same genomic region. In order to investigate the effect of different DNA methylation assays on patient outcome, we assessed the clinical sensitivity and specificity as well as the association of RET methylation with overall survival for three and five years of follow-up. Using direct-MSP and nested-MSP, 12.0 % (25/209) and 29.6 % (71/240) of the patients showed RET promoter CpG island methylation. Methylation frequencies detected by pyrosequencing were related to the threshold for positivity that defined RET methylation. Methylation frequencies obtained by pyrosequencing (threshold for positivity at 20 %) and MS-HRM were 13.3 % (32/240) and 13.8 % (33/239), respectively. The pyrosequencing threshold for positivity of 20 % showed the best correlation with MS-HRM and direct-MSP results. Nested-MSP detected RET promoter CpG island methylation in deceased patients with a higher sensitivity (33.1 %) compared to direct-MSP (10.7 %), pyrosequencing (14.4 %), and MS-HRM (15.4 %). While RET methylation frequencies detected by nested-MSP, pyrosequencing, and MS-HRM varied, the prognostic effect seemed similar (HR 1.74, 95 % CI 0.97-3.15; HR 1.85, 95 % CI 0.93-3.86; HR 1.83, 95 % CI 0.92-3.65, respectively). Our results show that upon optimizing and aligning four RET methylation assays with regard to primer location and sensitivity, differences in methylation frequencies and clinical sensitivities are observed; however, the effect on the marker's prognostic outcome is minimal.
The present study investigated whether combining of targeted analytical chemistry methods with unsupervised, data-rich methodologies (i.e. transcriptomics) can be utilized to evaluate relative contributions of wastewater treatment plant (WWTP) effluents to biological effects. The...
Lee, Sung Hak; Chung, Arthur Minwoo; Lee, Ahwon; Oh, Woo Jin; Choi, Yeong Jin; Lee, Youn-Soo; Jung, Eun Sun
2017-01-01
Mutations in the KRAS gene have been identified in approximately 50% of colorectal cancers (CRCs). KRAS mutations are well established biomarkers in anti-epidermal growth factor receptor therapy. Therefore, assessment of KRAS mutations is needed in CRC patients to ensure appropriate treatment. We compared the analytical performance of the cobas test to Sanger sequencing in 264 CRC cases. In addition, discordant specimens were evaluated by 454 pyrosequencing. KRAS mutations for codons 12/13 were detected in 43.2% of cases (114/264) by Sanger sequencing. Of 257 evaluable specimens for comparison, KRAS mutations were detected in 112 cases (43.6%) by Sanger sequencing and 118 cases (45.9%) by the cobas test. Concordance between the cobas test and Sanger sequencing for each lot was 93.8% positive percent agreement (PPA) and 91.0% negative percent agreement (NPA) for codons 12/13. Results from the cobas test and Sanger sequencing were discordant for 20 cases (7.8%). Twenty discrepant cases were subsequently subjected to 454 pyrosequencing. After comprehensive analysis of the results from combined Sanger sequencing-454 pyrosequencing and the cobas test, PPA was 97.5% and NPA was 100%. The cobas test is an accurate and sensitive test for detecting KRAS -activating mutations and has analytical power equivalent to Sanger sequencing. Prescreening using the cobas test with subsequent application of Sanger sequencing is the best strategy for routine detection of KRAS mutations in CRC.
Logares, Ramiro; Audic, Stephane; Santini, Sebastien; Pernice, Massimo C; de Vargas, Colomban; Massana, Ramon
2012-01-01
Flagellated heterotrophic microeukaryotes have key roles for the functioning of marine ecosystems as they channel large amounts of organic carbon to the upper trophic levels and control the population sizes of bacteria and archaea. Still, we know very little on the diversity patterns of most groups constituting this evolutionary heterogeneous assemblage. Here, we investigate 11 groups of uncultured flagellates known as MArine STramenopiles (MASTs). MASTs are ecologically very important and branch at the base of stramenopiles. We explored the diversity patterns of MASTs using pyrosequencing (18S rDNA) in coastal European waters. We found that MAST groups range from highly to lowly diversified. Pyrosequencing (hereafter ‘454') allowed us to approach to the limits of taxonomic diversity for all MAST groups, which varied in one order of magnitude (tens to hundreds) in terms of operational taxonomic units (98% similarity). We did not evidence large differences in activity, as indicated by ratios of DNA:RNA-reads. Most groups were strictly planktonic, although we found some groups that were active in sediments and even in anoxic waters. The proportion of reads per size fraction indicated that most groups were composed of very small cells (∼2–5 μm). In addition, phylogenetically different assemblages appeared to be present in different size fractions, depths and geographic zones. Thus, MAST diversity seems to be highly partitioned in spatial scales. Altogether, our results shed light on these ecologically very important but poorly known groups of uncultured marine flagellates. PMID:22534609
Pyrosequencing of prey DNA in reptile faeces: analysis of earthworm consumption by slow worms.
Brown, David S; Jarman, Simon N; Symondson, William O C
2012-03-01
Little quantitative ecological information exists on the diets of most invertebrate feeding reptiles, particularly nocturnal or elusive species that are difficult to observe. In the UK and elsewhere, reptiles are legally required to be relocated before land development can proceed, but without knowledge of their dietary requirements, the suitability of receptor sites cannot be known. Here, we tested the ability of non-invasive DNA-based molecular diagnostics (454 pyrosequencing) to analyse reptile diets, with the specific aims of determining which earthworm species are exploited by slow worms (the legless lizard Anguis fragilis) and whether they feed on the deeper-living earthworm species that only come to the surface at night. Slow worm faecal samples from four different habitats were analysed using earthworm-specific PCR primers. We found that 86% of slow worms (N=80) had eaten earthworms. In lowland heath and marshy/acid grassland, Lumbricus rubellus, a surface-dwelling epigeic species, dominated slow worm diet. In two other habitats, riverside pasture and calciferous coarse grassland, diet was dominated by deeper-living anecic and endogeic species. We conclude that all species of earthworm are exploited by these reptiles and lack of specialization allows slow worms to thrive in a wide variety of habitats. Pyrosequencing of prey DNA in faeces showed promise as a practical, rapid and relatively inexpensive means of obtaining detailed and valuable ecological information on the diets of reptiles. © 2011 Blackwell Publishing Ltd.
DOGMA: domain-based transcriptome and proteome quality assessment.
Dohmen, Elias; Kremer, Lukas P M; Bornberg-Bauer, Erich; Kemena, Carsten
2016-09-01
Genome studies have become cheaper and easier than ever before, due to the decreased costs of high-throughput sequencing and the free availability of analysis software. However, the quality of genome or transcriptome assemblies can vary a lot. Therefore, quality assessment of assemblies and annotations are crucial aspects of genome analysis pipelines. We developed DOGMA, a program for fast and easy quality assessment of transcriptome and proteome data based on conserved protein domains. DOGMA measures the completeness of a given transcriptome or proteome and provides information about domain content for further analysis. DOGMA provides a very fast way to do quality assessment within seconds. DOGMA is implemented in Python and published under GNU GPL v.3 license. The source code is available on https://ebbgit.uni-muenster.de/domainWorld/DOGMA/ CONTACTS: e.dohmen@wwu.de or c.kemena@wwu.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Genetic adaptations of the plateau zokor in high-elevation burrows.
Shao, Yong; Li, Jin-Xiu; Ge, Ri-Li; Zhong, Li; Irwin, David M; Murphy, Robert W; Zhang, Ya-Ping
2015-11-25
The plateau zokor (Myospalax baileyi) spends its entire life underground in sealed burrows. Confronting limited oxygen and high carbon dioxide concentrations, and complete darkness, they epitomize a successful physiological adaptation. Here, we employ transcriptome sequencing to explore the genetic underpinnings of their adaptations to this unique habitat. Compared to Rattus norvegicus, genes belonging to GO categories related to energy metabolism (e.g. mitochondrion and fatty acid beta-oxidation) underwent accelerated evolution in the plateau zokor. Furthermore, the numbers of positively selected genes were significantly enriched in the gene categories involved in ATPase activity, blood vessel development and respiratory gaseous exchange, functional categories that are relevant to adaptation to high altitudes. Among the 787 genes with evidence of parallel evolution, and thus identified as candidate genes, several GO categories (e.g. response to hypoxia, oxygen homeostasis and erythrocyte homeostasis) are significantly enriched, are two genes, EPAS1 and AJUBA, involved in the response to hypoxia, where the parallel evolved sites are at positions that are highly conserved in sequence alignments from multiple species. Thus, accelerated evolution of GO categories, positive selection and parallel evolution at the molecular level provide evidences to parse the genetic adaptations of the plateau zokor for living in high-elevation burrows.
Comparative transcriptomics of early dipteran development
2013-01-01
Background Modern sequencing technologies have massively increased the amount of data available for comparative genomics. Whole-transcriptome shotgun sequencing (RNA-seq) provides a powerful basis for comparative studies. In particular, this approach holds great promise for emerging model species in fields such as evolutionary developmental biology (evo-devo). Results We have sequenced early embryonic transcriptomes of two non-drosophilid dipteran species: the moth midge Clogmia albipunctata, and the scuttle fly Megaselia abdita. Our analysis includes a third, published, transcriptome for the hoverfly Episyrphus balteatus. These emerging models for comparative developmental studies close an important phylogenetic gap between Drosophila melanogaster and other insect model systems. In this paper, we provide a comparative analysis of early embryonic transcriptomes across species, and use our data for a phylogenomic re-evaluation of dipteran phylogenetic relationships. Conclusions We show how comparative transcriptomics can be used to create useful resources for evo-devo, and to investigate phylogenetic relationships. Our results demonstrate that de novo assembly of short (Illumina) reads yields high-quality, high-coverage transcriptomic data sets. We use these data to investigate deep dipteran phylogenetic relationships. Our results, based on a concatenation of 160 orthologous genes, provide support for the traditional view of Clogmia being the sister group of Brachycera (Megaselia, Episyrphus, Drosophila), rather than that of Culicomorpha (which includes mosquitoes and blackflies). PMID:23432914
Smola, Matthew J.; Rice, Greggory M.; Busan, Steven; Siegfried, Nathan A.; Weeks, Kevin M.
2016-01-01
SHAPE chemistries exploit small electrophilic reagents that react with the 2′-hydroxyl group to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues based on the ability of reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as for simple model RNAs. This protocol describes the experimental steps, implemented over three days, required to perform SHAPE probing and construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. These steps include RNA folding and SHAPE structure probing, mutational profiling by reverse transcription, library construction, and sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots, and provides useful troubleshooting information, often within an hour. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures, and visualize probable and alternative helices, often in under a day. We illustrate these algorithms with the E. coli thiamine pyrophosphate riboswitch, E. coli 16S rRNA, and HIV-1 genomic RNAs. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles, and entire transcriptomes. The straightforward MaP strategy greatly expands the number, length, and complexity of analyzable RNA structures. PMID:26426499
Naumenko, Sergey A; Logacheva, Maria D; Popova, Nina V; Klepikova, Anna V; Penin, Aleksey A; Bazykin, Georgii A; Etingova, Anna E; Mugue, Nikolai S; Kondrashov, Alexey S; Yampolsky, Lev Y
2017-01-01
Endemic species flocks inhabiting ancient lakes, oceanic islands and other long-lived isolated habitats are often interpreted as adaptive radiations. Yet molecular evidence for directional selection during species flocks radiation is scarce. Using partial transcriptomes of 64 species of Lake Baikal (Siberia, Russia) endemic amphipods and two nonendemic outgroups, we report a revised phylogeny of this species flock and analyse evidence for positive selection within the endemic lineages. We confirm two independent invasions of amphipods into Baikal and demonstrate that several morphological features of Baikal amphipods, such as body armour and reduction in appendages and sensory organs, evolved in several lineages in parallel. Radiation of Baikal amphipods has been characterized by short phylogenetic branches and frequent episodes of positive selection which tended to be more frequent in the early phase of the second invasion of amphipods into Baikal when the most intensive diversification occurred. Notably, signatures of positive selection are frequent in genes encoding mitochondrial membrane proteins with electron transfer chain and ATP synthesis functionality. In particular, subunits of both the membrane and substrate-level ATP synthases show evidence of positive selection in the plankton species Macrohectopus branickii, possibly indicating adaptation to active plankton lifestyle and to survival under conditions of low temperature and high hydrostatic pressures known to affect membranes functioning. Other functional categories represented among genes likely to be under positive selection include Ca-binding muscle-related proteins, possibly indicating adaptation to Ca-deficient low mineralization Baikal waters. © 2016 John Wiley & Sons Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oberemm, A., E-mail: axel.oberemm@bfr.bund.d; Ahr, H.-J.; Bannasch, P.
2009-12-01
A common animal model of chemical hepatocarcinogenesis was used to examine the utility of transcriptomic and proteomic data to identify early biomarkers related to chemically induced carcinogenesis. N-nitrosomorpholine, a frequently used genotoxic model carcinogen, was applied via drinking water at 120 mg/L to male Wistar rats for 7 weeks followed by an exposure-free period of 43 weeks. Seven specimens of each treatment group (untreated control and 120 mg/L N-nitrosomorpholine in drinking water) were sacrificed at nine time points during and after N-nitrosomorpholine treatment. Individual samples from the liver were prepared for histological and toxicogenomic analyses. For histological detection of preneoplasticmore » and neoplastic tissue areas, sections were stained using antibodies against the placental form of glutathione-S-transferase (GST-P). Gene and protein expression profiles of liver tissue homogenates were analyzed using RG-U34A Affymetrix rat gene chips and two-dimensional gel electrophoresis-based proteomics, respectively. In order to compare results obtained by histopathology, transcriptomics and proteomics, GST-P-stained liver sections were evaluated morphometrically, which revealed a parallel time course of the area fraction of preneoplastic lesions and gene plus protein expression patterns. On the transcriptional level, an increase of hepatic GST-P expression was detectable as early as 3 weeks after study onset. Comparing deregulated genes and proteins, eight species were identified which showed a corresponding expression profile on both expression levels. Functional analysis suggests that these genes and corresponding proteins may be useful as biomarkers of early hepatocarcinogenesis.« less
Fujita, Hiroto; Kataoka, Yuka; Tobita, Seiji; Kuwahara, Masayasu; Sugimoto, Naoki
2016-07-19
We have developed a novel RNA detection method, termed signal amplification by ternary initiation complexes (SATIC), in which an analyte sample is simply mixed with the relevant reagents and allowed to stand for a short time under isothermal conditions (37 °C). The advantage of the technique is that there is no requirement for (i) heat annealing, (ii) thermal cycling during the reaction, (iii) a reverse transcription step, or (iv) enzymatic or mechanical fragmentation of the target RNA. SATIC involves the formation of a ternary initiation complex between the target RNA, a circular DNA template, and a DNA primer, followed by rolling circle amplification (RCA) to generate multiple copies of G-quadruplex (G4) on a long DNA strand like beads on a string. The G4s can be specifically fluorescence-stained with N(3)-hydroxyethyl thioflavin T (ThT-HE), which emits weakly with single- and double-stranded RNA/DNA but strongly with parallel G4s. An improved dual SATIC system, which involves the formation of two different ternary initiation complexes in the RCA process, exhibited a wide quantitative detection range of 1-5000 pM. Furthermore, this enabled visual observation-based RNA detection, which is more rapid and convenient than conventional isothermal methods, such as reverse transcription-loop-mediated isothermal amplification, signal mediated amplification of RNA technology, and RNA-primed rolling circle amplification. Thus, SATIC methodology may serve as an on-site and real-time measurement technique for transcriptomic biomarkers for various diseases.
NASA Astrophysics Data System (ADS)
Graças, D. A.; Ramos, R. T.; Sá, P. G.; Baraúna, R. A.; Schneider, M. C.; Silva, A.
2013-05-01
The Amazon region has enormous hydro potential which is used for power generation. In fact, there are several hydroelectric power stations (HPS) already installed and many under construction or designed. It's in the Amazon which the HPS of Tucuruí, fifth largest in the world, is located. The construction of this hydroelectric dam flooded an area of 2,400 km2 of forest that decomposing, releasing greenhouse gases such as methane (CH4). Methane is the most abundant organic gas in the atmosphere and the second most important greenhouse gas. In this study, we use semicondutor sequencing to assess the bacterial diversity along a water column of 70 meters deep in the Tucuruí reservoir. One liter of water was collected every 10 meters along the water column for total DNA extraction. A fragment of approximately 150 base pairs of the 16S rRNA gene was amplified by polymerase chain reaction using universal primers. These fragments were then paralleled sequenced in Ion Torrent® platform using barcodes on the 316 chip. After the quality filters, about 237 thousands reads were obtained, representing more than 300 Mbp. For bacterial diversity analysis, we used only reads longer than 100 base pairs. The taxonomic diversity was obtained from the Ribosomal Database Project Classifier and alpha diversity analysis (diversity indices and rarefaction) was performed using the RDP pyrosequencing pipeline. Although it is recommended for data pyrosequencing, that pipeline is able to process data obtained from semiconductor sequencing once all of them are fasta files. Over 75% of the sequences were not classified in any phylum, which leads us to believe that there is a huge diversity in the bacterial environment whose function is still unclear. Among the sequences that could be classified, there is a predominance of proteobacteria in all layers, but in higher concentrations at the lower layers. Cyanobacteria accounted for about 3% in the layers of 0m and 10m, leading us to conclude that oxygen production is considerable in this layer. The oxygen produced by Cyanobacteria coupled to atmospheric oxygen provides the ideal environment for the methanotrophic bacteria oxidize methane. Indeed, methanotrophic bacteria represented approximately 10% in the upper layers. Another bacterial phylum well represented in the upper layers was Bacteroidetes, which accounted for about 3% in the layers of 0-30m. Rarefaction analyses, using a cutoff of 3%, tell us the existence of 3212, 6657, 10171, 4209, 10533, 74, 24345 and 64683 OTUs for the layers of 0, 10, 20, 30, 40, 50, 60 and 70 meters, respectively. Bacterial diversity seems to increase with depth, probably due to the large amount of organic matter deposited in the pellet. The 50 meter depth layer showed the lowest diversity due to low quality sequencing of this barcode, which hampered the analysis. The abundance of methanotrophic bacteria shows that the microbial profile of the reservoir is able to consume much of the methane produced by methanogenic archaea in the sediment and that there is a huge diversity whose function is still unknown. The use of semiconductor sequencing proved to be a robust tool to analysis of the microbial community, as an alternative to pyrosequencing.
Zhao, Li; Wit, Janneke; Svetec, Nicolas; Begun, David J.
2015-01-01
Gene expression variation within species is relatively common, however, the role of natural selection in the maintenance of this variation is poorly understood. Here we investigate low and high latitude populations of Drosophila melanogaster and its sister species, D. simulans, to determine whether the two species show similar patterns of population differentiation, consistent with a role for spatially varying selection in maintaining gene expression variation. We compared at two temperatures the whole male transcriptome of D. melanogaster and D. simulans sampled from Panama City (Panama) and Maine (USA). We observed a significant excess of genes exhibiting differential expression in both species, consistent with parallel adaptation to heterogeneous environments. Moreover, the majority of genes showing parallel expression differentiation showed the same direction of differential expression in the two species and the magnitudes of expression differences between high and low latitude populations were correlated across species, further bolstering the conclusion that parallelism for expression phenotypes results from spatially varying selection. However, the species also exhibited important differences in expression phenotypes. For example, the genomic extent of genotype × environment interaction was much more common in D. melanogaster. Highly differentiated SNPs between low and high latitudes were enriched in the 3’ UTRs and CDS of the geographically differently expressed genes in both species, consistent with an important role for cis-acting variants in driving local adaptation for expression-related phenotypes. PMID:25950438
Zhao, Li; Wit, Janneke; Svetec, Nicolas; Begun, David J
2015-05-01
Gene expression variation within species is relatively common, however, the role of natural selection in the maintenance of this variation is poorly understood. Here we investigate low and high latitude populations of Drosophila melanogaster and its sister species, D. simulans, to determine whether the two species show similar patterns of population differentiation, consistent with a role for spatially varying selection in maintaining gene expression variation. We compared at two temperatures the whole male transcriptome of D. melanogaster and D. simulans sampled from Panama City (Panama) and Maine (USA). We observed a significant excess of genes exhibiting differential expression in both species, consistent with parallel adaptation to heterogeneous environments. Moreover, the majority of genes showing parallel expression differentiation showed the same direction of differential expression in the two species and the magnitudes of expression differences between high and low latitude populations were correlated across species, further bolstering the conclusion that parallelism for expression phenotypes results from spatially varying selection. However, the species also exhibited important differences in expression phenotypes. For example, the genomic extent of genotype × environment interaction was much more common in D. melanogaster. Highly differentiated SNPs between low and high latitudes were enriched in the 3' UTRs and CDS of the geographically differently expressed genes in both species, consistent with an important role for cis-acting variants in driving local adaptation for expression-related phenotypes.
Targeted exploration and analysis of large cross-platform human transcriptomic compendia
Zhu, Qian; Wong, Aaron K; Krishnan, Arjun; Aure, Miriam R; Tadych, Alicja; Zhang, Ran; Corney, David C; Greene, Casey S; Bongo, Lars A; Kristensen, Vessela N; Charikar, Moses; Li, Kai; Troyanskaya, Olga G.
2016-01-01
We present SEEK (http://seek.princeton.edu), a query-based search engine across very large transcriptomic data collections, including thousands of human data sets from almost 50 microarray and next-generation sequencing platforms. SEEK uses a novel query-level cross-validation-based algorithm to automatically prioritize data sets relevant to the query and a robust search approach to identify query-coregulated genes, pathways, and processes. SEEK provides cross-platform handling, multi-gene query search, iterative metadata-based search refinement, and extensive visualization-based analysis options. PMID:25581801
Quantitative phenotyping via deep barcode sequencing
Smith, Andrew M.; Heisler, Lawrence E.; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J.; Chee, Mark; Roth, Frederick P.; Giaever, Guri; Nislow, Corey
2009-01-01
Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or “Bar-seq,” outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that ∼20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene–environment interactions on a genome-wide scale. PMID:19622793
Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T
2012-01-01
Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095
Oud, Bart; van Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T
2012-03-01
Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Grace, Peter M.; Hurley, Daniel; Barratt, Daniel T.; Tsykin, Anna; Watkins, Linda R.; Rolan, Paul E.; Hutchinson, Mark R.
2017-01-01
A quantitative, peripherally accessible biomarker for neuropathic pain has great potential to improve clinical outcomes. Based on the premise that peripheral and central immunity contribute to neuropathic pain mechanisms, we hypothesized that biomarkers could be identified from the whole blood of adult male rats, by integrating graded chronic constriction injury (CCI), ipsilateral lumbar dorsal quadrant (iLDQ) and whole blood transcriptomes, and pathway analysis with pain behavior. Correlational bioinformatics identified a range of putative biomarker genes for allodynia intensity, many encoding for proteins with a recognized role in immune/nociceptive mechanisms. A selection of these genes was validated in a separate replication study. Pathway analysis of the iLDQ transcriptome identified Fcγ and Fcε signaling pathways, among others. This study is the first to employ the whole blood transcriptome to identify pain biomarker panels. The novel correlational bioinformatics, developed here, selected such putative biomarkers based on a correlation with pain behavior and formation of signaling pathways with iLDQ genes. Future studies may demonstrate the predictive ability of these biomarker genes across other models and additional variables. PMID:22697386
Raherison, Elie S M; Giguère, Isabelle; Caron, Sébastien; Lamara, Mebarek; MacKay, John J
2015-07-01
Transcript profiling has shown the molecular bases of several biological processes in plants but few studies have developed an understanding of overall transcriptome variation. We investigated transcriptome structure in white spruce (Picea glauca), aiming to delineate its modular organization and associated functional and evolutionary attributes. Microarray analyses were used to: identify and functionally characterize groups of co-expressed genes; investigate expressional and functional diversity of vascular tissue preferential genes which were conserved among Picea species, and identify expression networks underlying wood formation. We classified 22 857 genes as variable (79%; 22 coexpression groups) or invariant (21%) by profiling across several vegetative tissues. Modular organization and complex transcriptome restructuring among vascular tissue preferential genes was revealed by their assignment to coexpression groups with partially overlapping profiles and partially distinct functions. Integrated analyses of tissue-based and temporally variable profiles identified secondary xylem gene networks, showed their remodelling over a growing season and identified PgNAC-7 (no apical meristerm (NAM), Arabidopsis transcription activation factor (ATAF) and cup-shaped cotyledon (CUC) transcription factor 007 in Picea glauca) as a major hub gene specific to earlywood formation. Reference profiling identified comprehensive, statistically robust coexpressed groups, revealing that modular organization underpins the evolutionary conservation of the transcriptome structure. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Martinovic-Weigelt, Dalma; Mehinto, Alvine C.; Ankley, Gerald T.; Denslow, Nancy D.; Barber, Larry B.; Lee, Kathy E.; King, Ryan J.; Schoenfuss, Heiko L.; Schroeder, Anthony L.; Villeneuve, Daniel L.
2014-01-01
The present study investigated whether a combination of targeted analytical chemistry information with unsupervised, data-rich biological methodology (i.e., transcriptomics) could be utilized to evaluate relative contributions of wastewater treatment plant (WWTP) effluents to biological effects. The effects of WWTP effluents on fish exposed to ambient, receiving waters were studied at three locations with distinct WWTP and watershed characteristics. At each location, 4 d exposures of male fathead minnows to the WWTP effluent and upstream and downstream ambient waters were conducted. Transcriptomic analyses were performed on livers using 15 000 feature microarrays, followed by a canonical pathway and gene set enrichment analyses. Enrichment of gene sets indicative of teleost brain–pituitary–gonadal–hepatic (BPGH) axis function indicated that WWTPs serve as an important source of endocrine active chemicals (EACs) that affect the BPGH axis (e.g., cholesterol and steroid metabolism were altered). The results indicated that transcriptomics may even pinpoint pertinent adverse outcomes (i.e., liver vacuolization) and groups of chemicals that preselected chemical analytes may miss. Transcriptomic Effects-Based monitoring was capable of distinguishing sites, and it reflected chemical pollution gradients, thus holding promise for assessment of relative contributions of point sources to pollution and the efficacy of pollution remediation.
Zhou, Xiaoxu; Wang, Hongdi; Cui, Jun; Qiu, Xuemei; Chang, Yaqing; Wang, Xiuli
2016-12-01
Tube foot as one of the ambulacral appendages types in Aspidochirote holothurioids, is known for their functions in locomotion, feeding, chemoreception, light sensitivity and respiration. In this study, we explored the characteristic of transcriptome in the tube foot of sea cucumber (Apostichopus japonicus). Our results showed that among 390 unigenes which specifically expressed in the tube foot, 190 of them were annotated. Based on the assembly transcriptome, we found 219,860 SNPs from 34,749 unigenes, 97,683, 53,624, 27,767 and 40,786 were located in CDSs, 5'-UTRs, 3'-UTRs and non-CDS separately. Furthermore, 12,114 SSRs were detected from 7394 unigenes. Target genes of four specifically expressed miRNAs (miR-29a, miR-29b, miR-278-3p and miR-2005) in tube foot were also predicted based on the transcriptome, which contain immune-related factors (MBL, VLRA, AjC3, MyD88, CFB), skin pigmentation (MITF), candidate regeneration factor (TRP) and holothurians autolysis-related factor (CL). These results develop a relatively large number of molecular markers and transcriptome resources, and will provide a foundation for further analyses on the function and molecular mechanisms underlying A. japonicas tube foot. Copyright © 2016 Elsevier Inc. All rights reserved.
Prokaryotic microbiota in the digestive cavity of the jellyfish Cotylorhiza tuberculata.
Cortés-Lara, Sara; Urdiain, Mercedes; Mora-Ruiz, Merit; Prieto, Laura; Rosselló-Móra, Ramon
2015-10-01
The microbiota associated to the gastric cavity of four exemplars of the jellyfish Cotylorhiza tuberculata has been studied by means of cultured-dependent and -independent methods. The pyrosequencing approach rendered a very reduced diversity of Bacteria with four major groups shared by the four exemplars that made up to 95% of the total diversity. The culturing approach recovered low abundant organisms and some of them also detected by the pyrosequencing approach. The major key organisms were related to the genera Spiroplasma, Thalassospira, Tenacibaculum (from the pyrosequencing data), and Vibrio (from the cultivable fraction). Altogether the results indicate that C. tuberculata harbors an associated microbiota of very reduced diversity. On the other hand, some of the major key players may be potential pathogens and the host may serve as dispersal mechanism. Copyright © 2015 Elsevier GmbH. All rights reserved.
Schmidt, Anja; Wuest, Samuel E.; Vijverberg, Kitty; Baroux, Célia; Kleen, Daniela; Grossniklaus, Ueli
2011-01-01
Germ line specification is a crucial step in the life cycle of all organisms. For sexual plant reproduction, the megaspore mother cell (MMC) is of crucial importance: it marks the first cell of the plant “germline” lineage that gets committed to undergo meiosis. One of the meiotic products, the functional megaspore, subsequently gives rise to the haploid, multicellular female gametophyte that harbours the female gametes. The MMC is formed by selection and differentiation of a single somatic, sub-epidermal cell in the ovule. The transcriptional network underlying MMC specification and differentiation is largely unknown. We provide the first transcriptome analysis of an MMC using the model plant Arabidopsis thaliana with a combination of laser-assisted microdissection and microarray hybridizations. Statistical analyses identified an over-representation of translational regulation control pathways and a significant enrichment of DEAD/DEAH-box helicases in the MMC transcriptome, paralleling important features of the animal germline. Analysis of two independent T-DNA insertion lines suggests an important role of an enriched helicase, MNEME (MEM), in MMC differentiation and the restriction of the germline fate to only one cell per ovule primordium. In heterozygous mem mutants, additional enlarged MMC-like cells, which sometimes initiate female gametophyte development, were observed at higher frequencies than in the wild type. This closely resembles the phenotype of mutants affected in the small RNA and DNA-methylation pathways important for epigenetic regulation. Importantly, the mem phenotype shows features of apospory, as female gametophytes initiate from two non-sister cells in these mutants. Moreover, in mem gametophytic nuclei, both higher order chromatin structure and the distribution of LIKE HETEROCHROMATIN PROTEIN1 were affected, indicating epigenetic perturbations. In summary, the MMC transcriptome sets the stage for future functional characterization as illustrated by the identification of MEM, a novel gene involved in the restriction of germline fate. PMID:21949639
Zhang, Zhe; Tsukikawa, Mai; Peng, Min; Polyak, Erzsebet; Nakamaru-Ogiso, Eiko; Ostrovsky, Julian; McCormack, Shana; Place, Emily; Clarke, Colleen; Reiner, Gail; McCormick, Elizabeth; Rappaport, Eric; Haas, Richard; Baur, Joseph A.; Falk, Marni J.
2013-01-01
Primary mitochondrial respiratory chain (RC) diseases are heterogeneous in etiology and manifestations but collectively impair cellular energy metabolism. Mechanism(s) by which RC dysfunction causes global cellular sequelae are poorly understood. To identify a common cellular response to RC disease, integrated gene, pathway, and systems biology analyses were performed in human primary RC disease skeletal muscle and fibroblast transcriptomes. Significant changes were evident in muscle across diverse RC complex and genetic etiologies that were consistent with prior reports in other primary RC disease models and involved dysregulation of genes involved in RNA processing, protein translation, transport, and degradation, and muscle structure. Global transcriptional and post-transcriptional dysregulation was also found to occur in a highly tissue-specific fashion. In particular, RC disease muscle had decreased transcription of cytosolic ribosomal proteins suggestive of reduced anabolic processes, increased transcription of mitochondrial ribosomal proteins, shorter 5′-UTRs that likely improve translational efficiency, and stabilization of 3′-UTRs containing AU-rich elements. RC disease fibroblasts showed a strikingly similar pattern of global transcriptome dysregulation in a reverse direction. In parallel with these transcriptional effects, RC disease dysregulated the integrated nutrient-sensing signaling network involving FOXO, PPAR, sirtuins, AMPK, and mTORC1, which collectively sense nutrient availability and regulate cellular growth. Altered activities of central nodes in the nutrient-sensing signaling network were validated by phosphokinase immunoblot analysis in RC inhibited cells. Remarkably, treating RC mutant fibroblasts with nicotinic acid to enhance sirtuin and PPAR activity also normalized mTORC1 and AMPK signaling, restored NADH/NAD+ redox balance, and improved cellular respiratory capacity. These data specifically highlight a common pathogenesis extending across different molecular and biochemical etiologies of individual RC disorders that involves global transcriptome modifications. We further identify the integrated nutrient-sensing signaling network as a common cellular response that mediates, and may be amenable to targeted therapies for, tissue-specific sequelae of primary mitochondrial RC disease. PMID:23894440
Elucidating and mining the Tulipa and Lilium transcriptomes.
Moreno-Pachon, Natalia M; Leeggangers, Hendrika A C F; Nijveen, Harm; Severing, Edouard; Hilhorst, Henk; Immink, Richard G H
2016-10-01
Genome sequencing remains a challenge for species with large and complex genomes containing extensive repetitive sequences, of which the bulbous and monocotyledonous plants tulip and lily are examples. In such a case, sequencing of only the active part of the genome, represented by the transcriptome, is a good alternative to obtain information about gene content. In this study we aimed to generate a high quality transcriptome of tulip and lily and to make this data available as an open-access resource via a user-friendly web-based interface. The Illumina HiSeq 2000 platform was applied and the transcribed RNA was sequenced from a collection of different lily and tulip tissues, respectively. In order to obtain good transcriptome coverage and to facilitate effective data mining, assembly was done using different filtering parameters for clearing out contamination and noise of the RNAseq datasets. This analysis revealed limitations of commonly applied methods and parameter settings used in de novo transcriptome assembly. The final created transcriptomes are publicly available via a user friendly Transcriptome browser ( http://www.bioinformatics.nl/bulbs/db/species/index ). The usefulness of this resource has been exemplified by a search for all potential transcription factors in lily and tulip, with special focus on the TCP transcription factor family. This analysis and other quality parameters point out the quality of the transcriptomes, which can serve as a basis for further genomics studies in lily, tulip, and bulbous plants in general.
Korsak, N; Taminiau, B; Leclercq, M; Nezer, C; Crevecoeur, S; Ferauche, C; Detry, E; Delcenserie, V; Daube, G
2015-06-01
Milk kefir is produced by fermenting milk in the presence of kefir grains. This beverage has several benefits for human health. The aim of this experiment was to analyze 5 kefir grains (and their products) using a targeted metagenetic approach. Of the 5 kefir grains analyzed, 1 was purchased in a supermarket, 2 were provided by the Ministry of Agriculture (Namur, Belgium), and 2 were provided by individuals. The metagenetic approach targeted the V1-V3 fragment of the 16S ribosomal (r)DNA for the grains and the resulting beverages at 2 levels of grain incorporation (5 and 10%) to identify the bacterial species population. In contrast, the 26S rDNA pyrosequencing was performed only on kefir grains with the aim of assessing the yeast populations. In parallel, pH measurements were performed on the kefir obtained from the kefir grains using 2 incorporation rates. Regarding the bacterial population, 16S pyrosequencing revealed the presence of 20 main bacterial species, with a dominance of the following: Lactobacillus kefiranofaciens, Lactococcus lactis ssp. cremoris, Gluconobacter frateurii, Lactobacillus kefiri, Acetobacter orientalis, and Acetobacter lovaniensis. An important difference was noticed between the kefir samples: kefir grain purchased from a supermarket (sample E) harbored a much higher proportion of several operational taxonomic units of Lactococcus lactis and Leuconostoc mesenteroides. This sample of grain was macroscopically different from the others in terms of size, apparent cohesion of the grains, structure, and texture, probably associated with a lower level of Lactobacillus kefiranofaciens. The kefir (at an incorporation rate of 5%) produced from this sample of grain was characterized by a lower pH value (4.5) than the others. The other 4 samples of kefir (5%) had pH values above 5. Comparing the kefir grain and the kefir, an increase in the population of Gluconobacter in grain sample B was observed. This was also the case for Acetobacter orientalis in sample D. In relation to 26S pyrosequencing, our study revealed the presence of 3 main yeast species: Naumovozyma spp., Kluyveromyces marxianus, and Kazachastania khefir. For Naumovozyma, further studies are needed to assess the isolation of new species. In conclusion, this study has proved that it is possible to establish the patterns of bacterial and yeast composition of kefir and kefir grain. This was only achieved with the use of high-throughput sequencing techniques. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Rocke, Emma; Jing, Hongmei; Xia, Xiaomin; Liu, Hongbin
2016-07-01
Tolo Harbor, a subtropical semi-enclosed coastal water body, is surrounded by an expanding urban community, which contributes to large concentrations of nutrient runoff, leading to algal blooms and localized hypoxic episodes. Present knowledge of protist distributions in subtropical waters during hypoxic conditions is very limited. In this study, therefore, we combined parallel 454 pyrosequencing technology and denaturing gradient gel electrophoresis (DGGE) fingerprint analyses to reveal the protist community shifts before, during, and after a 2-week hypoxic episode during the summer of 2011. Hierarchical clustering for DGGE demonstrated similar grouping of hypoxic samples separately from oxic samples. Dissolved oxygen (DO) concentration and dissolved inorganic nitrogen:phosphate (DIN:PO4) concentrations significantly affected OTU distribution in 454 sequenced samples, and a shift toward a ciliate and marine alveolate clade II (MALV II) species composition occurred as waters shifted from oxic to hypoxic. These results suggest that protist community shifts toward heterotrophic and parasitic tendencies as well as decreased diversity and richness in response to hypoxic outbreaks.
Descriptive Biomarkers for Assessing Breast Cancer Risk
2010-10-01
and we are making significant progress on Tasks 6 and 7. We completed methylation analyses of three genes (RASSF1, SFRP1 and GSTP1 ) on all samples...promoter hypermethylation; RASSF1, GSTP1 , SFRP1 12 karcaro@nre.umass.edu Arcaro, Kathleen F Annual Report...methylation analysis by pyrosequencing. PCR amplification and pyrosequencing has been completed for three genes, RASSF1, SFRP1 and GSTP1 and have
Pu, Jian; Kazama, Shinobu; Miura, Takayuki; Azraini, Nabila Dhyan; Konta, Yoshimitsu; Ito, Hiroaki; Ueki, You; Cahyaningrum, Ermaya Eka; Omura, Tatsuo; Watanabe, Toru
2016-12-01
Norovirus GII.3, GII.4, and GII.17 were detected using pyrosequencing in sewage and oysters in January and February 2015, in Japan. The strains in sewage and oyster samples were genetically identical or similar, predominant strains belonging to GII.17 Kawasaki 2014 lineage. This is the first report of GII.17 Kawasaki 2014 in oysters.
Thulin, Sara; Olcén, Per; Fredlund, Hans; Unemo, Magnus
2008-01-01
A segment of penA in Neisseria meningitidis strains (n = 127), including two nucleotide sites closely associated to reduced susceptibility to penicillins, was amplified and pyrosequenced. All results were in concordance with Sanger sequencing, and a high correlation between alterations in the two Peni-specific sites and reduced susceptibility to penicillins was identified. PMID:18070955
Trama, Jason P; Adelson, Martin E; Mordechai, Eli
2007-12-01
Laboratory diagnosis of molluscum contagiosum virus (MCV) is important as lesions can be confused with those caused by Cryptococcus neoformans, herpes simplex virus, human papillomavirus, and varicella-zoster virus. To develop a rapid method for identifying patients infected with MCV via swab sampling. Two dual-labeled probe real-time PCR assays, one homologous to the p43K gene and one to the MC080R gene, were designed. The p43K PCR was designed to be used in conjunction with Pyrosequencing for confirmation of PCR products and discrimination between MCV1 and MCV2. Both PCR assays were optimized with respect to reaction components, thermocycling parameters, and primer and probe concentrations. The specificities of both PCR assays were confirmed by non-amplification of 38 known human pathogens. Sensitivity assays demonstrated detection of as few as 10 copies per reaction. Testing 703 swabs, concordance between the two real-time PCR assays was 99.9%. Under the developed conditions, Pyrosequencing of the p43K PCR product was capable of providing enough nucleotide sequence to definitively differentiate MCV1 and MCV2. These real-time PCR assays can be used for the rapid, sensitive, and specific detection of MCV and, when combined with Pyrosequencing, can further discriminate between MCV1 and MCV2.
Manousaki, Tereza; Hull, Pincelli M; Kusche, Henrik; Machado-Schiaffino, Gonzalo; Franchini, Paolo; Harrod, Chris; Elmer, Kathryn R; Meyer, Axel
2013-02-01
The study of parallel evolution facilitates the discovery of common rules of diversification. Here, we examine the repeated evolution of thick lips in Midas cichlid fishes (the Amphilophus citrinellus species complex)-from two Great Lakes and two crater lakes in Nicaragua-to assess whether similar changes in ecology, phenotypic trophic traits and gene expression accompany parallel trait evolution. Using next-generation sequencing technology, we characterize transcriptome-wide differential gene expression in the lips of wild-caught sympatric thick- and thin-lipped cichlids from all four instances of repeated thick-lip evolution. Six genes (apolipoprotein D, myelin-associated glycoprotein precursor, four-and-a-half LIM domain protein 2, calpain-9, GTPase IMAP family member 8-like and one hypothetical protein) are significantly underexpressed in the thick-lipped morph across all four lakes. However, other aspects of lips' gene expression in sympatric morphs differ in a lake-specific pattern, including the magnitude of differentially expressed genes (97-510). Generally, fewer genes are differentially expressed among morphs in the younger crater lakes than in those from the older Great Lakes. Body shape, lower pharyngeal jaw size and shape, and stable isotopes (δ(13)C and δ(15)N) differ between all sympatric morphs, with the greatest differentiation in the Great Lake Nicaragua. Some ecological traits evolve in parallel (those related to foraging ecology; e.g. lip size, body and head shape) but others, somewhat surprisingly, do not (those related to diet and food processing; e.g. jaw size and shape, stable isotopes). Taken together, this case of parallelism among thick- and thin-lipped cichlids shows a mosaic pattern of parallel and nonparallel evolution. © 2012 Blackwell Publishing Ltd.
Genome-wide analysis of miRNA and mRNA transcriptomes during amelogenesis.
Yin, Kaifeng; Hacia, Joseph G; Zhong, Zhe; Paine, Michael L
2014-11-19
In the rodent incisor during amelogenesis, as ameloblast cells transition from secretory stage to maturation stage, their morphology and transcriptome profiles change dramatically. Prior whole genome transcriptome analysis has given a broad picture of the molecular activities dominating both stages of amelogenesis, but this type of analysis has not included miRNA transcript profiling. In this study, we set out to document which miRNAs and corresponding target genes change significantly as ameloblasts transition from secretory- to maturation-stage amelogenesis. Total RNA samples from both secretory- and maturation-stage rat enamel organs were subjected to genome-wide miRNA and mRNA transcript profiling. We identified 59 miRNAs that were differentially expressed at the maturation stage relative to the secretory stage of enamel development (False Discovery Rate (FDR)<0.05, fold change (FC)≥1.8). In parallel, transcriptome profiling experiments identified 1,729 mRNA transcripts that were differentially expressed in the maturation stage compared to the secretory stage (FDR<0.05, FC≥1.8). Based on bioinformatics analyses, 5.8% (629 total) of these differentially expressed genes (DEGS) were highlighted as being the potential targets of 59 miRNAs that were differentially expressed in the opposite direction, in the same tissue samples. Although the number of predicted target DEGs was not higher than baseline expectations generated by examination of stably expressed miRNAs, Gene Ontology (GO) analysis showed that these 629 DEGS were enriched for ion transport, pH regulation, calcium handling, endocytotic, and apoptotic activities. Seven differentially expressed miRNAs (miR-21, miR-31, miR-488, miR-153, miR-135b, miR-135a and miR298) in secretory- and/or maturation-stage enamel organs were confirmed by in situ hybridization. Further, we used luciferase reporter assays to provide evidence that two of these differentially expressed miRNAs, miR-153 and miR-31, are potential regulators for their predicated target mRNAs, Lamp1 (miR-153) and Tfrc (miR-31). In conclusion, these data indicate that miRNAs exhibit a dynamic expression pattern during the transition from secretory-stage to maturation-stage tooth enamel formation. Although they represent only one of numerous mechanisms influencing gene activities, miRNAs specific to the maturation stage could be involved in regulating several key processes of enamel maturation by influencing mRNA stability and translation.
Petitot, Anne-Sophie; Kyndt, Tina; Haidar, Rana; Dereeper, Alexis; Collin, Myriam; de Almeida Engler, Janice; Gheysen, Godelieve; Fernandez, Diana
2017-03-01
The root-knot nematode Meloidogyne graminicola is responsible for production losses in rice ( Oryza sativa ) in Asia and Latin America. The accession TOG5681 of African rice, O. glaberrima , presents improved resistance to several biotic and abiotic factors, including nematodes. The aim of this study was to assess the cytological and molecular mechanisms underlying nematode resistance in this accession. Penetration and development in M. graminicola in TOG5681 and the susceptible O. sativa genotype 'Nipponbare' were compared by microscopic observation of infected roots and histological analysis of galls. In parallel, host molecular responses to M. graminicola were assessed by root transcriptome profiling at 2, 4 and 8 d post-infection (dpi). Specific treatments with hormone inhibitors were conducted in TOG5681 to assess the impact of the jasmonic acid and salicylic acid pathways on nematode penetration and reproduction. Penetration and development of M. graminicola juveniles were reduced in the resistant TOG5681 in comparison with the susceptible accession, with degeneration of giant cells observed in the resistant genotype from 15 dpi onwards. Transcriptome changes were observed as early as 2 dpi, with genes predicted to be involved in defence responses, phenylpropanoid and hormone pathways strongly induced in TOG5681, in contrast to 'Nipponbare'. No specific hormonal pathway could be identified as the major determinant of resistance in the rice-nematode incompatible interaction. Candidate genes proposed as involved in resistance to M. graminicola in TOG5681 were identified based on their expression pattern and quantitative trait locus (QTL) position, including chalcone synthase, isoflavone reductase, phenylalanine ammonia lyase, WRKY62 transcription factor, thionin, stripe rust resistance protein, thaumatins and ATPase3. This study provides a novel set of candidate genes for O. glaberrima resistance to nematodes and highlights the rice- M. graminicola pathosystem as a model to study plant-nematode incompatible interactions. © The Author 2017. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Welkie, David; Zhang, Xiaohui; Markillie, Meng Lye; Taylor, Ronald; Orr, Galya; Jacobs, Jon; Bhide, Ketaki; Thimmapuram, Jyothi; Gritsenko, Marina; Mitchell, Hugh; Smith, Richard D; Sherman, Louis A
2014-12-29
Cyanothece sp. PCC 7822 is an excellent cyanobacterial model organism with great potential to be applied as a biocatalyst for the production of high value compounds. Like other unicellular diazotrophic cyanobacterial species, it has a tightly regulated metabolism synchronized to the light-dark cycle. Utilizing transcriptomic and proteomic methods, we quantified the relationships between transcription and translation underlying central and secondary metabolism in response to nitrogen free, 12 hour light and 12 hour dark conditions. By combining mass-spectrometry based proteomics and RNA-sequencing transcriptomics, we quantitatively measured a total of 6766 mRNAs and 1322 proteins at four time points across a 24 hour light-dark cycle. Photosynthesis, nitrogen fixation, and carbon storage relevant genes were expressed during the preceding light or dark period, concurrent with measured nitrogenase activity in the late light period. We describe many instances of disparity in peak mRNA and protein abundances, and strong correlation of light dependent expression of both antisense and CRISPR-related gene expression. The proteins for nitrogenase and the pentose phosphate pathway were highest in the dark, whereas those for glycolysis and the TCA cycle were more prominent in the light. Interestingly, one copy of the psbA gene encoding the photosystem II (PSII) reaction center protein D1 (psbA4) was highly upregulated only in the dark. This protein likely cannot catalyze O2 evolution and so may be used by the cell to keep PSII intact during N2 fixation. The CRISPR elements were found exclusively at the ends of the large plasmid and we speculate that their presence is crucial to the maintenance of this plasmid. This investigation of parallel transcriptional and translational activity within Cyanothece sp. PCC 7822 provided quantitative information on expression levels of metabolic pathways relevant to engineering efforts. The identification of expression patterns for both mRNA and protein affords a basis for improving biofuel production in this strain and for further genetic manipulations. Expression analysis of the genes encoded on the 6 plasmids provided insight into the possible acquisition and maintenance of some of these extra-chromosomal elements.
2014-01-01
Background The lined sea anemone Edwardsiella lineata is an informative model system for evolutionary-developmental studies of parasitism. In this species, it is possible to compare alternate developmental pathways leading from a larva to either a free-living polyp or a vermiform parasite that inhabits the mesoglea of a ctenophore host. Additionally, E. lineata is confamilial with the model cnidarian Nematostella vectensis, providing an opportunity for comparative genomic, molecular and organismal studies. Description We generated a reference transcriptome for E. lineata via high-throughput sequencing of RNA isolated from five developmental stages (parasite; parasite-to-larva transition; larva; larva-to-adult transition; adult). The transcriptome comprises 90,440 contigs assembled from >15 billion nucleotides of DNA sequence. Using a molecular clock approach, we estimated the divergence between E. lineata and N. vectensis at 215–364 million years ago. Based on gene ontology and metabolic pathway analyses and gene family surveys (bHLH-PAS, deiodinases, Fox genes, LIM homeodomains, minicollagens, nuclear receptors, Sox genes, and Wnts), the transcriptome of E. lineata is comparable in depth and completeness to N. vectensis. Analyses of protein motifs and revealed extensive conservation between the proteins of these two edwardsiid anemones, although we show the NF-κB protein of E. lineata reflects the ancestral structure, while the NF-κB protein of N. vectensis has undergone a split that separates the DNA-binding domain from the inhibitory domain. All contigs have been deposited in a public database (EdwardsiellaBase), where they may be searched according to contig ID, gene ontology, protein family motif (Pfam), enzyme commission number, and BLAST. The alignment of the raw reads to the contigs can also be visualized via JBrowse. Conclusions The transcriptomic data and database described here provide a platform for studying the evolutionary developmental genomics of a derived parasitic life cycle. In addition, these data from E. lineata will aid in the interpretation of evolutionary novelties in gene sequence or structure that have been reported for the model cnidarian N. vectensis (e.g., the split NF-κB locus). Finally, we include custom computational tools to facilitate the annotation of a transcriptome based on high-throughput sequencing data obtained from a “non-model system.” PMID:24467778
Stefanik, Derek J; Lubinski, Tristan J; Granger, Brian R; Byrd, Allyson L; Reitzel, Adam M; DeFilippo, Lukas; Lorenc, Allison; Finnerty, John R
2014-01-28
The lined sea anemone Edwardsiella lineata is an informative model system for evolutionary-developmental studies of parasitism. In this species, it is possible to compare alternate developmental pathways leading from a larva to either a free-living polyp or a vermiform parasite that inhabits the mesoglea of a ctenophore host. Additionally, E. lineata is confamilial with the model cnidarian Nematostella vectensis, providing an opportunity for comparative genomic, molecular and organismal studies. We generated a reference transcriptome for E. lineata via high-throughput sequencing of RNA isolated from five developmental stages (parasite; parasite-to-larva transition; larva; larva-to-adult transition; adult). The transcriptome comprises 90,440 contigs assembled from >15 billion nucleotides of DNA sequence. Using a molecular clock approach, we estimated the divergence between E. lineata and N. vectensis at 215-364 million years ago. Based on gene ontology and metabolic pathway analyses and gene family surveys (bHLH-PAS, deiodinases, Fox genes, LIM homeodomains, minicollagens, nuclear receptors, Sox genes, and Wnts), the transcriptome of E. lineata is comparable in depth and completeness to N. vectensis. Analyses of protein motifs and revealed extensive conservation between the proteins of these two edwardsiid anemones, although we show the NF-κB protein of E. lineata reflects the ancestral structure, while the NF-κB protein of N. vectensis has undergone a split that separates the DNA-binding domain from the inhibitory domain. All contigs have been deposited in a public database (EdwardsiellaBase), where they may be searched according to contig ID, gene ontology, protein family motif (Pfam), enzyme commission number, and BLAST. The alignment of the raw reads to the contigs can also be visualized via JBrowse. The transcriptomic data and database described here provide a platform for studying the evolutionary developmental genomics of a derived parasitic life cycle. In addition, these data from E. lineata will aid in the interpretation of evolutionary novelties in gene sequence or structure that have been reported for the model cnidarian N. vectensis (e.g., the split NF-κB locus). Finally, we include custom computational tools to facilitate the annotation of a transcriptome based on high-throughput sequencing data obtained from a "non-model system."
Makita, Yuko; Kawashima, Mika; Lau, Nyok Sean; Othman, Ahmad Sofiman; Matsui, Minami
2018-01-19
Natural rubber is an economically important material. Currently the Pará rubber tree, Hevea brasiliensis is the main commercial source. Little is known about rubber biosynthesis at the molecular level. Next-generation sequencing (NGS) technologies brought draft genomes of three rubber cultivars and a variety of RNA sequencing (RNA-seq) data. However, no current genome or transcriptome databases (DB) are organized by gene. A gene-oriented database is a valuable support for rubber research. Based on our original draft genome sequence of H. brasiliensis RRIM600, we constructed a rubber tree genome and transcriptome DB. Our DB provides genome information including gene functional annotations and multi-transcriptome data of RNA-seq, full-length cDNAs including PacBio Isoform sequencing (Iso-Seq), ESTs and genome wide transcription start sites (TSSs) derived from CAGE technology. Using our original and publically available RNA-seq data, we calculated co-expressed genes for identifying functionally related gene sets and/or genes regulated by the same transcription factor (TF). Users can access multi-transcriptome data through both a gene-oriented web page and a genome browser. For the gene searching system, we provide keyword search, sequence homology search and gene expression search; users can also select their expression threshold easily. The rubber genome and transcriptome DB provides rubber tree genome sequence and multi-transcriptomics data. This DB is useful for comprehensive understanding of the rubber transcriptome. This will assist both industrial and academic researchers for rubber and economically important close relatives such as R. communis, M. esculenta and J. curcas. The Rubber Transcriptome DB release 2017.03 is accessible at http://matsui-lab.riken.jp/rubber/ .
Sequencing, Annotation and Analysis of the Syrian Hamster (Mesocricetus auratus) Transcriptome
Tchitchek, Nicolas; Safronetz, David; Rasmussen, Angela L.; Martens, Craig; Virtaneva, Kimmo; Porcella, Stephen F.; Feldmann, Heinz
2014-01-01
Background The Syrian hamster (golden hamster, Mesocricetus auratus) is gaining importance as a new experimental animal model for multiple pathogens, including emerging zoonotic diseases such as Ebola. Nevertheless there are currently no publicly available transcriptome reference sequences or genome for this species. Results A cDNA library derived from mRNA and snRNA isolated and pooled from the brains, lungs, spleens, kidneys, livers, and hearts of three adult female Syrian hamsters was sequenced. Sequence reads were assembled into 62,482 contigs and 111,796 reads remained unassembled (singletons). This combined contig/singleton dataset, designated as the Syrian hamster transcriptome, represents a total of 60,117,204 nucleotides. Our Mesocricetus auratus Syrian hamster transcriptome mapped to 11,648 mouse transcripts representing 9,562 distinct genes, and mapped to a similar number of transcripts and genes in the rat. We identified 214 quasi-complete transcripts based on mouse annotations. Canonical pathways involved in a broad spectrum of fundamental biological processes were significantly represented in the library. The Syrian hamster transcriptome was aligned to the current release of the Chinese hamster ovary (CHO) cell transcriptome and genome to improve the genomic annotation of this species. Finally, our Syrian hamster transcriptome was aligned against 14 other rodents, primate and laurasiatheria species to gain insights about the genetic relatedness and placement of this species. Conclusions This Syrian hamster transcriptome dataset significantly improves our knowledge of the Syrian hamster's transcriptome, especially towards its future use in infectious disease research. Moreover, this library is an important resource for the wider scientific community to help improve genome annotation of the Syrian hamster and other closely related species. Furthermore, these data provide the basis for development of expression microarrays that can be used in functional genomics studies. PMID:25398096
Subha, Bakthavachallam; Song, Young Chae; Woo, Jung Hui
2015-09-15
The present study aims to optimize the slow release biostimulant ball (BSB) for bioremediation of contaminated coastal sediment using response surface methodology (RSM). Different bacterial communities were evaluated using a pyrosequencing-based approach in contaminated coastal sediments. The effects of BSB size (1-5cm), distance (1-10cm) and time (1-4months) on changes in chemical oxygen demand (COD) and volatile solid (VS) reduction were determined. Maximum reductions of COD and VS, 89.7% and 78.8%, respectively, were observed at a 3cm ball size, 5.5cm distance and 4months; these values are the optimum conditions for effective treatment of contaminated coastal sediment. Most of the variance in COD and VS (0.9291 and 0.9369, respectively) was explained in our chosen models. BSB is a promising method for COD and VS reduction and enhancement of SRB diversity. Copyright © 2015 Elsevier Ltd. All rights reserved.
Universal DNA-based methods for assessing the diet of grazing livestock and wildlife from feces.
Pegard, Anthony; Miquel, Christian; Valentini, Alice; Coissac, Eric; Bouvier, Frédéric; François, Dominique; Taberlet, Pierre; Engel, Erwan; Pompanon, François
2009-07-08
Because of the demand for controlling livestock diets, two methods that characterize the DNA of plants present in feces were developed. After DNA extraction from fecal samples, a short fragment of the chloroplastic trnL intron was amplified by PCR using a universal primer pair for plants. The first method generates a signature that is the electrophoretic migration pattern of the PCR product. The second method consists of sequencing several hundred DNA fragments from the PCR product through pyrosequencing. These methods were validated with a blind analysis of feces from concentrate- and pasture-fed lambs. The signature method allowed differentiation of the two diets and confirmed the presence of concentrate in one of them. The pyrosequencing method allowed the identification of up to 25 taxa in a diet. These methods are complementary to the chemical methods already used. They could be applied to the control of diets and the study of food preferences.
Duarte, A P M; Ferro, M; Rodrigues, A; Bacci, M; Nagamoto, N S; Forti, L C; Pagnocca, F C
2016-09-01
The relationship of attine ants with their mutualistic fungus and other microorganisms has been studied during the last two centuries. However, previous studies about the diversity of fungi in the ants' microenvironment are based mostly on culture-dependent approaches, lacking a broad characterization of the fungal ant-associated community. Here, we analysed the fungal diversity found on the integument of Atta capiguara and Atta laevigata alate ants using 454 pyrosequencing. We obtained 35,453 ITS reads grouped into 99 molecular operational taxonomic units (MOTUs). Data analysis revealed that A. capiguara drones had the highest diversity of MOTUs. Besides the occurrence of several uncultured fungi, the mycobiota analysis revealed that the most abundant taxa were the Cladosporium-complex, Cryptococcus laurentii and Epicoccum sp. Taxa in the genus Cladosporium were predominant in all samples, comprising 67.9 % of all reads. The remarkable presence of the genus Cladosporium on the integument of leaf-cutting ants alates from distinct ant species suggests that this fungus is favored in this microenvironment.
SC3 - consensus clustering of single-cell RNA-Seq data
Kiselev, Vladimir Yu.; Kirschner, Kristina; Schaub, Michael T.; Andrews, Tallulah; Yiu, Andrew; Chandra, Tamir; Natarajan, Kedar N; Reik, Wolf; Barahona, Mauricio; Green, Anthony R; Hemberg, Martin
2017-01-01
Single-cell RNA-seq (scRNA-seq) enables a quantitative cell-type characterisation based on global transcriptome profiles. We present Single-Cell Consensus Clustering (SC3), a user-friendly tool for unsupervised clustering which achieves high accuracy and robustness by combining multiple clustering solutions through a consensus approach. We demonstrate that SC3 is capable of identifying subclones based on the transcriptomes from neoplastic cells collected from patients. PMID:28346451
Yamamuro, Ayaka; Kouzuma, Atsushi; Abe, Takashi; Watanabe, Kazuya
2014-01-01
Methanol is widely used in industrial processes, and as such, is discharged in large quantities in wastewater. Microbial fuel cells (MFCs) have the potential to recover electric energy from organic pollutants in wastewater; however, the use of MFCs to generate electricity from methanol has not been reported. In the present study, we developed single-chamber MFCs that generated electricity from methanol at the maximum power density of 220 mW m−2 (based on the projected area of the anode). In order to reveal how microbes generate electricity from methanol, pyrosequencing of 16S rRNA-gene amplicons and Illumina shotgun sequencing of metagenome were conducted. The pyrosequencing detected in abundance Dysgonomonas, Sporomusa, and Desulfovibrio in the electrolyte and anode and cathode biofilms, while Geobacter was detected only in the anode biofilm. Based on known physiological properties of these bacteria, it is considered that Sporomusa converts methanol into acetate, which is then utilized by Geobacter to generate electricity. This speculation is supported by results of shotgun metagenomics of the anode-biofilm microbes, which reconstructed relevant catabolic pathways in these bacteria. These results suggest that methanol is anaerobically catabolized by syntrophic bacterial consortia with electrodes as electron acceptors. PMID:24852573
Yamamuro, Ayaka; Kouzuma, Atsushi; Abe, Takashi; Watanabe, Kazuya
2014-01-01
Methanol is widely used in industrial processes, and as such, is discharged in large quantities in wastewater. Microbial fuel cells (MFCs) have the potential to recover electric energy from organic pollutants in wastewater; however, the use of MFCs to generate electricity from methanol has not been reported. In the present study, we developed single-chamber MFCs that generated electricity from methanol at the maximum power density of 220 mW m(-2) (based on the projected area of the anode). In order to reveal how microbes generate electricity from methanol, pyrosequencing of 16S rRNA-gene amplicons and Illumina shotgun sequencing of metagenome were conducted. The pyrosequencing detected in abundance Dysgonomonas, Sporomusa, and Desulfovibrio in the electrolyte and anode and cathode biofilms, while Geobacter was detected only in the anode biofilm. Based on known physiological properties of these bacteria, it is considered that Sporomusa converts methanol into acetate, which is then utilized by Geobacter to generate electricity. This speculation is supported by results of shotgun metagenomics of the anode-biofilm microbes, which reconstructed relevant catabolic pathways in these bacteria. These results suggest that methanol is anaerobically catabolized by syntrophic bacterial consortia with electrodes as electron acceptors.
Pyrosequencing Based Microbial Community Analysis of Stabilized Mine Soils
NASA Astrophysics Data System (ADS)
Park, J. E.; Lee, B. T.; Son, A.
2015-12-01
Heavy metals leached from exhausted mines have been causing severe environmental problems in nearby soils and groundwater. Environmental mitigation was performed based on the heavy metal stabilization using Calcite and steel slag in Korea. Since the soil stabilization only temporarily immobilizes the contaminants to soil matrix, the potential risk of re-leaching heavy metal still exists. Therefore the follow-up management of stabilized soils and the corresponding evaluation methods are required to avoid the consequent contamination from the stabilized soils. In this study, microbial community analysis using pyrosequencing was performed for assessing the potential leaching of the stabilized soils. As a result of rarefaction curve and Chao1 and Shannon indices, the stabilized soil has shown lower richness and diversity as compared to non-contaminated negative control. At the phyla level, as the degree of contamination increases, most of phyla decreased with only exception of increased proteobacteria. Among proteobacteria, gamma-proteobacteria increased against the heavy metal contamination. At the species level, Methylobacter tundripaludum of gamma-proteobacteria showed the highest relative portion of microbial community, indicating that methanotrophs may play an important role in either solubilization or immobilization of heavy metals in stabilized soils.
Pyrosequencing reveals regional differences in fruit-associated fungal communities
Taylor, Michael W; Tsai, Peter; Anfang, Nicole; Ross, Howard A; Goddard, Matthew R
2014-01-01
We know relatively little of the distribution of microbial communities generally. Significant work has examined a range of bacterial communities, but the distribution of microbial eukaryotes is less well characterized. Humans have an ancient association with grape vines (Vitis vinifera) and have been making wine since the dawn of civilization, and fungi drive this natural process. While the molecular biology of certain fungi naturally associated with vines and wines is well characterized, complementary investigations into the ecology of fungi associated with fruiting plants is largely lacking. DNA sequencing technologies allow the direct estimation of microbial diversity from a given sample, avoiding culture-based biases. Here, we use deep community pyrosequencing approaches, targeted at the 26S rRNA gene, to examine the richness and composition of fungal communities associated with grapevines and test for geographical community structure among four major regions in New Zealand (NZ). We find over 200 taxa using this approach, which is 10-fold more than previously recovered using culture-based methods. Our analyses allow us to reject the null hypothesis of homogeneity in fungal species richness and community composition across NZ and reveal significant differences between major areas. PMID:24650123
Direct RNA-Based Detection and Differentiation of CTX-M-Type Extended-Spectrum β-Lactamases (ESBL)
Stein, Claudia; Makarewicz, Oliwia; Pfeifer, Yvonne; Brandt, Christian; Ramos, João Costa; Klinger, Mareike; Pletz, Mathias W.
2013-01-01
The current global spread of multi-resistant Gram-negatives, particularly extended spectrum β-lactamases expressing bacteria, increases the likelihood of inappropriate empiric treatment of critically ill patients with subsequently increased mortality. From a clinical perspective, fast detection of resistant pathogens would allow a pre-emptive correction of an initially inappropriate treatment. Here we present diagnostic amplification-sequencing approach as proof of principal based on the fast molecular detection and correct discrimination of CTX-M-β-lactamases, the most frequent ESBL family. The workflow consists of the isolation of total mRNA and CTX-M-specific reverse transcription (RT), amplification and pyrosequencing. Due to the high variability of the CTX-M-β-lactamase-genes, degenerated primers for RT, qRT as well as for pyrosequencing, were used and the suitability and discriminatory performance of two conserved positions within the CTX-M genes were analyzed, using one protocol for all isolates and positions, respectively. Using this approach, no information regarding the expected CTX-M variant is needed since all sequences are covered by these degenerated primers. The presented workflow can be conducted within eight hours and has the potential to be expanded to other β-lactamase families. PMID:24224038
Campana, Davide; Walter, Thomas; Pusceddu, Sara; Gelsomino, Fabio; Graillot, Emmanuelle; Prinzi, Natalie; Spallanzani, Andrea; Fiorentino, Michelangelo; Barritault, Marc; Dall'Olio, Filippo; Brighi, Nicole; Biasco, Guido
2018-06-01
Temozolomide (TEM) based therapy has been reported being effective in the treatment of metastatic neuroendocrine neoplasms (NEN), with response rates ranging from 30 to 70%. Among patients affected by advanced glioblastoma or melanoma and treated with TEM, loss of tumoral O6-methylguanine DNA methyltransferase (MGMT) is correlated with improved survival. In NEN patients, the role of MGMT deficiency in predicting clinical outcomes of TEM treatment is still under debate. In this study we evaluated 95 patients with advanced NENs undergoing treatment with TEM-based therapy. MGMT promoter methylation status was evaluated with two techniques: methylation specific-polymerase chain reaction or pyrosequencing. Treatment with TEM-based therapy was associated with an overall response rate of 27.4% according to RECIST criteria (51.8% of patients with and 17.7% without MGMT promoter methylation). Response to therapy, progression free survival and overall survival was correlated to MGMT status at univariate and multivariate analysis. Methylation of MGMT promoter could be a strong predictive factor of objective response and an important prognostic factor of a longer PFS and OS. According to our results, MGMT methylation status, evaluated with methylation specific-polymerase chain reaction or pyrosequencing, should have an important role in patients with metastatic NENs, in order to guide therapeutic options. These results need further confirmation with prospective studies.
How-Kit, Alexandre; Tost, Jörg
2015-01-01
A number of molecular diagnostic assays have been developed in the last years for mutation detection. Although these methods have become increasingly sensitive, most of them are incompatible with a sequencing-based readout and require prior knowledge of the mutation present in the sample. Consequently, coamplification at low denaturation (COLD)-PCR-based methods have been developed and combine a high analytical sensitivity due to mutation enrichment in the sample with the identification of known or unknown mutations by downstream sequencing experiments. Among these methods, the recently developed Enhanced-ice-COLD-PCR appeared as the most powerful method as it outperformed the other COLD-PCR-based methods in terms of the mutation enrichment and due to the simplicity of the experimental setup of the assay. Indeed, E-ice-COLD-PCR is very versatile as it can be used on all types of PCR platforms and is applicable to different types of samples including fresh frozen, FFPE, and plasma samples. The technique relies on the incorporation of an LNA containing blocker probe in the PCR reaction followed by selective heteroduplex denaturation enabling amplification of the mutant allele while amplification of the wild-type allele is prevented. Combined with Pyrosequencing(®), which is a very quantitative high-resolution sequencing technology, E-ice-COLD-PCR can detect and identify mutations with a limit of detection down to 0.01 %.
Mulindwa, Julius; Leiss, Kevin; Ibberson, David; Kamanyi Marucha, Kevin; Helbig, Claudia; Melo do Nascimento, Larissa; Silvester, Eleanor; Matthews, Keith; Matovu, Enock; Enyaru, John
2018-01-01
All of our current knowledge of African trypanosome metabolism is based on results from trypanosomes grown in culture or in rodents. Drugs against sleeping sickness must however treat trypanosomes in humans. We here compare the transcriptomes of Trypanosoma brucei rhodesiense from the blood and cerebrospinal fluid of human patients with those of trypanosomes from culture and rodents. The data were aligned and analysed using new user-friendly applications designed for Kinetoplastid RNA-Seq data. The transcriptomes of trypanosomes from human blood and cerebrospinal fluid did not predict major metabolic differences that might affect drug susceptibility. Usefully, there were relatively few differences between the transcriptomes of trypanosomes from patients and those of similar trypanosomes grown in rats. Transcriptomes of monomorphic laboratory-adapted parasites grown in in vitro culture closely resembled those of the human parasites, but some differences were seen. In poly(A)-selected mRNA transcriptomes, mRNAs encoding some protein kinases and RNA-binding proteins were under-represented relative to mRNA that had not been poly(A) selected; further investigation revealed that the selection tends to result in loss of longer mRNAs. PMID:29474390
Single-cell transcriptomics for microbial eukaryotes.
Kolisko, Martin; Boscaro, Vittorio; Burki, Fabien; Lynn, Denis H; Keeling, Patrick J
2014-11-17
One of the greatest hindrances to a comprehensive understanding of microbial genomics, cell biology, ecology, and evolution is that most microbial life is not in culture. Solutions to this problem have mainly focused on whole-community surveys like metagenomics, but these analyses inevitably loose information and present particular challenges for eukaryotes, which are relatively rare and possess large, gene-sparse genomes. Single-cell analyses present an alternative solution that allows for specific species to be targeted, while retaining information on cellular identity, morphology, and partitioning of activities within microbial communities. Single-cell transcriptomics, pioneered in medical research, offers particular potential advantages for uncultivated eukaryotes, but the efficiency and biases have not been tested. Here we describe a simple and reproducible method for single-cell transcriptomics using manually isolated cells from five model ciliate species; we examine impacts of amplification bias and contamination, and compare the efficacy of gene discovery to traditional culture-based transcriptomics. Gene discovery using single-cell transcriptomes was found to be comparable to mass-culture methods, suggesting single-cell transcriptomics is an efficient entry point into genomic data from the vast majority of eukaryotic biodiversity. Copyright © 2014 Elsevier Ltd. All rights reserved.
Mulindwa, Julius; Leiss, Kevin; Ibberson, David; Kamanyi Marucha, Kevin; Helbig, Claudia; Melo do Nascimento, Larissa; Silvester, Eleanor; Matthews, Keith; Matovu, Enock; Enyaru, John; Clayton, Christine
2018-02-01
All of our current knowledge of African trypanosome metabolism is based on results from trypanosomes grown in culture or in rodents. Drugs against sleeping sickness must however treat trypanosomes in humans. We here compare the transcriptomes of Trypanosoma brucei rhodesiense from the blood and cerebrospinal fluid of human patients with those of trypanosomes from culture and rodents. The data were aligned and analysed using new user-friendly applications designed for Kinetoplastid RNA-Seq data. The transcriptomes of trypanosomes from human blood and cerebrospinal fluid did not predict major metabolic differences that might affect drug susceptibility. Usefully, there were relatively few differences between the transcriptomes of trypanosomes from patients and those of similar trypanosomes grown in rats. Transcriptomes of monomorphic laboratory-adapted parasites grown in in vitro culture closely resembled those of the human parasites, but some differences were seen. In poly(A)-selected mRNA transcriptomes, mRNAs encoding some protein kinases and RNA-binding proteins were under-represented relative to mRNA that had not been poly(A) selected; further investigation revealed that the selection tends to result in loss of longer mRNAs.
2010-01-01
Background The Fagaceae family comprises about 1,000 woody species worldwide. About half belong to the Quercus family. These oaks are often a source of raw material for biomass wood and fiber. Pedunculate and sessile oaks, are among the most important deciduous forest tree species in Europe. Despite their ecological and economical importance, very few genomic resources have yet been generated for these species. Here, we describe the development of an EST catalogue that will support ecosystem genomics studies, where geneticists, ecophysiologists, molecular biologists and ecologists join their efforts for understanding, monitoring and predicting functional genetic diversity. Results We generated 145,827 sequence reads from 20 cDNA libraries using the Sanger method. Unexploitable chromatograms and quality checking lead us to eliminate 19,941 sequences. Finally a total of 125,925 ESTs were retained from 111,361 cDNA clones. Pyrosequencing was also conducted for 14 libraries, generating 1,948,579 reads, from which 370,566 sequences (19.0%) were eliminated, resulting in 1,578,192 sequences. Following clustering and assembly using TGICL pipeline, 1,704,117 EST sequences collapsed into 69,154 tentative contigs and 153,517 singletons, providing 222,671 non-redundant sequences (including alternative transcripts). We also assembled the sequences using MIRA and PartiGene software and compared the three unigene sets. Gene ontology annotation was then assigned to 29,303 unigene elements. Blast search against the SWISS-PROT database revealed putative homologs for 32,810 (14.7%) unigene elements, but more extensive search with Pfam, Refseq_protein, Refseq_RNA and eight gene indices revealed homology for 67.4% of them. The EST catalogue was examined for putative homologs of candidate genes involved in bud phenology, cuticle formation, phenylpropanoids biosynthesis and cell wall formation. Our results suggest a good coverage of genes involved in these traits. Comparative orthologous sequences (COS) with other plant gene models were identified and allow to unravel the oak paleo-history. Simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 52,834 SSRs and 36,411 SNPs. All of these are available through the Oak Contig Browser http://genotoul-contigbrowser.toulouse.inra.fr:9092/Quercus_robur/index.html. Conclusions This genomic resource provides a unique tool to discover genes of interest, study the oak transcriptome, and develop new markers to investigate functional diversity in natural populations. PMID:21092232
Community Structures of Fecal Bacteria in Cattle from Different Animal Feeding Operations▿†
Shanks, Orin C.; Kelty, Catherine A.; Archibeque, Shawn; Jenkins, Michael; Newton, Ryan J.; McLellan, Sandra L.; Huse, Susan M.; Sogin, Mitchell L.
2011-01-01
The fecal microbiome of cattle plays a critical role not only in animal health and productivity but also in food safety, pathogen shedding, and the performance of fecal pollution detection methods. Unfortunately, most published molecular surveys fail to provide adequate detail about variability in the community structures of fecal bacteria within and across cattle populations. Using massively parallel pyrosequencing of a hypervariable region of the rRNA coding region, we profiled the fecal microbial communities of cattle from six different feeding operations where cattle were subjected to consistent management practices for a minimum of 90 days. We obtained a total of 633,877 high-quality sequences from the fecal samples of 30 adult beef cattle (5 individuals per operation). Sequence-based clustering and taxonomic analyses indicate less variability within a population than between populations. Overall, bacterial community composition correlated significantly with fecal starch concentrations, largely reflected in changes in the Bacteroidetes, Proteobacteria, and Firmicutes populations. In addition, network analysis demonstrated that annotated sequences clustered by management practice and fecal starch concentration, suggesting that the structures of bovine fecal bacterial communities can be dramatically different in different animal feeding operations, even at the phylum and family taxonomic levels, and that the feeding operation is a more important determinant of the cattle microbiome than is the geographic location of the feedlot. PMID:21378055
Microbial Ecology of Thailand Tsunami and Non-Tsunami Affected Terrestrials
Somboonna, Naraporn; Wilantho, Alisa; Jankaew, Kruawun; Assawamakin, Anunchai; Sangsrakru, Duangjai; Tangphatsornruang, Sithichoke; Tongsima, Sissades
2014-01-01
The effects of tsunamis on microbial ecologies have been ill-defined, especially in Phang Nga province, Thailand. This ecosystem was catastrophically impacted by the 2004 Indian Ocean tsunami as well as the 600 year-old tsunami in Phra Thong island, Phang Nga province. No study has been conducted to elucidate their effects on microbial ecology. This study represents the first to elucidate their effects on microbial ecology. We utilized metagenomics with 16S and 18S rDNA-barcoded pyrosequencing to obtain prokaryotic and eukaryotic profiles for this terrestrial site, tsunami affected (S1), as well as a parallel unaffected terrestrial site, non-tsunami affected (S2). S1 demonstrated unique microbial community patterns than S2. The dendrogram constructed using the prokaryotic profiles supported the unique S1 microbial communities. S1 contained more proportions of archaea and bacteria domains, specifically species belonging to Bacteroidetes became more frequent, in replacing of the other typical floras like Proteobacteria, Acidobacteria and Basidiomycota. Pathogenic microbes, including Acinetobacter haemolyticus, Flavobacterium spp. and Photobacterium spp., were also found frequently in S1. Furthermore, different metabolic potentials highlighted this microbial community change could impact the functional ecology of the site. Moreover, the habitat prediction based on percent of species indicators for marine, brackish, freshwater and terrestrial niches pointed the S1 to largely comprise marine habitat indicating-species. PMID:24710002
Microbial ecology of Thailand tsunami and non-tsunami affected terrestrials.
Somboonna, Naraporn; Wilantho, Alisa; Jankaew, Kruawun; Assawamakin, Anunchai; Sangsrakru, Duangjai; Tangphatsornruang, Sithichoke; Tongsima, Sissades
2014-01-01
The effects of tsunamis on microbial ecologies have been ill-defined, especially in Phang Nga province, Thailand. This ecosystem was catastrophically impacted by the 2004 Indian Ocean tsunami as well as the 600 year-old tsunami in Phra Thong island, Phang Nga province. No study has been conducted to elucidate their effects on microbial ecology. This study represents the first to elucidate their effects on microbial ecology. We utilized metagenomics with 16S and 18S rDNA-barcoded pyrosequencing to obtain prokaryotic and eukaryotic profiles for this terrestrial site, tsunami affected (S1), as well as a parallel unaffected terrestrial site, non-tsunami affected (S2). S1 demonstrated unique microbial community patterns than S2. The dendrogram constructed using the prokaryotic profiles supported the unique S1 microbial communities. S1 contained more proportions of archaea and bacteria domains, specifically species belonging to Bacteroidetes became more frequent, in replacing of the other typical floras like Proteobacteria, Acidobacteria and Basidiomycota. Pathogenic microbes, including Acinetobacter haemolyticus, Flavobacterium spp. and Photobacterium spp., were also found frequently in S1. Furthermore, different metabolic potentials highlighted this microbial community change could impact the functional ecology of the site. Moreover, the habitat prediction based on percent of species indicators for marine, brackish, freshwater and terrestrial niches pointed the S1 to largely comprise marine habitat indicating-species.
Fehlbaum, Sophie; Chassard, Christophe; Haug, Martina C.; Fourmestraux, Candice; Derrien, Muriel; Lacroix, Christophe
2015-01-01
In vitro gut modeling is a useful approach to investigate some factors and mechanisms of the gut microbiota independent of the effects of the host. This study tested the use of immobilized fecal microbiota to develop different designs of continuous colonic fermentation models mimicking elderly gut fermentation. Model 1 was a three-stage fermentation mimicking the proximal, transverse and distal colon. Models 2 and 3 were based on the new PolyFermS platform composed of an inoculum reactor seeded with immobilized fecal microbiota and used to continuously inoculate with the same microbiota different second-stage reactors mounted in parallel. The main gut bacterial groups, microbial diversity and metabolite production were monitored in effluents of all reactors using quantitative PCR, 16S rRNA gene 454-pyrosequencing, and HPLC, respectively. In all models, a diverse microbiota resembling the one tested in donor’s fecal sample was established. Metabolic stability in inoculum reactors seeded with immobilized fecal microbiota was shown for operation times of up to 80 days. A high microbial and metabolic reproducibility was demonstrated for downstream control and experimental reactors of a PolyFermS model. The PolyFermS models tested here are particularly suited to investigate the effects of environmental factors, such as diet and drugs, in a controlled setting with the same microbiota source. PMID:26559530
Fehlbaum, Sophie; Chassard, Christophe; Haug, Martina C; Fourmestraux, Candice; Derrien, Muriel; Lacroix, Christophe
2015-01-01
In vitro gut modeling is a useful approach to investigate some factors and mechanisms of the gut microbiota independent of the effects of the host. This study tested the use of immobilized fecal microbiota to develop different designs of continuous colonic fermentation models mimicking elderly gut fermentation. Model 1 was a three-stage fermentation mimicking the proximal, transverse and distal colon. Models 2 and 3 were based on the new PolyFermS platform composed of an inoculum reactor seeded with immobilized fecal microbiota and used to continuously inoculate with the same microbiota different second-stage reactors mounted in parallel. The main gut bacterial groups, microbial diversity and metabolite production were monitored in effluents of all reactors using quantitative PCR, 16S rRNA gene 454-pyrosequencing, and HPLC, respectively. In all models, a diverse microbiota resembling the one tested in donor's fecal sample was established. Metabolic stability in inoculum reactors seeded with immobilized fecal microbiota was shown for operation times of up to 80 days. A high microbial and metabolic reproducibility was demonstrated for downstream control and experimental reactors of a PolyFermS model. The PolyFermS models tested here are particularly suited to investigate the effects of environmental factors, such as diet and drugs, in a controlled setting with the same microbiota source.
Fujimura, Kei E.; Rauch, Marcus; Matsui, Elizabeth; Iwai, Shoko; Calatroni, Agustin; Lynn, Henry; Mitchell, Herman; Johnson, Christine C.; Gern, James E.; Togias, Alkis; Boushey, Homer A.; Kennedy, Suzanne; Lynch, Susan V.
2013-01-01
Summary Standardized studies examining environmental microbial exposure in populations at risk for asthma are necessary to improve our understanding of the role this factor plays in disease development. Here we describe studies aimed at developing guidelines for high-resolution culture-independent microbiome profiling, using a phylogenetic microarray (PhyloChip), of house dust samples in a cohort collected as part of the NIH-funded Inner City Asthma Consortium (ICAC). We demonstrate that though extracted DNA concentrations varied across dust samples, the majority produced sufficient 16S rRNA to be profiled by the array. Comparison of array and 454-pyrosequencing performed in parallel on a subset of samples, illustrated that increasingly deeper sequencing efforts validated greater numbers of array-detected taxa. Community composition agreement across samples exhibited a hierarchy in concordance, with the highest level of agreement in replicate array profiles followed by samples collected from adjacent 1×1 m2 sites in the same room, adjacent sites with different sized sampling quadrants (1×1 and 2×2 m2), different sites within homes (living and bedroom) to lowest in living room samples collected from different homes. The guidelines for sample collection and processing in this pilot study extend beyond PhyloChip based studies of house-associated microbiota, and bear relevance for other microbiome profiling approaches such as next-generation sequencing. PMID:22975469
Beigh, Mohammad Muzafar
2016-01-01
Humans have predicted the relationship between heredity and diseases for a long time. Only in the beginning of the last century, scientists begin to discover the connotations between different genes and disease phenotypes. Recent trends in next-generation sequencing (NGS) technologies have brought a great momentum in biomedical research that in turn has remarkably augmented our basic understanding of human biology and its associated diseases. State-of-the-art next generation biotechnologies have started making huge strides in our current understanding of mechanisms of various chronic illnesses like cancers, metabolic disorders, neurodegenerative anomalies, etc. We are experiencing a renaissance in biomedical research primarily driven by next generation biotechnologies like genomics, transcriptomics, proteomics, metabolomics, lipidomics etc. Although genomic discoveries are at the forefront of next generation omics technologies, however, their implementation into clinical arena had been painstakingly slow mainly because of high reaction costs and unavailability of requisite computational tools for large-scale data analysis. However rapid innovations and steadily lowering cost of sequence-based chemistries along with the development of advanced bioinformatics tools have lately prompted launching and implementation of large-scale massively parallel genome sequencing programs in different fields ranging from medical genetics, infectious biology, agriculture sciences etc. Recent advances in large-scale omics-technologies is bringing healthcare research beyond the traditional “bench to bedside” approach to more of a continuum that will include improvements, in public healthcare and will be primarily based on predictive, preventive, personalized, and participatory medicine approach (P4). Recent large-scale research projects in genetic and infectious disease biology have indicated that massively parallel whole-genome/whole-exome sequencing, transcriptome analysis, and other functional genomic tools can reveal large number of unique functional elements and/or markers that otherwise would be undetected by traditional sequencing methodologies. Therefore, latest trends in the biomedical research is giving birth to the new branch in medicine commonly referred to as personalized and/or precision medicine. Developments in the post-genomic era are believed to completely restructure the present clinical pattern of disease prevention and treatment as well as methods of diagnosis and prognosis. The next important step in the direction of the precision/personalized medicine approach should be its early adoption in clinics for future medical interventions. Consequently, in coming year’s next generation biotechnologies will reorient medical practice more towards disease prediction and prevention approaches rather than curing them at later stages of their development and progression, even at wider population level(s) for general public healthcare system. PMID:28930123
2010-08-25
or intentional genetic modifications that circumvent the targets of the detection assays or in the case of a biological attack using an antibiotic ...genetic changes conferring antibiotic resistance can be deciphered rapidly and accurately using WGS. We demonstrate the utility of Roche 454...Rapid Identification of Genetic Modifications in Bacillus anthracis Using Whole Genome Draft Sequences Generated by 454 Pyrosequencing Peter E. Chen1
USDA-ARS?s Scientific Manuscript database
The technological advances of RNA-seq and de novo transcriptome assembly have enabled genome annotation and transcriptome profiling in heterozygous species. This is a promising approach to improving the annotation of the reference genome sequence of grapevine (Vitis vinifera L.), a species of high-l...
USDA-ARS?s Scientific Manuscript database
Diet, nutrition, and obesity are important topics of current research. While many insect genome and/or transcriptome models are based on dietary specialists, the lady beetle Coleomegilla maculata, a common New World species, is highly omnivorous. C. maculata feeds on plants, fungi, insects and other...
USDA-ARS?s Scientific Manuscript database
An essential step to understanding the genomic biology of any organism is to comprehensively survey its transcriptome. We present the Bovine Gene Atlas (BGA) a compendium of over 7.2 million unique 20 base Illumina DGE tags representing 100 tissue transcriptomes collected primarily from L1 Dominette...
Perron, Gabrielle; Jandaghi, Pouria; Solanki, Shraddha; Safisamghabadi, Maryam; Storoz, Cristina; Karimzadeh, Mehran; Papadakis, Andreas I; Arseneault, Madeleine; Scelo, Ghislaine; Banks, Rosamonde E; Tost, Jorg; Lathrop, Mark; Tanguay, Simon; Brazma, Alvis; Huang, Sidong; Brimo, Fadi; Najafabadi, Hamed S; Riazalhosseini, Yasser
2018-05-08
Widespread remodeling of the transcriptome is a signature of cancer; however, little is known about the post-transcriptional regulatory factors, including RNA-binding proteins (RBPs) that regulate mRNA stability, and the extent to which RBPs contribute to cancer-associated pathways. Here, by modeling the global change in gene expression based on the effect of sequence-specific RBPs on mRNA stability, we show that RBP-mediated stability programs are recurrently deregulated in cancerous tissues. Particularly, we uncovered several RBPs that contribute to the abnormal transcriptome of renal cell carcinoma (RCC), including PCBP2, ESRP2, and MBNL2. Modulation of these proteins in cancer cell lines alters the expression of pathways that are central to the disease and highlights RBPs as driving master regulators of RCC transcriptome. This study presents a framework for the screening of RBP activities based on computational modeling of mRNA stability programs in cancer and highlights the role of post-transcriptional gene dysregulation in RCC. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
CBrowse: a SAM/BAM-based contig browser for transcriptome assembly visualization and analysis.
Li, Pei; Ji, Guoli; Dong, Min; Schmidt, Emily; Lenox, Douglas; Chen, Liangliang; Liu, Qi; Liu, Lin; Zhang, Jie; Liang, Chun
2012-09-15
To address the impending need for exploring rapidly increased transcriptomics data generated for non-model organisms, we developed CBrowse, an AJAX-based web browser for visualizing and analyzing transcriptome assemblies and contigs. Designed in a standard three-tier architecture with a data pre-processing pipeline, CBrowse is essentially a Rich Internet Application that offers many seamlessly integrated web interfaces and allows users to navigate, sort, filter, search and visualize data smoothly. The pre-processing pipeline takes the contig sequence file in FASTA format and its relevant SAM/BAM file as the input; detects putative polymorphisms, simple sequence repeats and sequencing errors in contigs and generates image, JSON and database-compatible CSV text files that are directly utilized by different web interfaces. CBowse is a generic visualization and analysis tool that facilitates close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors in transcriptome sequencing projects. CBrowse is distributed under the GNU General Public License, available at http://bioinfolab.muohio.edu/CBrowse/ liangc@muohio.edu or liangc.mu@gmail.com; glji@xmu.edu.cn Supplementary data are available at Bioinformatics online.
Prosdocimi, Francisco; Bittencourt, Daniela; da Silva, Felipe Rodrigues; Kirst, Matias; Motta, Paulo C.; Rech, Elibio L.
2011-01-01
Characterized by distinctive evolutionary adaptations, spiders provide a comprehensive system for evolutionary and developmental studies of anatomical organs, including silk and venom production. Here we performed cDNA sequencing using massively parallel sequencers (454 GS-FLX Titanium) to generate ∼80,000 reads from the spinning gland of Actinopus spp. (infraorder: Mygalomorphae) and Gasteracantha cancriformis (infraorder: Araneomorphae, Orbiculariae clade). Actinopus spp. retains primitive characteristics on web usage and presents a single undifferentiated spinning gland while the orbiculariae spiders have seven differentiated spinning glands and complex patterns of web usage. MIRA, Celera Assembler and CAP3 software were used to cluster NGS reads for each spider. CAP3 unigenes passed through a pipeline for automatic annotation, classification by biological function, and comparative transcriptomics. Genes related to spider silks were manually curated and analyzed. Although a single spidroin gene family was found in Actinopus spp., a vast repertoire of specialized spider silk proteins was encountered in orbiculariae. Astacin-like metalloproteases (meprin subfamily) were shown to be some of the most sampled unigenes and duplicated gene families in G. cancriformis since its evolutionary split from mygalomorphs. Our results confirm that the evolution of the molecular repertoire of silk proteins was accompanied by the (i) anatomical differentiation of spinning glands and (ii) behavioral complexification in the web usage. Finally, a phylogenetic tree was constructed to cluster most of the known spidroins in gene clades. This is the first large-scale, multi-organism transcriptome for spider spinning glands and a first step into a broad understanding of spider web systems biology and evolution. PMID:21738742
Shi, Kui; Gu, Jiayu; Guo, Huijun; Zhao, Linshu; Xie, Yongdun; Xiong, Hongchun; Li, Junhui; Zhao, Shirong; Song, Xiyun; Liu, Luxiang
2017-01-01
Chloroplast development is an integral part of plant survival and growth, and occurs in parallel with chlorophyll biosynthesis. However, little is known about the mechanisms underlying chloroplast development in hexaploid wheat. Here, we obtained a spaceflight-induced wheat albino mutant mta. Chloroplast ultra-structural observation showed that chloroplasts of mta exhibit abnormal morphology and distribution compared to wild type. Photosynthetic pigments content was also significantly decreased in mta. Transcriptome and chloroplast proteome profiling of mta and wild type were done to identify differentially expressed genes (DEGs) and proteins (DEPs), respectively. In total 4,588 DEGs including 1,980 up- and 2,608 down-regulated, and 48 chloroplast DEPs including 15 up- and 33 down-regulated were identified in mta. Classification of DEGs revealed that most were involved in chloroplast development, chlorophyll biosynthesis, or photosynthesis. Besides, transcription factors such as PIF3, GLK and MYB which might participate in those pathways were also identified. The correlation analysis between DEGs and DEPs revealed that the transcript-to-protein in abundance was functioned into photosynthesis and chloroplast relevant groups. Real time qPCR analysis validated that the expression level of genes encoding photosynthetic proteins was significantly decreased in mta. Together, our results suggest that the molecular mechanism for albino leaf color formation in mta is a thoroughly regulated and complicated process. The combined analysis of transcriptome and proteome afford comprehensive information for further research on chloroplast development mechanism in wheat. And spaceflight provides a potential means for mutagenesis in crop breeding.
Young, Ellen; Carey, Manus; Meharg, Andrew A; Meharg, Caroline
2018-03-20
Plants can adapt to edaphic stress, such as nutrient deficiency, toxicity and biotic challenges, by controlled transcriptomic responses, including microbiome interactions. Traditionally studied in model plant species with controlled microbiota inoculation treatments, molecular plant-microbiome interactions can be functionally investigated via RNA-Seq. Complex, natural plant-microbiome studies are limited, typically focusing on microbial rRNA and omitting functional microbiome investigations, presenting a fundamental knowledge gap. Here, root and shoot meta-transcriptome analyses, in tandem with shoot elemental content and root staining, were employed to investigate transcriptome responses in the wild grass Holcus lanatus and its associated natural multi-species eukaryotic microbiome. A full factorial reciprocal soil transplant experiment was employed, using plant ecotypes from two widely contrasting natural habitats, acid bog and limestone quarry soil, to investigate naturally occurring, and ecologically meaningful, edaphically driven molecular plant-microbiome interactions. Arbuscular mycorrhizal (AM) and non-AM fungal colonization was detected in roots in both soils. Staining showed greater levels of non-AM fungi, and transcriptomics indicated a predominance of Ascomycota-annotated genes. Roots in acid bog soil were dominated by Phialocephala-annotated transcripts, a putative growth-promoting endophyte, potentially involved in N nutrition and ion homeostasis. Limestone roots in acid bog soil had greater expression of other Ascomycete genera and Oomycetes and lower expression of Phialocephala-annotated transcripts compared to acid ecotype roots, which corresponded with reduced induction of pathogen defense processes, particularly lignin biosynthesis in limestone ecotypes. Ascomycota dominated in shoots and limestone soil roots, but Phialocephala-annotated transcripts were insignificant, and no single Ascomycete genus dominated. Fusarium-annotated transcripts were the most common genus in shoots, with Colletotrichum and Rhizophagus (AM fungi) most numerous in limestone soil roots. The latter coincided with upregulation of plant genes involved in AM symbiosis initiation and AM-based P acquisition in an environment where P availability is low. Meta-transcriptome analyses provided novel insights into H. lanatus transcriptome responses, associated eukaryotic microbiota functions and taxonomic community composition. Significant edaphic and plant ecotype effects were identified, demonstrating that meta-transcriptome-based functional analysis is a powerful tool for the study of natural plant-microbiome interactions.
Ochsner, Scott A.; Tsimelzon, Anna; Dong, Jianrong; Coarfa, Cristian
2016-01-01
The pregnane X receptor (PXR) (PXR/NR1I3) and constitutive androstane receptor (CAR) (CAR/NR1I2) members of the nuclear receptor (NR) superfamily of ligand-regulated transcription factors are well-characterized mediators of xenobiotic and endocrine-disrupting chemical signaling. The Nuclear Receptor Signaling Atlas maintains a growing library of transcriptomic datasets involving perturbations of NR signaling pathways, many of which involve perturbations relevant to PXR and CAR xenobiotic signaling. Here, we generated a reference transcriptome based on the frequency of differential expression of genes across 159 experiments compiled from 22 datasets involving perturbations of CAR and PXR signaling pathways. In addition to the anticipated overrepresentation in the reference transcriptome of genes encoding components of the xenobiotic stress response, the ranking of genes involved in carbohydrate metabolism and gonadotropin action sheds mechanistic light on the suspected role of xenobiotics in metabolic syndrome and reproductive disorders. Gene Set Enrichment Analysis showed that although acetaminophen, chlorpromazine, and phenobarbital impacted many similar gene sets, differences in direction of regulation were evident in a variety of processes. Strikingly, gene sets representing genes linked to Parkinson's, Huntington's, and Alzheimer's diseases were enriched in all 3 transcriptomes. The reference xenobiotic transcriptome will be supplemented with additional future datasets to provide the community with a continually updated reference transcriptomic dataset for CAR- and PXR-mediated xenobiotic signaling. Our study demonstrates how aggregating and annotating transcriptomic datasets, and making them available for routine data mining, facilitates research into the mechanisms by which xenobiotics and endocrine-disrupting chemicals subvert conventional NR signaling modalities. PMID:27409825
Ochsner, Scott A; Tsimelzon, Anna; Dong, Jianrong; Coarfa, Cristian; McKenna, Neil J
2016-08-01
The pregnane X receptor (PXR) (PXR/NR1I3) and constitutive androstane receptor (CAR) (CAR/NR1I2) members of the nuclear receptor (NR) superfamily of ligand-regulated transcription factors are well-characterized mediators of xenobiotic and endocrine-disrupting chemical signaling. The Nuclear Receptor Signaling Atlas maintains a growing library of transcriptomic datasets involving perturbations of NR signaling pathways, many of which involve perturbations relevant to PXR and CAR xenobiotic signaling. Here, we generated a reference transcriptome based on the frequency of differential expression of genes across 159 experiments compiled from 22 datasets involving perturbations of CAR and PXR signaling pathways. In addition to the anticipated overrepresentation in the reference transcriptome of genes encoding components of the xenobiotic stress response, the ranking of genes involved in carbohydrate metabolism and gonadotropin action sheds mechanistic light on the suspected role of xenobiotics in metabolic syndrome and reproductive disorders. Gene Set Enrichment Analysis showed that although acetaminophen, chlorpromazine, and phenobarbital impacted many similar gene sets, differences in direction of regulation were evident in a variety of processes. Strikingly, gene sets representing genes linked to Parkinson's, Huntington's, and Alzheimer's diseases were enriched in all 3 transcriptomes. The reference xenobiotic transcriptome will be supplemented with additional future datasets to provide the community with a continually updated reference transcriptomic dataset for CAR- and PXR-mediated xenobiotic signaling. Our study demonstrates how aggregating and annotating transcriptomic datasets, and making them available for routine data mining, facilitates research into the mechanisms by which xenobiotics and endocrine-disrupting chemicals subvert conventional NR signaling modalities.
Fisher, Andrew G; Seaborne, Robert A; Hughes, Thomas M; Gutteridge, Alex; Stewart, Claire; Coulson, Judy M; Sharples, Adam P; Jarvis, Jonathan C
2017-12-01
Physical inactivity and disuse are major contributors to age-related muscle loss. Denervation of skeletal muscle has been previously used as a model with which to investigate muscle atrophy following disuse. Although gene regulatory networks that control skeletal muscle atrophy after denervation have been established, the transcriptome in response to the recovery of muscle after disuse and the associated epigenetic mechanisms that may function to modulate gene expression during skeletal muscle atrophy or recovery have yet to be investigated. We report that silencing the tibialis anterior muscle in rats with tetrodotoxin (TTX)-administered to the common peroneal nerve-resulted in reductions in muscle mass of 7, 29, and 51% with corresponding reductions in muscle fiber cross-sectional area of 18, 42, and 69% after 3, 7, and 14 d of TTX, respectively. Of importance, 7 d of recovery, during which rodents resumed habitual physical activity, restored muscle mass from a reduction of 51% after 14 d TTX to a reduction of only 24% compared with sham control. Returning muscle mass to levels observed at 7 d TTX administration (29% reduction). Transcriptome-wide analysis demonstrated that 3714 genes were differentially expressed across all conditions at a significance of P ≤ 0.001 after disuse-induced atrophy. Of interest, after 7 d of recovery, the expression of genes that were most changed during TTX had returned to that of the sham control. The 20 most differentially expressed genes after microarray analysis were identified across all conditions and were cross-referenced with the most frequently occurring differentially expressed genes between conditions. This gene subset included myogenin (MyoG), Hdac4, Ampd3, Trim63 (MuRF1), and acetylcholine receptor subunit α1 (Chrna1). Transcript expression of these genes and Fboxo32 (MAFbx), because of its previously identified role in disuse atrophy together with Trim63 (MuRF1), were confirmed by real-time quantitative RT-PCR, and DNA methylation of their promoter regions was analyzed by PCR and pyrosequencing. MyoG, Trim63 (MuRF1), Fbxo32 (MAFbx), and Chrna1 demonstrated significantly decreased DNA methylation at key time points after disuse-induced atrophy that corresponded with significantly increased gene expression. Of importance, after TTX cessation and 7 d of recovery, there was a marked increase in the DNA methylation profiles of Trim63 (MuRF1) and Chrna1 back to control levels. This also corresponded with the return of gene expression in the recovery group back to baseline expression observed in sham-surgery controls. To our knowledge, this is the first study to demonstrate that skeletal muscle atrophy in response to disuse is accompanied by dynamic epigenetic modifications that are associated with alterations in gene expression, and that these epigenetic modifications and gene expression profiles are reversible after skeletal muscle returns to normal activity.-Fisher, A. G., Seaborne, R. A., Hughes, T. M., Gutteridge, A., Stewart, C., Coulson, J. M., Sharples, A. P., Jarvis, J. C. Transcriptomic and epigenetic regulation of disuse atrophy and the return to activity in skeletal muscle. © FASEB.
2011-01-01
Background Big sagebrush (Artemisia tridentata) is one of the most widely distributed and ecologically important shrub species in western North America. This species serves as a critical habitat and food resource for many animals and invertebrates. Habitat loss due to a combination of disturbances followed by establishment of invasive plant species is a serious threat to big sagebrush ecosystem sustainability. Lack of genomic data has limited our understanding of the evolutionary history and ecological adaptation in this species. Here, we report on the sequencing of expressed sequence tags (ESTs) and detection of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers in subspecies of big sagebrush. Results cDNA of A. tridentata sspp. tridentata and vaseyana were normalized and sequenced using the 454 GS FLX Titanium pyrosequencing technology. Assembly of the reads resulted in 20,357 contig consensus sequences in ssp. tridentata and 20,250 contigs in ssp. vaseyana. A BLASTx search against the non-redundant (NR) protein database using 29,541 consensus sequences obtained from a combined assembly resulted in 21,436 sequences with significant blast alignments (≤ 1e-15). A total of 20,952 SNPs and 119 polymorphic SSRs were detected between the two subspecies. SNPs were validated through various methods including sequence capture. Validation of SNPs in different individuals uncovered a high level of nucleotide variation in EST sequences. EST sequences of a third, tetraploid subspecies (ssp. wyomingensis) obtained by Illumina sequencing were mapped to the consensus sequences of the combined 454 EST assembly. Approximately one-third of the SNPs between sspp. tridentata and vaseyana identified in the combined assembly were also polymorphic within the two geographically distant ssp. wyomingensis samples. Conclusion We have produced a large EST dataset for Artemisia tridentata, which contains a large sample of the big sagebrush leaf transcriptome. SNP mapping among the three subspecies suggest the origin of ssp. wyomingensis via mixed ancestry. A large number of SNP and SSR markers provide the foundation for future research to address questions in big sagebrush evolution, ecological genetics, and conservation using genomic approaches. PMID:21767398
Gomez, Ana; Cardoso, Christiane; Genta, Fernando A; Terra, Walter R; Ferreira, Clélia
2013-08-01
The soluble midgut trehalase from Tenebrio molitor (TmTre1) was purified after several chromatographic steps, resulting in an enzyme with 58 kDa and pH optimum 5.3 (ionizing active groups in the free enzyme: pK(e1) = 3.8 ± 0.2 pK(e2) = 7.4 ± 0.2). The purified enzyme corresponds to the deduced amino acid sequence of a cloned cDNA (TmTre1-cDNA), because a single cDNA coding a soluble trehalase was found in the T. molitor midgut transcriptome. Furthermore, the mass of the protein predicted to be coded by TmTre1-cDNA agrees with that of the purified enzyme. TmTre1 has the essential catalytic groups Asp 315 and Glu 513 and the essential Arg residues R164, R217, R282. Carbodiimide inactivation of the purified enzyme at different pH values reveals an essential carboxyl group with pKa = 3.5 ± 0.3. Phenylglyoxal modified a single Arg residue with pKa = 7.5 ± 0.2, as observed in the soluble trehalase from Spodoptera frugiperda (SfTre1). Diethylpyrocarbonate modified a His residue that resulted in a less active enzyme with pK(e1) changed to 4.8 ± 0.2. In TmTre1 the modified His residue (putatively His 336) is more exposed than the His modified in SfTre1 (putatively His 210) and that affects the ionization of an Arg residue. The architecture of the active site of TmTre1 and SfTre1 is different, as shown by multiple inhibition analysis, the meaning of which demands further research. Trehalase sequences obtained from midgut transcriptomes (pyrosequencing and Illumina data) from 8 insects pertaining to 5 different orders were used in a cladogram, together with other representative sequences. The data suggest that the trehalase gene went duplication and divergence prior to the separation of the paraneopteran and holometabolan orders and that the soluble trehalase derived from the membrane-bound one by losing the C-terminal transmembrane loop. Copyright © 2013 Elsevier Ltd. All rights reserved.
Lamy, Pierre-Jean; Castan, Florence; Lozano, Nicolas; Montélion, Cécile; Audran, Patricia; Bibeau, Frédéric; Roques, Sylvie; Montels, Frédéric; Laberenne, Anne-Claire
2015-07-01
The detection of the BRAF V600E mutation in melanoma samples is used to select patients who should respond to BRAF inhibitors. Different techniques are routinely used to determine BRAF status in clinical samples. However, low tumor cellularity and tumor heterogeneity can affect the sensitivity of somatic mutation detection. Digital PCR (dPCR) is a next-generation genotyping method that clonally amplifies nucleic acids and allows the detection and quantification of rare mutations. Our aim was to evaluate the clinical routine performance of a new dPCR-based test to detect and quantify BRAF mutation load in 47 paraffin-embedded cutaneous melanoma biopsies. We compared the results obtained by dPCR with high-resolution melting curve analysis and pyrosequencing or with one of the allele-specific PCR methods available on the market. dPCR showed the lowest limit of detection. dPCR and allele-specific amplification detected the highest number of mutated samples. For the BRAF mutation load quantification both dPCR and pyrosequencing gave similar results with strong disparities in allele frequencies in the 47 tumor samples under study (from 0.7% to 79% of BRAF V600E mutations/sample). In conclusion, the four methods showed a high degree of concordance. dPCR was the more-sensitive method to reliably and easily detect mutations. Both pyrosequencing and dPCR could quantify the mutation load in heterogeneous tumor samples. Copyright © 2015 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Analysis of bacterial xylose isomerase gene diversity using gene-targeted metagenomics.
Nurdiani, Dini; Ito, Michihiro; Maruyama, Toru; Terahara, Takeshi; Mori, Tetsushi; Ugawa, Shin; Takeyama, Haruko
2015-08-01
Bacterial xylose isomerases (XI) are promising resources for efficient biofuel production from xylose in lignocellulosic biomass. Here, we investigated xylose isomerase gene (xylA) diversity in three soil metagenomes differing in plant vegetation and geographical location, using an amplicon pyrosequencing approach and two newly-designed primer sets. A total of 158,555 reads from three metagenomic DNA replicates for each soil sample were classified into 1127 phylotypes, detected in triplicate and defined by 90% amino acid identity. The phylotype coverage was estimated to be within the range of 84.0-92.7%. The xylA gene phylotypes obtained were phylogenetically distributed across the two known xylA groups. They shared 49-100% identities with their closest-related XI sequences in GenBank. Phylotypes demonstrating <90% identity with known XIs in the database accounted for 89% of the total xylA phylotypes. The differences among xylA members and compositions within each soil sample were significantly smaller than they were between different soils based on a UniFrac distance analysis, suggesting soil-specific xylA genotypes and taxonomic compositions. The differences among xylA members and their compositions in the soil were strongly correlated with 16S rRNA variation between soil samples, also assessed by amplicon pyrosequencing. This is the first report of xylA diversity in environmental samples assessed by amplicon pyrosequencing. Our data provide information regarding xylA diversity in nature, and can be a basis for the screening of novel xylA genotypes for practical applications. Copyright © 2015. Published by Elsevier B.V.
Morin, Alexander M; Gatev, Evan; McEwen, Lisa M; MacIsaac, Julia L; Lin, David T S; Koen, Nastassja; Czamara, Darina; Räikkönen, Katri; Zar, Heather J; Koenen, Karestan; Stein, Dan J; Kobor, Michael S; Jones, Meaghan J
2017-01-01
Cord blood is a commonly used tissue in environmental, genetic, and epigenetic population studies due to its ready availability and potential to inform on a sensitive period of human development. However, the introduction of maternal blood during labor or cross-contamination during sample collection may complicate downstream analyses. After discovering maternal contamination of cord blood in a cohort study of 150 neonates using Illumina 450K DNA methylation (DNAm) data, we used a combination of linear regression and random forest machine learning to create a DNAm-based screening method. We identified a panel of DNAm sites that could discriminate between contaminated and non-contaminated samples, then designed pyrosequencing assays to pre-screen DNA prior to being assayed on an array. Maternal contamination of cord blood was initially identified by unusual X chromosome DNA methylation patterns in 17 males. We utilized our DNAm panel to detect contaminated male samples and a proportional amount of female samples in the same cohort. We validated our DNAm screening method on an additional 189 sample cohort using both pyrosequencing and DNAm arrays, as well as 9 publically available cord blood 450K data sets. The rate of contamination varied from 0 to 10% within these studies, likely related to collection specific methods. Maternal blood can contaminate cord blood during sample collection at appreciable levels across multiple studies. We have identified a panel of markers that can be used to identify this contamination, either post hoc after DNAm arrays have been completed, or in advance using a targeted technique like pyrosequencing.
Pyrosequencing analysis of oral microbiota shifting in various caries states in childhood.
Jiang, Wen; Ling, Zongxin; Lin, Xiaolong; Chen, Yadong; Zhang, Jie; Yu, Jinjin; Xiang, Charlie; Chen, Hui
2014-05-01
Dental caries is one of the most prevalent childhood diseases worldwide, but little is known about the dynamic characteristics of oral microbiota in the development of dental caries. To investigate the shifting bacterial profiles in different caries states, 60 children (3-7-year-old) were enrolled in this study, including 30 caries-free subjects and 30 caries-active subjects. Supragingival plaques were collected from caries-active subjects on intact enamel, white spot lesions and carious dentin lesions. Plaques from caries-free subjects were used as a control. All samples were analyzed by 454 pyrosequencing based on 16S rRNA gene V1-V3 hypervariable regions. A total of 572,773 pyrosequencing reads passed the quality control and 25,444 unique phylotypes were identified, which represented 18 phyla and 145 genera. Reduced bacterial diversity in the cavitated dentin was observed as compared with the other groups. Thirteen genera (including Capnocytophaga, Fusobacterium, Porphyromonas, Abiotrophia, Comamonas, Tannerella, Eikenella, Paludibacter, Treponema, Actinobaculum, Stenotrophomonas, Aestuariimicrobium, and Peptococcus) were found to be associated with dental health, and the bacterial profiles differed considerably depending on caries status. Eight genera (including Cryptobacterium, Lactobacillus, Megasphaera, Olsenella, Scardovia, Shuttleworthia, Cryptobacterium, and Streptococcus) were increased significantly in cavitated dentin lesions, and Actinomyces and Corynebacterium were present at significant high levels in white spot lesions (P < 0.05), while Flavobacterium, Neisseria, Bergeyella, and Derxia were enriched in the intact surfaces of caries individuals (P < 0.05). Our results showed that oral bacteria were specific at different stages of caries progression, which contributes to informing the prevention and treatment of childhood dental caries.
2013-01-01
Backgroud Isatis indigotica is a widely used herb for the clinical treatment of colds, fever, and influenza in Traditional Chinese Medicine (TCM). Various structural classes of compounds have been identified as effective ingredients. However, little is known at genetics level about these active metabolites. In the present study, we performed de novo transcriptome sequencing for the first time to produce a comprehensive dataset of I. indigotica. Results A database of 36,367 unigenes (average length = 1,115.67 bases) was generated by performing transcriptome sequencing. Based on the gene annotation of the transcriptome, 104 unigenes were identified covering most of the catalytic steps in the general biosynthetic pathways of indole, terpenoid, and phenylpropanoid. Subsequently, the organ-specific expression patterns of the genes involved in these pathways, and their responses to methyl jasmonate (MeJA) induction, were investigated. Metabolites profile of effective phenylpropanoid showed accumulation pattern of secondary metabolites were mostly correlated with the transcription of their biosynthetic genes. According to the analysis of UDP-dependent glycosyltransferases (UGT) family, several flavonoids were indicated to exist in I. indigotica and further identified by metabolic profile using UPLC/Q-TOF. Moreover, applying transcriptome co-expression analysis, nine new, putative UGTs were suggested as flavonol glycosyltransferases and lignan glycosyltransferases. Conclusions This database provides a pool of candidate genes involved in biosynthesis of effective metabolites in I. indigotica. Furthermore, the comprehensive analysis and characterization of the significant pathways are expected to give a better insight regarding the diversity of chemical composition, synthetic characteristics, and the regulatory mechanism which operate in this medical herb. PMID:24308360
Youssef, Noha H.; Couger, M. B.; Elshahed, Mostafa S.
2010-01-01
Background The adaptation of pyrosequencing technologies for use in culture-independent diversity surveys allowed for deeper sampling of ecosystems of interest. One extremely well suited area of interest for pyrosequencing-based diversity surveys that has received surprisingly little attention so far, is examining fine scale (e.g. micrometer to millimeter) beta diversity in complex microbial ecosystems. Methodology/Principal Findings We examined the patterns of fine scale Beta diversity in four adjacent sediment samples (1mm apart) from the source of an anaerobic sulfide and sulfur rich spring (Zodletone spring) in southwestern Oklahoma, USA. Using pyrosequencing, a total of 292,130 16S rRNA gene sequences were obtained. The beta diversity patterns within the four datasets were examined using various qualitative and quantitative similarity indices. Low levels of Beta diversity (high similarity indices) were observed between the four samples at the phylum-level. However, at a putative species (OTU0.03) level, higher levels of beta diversity (lower similarity indices) were observed. Further examination of beta diversity patterns within dominant and rare members of the community indicated that at the putative species level, beta diversity is much higher within rare members of the community. Finally, sub-classification of rare members of Zodletone spring community based on patterns of novelty and uniqueness, and further examination of fine scale beta diversity of each of these subgroups indicated that members of the community that are unique, but non novel showed the highest beta diversity within these subgroups of the rare biosphere. Conclusions/Significance The results demonstrate the occurrence of high inter-sample diversity within seemingly identical samples from a complex habitat. We reason that such unexpected diversity should be taken into consideration when exploring gamma diversity of various ecosystems, as well as planning for sequencing-intensive metagenomic surveys of highly complex ecosystems. PMID:20865128
Kisand, Veljo; Lettieri, Teresa
2013-04-01
De novo genome sequencing of previously uncharacterized microorganisms has the potential to open up new frontiers in microbial genomics by providing insight into both functional capabilities and biodiversity. Until recently, Roche 454 pyrosequencing was the NGS method of choice for de novo assembly because it generates hundreds of thousands of long reads (<450 bps), which are presumed to aid in the analysis of uncharacterized genomes. The array of tools for processing NGS data are increasingly free and open source and are often adopted for both their high quality and role in promoting academic freedom. The error rate of pyrosequencing the Alcanivorax borkumensis genome was such that thousands of insertions and deletions were artificially introduced into the finished genome. Despite a high coverage (~30 fold), it did not allow the reference genome to be fully mapped. Reads from regions with errors had low quality, low coverage, or were missing. The main defect of the reference mapping was the introduction of artificial indels into contigs through lower than 100% consensus and distracting gene calling due to artificial stop codons. No assembler was able to perform de novo assembly comparable to reference mapping. Automated annotation tools performed similarly on reference mapped and de novo draft genomes, and annotated most CDSs in the de novo assembled draft genomes. Free and open source software (FOSS) tools for assembly and annotation of NGS data are being developed rapidly to provide accurate results with less computational effort. Usability is not high priority and these tools currently do not allow the data to be processed without manual intervention. Despite this, genome assemblers now readily assemble medium short reads into long contigs (>97-98% genome coverage). A notable gap in pyrosequencing technology is the quality of base pair calling and conflicting base pairs between single reads at the same nucleotide position. Regardless, using draft whole genomes that are not finished and remain fragmented into tens of contigs allows one to characterize unknown bacteria with modest effort.
Feldmesser, Ester; Rosenwasser, Shilo; Vardi, Assaf; Ben-Dor, Shifra
2014-02-22
The advent of Next Generation Sequencing technologies and corresponding bioinformatics tools allows the definition of transcriptomes in non-model organisms. Non-model organisms are of great ecological and biotechnological significance, and consequently the understanding of their unique metabolic pathways is essential. Several methods that integrate de novo assembly with genome-based assembly have been proposed. Yet, there are many open challenges in defining genes, particularly where genomes are not available or incomplete. Despite the large numbers of transcriptome assemblies that have been performed, quality control of the transcript building process, particularly on the protein level, is rarely performed if ever. To test and improve the quality of the automated transcriptome reconstruction, we used manually defined and curated genes, several of them experimentally validated. Several approaches to transcript construction were utilized, based on the available data: a draft genome, high quality RNAseq reads, and ESTs. In order to maximize the contribution of the various data, we integrated methods including de novo and genome based assembly, as well as EST clustering. After each step a set of manually curated genes was used for quality assessment of the transcripts. The interplay between the automated pipeline and the quality control indicated which additional processes were required to improve the transcriptome reconstruction. We discovered that E. huxleyi has a very high percentage of non-canonical splice junctions, and relatively high rates of intron retention, which caused unique issues with the currently available tools. While individual tools missed genes and artificially joined overlapping transcripts, combining the results of several tools improved the completeness and quality considerably. The final collection, created from the integration of several quality control and improvement rounds, was compared to the manually defined set both on the DNA and protein levels, and resulted in an improvement of 20% versus any of the read-based approaches alone. To the best of our knowledge, this is the first time that an automated transcript definition is subjected to quality control using manually defined and curated genes and thereafter the process is improved. We recommend using a set of manually curated genes to troubleshoot transcriptome reconstruction.
Liu, Jun-Jun; Xiang, Yu
2011-01-01
WRKY transcription factors are key regulators of numerous biological processes in plant growth and development, as well as plant responses to abiotic and biotic stresses. Research on biological functions of plant WRKY genes has focused in the past on model plant species or species with largely characterized transcriptomes. However, a variety of non-model plants, such as forest conifers, are essential as feed, biofuel, and wood or for sustainable ecosystems. Identification of WRKY genes in these non-model plants is equally important for understanding the evolutionary and function-adaptive processes of this transcription factor family. Because of limited genomic information, the rarity of regulatory gene mRNAs in transcriptomes, and the sequence divergence to model organism genes, identification of transcription factors in non-model plants using methods similar to those generally used for model plants is difficult. This chapter describes a gene family discovery strategy for identification of WRKY transcription factors in conifers by a combination of in silico-based prediction and PCR-based experimental approaches. Compared to traditional cDNA library screening or EST sequencing at transcriptome scales, this integrated gene discovery strategy provides fast, simple, reliable, and specific methods to unveil the WRKY gene family at both genome and transcriptome levels in non-model plants.
Grace, Peter M; Hurley, Daniel; Barratt, Daniel T; Tsykin, Anna; Watkins, Linda R; Rolan, Paul E; Hutchinson, Mark R
2012-09-01
A quantitative, peripherally accessible biomarker for neuropathic pain has great potential to improve clinical outcomes. Based on the premise that peripheral and central immunity contribute to neuropathic pain mechanisms, we hypothesized that biomarkers could be identified from the whole blood of adult male rats, by integrating graded chronic constriction injury (CCI), ipsilateral lumbar dorsal quadrant (iLDQ) and whole blood transcriptomes, and pathway analysis with pain behavior. Correlational bioinformatics identified a range of putative biomarker genes for allodynia intensity, many encoding for proteins with a recognized role in immune/nociceptive mechanisms. A selection of these genes was validated in a separate replication study. Pathway analysis of the iLDQ transcriptome identified Fcγ and Fcε signaling pathways, among others. This study is the first to employ the whole blood transcriptome to identify pain biomarker panels. The novel correlational bioinformatics, developed here, selected such putative biomarkers based on a correlation with pain behavior and formation of signaling pathways with iLDQ genes. Future studies may demonstrate the predictive ability of these biomarker genes across other models and additional variables. © 2012 The Authors. Journal of Neurochemistry © 2012 International Society for Neurochemistry.
Multiplexed transcriptome analysis to detect ALK, ROS1 and RET rearrangements in lung cancer
Rogers, Toni-Maree; Arnau, Gisela Mir; Ryland, Georgina L.; Huang, Stephen; Lira, Maruja E.; Emmanuel, Yvette; Perez, Omar D.; Irwin, Darryl; Fellowes, Andrew P.; Wong, Stephen Q.; Fox, Stephen B.
2017-01-01
ALK, ROS1 and RET gene fusions are important predictive biomarkers for tyrosine kinase inhibitors in lung cancer. Currently, the gold standard method for gene fusion detection is Fluorescence In Situ Hybridization (FISH) and while highly sensitive and specific, it is also labour intensive, subjective in analysis, and unable to screen a large numbers of gene fusions. Recent developments in high-throughput transcriptome-based methods may provide a suitable alternative to FISH as they are compatible with multiplexing and diagnostic workflows. However, the concordance between these different methods compared with FISH has not been evaluated. In this study we compared the results from three transcriptome-based platforms (Nanostring Elements, Agena LungFusion panel and ThermoFisher NGS fusion panel) to those obtained from ALK, ROS1 and RET FISH on 51 clinical specimens. Overall agreement of results ranged from 86–96% depending on the platform used. While all platforms were highly sensitive, both the Agena panel and Thermo Fisher NGS fusion panel reported minor fusions that were not detectable by FISH. Our proof–of–principle study illustrates that transcriptome-based analyses are sensitive and robust methods for detecting actionable gene fusions in lung cancer and could provide a robust alternative to FISH testing in the diagnostic setting. PMID:28181564
Kim, Seungill; Kim, Myung-Shin; Kim, Yong-Min; Yeom, Seon-In; Cheong, Kyeongchae; Kim, Ki-Tae; Jeon, Jongbum; Kim, Sunggil; Kim, Do-Sun; Sohn, Seong-Han; Lee, Yong-Hwan; Choi, Doil
2015-01-01
The onion (Allium cepa L.) is one of the most widely cultivated and consumed vegetable crops in the world. Although a considerable amount of onion transcriptome data has been deposited into public databases, the sequences of the protein-coding genes are not accurate enough to be used, owing to non-coding sequences intermixed with the coding sequences. We generated a high-quality, annotated onion transcriptome from de novo sequence assembly and intensive structural annotation using the integrated structural gene annotation pipeline (ISGAP), which identified 54,165 protein-coding genes among 165,179 assembled transcripts totalling 203.0 Mb by eliminating the intron sequences. ISGAP performed reliable annotation, recognizing accurate gene structures based on reference proteins, and ab initio gene models of the assembled transcripts. Integrative functional annotation and gene-based SNP analysis revealed a whole biological repertoire of genes and transcriptomic variation in the onion. The method developed in this study provides a powerful tool for the construction of reference gene sets for organisms based solely on de novo transcriptome data. Furthermore, the reference genes and their variation described here for the onion represent essential tools for molecular breeding and gene cloning in Allium spp. PMID:25362073
rnaQUAST: a quality assessment tool for de novo transcriptome assemblies.
Bushmanova, Elena; Antipov, Dmitry; Lapidus, Alla; Suvorov, Vladimir; Prjibelski, Andrey D
2016-07-15
Ability to generate large RNA-Seq datasets created a demand for both de novo and reference-based transcriptome assemblers. However, while many transcriptome assemblers are now available, there is still no unified quality assessment tool for RNA-Seq assemblies. We present rnaQUAST-a tool for evaluating RNA-Seq assembly quality and benchmarking transcriptome assemblers using reference genome and gene database. rnaQUAST calculates various metrics that demonstrate completeness and correctness levels of the assembled transcripts, and outputs them in a user-friendly report. rnaQUAST is implemented in Python and is freely available at http://bioinf.spbau.ru/en/rnaquast ap@bioinf.spbau.ru Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Smola, Matthew J; Rice, Greggory M; Busan, Steven; Siegfried, Nathan A; Weeks, Kevin M
2015-11-01
Selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistries exploit small electrophilic reagents that react with 2'-hydroxyl groups to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues by using reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as can be done for simple model RNAs. This protocol describes the experimental steps, implemented over 3 d, that are required to perform SHAPE probing and to construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots and provides useful troubleshooting information. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures and visualize probable and alternative helices, often in under 1 d. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles and entire transcriptomes.
CellAtlasSearch: a scalable search engine for single cells.
Srivastava, Divyanshu; Iyer, Arvind; Kumar, Vibhor; Sengupta, Debarka
2018-05-21
Owing to the advent of high throughput single cell transcriptomics, past few years have seen exponential growth in production of gene expression data. Recently efforts have been made by various research groups to homogenize and store single cell expression from a large number of studies. The true value of this ever increasing data deluge can be unlocked by making it searchable. To this end, we propose CellAtlasSearch, a novel search architecture for high dimensional expression data, which is massively parallel as well as light-weight, thus infinitely scalable. In CellAtlasSearch, we use a Graphical Processing Unit (GPU) friendly version of Locality Sensitive Hashing (LSH) for unmatched speedup in data processing and query. Currently, CellAtlasSearch features over 300 000 reference expression profiles including both bulk and single-cell data. It enables the user query individual single cell transcriptomes and finds matching samples from the database along with necessary meta information. CellAtlasSearch aims to assist researchers and clinicians in characterizing unannotated single cells. It also facilitates noise free, low dimensional representation of single-cell expression profiles by projecting them on a wide variety of reference samples. The web-server is accessible at: http://www.cellatlassearch.com.
Proteomics of drug resistance in Candida glabrata biofilms.
Seneviratne, C Jayampath; Wang, Yu; Jin, Lijian; Abiko, Y; Samaranayake, Lakshman P
2010-04-01
Candida glabrata is a fungal pathogen that causes a variety of mucosal and systemic infections among compromised patient populations with higher mortality rates. Previous studies have shown that biofilm mode of the growth of the fungus is highly resistant to antifungal agents compared with the free-floating or planktonic mode of growth. Therefore, in the present study, we used 2-D DIGE to evaluate the differential proteomic profiles of C. glabrata under planktonic and biofilm modes of growth. Candida glabrata biofilms were developed on polystyrene surfaces and age-matched planktonic cultures were obtained in parallel. Initially, biofilm architecture, viability, and antifungal susceptibility were evaluated. Differentially expressed proteins more than 1.5-fold in DIGE analysis were subjected to MS/MS. The transcriptomic regulation of these biomarkers was evaluated by quantitative real-time PCR. Candida glabrata biofilms were highly resistant to the antifungals and biocides compared with the planktonic mode of growth. Candida glabrata biofilm proteome when compared with its planktonic proteome showed upregulation of stress response proteins, while glycolysis enzymes were downregulated. Similar trend could be observed at transcriptomic level. In conclusion, C. glabrata biofilms possess higher amount of stress response proteins, which may potentially contribute to the higher antifungal resistance seen in C. glabrata biofilms.
Huang, Kailong; Zhang, Xu-Xiang; Shi, Peng; Wu, Bing; Ren, Hongqiang
2014-11-01
In order to comprehensively investigate bacterial virulence in drinking water, 454 pyrosequencing and Illumina high-throughput sequencing were used to detect potential pathogenic bacteria and virulence factors (VFs) in a full-scale drinking water treatment and distribution system. 16S rRNA gene pyrosequencing revealed high bacterial diversity in the drinking water (441-586 operational taxonomic units). Bacterial diversity decreased after chlorine disinfection, but increased after pipeline distribution. α-Proteobacteria was the most dominant taxonomic class. Alignment against the established pathogen database showed that several types of putative pathogens were present in the drinking water and Pseudomonas aeruginosa had the highest abundance (over 11‰ of total sequencing reads). Many pathogens disappeared after chlorine disinfection, but P. aeruginosa and Leptospira interrogans were still detected in the tap water. High-throughput sequencing revealed prevalence of various pathogenicity islands and virulence proteins in the drinking water, and translocases, transposons, Clp proteases and flagellar motor switch proteins were the predominant VFs. Both diversity and abundance of the detectable VFs increased after the chlorination, and decreased after the pipeline distribution. This study indicates that joint use of 454 pyrosequencing and Illumina sequencing can comprehensively characterize environmental pathogenesis, and several types of putative pathogens and various VFs are prevalent in drinking water. Copyright © 2014 Elsevier Inc. All rights reserved.
Detection of novel NF1 mutations and rapid mutation prescreening with Pyrosequencing.
Brinckmann, Anja; Mischung, Claudia; Bässmann, Ingelore; Kühnisch, Jirko; Schuelke, Markus; Tinschert, Sigrid; Nürnberg, Peter
2007-12-01
Neurofibromatosis type 1 (NF1) is caused by mutations in the neurofibromin (NF1) gene. Mutation analysis of NF1 is complicated by its large size, the lack of mutation hotspots, pseudogenes and frequent de novo mutations. Additionally, the search for NF1 mutations on the mRNA level is often hampered by nonsense-mediated mRNA decay (NMD) of the mutant allele. In this study we searched for mutations in a cohort of 38 patients and investigated the relationship between mutation type and allele-specific transcription from the wild-type versus mutant alleles. Quantification of relative mRNA transcript numbers was done by Pyrosequencing, a novel real-time sequencing method whose signals can be quantified very accurately. We identified 21 novel mutations comprising various mutation types. Pyrosequencing detected a definite relationship between allelic NF1 transcript imbalance due to NMD and mutation type in 24 of 29 patients who all carried frame-shift or nonsense mutations. NMD was absent in 5 patients with missense and silent mutations, as well as in 4 patients with splice-site mutations that did not disrupt the reading frame. Pyrosequencing was capable of detecting NMD even when the effects were only moderate. Diagnostic laboratories could thus exploit this effect for rapid prescreening for NF1 mutations as more than 60% of the mutations in this gene disrupt the reading frame and are prone to NMD.
USDA-ARS?s Scientific Manuscript database
Although Campylobacter is an important food-borne human pathogen, there remains a lack of molecular diagnostic assays that are simple to use, cost-effective, and provide rapid results in research, clinical, or regulatory laboratories. Of the numerous Campylobacter assays that do exist, to our knowl...
Youssef, Noha; Sheik, Cody S.; Krumholz, Lee R.; Najar, Fares Z.; Roe, Bruce A.; Elshahed, Mostafa S.
2009-01-01
Pyrosequencing-based 16S rRNA gene surveys are increasingly utilized to study highly diverse bacterial communities, with special emphasis on utilizing the large number of sequences obtained (tens to hundreds of thousands) for species richness estimation. However, it is not yet clear how the number of operational taxonomic units (OTUs) and, hence, species richness estimates determined using shorter fragments at different taxonomic cutoffs correlates with the number of OTUs assigned using longer, nearly complete 16S rRNA gene fragments. We constructed a 16S rRNA clone library from an undisturbed tallgrass prairie soil (1,132 clones) and used it to compare species richness estimates obtained using eight pyrosequencing candidate fragments (99 to 361 bp in length) and the nearly full-length fragment. Fragments encompassing the V1 and V2 (V1+V2) region and the V6 region (generated using primer pairs 8F-338R and 967F-1046R) overestimated species richness; fragments encompassing the V3, V7, and V7+V8 hypervariable regions (generated using primer pairs 338F-530R, 1046F-1220R, and 1046F-1392R) underestimated species richness; and fragments encompassing the V4, V5+V6, and V6+V7 regions (generated using primer pairs 530F-805R, 805F-1046R, and 967F-1220R) provided estimates comparable to those obtained with the nearly full-length fragment. These patterns were observed regardless of the alignment method utilized or the parameter used to gauge comparative levels of species richness (number of OTUs observed, slope of scatter plots of pairwise distance values for short and nearly complete fragments, and nonparametric and parametric species richness estimates). Similar results were obtained when analyzing three other datasets derived from soil, adult Zebrafish gut, and basaltic formations in the East Pacific Rise. Regression analysis indicated that these observed discrepancies in species richness estimates within various regions could readily be explained by the proportions of hypervariable, variable, and conserved base pairs within an examined fragment. PMID:19561178
USDA-ARS?s Scientific Manuscript database
Micronutrient malnutrition is the most common form of nutrient deficiency among populations having a cereal based-diet. Rice is the staple food for one third of the world’s population, but is a poor source of iron and zinc concentration. We have characterized the root transcriptome of diverse indica...
J. D. Tang; L. A. Parker; A. D. Perkins; T. S. Sonstegard; S. G. Schroeder; D. D. Nicholas; S. V. Diehl
2013-01-01
High-throughput transcriptomics was used to identify Fibroporia radiculosa genes that were differentially regulated during colonization of wood treated with a copper-based preservative. The transcriptome was profiled at two time points while the fungus was growing on wood treated with micronized copper quat (MCQ). A total of 917 transcripts were...
The NCCT high throughput transcriptomics (HTTr) screening program uses whole transcriptome profiling assay in human-derived cells to collect concentration-response data for large numbers (100s-1000s) of environmental chemicals. To contextualize HTTr data, chemical effects on cell...
USDA-ARS?s Scientific Manuscript database
Illumina HiSeq technology was used to sequence the transcriptome from various dissected tissues and life stages from the horn fly, Haematobia irritans. These samples include eggs (0, 2, 4, and 9 hours post-oviposition), adult fly gut, adult fly legs, adult fly malpighian tubule, adult fly ovary, adu...
Between a Pod and a Hard Test: The Deep Evolution of Amoebae.
Kang, Seungho; Tice, Alexander K; Spiegel, Frederick W; Silberman, Jeffrey D; Pánek, Tomáš; Cepicka, Ivan; Kostka, Martin; Kosakyan, Anush; Alcântara, Daniel M C; Roger, Andrew J; Shadwick, Lora L; Smirnov, Alexey; Kudryavtsev, Alexander; Lahr, Daniel J G; Brown, Matthew W
2017-09-01
Amoebozoa is the eukaryotic supergroup sister to Obazoa, the lineage that contains the animals and Fungi, as well as their protistan relatives, and the breviate and apusomonad flagellates. Amoebozoa is extraordinarily diverse, encompassing important model organisms and significant pathogens. Although amoebozoans are integral to global nutrient cycles and present in nearly all environments, they remain vastly understudied. We present a robust phylogeny of Amoebozoa based on broad representative set of taxa in a phylogenomic framework (325 genes). By sampling 61 taxa using culture-based and single-cell transcriptomics, our analyses show two major clades of Amoebozoa, Discosea, and Tevosa. This phylogeny refutes previous studies in major respects. Our results support the hypothesis that the last common ancestor of Amoebozoa was sexual and flagellated, it also may have had the ability to disperse propagules from a sporocarp-type fruiting body. Overall, the main macroevolutionary patterns in Amoebozoa appear to result from the parallel losses of homologous characters of a multiphase life cycle that included flagella, sex, and sporocarps rather than independent acquisition of convergent features. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Tian, Xin-Jie; Long, Yan; Wang, Jiao; Zhang, Jing-Wen; Wang, Yan-Yan; Li, Wei-Min; Peng, Yu-Fa; Yuan, Qian-Hua; Pei, Xin-Wu
2015-01-01
The perennial O. rufipogon (common wild rice), which is considered to be the ancestor of Asian cultivated rice species, contains many useful genetic resources, including drought resistance genes. However, few studies have identified the drought resistance and tissue-specific genes in common wild rice. In this study, transcriptome sequencing libraries were constructed, including drought-treated roots (DR) and control leaves (CL) and roots (CR). Using Illumina sequencing technology, we generated 16.75 million bases of high-quality sequence data for common wild rice and conducted de novo assembly and annotation of genes without prior genome information. These reads were assembled into 119,332 unigenes with an average length of 715 bp. A total of 88,813 distinct sequences (74.42% of unigenes) significantly matched known genes in the NCBI NT database. Differentially expressed gene (DEG) analysis showed that 3617 genes were up-regulated and 4171 genes were down-regulated in the CR library compared with the CL library. Among the DEGs, 535 genes were expressed in roots but not in shoots. A similar comparison between the DR and CR libraries showed that 1393 genes were up-regulated and 315 genes were down-regulated in the DR library compared with the CR library. Finally, 37 genes that were specifically expressed in roots were screened after comparing the DEGs identified in the above-described analyses. This study provides a transcriptome sequence resource for common wild rice plants and establishes a digital gene expression profile of wild rice plants under drought conditions using the assembled transcriptome data as a reference. Several tissue-specific and drought-stress-related candidate genes were identified, representing a fully characterized transcriptome and providing a valuable resource for genetic and genomic studies in plants.
Impact of Transcriptomics on Our Understanding of Pulmonary Fibrosis
Vukmirovic, Milica; Kaminski, Naftali
2018-01-01
Idiopathic pulmonary fibrosis (IPF) is a lethal fibrotic lung disease characterized by aberrant remodeling of the lung parenchyma with extensive changes to the phenotypes of all lung resident cells. The introduction of transcriptomics, genome scale profiling of thousands of RNA transcripts, caused a significant inversion in IPF research. Instead of generating hypotheses based on animal models of disease, or biological plausibility, with limited validation in humans, investigators were able to generate hypotheses based on unbiased molecular analysis of human samples and then use animal models of disease to test their hypotheses. In this review, we describe the insights made from transcriptomic analysis of human IPF samples. We describe how transcriptomic studies led to identification of novel genes and pathways involved in the human IPF lung such as: matrix metalloproteinases, WNT pathway, epithelial genes, role of microRNAs among others, as well as conceptual insights such as the involvement of developmental pathways and deep shifts in epithelial and fibroblast phenotypes. The impact of lung and transcriptomic studies on disease classification, endotype discovery, and reproducible biomarkers is also described in detail. Despite these impressive achievements, the impact of transcriptomic studies has been limited because they analyzed bulk tissue and did not address the cellular and spatial heterogeneity of the IPF lung. We discuss new emerging technologies and applications, such as single-cell RNAseq and microenvironment analysis that may address cellular and spatial heterogeneity. We end by making the point that most current tissue collections and resources are not amenable to analysis using the novel technologies. To take advantage of the new opportunities, we need new efforts of sample collections, this time focused on access to all the microenvironments and cells in the IPF lung. PMID:29670881
Kairov, Ulykbek; Cantini, Laura; Greco, Alessandro; Molkenov, Askhat; Czerwinska, Urszula; Barillot, Emmanuel; Zinovyev, Andrei
2017-09-11
Independent Component Analysis (ICA) is a method that models gene expression data as an action of a set of statistically independent hidden factors. The output of ICA depends on a fundamental parameter: the number of components (factors) to compute. The optimal choice of this parameter, related to determining the effective data dimension, remains an open question in the application of blind source separation techniques to transcriptomic data. Here we address the question of optimizing the number of statistically independent components in the analysis of transcriptomic data for reproducibility of the components in multiple runs of ICA (within the same or within varying effective dimensions) and in multiple independent datasets. To this end, we introduce ranking of independent components based on their stability in multiple ICA computation runs and define a distinguished number of components (Most Stable Transcriptome Dimension, MSTD) corresponding to the point of the qualitative change of the stability profile. Based on a large body of data, we demonstrate that a sufficient number of dimensions is required for biological interpretability of the ICA decomposition and that the most stable components with ranks below MSTD have more chances to be reproduced in independent studies compared to the less stable ones. At the same time, we show that a transcriptomics dataset can be reduced to a relatively high number of dimensions without losing the interpretability of ICA, even though higher dimensions give rise to components driven by small gene sets. We suggest a protocol of ICA application to transcriptomics data with a possibility of prioritizing components with respect to their reproducibility that strengthens the biological interpretation. Computing too few components (much less than MSTD) is not optimal for interpretability of the results. The components ranked within MSTD range have more chances to be reproduced in independent studies.
RNA-Seq Technology and Its Application in Fish Transcriptomics
Ba, Yi; Zhuang, Qianfeng
2014-01-01
Abstract High-throughput sequencing technologies, also known as next-generation sequencing (NGS) technologies, have revolutionized the way that genomic research is advancing. In addition to the static genome, these state-of-art technologies have been recently exploited to analyze the dynamic transcriptome, and the resulting technology is termed RNA sequencing (RNA-seq). RNA-seq is free from many limitations of other transcriptomic approaches, such as microarray and tag-based sequencing method. Although RNA-seq has only been available for a short time, studies using this method have completely changed our perspective of the breadth and depth of eukaryotic transcriptomes. In terms of the transcriptomics of teleost fishes, both model and non-model species have benefited from the RNA-seq approach and have undergone tremendous advances in the past several years. RNA-seq has helped not only in mapping and annotating fish transcriptome but also in our understanding of many biological processes in fish, such as development, adaptive evolution, host immune response, and stress response. In this review, we first provide an overview of each step of RNA-seq from library construction to the bioinformatic analysis of the data. We then summarize and discuss the recent biological insights obtained from the RNA-seq studies in a variety of fish species. PMID:24380445
Hume, Michael E; Barbosa, Nei A; Dowd, Scot E; Sakomura, Nilva K; Nalian, Armen G; Martynova-Van Kley, Alexandra; Oviedo-Rondón, Edgar O
2011-11-01
A protective digestive microflora helps prevent and reduce broiler infection and colonization by enteropathogens. In the current experiment, broilers fed diets supplemented with probiotics and essential oil (EO) blends were infected with a standard mixed Eimeria spp. to determine effects of performance enhancers on ileal and cecal microbial communities (MCs). Eight treatment groups included four controls (uninfected-unmedicated [UU], unmedicated-infected, the antibiotic BMD plus the ionophore Coban as positive control, and the ionophore as negative control), and four treatments (probiotics BC-30 and Calsporin; and EO, Crina Poultry Plus, and Crina PoultryAF). Day-old broilers were raised to 14 days in floor pens on used litter and then were moved to Petersime batteries and inoculated at 15 days with mixed Eimeria spp. Ileal and cecal samples were collected at 14 days and 7 days postinfection. Digesta DNA was subjected to pyrosequencing for sequencing of individual cecal bacteria and denaturing gradient gel electrophoresis (DGGE) for determination of changes in ileal and cecal MC according to percentage similarity coefficient (%SC). Pyrosequencing is very sensitive detecting shifts in individual bacterial sequences, whereas DGGE is able to detect gross shifts in entire MC. These combined techniques offer versatility toward identifying feed additive and mild Eimeria infection modulation of broiler MC. Pyrosequencing detected 147 bacterial species sequences. Additionally, pyrosequencing revealed the presence of relatively low levels of the potential human enteropathogens Campylobacter sp. and four Shigella spp. as well as the potential poultry pathogen Clostridiun perfringens. Pre- and postinfection changes in ileal (56%SC) and cecal (78.5%SC) DGGE profiles resulted from the coccidia infection and with increased broiler age. Probiotics and EO changed MC from those seen in UU ilea and ceca. Results potentially reflect the performance enhancement above expectations in comparison to broilers not given the probiotics or the specific EO blends as feed supplements.
Altimari, Annalisa; de Biase, Dario; De Maglio, Giovanna; Gruppioni, Elisa; Capizzi, Elisa; Degiovanni, Alessio; D’Errico, Antonia; Pession, Annalisa; Pizzolitto, Stefano; Fiorentino, Michelangelo; Tallini, Giovanni
2013-01-01
Detection of KRAS mutations in archival pathology samples is critical for therapeutic appropriateness of anti-EGFR monoclonal antibodies in colorectal cancer. We compared the sensitivity, specificity, and accuracy of Sanger sequencing, ARMS-Scorpion (TheraScreen®) real-time polymerase chain reaction (PCR), pyrosequencing, chip array hybridization, and 454 next-generation sequencing to assess KRAS codon 12 and 13 mutations in 60 nonconsecutive selected cases of colorectal cancer. Twenty of the 60 cases were detected as wild-type KRAS by all methods with 100% specificity. Among the 40 mutated cases, 13 were discrepant with at least one method. The sensitivity was 85%, 90%, 93%, and 92%, and the accuracy was 90%, 93%, 95%, and 95% for Sanger sequencing, TheraScreen real-time PCR, pyrosequencing, and chip array hybridization, respectively. The main limitation of Sanger sequencing was its low analytical sensitivity, whereas TheraScreen real-time PCR, pyrosequencing, and chip array hybridization showed higher sensitivity but suffered from the limitations of predesigned assays. Concordance between the methods was k = 0.79 for Sanger sequencing and k > 0.85 for the other techniques. Tumor cell enrichment correlated significantly with the abundance of KRAS-mutated deoxyribonucleic acid (DNA), evaluated as ΔCt for TheraScreen real-time PCR (P = 0.03), percentage of mutation for pyrosequencing (P = 0.001), ratio for chip array hybridization (P = 0.003), and percentage of mutation for 454 next-generation sequencing (P = 0.004). Also, 454 next-generation sequencing showed the best cross correlation for quantification of mutation abundance compared with all the other methods (P < 0.001). Our comparison showed the superiority of next-generation sequencing over the other techniques in terms of sensitivity and specificity. Next-generation sequencing will replace Sanger sequencing as the reference technique for diagnostic detection of KRAS mutation in archival tumor tissues. PMID:23950653
Switzeny, Olivier J; Christmann, Markus; Renovanz, Mirjam; Giese, Alf; Sommer, Clemens; Kaina, Bernd
2016-01-01
The DNA repair protein O(6)-methylguanine-DNA methyltransferase (MGMT) causes resistance of cancer cells to alkylating agents and, therefore, is a well-established predictive marker for high-grade gliomas that are routinely treated with alkylating drugs. Since MGMT is highly epigenetically regulated, the MGMT promoter methylation status is taken as an indicator of MGMT silencing, predicting the outcome of glioma therapy. MGMT promoter methylation is usually determined by methylation specific PCR (MSP), which is a labor intensive and error-prone method often used semi-quantitatively. Searching for alternatives, we used closed-tube high resolution melt (HRM) analysis, which is a quantitative method, and compared it with MSP and pyrosequencing regarding its predictive value. We analyzed glioblastoma cell lines with known MGMT activity and formalin-fixed samples from IDH1 wild-type high-grade glioma patients (WHO grade III/IV) treated with radiation and temozolomide by HRM, MSP, and pyrosequencing. The data were compared as to progression-free survival (PFS) and overall survival (OS) of patients exhibiting the methylated and unmethylated MGMT status. A promoter methylation cut-off level relevant for PFS and OS was determined. In a multivariate Cox regression model, methylation of MGMT promoter of high-grade gliomas analyzed by HRM, but not MSP, was found to be an independent predictive marker for OS. Univariate Kaplan-Meier analyses revealed for PFS and OS a significant and better discrimination between methylated and unmethylated tumors when quantitative HRM was used instead of MSP. Compared to MSP and pyrosequencing, the HRM method is simple, cost effective, highly accurate and fast. HRM is at least equivalent to pyrosequencing in quantifying the methylation level. It is superior in predicting PFS and OS of high-grade glioma patients compared to MSP and, therefore, can be recommended being used routinely for determination of the MGMT status of gliomas.
Vezzulli, Luigi; Pezzati, Elisabetta; Huete-Stauffer, Carla; Pruzzo, Carla; Cerrano, Carlo
2013-01-01
Mass mortality events of benthic invertebrates in the Mediterranean Sea are becoming an increasing concern with catastrophic effects on the coastal marine environment. Sea surface temperature anomalies leading to physiological stress, starvation and microbial infections were identified as major factors triggering animal mortality. However the highest occurrence of mortality episodes in particular geographic areas and occasionally in low temperature deep environments suggest that other factors play a role as well. We conducted a comparative analysis of bacterial communities associated with the purple gorgonian Paramuricea clavata, one of the most affected species, collected at different geographic locations and depth, showing contrasting levels of anthropogenic disturbance and health status. Using massive parallel 16SrDNA gene pyrosequencing we showed that the bacterial community associated with healthy P. clavata in pristine locations was dominated by a single genus Endozoicomonas within the order Oceanospirillales which represented ∼90% of the overall bacterial community. P. clavata samples collected in human impacted areas and during disease events had higher bacterial diversity and abundance of disease-related bacteria, such as vibrios, than samples collected in pristine locations whilst showed a reduced dominance of Endozoicomonas spp. In contrast, bacterial symbionts exhibited remarkable stability in P. clavata collected both at euphotic and mesophotic depths in pristine locations suggesting that fluctuations in environmental parameters such as temperature have limited effect in structuring the bacterial holobiont. Interestingly the coral pathogen Vibrio coralliilyticus was not found on diseased corals collected during a deep mortality episode suggesting that neither temperature anomalies nor recognized microbial pathogens are solely sufficient to explain for the events. Overall our data suggest that anthropogenic influence may play a significant role in determining the coral health status by affecting the composition of the associated microbial community. Environmental stressful events and microbial infections may thus be superimposed to compromise immunity and trigger mortality outbreaks. PMID:23840768